Home How to filter a list of strings?

Questions

How to filter a list of strings?

December 27, 2021

I have a list of strings that contain Non-English/English words. I want to filter out only English words.

Example:


phrases = [
    "S/O अशोक कुमार, ब्लॉक न.-4डी, S/O Ashok Kumar, Block no.-4D.",
    "स्ट्रीट-15, विभाग 5. सिविक सेंटर Street-15, sector -5, Civic Centre",
    "भिलाई, दुर्ग, भिलाई, छत्तीसगढ़, Bhilai, Durg. Bhilai, Chhattisgarh,",
]

My code so far:

import re
regex = re.compile("[^a-zA-Z0-9!@#$&()\\-`.+,/\"]+")
for i in phrases:
    print(regex.sub(' ', i))

My output:

["S/O , .-4 , S/O Ashok Kumar, Block no.-4D.",
  "-15, 5. Street-15, sector -5, Civic Centre",
  ", , , , Bhilai, Durg. Bhilai, Chhattisgarh",]

My desire output

["S/O Ashok Kumar, Block no.-4D.",
 "Street-15, sector -5, Civic Centre",
 "Bhilai, Durg. Bhilai, Chhattisgarh,"]

>Solution :

If I look at your data it seems you could use the following:

import regex as re
lst=["S/O अशोक कुमार, ब्लॉक न.-4डी, S/O Ashok Kumar, Block no.-4D.",
      "स्ट्रीट-15, विभाग 5. सिविक सेंटर Street-15, sector -5, Civic Centre",
      "भिलाई, दुर्ग, भिलाई, छत्तीसगढ़, Bhilai, Durg. Bhilai, Chhattisgarh,",]
for i in lst:
    print(re.sub(r'^.*\p{Devanagari}.+?\b', '', i))

Prints:

S/O Ashok Kumar, Block no.-4D.
Street-15, sector -5, Civic Centre
Bhilai, Durg. Bhilai, Chhattisgarh,

See an online regex demo

^ – Start string anchor;
.*\p{Devanagari} – 0+ (Greedy) characters upto the last Devanagari letter;
.+?\b – 1+ (Lazy) characters upto the first word-boundary

byMR

Published December 27, 2021

Add a comment

Create a binary table with presence/absence from groups in pandas

byMR

December 27, 2021

Questions

Type guard function for the `[key, value]` entry

byMR

December 27, 2021

Questions

Vue bind child component data from parent

byMR

December 27, 2021

Questions

Compare multiple arrays of objects using java script

byMR

December 27, 2021

Questions

update/modify formula VBA

byMR

December 27, 2021

Questions

Why my texts are not being displayed in my column?

byMR

December 27, 2021

How to filter a list of strings?

MEDevel.com: Open-source for Healthcare and Education

>Solution :

Like this:

Leave a ReplyCancel reply

Read more

Create a binary table with presence/absence from groups in pandas

Type guard function for the `[key, value]` entry

Vue bind child component data from parent

Compare multiple arrays of objects using java script

update/modify formula VBA

Why my texts are not being displayed in my column?

Keep Up to Date with the Most Important News

How to filter a list of strings?

MEDevel.com: Open-source for Healthcare and Education

>Solution :

Share this:

Like this:

Leave a ReplyCancel reply

Keep Up to Date with the Most Important News

Read more

Create a binary table with presence/absence from groups in pandas

Type guard function for the `[key, value]` entry

Vue bind child component data from parent

Compare multiple arrays of objects using java script

update/modify formula VBA

Why my texts are not being displayed in my column?

Discover more from Dev solutions