Home Extract features from text data on python

Questions

Extract features from text data on python

March 9, 2022

I have a dataframe from pandas like this.

ID            email
1            abc@google.com
2            abc@facebook.com
3            abc@GOOGLE.COM
4            abc@tesla.com
5            abc@hilton.com
6            abc@FaceBook.com

I want to learn company from email(after @).Sample output like this.

Sample output

ID            email                WorkGoogle     WorkFacebook    etc.....
1            abc@google.com          Yes             No              ..
2            abc@facebook.com        No              Yes             .. 
3            abc@GOOGLE.com          Yes             No               ..   
4            abc@tesla.com           No              No              ..
5            abc@hilton.com          No              No              ..
6            abc@FaceBook.com        No              Yes             ..

Need to care Uppercase lowercase.

>Solution :

FYI: this solution is not performance efficient. I am sure in the comments on this answer, you may find a more efficient solution

I would first make a list of all companies by saying:

companies = set([email.split('@')[1].split('.')[0].lower() for email in df['email']])

Then simply iterate over this:

for company in companies:
    df['Work'+company.capitalize()] = df['email'].apply(lambda x: x.split("@")[1].lower()).str.contains(company)

dataframe

byMR

Published March 09, 2022

Add a comment

Case statement in where clause to search for none specific values

byMR

March 9, 2022

Questions

C++ Regex expressions not working for string and digit validation

byMR

March 9, 2022

Questions

Explode array [(str), (int)] in column dataframe pandas

byMR

March 9, 2022

Questions

Merging dictionaries by key

byMR

March 9, 2022

Questions

Can you assign mutiple values to this? and if so, how?

byMR

March 9, 2022

Questions

Flip card on click

byMR

March 9, 2022

Extract features from text data on python

MEDevel.com: Open-source for Healthcare and Education

>Solution :

Like this:

Leave a ReplyCancel reply

Read more

Case statement in where clause to search for none specific values

C++ Regex expressions not working for string and digit validation

Explode array [(str), (int)] in column dataframe pandas

Merging dictionaries by key

Can you assign mutiple values to this? and if so, how?

Flip card on click

Keep Up to Date with the Most Important News

Extract features from text data on python

MEDevel.com: Open-source for Healthcare and Education

>Solution :

Share this:

Like this:

Leave a ReplyCancel reply

Keep Up to Date with the Most Important News

Read more

Case statement in where clause to search for none specific values

C++ Regex expressions not working for string and digit validation

Explode array [(str), (int)] in column dataframe pandas

Merging dictionaries by key

Can you assign mutiple values to this? and if so, how?

Flip card on click

Discover more from Dev solutions