Home Split dataframe string (when string can hold n values of that cell variable), into multiple columns

Questions

Split dataframe string (when string can hold n values of that cell variable), into multiple columns

March 3, 2022

Currently working on a dataset with a lot of contact data, being Emails one of the variables.

A cell in the Emails column can have more than one email (1 to n) and they are all separated by a comma and a space.

For contacts with only two emails, the process would be quite straightforward. One can split the string and create a new column for that secondary email as follows

email_df[['Emails', 'SecondaryEmail']] = email_df['Emails'].str.split(', ', expand=True)

However this won’t work with more than 2 emails. Therefore, I wonder what is the most efficient way to split the emails when the number of emails can go from 1 to n (in this case the n is limited to around 10 but that won’t always be the case), into columns with only one email each (and different names each)?

>Solution :

Use Series.str.splitSeries.str.rsplit with DataFrame.pop for remove column Email after processing:

df = email_df.join(email_df.pop('Emails').str.split(', ', expand=True).add_prefix('Email'))

byMR

Published March 03, 2022

Add a comment

I want to return undefined but instead got "{"

byMR

March 3, 2022

Questions

Create a sub interface in Typescript

byMR

March 3, 2022

Questions

Password Generator using randint instead of random.choice

byMR

March 3, 2022

Questions

Send application/pdf content with http call

byMR

March 3, 2022

Questions

Need assistance with Nested Loops and Dictionary in Python to display tabular data

byMR

March 3, 2022

Questions

Add spaces before every repeating string in list

byMR

March 3, 2022

Split dataframe string (when string can hold n values of that cell variable), into multiple columns

MEDevel.com: Open-source for Healthcare and Education

>Solution :

Like this:

Leave a ReplyCancel reply

Read more

I want to return undefined but instead got "{"

Create a sub interface in Typescript

Password Generator using randint instead of random.choice

Send application/pdf content with http call

Need assistance with Nested Loops and Dictionary in Python to display tabular data

Add spaces before every repeating string in list

Keep Up to Date with the Most Important News

Split dataframe string (when string can hold n values of that cell variable), into multiple columns

MEDevel.com: Open-source for Healthcare and Education

>Solution :

Share this:

Like this:

Leave a ReplyCancel reply

Keep Up to Date with the Most Important News

Read more

I want to return undefined but instead got "{"

Create a sub interface in Typescript

Password Generator using randint instead of random.choice

Send application/pdf content with http call

Need assistance with Nested Loops and Dictionary in Python to display tabular data

Add spaces before every repeating string in list

Discover more from Dev solutions