Home pandas split string column on any character that is not an alphabet

Questions

pandas split string column on any character that is not an alphabet

September 14, 2023

I have dataframe something like below

a    str_col
1    ABC*EFG
2    DDC/DSD
3.   sew^sds 
...

I want to split them on non alphabet and into a list. Desired df is as follows

a    str_col.   new_col
1    ABC*EFG.   [ABC, EFG]
2    DDC/DSD.   [DDC, DSD]
3.   sew^sds    [sew, sds]
...

I’ve tried

df['str_col'].str.split('^[a-zA-Z]+') but it created something like [, *EFG]`

>Solution :

You can use [^a-zA-Z], or \W+ (equivalent to [^a-zA-Z0-9_]) that should also work in your case:

df['new_col'] = df['str_col'].str.split(r'[^a-zA-Z]+')

df['new_col'] = df['str_col'].str.split(r'\W+')

Output:

     a  str_col     new_col
0  1.0  ABC*EFG  [ABC, EFG]
1  2.0  DDC/DSD  [DDC, DSD]
2  3.0  sew^sds  [sew, sds]

^[a-zA-Z]+ failed because ^ is an anchor to the start of the string when outside of […].

pandas

byMR

Published September 14, 2023

Add a comment

pandas split string column on any character that is not an alphabet

byMR

September 14, 2023

Questions

Javascript/jQuery replace every first text to specific text in string after comma

byMR

September 14, 2023

Questions

Sorting Rows of a 2D Array in JavaScript Based on Dates in the First Row

byMR

September 14, 2023

Questions

EF migration not creating table

byMR

September 14, 2023

Questions

Extracting data from nested JSON with escape backslash characters by using jq

byMR

September 14, 2023

Questions

C error: incompatible pointer types issue using typedef for int array

byMR

September 14, 2023

pandas split string column on any character that is not an alphabet

MEDevel.com: Open-source for Healthcare and Education

>Solution :

Like this:

Leave a ReplyCancel reply

Read more

pandas split string column on any character that is not an alphabet

Javascript/jQuery replace every first text to specific text in string after comma

Sorting Rows of a 2D Array in JavaScript Based on Dates in the First Row

EF migration not creating table

Extracting data from nested JSON with escape backslash characters by using jq

C error: incompatible pointer types issue using typedef for int array

Keep Up to Date with the Most Important News

pandas split string column on any character that is not an alphabet

MEDevel.com: Open-source for Healthcare and Education

>Solution :

Share this:

Like this:

Leave a ReplyCancel reply

Keep Up to Date with the Most Important News

Read more

pandas split string column on any character that is not an alphabet

Javascript/jQuery replace every first text to specific text in string after comma

Sorting Rows of a 2D Array in JavaScript Based on Dates in the First Row

EF migration not creating table

Extracting data from nested JSON with escape backslash characters by using jq

C error: incompatible pointer types issue using typedef for int array

Discover more from Dev solutions