Follow

Follow

Contact

Home How to extract user IDs from URLs inside Pandas in Python?

Questions

How to extract user IDs from URLs inside Pandas in Python?

byMR

September 6, 2022

Data frame has 1,050,000 rows.

Input: (a pandas dataframe column)

UserImage
    https://play-lh.googleusercontent.com/a/AItbvmkI4RoZOTFftgRqwJ0QVl-OqLw0PXFRQsQmzPwayQ=mo
    https://play-lh.googleusercontent.com/EGemoI2NTXmTsBVtJqk8jxF9rh8ApRWfsIMQSt2uE4OcpQqbFu7f7NbTK05lx80nuSijCz7sc3a277R67g
    https://play-lh.googleusercontent.com/a-/AFdZucpr-V6JJAWHdTjxYVPa15fmQC7pWl5Xd5StFt1E'

Output:

UserIDs
AItbvmkI4RoZOTFftgRqwJ0QVl-OqLw0PXFRQsQmzPwayQ
EGemoI2NTXmTsBVtJqk8jxF9rh8ApRWfsIMQSt2uE4OcpQqbFu7f7NbTK05lx80nuSijCz7sc3a277R67g
AFdZucpr-V6JJAWHdTjxYVPa15fmQC7pWl5Xd5StFt1E

>Solution :

This looks like a perfect use case for a regex:

df['UserIDs'] = df['UserImage'].str.extract('^.*/([^/=]+)[^/]*$')

Or if you want to keep only alphanum + -:

df['UserIDs'] = df['UserImage'].str.extract('^.*/([-\w]+)[^/]*$')

output:

                                           UserImage  \
0  https://play-lh.googleusercontent.com/a/AItbvm...   
1  https://play-lh.googleusercontent.com/EGemoI2N...   
2  https://play-lh.googleusercontent.com/a-/AFdZu...   

                                             UserIDs  
0     AItbvmkI4RoZOTFftgRqwJ0QVl-OqLw0PXFRQsQmzPwayQ  
1  EGemoI2NTXmTsBVtJqk8jxF9rh8ApRWfsIMQSt2uE4OcpQ...  
2       AFdZucpr-V6JJAWHdTjxYVPa15fmQC7pWl5Xd5StFt1E

dataframe

byMR

Published September 06, 2022

Add a comment

Leave a ReplyCancel reply

Read more

Questions

why this code is only printing the first statement as output

byMR

September 6, 2022

Questions

Changing only the name of a variable appears to break JavaScript code to copy a link to the clipboard

byMR

September 6, 2022

Questions

How to add an HTML element starting after one and finishing before another one

byMR

September 6, 2022

Questions

Implement inverse of minmax scaler in numpy

byMR

September 6, 2022

Questions

How to correctly convert from if/else statement to ternary expression?

byMR

September 6, 2022

Questions

regex to find all instances of "),"

byMR

September 6, 2022