Follow

Follow

Contact

Home Regex to match multiple numbers within string

Questions

Regex to match multiple numbers within string

byMR

May 14, 2022

I have a regex that looks like this to extract order numbers from columns:

df["Orders"].str.extract('([0-9]{9,10}[/+ #_;.-]?)')

The orders column can look like this:

12
123456789
1234567890
123456789/1234567890
123456789/1/123456789
123456789+1234567890

The resulting new column in the dataframe after the regex should look like this:

NaN
123456789
1234567890
123456789/1234567890
123456789/123456789
123456789+1234567890

However, with my current regex I’m getting the following result:

How can I get the result that I’m looking for?

>Solution :

You can use

import pandas as pd
df = pd.DataFrame({'Orders':['12','123456789','1234567890','123456789/1234567890','123456789/1/123456789','123456789+1234567890', 'Order number: 6508955960_000010_1005500']})
df["Result"] = df["Orders"].str.findall(r'[/+ #_;.-]?(?<![0-9])[0-9]{9,10}(?![0-9])').str.join('').str.lstrip('/+ #_;.-')
df.loc[df['Result'] == '', 'Result'] = np.nan

See the regex demo. Details

[/+ #_;.-]?(?<![0-9])[0-9]{9,10}(?![0-9]) – matches an optional /, +, space, #, _, ;, . or - char, and then none or ten digit number not enclosed with other digits
Series.str.findall extracts all occurrences
.str.join('') concatenates the matches into a single string
.str.lstrip('/+ #_;.-') – removes the special chars that were matched with the number at the beginning of the string
df.loc[df['Result'] == '', 'Result'] = np.nan – if needed – replaces empty strings with np.nan values in the Result column.

Output:

>>> df
                  Orders                Result
0                    NaN                   NaN
1              123456789             123456789
2             1234567890            1234567890
3   123456789/1234567890  123456789/1234567890
4  123456789/1/123456789   123456789/123456789
5   123456789+1234567890  123456789+1234567890
>>>

byMR

Published May 14, 2022

Add a comment

Leave a ReplyCancel reply

Read more

Questions

Flutter _AssertionError 'initialValue == null || controller == null': is not true. Error

byMR

May 14, 2022

Questions

How can I set one array as a value of another array in PHP?

byMR

May 14, 2022

Questions

how to properly get day of the week (name) with moment.js?

byMR

May 14, 2022

Questions

Convert data frame into adjacency matrix format in R

byMR

May 14, 2022

Questions

view Terraform output from module using for_each and toset

byMR

May 14, 2022

Questions

How to convert PHP multidimensional associative array to API http query in the format the API specifies?

byMR

May 14, 2022