Match only the last instance of a regex that I only know partially

Advertisements I want to match just the last instance of a regular expression. For example in this string: some text here foo word bar and foo word foo word foo word bar more text here I am trying to only match the bolded portion. Here is what I have tried currently: /foo?.+?bar/gi However, it is… Read More Match only the last instance of a regex that I only know partially

February 27, 2024 MRLeave a comment

Need help figuring out short regex – how to match 1 a char, then either of a set including the char, then another char

Advertisements My regex does not match phrases as intended, and I don’t know if it’s possible or not to do what I’m trying to. Intended match (as string progresses) phrase starts with t FIRST character after beginning ‘t’ must not be ‘t’ has any number of ‘t’ or ‘y’ characters (can be 0) must end… Read More Need help figuring out short regex – how to match 1 a char, then either of a set including the char, then another char

February 24, 2024 MRLeave a comment

Regex to split a column in R after the second pipe and after the second T

Advertisements I have a 1 column dataframe of thousands of lines all built on the same pattern, for example: ids <- c("ETC|HMPI01000001|HMPI01000001.1 TAG: Genus Species, T05X3Ml2_CL10007Cordes1_1","ETC|HMPI31000002|HMPI31000002.1 TAG: Genus Species, T3X3Ml2_CL10157Cordes1_1", "ETC|HMPI01000007|HMPI01000007.1 TAG: Genus Species, T1X3Ml2_CL11231Cordes1_1") df <- as.data.frame(ids) > df ids 1 ETC|HMPI01000001|HMPI01000001.1 TAG: Genus Species, T05X3Ml2_CL10007Cordes1_1 2 ETC|HMPI31000002|HMPI31000002.1 TAG: Genus Species, T3X3Ml2_CL10157Cordes1_1 3 ETC|HMPI01000007|HMPI01000007.1… Read More Regex to split a column in R after the second pipe and after the second T

February 23, 2024 MRLeave a comment

Matching strings between symbols

Advertisements It’s a few days I’m trying to solve this issue but I can’t make it work. I have looked at many questions here on Stack-overflow but still I can’t figure out the correct way to solve it. I have string representing arithmetic expressions with numbers and "multi word" variables such as (Car speed /… Read More Matching strings between symbols

February 5, 2024 MRLeave a comment

Regex for finding string after the second occurrence of the character

Advertisements The problem is to get from the string ‘https://myapp-ui.private.dev.mysubdom.eu’ the substring ‘dev.mysubdom.eu’ without the private. So, in other words, I want to to get the substring after the occurrence of the second dot, the character ‘.’. What I tried and works (so the next string after the occurrence of the first dot) : to… Read More Regex for finding string after the second occurrence of the character

February 1, 2024 MRLeave a comment

How to match fixed length string with quantifiers

Advertisements I have strings like this: 123456-0001 123456-0012 123456-0123 How to match with next conditions: chars count after – should be 4 zeros count variable – from 1 to 3 I found ^\d{6}-0+([1-9]+)$ pattern but it matches for 123456-001 or 123456-00001. >Solution : You can use ^\d{6}-(?=\d{4}$)0+([1-9]\d*)$ See the regex demo. Details: ^ – start… Read More How to match fixed length string with quantifiers

January 31, 2024 MRLeave a comment

Extracting maximum number from DataFrame of strings (and some NaN values)

Advertisements Look at the DataFrame: import pandas as pd import numpy as np data=pd.DataFrame([‘random 15 numbers 128 and 12 letters’,’12-5′,’page 65′],columns=[‘text’]) I want to extract all numbers from the strings and write the maximum number into a new column. I achieved that with this code: data[‘list’]=data[‘text’].str.extractall(‘(\d+)’).unstack().values.tolist() data[‘max’]=data[‘list’].apply(lambda row:max([int(x) for x in row if x is… Read More Extracting maximum number from DataFrame of strings (and some NaN values)

January 18, 2024 MRLeave a comment

How do I parse a file name format like 'PUBLIC001' with a Regular Expression?

Advertisements Need help with a regular expression that parses a file name File will be named PUBLIC001 ‘PUBLIC’ is static text in all file names Last 3 digits- day of the year.001(Jan 1)-366(Dec31st on a leap year) is valid range What would be regular expression. Is there a way to limit the max to 366?… Read More How do I parse a file name format like 'PUBLIC001' with a Regular Expression?

January 16, 2024 MRLeave a comment

Regex don't match specific string

Advertisements I have a regex, for simplicity let’s say it’s: ([a-z]*[0-9]) For the string aa bbc1 cc it matches bbc1. Now I want to change the regex in such a way that c1 is not part of the match anymore. But only if it’s c1. Some examples: aa bbb1 cc will match bbb1 aa bbc1… Read More Regex don't match specific string

January 15, 2024 MRLeave a comment

Replace little text format hashtag in string using javascript

Advertisements I have a string returned by the LinkedIn API that contains a number of hashtags. They are formatted like this: {hashtag|\#|somehashtag} I am trying to use a regex with String.replaceAll that will replace all occurrence of these hashtags to a standard hashtag notation like this: #somehashtag I think this regex will identify the hashtags… Read More Replace little text format hashtag in string using javascript