Home Python extract repeating substring(s) between equal markers

Questions

Python extract repeating substring(s) between equal markers

December 12, 2021

let’s say I have a textfile as follows:

 1. MarkerOne
 Some text
 EndMarkerOne
 2. Something else
 Some more text
 EndSomethingElse
 3. MarkerTwo
 Some Text
 EndMarkerTwo

whereas MarkerOne and MarkerTwo as well as EndMarkerOne and EndMarkerTwo are the same. E.g.:

    1. Notice 
    Some text 
    End Notice
    2. Blabla 
    Some other text 
    End Blabla
    3. Notice 
    Some more text
    End Notice

Now I want to extract the "some text" and the "some more text" from the file as two different substrings in a list.

I tried:

    import re
    pattern = "\d+. Notice[\S\t\n\v ]*End Notice"
    re.compile(pattern)
    result = re.findall(pattern, text)
    print(result)

Unfortunately this gives me all text between the first "Notice" and the last "End Notice" and not two separate results.

What I need is to tell the script to separate the results by each "End Notice" and start the next with finding the pattern again.

Any idea?

>Solution :

Use a non-greedy regex, change * to *?, see What is the difference between .*? and .* regular expressions?

import re

ptn = re.compile(r"\d+. Notice[\S\t\n\v ]*?End Notice")
result = ptn.findall(text)

regex

byMR

Published December 12, 2021

Add a comment

Is a subclass a new type?

byMR

December 12, 2021

Questions

Function that recieves an array and returns if it can be made strictly increasing by removing only one of it's element

byMR

December 12, 2021

Questions

the latter element subtract the previous element in javascript

byMR

December 12, 2021

Questions

Python: pandas_datareader import historical stock data in euro

byMR

December 12, 2021

Questions

What does the ^= operator do?

byMR

December 12, 2021

Questions

What does "~" do in CSS?

byMR

December 12, 2021

Python extract repeating substring(s) between equal markers

MEDevel.com: Open-source for Healthcare and Education

>Solution :

Like this:

Leave a ReplyCancel reply

Read more

Is a subclass a new type?

Function that recieves an array and returns if it can be made strictly increasing by removing only one of it's element

the latter element subtract the previous element in javascript

Python: pandas_datareader import historical stock data in euro

What does the ^= operator do?

What does "~" do in CSS?

Keep Up to Date with the Most Important News

Python extract repeating substring(s) between equal markers

MEDevel.com: Open-source for Healthcare and Education

>Solution :

Share this:

Like this:

Leave a ReplyCancel reply

Keep Up to Date with the Most Important News

Read more

Is a subclass a new type?

Function that recieves an array and returns if it can be made strictly increasing by removing only one of it's element

the latter element subtract the previous element in javascript

Python: pandas_datareader import historical stock data in euro

What does the ^= operator do?

What does "~" do in CSS?

Discover more from Dev solutions