Home Extract keywords from links

Questions

Extract keywords from links

March 31, 2022

I’m trying to extract the first 2 numbers in links like these:

https://primer.text.com/sdfg/8406758680-345386743-DSS1-S%20Jasd%12Odsfr%12Iwetds-Osdgf/ 
https://primer.text.com/sdfg/8945879094-849328844-DPE-S%20Jsdfe%12OIert-Isdfu/
https://primer.text.com/sdfg/8493093053-292494834-QW23%23Wsdfg%23Iprf%64Uiojn%32Asdfg-Werts/

The output should be like this:

id1 = ['8406758680', '8945879094','8493093053']
id2 = ['345386743', '849328844', '292494834']

I’m trying to do this using the re module.

Please, tell me how to do it.

This the code snippet I have so far:

def GetUrlClassId(UrlInPut):
    ClassID = ''
    for i in UrlInPut:
        if i.isdigit():
            ClassID+=i
        elif ClassID !='':
            return int(ClassID)
    return ""

def GetUrlInstanceID(UrlInPut):
    InstanceId = ''
    ClassID = 0
    for i in UrlInPut:
        if i.isdigit() and ClassID==1:
            InstanceId+=i
        elif InstanceId !='':
            return int(InstanceId)
        if i == '-':
            ClassID+=1
    return ""

I don’t want to use something like this. I would like to use regular expressions.

>Solution :

The regex pattern: /(\d{10})-(\d{9}) the brackets are needed to identify the groups of digits, the {} specifies an exact occurrence of a repetition, doc.

# urls separated by a white space
urls = 'https://primer.text.com/sdfg/8406758680-345386743-DSS1-S%20Jasd%12Odsfr%12Iwetds-Osdgf/ https://primer.text.com/sdfg/8945879094-849328844-DPE-S%20Jsdfe%12OIert-Isdfu/ https://primer.text.com/sdfg/8493093053-292494834-QW23%23Wsdfg%23Iprf%64Uiojn%32Asdfg-Werts/'

urls = urls.split() # as list

import re

ids = [re.search(r'/(\d{10})-(\d{9})', url).groups() for url in urls]
print(list(zip(*ids)))

Output

[('8406758680', '8945879094', '8493093053'), ('345386743', '849328844', '292494834')]

keyword-search

byMR

Published March 31, 2022

Add a comment

Grouping By bool produces wrong result

byMR

March 31, 2022

Questions

Error in getOptionChain expiry date for multiple tickers

byMR

March 31, 2022

Questions

equality operator with tuple: 'a', 'b' == ('a', 'b')

byMR

March 31, 2022

Questions

How can a centered <hr> be absolutely placed within a div, without losing its center alignment?

byMR

March 31, 2022

Questions

How to Make Laravel Eloquent "AND" in a Query?

byMR

March 31, 2022

Extract keywords from links

MEDevel.com: Open-source for Healthcare and Education

>Solution :

Like this:

Leave a ReplyCancel reply

Read more

Grouping By bool produces wrong result

Error in getOptionChain expiry date for multiple tickers

equality operator with tuple: 'a', 'b' == ('a', 'b')

How can a centered <hr> be absolutely placed within a div, without losing its center alignment?

How to Make Laravel Eloquent "AND" in a Query?

Keep Up to Date with the Most Important News

Extract keywords from links

MEDevel.com: Open-source for Healthcare and Education

>Solution :

Share this:

Like this:

Leave a ReplyCancel reply

Keep Up to Date with the Most Important News

Read more

Grouping By bool produces wrong result

Error in getOptionChain expiry date for multiple tickers

equality operator with tuple: 'a', 'b' == ('a', 'b')

How can a centered <hr> be absolutely placed within a div, without losing its center alignment?

groupby and select max id from object in react native

How to Make Laravel Eloquent "AND" in a Query?

Discover more from Dev solutions