Home XPath : Contains() on a xml to find urls, but it actually find more

Questions

XPath : Contains() on a xml to find urls, but it actually find more

January 4, 2022

I’m trying to find the correct XPath expression to get only urls from all my documents, whatever the tag is. I’m trying with this one :

<urlset xmlns="https://www.sitemaps.org/schemas/sitemap/0.9">
  <url>
    <loc>https://url
    </loc>
    <lastmod>2019-08-07T15:01:51+00:00
    </lastmod>
  </url>
</urlset>

The following expression gives me these results :

//*[contains(.,’http’)]//text()

https://url

2019-08-07T15:01:51+00:00

What I’m looking for is to get rid of the second line. I need to be able to get only urls from any xml file.

>Solution :

Well, let’s ignore the fact that not all URLs contain "http" and not everything that contains "http" is a URL…

To find all text nodes containing "http", just use //text()[contains(., 'http')].

Or you could find

byMR

Published January 04, 2022

Add a comment

How to extract all string elements from a list with tuples

byMR

January 4, 2022

Questions

How do I write a unit test to call a function inside useEffect?

byMR

January 4, 2022

Questions

Cannot convert undefined or null to object at Function.entries (<anonymous>)

byMR

January 4, 2022

Questions

git: forget to save local changes but reset to past commit

byMR

January 4, 2022

Questions

File not in request.FILES but in request.POST I'm using htmx to make post request

byMR

January 4, 2022

Questions

Assign unique numeric value to each subgroup in pandas

byMR

January 4, 2022

XPath : Contains() on a xml to find urls, but it actually find more

MEDevel.com: Open-source for Healthcare and Education

>Solution :

Like this:

Leave a ReplyCancel reply

Read more

How to extract all string elements from a list with tuples

How do I write a unit test to call a function inside useEffect?

Cannot convert undefined or null to object at Function.entries (<anonymous>)

git: forget to save local changes but reset to past commit

File not in request.FILES but in request.POST I'm using htmx to make post request

Assign unique numeric value to each subgroup in pandas

Keep Up to Date with the Most Important News

XPath : Contains() on a xml to find urls, but it actually find more

MEDevel.com: Open-source for Healthcare and Education

>Solution :

Share this:

Like this:

Leave a ReplyCancel reply

Keep Up to Date with the Most Important News

Read more

How to extract all string elements from a list with tuples

How do I write a unit test to call a function inside useEffect?

Cannot convert undefined or null to object at Function.entries (<anonymous>)

git: forget to save local changes but reset to past commit

File not in request.FILES but in request.POST I'm using htmx to make post request

Assign unique numeric value to each subgroup in pandas

Discover more from Dev solutions