Home XPath: How do I capture the previous element?

Questions

XPath: How do I capture the previous element?

January 25, 2022

I have such a construction

<p>File name</p>
<a href="https://somelink.pdf">Download</a>

I need to capture the link a and its name p using CSS and XPath. I’m trying to do the following, first I find using the CSS selector all files whose href values end in .pdf (a[href$=".pdf"]):

for i in response.css('a[href$=".pdf"]'):
    link = i.css('::attr("href")').get()
    name = i.xpath(?????????)
    print(name, link)

How do I capture the text in the p element using XPath?

>Solution :

Starting from `a`

This XPath,

//a[.="Download"]/preceding-sibling::p[1]

will select the first p element siblings preceding each a element whose string value equals "Download".

Starting from `p`

This XPath,

//p[.="File name"]/following-sibling::a[1]

will select the first a element siblings following each p element whose string value equals "File name".

In either case, you can select the text node child by appending /text() to the XPaths.

scrapy

byMR

Published January 25, 2022

Add a comment

Docker compose won't use .env variables [NodeJS, Docker]

byMR

January 25, 2022

Questions

Int array pushed through while loop until it reaches -1

byMR

January 25, 2022

Questions

Footer shows to different types of result when I leave parts out and I can't explain why the output is different

byMR

January 25, 2022

Questions

What is the lifetime of a property with only a getter

byMR

January 25, 2022

Questions

How to get property from array of objects by property name?

byMR

January 25, 2022

Questions

How to use sqrt with floor and ceiling?

byMR

January 25, 2022

XPath: How do I capture the previous element?

MEDevel.com: Open-source for Healthcare and Education

>Solution :

Starting from `a`

Starting from `p`

Like this:

Leave a ReplyCancel reply

Read more

Docker compose won't use .env variables [NodeJS, Docker]

Int array pushed through while loop until it reaches -1

Footer shows to different types of result when I leave parts out and I can't explain why the output is different

What is the lifetime of a property with only a getter

How to get property from array of objects by property name?

How to use sqrt with floor and ceiling?

Keep Up to Date with the Most Important News

XPath: How do I capture the previous element?

MEDevel.com: Open-source for Healthcare and Education

>Solution :

Starting from a

Starting from p

Share this:

Like this:

Leave a ReplyCancel reply

Keep Up to Date with the Most Important News

Read more

Docker compose won't use .env variables [NodeJS, Docker]

Int array pushed through while loop until it reaches -1

Footer shows to different types of result when I leave parts out and I can't explain why the output is different

What is the lifetime of a property with only a getter

How to get property from array of objects by property name?

How to use sqrt with floor and ceiling?

Discover more from Dev solutions

Starting from `a`

Starting from `p`