Home Pagination with BeautifulSoup in python

Questions

Pagination with BeautifulSoup in python

March 12, 2022

I am doing a web scraping project for this site.
https://yellowpages.com.eg/en/search/fast-food
I managed to scrape the data but I am struggling with the pagination
As I want to make a loop that scrapes the next page button and then uses the scraped URL from the next button to do the same process.

url = 'https://yellowpages.com.eg/en/search/fast-food'
while True:
    r =  requests.get(url)
    soup = BeautifulSoup(r.content, 'lxml')
    pages = soup.find_all('ul', class_='pagination center-pagination')
for page in pages:
    nextpage =page.find('li', class_='waves-effect').find('a', {'aria-label' : 'Next'})
if nextpage:
    uu = nextpage.get('href')
    url = 'http://www.yellowpages.com.eg' + str(uu)
    print(url)
else:
    break

This code returns the next URL in the pagination order and then breaks out of loop.

>Solution :

The problem is that

nextpage =page.find('li', class_='waves-effect').find('a', {'aria-label' : 'Next'})

does return the Next button, but only as long as the Previous button is not there, meaning that it breaks as soon as you leave the first page (it returns None).

Instead, page.find_all('li', class_='waves-effect') returns the Next and the Previous button.

To (maybe) robustly get the Next button, change your line to

nextpage =page.find_all('li', class_='waves-effect')[-1].find('a', {'aria-label' : 'Next'})

web-scraping-language

byMR

Published March 12, 2022

Add a comment

How to remove space after comma in list in python

byMR

March 12, 2022

Questions

how to reverse a string in python without changing the position of words?

byMR

March 12, 2022

Questions

How to call a function defined in "onMounted" lifecycle hook in Vuejs?

byMR

March 12, 2022

Questions

malloc and C alignment: is this hand-made optimization safe?

byMR

March 12, 2022

Questions

AttributeError: Did you call find_all() when you meant to call find()?

byMR

March 12, 2022

Questions

useContext causing blank screen

byMR

March 12, 2022

Pagination with BeautifulSoup in python

MEDevel.com: Open-source for Healthcare and Education

>Solution :

Like this:

Leave a ReplyCancel reply

Read more

How to remove space after comma in list in python

how to reverse a string in python without changing the position of words?

How to call a function defined in "onMounted" lifecycle hook in Vuejs?

malloc and C alignment: is this hand-made optimization safe?

AttributeError: Did you call find_all() when you meant to call find()?

useContext causing blank screen

Keep Up to Date with the Most Important News

Pagination with BeautifulSoup in python

MEDevel.com: Open-source for Healthcare and Education

>Solution :

Share this:

Like this:

Leave a ReplyCancel reply

Keep Up to Date with the Most Important News

Read more

How to remove space after comma in list in python

how to reverse a string in python without changing the position of words?

How to call a function defined in "onMounted" lifecycle hook in Vuejs?

malloc and C alignment: is this hand-made optimization safe?

AttributeError: Did you call find_all() when you meant to call find()?

useContext causing blank screen

Discover more from Dev solutions