Follow

Follow

Contact

Home Webscrape a table with BeautifulSoup

Questions

Webscrape a table with BeautifulSoup

byMR

March 25, 2022

I’m trying to get the tables (and then the tr and td contents) with requests and BeautifulSoup from this link: https://www.basketball-reference.com/teams/PHI/2022/lineups/ , but I get no results.

I tried with:

import requests
from bs4 import BeautifulSoup

url = "https://www.basketball-reference.com/teams/PHI/2022/lineups/"
page = requests.get(url)
soup = BeautifulSoup(page.text, 'html.parser') 

tables = soup.find_all('table')

However the result of tables is [].

>Solution :

It looks like the tables are placed in the comments, so you have to adjust the response text:

page = page.text.replace("<!--","").replace("-->","")
soup = BeautifulSoup(page, 'html.parser')

Example

import requests
from bs4 import BeautifulSoup
import pandas as pd

url = "https://www.basketball-reference.com/teams/PHI/2022/lineups/"
page = requests.get(url)
page = page.text.replace("<!--","").replace("-->","")
soup = BeautifulSoup(page, 'html.parser') 

tables = soup.find_all('table')

python-requests

byMR

Published March 25, 2022

Add a comment

Leave a ReplyCancel reply

Read more

Questions

cant access json array using php

byMR

March 25, 2022

Questions

Parse an array of json object using jq

byMR

March 25, 2022

Questions

Class method won't give the right output

byMR

March 25, 2022

Questions

Clear inline style after reset

byMR

March 25, 2022

Questions

In general, static languages are type checked at compile time. Is typescript also type checked at compile time?

byMR

March 25, 2022

Questions

SQL to fetch value of one column such that a certain value in another column does not exist

byMR

March 25, 2022