Home Select all <table> elements without classes or ids with BeautifulSoup

Questions

Select all <table> elements without classes or ids with BeautifulSoup

January 30, 2024

I am trying to select all <table> elements on some web pages with BeautifulSoup. The table elements do not have specific classes or ids.

import bs4
import requests

def get_keycode_soup(url):
    res = requests.get(url)
    res.raise_for_status()
    return bs4.BeautifulSoup(res.text, features="html.parser")

def parse_qmk_soup():
    qmk_soup = get_keycode_soup("https://docs.qmk.fm/#/keycodes")
    tables = qmk_soup.select("table")
    # pass line for breakpoint
    pass

def main():
    parse_qmk_soup()

if __name__ == "__main__":
    main()

I have also tried selecting all the different table elements with

tables = qmk_soup.find_all("table")
# and
table_rows = qmk_soup.find_all("tr")

Whenever I pause the debugger on the pass line, tables is always None.

I have tried some similar methods to this post and this post, but since there do not appear to be any other descriptive tags on the tables I’m trying to select, iterating feels inefficient.

Is there a way to simply select all the <table> elements on their own?

Edit: it appears that the page requires JS to load the tables as suggested by @DeepSpace below. Additionally, see the answer from @MendelG regarding following where the data is loaded from in case you might obtain the data from the source.

>Solution :

If you inspect your browser’s Network calls, and view the HTTP requests, you’ll see that the data is loaded from a different website URL, which is:

https://docs.qmk.fm/keycodes.md?cache-bust=1706627991267

The thing is, it’s really a markdown file (.md). However, at least you obtain the original file

So, there isn’t really any HTML to parse, to obtain it in a readable format.

html-parsing

byMR

Published January 30, 2024

Add a comment

How does the exit code of “test” act as a “if” condition?

byMR

January 30, 2024

Questions

New ggplot Error in `palette()` with code I have been running for months

byMR

January 30, 2024

Questions

Why is array.fill() not returning the modified array?

byMR

January 30, 2024

Questions

change the key value of only the first object in the array react

byMR

January 30, 2024

Questions

How to get values from a Stripe\Collection JSON output

byMR

January 30, 2024

Questions

selecting an element by a class name with a dot in it with Cheerio

byMR

January 30, 2024

Select all <table> elements without classes or ids with BeautifulSoup

MEDevel.com: Open-source for Healthcare and Education

>Solution :

Like this:

Leave a ReplyCancel reply

Read more

How does the exit code of “test” act as a “if” condition?

New ggplot Error in `palette()` with code I have been running for months

Why is array.fill() not returning the modified array?

change the key value of only the first object in the array react

How to get values from a Stripe\Collection JSON output

selecting an element by a class name with a dot in it with Cheerio

Keep Up to Date with the Most Important News

Select all <table> elements without classes or ids with BeautifulSoup

MEDevel.com: Open-source for Healthcare and Education

>Solution :

Share this:

Like this:

Leave a ReplyCancel reply

Keep Up to Date with the Most Important News

Read more

How does the exit code of “test” act as a “if” condition?

New ggplot Error in `palette()` with code I have been running for months

Why is array.fill() not returning the modified array?

change the key value of only the first object in the array react

How to get values from a Stripe\Collection JSON output

selecting an element by a class name with a dot in it with Cheerio

Discover more from Dev solutions