Home How to read text off a website using python (Simple explanation)

Questions

How to read text off a website using python (Simple explanation)

March 24, 2022

I’m looking to make a program that can get the text off a website when given the website’s URL. I would like to be able to get all text between the

tags. Everywhere I have looked online seems to overcomplicate this and it involves some coding in C which I am not well versed in. To summarize what I would like the code to look like (best case scenario). If theres anything I can clarify or is unclear in the question please let me know in comments

import WebReader as WR

StringOfWebText = WR.getParagrahText("WebsiteURL")

>Solution :

You probably want to look into something like BeautifulSoup paired with requests. You can then extract text from a page with a simple solution like this:

import requests
from bs4 import BeautifulSoup

r = requests.get("https://google.com")
soup = BeautifulSoup(r.text, "html.parser")
print(s.text)

There’s also tag-searching and other useful features built into BS4, if you need to be able to handle that.

byMR

Published March 24, 2022

Add a comment

Revert multiindex Date and time to singleindex datetime

byMR

March 24, 2022

Questions

How to use Eleasticsearch query_string wildcards for the the key

byMR

March 24, 2022

Questions

How to get the value of input tag onSubmit without using onChange in React js/Typescript?

byMR

March 24, 2022

Questions

How to count input value length in typescript(Angular)?

byMR

March 24, 2022

Questions

error: no match for 'operator>>' (operand types are 'std::istream' {aka 'std::basic_istream<char>'} and 'Oper')

byMR

March 24, 2022

How to read text off a website using python (Simple explanation)

MEDevel.com: Open-source for Healthcare and Education

>Solution :

Like this:

Leave a ReplyCancel reply

Read more

Revert multiindex Date and time to singleindex datetime

How to use Eleasticsearch query_string wildcards for the the key

How to get the value of input tag onSubmit without using onChange in React js/Typescript?

How to count input value length in typescript(Angular)?

error: no match for 'operator>>' (operand types are 'std::istream' {aka 'std::basic_istream<char>'} and 'Oper')

Keep Up to Date with the Most Important News

How to read text off a website using python (Simple explanation)

MEDevel.com: Open-source for Healthcare and Education

>Solution :

Share this:

Like this:

Leave a ReplyCancel reply

Keep Up to Date with the Most Important News

Read more

Revert multiindex Date and time to singleindex datetime

How to use Eleasticsearch query_string wildcards for the the key

How to get the value of input tag onSubmit without using onChange in React js/Typescript?

How to count input value length in typescript(Angular)?

Sort a list of dicts according to a list of values with regex

error: no match for 'operator>>' (operand types are 'std::istream' {aka 'std::basic_istream<char>'} and 'Oper')

Discover more from Dev solutions