Home Gettin non-english text from html doc

Questions

Gettin non-english text from html doc

July 19, 2022

I’m trying to get a title of html document in python, but getting weird symbols. I guess that’s because of encoding, but the html doc in utf-8 encoding.
Is there any way I can get normal letters?

Here is code and what am I getting:

from bs4 import BeautifulSoup

 with open("index.html") as file:
     src = file.read()


soup = BeautifulSoup(src, "lxml")

title = soup.title.text

print(title)

Р“Р»Р°РІРЅР°СЏ СЃС‚СЂР°РЅРёС†Р°

>Solution :

You need to specify an encoding type when opening the file:

 with open("index.html", encoding='utf-8') as file:
     src = file.read()

utf-8

byMR

Published July 19, 2022

Add a comment

Bracket error in c "error: expected identifier or '(' {"

byMR

July 19, 2022

Questions

Access insert into one row characters from other

byMR

July 19, 2022

Questions

Suppressing causal tree to print in console

byMR

July 19, 2022

Questions

How do I read and awrite addtional data to existing data in a csv file

byMR

July 19, 2022

Questions

Can someone help how to display a specific form if a checkbox is checked (all this happen without submitting the form beforehand) in laravel?

byMR

July 19, 2022

Questions

Using multiple structs in a function c++

byMR

July 19, 2022

Gettin non-english text from html doc

MEDevel.com: Open-source for Healthcare and Education

>Solution :

Like this:

Leave a ReplyCancel reply

Read more

Bracket error in c "error: expected identifier or '(' {"

Access insert into one row characters from other

Suppressing causal tree to print in console

How do I read and awrite addtional data to existing data in a csv file

Can someone help how to display a specific form if a checkbox is checked (all this happen without submitting the form beforehand) in laravel?

Using multiple structs in a function c++

Keep Up to Date with the Most Important News

Gettin non-english text from html doc

MEDevel.com: Open-source for Healthcare and Education

>Solution :

Share this:

Like this:

Leave a ReplyCancel reply

Keep Up to Date with the Most Important News

Read more

Bracket error in c "error: expected identifier or '(' {"

Access insert into one row characters from other

Suppressing causal tree to print in console

How do I read and awrite addtional data to existing data in a csv file

Can someone help how to display a specific form if a checkbox is checked (all this happen without submitting the form beforehand) in laravel?

Using multiple structs in a function c++

Discover more from Dev solutions