Unicode-Dammit

Sat 17 May 2025

title: "Unicode Dammit" author: "Rj" date: 2019-04-20 description: "-" type: technical_note draft: false


from bs4 import UnicodeDammit
print("inside main method")

dammit = UnicodeDammit(b"Sacr\xc3\xa9 bleu!")
inside main method
print(type(dammit))
<class 'bs4.dammit.UnicodeDammit'>
print(dammit.unicode_markup)
Sacré bleu!
print(type(dammit.unicode_markup))
<class 'str'>
print(dammit.original_encoding)
iso-8859-9
print(type(dammit.original_encoding))
<class 'str'>


Score: 5

Category: basics