Unicode-Dammit
Sat 17 May 2025
title: "Unicode Dammit" author: "Rj" date: 2019-04-20 description: "-" type: technical_note draft: false
from bs4 import UnicodeDammit
print("inside main method")
dammit = UnicodeDammit(b"Sacr\xc3\xa9 bleu!")
inside main method
print(type(dammit))
<class 'bs4.dammit.UnicodeDammit'>
print(dammit.unicode_markup)
Sacré bleu!
print(type(dammit.unicode_markup))
<class 'str'>
print(dammit.original_encoding)
iso-8859-9
print(type(dammit.original_encoding))
<class 'str'>
Score: 5
Category: basics