Jinhang Jiang
Dec 3, 2022

--

I’m not really sure what your data looks like in this case. As far as I understand, utf8 encoding probably won’t display properly. If the built-in txt reader doesn’t work for you, you perhaps need to find a different way to read the data into pandas to use our package. For the text cleaning, all the bad Unicode can be removed using the preprocessing function. I hope this answers your questions.

--

--

Jinhang Jiang
Jinhang Jiang

No responses yet