- Codec: `cp1254`;
- Aliases: `windows-1254`;
- Language: Turkish.
Okay, apparently my decision to add chardet to caterpillar turned out to be a dumpster fire. I've used uchardet to great success in the past, but I guess for short strings it's almost as good as a PRNG. I realized this as I released the feature, to which end [I quote](https://github.com/zmwangx/caterpillar/releases/tag/v0.8):
> Add encoding detection for batch manifest (with varying degrees of success due to sample size, better just use UTF-8).
In other words, it fucking, doesn't, work.
So I'm following my own advice now: just use UTF-8. v0.1.5 does just that. Hopefully it addresses the situation.
---
It would still help to see a sample though. I used to know the Python 3's Windows default encoding rules (yes, wrestled with it long before this), but in practice it seems all over the place.
If you know how to upload `m3u8.txt` to a gist without butchering the encoding (e.g., with [defunkt/gist](https://github.com/defunkt/gist)), please share it as a gist; otherwise, an ephemeral file sharing site like [file.io](https://www.file.io/) works for me too.
P.S. I've been very busy and off QQ for at least a month. I'll be back though... at some point.