Does anybody know of tables of Unicode codepoint frequency in real-world text for use in building compression tables?

I could calculate it based on the material I have available to me, but that seems like it's going to disadvantage a bunch of people that don't speak English.

Follow

@isomer it's not clear what you want to weigh with

Sign in to participate in the conversation
Qoto Mastodon

QOTO: Question Others to Teach Ourselves
An inclusive, Academic Freedom, instance
All cultures welcome.
Hate speech and harassment strictly forbidden.