this post was submitted on 21 Jun 2023
660 points (100.0% liked)

Reddit Migration

21 readers
2 users here now

### About Community Tracking and helping #redditmigration to Kbin and the Fediverse. Say hello to the decentralized and open future. To see latest reeddit blackout info, see here: https://reddark.untone.uk/

founded 1 year ago
 

r/ModCoord: The admins in charge of demodded subreddits are mass-removing images of Huffman previously shared on them

the OP asked how are they gonna remove them all, so I replied they'll run a query for terms like "fuck spez". only to find my comment invisible, because they're already blocking it. spez protective word filter, it's active right now.

you are viewing a single comment's thread
view the rest of the comments
[–] innrautha@kbin.social 2 points 1 year ago (1 children)

Run the text through a normalizer that strips combining characters then run your filter.

[–] Trebach@kbin.social 2 points 1 year ago (1 children)

There are characters that have two or three combining diacritic marks already included, so if you start with one of those, it'll make it harder to filter out.

[–] argv_minus_one@beehaw.org 1 points 1 year ago

That's what the normalizer is for. Normalize to NFD, strip combiners, map homoglyphs to ASCII, and look for ASCII “fuck spez”.