FINDING · EVALUATION

Evidence from Youdao Translate suggests it deploys a machine-learning or NLP-based classifier alongside keyword rules: measured rules included repeated components (e.g., 螺+螺+螺+螺+螺+螺+蟢+D+哒+大) and nonsensical multi-token sequences that no human rule author would write, yet which consistently triggered censorship. Youdao returned 9,414 unique rules from the general test set — the most of any service — while also producing the most structurally anomalous rule patterns.

From 2024-ruo-lostLost in Translation: Characterizing Automated Censorship in Online Translation Services · §6 Results / §10 Future Work · 2024 · Free and Open Communications on the Internet

Implications

Tags

censors
cn
techniques
keyword-filteringml-classifier

Extracted by claude-sonnet-4-6 — review before relying.