FINDING · DETECTION

Information Gain feature selection from 408 candidates identified informal language markers (informal, nonflu, swear), Chinese modal and general particles signaling mood and relational framing, and physical-feeling words used metaphorically as the top predictors of censored Weibo content — all with statistically significant differences between censored and uncensored classes.

From 2018-ng-detectingDetecting Censorable Content on Sina Weibo: A Pilot Study · §5, §6 · 2018 · Hellenic Conference on Artificial Intelligence

Implications

Tags

censors
cn
techniques
keyword-filteringml-classifier

Extracted by claude-sonnet-4-6 — review before relying.