FINDING · EVALUATION

Among inaccessible URLs that also triggered OONI anomalies, approximately 58% were generated by the Top2Vec-Trends pipeline (combining Top2Vec topic modeling with Google Trends keyword expansion), while LDA-TFIDF and Top2Vec alone each accounted for only 13–14%. BERTopic-generated pages were least effective at producing censored candidates.

From 2024-tang-automaticAutomatic Generation of Web Censorship Probe Lists · §5.4 · 2024 · Privacy Enhancing Technologies

Implications

Tags

censors
cn
techniques
measurement-platform

Extracted by claude-sonnet-4-6 — review before relying.