FINDING · EVALUATION

A proposed HTTP censorship detection algorithm combining status-code comparison, response-length Z-score, HTML TF-vector cosine similarity, and redirect-hostname matching achieves F1 scores of 0.83 (censored) and 0.77 (uncensored), outperforming OONI (0.80 / 0.70), length-difference methods (0.70 / 0.66), and HTML-similarity methods (0.52 / 0.34) on a manually annotated set of 3,000 responses across six Indian ISPs.

From 2020-singh-indiaHow India Censors the Web · §4.3, Table 1 · 2020 · Web Science

Implications

Tags

censors
in
techniques
measurement-platformkeyword-filteringrst-injectionpacket-injection

Extracted by claude-sonnet-4-6 — review before relying.