FINDING · EVALUATION

Of 21.8 billion raw measurements, approximately 7% (1.5 billion) were initially flagged as blocked; iterative HTML clustering and DBSCAN image clustering then removed ~500 million false positives, leaving ~1 billion confirmed blocked measurements. The clustering process formed 457 new response clusters, of which 308 were confirmed blockpages and 149 were false positives, with Cloudflare bot-checks being a notable source of false positives in HTTPS measurements.

From 2020-raman-censoredCensored Planet: An Internet-wide, Longitudinal Censorship Observatory · §5.1.3, §5.1.4 · 2020 · Computer and Communications Security

Implications

Tags

censors
generic
techniques
measurement-platformdpi

Extracted by claude-sonnet-4-6 — review before relying.