Dynamic LDA applied to ICLab longitudinal data for India (2016–2020) successfully identified 14 distinct censored topic clusters—including religious conflict, piracy, educational fraud, and political dissent—from 677 overtly-censored URLs out of 6,012 tested (11.3% overtly censored at least once). The model required monthly time-slice granularity; daily and weekly granularities produced unstable results due to wild swings in document counts.
From 2022-waheed-darwin-s — Darwin's Theory of Censorship: Analysing the Evolution of Censored Topics with Dynamic Topic Models
· §3.1, Table 1
· 2022
· Workshop on Privacy in the Electronic Society
Implications
Circumvention tools targeting India should prioritize coverage of religious-conflict, political-dissent, and piracy-adjacent domains, which dominate the observed censored topic space.
Monthly rather than daily granularity is sufficient for tracking Indian censorship evolution; over-engineering real-time blocklist updates may not be warranted.