FINDING · DETECTION
The classifier uses a 3,000-dimension binary vector recording which upstream and downstream packet sizes appear across the full session, combined with aggregate biflow statistics (total packets, burst length, transmission time, incoming/outgoing fractions). This packet-size histogram is the highest-dimensionality feature in the set.
From 2017-deng-random — The Random Forest based Detection of Shadowsock's Traffic · §IV.B, Table 1 · 2017 · Intelligent Human-Machine Systems and Cybernetics
Implications
- Padding packets to a small set of fixed sizes (e.g., fixed 1460-byte MTU fills) would collapse the 3,000-dim size-histogram to near-zero entropy, defeating this feature class entirely.
- Shaping burst-length and inter-packet timing distributions to match benign HTTPS would neutralize the statistical biflow features used alongside the size histogram.
Tags
Extracted by claude-sonnet-4-6 — review before relying.