FINDING · DETECTION
A Random Forest classifier with 100 CART trees and a sqrt(C) feature-selection strategy achieves over 85% accuracy detecting Shadowsocks traffic from biflow statistics. Accuracy increases monotonically with train-set and test-set size before plateauing.
From 2017-deng-random — The Random Forest based Detection of Shadowsock's Traffic · §V, Abstract · 2017 · Intelligent Human-Machine Systems and Cybernetics
Implications
- Biflow-level features (packet counts, burst statistics, timing) are sufficient for >85% RF classification — Shadowsocks operators must add traffic-shaping or padding to break these features.
- ML detector accuracy is train-set-limited at small scales, so obscuring flow shape during deployment limits effective adversarial retraining.
Tags
Extracted by claude-sonnet-4-6 — review before relying.