FINDING · EVALUATION

API and WebUI interfaces show statistically significant response length differences in opposite directions across models. Gemini API responses averaged 2,333 characters vs. 1,746 for WebUI (34% longer API; t=5.028, p<0.0001, Cohen's d=0.50). ChatGPT WebUI responses averaged 2,752 characters vs. 1,389 for API (98% longer WebUI; t=-9.800, p<0.0001, d=-0.98). The divergent direction across models suggests fundamentally different generation parameters rather than simple post-hoc filtering, indicating architectural or policy-level differences at the provider level.

From 2026-lipphardt-dualDual Standards: Examining Content Moderation Disparities Between API and WebUI Interfaces in Large Language Models · §4.6 · 2026 · Free and Open Communications on the Internet

Implications

Tags

censors
generic
techniques
ml-classifier

Extracted by claude-sonnet-4-6 — review before relying.