Case Study: CableBurg ToS Contract Red-Flag Analysis
A real cable company's Terms of Service — anonymized as "CableBurg" — analyzed for consumer red flags. The same document, three approaches: a solo model baseline, and two Verdion tournament tiers.
↑ Select a tier to explore findings ↑
What it missed
- 33+ additional red flags — dense sections like Binding Arbitration treated as single items rather than broken into constituent risks
- Compound-risk analysis — no identification of how clauses interact to create cascading exposure (e.g., termination + debt collection + arbitration forming a suppression stack)
- Severity escalation — no consideration of how flags affect vulnerable populations differently
- Actionable deadline table — time-critical windows buried in prose rather than surfaced as a decision tool
- Self-improvement yielded no improvement — three iterations produced essentially the same output
View full transcript SVG
What the tournament found
- 3.75x more red flags — 45 vs ~12, each with specific clause citation, plain-English explanation, and severity rating
- Granular decomposition — complex clauses broken into individual constituent risks rather than summarized as one item
- MEDIUM-HIGH severity ratings — nuanced intermediate ratings the solo model never produced
- Adversarial refinement — judge models evaluated competing analyses and selected the most thorough, eliminating weak reasoning