TOON Benchmarks
Performance and efficiency comparisons against other popular data formats.
Efficiency Ranking
Each format's overall performance, balancing accuracy against token cost (accuracy per 1K tokens).
TOON achieves 73.9% accuracy (vs JSON's 69.7%) while using 39.6% fewer tokens.
Per-Model Accuracy
Accuracy across 4 LLMs on 209 data retrieval questions.
claude-haiku-4-5-20251001
gemini-2.5-flash
gpt-5-nano
grok-4-fast-non-reasoning
Token Efficiency
Token counts are measured using the GPT-5 o200k_base tokenizer. Savings are calculated against formatted JSON (2-space indentation) as the primary baseline. Actual savings vary by model and tokenizer.
Mixed-Structure Track
Datasets with nested or semi-uniform structures. CSV is excluded as it cannot properly represent these structures.
🛒 E-commerce orders with nested structures
Tabular: 33%
TOON
72,771 tokens
vs JSON (-33.1%)
vs JSON compact (+5.5%)
vs YAML (-14.2%)
vs XML (-40.5%)
JSON
108,806 tokens
JSON compact
68,975 tokens
YAML
84,780 tokens
XML
122,406 tokens
🧾 Semi-uniform event logs
Tabular: 50%
TOON
153,211 tokens
vs JSON (-15.0%)
vs JSON compact (+19.9%)
vs YAML (-0.8%)
vs XML (-25.2%)
JSON
180,176 tokens
JSON compact
127,731 tokens
YAML
154,505 tokens
XML
204,777 tokens
🧩 Deeply nested configuration
Tabular: 0%
TOON
631 tokens
vs JSON (-31.3%)
vs JSON compact (+11.9%)
vs YAML (-6.2%)
vs XML (-37.4%)
JSON
919 tokens
JSON compact
564 tokens
YAML
673 tokens
XML
1,008 tokens
Flat-Only Track
Datasets with flat tabular structures where CSV is a strong contender.
👥 Uniform employee records
Tabular: 100%
CSV
46,954 tokens
TOON
49,831 tokens
vs JSON (-60.7%)
vs JSON compact (-36.8%)
vs YAML (-50.0%)
vs XML (-66.0%)
JSON
126,860 tokens
JSON compact
78,856 tokens
YAML
99,706 tokens
XML
146,444 tokens