benchmarks
Benchmark suite results
Token reduction
54.12%
balanced mode, benchmark suite
Character reduction
58.23%
balanced mode, benchmark suite
Line reduction
66.67%
fixed benchmark fixtures
Warning preservation
100%
heuristic checks; review critical output
The repository benchmark uses fixed baseline and Noyap-mode responses. Token counts use gpt-tokenizer. Meaning, warning, and language checks are heuristic, so flagged or critical cases still deserve human review.
measured
What the benchmark measures
✔ token reduction
✔ character reduction
✔ line reduction
✔ meaning preservation
✔ warning preservation
✔ Thai and English quality checks
repo files
Benchmark artifacts
github
summary.md https://github.com/ppwnr88/noyap/blob/main/benchmarks/results/summary.md
results.json https://github.com/ppwnr88/noyap/blob/main/benchmarks/results/results.json
cases.json https://github.com/ppwnr88/noyap/blob/main/benchmarks/cases.json