Benchmarking Backends¶

Zomi‑Syl includes a built‑in benchmarking framework for evaluating syllabification backends.
You can benchmark a single backend, compare multiple backends, or run diagnostics to ensure models load correctly.

Benchmarking is essential for:

evaluating rule‑based vs CRF performance
validating new backends
regression testing
dialect‑specific evaluation
release readiness

📊 Benchmark a Single Backend¶

To benchmark one backend:

zomi-syl models benchmark crf

This runs:

syllable accuracy
boundary F1
confidence scoring (if supported)
dataset‑level summary

Benchmark a specific dialect:

zomi-syl models benchmark crf --dialect tedim

Output as JSON:

zomi-syl models benchmark crf --json

⚖️ Compare Multiple Backends¶

Compare two or more backends:

zomi-syl models compare rule crf

This produces a side‑by‑side comparison of:

accuracy
boundary F1
speed (if available)
metadata summary

Example:

zomi-syl models compare rule crf transformer

🌐 Compare All Backends¶

To benchmark every registered backend:

zomi-syl models compare --all

Useful for:

release validation
regression testing
backend development
model selection

🧪 Benchmark Datasets¶

Benchmarking uses datasets defined in:

src/zomi_syl/evaluation/

Default dataset:

crf_golden.tsv — gold standard regression set

Specify a dataset:

zomi-syl models benchmark crf --dataset crf_golden

Specify a dataset version:

zomi-syl models benchmark crf --dataset-version latest

🩺 Backend Diagnostics¶

Before benchmarking, run a full backend health check:

zomi-syl models doctor

This validates:

registry entries
metadata schema
model file presence
backend loadability
single prediction
batch prediction

This is especially important when:

adding new backends
modifying metadata
updating model files

See: Adding New Backends

📁 Benchmark Reports¶

Export JSON:

zomi-syl models benchmark crf --json > report.json

Advanced users can generate Markdown/HTML reports using the evaluation module.

🧠 Benchmarking in Python¶

You can also run benchmarks programmatically:

from zomi_syl.evaluation.benchmark import run_benchmark

result = run_benchmark(backend="crf", dialect="tedim")
print(result)

This returns:

accuracy
boundary F1
dataset version
backend metadata

🛠 Tips for Backend Developers¶

When developing a new backend:

Run diagnostics:

zomi-syl models doctor

Benchmark your backend:

zomi-syl models benchmark my_backend

Compare against existing backends:

zomi-syl models compare rule crf my_backend

Add regression tests using:

tests/backends/template_test_my_backend.py

See: Developer Documentation

Next Steps¶

Install the library: Installation
Learn the CLI: Top‑Level Commands
Explore dialect profiles: Dialect Profiles
Add your own backend: Adding New Backends

Keys	Action
`?`	Open this help
`n`	Next page
`p`	Previous page
`s`	Search