Welcome to the Zomi‑Syl Documentation

zomi-syl is a modular, dialect‑aware Zomi syllabification library with support for multiple backends, dialect profiles, benchmarking tools, and a clean developer workflow. This documentation provides an overview of the system, how to use it, and how to extend it with new models.


🚀 What is Zomi‑Syl?

Zomi‑Syl provides:

  • A unified syllabification API
  • Multiple backends (rule‑based, CRF, transformer‑ready)
  • Dialect‑aware syllabification
  • A full CLI for analysis, benchmarking, and diagnostics
  • Developer tooling for adding new backends
  • Training and evaluation scripts

It is designed for linguists, developers, and researchers working with Zomi language processing.


📦 Installation

pip install zomi-syl

🧠 Quick Start

Syllabify a word

zomi-syl syllabify itna

Analyze a word

zomi-syl analyze itna --json

Python API

import zomi_syl as zs

zs.syllabify("itna")
zs.analyze("itna")

🧩 Key Components

Backends

Zomi‑Syl supports multiple syllabification backends:

  • Rule‑based
  • CRF
  • Transformer‑ready

See: Backend Overview

Dialect Profiles

Profiles define onsets, nuclei, codas, and rules for each dialect.

See: Dialect Profiles

CLI Tools

The CLI provides:

  • Syllabification
  • Analysis
  • Batch processing
  • Benchmarking
  • Backend comparison
  • Diagnostics

See: Top‑Level Commands

Developer Guides

Zomi‑Syl includes extensive developer documentation:

  • Adding new backends
  • Unified Metadata Schema (UMS)
  • CRF training
  • Test templates
  • Recommended folder structure

See: Developer Documentation


📊 Benchmarking

Benchmark a backend:

zomi-syl models benchmark crf

Compare multiple backends:

zomi-syl models compare rule crf

See: Benchmarking Guide


🛠 Adding New Models

Zomi‑Syl is designed for extensibility.
New backends can be added without modifying the core engine.

See: Adding New Backends


📚 Documentation Structure

This documentation includes:

  • User Guide
  • Installation
  • CLI usage
  • Python API
  • Dialect profiles

  • Developer Guide

  • Backend architecture
  • Unified Metadata Schema
  • Adding new backends
  • Testing
  • Training CRF models

  • Reference

  • CLI command reference
  • Folder structure
  • Release checklist

🧪 Testing

Run all tests:

pytest

Golden CRF regression data:

tests/golden/crf_golden.tsv

📦 Release Notes

Changelog is generated automatically:

make changelog

See: [Looks like the result wasn't safe to show. Let's switch things up and try something else!]


📜 License

MIT License — see LICENSE.


🙌 Contributing

See: [Looks like the result wasn't safe to show. Let's switch things up and try something else!]