Skip to main content

Documentation Index

Fetch the complete documentation index at: https://mehen.ophi.dev/llms.txt

Use this file to discover all available pages before exploring further.

The structural Markdown layer (Markdown Metrics) is deliberately language-opaque. The prose layer adds language-aware signals — readability formulas, lexical diversity, wording quality, Japanese script composition and JTF conformance — on top of the same AST.

Architectural constraints

  1. Layered, not folded. Prose metrics are a separate top-level section in the output schema. They do not modify DMI, MCC, MRPC, or Filler / Lazy Risk weights silently.
  2. Per-block language tag. Language detection runs per Markdown block (paragraph, heading, list item, blockquote), not per document.
  3. Structural artifacts stay excluded. Code fences, inline code, link destinations, image alt-text, YAML/TOML/JSON front matter, HTML/MDX, and table delimiters are stripped before any readability or wording calculation.
  4. Short-text refusal. Grade-level formulas are suppressed when words < 100 OR sentences < 5.
  5. Feature-gated dictionaries. Dictionary-dependent features ship behind Cargo --features flags so the default binary stays small.
  6. Deterministic and reproducible. No network, no cloud, no sampling.

Tier model

TierCargo featuresAddsBinary cost
0 (default)noneUnicode-block language detection; UAX #29 segmentation; vowel-group syllables; Tateishi RS; basic wording heuristics; JTF mechanical checks~100–300 KB
1asyllables-cmuCMU Pronouncing Dictionary+1–2 MB
1bjapanese-jouyouJōyō grade proxy, hyōgai ratio+10 KB
1cjapanese-jlptJLPT N5–N1 word and kanji bands+300 KB
1dlinguaHigh-accuracy trigram language detection+2–5 MB
2ajapanese-morphLindera + IPADIC, bunsetsu, POS, Shibasaki grade+50 MB
2bjapanese-unidicVibrato + UniDic; jReadabilityexternal dict
2clexical-diversityMTLD, HD-D, Yule’s K+50 KB
2dvale-rulesParse vale-compatible YAML rule packs+200 KB

Pages

PagePurpose
Block-level language detectionPer-block English/Japanese/other tagging.
English readability ensembleFlesch, FKGL, Fog, SMOG, ARI, Coleman-Liau, Dale-Chall, FORCAST, LIX/RIX.
Lexical diversityMATTR, hapax, density, sentence/word moments.
Wording qualityPassive, hedges, weasels, wordy, adverbs, nominalizations, cliches, illusions.
Inclusive languagealex / retext-equality flags.
Japanese script compositionKanji/hiragana/katakana ratios, registers.
Tateishi RS + Jōyō gradeJapanese readability formulas.
JTF rulesJapan Translation Federation 12 rules.
textlint-ja subsetSelected textlint-rule-preset-ja-technical-writing rules.