This metric addresses the AI-era documentation problem:Documentation Index
Fetch the complete documentation index at: https://mehen.ophi.dev/llms.txt
Use this file to discover all available pages before exploring further.
The document is just filler: structure is lazy, there are no references, it is large but useless.
Sub-scores (.1–17.8)
| Sub-score | What it captures |
|---|---|
| UnanchoredProseMass | Fraction of words living in sections with no evidence anchors. |
| LowArtifactDensity | 1 − sat(A / (W/800); 0.5, 2.0) — too few code, tables, diagrams. |
| LowRepoGrounding | 1 − RepositoryGroundingScore. |
| LazySectioning | Heading density, large-section rate, “shallow big doc” flag (W > 2,500 AND max heading depth ≤ 2). |
| RepetitionDensity | Token-shingle Jaccard > 0.82 detects near-duplicate paragraphs. |
| SpecificityScarcity | Identifiers + paths + version tokens + inline code tokens relative to W. |
| ReferenceHollowness | Bibliography entries without verifiable DOI/arXiv/RFC/URL anchors. |
| PlaceholderDensity | TODO/TBD/FIXME/XXX/lorem and empty links per 1,000 words. |
Formula
Bands
| Score | Band |
|---|---|
| 0.00 – 0.20 | Low. |
| 0.21 – 0.40 | Mild. |
| 0.41 – 0.60 | Review. |
| 0.61 – 0.80 | High. |
| 0.81 – 1.00 | Severe. |
Diagnostic labels
High scores attach stable string labels reviewers can act on:large-unanchored-proselow-repository-groundinglazy-sectioninglow-artifact-densitynear-duplicate-paragraphsspecificity-scarcityhollow-referencesplaceholder-heavy
Example output
References
- Pirolli, P. & Card, S. (1999). Information Foraging. Psychological Review 106(4): 643–675 — motivates the evidence-anchor and specificity-scarcity sub-scores. DOI.
- Halliday, M. A. K. (1985). Spoken and Written Language. Oxford University Press — lexical-density
basis used by
SpecificityScarcity. - Manning, C. D., Raghavan, P. & Schütze, H. (2008). Introduction to Information Retrieval, ch. 6.
Cambridge University Press — Jaccard / token-shingle methods used by
RepetitionDensity. Stanford online edition.
See also
- DMI — uses Filler Risk as one of its inputs.
- Repository Grounding — feeds LowRepoGrounding.
- Evidence Coverage — feeds UnanchoredProseMass.