Documentation

Protocol

Versioning

Three independent version streams: specification, wire protocol, and reference compiler. Every package declares the protocol_version it targets so consumers can negotiate compatibility.

Current versions

spec v0.2
protocol ckf-1.0
compiler v1.3.1
experimental
  • Specification — the 22-section schema and its semantics. Currently v0.2, marked experimental.
  • Protocol — the wire format consumers parse. Currently ckf-1.0. Independent from the spec.
  • Compiler — the reference implementation. Currently v1.3.1. Bumps when observable pipeline behavior changes.

Semver policy (spec)

  • MAJOR — incompatible schema changes (renamed fields, removed sections, changed types).
  • MINOR — backward-compatible additions (new optional fields, new section, new enum value).
  • PATCH — clarifications and fixes that do not affect serialized packages.

Timeline

VersionDateNotes
CKF-0.1Sep 2025First working schema; heuristic compiler; markdown-only.
CKF-0.2Jan 202622-section canonical schema; multi-format encodings (md/yaml/json); MCP server first cut.
Compiler v1.0Mar 2026Reduce step formalized; chunked compile + reduce.
Compiler v1.02Apr 2026Promotion module: atomic_units → if_then_rules / playbooks / anti_patterns.
Compiler v1.03May 2026Global sanitizer (language + completeness + truncation). Regression on rich sections of short sources.
Compiler v1.03.1May 2026Field-aware sanitizer fixes the v1.03 regression; retrieval / procedures / playbooks preserved.
Compiler v1.1May 2026Unified pipeline (runCkfPipeline) used by /compiler, MCP, Lab, admin recompile. Language lock enforced end-to-end.
Compiler v1.2May 2026Source preflight (language/format/records, hash/empty-source guard), record-level segmentation with source_manifest, coverage modes (summary/balanced/complete), domain-agnostic numeric integrity guards (currencies, dates, durations, citations), language recovery.
Compiler v1.3.1May 2026Canonical PDF metadata extraction: title, subtitle, authors[], edition, publisher, year and ISBN derived from front/back-matter via deterministic heuristics override LLM-inferred values. Controlled source_type vocabulary ('PDF e-book' / 'PDF document'). Title sanitization removes section-suffix contamination. Every override is logged as an auditable warning.

See the project review post for the long-form story.

Migration

When the protocol bumps a major version, this site publishes a migration guide alongside the new spec. Older packages remain valid against their declared protocol_version; consumers decide whether to upgrade.

No automatic upgrade

The protocol does not promise lossless upgrades across major versions. The migration guide will list which fields require manual review.

Proposing a change

Open a discussion on the GitHub repository. Substantive changes go through a short RFC describing the problem, the proposed schema delta, and at least one worked example.

CKF v1.0 for this page has not been compiled yet. Downloads become available once an admin runs the compiler.