CKF Compiler Pro
Industrial pipeline — chunk · extract · reduce · validate. BYOK with OpenAI, Anthropic, Google, DeepSeek or OpenRouter. Handles up to ~960k characters per run. · reference compiler v1.3.1
Engine
OpenAI API key. Get a key. Stored only in this browser tab (sessionStorage).
Curious how the LLM extraction differs from the heuristic one? See side-by-side →
Auto upgrades to complete for record-oriented sources (JSONL, FAQ, normative texts).
Sources
Pipeline
- 1ExtractRead all sources, normalize whitespace and headings.
- 2ChunkSplit semantically by markdown sections (~12k chars each).
- 3Map · ExtractPer-chunk extraction via tool-calling — strict JSON schema.
- 4Reduce · MergeDeduplicate concepts, merge facts/relations, score utility.
- 5ValidateCheck required fields, types, and counts.
- 6SerializeEmit canonical Markdown + JSON CKF package.
Output
From human file to AI agent, in 4 steps
CKF translates human-oriented documents into structured knowledge that agents can reason over — without hallucinating and with full traceability.
PDF, DOCX, MD, TXT, transcripts — anything humans read.
Extracts entities, concepts, rules, principles and procedures into 22 typed sections.
Markdown / JSON / YAML — portable, versionable, auditable, traceable to source.
LLMs, RAG and agents consume pre-compiled knowledge — fewer tokens, less hallucination, higher precision.