>2,000,000 Tokens / Second
The fastest production-grade tokenizer ever built.
First-principles engineering from the XERV Research Division.
Soham Pal
Crayon redefines what’s possible in large-scale text processing by combining information theory, SIMD optimization, cache-aware architecture, and zero-copy memory design.
Achieving sustained throughput of >2 million tokens per second on commodity hardware, Crayon enables real-time processing of massive datasets — accelerating AI training, inference preprocessing, and interactive research workflows.