🦛 Chonkie ✨
The no-nonsense RAG chunking library that’s lightweight, fast, and ready to CHONK your texts!
Ever found yourself making a RAG pipeline yet again (your 2,342,148th one), only to realize you’re stuck having to write chunking with bloated software library X or the painfully feature-less library Y? WHY CAN’T THIS JUST BE SIMPLE, UGH?
Well, look no further than Chonkie! (chonkie boi is a gud boi 🦛)
Feature-rich
All the CHONKs you’d ever need for your RAG applications
Easy to use
Install, Import, CHONK - it’s that simple!
Lightning Fast
CHONK at the speed of light! zooooom
Wide Support
Supports all your favorite tokenizer, model and API CHONKs
Lightweight
No bloat, just CHONK - only 9.7MB base installation
Cute Mascot
psst it’s a pygmy hippo btw! Moto Moto approved
Quick Start
Get started with Chonkie in three simple steps: Install, Import and CHONK!
Want more features? :
Chonkie follows a special approach to dependencies, keeping the base installation lightweight while allowing you to add extra features as and when needed. Please check the Installation page for more details.
Release the CHONK! 🦛✨
Chonkie Cloud
Don’t wanna chunk locally? No problem! Chonkie Cloud is here to save the day!
- Make a free account on Chonkie Cloud
- Get your API key
- Send your CHONK reqests!
Build With Chonkie
Ready to learn more about Chonkie?
Concepts
Learn about Chonkie’s core concepts and values
Installation
Learn about different installation options
Chunkers
Explore different chunking strategies
Chonkie Cloud
Chonkie’s hosted chunking service! 🦛☁️
Embeddings
Explore different embedding strategies
GitHub
Star us on GitHub and contribute
Support
Got questions? We’re here to help!