Sentence Chunker
Splits text into chunks while preserving complete sentences, ensuring that each chunk maintains proper sentence boundaries and context.
Authorizations
Your API Key from the Chonkie Cloud dashboard
Body
The file to chunk.
Tokenizer or token counting function to use. Can be a string identifier or an instance.
Maximum number of tokens per chunk.
Number of overlapping tokens between chunks.
Minimum number of sentences to include in each chunk.
Minimum number of characters per sentence.
Use approximate token counting for faster processing (deprecated).
Delimiters to split sentences on.
Include delimiters in the chunk text. If so, specify whether to include the previous or next delimiter.
prev
, next
Whether to return chunks as text strings or as SentenceChunk
objects.
texts
, chunks
Response
The actual text content of the chunk.
The starting character index of the chunk within the original input text.
The ending character index (exclusive) of the chunk within the original input text.
The number of tokens in this specific chunk, according to the tokenizer used.
List of Sentence
objects contained within this chunk.
Represents a single sentence with metadata, used within sentence-based chunks.