Authorizations
Your API Key from the Chonkie Cloud dashboard
Body
The file to chunk.
Tokenizer or token counting function to use. Can be a string identifier or an instance.
Maximum number of tokens per chunk.
Number of overlapping tokens between chunks.
Minimum number of sentences to include in each chunk.
Minimum number of characters per sentence.
Use approximate token counting for faster processing (deprecated).
Delimiters to split sentences on.
Include delimiters in the chunk text. If so, specify whether to include the previous or next delimiter.
prev
, next
Whether to return chunks as text strings or as SentenceChunk
objects.
texts
, chunks
Response
Successful Response: A list of SentenceChunk
objects.
A list containing SentenceChunk
objects, each detailing a segment of the original text and the sentences within it.
The actual text content of the chunk.
The starting character index of the chunk within the original input text.
The ending character index (exclusive) of the chunk within the original input text.
The number of tokens in this specific chunk, according to the tokenizer used.
List of Sentence
objects contained within this chunk.