Recursively chunk documents into smaller chunks.
RecursiveChunker
via the API, check out the API reference documentation.
RecursiveRules
to determine how to chunk the text.
The rules are a list of RecursiveLevel
objects, which define the delimiters and whitespace rules for each level of the recursive tree.
Find more information about the rules in the Additional Information section.
RecursiveChunk
objects with additional sentence metadata:
RecursiveRules
class to determine the chunking rules. The rules are a list of RecursiveLevel
objects, which define the delimiters and whitespace rules for each level of the recursive tree.
RecursiveLevel
expects the list of custom delimiters to not include whitespace.
If whitespace as a delimiter is required, you can set the whitespace
parameter in the RecursiveLevel
class to True.
Note that if whitespace = True
, you cannot pass a list of custom delimiters.