Skip to main content
Chefs are simple classes that automatically prepare data for future usage. They are designed to make preprocessing and data transformation easy and reusable.
Chefs are available only in Python

TableChef

Extracts tables from markdown text and prepares them for future usage.

TextChef

Processes plain text files and returns structured Document objects.

MarkdownChef

Processes markdown files, extracting tables, code blocks, and images into a MarkdownDocument.

MistralOCR

Extracts text from images and PDFs using Mistral’s OCR API.

LiteParse

Extracts text from PDFs, office documents, and images locally using LiteParse.