opendataloader-pdf: PDF Parser for AI-ready data (github.com)
github.comOpenDataLoader PDF is an open-source parser designed to convert PDFs into AI-ready Markdown, JSON, and HTML. It features a hybrid mode combining deterministic local processing with AI for complex layouts. Extract Markdown, JSON (with bounding boxes), and HTML from any PDF. #1 in benchmarks.