Reducto

Developer APIs B2B Software and Services Generative AI Documents AI Engineering, Product and Design Artificial Intelligence (AI)Enterprise Software Search Document Management Artificial Intelligence Data Engineering

San Francisco, CA, USAFounded 202330+ employeesPrivate

$108.4M raisedSeries A· last: series b (Oct 2025)· 4 rounds

Backed by(14 investors)

First Round Capital · Lead Y Combinator · Lead Andreessen Horowitz · Lead Benchmark · Lead+2 more

Reducto is a company that specializes in converting complex documents into AI-ready inputs, leveraging state-of-the-art vision models developed by a team from MIT. The technology enables AI teams to process unstructured data, such as medical records and financial statements, with high accuracy and reliability. These models read documents in a way that mimics human understanding, addressing a critical bottleneck in AI workflows.

Founders

Adit Abraham

Raunak Chowdhuri

YC W24

Hiring Pitch

Nearly 80% of enterprise data is in unstructured formats like PDFs

PDFs are the status quo for enterprise knowledge in nearly every industry. Insurance claims, financial statements, invoices, and health records are all stored in a structure that’s simply impractical for use in digital workflows. This isn’t an inconvenience—it’s a critical bottleneck that leads to dozens of wasted hours every week.

Traditional approaches fail at reliably extracting information in complex PDFs

OCR and even more sophisticated ML approaches work for simple text documents but are unreliable for anything more complex. Text from different columns are jumbled together, figures are ignored, and tables are a nightmare to get right. Overcoming this usually requires a large engineering effort dedicated to building specialized pipelines for every document type you work with.

Reducto breaks document layouts into subsections and then contextually parses each depending on the type of content. This is made possible by a combination of vision models, LLMs, and a suite of heuristics we built over time. Put simply, we can help you:

Accurately extract text and tables even with nonstandard layouts
Automatically convert graphs to tabular data and summarize images in documents
Extract important fields from complex forms with simple, natural language instructions
Build powerful retrieval pipelines using Reducto’s document metadata
Intelligently chunk information using the document’s layout data