Skip to main content
Back to companies
San Francisco, CA, USAFounded 20237+ employeesPrivate
$2.2M raisedSeed· last: seed (Mar 2025)· 1 rounds

The LLM Eval and Observability Platform for AI Quality

Founders

YC W25

Hiring Pitch

Confident AI is the leading LLM evaluation platform that helps teams evaluate, test, benchmark, optimize, monitor, and red-team LLM applications. Powered by DeepEval, the go-to LLM evaluation framework with over 600k monthly downloads, 5.3k GitHub stars, and over 40 million evaluations conducted, Confident AI is trusted by hundreds of companies from leading startups to international corporations.

Tech Stack

Confident AI is building an open-source LLM evaluation framework called DeepEval to help companies evaluate their LLM applications. While we provide the algorithms, companies are free to use their own LLMs for evaluation and our job is to make sure they get accurate evaluation results and a good user experience while using our framework.

Confident AI's commercial product brings DeepEval to the cloud. While DeepEval is great, it can only do so much as a testing framework that runs locally in notebooks or CI/CD pipelines. With Confident AI, companies can get instant access to benchmark and LLM testing reports, catch regressions at scale, and monitor LLM applications in production.

Skip jobs list

Open Positions at Confident AI (3 Jobs)

3 open · 2 filled

Showing 3 jobs