About LLM Evaluation & Testing
The LLM Evaluation & Testing category is part of the LLMOps & AI Observability market map, tracking 18 companies building in this segment. Evaluation frameworks, production monitoring, safety guardrails, prompt management, cost optimization, and model lifecycle tools keeping AI systems reliable and secure. Curated by Hartmann Capital's venture research team.
Companies in LLM Evaluation & Testing
- Braintrust — Series A, $45M
- Patronus AI — Series A, $20M
- Galileo — Series B, $68M
- Confident AI — Seed, $2.2M
- Maxim AI — Seed, $3M
- HoneyHive — Seed, $7.4M
- Promptfoo — Series A, $23.4M
- Comet ML — Series B, $63M
- Openlayer — Series A, $14.5M
- Deepchecks — Seed, $18.3M
- Kolena — Series A, $21M
- Trismik — Pre-Seed, $2.8M
- Okareo — Pre-Seed
- Cleanlab — Series A, $30M
- Athina AI — Seed, $4.1M
- Giskard — Seed, $1.6M
- LastMile AI — Seed, $10M
- AIMon — Pre-Seed, $2.3M
Frequently Asked Questions
- What companies are in the LLM Evaluation & Testing category?
- The LLM Evaluation & Testing category includes 18 companies: Braintrust, Patronus AI, Galileo, Confident AI, Maxim AI, HoneyHive, Promptfoo, Comet ML, Openlayer, Deepchecks, Kolena, Trismik, Okareo, Cleanlab, Athina AI, Giskard, LastMile AI, AIMon. This is part of the LLMOps & AI Observability market map maintained by Hartmann Capital.
- How many LLM Evaluation & Testing startups are tracked?
- Hartmann Capital tracks 18 companies in the LLM Evaluation & Testing segment of the LLMOps & AI Observability market map.
- What are the best funded LLM Evaluation & Testing companies?
- Top funded companies in LLM Evaluation & Testing include Galileo ($68M), Comet ML ($63M), Braintrust ($45M), Cleanlab ($30M), Promptfoo ($23.4M). Browse the full list in the LLMOps & AI Observability market map.
- How can I submit my startup?
- You can submit your startup for inclusion by visiting the submission page. Submissions are reviewed by Hartmann Capital's research team.