What companies are in the LLM Evaluation & Testing category?

The LLM Evaluation & Testing category includes 18 companies: Braintrust, Patronus AI, Galileo, Confident AI, Maxim AI, HoneyHive, Promptfoo, Comet ML, Openlayer, Deepchecks, Kolena, Trismik, Okareo, Cleanlab, Athina AI, Giskard, LastMile AI, AIMon. This is part of the LLMOps & AI Observability market map maintained by Hartmann Capital.

How many LLM Evaluation & Testing startups are tracked?

Hartmann Capital tracks 18 companies in the LLM Evaluation & Testing segment of the LLMOps & AI Observability market map.

What are the best funded LLM Evaluation & Testing companies?

Top funded companies in LLM Evaluation & Testing include Galileo ($68M), Comet ML ($63M), Braintrust ($45M), Cleanlab ($30M), Promptfoo ($23.4M). Browse the full list with funding details in the interactive LLMOps & AI Observability market map at vcmaps.com.

How can I submit my startup to this category?

You can submit your startup for inclusion by visiting vcmaps.com/submit. Submissions are reviewed by Hartmann Capital's research team.

LLM Evaluation & Testing Companies — LLMOps & AI Observability Market Map 2026

About LLM Evaluation & Testing

The LLM Evaluation & Testing category is part of the LLMOps & AI Observability market map, tracking 18 companies building in this segment. Evaluation frameworks, production monitoring, safety guardrails, prompt management, cost optimization, and model lifecycle tools keeping AI systems reliable and secure. Curated by Hartmann Capital's venture research team.

Companies in LLM Evaluation & Testing

Braintrust — Series A, $45M
Patronus AI — Series A, $20M
Galileo — Series B, $68M
Confident AI — Seed, $2.2M
Maxim AI — Seed, $3M
HoneyHive — Seed, $7.4M
Promptfoo — Series A, $23.4M
Comet ML — Series B, $63M
Openlayer — Series A, $14.5M
Deepchecks — Seed, $18.3M
Kolena — Series A, $21M
Trismik — Pre-Seed, $2.8M
Okareo — Pre-Seed
Cleanlab — Series A, $30M
Athina AI — Seed, $4.1M
Giskard — Seed, $1.6M
LastMile AI — Seed, $10M
AIMon — Pre-Seed, $2.3M

Frequently Asked Questions

What companies are in the LLM Evaluation & Testing category?: The LLM Evaluation & Testing category includes 18 companies: Braintrust, Patronus AI, Galileo, Confident AI, Maxim AI, HoneyHive, Promptfoo, Comet ML, Openlayer, Deepchecks, Kolena, Trismik, Okareo, Cleanlab, Athina AI, Giskard, LastMile AI, AIMon. This is part of the LLMOps & AI Observability market map maintained by Hartmann Capital.
How many LLM Evaluation & Testing startups are tracked?: Hartmann Capital tracks 18 companies in the LLM Evaluation & Testing segment of the LLMOps & AI Observability market map.
What are the best funded LLM Evaluation & Testing companies?: Top funded companies in LLM Evaluation & Testing include Galileo ($68M), Comet ML ($63M), Braintrust ($45M), Cleanlab ($30M), Promptfoo ($23.4M). Browse the full list in the LLMOps & AI Observability market map.
How can I submit my startup?: You can submit your startup for inclusion by visiting the submission page. Submissions are reviewed by Hartmann Capital's research team.

Other Categories in LLMOps & AI Observability

AI Observability & Monitoring AI Safety & Guardrails Prompt Engineering & Management AI Cost Optimization & FinOps MLOps & Model Management AI Gateway & Inference Infrastructure AI Data Privacy & Compliance

← Back to LLMOps & AI Observability Market Map