AI agent evaluation pricing — free open source to enterprise

Free open-source ProofAgent Harness today (Apache 2.0, BYO LLM, runs locally). Enterprise Platform with all 5 evaluation tiers, hosted operations, SOC 2 + HIPAA-ready, on-premises deployment.

Open-source Harness — free forever

The ProofAgent Harness is Apache 2.0 and free for any use. Install with pip, bring your own LLM (Anthropic, OpenAI, Gemini, Bedrock, Ollama, vLLM, lm-studio via LiteLLM), and run unlimited evaluations locally. Multi-turn adversarial testing, 183 bundled traps, 3-juror consensus scoring, pytest integration, full transcripts, evidence-linked findings — all included.

Hosted Platform — free during beta

The hosted Platform is free during beta with 5 evaluations per day, BYO LLM key, 10+ proof metrics with full transcripts, 3-level readiness verdict per run, dashboard with metrics radar, and CSV export.

Enterprise — contact for tailored pricing

Enterprise tier adds unlimited evaluations, REST API + webhooks + SDK, custom rubrics per vertical, private hosted engine, all 5 evaluation tiers, 11+ production metrics, regression tracking, agent evolution dashboard, SSO/SAML, RBAC, audit logs, SOC 2 Type II alignment, HIPAA-ready BAAs, GDPR-aligned data processing, on-premises and private cloud deployment, dedicated SLA, and expert human reviewers. Contact us for tailored quotes.