Rhesis AI
Open-source testing platform for LLM and agentic applications. Test generation, adversarial probing, and regression tracking.
Rhesis AI is an open-source (MIT) testing platform for LLM and AI agent applications. It goes beyond scoring to provide a full testing workflow: define requirements, generate test scenarios (including edge cases), execute them, review results, and track fixes. Engineers work in the SDK, non-technical team members work in the UI.
The platform generates test scenarios from requirements and can connect to knowledge sources via MCP (Notion, GitHub, Jira). Its red-teaming agent, Polyphemus, continuously probes for jailbreaks, prompt injection, and PII extraction. Every failed test links to its root cause across multi-step and multi-agent flows.
Covers conversational AI, RAG, NL-to-SQL, and agentic systems across any LLM provider.
Pricing: Free
Rhesis AI Alternatives
Explore 30 products in the Observability & Analytics category. View all Rhesis AI alternatives.
Sentrial
Production monitoring for AI agents with automated failure detection and diagnosis
Comet Opik
Comet provides an end-to-end model evaluation platform for AI developers.
Is your product missing?