Rhesis AI
Open-source testing platform for LLM and agentic applications. Test generation, adversarial probing, and regression tracking.
Rhesis AI is an open-source (MIT) testing platform for LLM and AI agent applications. It goes beyond scoring to provide a full testing workflow: define requirements, generate test scenarios (including edge cases), execute them, review results, and track fixes. Engineers work in the SDK, non-technical team members work in the UI.
The platform generates test scenarios from requirements and can connect to knowledge sources via MCP (Notion, GitHub, Jira). Its red-teaming agent, Polyphemus, continuously probes for jailbreaks, prompt injection, and PII extraction. Every failed test links to its root cause across multi-step and multi-agent flows.
Covers conversational AI, RAG, NL-to-SQL, and agentic systems across any LLM provider.
Pricing: Free
Rhesis AI Alternatives
Explore 32 products in the Observability & Analytics category. View all Rhesis AI alternatives.
Future AGI
Open-source platform for testing, monitoring, and improving AI agents with tracing, evals, guardrails, and gateway
Sentrial
Production monitoring for AI agents with automated failure detection and diagnosis
Agenta
Open-source prompt management, evaluation, and observability for LLM apps
Ragas
Open-source evaluation and testing framework for LLM and RAG applications
Is your product missing?