Rhesis AI
Open-source testing platform for LLM and agentic applications. Test generation, adversarial probing, and regression tracking.
Rhesis AI is an open-source (MIT) testing platform for LLM and AI agent applications. It goes beyond scoring to provide a full testing workflow: define requirements, generate test scenarios (including edge cases), execute them, review results, and track fixes. Engineers work in the SDK, non-technical team members work in the UI.
The platform generates test scenarios from requirements and can connect to knowledge sources via MCP (Notion, GitHub, Jira). Its red-teaming agent, Polyphemus, continuously probes for jailbreaks, prompt injection, and PII extraction. Every failed test links to its root cause across multi-step and multi-agent flows.
Covers conversational AI, RAG, NL-to-SQL, and agentic systems across any LLM provider.
Pricing: Free
Rhesis AI Alternatives
Explore 41 products in the Observability & Analytics category. View all Rhesis AI alternatives.
Helicone
Open-source LLM observability platform for monitoring, debugging, and improving AI applications.
Langfuse
Traces, evals, prompt management and metrics to debug and improve your LLM application.
Work on Rhesis AI? Feature it at the top of Observability & Analytics.
Is your product missing?