How much does IonRouter cost?

IonRouter pricing: Per token usage.

Does IonRouter have a free tier?

Yes, IonRouter offers a free tier or free trial.

Is IonRouter open source?

No, IonRouter is not open source.

Home / Inference APIs / IonRouter

IonRouter

Free Trial

High-throughput inference API with OpenAI-compatible access to open-source models at half market rate

IonRouter is a managed inference API by Cumulus Labs that provides OpenAI-compatible access to open-source AI models at roughly half the cost of competitors. Powered by a custom IonAttention engine optimized for NVIDIA Grace Hopper hardware, it supports LLMs (Qwen, DeepSeek, GLM), vision, video generation, and text-to-speech models. Developers swap their base URL and get sub-100ms model swap times with up to 7,167 tokens/second throughput.

Pricing: Per token usage

Hosting Cloud

Pricing Usage Based, from $0.02/M tokens

HQ 🇺🇸 United States

Founded 2025

License PROPRIETARY

Visit website →

Pricing

Posts