Icon for deepinfra


Run the top AI models using a simple API, pay per use. Low cost, scalable and production ready infrastructure.

Deep Infra combines a machine learning platform with models like Llama-2-7b-chat, Mistral-7B, and OpenChat-3.5. With no ML Ops required, it enables fast, low-latency API deployments and auto-scaling, including specialized models such as CodeLlama-34b-Instruct for developers.

Pricing: Per token usage

Screenshot of deepinfra webpage

Is your product missing? 👀 Add it here →