≫ Home / Inference APIs / deepinfra

deepinfra

Run the top AI models using a simple API, pay per use. Low cost, scalable and production ready infrastructure.

Deep Infra combines a machine learning platform with models like Llama-2-7b-chat, Mistral-7B, and OpenChat-3.5. With no ML Ops required, it enables fast, low-latency API deployments and auto-scaling, including specialized models such as CodeLlama-34b-Instruct for developers.

Pricing: Per token usage

Visit website →

Suggest changes

Screenshot of deepinfra webpage

🙋‍♀️ Resources

Deploy Open Source AI Models within Minutes using De...

Similar Products

We have 21 products in the Inference APIs category. Here are the latest 3:

cohere

Cohere’s world-class LLMs help enterprises build powerful, secure applications that search, understand meaning and co...

Mistral

Use models in a few clicks with our platform. Download our open models for deep access.

Anyscale

Fast, cost-efficient, serverless APIs for LLM Serving and Fine Tuning

View deepinfra alternatives in Inference APIs ≫

Is your product missing? 👀 Add it here →