deepinfra
Run the top AI models using a simple API, pay per use. Low cost, scalable and production ready infrastructure.
Deep Infra combines a machine learning platform with models like Llama-2-7b-chat, Mistral-7B, and OpenChat-3.5. With no ML Ops required, it enables fast, low-latency API deployments and auto-scaling, including specialized models such as CodeLlama-34b-Instruct for developers.
Pricing: Per token usage
🙋♀️ Resources
Similar Products
We have 20 products in the Inference APIs category. Here are the latest 3:
Is your product missing? 👀 Add it here →