deepinfra
Run the top AI models using a simple API, pay per use. Low cost, scalable and production ready infrastructure.
Deep Infra combines a machine learning platform with models like Llama-2-7b-chat, Mistral-7B, and OpenChat-3.5. With no ML Ops required, it enables fast, low-latency API deployments and auto-scaling, including specialized models such as CodeLlama-34b-Instruct for developers.
Pricing: Per token usage
![Screenshot of deepinfra webpage](https://static.runit.ai/rails/active_storage/representations/proxy/eyJfcmFpbHMiOnsiZGF0YSI6OTYsInB1ciI6ImJsb2JfaWQifX0=--d26b90383b631f605d2aceff427c643f65438354/eyJfcmFpbHMiOnsiZGF0YSI6eyJmb3JtYXQiOiJqcGciLCJyZXNpemVfdG9fbGltaXQiOls2NDAsNDgwXX0sInB1ciI6InZhcmlhdGlvbiJ9fQ==--b0c1c9855994c273b59ef891730b2e839687391b/screenshot.jpg)
🙋♀️ Resources
Similar Products
We have 18 products in the Inference APIs category. Here are the latest 3:
Is your product missing? 👀 Add it here →