BentoML
Open Source
Free Trial
BentoML is the platform for software engineers to build AI products.
BentoCloud provides fully managed infrastructures for deploying BentoML, OpenLLM, or any model, optimized for performance, scalability, and cost-efficiency. Supporting models like Llama 2, Stable Diffusion, Flan-T5, Segment Anything, and CLIP.
Pricing: Pay-as-you-go
HQ
🇺🇸 United States
Resources
BentoML Alternatives
Explore 76 products in the Inference APIs category. View all BentoML alternatives.
Lyceum
EU-hosted inference cloud for open-source models, OpenAI-compatible
Free Trial
From From $0.13/1M tokens
vLLM
High-throughput LLM inference engine with PagedAttention for efficient GPU memory usage
Open Source
Free Trial
From Free (open-source)
Work on BentoML? Feature it at the top of Inference APIs.
Is your product missing?