Fast, cost-efficient, serverless APIs for LLM Serving and Fine Tuning

Anyscale Endpoints offers fast, cost-efficient, serverless APIs for serving and fine-tuning Large Language Models (LLMs) with a focus on production-readiness. Users can start with common LLMs, including the Llama-2 family and Mistral 7B, and fine-tune them for specific applications.

Pricing: Pay-as-you-go

