Home / Fine-tuning / Compare

Fine-tuning Pricing Comparison

21 providers compared by pricing model, free tiers, hosting options, and headquarters. Last updated May 2026.

15 with free tiers · 7 open source · 2 self-hostable · 2 European

Provider	Pricing Model	Starting Price	Free Tier	Hosting	Open Source	HQ
Amazon Bedrock	Pay-per-use	Pay-per-token	✓	Cloud	—	🇺🇸 United States
Anyscale	—	—	—	—	—	🇺🇸 United States
Axolotl	—	—	—	—	✓	🇺🇸 United States
fal	Pay-per-use	$0.02/megapixel	✓	Cloud	—	🇺🇸 United States
FinetuneDB	—	—	—	—	—	🇺🇸 United States
Hugging Face	Pay-per-use	$0.033/hr (CPU)	✓	Cloud	✓	🇺🇸 United States
Klu	—	—	✓	—	—	🇺🇸 United States
Lamini	—	—	✓	—	—	🇺🇸 United States
LangSmith	Freemium	$39/seat/mo	✓	Cloud + Self-hosted	—	🇺🇸 United States
LLaMA-Factory	—	—	✓	—	✓	🇺🇸 United States
Ludwig	—	—	—	—	✓	🇺🇸 United States
Modal	Pay-per-use	$30/mo free credits	✓	Cloud	—	🇺🇸 United States
Monster API	Pay-per-use	—	✓	Cloud	—	🇺🇸 United States
OpenAI	Pay-per-use	$0.05/1M tokens	✓	Cloud	—	🇺🇸 United States
OVHcloud AI	Pay-per-use	$0.91/hr (L4 GPU)	✓	Cloud	—	🇫🇷 France
Prem AI	Freemium	Free	✓	—	—	🇨🇭 Switzerland
Replicate	Pay-per-use	Per-second GPU billing	—	Cloud	—	🇺🇸 United States
together.ai	Pay-per-use	Pay-per-token	—	Cloud + Self-hosted	—	🇺🇸 United States
torchtune	—	—	✓	—	✓	🇺🇸 United States
T TRL	—	—	✓	—	✓	🇺🇸 United States
Unsloth	—	—	✓	—	✓	🇺🇸 United States

ℹ️ Pricing units vary by provider type: per-token for LLM APIs, per-GPU-hour for compute platforms, per-request for media generation. Verify current rates on each provider's website.

Providers with free tiers

These fine-tuning providers offer free credits, free tiers, or open-source self-hosting options to get started without upfront costs.

Amazon Bedrock

Managed API access to foundation models on AWS with built-in fine-tuning and ...

From: Pay-per-token

fal

Build the next generation of creativity with fal. Lightning fast inference.

From: $0.02/megapixel

Hugging Face

The open-source AI platform with 500K+ models, inference endpoints, and fine-...

From: $0.033/hr (CPU)

Klu

Collaborate on prompts, evaluate, and optimize LLM-powered Apps with Klu.

Lamini

Enterprise LLM fine-tuning platform with Memory Tuning for near-zero hallucin...

LangSmith

LangSmith is a unified DevOps platform for developing, collaborating, testing...

From: $39/seat/mo

Show all 15 providers with free tiers

LLaMA-Factory

Open-source fine-tuning framework for 100+ LLMs with a web UI

Modal

Run generative AI models, large-scale batch jobs, job queues, and much more.

From: $30/mo free credits

Monster API

Access, finetune, deploy LLMs using our affordable and scalable APIs.

OpenAI

API access to GPT, o-series reasoning, DALL-E, and Whisper models

From: $0.05/1M tokens

OVHcloud AI

European cloud provider with AI inference, training, and deployment services

From: $0.91/hr (L4 GPU)

Prem AI

Fine-tune and deploy LLMs on your own infrastructure with full data sovereignty

From: Free

torchtune

PyTorch-native library for fine-tuning LLMs on consumer and enterprise GPUs

TRL

Hugging Face library for training language models with RLHF, SFT, and DPO

Unsloth

Fine-tune LLMs up to 30x faster with 90% less memory usage

Frequently asked questions

Which fine-tuning offer a free tier?

15 of the 21 fine-tuning listed offer a free tier or free credits. Examples: Amazon Bedrock, fal, Hugging Face, Klu, Lamini. Use the "Free tier" filter above to see the full list.

Which fine-tuning are open source?

7 open-source options are listed. Examples: Axolotl, Hugging Face, LLaMA-Factory, Ludwig, torchtune. Most can be self-hosted alongside or instead of any managed offering.

Are there European fine-tuning?

Yes. 2 providers in this category are headquartered in Europe, including OVHcloud AI, Prem AI. The European providers page has the full cross-category list with hosting regions.

Which fine-tuning can be self-hosted?

2 of the 21 listed support self-hosting, either as the primary deployment model or alongside a managed cloud offering. Examples: LangSmith, together.ai.

How to choose an inference API provider

The right provider depends on workload type, latency requirements, and budget. Most providers use pay-per-token pricing for LLMs and per-second GPU billing for custom models. Token-based pricing varies by model, so the cheapest provider for one model may not be cheapest for another.

Free tiers are useful for prototyping but often come with rate limits. For production, compare per-token costs for your specific model, cold start latency, rate limits, and whether the provider supports the models you need.

Teams with data residency requirements should check hosting options and provider headquarters. European providers like OVHcloud AI, Prem AI keep data within EU jurisdiction. See the full European AI Infrastructure directory. Self-hostable options like LangSmith and together.ai give full control over data location.

For a deeper analysis, read AI Inference API Providers Compared on the blog. Pricing changes frequently, so verify current rates on each provider's website. Submit a correction.

Browse all Fine-tuning tools or explore the full AI Infrastructure Landscape.

Is your product missing?

Add it here →