ScalewaySkip to loginSkip to main contentSkip to footer section

Model-as-a-service

Serve Generative AI models and pay for a dedicated infrastructure or for millions of tokens

Generative APIs

Serve the latest AI models via API, pay by million token

Generative API
qwen3-235b-a22b-instruct-2507Chat€0.75 /million tokens€2.25 /million tokensTry
gpt-oss-120bChat€0.15 /million tokens€0.60 /million tokensTry
gemma-3-27b-itChat and Vision€0.25 /million tokens€0.50 /million tokensTry
whisper-large-v3Audio transcription€0.003 /Audio minuteFreeTry
holo2-30b-a3bChat and Vision€0.30 /million tokens€0.70 /million tokensTry
voxtral-small-24b-2507Audio transcription and Chat€0.15 /million tokens€0.35 /million tokensTry
mistral-small-3.2-24b-instruct-2506Chat and Vision€0.15 /million tokens€0.35 /million tokensTry
llama-3.3-70b-instructChat€0.90 /million tokens€0.90 /million tokensTry
deepseek-r1-distill-llama-70bChat€0.90 /million tokens€0.90 /million tokensTry
qwen3-embedding-8bEmbeddings€0.10 /million tokensFreeTry
qwen3-coder-30b-a3b-instructChat€0.20 /million tokens€0.80 /million tokensTry
pixtral-12b-2409Chat and Vision€0.20 /million tokens€0.20 /million tokensTry
mistral-nemo-instruct-2407Chat€0.20 /million tokens€0.20 /million tokensTry
bge-multilingual-gemma2Embeddings€0.10 /million tokensFreeTry
llama-3.1-8b-instructChat€0.20 /million tokens€0.20 /million tokensTry
mistral-small-3.1-24b-instruct-2503Chat and Vision€0.15 /million tokens€0.35 /million tokensTry
qwen2.5-coder-32b-instructChat€0.90 /million tokens€0.90 /million tokensTry
llama-3.1-70b-instructChat€0.90 /million tokens€0.90 /million tokensTry
devstral-small-2505Chat€0.15 /million tokens€0.35 /million tokensTry
Legal notice

Prices before tax.
You benefit from a free tier on the first 1,000,000 tokens. You'll be charged from token number 1,000,001.

Managed Inference

Deploy your managed AI infrastructure with dedicated GPUs and optimized models. You are charged for usage of the GPU type you choose. Billing only starts once the model is deployed

ModelGPUPriceApprox. per month
llama-3.1-8b-instructL4-1-24G€0.93/hour~€679/month
L40S-1-48G€1.72/hour~€1256/month
H100-1-80G€3.40/hour~€2482/month
H100-2-80G€6.68/hour~€4876/month
llama-3.3-70b-instructH100-2-80G€6.68/hour~€4876/month
llama-3.1-70b-instructH100-1-80G€3.40/hour~€2482/month
H100-2-80G€6.68/hour~€4876/month
llama-3.1-nemotron-70b-instructH100-1-80G€3.40/hour~€2482/month
H100-2-80G€6.68/hour~€4876/month
mistral-7b-instruct-v0.3L4-1-24G€0.93/hour~€679/month
L40S-1-48G€1.72/hour~€1256/month
H100-1-80G€3.40/hour~€2482/month
H100-2-80G€6.68/hour~€4876/month
mixtral-8x7b-instruct-v0.1H100-1-80G€3.40/hour~€2482/month
H100-2-80G€6.68/hour~€4876/month
mistral-nemo-instruct-2407L40S-1-48G€1.72/hour~€1256/month
H100-1-80G€3.40/hour~€2482/month
H100-2-80G€6.68/hour~€4876/month
pixtral-12b-2409L40S-1-48G€1.72/hour~€1256/month
H100-1-80G€3.40/hour~€2482/month
H100-2-80G€6.68/hour~€4876/month
molmo-72b-0924H100-2-80G€6.68/hour~€4876/month
qwen2.5-coder-32b-instructH100-1-80G€3.40/hour~€2482/month
H100-2-80G€6.68/hour~€4876/month
bge-multilingual-gemma2L4-1-24G€0.93/hour~€679/month
L40S-1-48G€1.72/hour~€1256/month
sentence-t5-xxlL4-1-24G€0.93/hour~€679/month
Legal notice

Prices before tax