About OVHcloud AI Endpoints
OVHcloud AI Endpoints is the serverless inference API from French cloud provider OVHcloud, headquartered in Roubaix. Launched in April 2025, it lets developers call 40+ open-weight models (Llama, Qwen, DeepSeek and others) for chat, voice processing, document analysis and image analysis without managing GPUs or ML stacks, and includes a sandbox for testing before scaling. Models are served from OVHcloud's Gravelines data centre in northern France under EU jurisdiction, protected from non-European regulations. Pricing is pay-as-you-go per million tokens per model, with availability across Europe, Canada and APAC.
Quick facts
Pricing model
Paid
Free trial
—
Open source
—
Self-hostable
No
Public API
No
Target audience
B2B
Company size fit
All
DPA available
Yes
Data stored in EU
Yes

Replicate
Weights & Biases
RunPod