ai-tools / ML Inference
Replicate
4.5(2,400 reviews)
Overview
Replicate is a cloud platform for running open-source machine learning models via a simple API. It hosts thousands of community-contributed models — LLMs, image generation, audio, video, and more — and lets you run them with a single HTTP call. You pay only for the compute seconds used. Replicate also supports deploying your own private models with custom hardware configurations.
Key Features
- 50,000+ hosted open-source models
- Pay-per-second compute
- Streaming outputs
- Private model deployment
- Custom hardware (CPU, GPU tiers)
- Model versioning