Together AI

API Aggregator

Fast inference for open-source models

together.ai

Together AI provides fast, reliable inference for 200+ open-source models including DeepSeek, Llama, and Mixtral. Known for being up to 4x faster than vLLM with optimized infrastructure for production workloads.

Pros

  • 200+ open-source models
  • Up to 4x faster than vLLM
  • Fine-tuning support
  • GPU cloud available
  • Great for LLMs

Cons

  • Primarily text models
  • Complex pricing tiers
  • Learning curve for fine-tuning
  • Variable costs at scale

Key Features

API Access
Models Available200+
Fine-tuning
Speed4x faster

Best For

llm applicationsdevelopersfine tuningopen source

Pricing

Serverless

Pay as you go
  • Pay per token
  • 200+ models
  • No minimum
  • Instant access

GPU Cloud

Pay as you go
  • H100: $3.36/hr
  • A100: $2.40/hr
  • Dedicated capacity
  • Custom models

Company

Company
Together AI
Founded
2022
Headquarters
San Francisco, USA

Ready to try Together AI?

Visit Together AI