DeepSeek V3 MoE model with 671B total parameters and 37B active, hosted on TogetherAI.
Specifications
Context
64K
Maximum Output
8K
Inputtext
Outputtext
Performance (7-day Average)
Collecting…
Collecting…
Collecting…
Pricing
Input$1.38/MTokens
Output$1.38/MTokens
Availability Trend (24h)
Performance Metrics (24h)
Similar Models
$2.20/$2.20/M
ctx128Kmax—avail—tps—
InOut
DeepSeek R1 reasoning model distilled to Llama 70B architecture, hosted on TogetherAI.
$0.97/$0.97/M
ctx128Kmax—avail—tps—
InOut
Meta's Llama 3.1 70B optimized for fast inference on TogetherAI.
$1.32/$1.32/M
ctx128Kmax—avail—tps—
InOut
Alibaba's Qwen2.5 7B model optimized for fast inference on TogetherAI.
$1.32/$1.32/M
ctx128Kmax—avail—tps—
InOut
Alibaba's QwQ reasoning model preview with enhanced thinking capabilities, hosted on TogetherAI.