doubao-1.5-vision-pro-250328

Common Name: Doubao 1.5 Vision Pro (250328)

ByteDance
Released on May 23 12:00 AMSupportedTool Invocation
CompareTry in Chat

Advanced vision-language model with enhanced image understanding and analysis capabilities. Features 64K context window and excels at complex visual reasoning and multimodal tasks.

Specifications

Context
64K
Maximum Output
16.4K
Inputtext, image
Outputtext

Performance (7-day Average)

Collecting…
Collecting…
Collecting…

Pricing

Input¥3.00/MTokens
Output¥9.00/MTokens

Availability Trend (24h)

Performance Metrics (24h)

Similar Models

¥2.00/¥8.00/M
ctx128Kmax16Kavailtps
InOutCap

DeepSeek's reasoning-focused model hosted on ByteDance infrastructure, optimized for complex problem-solving and logical reasoning tasks. Supports 128K context with strong analytical capabilities.

¥1.50/¥4.50/M
ctx64Kmax16Kavailtps
InOutCap

Lightweight vision-language model from the Doubao 1.5 series, balancing efficiency with multimodal understanding. Supports text and image inputs with 64K context for cost-effective visual tasks.

¥3.00/¥9.00/M
ctx64Kmax16Kavailtps
InOutCap

Premium multimodal model combining thinking capabilities with advanced vision understanding. Supports text, image, and video inputs with 64K context for sophisticated reasoning over visual content.

¥4.00/¥16.00/M
ctx64Kmax16Kavailtps
InOutCap

Professional thinking-enhanced model designed for complex reasoning and analytical tasks. Supports 64K context with text and image inputs, excelling at multi-step problem solving.