o3

openai large reasoning

240 questions answered · 240 with human benchmark data · Released 2025-04-16

o3 is a well-rounded and powerful model across domains. It sets a new standard for math, science, coding, and visual reasoning tasks. It also excels at technical writing and instruction-following. Use it to think through multi-step problems that involve analysis across text, code, and images.

Alignment 51
Consensus 75
Confidence 85

Scores by Category

Notable Questions

Response Confidence

Related Models

Most Different

Qualia Garden Exploring AI values alignment