61 posts tagged “llm-reasoning”
Improving performance of LLMs through spending more tokens "reasoning" about a problem, as seen in OpenAI's o-series, DeepSeek's R1, Qwen's QwQ, Google's Gemini 2.5 and Anthropic's Claude 3.7 Sonnet.
2024
Notes on OpenAI’s new o1 chain-of-thought models
OpenAI released two major new preview models today: o1-preview
and o1-mini
(that mini one is not a preview)—previously rumored as having the codename “strawberry”. There’s a lot to understand about these models—they’re not as simple as the next step up from GPT-4o, instead introducing some major trade-offs in terms of cost and performance in exchange for improved “reasoning” capabilities.
«« first
« previous
page 3 / 3