Release: llm 0.32a2

12th May 2026

Release llm 0.32a2 — Access large language models from the command-line

A bunch of useful stuff in this LLM alpha, but the most important detail is this one:

Most reasoning-capable OpenAI models now use the /v1/responses endpoint instead of /v1/chat/completions. This enables interleaved reasoning across tool calls for GPT-5 class models. #1435

This means you can now see the summarized reasoning tokens when you run prompts against an OpenAI model, displayed in a different color to standard error. Use the -R or --hide-reasoning flags if you don't want to see that.

Posted 12th May 2026 at 5:45 pm

Simon Willison’s Weblog

Recent articles

Monthly briefing