How streaming LLM APIs work

How streaming LLM APIs work. New TIL. I used curl to explore the streaming APIs provided by OpenAI, Anthropic and Google Gemini and wrote up detailed notes on what I learned.

Also includes example code for receiving streaming events in Python with HTTPX and receiving streaming events in client-side JavaScript using fetch().

Posted 22nd September 2024 at 3:48 am

Simon Willison’s Weblog

Recent articles

Monthly briefing