Simon Willison’s Weblog

Pelicans on a bicycle. I decided to roll out my own LLM benchmark: how well can different models render an SVG of a pelican riding a bicycle?

I chose that because a) I like pelicans and b) I'm pretty sure there aren't any pelican on a bicycle SVG files floating around (yet) that might have already been sucked into the training data.

My prompt:

Generate an SVG of a pelican riding a bicycle

I've run it through 16 models so far - from OpenAI, Anthropic, Google Gemini and Meta (Llama running on Cerebras), all using my LLM CLI utility. Here's my (Claude assisted) Bash script: generate-svgs.sh

Here's Claude 3.5 Sonnet (2024-06-20) and Claude 3.5 Sonnet (2024-10-22):

Gemini 1.5 Flash 001 and Gemini 1.5 Flash 002:

GPT-4o mini and GPT-4o:

o1-mini and o1-preview:

Cerebras Llama 3.1 70B and Llama 3.1 8B:

And a special mention for Gemini 1.5 Flash 8B:

The rest of them are linked from the README.

Posted 25th October 2024 at 11:56 pm

Recent articles

Vibe scraping and vibe coding a schedule app for Open Sauce 2025 entirely on my phone - 17th July 2025
Happy 20th birthday Django! Here's my talk on Django Origins from Django's 10th - 13th July 2025
Grok: searching X for "from:elonmusk (Israel OR Palestine OR Hamas OR Gaza)" - 11th July 2025

svg 47 ai 1448 openai 319 generative-ai 1265 llama 76 llms 1245 llm 218 anthropic 169 gemini 101 cerebras 10 pelican-riding-a-bicycle 41

Monthly briefing

Sponsor me for $10/month and get a curated email digest of the month's most important LLM developments.

Pay me to send you less!

Sponsor & subscribe