Simon Willison’s Weblog

13th March 2023 - Link Blog

Int-4 LLaMa is not enough - Int-3 and beyond (via) The Nolano team are experimenting with reducing the size of the LLaMA models even further than the 4bit quantization popularized by llama.cpp.

Posted 13th March 2023 at 11:55 pm

Recent articles

The new GPT-5.6 family: Luna, Terra, Sol - 9th July 2026
sqlite-utils 4.0, now with database schema migrations - 7th July 2026
sqlite-utils 4.0rc2, mostly written by Claude Fable (for about $149.25) - 5th July 2026

This is a link post by Simon Willison, posted on 13th March 2023.

ai 2,119 generative-ai 1,874 llama 80 local-llms 160 llms 1,841

Monthly briefing

Sponsor me for $10/month and get a curated email digest of the month's most important LLM developments.

Pay me to send you less!

Sponsor & subscribe