Simon Willison’s Weblog

13th March 2023 - Link Blog

Int-4 LLaMa is not enough - Int-3 and beyond (via) The Nolano team are experimenting with reducing the size of the LLaMA models even further than the 4bit quantization popularized by llama.cpp.

Posted 13th March 2023 at 11:55 pm

Recent articles

Writing about Agentic Engineering Patterns - 23rd February 2026
Adding TILs, releases, museums, tools and research to my blog - 20th February 2026
Two new Showboat tools: Chartroom and datasette-showboat - 17th February 2026

This is a link post by Simon Willison, posted on 13th March 2023.

ai 1872 generative-ai 1659 llama 79 local-llms 146 llms 1624

Monthly briefing

Sponsor me for $10/month and get a curated email digest of the month's most important LLM developments.

Pay me to send you less!

Sponsor & subscribe