Archive for Sunday, 10th September 2023

Sunday, 10th September 2023

Release blip-caption 0.1 — Generate captions for images with Salesforce BLIP

10th Sep 2023, 6:17 am

promptfoo: How to benchmark Llama2 Uncensored vs. GPT-3.5 on your own inputs. promptfoo is a CLI and library for “evaluating LLM output quality”. This tutorial in their documentation about using it to compare Llama 2 to gpt-3.5-turbo is a good illustration of how it works: it uses YAML files to configure the prompts, and more YAML to define assertions such as “not-icontains: AI language model”.

# 4:19 pm / cli, testing, ai, generative-ai, llms

The AI-assistant wars heat up with Claude Pro, a new ChatGPT Plus rival. I'm quoted in this piece about the new Claude Pro $20/month subscription from Anthropic:

Willison has also run into problems with Claude's morality filter, which has caused him trouble by accident: "I tried to use it against a transcription of a podcast episode, and it processed most of the text before—right in front of my eyes—it deleted everything it had done! I eventually figured out that they had started talking about bomb threats against data centers towards the end of the episode, and Claude effectively got triggered by that and deleted the entire transcript."

# 5:07 pm / arstechnica, ai, generative-ai, llms, anthropic, claude, press-quotes

All models on Hugging Face, sorted by downloads (via) I realized this morning that “sort by downloads” against the list of all of the models on Hugging Face can work as a reasonably good proxy for “which of these models are easiest to get running on your own computer”.

# 5:24 pm / machine-learning, ai, hugging-face

Release datasette-sqlite-trace 0.1 — Datasette plugin that prints all executed SQL to stderr

10th Sep 2023, 10:03 pm · datasette

← Saturday, 9th September 2023

Monday, 11th September 2023 →

M	T	W	T	F	S	S
				1	2	3
4	5	6	7	8	9	10
11	12	13	14	15	16	17
18	19	20	21	22	23	24
25	26	27	28	29	30

Simon Willison’s Weblog

Sunday, 10th September 2023