Simon Willison’s Weblog

Subscribe

August 2022

67 posts: 5 entries, 24 links, 2 quotes, 36 beats

Aug. 29, 2022

TIL SQLite VACUUM: database or disk is full — I was trying to run `VACUUM` against a large SQLite database file (~7GB) using `sqlite-utils vacuum data.db` and I got this error:
Release datasette-sitemap 0.1 — Generate sitemap.xml for Datasette sites
Release datasette-sitemap 0.1.1 — Generate sitemap.xml for Datasette sites

Aug. 30, 2022

Release datasette-block-robots 1.1 — Datasette plugin that blocks robots and crawlers using robots.txt
Release datasette-sitemap 1.0 — Generate sitemap.xml for Datasette sites

Aug. 31, 2022

Exploring 12 Million of the 2.3 Billion Images Used to Train Stable Diffusion’s Image Generator. Andy Baio and I collaborated on an investigation into the training set used for Stable Diffusion. I built a Datasette instance with 12m image records sourced from the LAION-Aesthetics v2 6+ aesthetic score data used as part of the training process, and built a tool so people could run searches and explore the data. Andy did some extensive analysis of things like the domains scraped for the images and names of celebrities and artists represented in the data. His write-up here explains our project in detail and some of the patterns we’ve uncovered so far.

# 2:10 am / machine-learning, ai, stable-diffusion, generative-ai, laion, training-data

Farmbound, or how I built an app in 2022. Stuart Langridge describes the architecture and decision process behind his new mobile web game, Farmbound.

# 11:23 pm / stuart-langridge, web

2022 » August

MTWTFSS
1234567
891011121314
15161718192021
22232425262728
293031