Simon Willison’s Weblog

Subscribe

September 2022

81 posts: 9 entries, 28 links, 9 quotes, 35 beats

Sept. 20, 2022

I Resurrected “Ugly Sonic” with Stable Diffusion Textual Inversion (via) “I trained an Ugly Sonic object concept on 5 image crops from the movie trailer, with 6,000 steps [...] (on a T4 GPU, this took about 1.5 hours and cost about $0.21 on a GCP Spot instance)”

# 3:35 am / machine-learning, ai, max-woolf, stable-diffusion, generative-ai, text-to-image

Wasmtime Reaches 1.0: Fast, Safe and Production Ready! The Bytecode Alliance are making some confident promises in this post about the performance and stability of their Wasmtime WebAssembly runtime. They also highlight some exciting use-cases for WebAssembly on the server, including safe 3rd party plugin execution and User Defined Functions running inside databases.

# 10:11 pm / webassembly

Fastly Compute@Edge JS Runtime (via) Fastly’s JavaScript runtime, designed to run at the edge of their CDN, uses the Mozilla SpiderMonkey JavaScript engine compiled to WebAssembly.

# 10:20 pm / javascript, mozilla, spidermonkey, webassembly, fastly

Sept. 21, 2022

Introducing LiteFS (via) LiteFS is the new SQLite replication solution from Fly, now ready for beta testing. It’s from the same author as Litestream but has a very different architecture; LiteFS works by implementing a custom FUSE filesystem which spies on SQLite transactions being written to the journal file and forwards them on to other nodes in the cluster, providing full read-replication. The signature Litestream feature of streaming a backup to S3 should be coming within the next few months.

# 6:56 pm / databases, replication, sqlite, fly, litestream, ben-johnson

Sept. 24, 2022

Running training jobs across multiple nodes scales really well. A common assumption is that scale inevitably means slowdowns: more GPUs means more synchronization overhead, especially with multiple nodes communicating across a network. But we observed that the performance penalty isn’t as harsh as what you might think. Instead, we found near-linear strong scaling: fixing the global batch size and training on more GPUs led to proportional increases in training throughput. On a 1.3B parameter model, 4 nodes means a 3.9x gain over one node. On 16 nodes, it’s 14.4x. This is largely thanks to the super fast interconnects that major cloud providers have built in: @awscloud EC2 P4d instances provide 400 Gbps networking bandwidth, @Azure provides 1600 Gbps, and @OraclePaaS provides 800 Gbps.

Linden Li

# 4:03 pm / machine-learning, ai, gpus

Release webvtt-to-json 0.1 — Convert WebVTT to JSON, optionally removing duplicate lines

Sept. 25, 2022

Release ttml-to-json 0.1 — Convert TTML to JSON
Release webvtt-to-json 0.2 — Convert WebVTT to JSON, optionally removing duplicate lines
Release ttml-to-json 0.2 — Convert TTML to JSON
TIL Whisky sour — I picked up the recipe for this one from [this video](https://www.tiktok.com/t/ZTRaxyxQP/) by [@notjustabartender](https://www.tiktok.com/@notjustabartender) on TikTok.
TIL Ensure labels exist in a GitHub repository — I wanted to ensure that when [this template repository](https://github.com/simonw/action-transcription) was used to create a new repo that repo would have a specific set of labels.

Sept. 26, 2022

Release datasette 0.63a0 — An open source multi-tool for exploring and publishing data

Sept. 28, 2022

TIL Athena error: The specified key does not exist — I was trying to run Athena queries against compressed JSON log files stored in an S3 bucket.
Release datasette-pretty-json 0.2.2 — Datasette plugin that pretty-prints any column values that are valid JSON objects or arrays

Sept. 29, 2022

Release datasette-publish-fly 1.2 — Datasette plugin for publishing data using Fly

Exploring 10m scraped Shutterstock videos used to train Meta’s Make-A-Video text-to-video model

Visit Exploring 10m scraped Shutterstock videos used to train Meta's Make-A-Video text-to-video model

Make-A-Video is a new “state-of-the-art AI system that generates videos from text” from Meta AI. It looks incredible—it really is DALL-E / Stable Diffusion for video. And it appears to have been trained on 10m video preview clips scraped from Shutterstock.

[... 923 words]

TIL HTML video that loads when the user clicks play — Today I figured out how to use the `<video>` tag to show a static thumbnail that gets replaced by the loaded video only when the user clicks play.

Sept. 30, 2022

A tool to run caption extraction against online videos using Whisper and GitHub Issues/Actions

Visit A tool to run caption extraction against online videos using Whisper and GitHub Issues/Actions

I released a new project this weekend, built during the Bellingcat Hackathon (I came second!) It’s called Action Transcription and it’s a tool for caturing captions and transcripts from online videos.

[... 1,362 words]

nat/natbot (via) Extremely devious hack by Nat Friedman: opens a browser using Playwright and then passes a DOM representation to GPT-3 in order to power a chat-style interface for driving the browser. Worth diving into the code to look at the prompt it uses, it’s fascinating.

# 1:01 am / playwright, gpt-3, openai

TIL GraphQL fragments — One of [the scripts](https://github.com/simonw/datasette.io/blob/main/build_directory.py) that builds and deploys [datasette.io](https://datasette.io/) uses a GraphQL query to retrieve information from GitHub about the repositories used for the various Datasette tools and plugins.

Weeknotes: Datasette Cloud preview invitations

Visit Weeknotes: Datasette Cloud preview invitations

This week I finally started sending out invitations for people to try out the preview of the new Datasette Cloud, my SaaS offering for Datasette.

[... 713 words]

2022 » September

MTWTFSS
   1234
567891011
12131415161718
19202122232425
2627282930