Simon Willison’s Weblog

Subscribe
Atom feed for datasette Random

1,520 posts tagged “datasette”

Datasette is an open source tool for exploring and publishing data.

2022

Release datasette-upload-csvs 0.8.2 — Datasette plugin for uploading CSV files and converting them to database tables
Release datasette-upload-csvs 0.8.1 — Datasette plugin for uploading CSV files and converting them to database tables
Release datasette-upload-csvs 0.8 — Datasette plugin for uploading CSV files and converting them to database tables

Spevktator: OSINT analysis tool for VK. This is a really cool project that came out of a recent Bellingcat hackathon. Spevktator takes 67,000 posts from five popular Russian news channels on VK (a popular Russian social media platform) and makes them available in Datasette, along with automated translations to English, post sharing metrics and sentiment analysis scores. This README includes some detailed analysis of the data, plus a link to an Observable notebook that implements custom visualizations against queries run directly against the Datasette instance.

# 5th September 2022, 8:48 pm / political-hacking, datasette, observable, bellingcat

Exploring the training data behind Stable Diffusion

Visit Exploring the training data behind Stable Diffusion

Two weeks ago, the Stable Diffusion image generation model was released to the public. I wrote about this last week, in Stable Diffusion is a really big deal—a post which has since become one of the top ten results for “stable diffusion” on Google and shown up in all sorts of different places online.

[... 2,897 words]

Release datasette-sitemap 1.0 — Generate sitemap.xml for Datasette sites
Release datasette-block-robots 1.1 — Datasette plugin that blocks robots and crawlers using robots.txt
Release datasette-sitemap 0.1.1 — Generate sitemap.xml for Datasette sites
Release datasette-sitemap 0.1 — Generate sitemap.xml for Datasette sites

Building a searchable archive for the San Francisco Microscopical Society

Visit Building a searchable archive for the San Francisco Microscopical Society

The San Francisco Microscopical Society was founded in 1870 by a group of scientists dedicated to advancing the field of microscopy.

[... 1,845 words]

Release datasette-multiline-links 0.1 — Make multiple newline separated URLs clickable in Datasette
Release datasette-publish-vercel 0.14.2 — Datasette plugin for publishing data using Vercel
Release datasette-edit-schema 0.5.1 — Datasette plugin for modifying table schemas

Digitizing 55,000 pages of civic meetings (via) Philip James has been building public, searchable archives of city council meetings for various cities—Oakland and Alamedia so far—using my s3-ocr script to run Textract OCR against the PDFs of the minutes, and deploying them to Fly using Datasette. This is a really cool project, and very much the kind of thing I’ve been hoping to support with the tools I’ve been building.

# 22nd August 2022, 4:26 pm / archiving, ocr, political-hacking, datasette, fly

Analyzing ScotRail audio announcements with Datasette—from prototype to production

Visit Analyzing ScotRail audio announcements with Datasette - from prototype to production

Scottish train operator ScotRail released a two-hour long MP3 file containing all of the components of its automated station announcements. Messing around with them is proving to be a huge amount of fun.

[... 4,428 words]

Release datasette-mp3-audio 0.2 — Turn .mp3 URLs into an audio player in the Datasette interface
Release datasette-mp3-audio 0.1 — Turn .mp3 URLs into an audio player in the Datasette interface

The Datasette Newsletter: Datasette Lite, Datasette Tutorials, Datasette Cloud. It’s been quite a while since I’ve sent one of these out now—hoping to get this on to a more regular schedule.

# 19th August 2022, 1:20 am / datasette

Plugin support for Datasette Lite

Visit Plugin support for Datasette Lite

I’ve added a new feature to Datasette Lite, my distribution of Datasette that runs entirely in the browser using Python and SQLite compiled to WebAssembly. You can now install additional Datasette plugins by passing them in the URL.

[... 865 words]

Release datasette-sentry 0.2 — Datasette plugin for configuring Sentry
Release datasette-sentry 0.2a1 — Datasette plugin for configuring Sentry
Release datasette 0.62 — An open source multi-tool for exploring and publishing data

Litestream backups for Datasette Cloud (and weeknotes)

My main focus this week has been adding robust backups to the forthcoming Datasette Cloud.

[... 1,604 words]

datasette on Open Source Insights (via) Open Source Insights is "an experimental service developed and hosted by Google to help developers better understand the structure, security, and construction of open source software packages". It calculates scores for packages using various automated heuristics. A JSON version of the resulting score card can be accessed using https://deps.dev/_/s/pypi/p/{package_name}/v/

# 11th August 2022, 1:06 am / open-source, security, datasette

Release datasette-scale-to-zero 0.2 — Quit Datasette if it has not received traffic for a specified time period

Introducing sqlite-html: query, parse, and generate HTML in SQLite (via) Another brilliant SQLite extension module from Alex Garcia, this time written in Go. sqlite-html adds a whole family of functions to SQLite for parsing and constructing HTML strings, built on the Go goquery and cascadia libraries. Once again, Alex uses an Observable notebook to describe the new features, with embedded interactive examples that are backed by a Datasette instance running in Fly.

# 3rd August 2022, 5:31 pm / go, html, sqlite, datasette, alex-garcia

Cleaning data with sqlite-utils and Datasette (via) I wrote a new tutorial for the Datasette website, showing how to use sqlite-utils to import a CSV file, clean up the resulting schema, fix date formats and extract some of the columns into a separate table. It’s accompanied by a ten minute video originally recorded for the HYTRADBOI conference.

# 31st July 2022, 7:57 pm / documentation, tutorials, datasette, sqlite-utils

Release datasette-sqlite-fts4 0.3.2 — Datasette plugin that adds custom SQL functions for working with SQLite FTS4

Weeknotes: Joining the board of the Python Software Foundation

Visit Weeknotes: Joining the board of the Python Software Foundation

A few weeks ago I was elected to the board of directors for the Python Software Foundation.

[... 2,081 words]

Release datasette-sqlite-fts4 0.3.1 — Datasette plugin that adds custom SQL functions for working with SQLite FTS4