Simon Willison’s Weblog

Subscribe
Atom feed for datasette Random

1,520 posts tagged “datasette”

Datasette is an open source tool for exploring and publishing data.

2023

Release datasette-debug-permissions 0.2 — A Datasette plugin that outputs debug information about permission checks
Release datasette-debug-permissions 0.1 — A Datasette plugin that outputs debug information about permission checks
Release datasette-configure-fts 1.1.2 — Datasette plugin for enabling full-text search against selected table columns

Datasette Cloud and the Datasette 1.0 alphas. I sent out the Datasette Newsletter for the first time in quite a while, with updates on Datasette Cloud, the Datasette 1.0 alphas, a note about the security vulnerability in those alphas and a summary of some of my research into combining LLMs with Datasette.

# 22nd August 2023, 7:56 pm / projects, datasette, datasette-cloud, llms

Datasette 1.0 alpha series leaks names of databases and tables to unauthenticated users. I found and fixed a security vulnerability in the Datasette 1.0 alpha series, described in this GitHub security advisory.

The vulnerability allowed unauthenticated users to see the names of the databases and tables in an otherwise private Datasette instance—though not the actual table contents.

The fix is now shipped in Datasette 1.0a4.

The vulnerability affected Datasette Cloud as well, but thankfully I was able to analyze the access logs and confirm that no unauthenticated requests had been made against any of the affected endpoints.

# 22nd August 2023, 5:44 pm / releases, security, datasette

Release datasette 1.0a4 — An open source multi-tool for exploring and publishing data
Release datasette-auth-tokens 0.4a1 — Datasette plugin for authenticating access using API tokens
Release datasette-ripgrep 0.8.1 — Web interface for searching your code using ripgrep, built as a Datasette plugin
Release datasette-publish-fly 1.3.1 — Datasette plugin for publishing data using Fly
Release datasette-edit-schema 0.5.3 — Datasette plugin for modifying table schemas

Datasette Cloud, Datasette 1.0a3, llm-mlc and more

Visit Datasette Cloud, Datasette 1.0a3, llm-mlc and more

Datasette Cloud is now a significant step closer to general availability. The Datasette 1.03 alpha release is out, with a mostly finalized JSON format for 1.0. Plus new plugins for LLM and sqlite-utils and a flurry of things I’ve learned.

[... 1,690 words]

Introducing datasette-write-ui: a Datasette plugin for editing, inserting, and deleting rows. Alex García is working with me on Datasette Cloud for the next few months, graciously sponsored by Fly. We will be working in public, releasing open source code and documenting how to build a multi-tenant SaaS product using Fly Machines.

Alex’s first project is datasette-write-ui, a plugin that finally lets you directly edit data stored inside Datasette. Alex wrote about the plugin on our new Datasette Cloud blog.

# 16th August 2023, 1:48 am / plugins, datasette, datasette-cloud, fly, alex-garcia

Welcome to Datasette Cloud. We launched the Datasette Cloud blog today! The SaaS hosted version of Datasette is ready to start onboarding more users—this post describes what it can do so far and hints at what’s planned to come next.

# 16th August 2023, 1:46 am / projects, datasette, datasette-cloud

Release datasette-render-markdown 2.2.1 — Datasette plugin for rendering Markdown

Dependency Management Data (via) This is a really neat CLI tool by Jamie Tanna, built using Go and SQLite but with a feature that embeds a Datasette instance (literally shelling out to start the process running from within the Go application) to provide an interface for browsing the resulting database.

It addresses the challenge of keeping track of the dependencies used across an organization, by gathering them into a SQLite database from a variety of different sources—currently Dependabot, Renovate and some custom AWS tooling.

The “Example” page links to a live Datasette instance and includes video demos of the tool in action.

# 11th August 2023, 3:54 pm / cli, packaging, sqlite, datasette

Release datasette-upload-dbs 0.3 — Upload SQLite database files to Datasette

Datasette 1.0a3. A new Datasette alpha release. This one previews the new default JSON API design that’s coming in 1.0—the single most significant change in the 1.0 milestone, since I plan to keep that API stable for many years to come.

# 9th August 2023, 8:49 pm / json, projects, datasette

Release datasette 1.0a3 — An open source multi-tool for exploring and publishing data

Weeknotes: Plugins for LLM, sqlite-utils and Datasette

Visit Weeknotes: Plugins for LLM, sqlite-utils and Datasette

The principle theme for the past few weeks has been plugins.

[... 1,203 words]

Release datasette-upload-dbs 0.2 — Upload SQLite database files to Datasette
Release datasette-auth-tokens 0.4a0 — Datasette plugin for authenticating access using API tokens

Weeknotes: Self-hosted language models with LLM plugins, a new Datasette tutorial, a dozen package releases, a dozen TILs

A lot of stuff to cover from the past two and a half weeks.

[... 1,742 words]

Data analysis with SQLite and Python. I turned my 2hr45m workshop from PyCon into the latest official tutorial on the Datasette website. It includes an extensive handout which should be useful independently of the video itself.

# 2nd July 2023, 4:48 pm / python, speaking, sqlite, my-talks, datasette

Release datasette-render-markdown 2.2 — Datasette plugin for rendering Markdown
Release datasette-upload-csvs 0.8.3 — Datasette plugin for uploading CSV files and converting them to database tables
Release datasette-render-images 0.4 — Datasette plugin that renders binary blob images using data-uris

Weeknotes: Parquet in Datasette Lite, various talks, more LLM hacking

I’ve fallen a bit behind on my weeknotes. Here’s a catchup for the last few weeks.

[... 769 words]

Vector Search. Amjith Ramanujam provides a very thorough tutorial on implementing vector similarity search using SentenceTransformers embeddings (all-MiniLM-L6-v2) executed using sqlite-utils, then served via datasette-sqlite-vss and deployed using Fly.

# 2nd June 2023, 5:02 am / sqlite, ai, datasette, fly, vector-search, embeddings

Exploration de données avec Datasette. One of the great delights of open source development is seeing people run workshops on your project, even more so when they’re in a language other than English! Romain Clement presented this French workshop for the Python Grenoble meetup on 25th May 2023, using GitHub Codespaces as the environment. It’s pretty comprehensive, including a 300,000+ row example table which illustrates Datasette plugins such as datasette-cluster-map and datasette-leaflet-geojson.

# 27th May 2023, 12:36 am / tutorials, datasette, github-codespaces, leaflet

Release datasette-sqlite-url-lite 0.1 — A pure Python alternative to sqlite-url ready to be used in Datasette Lite