Simon Willison’s Weblog

Subscribe
Atom feed for shot-scraper Random

67 posts tagged “shot-scraper”

shot-scraper is a command-line utility for taking screenshots of websites and scraping content from them using JavaScript.

2022

Dumping the HTML of a page using shot-scraper. New in 1.0 is the “shot-scraper html URL” command, which outputs the HTML of a page once JavaScript has finished executing there. You can pass in additional custom JavaScript to run before the shapshot is taken, and you can also specify a CSS selector on the page to return just that fragment of HTML.

# 15th October 2022, 9:30 pm / shot-scraper

shot-scraper 1.0 (via) Only a minor release in terms of features, but I decided that I'm comfortable enough with the CLI design at this point that I'm ready to stamp a 1.0 on it and commit to not making backwards-incompatible changes (at least without shipping a 2.0 release, which I'd like to avoid if possible).

Full release notes:

# 15th October 2022, 9:28 pm / cli, projects, shot-scraper

Release shot-scraper 1.0 — A command-line utility for taking automated screenshots of websites

Automating screenshots for the Datasette documentation using shot-scraper

Visit Automating screenshots for the Datasette documentation using shot-scraper

I released shot-scraper back in March as a tool for keeping screenshots in documentation up-to-date.

[... 1,810 words]

Release datasette-screenshots 0.62 — Screenshots of Datasette, taken using shot-scraper
Release shot-scraper 0.17 — A command-line utility for taking automated screenshots of websites

Weeknotes: Datasette Lite, s3-credentials, shot-scraper, datasette-edit-templates and more

Visit Weeknotes: Datasette Lite, s3-credentials, shot-scraper, datasette-edit-templates and more

Despite distractions from AI I managed to make progress on a bunch of different projects this week, including new releases of s3-credentials and shot-scraper, a new datasette-edit-templates plugin and a small but neat improvement to Datasette Lite.

[... 1,562 words]

Release shot-scraper 0.16 — A command-line utility for taking automated screenshots of websites
Release shot-scraper 0.15.1 — A command-line utility for taking automated screenshots of websites
Release shot-scraper 0.15 — A command-line utility for taking automated screenshots of websites
Release shot-scraper 0.14.3 — A command-line utility for taking automated screenshots of websites
Release shot-scraper 0.14.2 — A command-line utility for taking automated screenshots of websites

A tiny web app to create images from OpenStreetMap maps

Visit A tiny web app to create images from OpenStreetMap maps

Earlier today I found myself wanting to programmatically generate some images of maps.

[... 1,388 words]

Release shot-scraper 0.14.1 — A command-line utility for taking automated screenshots of websites
Release shot-scraper 0.14 — A command-line utility for taking automated screenshots of websites

simonw/datasette-screenshots (via) I started a new GitHub repository to automate taking screenshots of Datasette for marketing purposes, using my shot-scraper browser automation tool.

# 17th May 2022, 5:56 pm / projects, datasette, github-actions, shot-scraper

Release shot-scraper 0.13 — A command-line utility for taking automated screenshots of websites
Release shot-scraper 0.12 — A command-line utility for taking automated screenshots of websites

Pillar Point Stewards, pypi-to-sqlite, improvements to shot-scraper and appreciating datasette-dashboards

Visit Pillar Point Stewards, pypi-to-sqlite, improvements to shot-scraper and appreciating datasette-dashboards

This week I helped Natalie launch the Pillar Point Stewards website and built a new tool for loading PyPI package data into SQLite, in order to help promote the excellent datasette-dashboards plugin by Romain Clement.

[... 1,985 words]

Release shot-scraper 0.11 — A command-line utility for taking automated screenshots of websites
Release shot-scraper 0.10 — A command-line utility for taking automated screenshots of websites

Weeknotes: Tildes not dashes, and the big refactor

After last week’s shot-scraper distractions with Playwright, this week I finally managed to make some concrete progress on the path towards Datasette 1.0.

[... 1,292 words]

Instantly create a GitHub repository to take screenshots of a web page

Visit Instantly create a GitHub repository to take screenshots of a web page

I just released shot-scraper-template, a GitHub repository template that helps you start taking automated screenshots of a web page by filling out a form.

[... 1,177 words]

Scraping web pages from the command line with shot-scraper

Visit Scraping web pages from the command line with shot-scraper

I’ve added a powerful new capability to my shot-scraper command line browser automation tool: you can now use it to load a web page in a headless browser, execute JavaScript to extract information and return that information back to the terminal as JSON.

[... 1,277 words]

Release shot-scraper 0.9 — A command-line utility for taking automated screenshots of websites
Release shot-scraper 0.8 — A command-line utility for taking automated screenshots of websites
Release shot-scraper 0.7 — A command-line utility for taking automated screenshots of websites
Release shot-scraper 0.6 — A command-line utility for taking automated screenshots of websites

@newshomepages (via) Ben Welsh used my shot-scraper tool and GitHub Actions to launch a Twitter bot which tweets screenshots of newspaper homepages on a scheduled basis. Ben says: “The tech is so easy, I was able to pull it off in a couple hours at zero cost. A decade ago I ran a similar project using the cloud resources of the day. [...] It costs thousands of dollars and the screenshots were of much lower quality. Incredible progress!”

# 12th March 2022, 7:21 pm / twitter, github-actions, playwright, shot-scraper, ben-welsh

Release shot-scraper 0.5 — A command-line utility for taking automated screenshots of websites