Simon Willison’s Weblog

Subscribe

3rd August 2008 - Link Blog

PDFMiner. Useful looking PDF parsing library in Python—can produce an XML representation of the text and style information in a PDF document.

This is a link post by Simon Willison, posted on 3rd August 2008.

Monthly briefing

Sponsor me for $10/month and get a curated email digest of the month's most important LLM developments.

Pay me to send you less!

Sponsor & subscribe