Simon Willison’s Weblog

Subscribe

Wednesday, 18th December 2013

What are some good resources to learn how to cleanse data using Python?

http://gnosis.cx/TPiP/ “Text Processing in Python” is a free online book that covers a bunch of useful topics related to data cleanup. It’s over 10 years old now but is still mostly relevant—the chapter on regular expressions is particularly good.

2013 » December

MTWTFSS
      1
2345678
9101112131415
16171819202122
23242526272829
3031