Simon Willison’s Weblog

Subscribe

Wednesday, 11th March 2009

Guardian + Lucene = Similar Articles + Categorisation. Alf Eaton loaded 13,000 Guardian articles tagged Science in to Solr and Lucene and is using Solr’s MoreLikeThisHandler to find related articles and automatically apply Guardian tags to Nature News articles.

# 12:53 pm / alf-eaton, full-text-search, guardian, lucene, naturenews, openplatform, search, solr

Get our full university data. “The Guardian’s university rankings are the most visited part of Education Guardian”—and now they’re available as a spreadsheet.

# 1:52 pm / datastore, guardian, leaguetables, openplatform, university

2009 » March

MTWTFSS
      1
2345678
9101112131415
16171819202122
23242526272829
3031