For a project that I’m working on at the National Museum of Australia, I’ve started collecting various sources of date-identified data. Most recently I had a go at extracting historical population data from the Australian Bureau of Statistics.
The data can all be downloaded as .xls files, but they’re not simple, flat spreadsheets – they’re data [...]
discontents
working for the triumph of content over form, ideas over control, people over systems
python
Out of the cube
Cloudy biographies and portrait walls
With a bit of time to play over Christmas I had a go at applying some of the techniques described at ProgrammingHistorian to the ADB Online. I thought it might be interesting to create some word clouds, both for what they could reveal about the content of the ADB, and to see what they had [...]