Making use of Open Data
Oooo! new data
I was excited to see that the Wales Audit Office had released a set of data as open data. Open data is data that anyone can find access and use and it is the most useful sort of data.
Dyfrig Williams wrote about the process they’d gone through to release this data set (a summary of the audit data from each local authority in Wales for each year). The data is a simple table and you can download it as a CSV file (essentially a file that will work in any spreadsheet programme) here.
But there are problems
I downloaded the file and quickly spotted some problems. These are not errors exactly but just things that are missing or inconsistent and will make some uses of the data a bit harder. But this is not a complaint, because one of the attractive features of open data is that I could resolve these problems. I can do this because the Wales Audit Office have released the data under the Open Government Licence. This tells me I don’t need their permission to do anything with the data and there are no limits to what I can do with it (apart from I have to make it clear where it came from).
I can fix the problems
These are the things I did to my copy of the data.
I changed the format of the “financial year column” because in the Wales Audit Office file some of these are numbers and some are text.
I added a column of GSS codes. GSS codes are codes that are used to identify local authorities (and other boundaries). Having the GSS code means you don’t have to worry about whether the data says Anglesey Council, or Isle of Anglesey Council or Ynys Môn. And with the GSS code I could add “polygons” for each council. Polygons are basically instructions on how to draw the outline of each council and information about where to put the drawing on a map.
And given the Wales Audit Office an improved file
And I’ve been able to hand back to the Wales Audit Office a KML file. This is a file suitable for use in mapping software. Anyone who wants to visualise the Wales Audit Office data on a map can just open the KML file and get going.
Why did I do this?
I’m part of the core team at ODI-Cardiff so I get excited about open data.
It took me a very few minutes.
I’m trying to get better at using a Google service called Fusion Tables and this is a good opportunity to experiment.
I’m actually quite interested in what this data might tell us.