The Sheffield City Council Open Data website contains a lot of datasets that are the kind of thing that would be excellent to import into OSM and I wanted to begin the process of identifying which datasets would be suitable for an import into OSM and classify them in terms of priority.
Some of the data is under the INSPIRE license or doesn’t have a license listed. This will need contact with the council to determine if the data can be re-released under OGL or otherwise licensed for OSM. Everything under the OGL is ready to go from the looks of things but will still need care when importing as well as attribution.
Feedback, Help and Discussion
From here I now need feedback on this list of data sets as to where the community priorities lie in Sheffield as well as if I’ve missed any important data sets from the Sheffield City Council Open Data.
It would be a massive help as well if more experienced OSM members were willing to lend a hand evaluating the import potential and problems associated with each dataset. Feedback and discussion is greatly appreciated!
I used the grit bin data a while ago, it was good. I considered using the waste bin data, it was questionable around The Moor, so I wasn’t sure on the age of the data. I can’t remember if that was the show stopper, or the technical, or because I moved away.
The grit bin data has been incredibly useful from my experience surveying round Sheffield using street complete! Thank you for the work you put into that! I want to see if it’d be possible to set up a semi-automated import from the live data for grit bins at some point in the future.
As someone with experience doing imports in Sheffield what would be your recommendation for evaluating these datasets for import viability?
Many of these datasets will probably need some form of manual conflation, rather than just a straight import. But it would be great if people can make use of them. Any sort of conflation/matching and ongoing maintenance is much easier if the is a suitable ID for each object in the dataset. It’s also helpful if the datasets are complete (in the sense that they include all objects of a certain type in OSM in that area) for then you can look for unexpected objects in OSM as well as missing ones.
If anyone is interested in using the Public Rights of Way data, do let me know, as I’d be happy to add it to my comparison tool at https://osm.mathmos.net/prow/progress/ if it would be useful to someone.
“live data” do you just mean update the data, or is there something more available?
It would be interesting to compare current OSM to new data, see where OSM has changed since then, if the council has been adding adding, removing or moving them.
Defiantly check the data out before doing anything, check out your local ones, see if they are accurate in terms of up to date or geospatially. Consider each dataset totally different, they may be made by different people, made years apart, have different attributes.
Some of this data wont be good enough, sometimes its just part of the dataset, sometimes its the whole dataset. That’s fine, move on, don’t be a completionist.
Just looking at the “Public Toilets” dataset, it is so small (41 locations) that you should do it all manually. However I spot checked one (100052216711) it seems like the toilets are long gone. The metadata says “20 February 2025”, but that’s when the file was uploaded, the data must be old. This is pretty typical, my new council has a toilets dataset that is really outdated too. Do manual surveys/open notes, but don’t trust the dataset.
The “Litter Bins on the Highway Network” dataset has a “startdate” attribute, presumably the new ones will be way more trust worth than older ones, but check it out. Have a survey and see.
Oh nothing actually real-time but some of those data sets are supposedly updated nightly so monitoring them for changes and updates and writing some automation to identify any changes made and generate a change-set that can be reviewed and then pushed to OSM so it doesn’t become an import that needs re-doing every time the data changes enough.
And ty for all the tips! Will publish updates here when I get round to surveying the accuracy of the different data sets and absolutely need to keep my perfectionism in check.
A lot of these datasets are out of date. The public toilets one is hilariously out of date, showing toilets that have been closed for decades. I think these are better used as indicators for where to go and survey rather than importing directly.
Some of the datasets are more or less out of date than the others it seems, mostly the ones the Council has a legal obligation to keep up to date info about like street lighting seem to be more up to date.
I’m not sure how to do it but could always set up some of them as a street complete quest like the grit bins have been?