Hello everyone,
I’m planning to fix up a data issue in Bolivia and would like to get your feedback before I proceed.
The background:
Quite a while ago, user rodolfovargas imported 10,000 trees (natural=tree ) in Bolivia. During that import, the circumference tag was set in centimeters, but the unit of measurement was not specified.
According to the OSM wiki, the value for circumference should be in meters or, according to the common practice the unit should be explicitly stated (e.g., circumference=250 cm ).
Currently, all 10,000 nodes still have the original values (e.g., circumference=250 ), which are clearly meant to be centimeters.
What I plan to do:
-
For 9,999 of these trees, I will divide the value by 100, thus converting centimeters to meters, like this:
circumference=123→circumference=1.23
circumference=250→circumference=2.50 -
For 1 tree, the value is
circumference=0. Since infinitely thin trees do not really exist, I will delete this tag entirely.
Why this matters:
Besides the fact that the current tagging is simply incorrect, these unrealistically thick trees are distorting statistics. I need accurate statistical data to estimate height of trees in a JOSM plugin I’m developing (UrbanEye3D).
Fixing these values will greatly improve its functionality and the overall data quality in the region and tree species statistics worldwide.
How I plan to do it:
I’ve written a simple Python script that:
- Reads node IDs from an OSC file of the import in question.
- Fetches the current version of each node from a fresh
planet.osmfile . - Compares the current
circumferencevalue with the imported one.
If they are still the same (i.e., no one has edited them since the import), the script applies the changes described above.
As of now, all 10,000 nodes still carry the original values.
If anyone has suggestions, concerns, or objections, I’d be happy to hear them before I start.
Best regards,
Zkir