Hello,
This is a thread to discuss the proposed bulk POI name change in Azerbaijan.
The discussion has started in the thread Concerns about Azerbaijan
To the name= tags of POIs, MenKauRa has been adding, after a backslash, also the name of the town. In the same changeset discussion, he says that he feels it necessary to do this since some people’s apps might not find the POI otherwise. It seems to me that MenKauRa has simply reinvented, in a poorer form, the long-deprecated is_in:city tag. The result is that names of POIs in Azerbaijan look like nowhere else; they look like the result of a bad import. As an example, see the cafe at 40.1731, 49.4699 and the other POIs to the northwest of it: there is absolutely no reason to specify the town in the name= tag (moreover, many of these are generic names, not the POI’s actual name).
First of all, I’d like to express gratitude and respect to @menkaura, who has been drawing the map of Azerbaijan for 13 years, and who has drawn almost all the thousands of the POIs I’m talking about now. I don’t agree with the naming approach that he used, but without his contribution, there wouldn’t be anything to discuss at all.
My proposal is to change such names in bulk by just removing the part after a backslash.
With AI assistance, I prepared a draft changeset to evaluate existing data, possible impact, and so on. Here is an overview spreadsheet: https://docs.google.com/spreadsheets/d/14-kqFjgPMc7wvABBiaqSpgs0j3mwOnWjDbBcqP5843k/edit?usp=sharing
There are three sheets in the spreadsheet:
-
Changes. List of all tag changes sorted by the number of occurrences of each value of the name’s part after a backslash. It’s supposed to be the name of a settlement or another place where the POI is located.
How did I get it? Via Overpass, I got all nodes in Azerbaijan that contain a backslash in the name tag. Then I used Claude to generate a changeset and a report, providing the .osm file with raw data from Overpass, providing an example of a changeset generated by JOSM (I manually changed the name of one POI), and asking to remove everything after backslashes in name tags.
To evaluate the changeset, I additionally asked to generate a spreadsheet that you can see above (I used the same context, so theoretically Claude could generate the report not based on the actual changeset but on its own previous response).
The number of occurrences is calculated in the spreadsheet using a formula.
-
Suspicious values. Via Overpass, I got all place=* *in Azerbaijan. Then I provided this raw data and previously generated report to Claude and asked it to find substrings supposed to be removed from the name tag, which don’t match with any place in OSM.
The majority of the found values are names of places anyway, like streets, residential complexes, and so on, which just aren’t tagged with the place=** tag.
Previously, two cases that obviously shouldn’t be changed the same way have been found manually. It was discussed here Concerns about Azerbaijan - #13 by evgenykatyshev
-
Summary. It has been added by Claude on its own. Contains total numbers and the most popular values.
To avoid possible AI-related mistakes, the same changeset and report might be generated with a plain script.
I’d like to discuss with the community the best way to perform this bulk update, if it’s considered to be done at all.