I really don’t see how this is realistically achievable, even if the Canadian community suddenly grows by an order of magnitude. There are now 1.4 million interpolation lines in Canada. From what I understand from the responses in this thread, they are all potentially of questionable quality and therefore all need to be resurveyed.

That makes me wonder if this data belongs imported into OSM in the first place. It would be much better to prepare the data as an external dataset in a way that it can be easily used with OSM as fallback data. We’ve been doing this for many years now in the US with the house number interpolation data from TIGER. Nominatim (the search engine) can import it on the side and use it as a fallback in the US but it will always prefer house numbers from OSM if they are available. The added bonus of an external dataset would be that it can be easily updated with each new version of CanVec.

There is a quirk with the current tagging schema of interpolations. Each interpolation creates at least two address nodes that are on first sight indistinguishable from exactly mapped house numbers. You’d have to look if the address node is part of an interpolation way to understand that it is not an exact number but an estimate. Very few data users do that. And that’s where the low-quality interpolations do a lot more harm then it seems on first sight.

1 Like