Overture Maps first dataset release

Reading that, it had a link to Overture Places.

Zoomed in to a local shopping centre at Overture Places, which we have as: OpenStreetMap.

So, from that one small spot - the spot they have the real estate agent mapped is out on the footpath, & has the wrong agency name; the convenience store is named with two variations of the same current name, one in the correct spot, one out in the street, but also a third time as IGA, which changed ~4 years ago; The Facialist closed down ~3 years ago; & some of the business names just don’t exist :face_with_monocle: :thinking:

Even “better”, just up a bit they have Sea World Gold Coast marked: Overture Places
Sea World is one of the biggest theme parks in all of Australia, but is about 20km north of here, not in a suburban back yard!

3 Likes

In regard to the question of reusing this info in OSM, another thing I noticed is lots of “work from home” businesses shown, often with names attached to them.

I happen to know that hardly any (if any?) of these businesses have signs posted outside saying that this business is here.

In this case, we’d have to very careful with what we copy, & as always, it shouldn’t be done without first verifying what is on the ground!

9 Likes

I have seen similar effects in other datasets - usually it is result of importing official business register, full of such business without an actual offices.

3 Likes

A “short list” consensus (initial distillation of quality w.r.t. the .alpha-0) is that it is a relative shit-show.

It is .alpha-0. I’ve been on teams (Apple application/OS software, Adobe system software, Santa Cruz Operation Xenix…) that make such (very early versions of) things, and in retrospect, they are “not usually particularly great.”

It’s the first “pickle out of the barrel.” We (OSM) might offer patience, we might say “do better,” we might openly mock and ridicule (I’m not), we might encourage patience and better results in future endeavors.

We might also “double down” and continue (as @hfs , @SimonPoole , and others have) to “tear it open and see how it bleeds.” I’m in the latter camp, not necessarily doing the tearing, but certainly munching popcorn as others do. Thanks to everyone who tears this stuff apart.

1 Like

To be fair, the Overture Maps data also provides the confidence of a POI:

confidence:
        description: The confidence of the existence of the place. It's a number between 0 and 1. 0 means that we're sure that the place doesn't exist (anymore). 1 means that we're sure that the place exists. If there's no value for the confidence, it means that we don't have any confidence information.

schema/schema/places/place.yaml at main · OvertureMaps/schema · GitHub

Many of the POIs in question have a very small confidence. Probably the POIs with low confidence should be filtered out and only POIs with sufficiently high confidence should be used.
By the way, the color on this map indicates the category of the POI.

2 Likes

In my opinion this stuff should only be used as input for StreetComplete or EveryDoor quest to ask mapper “Is shop/gas station/barber/… at this location?”. And leave everything else up to mapper to add. And if mapper says no, remove entry from quest.

7 Likes

Thanks!

Can we see that confidence level at all?

& again, do we know the legend?

Overture Maps data contains the confidence. This map (only a small area) also shows the confidence, by clicking on the marker:
hmb: places: 931 rows

Here it is:

1 Like

Yowza, OSM-community. Thank you to all who tear this apart. I am delighted to see this sort of “eat the red meat” we have been thrown. We are up to the task. Good discussion can only help; good for us.

The main issue I see is, we know the POI data is junk (and we actually knew that before it was released because we’ve been on the receiving end of complaints about the low quality Facebook data for years), and anybody that actually inspects the data will realize that it is junk, but that has nothing to with outside perception.

Not just the Facebook fanbois and shills, but the media have decided that this is a wonderful thing and the last thing they will do is point to OSM as the better data source.

6 Likes

Google is not much better. Too many street names that are wrong, even when sent photographic evidence left wrong. So you get routed on Wase to the wrong location or it can’t find a location. How POI can have different streetnames on them versus the streetname right out the doorstep is a riddle. OSM QA will sure as heck let you know.
Having worked with MapWithAI pulling in MS footprint buildings, 99.99% requires correcting so if one sees buildings, a token representation or a figment of the shadow hunters. The data is dated at that, many newer buildings missing.

Without some sort of filtering, I doubt that it is even worth doing that with it. Locally to me (in England) there are about 3 times as many invalid POIs as valid ones, and (very small sample but a rough guess) maybe 30% of pois missing, though that might be a process or category issue at Overture Maps’ side.

As Simon said above, OSM has had numerous reports about the problems with Facebook’s data (whether used in FB, Instagram or elsewhere) - as reports to the the help site, forums like this and to the DWG where we pretty much created a special reporting category for it.

9 Likes

For shop data it is much better, at least in my area.

3 Likes

Have they been attempting to restructure the data?

I am seeing similar in my town.

Lots of unverifiable work from home, or where the business is
registered to receive mail. On this subject one of the bowling clubs is
shown at a residential address which is probably where the secretary
lives rather than on the bowling green and clubhouse where OSM has it
correctly placed.

Outdated POIs, seeing both previous businesses and current at the same
place.

Some seriously misplaced such as a school that is in a village several
miles away.

One that looks like it could be a GDPR breach, the name of a lady on
her address with nothing more to indicate a type of business. I know
her and thought she works in a local cafe.

7 Likes

I see the same here, Austria. Recently two people (or was it one) mapped hundreds of PoIs in the area, the ones you find on shady yellow-pages sites, and from the looks, in overture now.

They are a pain to weed out. Maybe that is the idea?

2 Likes

Interesting reflections from @migurski on the POI dataset, which superficially look really bad (but @bdon helpful site doesn’t visualize the confidence score, so hard to judge with my expertise if Overture know its bad or think the bad data is good :slight_smile: Overture Places

Michal Migurski 📦: "@richlv @zverik@en.osm.town @simon@en.osm.town @a…" - Mastodon.

1 Like

Interesting. I wonder for which kind of data we would be part of the conversation, or what would put us there (if that’s what we want). Might give us a hint where to focus our efforts.

1 Like

Oh, I think another issue that LWG (pinging @Stereo ) should consider that we encountered in Poland is that many companies (think shop chains) were happy to give away for free location data of their shops but were so scared whenever a word “licence” was uttered and the topic was stuck.
It would make sense to streamline this process from the viewpoint of legal side and typical corporate bureaucracy, so that intent is met with procedural capability. I think for that we’d need someone who has been on the other (i.e. corporate) side. I mean, they already supply this data to other vendors, right?

12 Likes

It is extremely unlikely that the OMF “wants” to use any of our data (outside of a complete surrender that is).

Currently they don’t have a choice wrt road data and some other bits and pieces, but I’m sure they are busy spending their not unsubstantial funds on rectifying that.

I would suggest reading up on the history of FB and OSM, not the least the “craft mapper” discussion started by Michal.

3 Likes