For the popcorn crowd, MapWithAI addons for JOSM and Rapid inserting buildings by the zillions in the most preposterous shapes and in the most ludicrous places and MagicWand to name 2.
(not related to AI ferrets, but)
âRapidâ is not inserting zillions of buildings, mappers using these tools incorrectly allegedly are. If you see people doing that, please report them.
Edit: Iâve added âallegedlyâ above, because (based on DWG reports) weâre just not getting reports of âzillionsâ of buildings added in this way. We are getting some (and have dealt with those). Maybe thereâs a big unreported problem out there - Iâm not convinced, but please do report any such problems so that we can engage with the problem mappers (and help you do do that too)
While I agree, I think there is also some responsibility when you offer such low barrier tools which make it easy for everyone to perform massive imports of such data, if 999 out of a thousand are refraining thereâll still always be someone doing it.
I agree (and have made similar points elsewhere re the HOT tasking manager and Maproulette). Just to check, Iâve had a quick look at a local area right now. In that area, of the âRapid assist featuresâ:
- Facebook Roads is garbage (1)
- Microsoft Buildings is mostly correct at detecting something missing (2)
- Overture places are garbage (3)
- Open Data Footways isnât showing anything
(1) Anything worth calling a road here is mapped, so all suggestions are false positives, mostly âschoolboy errorsâ such as hedge shadows etc.
(2) The vast majority are genuine missing buildings. The geometry is usually there or thereabouts, but editing is needed about half the time (ish)
(3) See other forum threads ad nauseam. That doesnât mean that there isnât useful stuff in that dataset (as a prompt for a survey, for example) but in this area itâs not actionable in Rapid.
In each case Rapidâs info box below âaddâ says âDoes this look like an accurate feature? Select this to start editing it so that you can connect, tag, and save it to OpenStreetMap. â. Maybe more can be added there, but too much text will simply mean that it will not be read.
The âRapid Featuresâ section of the Rapid tutorial could do with more work - it says you can add and ignore, and note that adding may cause issues with existing data, but doesnât cover âthis might be a false positive and not exist at allâ. However, to make that change youâll need to engage with the editor authors - writing forum posts wonât make that happen.
I looked in TagInfo before Rapid was even on the map and then there were 17+ million in OSM with the source tag pointing at MapWithAi. Today thereâs over 28 million with the tag thatâs used by Rapid. Cant remember if same as MwAI, but since the take-off in the graph is 2020, the bulk is likely MwAI
PS Does the person to whom i replied about âAI was not allowed in OSMâ know the reply was moved? When I look in the rodent thread, the post is without a reply indication.
Yes, Iâm aware.
Just to remove any and all confusion, I should probably clarify that I donât see a problem with analytical systems taking a look at aerial imagery and rough-sketching a starting place as MwAI/Rapid (same thing) does.
I do have a problem with wholesale swallowing FurAffinity, DeviantArt, e6 etc and regurgitating the pirated results to us.
We have a lot of MS building footprints additions here. Theyâre often rather wonky, not really respecting multiple sources, but better than nothing. They do need human correction and correlation against cadastral land parcels and surveys, and general common sense about houses, garages, and how people live.
I donât really have problems with image recognition tasks applied to OSM and turned into possibly useful data, as of now. I doubt that implicit biases in the training sets are going to get somebody arrested, bulldozed, or shot at present, at least not with what Iâve seen from the current state of the art. Itâs a small task for a small matrix, and the worst outcome for places with lots of human editors is that much will be fixably wrong. Just donât dump bulk garbage data synthesized from Western-centric assumptions on communities where the assumptions donât hold, and who canât fix it readily, please!
I hope this stuff stays limited in scope, and easy for the DWG to revert if someone starts spamming the map with poor or harmful data.
They are not âbetter than nothingâ, they are actively negative and require more work to fix than simply doing it properly the 1st time around (which essentially means they are never ever going to be improved).
I might have agreed before I knew about replace geometry. Now I just drag them out of the way, redraw them from scratch, then Ctrl+Shift+G.
If you have to do this for a whole area I find it quicker to do the new building traces in a fresh layer and then use the conflate plugin to do the geometry replacement/additions.
Oh I could see that, yeah!
You could, but the point that Simon was making was that itâs actually quicker for the crap buildings not to have been added in the first place.
By all means use things like Rapid to detect missing buildings, but donât then say âWell, itâs a building. It doesnât really match the building on the imagery. Iâm sure someone will come along later and tidy it upâ.
Both of the responses are actually making my case as you note
Just happen to come along⊠The red outline is before hitting the Ctrl+Shift+G to transpose the chronology to the âcorrectâ shape. for the countlessless time. According the JOSM measurement plugin the old shape was 116 m2, the new shape 80 m2)
Bing
ESRI⊠seems the right top white blob, possibly a furgone, went away.
Have you contacted the mapper about the less than stellar quality of their edits? Did you get a reply?
Did this 2 minutes before making the mistake of venturing here and seeing the OP earned a thumbs down on top, the context of course lost since the comment was a reply in another thread on AI. Shakes head and walks away, my feet reached the door first.
Yes. In my case theyâre usually misguided (under-guided?) noobs trying to help after a hurricane by very loosely tracing buildings on a completely unaffected island. MapwithAI detections might actually have been better in many cases.
And no, I donât normally contact them. Their history tends to have been a two day sprint X years ago and nothing since so engagement is unlikely to improve future mapping.
Indeed it always takes some more time to modify inaccurate mapping, especially when trying hard to keep the history.. Iâd say an exception to this good practice could be made in cases where version 1 of a building set is very poor. The history isnât worth much when it was carelessly done in the first place. Freed from that constraint, it still would technically take a little bit of time to select all and press delete, but it would be a vanishingly small amount.
Another downside to poorly drawn buildings, no matter the method, is that if people see that a city is blanketed in buildings, they may not realize the buildings are poorly drawn because they arenât necessarily looking at aerial imagery. Unlike with roads, we have few reliable tools to automatically evaluate the quality of a building.
In the past, this has been a problem with other features that we track coverage of by visual inspection, such as landuse and landcover areas. People used to map these areas very crudely, possibly disincentivizing more granular mapping slightly later on when our standards were higher. Nowadays, at least we have the tools to detect abnormally large landuse areas or those that cross major streets.
Iâve been somewhat wary of the Microsoft building dataset because of the potential to preempt higher-quality local imports by mappers who sweat every detail. Even Esriâs curated local building+address datasets donât correct the often artistic placement of units within an apartment building. I know we arenât going to come up with better imports or more dedicated mappers absolutely everywhere in anything less than a geologic timeframe, so the challenge is finding a good balance.
Sometimes there are considerations other than quality. Iâm somewhat notorious for the thousands of buildings I drew in my hometown back before we had the tools and imagery to draw them well. Back then, local government agencies had much better building datasets with addresses, but they were proprietary and cost real money, so we couldnât import them. Fortunately, our crude buildings helped convince the authorities to release their data into the public domain. Now we can import the better buildings, using the Replace Geometry tool. (Though itâs slow going because I donât have as much free time anymoreâŠ)
Yes. JOSMâs Conflation plugin (Github) is a great way to do it.
@watmildon 's fantastic User Diary is what first taught me that method:
- âUsing the JOSM Conflation plugin to add 1500 addresses in 10 minutesâ (April 2023)
- (I even wrote my own step-by-step tutorial: âHow to Conflate Addressesâ)
- Includes a few extra tips/tricks too.
- (I even wrote my own step-by-step tutorial: âHow to Conflate Addressesâ)
The instant I learned how to use it efficiently, it made (re)mapping things SO MUCH BETTER/FASTERâand most important, way more fun.
Side Note: Ever since stumbling across @watmildon 's tutorial, according to âHow Did You Contributeâ, Iâve jumped up to position #14 in the entire US.
How to Use JOSMâs Layers + Conflation
You then have:
- The main layer
- OSM data.
- Can be awesome â very poor quality.
- A secondary layer
- Address / âAIâ / imported / generated data.
- Can be very high â atrocious quality.
You can then:
- Select the (poorly drawn) OSM buildings.
- Set them as âSubjectâ.
- Swap to the secondary layer + select the best stuff there.
- Manually draw/adjust/realign as needed.
- Set those as âReferenceâ.
- Run JOSMâs Conflation.
- One-by-one, go through and approve the best ones.
The Conflation plugin will âsave the historyâ of the original nodes, while swapping in new/better shapes!
On Quality of âAI Data vs. AI Dataâ
Like some users said:
- Microsoft buildings are typically âmehâ â awful.
- ⊠but your countyâs GIS buildings may be âhigh / very highâ quality!
- And even include the individual duplexes/townhouses already split!
For example, hereâs a GIF:
Even having this layer as an option lets you very quickly know, at-a-glance:
- âThatâs not a single building⊠but itâs actually 2+ houses combinedâ!
- You wouldnât believe how much time this saved me in more dense towns/citiesâeven with my âbuilding cuttingâ trick!
On Quality of âAI Assistance vs. Fully Manual/Hand-drawn OSMâ
There was a big section of a town I recently went through, where an OSM user drew buildings âlike they were using crayonsâ! (And didnât even use âQâ to square them off!)
See before/after âPoorly-drawn OSM vs. County GIS buildingsâ GIF:
Even most of Microsoftâs âmehâ buildings would be a HUGE step in the right direction compared to that!
And for those who care about high quality⊠being able to import AI stuff as a base, then do manual tweaking on top, saves lots of tedious drawing as well.
So, my adding of buildings might be split like this:
- 50% = Verify + Hit âAcceptâ.
- AI is correct on most square/rectangle/L-shaped buildings.
- 25% = Drag to correct spot / Add-or-Delete 2->6ish nodes.
- AI accidentally detected outside patio as a part of the house/roof.
- I add a missing garage.
- 25% = Completely ignore.
- I then manually draw everything super-high quality, just as I would if I had a blank slate.
This frees up more time to then focus on the most fun aspects of mapping, and go into even more detail now that the buildings are there!
(For example, I recently began tagging a lot more building:level
/ roof:level
/ roof:shape
⊠or drive-throughs and fire hydrants!!! )
Yes, this is one annoyance I mentioned to @watmildon.
Right now, the MapWithAI plugin always tries to deduplicate the OSM->imported buildings layer.
If there are crudely drawn buildings, those take priority, so you get a giant blank spot in the MapWithAI layer.
I wish there could be a checkbox to disable the deduplication step, so someone like me could selectively replace:
- Microsoft with County GIS buildings.
- Going from âmehâ â high quality.
- Poorly-drawn with County GIS.
- Going from âawfulâ â high quality.
Then I could use those Conflation tricks above, to swap in even more better buildings.