How to generate BuildingFootprints from a public WMS source?

I’ve been carrying around an idea for a while now, where my own attempts have failed so far and I now want to ask the community.

Preamble:
In my state there are very good aerial images with 10cm resolution, which are already released for OpenStreetMap.
To get a feeling for the size: The state covers an area of 2.569 km². The area is already excellently mapped in OpenStreetMap, but new buildings and also roads are being built at a rapid pace. In the course of this year, the new aerial images will be published with the date of recording in 2022.
This is the reason why I wanted to discuss my project with you and clarify whether this is at all possible for a single person without deeper programming knowledge and large infrastructure like me.

The idea:
I would like to have BuildingFootprints created with the new aerial imagery, which would include all previously unmapped buildings (and roads, if applicable). Very similar to the BuildingsFootprints or RoadDetections from Microsoft, only due to the much better aerial image resolution also in a higher quality and timeliness. I would then want to check and map these building outlines either in JOSM or using a MapRoulette challenge. Editing in JOSM would have the advantage that the address of the new building could be entered at the same time using the AddressHelper plugin.

Originally I got this idea from the project ohsome2label and in its wake I also looked at some other projects, which partly follow similar approaches and seem to be suitable for my project, but probably all of them are not yet completely finished and partly have only been created as a scientific thesis.
Just this week I came across this GitHub repo, which looks very promising, but completely overwhelmed me due to its scope.

Questions:

  • What are the technical hardware requirements for such a project? What costs would have to be expected?

  • Which software could be best suited for this? How much technical knowledge in programming languages would be required?

  • What preparatory work is required to obtain processable media files from the WMS layer? From what do I create the sample set for recognition?

  • What is the expected success rate in terms of quality and usability of the result data? And how long does such a project take to realize? In a nutshell: Is it even worth it?

Now I hope to get an answer from the community regarding feasibility and to share experiences, examples and approaches. Have I forgotten anything? Thanks a lot!

Have you considered/are there alternatives? That would mainly be open data building outlines from your cadastre or similar.

Thanks, but if I had an alternative solution I probably wouldn’t have opened this post in the first place, would I?
Do you have any suggestions for alternatives? I am happy to hear them.

Unfortunately, we cannot use the Austrian cadastral data for OSM, despite explicit requests for the license.
There is of course the basemap.at, which contains building areas and these also as a vector. However, it often takes years until buildings appear on basemap.at. And the building areas are here mostly not the representation of the entire roof area but often include the entire basement, which may be much larger than the building itself or the other way around.
One random example:
image
The building outline in basemap.at does not match the actual building here. And no, the roof does not overhang by 1.5 meters on all sides.

Next example is a school complex. The northern part is completely missing in basemap.at, although the building was demolished and rebuilt in 2016. So the basemap here matches the old structure from 2015, so is now 8 years behind.
In addition, the southern part no longer exists, as can be seen in the 2020 aerial photo.
2020|basemap|2015

image
There are thousands of such cases.

So I would not like to use these open data building outlines from the cadastre for OSM, would you?