Why does OSMF Budget €25,000 on Amazon

pnorman · August 15, 2023, 9:40am

Yes - there’s not a great way to handle the accounting. Using the two examples you gave, we can see the problems.

If we didn’t have Fastly sponsorship, we’d have to pay a CDN. At our volumes, we’d never pay the listed commercial rates, but we don’t know what price we’d get actually doing the negotiations, which we wouldn’t do unless we had to switch. We’d also probably change our usage policies, so we’d have different traffic levels. At this point, we put the risk in the budget request with a maximum cost.

The AWS render server is different. It’s important for capacity right now, but if we lost the credits, we’d go with a cheaper option than EC2. We also have a very low risk of losing credits compared to donated servers, since once the credits are in our account they’re good until they expire at a known date. If we put a cash value on the risk, it would be the capex cost of replacing it, not the dollar value of AWS credits it uses. We didn’t call this risk out in the budget, because it is minor compared to other risks, and substantially less than other donated servers.

Matija_Nalis · August 15, 2023, 3:36pm

It is quite sad when the serious lack of communication and social skills (aggressive approach, blatantly ignoring “assume good faith” policy and other etiquette guidelines, replying when angry / ignoring “WP:NAM”, poor anger control, incessant posting of retorts, repeatedly ignoring other people suggestions to moderate as well as dis-likes as indication of problem with their writing style, continuing to enforce validating their own bad behaviour instead of noting that there must be a problem with it as so many people complain, and general argumentative and trollish behaviour, arrogant self-righteousness even in the face of facts proving them completely wrong, inability to accept and admit to others that they have been wrong and acted inappropriately etc.) completely overwhelmes the actually quite reasonable request for information and clarification and more transparency, and makes people want to just blacklist them as a troll vulgaris domesticus.

Sad, but quite understandable. It is human social behaviour 101 (what you say doesn’t matter in the least if you don’t know how to say it and do in a way that puts people in a “here comes aggressor, defend now!” mode). Hopefully OP will learn from this and acquire better communication skills to discuss issues in more civil way, and even complain if need be in more amiable and sociable way, before they get blacklisted by the majority of the community as a troll.

And yet a simple sincere apology (instead of trying to reframe the issue so they remain blameless) would suffice upon noticing community response and lash-back, e.g.

“I apologize I overracted, I misunderstood the situation and was angry, so I forgot to assume good faith and my lack of knowledge, so I inappropriately come out agressively. I however still have questions on the sucjet pertaining to XXX (like: what is the difference between XXX1 and XXX2), and would like to suggest that OSMF be more transparent about YYYY in the future, by including more information in financial reports about ZZZ. I find that important because of QQQQ”.

Simple, admitting own mistakes (instead of desperately trying to find any smaller mistake of others and “try to make them more wrong then me”), non-agressive and constructive, and yet still asking for same information in non-confrontational way. It would make people see him as a valid peer, and support the idea.

P.S. I actually connected their identity on GitHub with this identity of Discourse forum, and in my experience (on e.g. StreetComplete issue tracker) previously, they actually seem as valuable members which do want to help the OSM project. But their lacking anger management issues however might turn away most of the community unless they learn to manage it (much) better. And that would be a loss (for both sides).

WarpathPeacock · August 15, 2023, 6:30pm

@NorthCrab You have made many implicit and explicit assumptions throughout this whole conversation. You are engaging bad faith. I will be changing the title of this thread to “Why does OSMF budget 25,000 euros on Amazon” because the implication of the current title is not accurate.

WarpathPeacock · August 15, 2023, 6:31pm

I will also lock this thread for a couple hours.

Firefishy · August 15, 2023, 6:38pm

OpenStreetMap Ops team AWS usage.

AWS EC2 - virtual machines:

OSM tile render server (USA) - $3500/month (including data transfer which is ~50% of cost)
palulukon.openstreetmap.org Fully sponsored, no cost to OSM. Would find alternative if not sponsored. EC2 On-Demand, potential to optimise using Spot Instances, but significant ops investment required.

AWS S3 - object storage:

Summary: 272TB used. Growth of 3% per month.

Quoted amount include data transfer, API usage and storage costs.

openstreetmap-storage-backups - 112.4 TB - $120/month
Backups including some historical. Backups are not de-duped by design (heavy admin / risk burden). Some opportunity to manually cleanup, but very low priority. No automatic cleanup.
openstreetmap-planet - 71.1 TB - $100/month
Historical and current copies of published planet files. Deep-Archive, for future restore to AWS hosted planet service with full back catalog. No automatic cleanups.
openstreetmap-tile-aggregated-logs - 32.1 TB - $125/month
Archival of processed tile CDN usage logs. Historical reference for Ops to work out tends and usage patterns. More data here than provided by public logs: Index of /tile_logs @pnorman can clarify.
openstreetmap-wal - 28.7 TB - $400/month
Live streaming “Write Ahead Log” copies of the OpenStreetMap core Postgres database. The WAL files are used for syncing follower instances of the core Postgres database server. Vital asset to our data recovery plans. Can be used for recovery between full weekly database backup or corruption. For clarity this database is private and not published via planet data (eg: messages, users etc). Automatic cleanup after 1 year.
openstreetmap-imagery-backups - 18.2 TB - $35/month
Backups of imagery provided to OpenStreetMap. Deep archival. Primarily backups of imagery hosted on kessie. No automatic cleanups.
openstreetmap-fastly-logs - 5.3 TB - $125/month
Inbound fastly CDN logs for processing. Key to us finding and managing abuse, source for publish tile log analysis: Index of /tile_logs Automatic Cleanup after 31 days.
openstreetmap-gps-traces - 2.8 TB - $80 to $225/month (varies due to access by website users)
The GPS traces that are uploaded to OpenStreetMap.org, the storage backend for website: Public GPS Traces | OpenStreetMap Formerly provided by NFS service, moved to S3 to simply admin burden and to seamlessly work across our hosting data centres. No automatic cleanup, but opportunity to improve costs with S3 “tier” lifecycle rules.
openstreetmap-fastly-processed-logs - 1.9 TB - $50/month
Archival of processed tile CDN view logs. Historical reference for Ops to work out tends and usage patterns. More data than provided by public logs: Index of /tile_logs @pnorman can clarify.
openstreetmap-user-avatars - 113.1 GB - $5/month
The user “avatar” images as uploaded by users. No automatic cleanup, but opportunity to improve costs with S3 “tier” lifecycle rules. Formerly provided by NFS service, moved to S3 to simply admin burden and to seamlessly work across our hosting data centres.
openstreetmap-aws-cloudtrail - 76.0 GB - $2/month
Storage backend for AWS Cloudtrail API access logging service. Security monitoring. No automatic cleanup.
openstreetmap-gps-images - 62.7 GB - $10/month
The processed display images used by OpenStreetMap.org on Public GPS Traces | OpenStreetMap
Formerly provided by NFS service, moved to S3 to simply admin burden and to seamlessly work across our hosting data centres.
openstreetmap-backups - 21.1 GB $0.03/month
Historical database backups from OSM in first few years. No automatic cleanup.

Please remember all storage solutions also carry an ongoing human administrative / management overhead cost which is not accounted for in the above numbers.

Other AWS services

We also use AWS like Cloudtrail, Athena, Glue, etc. There are all minimal expenses (<$20/month) or covered by free tier.

Usage of Credits

All costs above have been covered by AWS credits since Nov 2022. The credits cannot be used for purchasing Savings Plans or Reserved Instances etc which can be used for offsetting / reducing future costs. See AWS credits FAQ. Credits are valid for 1 year and any unused credits expire. Credits cannot be exchanged for cash. Crypto Mining is not a permitted use of credits.

Credits need to be requested annually from AWS and it is not guaranteed we will get credits and this is why we have a budget line item for contingency.

WarpathPeacock · August 16, 2023, 11:54am

Thread is unlocked. Please to continue discussing the additional information provided by Firefishy

StC · August 16, 2023, 12:37pm

I relay here a question that this thread has raised within the French crowd: is this a purely financial issue, or is there some kind of technical commitment to consider? Are there specific APIs or architectures that would make the switch to another storage infrastructure costly?

InfosReseaux · August 16, 2023, 12:37pm

Dear all

Thank you @Firefishy for extensive details about what components are currently hosted.

Was there any discussion in the past about possible rivalry this hosting strategy could raise in regard of Amazon involvment in Overture foundation?

This kind of problems could be taken seriously due to the impact it could have, years after decision making.

Best regards

Stereo · August 16, 2023, 2:09pm

Amazon is involved in Overture, and also a big user and supporter of OpenStreetMap. I see no rivalry there. These buckets were created long before Overture was a thing. OWG is independent in the products and technologies it chooses to use.

Firefishy · August 16, 2023, 2:12pm

The website uses Rails Active Storage which is responsible for openstreetmap-user-avatars, openstreetmap-gps-traces and openstreetmap-gps-images. Active Storage uses the S3 API. Prior to using S3 we used NFS, but NFS did not scale well beyond a single data centre. We extensively evaluated Ceph but the administrative burden of managing it is too high for our small team with our expected limited usage. AWS S3 was a feature match, price match and has first class compatibility with Active Storage. While other providers do provide a S3 compatible API, other providers were not deeply considered.

The other S3 buckets are effectively purely storage. We push data via the AWS cli. A significant portion of the storage used is “deep archive” storage which is expensive to retrieve (very cheap to store). We are quite happy with AWS S3 and at the moment don’t have a reason to re-evaluate.

SimonPoole · August 16, 2023, 4:17pm

Generally large companies are complex, while everything you say is true, the large user and contributor to OSM (historically, it may or may have not died with the OMF) was Amazon Logistics. The geo-services provided by AWS on the other hand have to my knowledge never been directly OSM based and have used Here and ESRI (the bits from ESRI may have some OSM in them).

That said, having at least a cordial relationship with OMF members that matter to us, for example Amazon, ESRI and Tomtom, is probably a good idea, and if it is only to have some channel to rein in some of the more egregious behaviour of the OMF.

NorthCrab · August 16, 2023, 9:19pm

Thank you all. At this juncture, I am choosing to refrain from any future involvement in OSM global operations. I’ve provided additional context in my diary entry here: NorthCrab's Diary | 🌂 The Past, The Present, The Future | OpenStreetMap. I no longer view this as “my” issue and will thus cease all related commentary. OSM moderation has taken many of my texts entirely out of context, seemingly to cast me in a negative light. I’m not willing to continue the discussion in such an environment, where there’s an overemphasis on subjective views and a lack of grounded argumentation. Moreover, I find it unsettling when a thread is closed “to cool off” even when tensions have largely subsided. To make matters worse, one party continued to comment even after the thread was locked, which, to me, is a textbook form of censorship. Times have changed, and I recognize that. It’s time for me to shift my attention to other matters.

jimkats · August 16, 2023, 9:27pm

I will dare to say that from what I understand from the beginning of this topic, you misunderstood the concept of “budget”, thus the initial tension. And it’s not the OSM moderation taking some of your texts out of context, is the general climate you brought by the rapid replies with the majority being more or less negatively targeting the OWG and OSMF operations part.

iandees · August 16, 2023, 10:30pm

Thanks for taking the time to put this together, @Firefishy. This is a useful summary!

apm-wa · August 17, 2023, 3:28am

Moderator of this channel stepping in. @NorthCrab, your tone verges on disrespect for the volunteers who keep the servers running, you are combative, and you are accusing the volunteers of bad faith in their responses. Further, you seem not to understand the difference between a “budget” and an “expenditure report” (aka “balance sheet”). I strongly urge you to calm down, ask questions in a dispassionate manner, and refrain from insinuating that the OWG is somehow hiding an illicit expenditure.

Firefishy · August 17, 2023, 4:28am

Some context, this thread was temporarily locked (15 Aug 19:31) 7 minutes before I posted my entry (15 Aug 19:38) which took a lot longer than 7 minutes to write. Only after I posted did I see the thread had been locked in the interim. I guess my entry posted because I am a super admin here (I setup this instance of discourse and manage its server and software infrastructure) and therefore I guess discourse allowed me to post it.

Firefishy · August 17, 2023, 5:56am

@NorthCrab You and I are both Linux admins, we both map for fun, we both are in agreement that wasting money on cloud services, especially when there is physical hardware to hand, would be a dishonourable way to spend donated project money.

The OSM ops team is extremely small. We don’t have the resources (human capital, time, electrical power or even large enterprise disks) to run a large on-premise ceph / gluster / NFS / whatever cluster to store the shared data (across DCs) and backup data we store. I would not feel comfortable locating important backups in the same racks that we are backing up. Building a cluster that would survive our requirement of surviving a data centre outage would be difficult. In the past online and offline we have spent a long time discussing options for example: Establish an object store · Issue #169 · openstreetmap/operations · GitHub

Our use of AWS (and S3) is limited and is the pragmatic choice. The amount of money we spend on AWS is small and I believe justified. Our AWS costs currently being 100% covered by free credits helps with the value proposition and allows us the ops team to focus on running the many other aspects of the OSM infrastructure that require our attention.

It would be great if we could meet up and find common ground. A few times a year I visit family in Lithuania, I see you map in Poland, maybe we could meet up with drinks in Warszawa? Or video call or whatever.

o_andras · August 17, 2023, 10:34am

If “there is so much text” then maybe you should try to relax for a second, not hash and rehash what you’ve already said so many times in previous posts, and wait for a clarification…

I came here after having read your diary, but the picture is quite different from what I imagined while reading it.

You seem to just have confused budgeting with actual spending, which are obviously not the same.

AFAIU, the OSMF got 25k EUR in credit from AWS. The OWG didn’t get 25k EUR from the OSMF, and they didn’t buy 25k EUR in AWS credits.

Consider that maybe by replying right away you’re sending the signal that you do expect to receive an answer today. This is for sure a cultural/personal difference/preference, but: when I need something from someone at work but they’re in vacation, I never message them until they get back. Two reasons:

I wouldn’t like to receive work-related messages during my own vacations;
Lots of people don’t seem to understand the concept of vacations, so they end up working anyway (i.e. replying).

Finally, this: