Thank you for all of this good information. I can confirm that --flat-nodes does work as you describe. While not quite complete yet, processing the entire planet’s worth of data uses only a fraction of the amount of RAM and is a lot faster. It is clearly the way to go when processing large datasets.
I am curious…under what conditions would one not want to use --flat-nodes? If there really aren’t any, it may be worth it for someone to add an addendum to the tutorial specifically mentioning the flag and it’s benefits considering it allows the processing of data that may be impossible to process otherwise - something someone new is not likely to realize.