web services (big part of the internet) outage:
"team member using an established playbook executed a command which was intended to remove a small number of servers for one of the S3 subsystems that is used by the S3 billing process. Unfortunately, one of the inputs to the command was entered incorrectly and a larger set of servers was removed than intended." (Seems they never heard of selection menus.)
"In theory, a series of failsafes should keep the fallout from such errors localized, but Amazon
says that some of the key systems involved hadn’t been fully restarted in many years and “took longer than expected” to come back online." (sounds like some vacuum robots reported here).
"The company now claims it’s “making several changes as a result of this operational event.”
Not surprising in a way, given how the internet was built totally lacking in security features to begin with, so what else is lacking. One hopes strategic nuclear missile systems are better designed (well, they are supposed to require two keys at once etc. -- maybe taking too long to launch when needed (hard to read that fine print in the instructions). In haste, coffee is spilled on the launch panel...)