March 20 incident postmortem

fenriquez · March 31, 2018, 4:24pm

Thanks for the transparency. What can be done from the device firmware perspective to have workarounds when this happens? I had built in my devices a few precautions whenever the device senses there is no cloud or cellular connectivity (i.e storing some data locally and wait until connection restores before transmitting). However, these measures did not work in this event.

As a context, we have a connected-machines service. Our customers rely on us sensing when their machines are operating and providing a service - they bill in turn to their customers based on this info. Hence, the data we get from the IoT devices is not real-time sensitive but we need to ensure it gets to its destination. We could not do it during downtime and the local storage solution we had did not work as the particle devices thought they had connectivity. We use the particle cloud and webhooks to send the messages from Particle to our final dashboard/data analysis solutions. We liked this approach (as opposed to sending the data from the device directly to our cloud) because of the built-in security and data efficiency in the particle.publish methods.
Thanks in advance for the suggestions.

Topic		Replies	Views
Disappointed in Particle Firmware	14	1696	February 18, 2021
Particle Cloud incident postmortem: Critical database vendor failure Particle Updates	1	1059	July 22, 2020
[Resolved] Outage! Particle cloud down Cloud	24	1585	December 12, 2020
Device Cloud incident postmortem: sleepy devices stuck using expired sessions Particle Updates	3	1232	May 17, 2019
Over 20 Devices just went down Troubleshooting electron	66	7150	May 9, 2019

March 20 incident postmortem

Related topics