Programing best practices to reduce cloud disconnects

sazp96 · August 13, 2015, 1:36pm

Hello folks,

I’m running two Photons on 0.4.4. At random time through the day the photons go offline and online. This happens anywhere from 3 to 10 times per day.

The weird part is that once the Photon comes back online, it does not call void setup(). During the “reset” the photon continues running as planned, and there is no SOS light or anything.

More than a reset, it seems like the Photon is loosing connection to the cloud and once it connect again it triggers the offline/online event. This is just my opinion but would love to hear what other think.

Anybody else has experienced this?

Correct reset:

“Partial” reset that doesn’t call Setup():

AndyW · August 13, 2015, 4:04pm

You are correct, the most common cause of this behaviour is a problem somewhere between the device and the cloud.

For brief interruptions, you may not even see an offline event, only an online one.

There can be numerous causes of this, the vast majority of which are beyond the control of either you or particle (e.g., your ISP, their connection(s) to the internet, the interbnet as a whole, etc etc.)

In short, it is to be expected.

bko · August 13, 2015, 4:10pm

Another really common thing is that your DHCP lease expires and you have to wait while it is gets renewed. This happens to me every 24 hours, but if I wanted to I could configure my router to give out longer address leases. Not really a problem in a robust system that recovers gracefully.

sazp96 · August 13, 2015, 4:11pm

Thanks @AndyW and @bko .

Could having a lot of code on a subscribe handler increase this type of issue?

Or more generally, having code that takes more than a couple second to execute before loop() is completed. Could that cause this issue?

bko · August 13, 2015, 4:15pm

Yes, this can be a problem, depending on how you are delaying. The actual delay(10000); function handles servicing the cloud while it is waiting, but a loop that you are running that is longer than about 10 seconds (the cloud time-out), will cause problems. You can help this by adding a call to the cloud service routine when you have time for it, which is Spark.process(); on Photon and slightly different right now on a Core.

sazp96 · August 13, 2015, 4:33pm

Thanks @bko. I don’t have many delays, but I do have a lot of copying and parsing of strings. I will add some Spark.process() in the slower pieces of code.

What about it in webhook handlers? Here is a extreme example, let’s say I have a handler that takes 20 seconds to run. But it has Spark.process() every 4 seconds. Would that work well?

bko · August 13, 2015, 4:44pm

I am not sure, but I would urge you to think of your webhook handler more like an interrupt handler: Do the minimum amount of work possible to copy the data or whatever, and set a flag that there is work to be done. Then in loop() you can read the flag and do the work.

AndyW · August 13, 2015, 5:57pm

Can I suggest renaming the thread to something that is less ominous sounding ?

I re-iterate - this is expected/unavoidable behaviour; and besides, it sounds like the thread is swiftly pivoting to something more about recommended programming practices.

sazp96 · August 13, 2015, 7:44pm

Thanks for the feedback @bko. I will refactor my code.

@AndyW, you are right. I think this new name is more descriptive.

sazp96 · August 16, 2015, 3:09pm

Hello @AndyW and @bko. Just wanted to report back that after heavily reducing the amount of code on the handler, the number of disconnects has been reduced ~40.

Thanks for the help!

Topic		Replies	Views
Photon Reset vs WiFi Disconnect General	3	1035	May 19, 2018
Photons go off line and on line repeatedly Troubleshooting	1	1003	August 9, 2016
Photon losing connectivity and Resets Troubleshooting	1	2852	July 30, 2015
Photon losing connection from time to time Troubleshooting	10	2641	January 29, 2016
Photon mysteriously goes Off-Line Troubleshooting	2	1160	March 21, 2016

Programing best practices to reduce cloud disconnects

Related topics