3
Build 7685 - Unresponsive and 100% CPU usage after a few hours
Problem reported by Sébastien Riccio - 1/16/2021 at 8:17 AM
Submitted
Well... it was too good to be true. Since the update to the latest build, service runs a few hours and suddently the process cpu usage goes to 100% and the services stop responding.

We had to manually kill the process 3 times every X hours since the update. It rarely happened before the update.


Considering a downgrade to previous build if it happens again in the next hours.
I hope that the changes for Auto Folder clean are downgrade friendly and that it will not break stuff when downgrading...

Some Zabbix graphs showing the issue started after the update.


And a zoomed in view to see what happened the few last hours

Sébastien Riccio
System & Network Admin

12 Replies

Reply to Thread
1
Gabriele Maoret - SERSIS Replied
No issue at all here.

Upgraded 10 hours ago and all is fine till now...
Gabriele Maoret - Head of SysAdmins at SERSIS Currently manages 6 SmarterMail installations (1 in the cloud for SERSIS which provides services to a few hundred third-party email domains + 5 on-premise for customers who prefer to have their mail server in-house)
1
Sébastien Riccio Replied
I hope it will stay like this for you. We updated 23:00 cet yesterday. First lock started at 11:30 today (so after 12h30 of uptime)


I'll have to stay around to kill the process if it happens again :/ and hope that having to kill the process will not corrupt any files like it used to do with for example json config and kind of db files...
Sébastien Riccio System & Network Admin https://swisscenter.com
1
Alessandro Pereira Replied
Our update had no problem, we don't use MAPI.
0
Sébastien Riccio Replied
We don't either. I hope this is a false alarm but as it already happened 3 times since the update, we're being cautionous here.
It might be some background procedure running after the update that gets stuck il a loop. Let's see if it happens again...
Sébastien Riccio System & Network Admin https://swisscenter.com
0
Sébastien Riccio Replied
Hi Ron,

No OS reboot but : stop service, uninstall, launch installer. As we always do to try to keep the downtime as low as possible.

I probably try a full OS reboot if problem comes again.
Sébastien Riccio System & Network Admin https://swisscenter.com
1
echoDreamz Replied
Have you had any further issues Sebastien?
1
Sébastien Riccio Replied
Hello echoDreamz,

Not since the latest service kill/restart. I'll update the thread if it happens again.

Sébastien Riccio System & Network Admin https://swisscenter.com
3
echoDreamz Replied
Would be good too if ST could indicate if a build is safe to downgrade back to the previous build in the changelog, just incase breaking issues do occur, we know if it is safe to rollback.
2
Sébastien Riccio Replied
No issue today since the latest restart. Let's say that it was a temporary issue due to dunno what. So false alarm :)
Sébastien Riccio System & Network Admin https://swisscenter.com
1
FrankyBoy Replied
Hello everyone, 48 hours have passed since the update and no issues reported yet. I tested the fixes for the issues that were the most critical for us (handling of message text encoding and the issue of attached files that contained accented characters) and everything seems fixed. On the other hand, our customers being mainly businesses, the ultimate test will be when they return to work on Monday morning. I keep my fingers crossed!
1
Sébastien Riccio Replied
Yes FrankyBoy. Crossing fingers too, here they start working right now. It's the real test.
Sébastien Riccio System & Network Admin https://swisscenter.com
1
Gabriele Maoret - SERSIS Replied
We updated 48 hour ago and never had issues
Gabriele Maoret - Head of SysAdmins at SERSIS Currently manages 6 SmarterMail installations (1 in the cloud for SERSIS which provides services to a few hundred third-party email domains + 5 on-premise for customers who prefer to have their mail server in-house)

Reply to Thread