This question was originally posted on DCIM Support by Mark on 2018-03-21
Good afternoon all,
we continue to be experiencing an issue where our Virtual DCE 7.3.1 generates a multitude of communication lost alarms across multiple device types.
It seems that the lost connectivity alarms are mostly generated during the late evening or overnight hours, with hardly any occurring during business hours.
I was told it is possibly a latency issue given the server is located in one city and the the devices are spread out as far as other cities 350 Kilometers away that could be causing these timeouts.
I checked the KBase, to see if anyone had added similar topic, recently and noticed: . My issue is essentially the same except on version 7.3.1... using SNMPv1 and although some devices have up to date firmware the lost connectivity still occurs on them also.
Any assistance, guidance or suggestions with this issue would be greatly appreciated in the meantime I will a closer look at the above example and see if I have missed any of the steps that were offered above.
Thank you in advance,
Solved! Go to Solution.
This comment was originally posted on DCIM Support by Ditte Drewer Mathiasen on 2018-03-27
Hi Mark , thanks for posting. Let's hope you find your answer here.
Kind regards Ditte
This answer was originally posted on DCIM Support by Steven Marchetti on 2018-03-27
Intermittent SNMP issues can be difficult to pin down but if it's happening at certain times a day, that may make it easier. First, take a look at the following k-base:
That goes through some basics of troubleshooting SNMP. If you think it is a latency issue, things like timeouts and retries should potentially be increased as noted in the document.
You may also want to take a look a the logs on the devices themselves. Are there other systems polling the devices at that time? Perhaps another NMS trying numerous community names if SNMP V1? The newest firmware on APC devices should be able to indicate if there is such an issue by showing unauthorized attempts.
You may also want to look at other network activities at this time. Is there a great deal more traffic due to backups and vulnerability sweeps?
You may also want to consider getting DCE up to date. We're up to 7.5 now and there have been updates to things like SNMP processes. The issues noted in the post you mentioned would not likely be the same as SNMP version 1 and version 3 issues are different and that was version 3 specific. Still, it's best to be up to date on all aspects.
If these issues are happening at a certain time, you can also look at things on the DCE server that may be set to happen overnight such as backups being performed, discoveries set to run, etc. If the issues are happening to specific locations however, it's more likely that it is network based than server based.
Discuss challenges and get support in energy and automation with 30,000+ experts and peers.
Over 10,000+ support articles are available to help you find answers to your product and business challenges.
Find peer based solutions to your questions. Provide answers for fellow community members!