TECHZEN Zenoss User Community ARCHIVE  

Windows monitoring issues

Subject: Windows monitoring issues
Author: Mohan J
Posted: 2017-07-24 09:42

Hello Everyone,

We are running Zenoss 4.2.5 and monitoring all kind of devices. We have a issue with windows server monitoring, the rrd files are not updating on time.We have been observing below error messages in all the collectors, Zenpython daemon is been restarted many times but nothing is working. This errors are with windows services, and no other errors are found, we are seeing this errors for all most all the servers.

We are seeing the broken graphs and rrd files are not updating, we are not sure the cause of this issues.But all we can see is these errors in Zenpython log file. All others data collection is running file expect for windows. .

Zenpack Using : ZenPacks.zenoss.Microsoft.Windows-2.7.0.egg

Errors
2017-07-24 09:31:35,211 ERROR zen.RRDUtil: rrdtool reported error rrdcached: illegal attempt to update using time 1500903095.000000 when last update time is 1500903095.000000 (minimum one second step) Devices/10.64.65.78/nsi/state_state
2017-07-24 09:31:35,212 ERROR zen.RRDUtil: rrdtool reported error rrdcached: illegal attempt to update using time 1500903095.000000 when last update time is 1500903095.000000 (minimum one second step) Devices/10.64.65.78/nsi/state_state
2017-07-24 09:31:35,213 ERROR zen.RRDUtil: rrdtool reported error rrdcached: illegal attempt to update using time 1500903095.000000 when last update time is 1500903095.000000 (minimum one second step) Devices/10.64.65.78/sppsvc/state_state
2017-07-24 09:31:35,214 ERROR zen.RRDUtil: rrdtool reported error rrdcached: illegal attempt to update using time 1500903095.000000 when last update time is 1500903095.000000 (minimum one second step) Devices/10.64.65.78/sppsvc/state_state
2017-07-24 09:31:35,215 ERROR zen.RRDUtil: rrdtool reported error rrdcached: illegal attempt to update using time 1500903095.000000 when last update time is 1500903095.000000 (minimum one second step) Devices/10.64.65.78/sppsvc/state_state
2017-07-24 09:31:35,216 ERROR zen.RRDUtil: rrdtool reported error rrdcached: illegal attempt to update using time 1500903095.000000 when last update time is 1500903095.000000 (minimum one second step) Devices/10.64.65.78/sppsvc/state_state
2017-07-24 09:31:35,217 ERROR zen.RRDUtil: rrdtool reported error rrdcached: illegal attempt to update using time 1500903095.000000 when last update time is 1500903095.000000 (minimum one second step) Devices/10.64.65.78/xagt/state_state
2017-07-24 09:31:35,218 ERROR zen.RRDUtil: rrdtool reported error rrdcached: illegal attempt to update using time 1500903095.000000 when last update time is 1500903095.000000 (minimum one second step) Devices/10.64.65.78/xagt/state_state
2017-07-24 09:31:35,219 ERROR zen.RRDUtil: rrdtool reported error rrdcached: illegal attempt to update using time 1500903095.000000 when last update time is 1500903095.000000 (minimum one second step) Devices/10.64.65.78/xagt/state_state
2017-07-24 09:31:35,220 ERROR zen.RRDUtil: rrdtool reported error rrdcached: illegal attempt to update using time 1500903095.000000 when last update time is 1500903095.000000 (minimum one second step) Devices/10.64.65.78/xagt/state_state
2017-07-24 09:31:41,434 WARNING zen.MicrosoftWindows: receive failure on 10.75.32.13: HTTP status: 400.
2017-07-24 09:31:41,601 WARNING zen.MicrosoftWindows: 10.75.32.13: Windows Perfmon Error on 10.75.32.13: HTTP status: 400.

Regards,
Mohan

------------------------------
Mohan J
------------------------------


Subject: RE: Windows monitoring issues
Author: Jane Curry
Posted: 2017-07-25 02:10

Have you changed the cycle time for this datapoint for this device? With rrd files, that will stop values being saved as the step value in the red file cannot easily be changed.
Cheers,
Jane
--
Zenoss Master
Skills 1st Limited, 2 Cedar Chase, Taplow, Bucks, SL6 0EU, UK.
Registered in England & Wales, Company No. 3458854.
Tel: +44 (0)1628 782565 Skype: jane_curry_uk
Email: jane.curry@skills-1st.co.uk Web: http://www.skills-1st.co.uk
Copyright (c) 2016 Jane Curry < jane.curry@skills-1st.co.uk >. All rights reserved.


Subject: RE: Windows monitoring issues
Author: Mohan J
Posted: 2017-07-25 06:49

Thank You Jane for your quick reply!!

I have not changed the cycle time for this data-point and fyi i am able to see the alerts for multiple servers.

Regards,
Mohan

------------------------------
Mohan J
------------------------------


Subject: RE: Windows monitoring issues
Author: Mohan J
Posted: 2017-07-26 04:36

Hi Jane,

Request you to share your thoughts for this issue, what can be done to prevent these errors. We are having issues only with windows devices, monitoring is completely effected. I need to restart the zenpython daemon to collect the data for windows devices when it's broke. We can schedule a corn job to restart the zenpython daemon, but it's not the feasible solution to do it.

Upon looking into the zenpython logs the error is for state_state data-source and it's does not have any data-points defined in the zenoss.


------------------------------
Mohan J
------------------------------


Subject: RE: Windows monitoring issues
Author: Mohan J
Posted: 2017-07-27 10:36

Hi everyone,

Can anyone please let us know how to overcome this issue, we are still seeing these errors in all zenpython logs. Please let me know if you need anyone details for further troubleshooting.


------------------------------
Mohan J
------------------------------


Subject: RE: Windows monitoring issues
Author: Mohan J
Posted: 2017-08-01 09:23

Hi Jane,

Can you provide your inputs on this issue, i am still facing in our environment.

------------------------------
Mohan J
------------------------------


Subject: RE: Windows monitoring issues
Author: Jane Curry
Posted: 2017-08-02 11:58

Well your logfile is saying that you are trying to update the rrd file twice inside the same second.  Are you sure that the template applied really hasn't had it's cycle time changed?

Have you upgraded the Windows ZenPack recently?

I am assuming that this DID collect data for the state.state datapoint at some stage? 

Look at the Monitoring Template called WinService (/Server/Microsoft) and check that there is a datapoint called state.state - this datapoint didn't exist in earlier versions of the ZenPack.

Other possibilities to check are whether the time on your Zenoss box is correct.

The error message is actually from rrdcached which has a set of journal files under /opt/zenoss/var/rrd_journals - have a look in there.  Pick the latest file and see if there are errors in there.  You can restart the rrdcached daemon with:
zenrrdcached stop; zenrrdcached start

Note that the zenoss daemon is zenrrdcached but the process that actually runs (if you check with ps -ef) is rrdcached:

zenoss 2679 1 0 18:01 ? 00:00:00 /usr/bin/rrdcached -b /opt/zenoss/perf -p /opt/zenoss/var/rrdcached.pid -l /opt/zenoss/var/rrdcached.sock -j /opt/zenoss/var/rrd_journals


Cheers,
Jane

------------------------------
Jane Curry
Skills 1st United Kingdom
jane.curry@skills-1st.co.uk
------------------------------
Have you changed the cycle time for this datapoint for this device? With rrd files, that will stop values being saved as the step value in the red file cannot easily be changed.
Cheers,
Jane
--
Zenoss Master
Skills 1st Limited, 2 Cedar Chase, Taplow, Bucks, SL6 0EU, UK.
Registered in England & Wales, Company No. 3458854.
Tel: +44 (0)1628 782565 Skype: jane_curry_uk
Email: jane.curry@skills-1st.co.uk Web: http://www.skills-1st.co.uk
Copyright (c) 2016 Jane Curry < jane.curry@skills-1st.co.uk >. All rights reserved.





< Previous
Database Connection
  Next
Network graph display average value on custom interval
>