[Nagiosplug-help] check_nrpe plugins work fine for a while then start throwing "NRPE: unable to read output"
Kyle Smith
kyles at apple.com
Thu May 21 16:59:42 CEST 2009
Hey Philip,
Thanks for the response, see my comments inline...
On May 20, 2009, at 6:09 PM, Philip Marcus wrote:
> Hey Kyle,
>
> I'm just throwing out some random ideas off the top of my head...
> To test, I just enabled debug in one of my nrpe configs, restarted
> the process and did see more output from the daemon.log file in my /
> var/log dir. (running ubuntu)
> Do you see any output from nrpe? you should see a startup and stop
> message. If not, check the logging facility, and Stop/Start the nrpe
> daemon again.
Yes I see nrpe messages for startup and shtudown of the daemon in my
system log file (I'm on OSX so logfile is /var/log/system.log). My
issue is that I don't see any more messages with debug enabled than I
do with debug disabled. This is hampering my ability to work this
issue since i can't tell what nrpe is doing internally. Here's a
snippet of the logging I see for a shutdown and restart of nrpe on my
host manta. As part of the startup logging it tells me I have
configrued nrpe to accept command args which is accurate...
May 21 07:55:30 manta nrpe[10200]: Caught SIGTERM - shutting down...
May 21 07:55:30 manta nrpe[10200]: Cannot remove pidfile '/var/run/
nrpe.pid' - check your privileges.
May 21 07:55:30 manta nrpe[10200]: Daemon shutdown
May 21 07:55:33 manta sudo[11969]: kylesmith : TTY=ttys000 ; PWD=/
Users/kylesmith ; USER=root ; COMMAND=/usr/local/nagios/etc/init.d/
init-script start
May 21 07:55:33 manta nrpe[11971]: Starting up daemon
May 21 07:55:33 manta nrpe[11971]: Warning: Daemon is configured to
accept command arguments from clients!
>
>
> Regarding the problem where after a while the remote check fails,
> thats a difficult one to pin down. It sounds like something is
> timing out. Without knowing anything about your environment or what
> the actual check is doing..., but it sounds like maybe some local
> disk access is timing out (I've seen this with kerberized NFS.),
> maybe some other local environment setting or keyring or something
> is expiring. When you login (as which ever user), you get a fresh
> "cert" and the command works, but the cert the existing process is
> running under has timed out. Like I said, just a stab in the dark.
I understand what you are getting at and I will investigate such
possibilities.
>
>
> Good luck,
Thanks again! >bows deeply<
kyle
>
>
> Philip
>
>
> On Wed, May 20, 2009 at 3:56 PM, Kyle Smith <kyles at apple.com> wrote:
> Hey Nagios Folks,
>
> We're seeing some strange behavior with a few of our nrpe plugins
> and am wondering if anyone has any ideas on how to work on the
> problem.
>
> We are running OS X 10.5.6 with NRPE 2.12 and NRPE plugins 1.4.13.
>
> We have some plugins written in Ruby (boss loves it, I am learning
> it :-)) that work fine with a freshly started nrpe daemon. Start up
> nrpe on the client, force checks from the gui, see a green nagios
> console, nice. But after a while in the console under "Status
> Information" we start seeing "NRPE: unable to read output" rather
> than our normal success messages. Bounce the nrpe on the client,
> force the check through the gui, and things start working again for
> a while.
>
> While seeing the errors I can still run the plugin on the client no
> problem but when I try to run it from the server via command line (/
> usr/local/nagios/libexec/check_nrpe -H IP.of.client -c command) I
> see the error "NRPE: unable to read output".
>
> Another data point - We have one system where things were working
> fine but we needed to start passing params to nrpe. I recompiled
> after running ./configure --enable-command-args and setting
> dont_blame_nrpe=1 in the nrpe.cfg. Start up a new nrpe and it works
> fine, can pass params from the server no problem...but the a check
> that was working fine for weeks starts throwing "NRPE: unable to
> read output" after 3 or 4 successful invocations. Arg!! I had to
> rollback to the previous nrpe and check_nrpe.
>
> I am trying to set nrpe debug on in nrpe.cfg on the client and then
> bounce nrpe but i don't see any more messages in /var/log/system.log
> as compared to when I run with nrpe debug turned off. It's like
> turning on debug has no effect.
>
> I am curious about 2 issues - 1) why no extra messages in the system
> log file with debug enabled? and 2) Anyone seen anything like this
> happening where nrpe plugins work fine for a while then start
> throwing the error?
>
> TIA
>
> kyle
>
>
>
> ------------------------------------------------------------------------------
> Register Now for Creativity and Technology (CaT), June 3rd, NYC. CaT
> is a gathering of tech-side developers & brand creativity
> professionals. Meet
> the minds behind Google Creative Lab, Visual Complexity, Processing, &
> iPhoneDevCamp asthey present alongside digital heavyweights like
> Barbarian
> Group, R/GA, & Big Spaceship. http://www.creativitycat.com
> _______________________________________________
> Nagiosplug-help mailing list
> Nagiosplug-help at lists.sourceforge.net
> https://lists.sourceforge.net/lists/listinfo/nagiosplug-help
> ::: Please include plugins version (-v) and OS when reporting any
> issue.
> ::: Messages without supporting info will risk being sent to /dev/null
>
>
> ------------------------------------------------------------------------------
> Register Now for Creativity and Technology (CaT), June 3rd, NYC. CaT
> is a gathering of tech-side developers & brand creativity
> professionals. Meet
> the minds behind Google Creative Lab, Visual Complexity, Processing, &
> iPhoneDevCamp asthey present alongside digital heavyweights like
> Barbarian
> Group, R/GA, & Big Spaceship. http://www.creativitycat.com
> _______________________________________________
> Nagiosplug-help mailing list
> Nagiosplug-help at lists.sourceforge.net
> https://lists.sourceforge.net/lists/listinfo/nagiosplug-help
> ::: Please include plugins version (-v) and OS when reporting any
> issue.
> ::: Messages without supporting info will risk being sent to /dev/null
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <https://www.monitoring-plugins.org/archive/help/attachments/20090521/9f2abe00/attachment.html>
-------------- next part --------------
A non-text attachment was scrubbed...
Name: smime.p7s
Type: application/pkcs7-signature
Size: 3980 bytes
Desc: not available
URL: <https://www.monitoring-plugins.org/archive/help/attachments/20090521/9f2abe00/attachment.bin>
More information about the Help
mailing list