Hi!
Running on a virtualized Linux, one agent has lost for several minutes. No
measurements are present in the servers database, while theres no any sign
of problem in the virtualization log, or even in the agent's own log. There
is en exception, but the very same happens many time.
According the log, it sent the reports regularly for the server:
2014-01-14 10:17:41,788 INFO [MeasurementManager.sender-1]
(rhq.core.pc.measurement.MeasurementSenderRunner)- Measurement collection
for [14] metrics took 94ms - sending report to Server...
2014-01-14 10:18:09,712 INFO [InventoryManager.availability-1]
(rhq.core.pc.inventory.AvailabilityExecutor)- Scan Starting: Tue Jan 14
10:18:09 CET 2014
2014-01-14 10:18:09,733 INFO [InventoryManager.availability-1]
(rhq.core.pc.inventory.AvailabilityExecutor)- Scan Ended : Tue Jan 14
10:18:09 CET 2014 : Scan [startTime=1389691089712, endTime=1389691089732,
runtime=20, isFull=false, isForced=false, numResources=179,
numGetAvailabilityCalls=7, numScheduledRandomly=0, numPushedByInterval=6,
numAvailabilityChanges=0, numDeferToParent=0]
2014-01-14 10:18:11,788 INFO [MeasurementManager.sender-1]
(rhq.core.pc.measurement.MeasurementSenderRunner)- Measurement collection
for [14] metrics took 67ms - sending report to Server...
2014-01-14 10:18:11,812 INFO [ConfigurationManager.threadpool-1]
(rhq.core.pc.configuration.ConfigurationCheckExecutor)- Starting
configuration update check...
2014-01-14 10:18:11,814 INFO [ResourceContainer.invoker.daemon-56351]
(org.rhq.plugins.jmx.MBeanResourceComponent)- Configuration management is
currently disabled for the ResourceType[id=0, name=java.util.logging,
plugin=RHQAgent, category=Service] Resource with key
[java.util.logging:type=Logging] - it can be enabled in the Resource's
Inventory > Connection tab.
2014-01-14 10:18:12,793 WARN [ConfigurationManager.threadpool-1]
(rhq.core.pc.configuration.ConfigurationCheckExecutor)- An error occurred
while checking for an updated Resource configuration for Resource[id=11763,
uuid=74154a3e-07fc-4b0e-88be-f14e95059b8a, type={OpenSSH}SSHD, key=sshd,
name=OpenSSH sshd, parent=amweb-se, version=1.0].
net.augeas.AugeasException: get failed
at net.augeas.Augeas.processLastCall(Unknown Source)
at net.augeas.Augeas.get(Unknown Source)
at
org.rhq.plugins.sshd.OpenSSHDComponent.getConfig(OpenSSHDComponent.java:189)
at
org.rhq.plugins.sshd.OpenSSHDComponent.loadResourceConfiguration(OpenSSHDComponent.java:168)
at
org.rhq.plugins.sshd.OpenSSHDComponent.loadResourceConfiguration(OpenSSHDComponent.java:119)
at sun.reflect.GeneratedMethodAccessor64.invoke(Unknown Source)
at
sun.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAccessorImpl.java:25)
at java.lang.reflect.Method.invoke(Method.java:597)
at
org.rhq.core.pc.inventory.ResourceContainer$ComponentInvocation.call(ResourceContainer.java:654)
at
java.util.concurrent.FutureTask$Sync.innerRun(FutureTask.java:303)
at java.util.concurrent.FutureTask.run(FutureTask.java:138)
at
java.util.concurrent.ThreadPoolExecutor$Worker.runTask(ThreadPoolExecutor.java:895)
at
java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:918)
at java.lang.Thread.run(Thread.java:662)
2014-01-14 10:18:12,793 WARN [ConfigurationManager.threadpool-1]
(rhq.core.pc.configuration.ConfigurationCheckExecutor)- An error occurred
while checking for an updated Resource configuration for Resource[id=11765,
uuid=d9704c88-052e-47e2-8b9e-3495a57bb5c5, type={Apache}Apache HTTP Server,
key=/etc/apache2||/etc/apache2/apache2.conf, name=127.0.1.1:80,
parent=amweb-se, version=2.2.22].
java.lang.IllegalStateException: Configuration and child resource
creation/deletion support for Apache is optional. If you switched it on by
enabling Augeas support in the connection settings of the Apache server
resource and still get this message, it means that either your Apache
version is not supported (only Apache 2.x is supported) or Augeas is not
available on your platform.
at
org.rhq.plugins.apache.ApacheServerComponent.loadResourceConfiguration(ApacheServerComponent.java:369)
at sun.reflect.GeneratedMethodAccessor64.invoke(Unknown Source)
at
sun.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAccessorImpl.java:25)
at java.lang.reflect.Method.invoke(Method.java:597)
at
org.rhq.core.pc.inventory.ResourceContainer$ComponentInvocation.call(ResourceContainer.java:654)
at
java.util.concurrent.FutureTask$Sync.innerRun(FutureTask.java:303)
at java.util.concurrent.FutureTask.run(FutureTask.java:138)
at
java.util.concurrent.ThreadPoolExecutor$Worker.runTask(ThreadPoolExecutor.java:895)
at
java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:918)
at java.lang.Thread.run(Thread.java:662)
2014-01-14 10:18:12,794 WARN [ConfigurationManager.threadpool-1]
(rhq.core.pc.configuration.ConfigurationCheckExecutor)- An error occurred
while checking for an updated Resource configuration for Resource[id=11799,
uuid=63150549-2454-4218-aaad-ddcffc071718, type={Apache}Apache Virtual
Host, key=MainServer, name=Main, parent=127.0.1.1:80].
java.lang.IllegalStateException: Configuration and child resource
creation/deletion support for Apache is optional. If you switched it on by
enabling Augeas support in the connection settings of the Apache server
resource and still get this message, it means that either your Apache
version is not supported (only Apache 2.x is supported) or Augeas is not
available on your platform.
at
org.rhq.plugins.apache.ApacheVirtualHostServiceComponent.loadResourceConfiguration(ApacheVirtualHostServiceComponent.java:165)
at sun.reflect.GeneratedMethodAccessor64.invoke(Unknown Source)
at
sun.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAccessorImpl.java:25)
at java.lang.reflect.Method.invoke(Method.java:597)
at
org.rhq.core.pc.inventory.ResourceContainer$ComponentInvocation.call(ResourceContainer.java:654)
at
java.util.concurrent.FutureTask$Sync.innerRun(FutureTask.java:303)
at java.util.concurrent.FutureTask.run(FutureTask.java:138)
at
java.util.concurrent.ThreadPoolExecutor$Worker.runTask(ThreadPoolExecutor.java:895)
at
java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:918)
at java.lang.Thread.run(Thread.java:662)
2014-01-14 10:18:12,795 INFO [ConfigurationManager.threadpool-1]
(rhq.core.pc.configuration.ConfigurationCheckExecutor)- Configuration
update check completed in 982ms
2014-01-14 10:18:39,733 INFO [InventoryManager.availability-1]
(rhq.core.pc.inventory.AvailabilityExecutor)- Scan Starting: Tue Jan 14
10:18:39 CET 2014
2014-01-14 10:18:39,735 INFO [InventoryManager.availability-1]
(rhq.core.pc.inventory.AvailabilityExecutor)- Scan Ended : Tue Jan 14
10:18:39 CET 2014 : Scan [startTime=1389691119733, endTime=1389691119735,
runtime=2, isFull=false, isForced=false, numResources=179,
numGetAvailabilityCalls=1, numScheduledRandomly=0, numPushedByInterval=0,
numAvailabilityChanges=0, numDeferToParent=0]
2014-01-14 10:18:41,789 INFO [MeasurementManager.sender-1]
(rhq.core.pc.measurement.MeasurementSenderRunner)- Measurement collection
for [9] metrics took 73ms - sending report to Server...
The Exception above occurs in every hour in the same minute.
And this is the servers log:
10:20:08,530 INFO [org.rhq.enterprise.server.core.AgentManagerBean]
(RHQScheduler_Worker-5) Have not heard from agent [amweb] since [Tue Jan 14
10:14:17 UTC 2014]. Will be backfilled since we suspect it is down
Any idea how to prevent such events in the future?