Versions Compared

Key

  • This line was added.
  • This line was removed.
  • Formatting was changed.

...

TAD (Test And Diagnostic) monitors the amount of free memory available in the system at run time. It triggers the memory/Resource Reclamation (RR) process in TDM, when the amount of free memory drops below a configurable threshold. RR process can also be triggered by memory allocation failures which results in a notification being sent to TDM to try to reclaim memory.

Selfheal is a another feature implemented in Test And Diagnostic Component.

Self-heal monitors:

  • CPU usage
  • Memory Usage
  • Critical RDK-B processes

Self-heal stores Reset count Count and Reboot Count.
Self-heal takes required action like: Reboot Rebooting the device, Restart Restarting required process based on predefined conditions.
Self-heal does connectivity test.

...

Monitors the resources periodically (eg: 15 mins). If "Average Memory Used" reaches threshold value, reboot necessary action will be executed. 

...

Monitors the processes periodically (eg: 15 mins) based on it's process id (pid). Based on the process id availability, required action will be taken such as restarting the process, rebooting the device.

"task_health_monitor.sh" script is used for monitoring all RDK-B processes
Located in the device . This is located at path: "/fss/gw/usr/ccsp/tad/task_health_monitor.sh"

Self-heal stores Reset Count and Reboot Count.

Connectivity Test

Self-heal does connectivity test. Ping test will be done through any server IP/URI , which need (this needs to be configured). If server IP/URI is not configured, Ping test won't be executed and no action will be taken. If server is configured and ping test fails, reboot action will be executed.

...