RDK Resources
[*RDK Preferred*]
Code Management Facility
RDK Forums
[RDK Conferences]
RDK Support
Archives
Papers & Presentations Archive
This Page is under Development
Selfheal is another feature implemented in Test And Diagnostic Component. SelfHeal is a Process monitoring and recovery module.
Self-heal Periodically monitors the below scenarios:
Self-heal stores Reset Count and Reboot Count.
Self-heal takes required action like: Rebooting the device, Restarting required process based on predefined conditions.
Self-heal does connectivity test.
Please ensure that below Self heal scripts are present on the device at the path "/usr/ccsp/tad".
resource_monitor.sh
task_health_monitor.sh
corrective_action.sh
self_heal_connectivity.sh
Resource Monitoring:
"resource_monitor.sh" script is used for monitoring Memory and CPU usage.Monitors the resources periodically (eg: 60 seconds). If "Average Memory Used" reaches threshold value, reboot action will be executed.
Process Monitoring:
<Architecture Diagram>
<Describe the communication between the sub-systems. (Diagrams may be used to illustrate communications). >
< Identify input interfaces, function call protocol, and the nature of the data structures passed across the interface between the sub-modules>
< Identify output interfaces, function call protocol, and the nature of the data structures passed across the interface. >
S.NO | Module | DMCLI COMMANDS | Description |
---|---|---|---|
1. | TDM(TestandDiagnostic) XML Mapper - TestAndDiagnostic.XML | Device.SelfHeal.X_RDKCENTRAL-COM_Enable | Used to enable/disable self heal functionality |
2. | TDM(TestandDiagnostic) XML Mapper - TestAndDiagnostic.XML | Device.SelfHeal.X_RDKCENTRAL-COM_MaxRebootCount | Used to set the maximum reboot count for rebooting the rpi device once the cpu and memory threshold value was reached as 100(default value). By default, it set as 3. If it reaches 3 ..after that it doesn't do the reboot functionality. If we want, we can increase the reboot count also. |
3. | TDM(TestandDiagnostic) XML Mapper - TestAndDiagnostic.XML | Device.SelfHeal.X_RDKCENTRAL-COM_MaxResetCount | Used to set the maximum reset count for connectivity test. for example, if it reaches 3(3 times it stops the lan functionality), after that it doesn't stop the lan functionality. If we want , we can increase the reset count also. |
4. | TDM(TestandDiagnostic) XML Mapper - TestAndDiagnostic.XML | Device.SelfHeal.X_RDKCENTRAL-COM_DNS_PINGTEST_Enable | Using this command to enable the PING function for connectivity tests.By default, it set as TRUE. |
5. | TDM(TestandDiagnostic) XML Mapper - TestAndDiagnostic.XML | Device.SelfHeal.X_RDKCENTRAL-COM_DNS_URL | Using this command to set the DNS url for PING function for connectivity test , By default, it set as www.google.com |
6. | TDM(TestandDiagnostic) XML Mapper - TestAndDiagnostic.XML | Device.SelfHeal.ConnectivityTest.X_RDKCENTRAL-COM_PingInterval | Using this command to set the PING interval time for connectivity test . By default, it set as 60. Range of ping interval is min 15 to max 1440. |
7. | TDM(TestandDiagnostic) XML Mapper - TestAndDiagnostic.XML | Device.SelfHeal.ConnectivityTest.X_RDKCENTRAL-COM_CorrectiveAction | Using this command to enable/disable for Corrective Action for self heal scripts. By default, it set as TRUE. |
8. | TDM(TestandDiagnostic) XML Mapper - TestAndDiagnostic.XML | Device.SelfHeal.ResourceMonitor.X_RDKCENTRAL-COM_UsageComputeWindow | Using this command to set the resource monitor interval time. By default, it set as 1. |
9. | TDM(TestandDiagnostic) XML Mapper - TestAndDiagnostic.XML | Device.SelfHeal.ResourceMonitor.X_RDKCENTRAL-COM_AvgCPUThreshold | Using this command to set the AVG CPU threshold value. By default, it set as 100 |
10. | TDM(TestandDiagnostic) XML Mapper - TestAndDiagnostic.XML | Device.SelfHeal.ResourceMonitor.X_RDKCENTRAL-COM_AvgMemoryThreshold | Using this command to set the AVG Memory threshold value. By default, it set as 100 |
11. | TDM(TestandDiagnostic) XML Mapper - TestAndDiagnostic.XML | Device.SelfHeal.ConnectivityTest.X_RDKCENTRAL-COM_RebootInterval | Using this command to set the reboot interval time for connectivity test. By default, it can be set as 28800. If DNS or WAN_IP gets down, device will stop the LAN functionality. If device will stop the LAN functionality..Internally PING functionality will check the diff of current time and last reboot time will be greater than the reboot interval time..then only device will stop the LAN functionality. |
< Describe how the current design is suitable for future enhancement without completely modifying existing design . >