sonic-buildimage/files/image_config
yozhao101 04cd1d61e8
[Monit] Monitoring the running status of containers. (#6251)
**- Why I did it**
This PR aims to monitor the running status of each container. Currently the auto-restart feature was enabled. If a critical process exited unexpected, the container will be restarted. If the container was restarted 3 times during 20 minutes, then it will not run anymore unless we cleared the flag using the command `sudo systemctl reset-failed <container_name>` manually. 

**- How I did it**
We will employ Monit to monitor a script. This script will generate the expected running container list and compare it with the current running containers. If there are containers which were expected to run but were not running, then an alerting message will be written into syslog.

**- How to verify it**
I tested this feature on a lab device `str-a7050-acs-3` which has single ASIC and `str2-n3164-acs-3` which has a Multi-ASIC. First I manually stopped a container by running the command `sudo systemctl stop <container_name>`, then I checked whether there was an alerting message in the syslog.

Signed-off-by: Yong Zhao <yozhao@microsoft.com>
2021-01-07 19:52:22 -08:00
..
apt change image apt source list from stretch to buster for arm 2020-05-25 13:15:19 +00:00
bash [bash.bashrc] Add reverse SSH script to bash.bashrc (#5438) 2020-11-24 14:11:53 +08:00
config-chassisdb [ChassisDB]: bring up ChassisDB service (#5283) 2020-10-14 15:15:24 -07:00
config-setup Take a copy of existing TACACS credentials and restore it during upgrade (#6285) 2021-01-07 16:45:38 -08:00
constants [bgpcfgd]: Fixes for BBR (#5956) 2020-11-19 00:07:58 -08:00
copp Copp Manager Changes (#4861) 2020-11-23 09:31:42 -08:00
corefile_uploader [Python] Align files in root dir, dockers/ and files/ with PEP8 standards (#6109) 2020-12-03 15:57:50 -08:00
cron.d [core_cleanup] Fix issue where core_cleanup job runs too frequently (#3659) 2019-10-23 15:55:47 -07:00
ebtables [ebtbles] Replace binary config file to text config file for ebtables (#5252) 2020-09-03 17:27:07 -07:00
environment [image]: Update login message (#706) 2017-06-14 15:18:02 -07:00
fstrim [sonic-utilities] Build and install as a Python wheel package (#5409) 2020-09-20 20:16:42 -07:00
hostname [hostname-config] improve hostname-config process (#3676) 2019-10-29 08:30:27 -07:00
interfaces Set preference for forced mgmt routes (#5844) 2020-11-10 14:20:13 -08:00
kdump [kdump] Add more kernel panic conditions for vmcore dump (#6095) 2020-12-15 08:54:13 -08:00
kubernetes [baseimage]: Install Kubernetes packages if enabled in image (#4374) 2020-04-13 08:41:18 -07:00
logrotate Move frr logs from syslog to /var/log/frr/*.log (#5988) 2020-12-10 08:44:34 -08:00
misc [Python] Align files in root dir, dockers/ and files/ with PEP8 standards (#6109) 2020-12-03 15:57:50 -08:00
monit [Monit] Monitoring the running status of containers. (#6251) 2021-01-07 19:52:22 -08:00
ntp [ntp]: Source interface support for NTP (#6033) 2020-12-21 05:34:13 -08:00
pcie-check Fix bug with pcie-check.service (#5368) 2020-09-15 15:21:31 -07:00
platform [kdump]: Kdump usability and reliability improvements (#6113) 2020-12-10 01:32:37 -08:00
rsyslog Move frr logs from syslog to /var/log/frr/*.log (#5988) 2020-12-10 08:44:34 -08:00
secureboot [platform] Add Support For Environment Variable File (#5010) 2020-07-31 17:59:09 -07:00
snmp mvrf_avoid_snmp_yml_config: made changes to pass SNMP config from con… (#4057) 2020-01-28 17:41:21 -08:00
sudoers Add the vtysh command with newly added "-n" option for multi asic to the read_only_cmds (#5845) 2020-11-10 12:18:49 -08:00
sysctl Set sock rx Buf size to 3MB. (#5566) 2020-10-15 14:40:59 -07:00
syslog [baseimage]: /host unmount timeout issue during reboot. (#5032) 2020-07-25 01:27:58 -07:00
system-health [system-health] Add support for monitoring system health (#4835) 2020-10-12 11:12:49 +03:00
systemd [services] Restart SwSS service upon unexpected critical process exit (#2845) 2019-05-01 08:02:38 -07:00
topology [platform] Add Support For Environment Variable File (#5010) 2020-07-31 17:59:09 -07:00
updategraph [platform] Add Support For Environment Variable File (#5010) 2020-07-31 17:59:09 -07:00
warmboot-finalizer fix error: interface counters is mismatch after warm-reboot (#5346) 2020-11-20 10:37:45 -08:00
watchdog-control [sonic-utilities] Build and install as a Python wheel package (#5409) 2020-09-20 20:16:42 -07:00