sonic-buildimage/files
yozhao101 37863ac854
[Monit] Restart telemetry container if memory usage is beyond the threshold (#7645)
Signed-off-by: Yong Zhao yozhao@microsoft.com

Why I did it
This PR aims to monitor the memory usage of streaming telemetry container and restart streaming telemetry container if memory usage is larger than the pre-defined threshold.

How I did it
I borrowed the system tool Monit to run a script memory_checker which will periodically check the memory usage of streaming telemetry container. If the memory usage of telemetry container is larger than the pre-defined threshold for 10 times during 20 cycles, then an alerting message will be written into syslog and at the same time Monit will run the script restart_service to restart the streaming telemetry container.

How to verify it
I verified this implementation on device str-7260cx3-acs-1.
2021-05-28 11:13:44 -07:00
..
Aboot [Arista] Fix dockerd issue on Arista platforms (#7376) 2021-04-20 14:55:14 -07:00
apt Add mirrors for reproducible build (#6813) 2021-02-18 14:59:52 +08:00
build_scripts [hostcfgd]: Add Ability To Configure Feature During Run-time (#6700) 2021-03-13 05:56:27 -08:00
build_templates [Monit] Restart telemetry container if memory usage is beyond the threshold (#7645) 2021-05-28 11:13:44 -07:00
dhcp ZTP infrastructure changes to support DHCP discovery provisioning data (#3298) 2019-12-10 08:16:56 -08:00
docker Enabling ipv6 support on docker container network. This is needed (#5418) 2020-09-22 08:32:17 -07:00
image_config [Monit] Restart telemetry container if memory usage is beyond the threshold (#7645) 2021-05-28 11:13:44 -07:00
initramfs-tools [kdump] Fix OOM events in crashkernel (#6447) 2021-02-02 01:55:09 -08:00
scripts [Mux] Do not clean-up HW_MUX_CABLE_TABLE from State DB (#7710) 2021-05-26 09:12:34 -07:00
sshd [sshd]: Create /run/sshd under systemd using RuntimeDirectory 2020-04-17 04:51:51 +00:00