This repository has been archived on 2025-03-20. You can view files and clone it, but cannot push or open issues or pull requests.
sonic-buildimage/dockers
yozhao101 37863ac854
[Monit] Restart telemetry container if memory usage is beyond the threshold (#7645)
Signed-off-by: Yong Zhao yozhao@microsoft.com

Why I did it
This PR aims to monitor the memory usage of streaming telemetry container and restart streaming telemetry container if memory usage is larger than the pre-defined threshold.

How I did it
I borrowed the system tool Monit to run a script memory_checker which will periodically check the memory usage of streaming telemetry container. If the memory usage of telemetry container is larger than the pre-defined threshold for 10 times during 20 cycles, then an alerting message will be written into syslog and at the same time Monit will run the script restart_service to restart the streaming telemetry container.

How to verify it
I verified this implementation on device str-7260cx3-acs-1.
2021-05-28 11:13:44 -07:00
..
docker-base [arm] support compile sonic arm image on arm server (#7285) 2021-04-18 08:17:57 -07:00
docker-base-buster [arm] support compile sonic arm image on arm server (#7285) 2021-04-18 08:17:57 -07:00
docker-base-stretch [arm] support compile sonic arm image on arm server (#7285) 2021-04-18 08:17:57 -07:00
docker-basic_router [supervisord]: use abspath as supervisord entrypoint (#5995) 2020-11-22 21:18:44 -08:00
docker-config-engine [docker-base-buster][docker-config-engine-buster] No longer install Python 2 (#6162) 2020-12-25 21:29:25 -08:00
docker-config-engine-buster [docker-base-buster][docker-config-engine-buster] No longer install Python 2 (#6162) 2020-12-25 21:29:25 -08:00
docker-config-engine-stretch [docker-base-buster][docker-config-engine-buster] No longer install Python 2 (#6162) 2020-12-25 21:29:25 -08:00
docker-database Fix python version (#7658) 2021-05-20 15:47:46 -07:00
docker-dhcp-relay dhcpmon: support dual tor in docker template (#7470) 2021-05-03 10:51:34 -07:00
docker-fpm-frr Fix the type issue in rvtysh (#7648) 2021-05-20 21:35:23 +08:00
docker-fpm-gobgp [dockers][supervisor] Increase event buffer size for process exit listener; Set all event buffer sizes to 1024 (#7083) 2021-03-27 21:14:24 -07:00
docker-fpm-quagga Support readonly vtysh for sudoers (#7383) 2021-04-25 16:32:02 +08:00
docker-iccpd [dockers][supervisor] Increase event buffer size for process exit listener; Set all event buffer sizes to 1024 (#7083) 2021-03-27 21:14:24 -07:00
docker-lldp docker-lldp:intermittent DB errors will result in Client termination (#6119) 2021-05-18 09:51:02 -07:00
docker-macsec [dockers][supervisor] Increase event buffer size for process exit listener; Set all event buffer sizes to 1024 (#7083) 2021-03-27 21:14:24 -07:00
docker-nat [dockers][supervisor] Increase event buffer size for process exit listener; Set all event buffer sizes to 1024 (#7083) 2021-03-27 21:14:24 -07:00
docker-orchagent [docker-orchagent]: Increase ndppd kernel poll interval (#7456) 2021-04-30 16:30:30 -07:00
docker-platform-monitor [pmon]: add psmisc to bring fuser that dentifies processes that are using files or sockets (#7509) 2021-05-06 22:24:07 -07:00
docker-ptf [arm] support compile sonic arm image on arm server (#7285) 2021-04-18 08:17:57 -07:00
docker-router-advertiser [radv] fix bug for radv can't startup if DEVICE_METADATA.localhost.type is NULL (#7651) 2021-05-25 08:17:44 -07:00
docker-sflow [dockers][supervisor] Increase event buffer size for process exit listener; Set all event buffer sizes to 1024 (#7083) 2021-03-27 21:14:24 -07:00
docker-snmp [dockers][supervisor] Increase event buffer size for process exit listener; Set all event buffer sizes to 1024 (#7083) 2021-03-27 21:14:24 -07:00
docker-sonic-mgmt [docker-sonic-mgmt]: Upgrade Tgen version in SONiC mgmt docker (#7472) 2021-04-29 12:31:46 -07:00
docker-sonic-mgmt-framework [dockers][supervisor] Increase event buffer size for process exit listener; Set all event buffer sizes to 1024 (#7083) 2021-03-27 21:14:24 -07:00
docker-sonic-restapi Fix supervisor-proc-exit-listener startup issue in restapi (#7681) 2021-05-26 18:28:10 +08:00
docker-sonic-sdk [sonic-sdk] add sonic sdk and sonic sdk buildenv (#6712) 2021-05-28 10:16:02 -07:00
docker-sonic-sdk-buildenv [sonic-sdk] add sonic sdk and sonic sdk buildenv (#6712) 2021-05-28 10:16:02 -07:00
docker-sonic-telemetry [Monit] Restart telemetry container if memory usage is beyond the threshold (#7645) 2021-05-28 11:13:44 -07:00
docker-teamd [dockers][supervisor] Increase event buffer size for process exit listener; Set all event buffer sizes to 1024 (#7083) 2021-03-27 21:14:24 -07:00
dockerfile-macros.j2 [sonic-config-engine] Clean up dependencies, pin versions; install Python 3 package in Buster container (#5656) 2020-10-26 13:48:50 -07:00