sonic-buildimage/files/image_config
Feng-msft a5043bfc84
Fix monit false alarm issue, which locates in process_checker and it (#16907)
Fix monit false alarm issue, which located in process_checker and it missed "disk-sleep" status check, thus some 201911 SONiC box report "pmon|sensord" error coincidently.

#### Why I did it
Currently psutil library returns below detail process status:
running: The process is currently running.
sleeping: The process is sleeping or waiting for an event to occur.
disk-sleep: The process is waiting for I/O operations to complete.
stopped: The process has been stopped (e.g. via the SIGSTOP signal).
zombie: The process has terminated but is still listed in the process table.
dead: The process has terminated and has been removed from the process table.

We should regard running/sleeping/disk-sleep as normal case and not alert in monit process.

Now once the disk-sleep occurs during monit cycle, below syslog will be paged, so get rid of syslog output meanwhile.

yslog.2.gz:Feb 24 06:12:17.394619 MEL23-0101-0301-04T1 ERR monit[6040]: 'pmon|sensord' status failed (1) -- '/usr/sbin/sensord -f daemon' is not running in host
syslog.2.gz:Feb 24 06:13:17.932531 MEL23-0101-0301-04T1 ERR monit[6040]: 'pmon|sensord' status failed (1) -- '/usr/sbin/sensord -f daemon' is not running in host
syslog.2.gz:Feb 24 06:14:18.502505 MEL23-0101-0301-04T1 ERR monit[6040]: 'pmon|sensord' status failed (1) -- '/usr/sbin/sensord -f daemon' is not running in host

Then I tried to reproduce the issue by triggering process_checker for sensord frequently and observed it's under "disk-sleep" status once the alert is raised.

##### Work item tracking
- Microsoft ADO **(number only)**:17663589

#### How I did it
Fix process_checker script code for adding "disk-sleep" case handling.

#### How to verify it
Verified in local DUT.
2023-10-26 18:23:24 -07:00
..
apt [Build][201911] Fix the stretch/jessie mirror removed issue (#15083) 2023-05-17 22:52:26 -07:00
bash Close console session due to user inactivity (#9890) 2023-01-30 18:36:11 +00:00
caclmgrd [multi-asic] Enhanced iptable default rules (#6765) 2021-02-25 18:39:43 -08:00
config-setup Add service to restore TACACS from old config (#7560) (#7865) 2021-06-15 10:52:31 -07:00
constants [bgpcfgd]: Fixes for BBR (#5956) 2020-11-19 10:42:42 -08:00
corefile_uploader corefile uploader: Updates per review comments offline (#3915) 2019-12-31 14:42:01 -08:00
cron.d [core_cleanup] Fix issue where core_cleanup job runs too frequently (#3659) 2019-10-23 15:55:47 -07:00
ebtables [ebtbles] Replace binary config file to text config file for ebtables (#5252) 2021-01-27 16:57:41 -08:00
environment [201911] Change submodule path from Azure to sonic-net (#12313) 2022-10-12 21:07:22 +08:00
fstrim [baseimage]: Add fstrim service and fstrim timer by default (#2804) 2019-04-21 14:21:16 -07:00
hostcfgd hostcfgd: Handle missed tacacs updates between load & listen (#8223) 2021-08-06 10:38:37 -07:00
hostname Fix to improve hostname handling (#12064) 2023-01-30 18:39:09 +00:00
interfaces [interfaces] Reduce Calls to SONiC Cfggen (#5174) 2020-12-22 09:51:54 -08:00
kubernetes [baseimage]: Install Kubernetes packages if enabled in image (#4374) (#4432) 2020-04-16 21:54:45 -07:00
logrotate [Multi Asic] support of swss.rec and sairedis.rec for multi asic (#6310) 2021-01-27 17:12:32 -08:00
misc Fix to remove the import of APIClient (#5724) 2020-10-27 08:32:37 -07:00
monit Fix monit false alarm issue, which locates in process_checker and it (#16907) 2023-10-26 18:23:24 -07:00
ntp [ntp][apparmor] Allow apparmor read permission for ntpd under rw mount path of rootfs (#6040) 2022-10-16 05:42:35 +00:00
platform [baseimage]: Updates for Ebtables and support for multi-asic (#6542) 2021-01-27 16:59:10 -08:00
procdockerstatsd [201911][procdockerstatsd] fix typo for variable name (#7183) 2021-03-29 19:22:03 -07:00
process-reboot-cause [201911] Add hardware reboot cause when software reboot failed (#11753) 2022-08-25 12:30:53 -07:00
rsyslog Move frr logs from syslog to /var/log/frr/*.log (#5988) 2020-12-22 10:53:16 -08:00
snmp mvrf_avoid_snmp_yml_config: made changes to pass SNMP config from con… (#4057) 2020-02-03 15:38:38 -08:00
sudoers Improve sudo cat command for RO user. (#14428) (#14437) 2023-04-05 15:29:58 -07:00
sysctl Set sock rx Buf size to 3MB. (#5566) 2020-11-24 11:21:56 -08:00
syslog [baseimage]: /host unmount timeout issue during reboot. (#5032) 2020-08-09 10:38:33 -07:00
systemd [services] Restart SwSS service upon unexpected critical process exit (#2845) 2019-05-01 08:02:38 -07:00
topology [multi-asic][vs]: Update topology script to retrieve hwsku from minigraph (#6219) 2021-02-25 18:42:44 -08:00
updategraph [platform] Add Support For Environment Variable File (#5010) 2020-09-28 21:14:39 +00:00
warmboot-finalizer [warm boot finalizer] only wait for enabled components to reconcile (#6454) 2022-03-31 12:01:25 -07:00
watchdog-control Add disabling HW watchdog during boot for fast-reboot and warm-reboot (#4927) 2020-08-09 11:25:31 -07:00