sonic-buildimage

Author	SHA1	Message	Date
Joe LeVeque	d9b8bed916	[caclmgrd] Don't limit connection tracking to TCP (#4796 ) Don't limit iptables connection tracking to TCP protocol; allow connection tracking for all protocols. This allows services like NTP, which is UDP-based, to receive replies from an NTP server even if the port is blocked, as long as it is in reply to a request sent from the device itself.	2020-06-19 04:33:50 +00:00
Ying Xie	4cd54ed58c	[ntp] disable ntp long jump (#4748 ) Found another syncd timing issue related to clock going backwards. To be safe disable the ntp long jump. Signed-off-by: Ying Xie <ying.xie@microsoft.com>	2020-06-11 22:03:22 +00:00
Joe LeVeque	7ae30d7898	[caclmgrd] Get first VLAN host IP address via next() (#4685 ) I found that with IPv4Network types, calling list(ip_ntwrk.hosts()) is reliable. However, when doing the same with an IPv6Network, I found that the conversion to a list can hang indefinitely. This appears to me to be a bug in the ipaddress.IPv6Network implementation. However, I could not find any other reports on the web. This patch changes the behavior to call next() on the ip_ntwrk.hosts() generator instead, which returns the IP address of the first host.	2020-06-09 16:30:45 +00:00
Joe LeVeque	494701a0ee	[caclmgrd] Allow more ICMP types (#4625 )	2020-06-09 16:07:51 +00:00
Joe LeVeque	7da0c15af5	[caclmgrd] Ignore keys in interface-related tables if no IP prefix is present (#4581 ) Since the introduction of VRF, interface-related tables in ConfigDB will have multiple entries, one of which only contains the interface name and no IP prefix. Thus, when iterating over the keys in the tables, we need to ignore the entries which do not contain IP prefixes.	2020-06-09 16:05:40 +00:00
Joe LeVeque	3ee9c5d1e3	[caclmgrd] Add some default ACCEPT rules and lastly drop all incoming packets (#4412 ) Modified caclmgrd behavior to enhance control plane security as follows: Upon starting or receiving notification of ACL table/rule changes in Config DB: 1. Add iptables/ip6tables commands to allow all incoming packets from established TCP sessions or new TCP sessions which are related to established TCP sessions 2. Add iptables/ip6tables commands to allow bidirectional ICMPv4 ping and traceroute 3. Add iptables/ip6tables commands to allow bidirectional ICMPv6 ping and traceroute 4. Add iptables/ip6tables commands to allow all incoming Neighbor Discovery Protocol (NDP) NS/NA/RS/RA messages 5. Add iptables/ip6tables commands to allow all incoming IPv4 DHCP packets 6. Add iptables/ip6tables commands to allow all incoming IPv6 DHCP packets 7. Add iptables/ip6tables commands to allow all incoming BGP traffic 8. Add iptables/ip6tables commands for all ACL rules for recognized services (currently SSH, SNMP, NTP) 9. For all services which we did not find configured ACL rules, add iptables/ip6tables commands to allow all incoming packets for those services (allows the device to accept SSH connections before the device is configured) 10. Add iptables rules to drop all packets destined for loopback interface IP addresses 11. Add iptables rules to drop all packets destined for management interface IP addresses 12. Add iptables rules to drop all packets destined for point-to-point interface IP addresses 13. Add iptables rules to drop all packets destined for our VLAN interface gateway IP addresses 14. Add iptables/ip6tables commands to allow all incoming packets with TTL of 0 or 1 (This allows the device to respond to tools like tcptraceroute) 15. If we found control plane ACLs in the configuration and applied them, we lastly add iptables/ip6tables commands to drop all other incoming packets	2020-06-09 04:21:27 +00:00
Ying Xie	f52e59a032	[ntp] enable/disable NTP long jump according to reboot type (#4582 ) - Enable NTP long jump after cold reboot. - Disable NTP long jump after warrm/fast reboot. Signed-off-by: Ying Xie <ying.xie@microsoft.com>	2020-05-12 12:23:47 -07:00
Joe LeVeque	ceb878414d	[process-reboot-cause] If software reboot cause is unknown add note if first boot into new image (#4538 )	2020-05-08 20:37:22 +00:00
SuvarnaMeenakshi	fba321ae6c	[ntp]: Add "tinker panic 0" in ntp.conf to avoid ntpd from panic (#4263 ) - What I did Add configuration to avoid ntpd from panic and exit if the drift between new time and current system time is large. - How I did it Added "tinker panic 0" in ntp.conf file. - How to verify it [this assumes that there is a valid NTP server IP in config_db/ntp.conf] Change the current system time to a bad time with a large drift from time in ntp server; drift should be greater than 1000s. Reboot the device. Before the fix: 3. upon reboot, ntp-config service comes up fine, ntp service goes to active(exited) state without any error message. This is because the offset between new time (from ntp server) and the current system time is very large, ntpd goes to panic mode and exits. The system continues to show the bad time. After the fix: 3. Upon reboot, ntp-config comes up fine, ntp services comes up from and stays in active (running) state. The system clock gets synced with the ntp server time.	2020-04-03 19:42:17 +00:00
Joe LeVeque	cbf7c7d80d	[rsyslog] Suppress duplicate messages from base image and all Docker containers (#2497 )	2020-04-02 21:42:01 +00:00
yozhao101	1cc6141a93	[Monit] Delay start of monitoring for 5 minutes (#4281 )	2020-03-19 22:49:04 +00:00
zhenggen-xu	19c1ad36a5	[201811] [interfaces-config.sh] Flush the loopback interface addresses (#4234 ) * [interfaces-config.sh] Flush the loopback interface before configure it Without this, you may end up with more and more ip addresses on loopback interface after you change the loopback ip and do config reload Signed-off-by: Zhenggen Xu <zxu@linkedin.com>	2020-03-09 16:14:59 -07:00
Stephen Sun	726fecaf8b	[process-reboot-cause]Clean up the process-reboot-cause as reqired in issue 3927 (#4128 )	2020-02-14 19:37:30 +00:00
Joe LeVeque	4af3e5066d	[interfaces-config.sh] Force lo interface down (#4149 ) Force "lo" interface down in interfaces-config.sh to prevent interface-config.service from failing with the following error: ``` -- The result is failed. systemd[1]: networking.service: Unit entered failed state. systemd[1]: networking.service: Failed with result 'exit-code'. interfaces-config.sh[29232]: Job for networking.service failed because the control process exited with error code. interfaces-config.sh[29232]: See "systemctl status networking.service" and "journalctl -xe" for details. interfaces-config.sh[29232]: ifdown: interface lo not configured interfaces-config.sh[29232]: RTNETLINK answers: File exists interfaces-config.sh[29232]: ifup: failed to bring up lo systemd[1]: interfaces-config.service: Main process exited, code=exited, status=1/FAILURE systemd[1]: Failed to start Update interfaces configuration. -- Subject: Unit interfaces-config.service has failed ``` Failure to bring down the interface will result in a failure to subsequently bring the interface back up.	2020-02-13 22:38:21 -08:00
Joe LeVeque	2e43e6bc6c	[caclmgrd] Fix application of IPv6 service ACL rules (part 2) (#4036 )	2020-01-18 01:44:42 +00:00
Sujin Kang	956b8fd7c7	[reboot cause]: Delay process-reboot-cause service until network connection is stable (#4003 )	2020-01-11 01:09:08 +00:00
yozhao101	27a2e0692b	[Monit] Change the monitoring period from 120 seconds to 60 seconds. (#3974 ) * [Monit] Change the monitoring period of monit from 120 seconds to 60 seconds and also at the same time double the interval for existing sonic monit config file in host. Signed-off-by: Yong Zhao <yozhao@microsoft.com>	2020-01-11 01:01:34 +00:00
Joe LeVeque	640023ec57	[caclmgrd] Fix application of IPv6 service ACL rules (#3917 )	2020-01-06 21:04:52 +00:00
Renuka Manavalan	da7db51259	corefile uploader: Updates per review comments offline (#3915 ) * Updates per review comments 1) core_uploader service waits for syslog.service 2) core_uploader service enabled for restart on failure 3) Use mtime instead of file size + ample time to be robust. * Avoid reloading already uploaded file, by marking the names with a prefix. * Updated failing path. 1) If rc file is missing or required data missing, it periodically logs error in forever loop. 2) If upload fails, retry every hour with a error log, forever. * Fix few bugs * The binary update_json.py will come from sonic-utilities.	2020-01-06 21:03:40 +00:00
Renuka Manavalan	6db0c76a06	Corefile uploader service (#3887 ) * Corefile uploader service 1) A service is added to watch /var/core and upload to Azure storage 2) The service is disabled on boot. One may enable explicitly. 3) The .rc file to be updated with acct credentials and http proxy to use. 4) If service is enabled with no credentials, it would sleep, with periodic log messages 5) For any update in .rc, the service has to be restarted to take effect. * Remove rw permission for .rc file for group & others. * Changes per review comments. Re-ordered .rc file per JSON.dump order. Added a script to enable partial update of .rc, which HWProxy would use to add acct key. * Azure storage upload requires python module futures, hence added it to install list. * Removed trailing spaces. * A mistake in name corrected. Copy the .rc updater script to /usr/bin.	2020-01-06 21:02:14 +00:00
Joe LeVeque	9ee8eba77c	[monit] Build from source and patch to use MemAvailable value if available on system (#3875 )	2020-01-06 20:59:32 +00:00
Stephen Sun	49869aa6fa	[process-reboot-cause]Address the issue: Incorrect reboot cause returned when warm reboot follows a hardware caused reboot (#3880 ) * [process-reboot-cause]Address the issue: Incorrect reboot cause returned when warm reboot follows a hardware caused reboot 1. check whether /proc/cmdline indicates warm/fast reboot. if yes the software reboot cause file will be treated as the reboot cause. finish 2. check whether platform api returns a reboot cause. if yes it is treated as the reboot cause. finish. 3. check whether /hosts/reboot-cause contains a cause. if yes it is treated as the cause otherwise return unknown. * [process-reboot-cause]Fix review comments * [process-reboot-cause]address comments 1. use "with" statement 2. update fast/warm reboot BOOT_ARG * [process-reboot-cause]address comments * refactor the code flow * Remove escape * Remove extra ':'	2019-12-14 17:44:02 +00:00
Sujin Kang	0510fc7258	Correct the watch-control service to call the right script (#3906 ) * Correct the watch-control service to call the right script * make watchdog-control.sh executable (chmod +x)	2019-12-14 09:42:36 -08:00
Ying Xie	ca1c5bc0c4	[hostcfgd] avoid in place editing config file contents (#3904 ) In place editing (sed -i) seems having some issues with filesystem interaction. It could leave 0 size file or corrupted file behind. It would be safer to sed the file contents into a new file and switch new file with the old file. Signed-off-by: Ying Xie <ying.xie@microsoft.com>	2019-12-14 03:27:39 +00:00
Sujin Kang	aea18165a8	Add watchdog-control service to disable watchdog during bootup (#3877 ) * Add watchdog-control service to disable watchdog during bootup Disable only if it's applicable and the watchdog is enabled. * Address the review comment * Correct the watchdog start script name * Change to call common watchdog api instead of platform specific * Start watchdog control service after swss starts * advance sonic-utility submodule	2019-12-13 12:44:11 -08:00
Neetha John	6d23e4c8d7	[pfcwd]: Do not start pfc watchdog on Management Tor (#3719 ) Signed-off-by: Neetha John <nejo@microsoft.com>	2019-11-07 21:41:32 +00:00
lguohan	9167f9da46	[aboot]: preserve snmp.yml and acl.json for eos to sonic fast reboot (#3716 )	2019-11-07 21:40:20 +00:00
Ying Xie	f764a167ac	[hostname-config] improve hostname-config process (#3676 ) We noticed in tests/production that there is a low probability failure where /etc/hosts could have some garbage characters before the entry for local host name. The consequence is that all sudo command would be very slow. In extreme cases it would prevent some services from starting properly. I suspect that the /etc/hosts file might be opened by some process causing the issue. Editing contents with new file level and replace the whole file should be safer. Signed-off-by: Ying Xie <ying.xie@microsoft.com>	2019-10-29 15:42:23 +00:00
Prabhu Sreenivasan	ff137a8e56	[baseimage]: Avoid removing localhost entry from /etc/hosts file (#2452 ) - What I did This fix removes the possibility of 'localhost' entry getting removed from /etc/hosts file by hostname-config service. Without this change, whenever we change the hostname from 'localhost' to any other name on the config_db.json and reload the config, /etc/hosts file will only have the new hostname on it. But there are multiple sonic utilities (eg: swssconfig) which relies on the hard coded 'localhost' name and they tend to stop working. - How I did it Added a new check on hostname-config.sh script to avid blindly deleting the line containing the old hostname from /etc/hosts file. Now it will delete the old hostname only if its not localhost or when the hostname is not changing. - How to verify it Bring up SONiC on a device with hostname as localhost Edit /etc/sonic/config_db.json to update the 'hostname' filed under DEVICE_METADATA from "hostname" : "localhost" --> "hostname" : "sonic" run config reload -y to reflect the hostname change done on config_db.json file. cat /etc/hosts and check whether both 127.0.0.1 localhost and 127.0.0.1 sonic entry are present on the file. ping localhost should work fine. - Description for the changelog Make hostname-config service more robust in handling SONiC hostname change from localhost to anything else.	2019-10-29 15:42:04 +00:00
Danny Allen	818ab7fdaa	[core_cleanup] Fix issue where core_cleanup job runs too frequently (#3659 ) Signed-off-by: Danny Allen <daall@microsoft.com>	2019-10-24 17:04:16 +00:00
Ying Xie	c7a096b6b9	[201811][ntp] removed undefined filter (#3594 ) pfx_filter is not defined in 201811 branch. Signed-off-by: Ying Xie <ying.xie@microsoft.com>	2019-10-11 19:46:14 -07:00
pavel-shirshov	53ec9124bc	[ntp]: Use loopback address when we don't have MGMT interface (#3566 ) Added configuration to use Loopback ip if a switch doesn't have MGMT_PORT.	2019-10-07 16:56:00 +00:00
Ying Xie	37b78826ee	[updategraph] enhance update graph handling (#3549 ) - after reloading minigraph, write latest version string in the DB. - if old config_db.json file exists, use it and migrate to latest version. - only reload minigraph when config_db.json doesn't exist and minigraph exists. Signed-off-by: Ying Xie <ying.xie@microsoft.com>	2019-10-02 21:04:39 +00:00
Ying Xie	e4f8a3946c	[first boot] sync file system after moving/copying files (#3550 ) Signed-off-by: Ying Xie <ying.xie@microsoft.com>	2019-10-02 21:04:39 +00:00
Prince Sunny	4ef5ce74e4	Install Iptables rules to set TCPMSS for 'lo' interface (#3452 ) * Install Iptables rules to set TCPMSS for lo interface * Moved implementation to hostcfgd to maintain at one place	2019-09-19 01:08:44 +00:00
Danny Allen	ba77de12ac	[cron.d] Add cron job to periodically clean-up core files (#3449 ) * [cron.d] Create cron job to periodically clean-up core files * Create script to scan /var/core and clean-up older core files * Create cron job to run clean-up script Signed-off-by: Danny Allen <daall@microsoft.com> * Update interval for running cron job * Respond to feedback * Change syslog id	2019-09-13 17:52:10 +00:00
lguohan	87cb1e307e	[baseimage]: fix monit configuration (#3448 ) - monit config broke by one monit upgrade - abandon sed approach since it is suspestible to monit config changes - use unixsocket instead of httpd due to a bug in 5.20.0	2019-09-13 06:08:30 +00:00
sridhar-ravindran	d4758afdde	[DELL] S6100 Add PowerCycle Support for Last Reset Reason (#3402 ) * [DELL] S6100 Add PowerCycle Support for Last Reset Reason * handle first time boot properly * S6000 Last Reboot Reason Fix	2019-09-09 22:33:32 -07:00
Joe LeVeque	aee7d86fc9	[201811] Log message containing SONiC version to syslog at boot (#3417 )	2019-09-08 12:33:08 -07:00
Ying Xie	2b8eca5ebb	[control plane assistant] stop control plane assistant after warm reboot (#3337 ) Delay saving configuration so that the control assistant configurations won't be persisted. Signed-off-by: Ying Xie <ying.xie@microsoft.com>	2019-08-15 20:28:42 +00:00
Renuka Manavalan	b80d60c277	Fix to ensure that tacacs servers are ordered (reverse) by priority in pam.d's config. (#3322 ) Present: Servers are listed in the same order as in redis-db Fix: Save the sort o/p, hence use sorted list to write into pam.d's conf. As well convert priority to integer for use by sort.	2019-08-14 21:20:01 +00:00
Joe LeVeque	da57e8db36	Revert back to 'import sonic_platform' (#3249 )	2019-07-31 16:44:17 -07:00
Joe LeVeque	29bbd86862	[services] Restart SwSS service upon unexpected critical process exit (#2845 ) (#2852 )	2019-07-29 18:10:26 -07:00
Ying Xie	7cf90ec441	[warm reboot] save configuration after warm reboot (#3200 ) * [warm reboot] save configuration after warm reboot After warm reboot, save a copy of in memory database to config_db.json, upgrade procedure might have removed config_db.json to force new image to reload minigraph. However, reload minigraph is skipped during warm reboot. Missing config_db.json would cause device to fault in next non-upgrading cold/fast reboot. Signed-off-by: Ying Xie <ying.xie@microsoft.com> * Update finalize-warmboot.sh	2019-07-24 17:45:07 +00:00
Stephen Sun	7a9d04ee73	[Mellanox] Backporting reboot cause to 201811 (#3198 ) * backport new platform api to 201811, reboot cause part * install new platform api on host * 1. remove chassis's dependency on sonic_platform_daemon. 2. add some mellanox-specific hardware reboot causes. 3. fix typo in files/image_config/process-reboot-cause/process-reboot-cause. * 1. add dependency of sonic_platform for base image 2. handle the case of reboot cause file not found * adjust log message.	2019-07-23 07:05:35 -07:00
zzhiyuan	0869fd3925	[baseimage]: Fix process-reboot-cause possibly throwing OSError (#3159 ) In case of going from previous iteration of SONiC, and the last reboot was hardware, REBOOT_CAUSE_FILE may not be present and the service may throw an error.	2019-07-16 21:38:46 +00:00
Joe LeVeque	c3932e501b	[process-reboot-cause] Handle case if platform does not yet have sonic_platform implementation (#3126 )	2019-07-10 23:06:43 +00:00
Joe LeVeque	1115c8431d	[reboot-cause]: Move reboot cause processing to its own service, 'process-reboot-cause' (#3102 )	2019-07-10 23:02:57 +00:00
Joe LeVeque	02fc1306b0	[baseimage]: Increase TMOUT for serial port connections to 15 minutes (#3032 ) Increase TMOUT value in order to close inactive serial console connections after 900 seconds (15 minutes) of inactivity	2019-06-19 19:07:36 +00:00
Joe LeVeque	8ae67c4c5d	[logrotate] Enhance robustness (#2942 ) * [logrotate] Decrease frequency to every 10 minutes; kill any lingering logrotate processes * [logrotate] Delete all *.1.gz files as firstaction; Remove note about init-system-helpers < 1.47 workaround However, continue to send SIGHUP directly to rsyslogd process because 'service rsyslog rotate' still doesn't work properly with init-system-helpers version 1.48	2019-05-29 00:53:13 +00:00

1 2 3 4

200 Commits