sonic-buildimage

Author	SHA1	Message	Date
Kebo Liu	2b568ec136	Add with_i2cdev for mst start to have I2C device loaded properly (#4790 )	2020-06-21 16:27:05 +03:00
Joe LeVeque	4d2d95e8e6	[hostcfgd] Synchronize all feature statuses once upon start (#4714 ) - Ensure all features (services) are in the configured state when hostcfgd starts - Better functionalization of code - Also replace calls to deprecated `has_key()` method in `tacacs_server_handler()` and `tacacs_global_handler()` with `in` keyword. This PR depends on https://github.com/Azure/sonic-utilities/pull/944, otherwise `config load_minigraph` will fail when trying to restart disabled services.	2020-06-20 12:09:29 -07:00
padmanarayana	95e3cda5da	[DELL]: FTOS to SONiC fast conversion fixes (#4807 ) While migrating to SONiC 20181130, identified a couple of issues: 1. union-mount needs /host/machine.conf parameters for vendor specific checks : however, in case of migration, the /host/machine.conf is extracted from ONIE only in https://github.com/Azure/sonic-buildimage/blob/master/files/image_config/platform/rc.local#L127. 2. Since grub.cfg is updated to have net.ifnames=0 biosdevname=0, 70-persistent-net.rules changes are no longer required.	2020-06-19 11:02:08 -07:00
Joe LeVeque	1f8a78cef1	[build] No longer install Python 'click-default-group' package (#4811 ) All dependencies upon the Python 'click-default-group' package have been removed from sonic-utilities as of https://github.com/Azure/sonic-utilities/pull/903. The submodule was updated to include this patch as of https://github.com/Azure/sonic-buildimage/pull/4601, therefore we no longer need to install this package in the SONiC image.	2020-06-19 10:54:10 -07:00
Joe LeVeque	6960477cc2	[caclmgrd] Don't limit connection tracking to TCP (#4796 ) Don't limit iptables connection tracking to TCP protocol; allow connection tracking for all protocols. This allows services like NTP, which is UDP-based, to receive replies from an NTP server even if the port is blocked, as long as it is in reply to a request sent from the device itself.	2020-06-18 00:18:20 -07:00
abdosi	30d7ce0004	[build] Ensure /usr/lib/systemd/system/ directory exists before referencing (#4788 ) * Fix the Build on 201911 (Stretch) where the directory /usr/lib/systemd/system/ does not exist so creating manually. Change should not harm Master (buster) where the directory is created by Linux * Fix as per review comments	2020-06-17 09:16:58 -07:00
xumia	76a395cdbf	[secure boot] Support rw files allowlist (#4585 ) * Support rw files allowlist for Sonic Secure Boot * Improve the performance * fix bug * Move the config description into a md file * Change to use a simple way to remove the blank line * Support chmod a-x in rw folder * Change function name * Change some unnecessary words	2020-06-13 00:10:13 -07:00
Renuka Manavalan	edeb40ffcf	[k8s]: switching to Flannel from Calico. (#4768 ) Switching to Flannel from Calico which brings down the image size by around 500+MB.	2020-06-12 18:06:08 -07:00
Joe LeVeque	4e482c16ba	[build] Enable telemetry service by default (#4760 ) - Why I did it To ensure telemetry service is enabled by default after installing a fresh SONiC image - How I did it Set telemetry feature status to "enabled" when generating init_cfg.json file	2020-06-12 16:20:31 -07:00
Ying Xie	ae7bf3db52	[ntp] disable ntp long jump (#4748 ) Found another syncd timing issue related to clock going backwards. To be safe disable the ntp long jump. Signed-off-by: Ying Xie <ying.xie@microsoft.com>	2020-06-11 13:01:21 -07:00
yozhao101	4ea2e5e6dc	[docker-syncd] Add timeout to force stop syncd container (#4617 ) - Why I did it When I tested auto-restart feature of swss container by manually killing one of critical processes in it, swss will be stopped. Then syncd container as the peer container should also be stopped as expected. However, I found sometimes syncd container can be stopped, sometimes it can not be stopped. The reason why syncd container can not be stopped is the process (/usr/local/bin/syncd.sh stop) to execute the stop() function will be stuck between the lines 164 –167. Systemd will wait for 90 seconds and then kill this process. 164 # wait until syncd quit gracefully 165 while docker top syncd$DEV \| grep -q /usr/bin/syncd; do 166 sleep 0.1 167 done The first thing I did is to profile how long this while loop will spin if syncd container can be normally stopped after swss container is stopped. The result is 5 seconds or 6 seconds. If syncd container can be normally stopped, two messages will be written into syslog: str-a7050-acs-3 NOTICE syncd#dsserve: child /usr/bin/syncd exited status: 134 str-a7050-acs-3 INFO syncd#supervisord: syncd [5] child /usr/bin/syncd exited status: 134 The second thing I did was to add a timer in the condition of while loop to ensure this while loop will be forced to exit after 20 seconds: After that, the testing result is that syncd container can be normally stopped if swss is stopped first. One more thing I want to mention is that if syncd container is stopped during 5 seconds or 6 seconds, then the two log messages can be still seen in syslog. However, if the execution time of while loop is longer than 20 seconds and is forced to exit, although syncd container can be stopped, I did not see these two messages in syslog. Further, although I observed the auto-restart feature of swss container can work correctly right now, I can not make sure the issue which syncd container can not stopped will occur in future. - How I did it I added a timer around the while loop in stop() function. This while loop will exit after spinning 20 seconds. Signed-off-by: Yong Zhao <yozhao@microsoft.com>	2020-06-04 15:17:28 -07:00
Joe LeVeque	7b8037770d	[caclmgrd] Get first VLAN host IP address via next() (#4685 ) I found that with IPv4Network types, calling list(ip_ntwrk.hosts()) is reliable. However, when doing the same with an IPv6Network, I found that the conversion to a list can hang indefinitely. This appears to me to be a bug in the ipaddress.IPv6Network implementation. However, I could not find any other reports on the web. This patch changes the behavior to call next() on the ip_ntwrk.hosts() generator instead, which returns the IP address of the first host.	2020-06-02 02:11:21 -07:00
Joe LeVeque	eff8a89523	[hostcfgd] Get service enable/disable feature working (#4676 ) Fix hostcfgd so that changes to the "FEATURE" table in ConfigDB are properly handled. Three changes here: 1. Fix indenting such that the handling of each key actually occurs in the for key in status_data.keys(): loop 2. Add calls to sudo systemctl mask and sudo systemctl unmask as appropriate to ensure changes persist across reboots 3. Substitute returns with continues so that even if one service fails, we still try to handle the others Note that the masking is persistent, even if the configuration is not saved. We may want to consider only calling systemctl enable/disable in hostcfgd when the DB table changes, and only call systemctl mask/unmask upon calling config save.	2020-06-02 02:07:22 -07:00
Joe LeVeque	1e369b0998	[systemd] Relocate all SONiC unit files to /usr/lib/systemd/system (#4673 ) This will allow us to disable services and have it persist across reboots by using the `systemctl mask` operation	2020-05-30 13:46:44 -07:00
Qi Luo	65e7a84509	[baseimage]: Build and install redis-dump-load Python 3 package in host image (#4661 ) Fix #4656	2020-05-30 05:52:27 -07:00
Samuel Angebault	d35a8a3800	[arista]: Add SmartsvilleDDBK and SmartsvilleBkMs (#4662 ) Co-authored-by: Boyang Yu <byu@arista.com>	2020-05-28 14:59:00 -07:00
taocy	4cd36175ce	arm arch: 1. install required libraries; 2. umount /proc after dockerfs.	2020-05-25 13:15:19 +00:00
taocy	ea2dd9541d	change image apt source list from stretch to buster for arm	2020-05-25 13:15:19 +00:00
Praveen Chaudhary	0ccdd70671	[sonic-yang-mgmt]: sonic-yang-mgmt package for configuration validation. (#3861 ) - What I did #### wheel package Makefiles - wheel package Makefiles for sonic-yang-mgmt package. #### libyang Python APIs: - python APIs based on libyang - functions to load/merge yang models and Yang data files - function to validate data trees based on Yang models - functions to merge yang data files/trees - add/set/delete node in schema and data trees - find data/schema nodes from xpath from the Yang data/schema tree in memory - find dependencies - dump the data tree in json/xml #### Extension of libyang Python APIs: -- Cropping input config based on Yang Model. -- Translate input config based on Yang Model. -- rev Translate input config based on Yang Model. -- Find xpath of port, portleaf and a yang list. -- Find if node is key of a list while deletion if yes, then delete the parent. Signed-off-by: Praveen Chaudhary pchaudhary@linkedin.com Signed-off-by: Ping Mao pmao@linkedin.com	2020-05-21 16:27:57 -07:00
simonJi2018	0b6253baa1	[platform/nephos] Optimize the code to reduce changes due to the kernel upgrade (#4332 ) - bug fix : Fixed an issue which the nps ko file was not loaded due to the wrong service file name - Optimize the code to reduce changes due to the kernel upgrade - Remove nephos ko file loaded in swss.service.j2 because it has loaded at syncd.service.j2	2020-05-21 02:21:07 -07:00
anand-kumar-subramanian	34586032dc	[mgmt-framework] removed requires dependency on swss (#4548 ) fixes #4473	2020-05-20 20:47:09 -07:00
Joe LeVeque	bce42a7595	[caclmgrd] Allow more ICMP types (#4625 )	2020-05-20 17:45:07 -07:00
abdosi	a44fc07e78	Changes to support config-setup service for multi-npu (#4609 ) * Changes to support config-setup service for multi-npu platforms. For Multi-npu we are not supporting as of now config initializtion and ZTP. It will support creating config db from minigraph or using config db from previous file system Signed-off-by: Abhishek Dosi <abdosi@microsoft.com> * Address Review Comments. * Address Review comments * Address Review Comments of using pyhton based config load_minigraph/ config save/config reload from shell scripts so that we don't duplicate code. Also while running from shell we will skip stop/start services done by those commands. * Updated to use python command so no code duplication.	2020-05-20 16:32:33 -07:00
rkdevi27	32f58b5864	Fix "/host unmount failure" during reboot (#4558 )	2020-05-20 11:18:11 -07:00
Ying Xie	cdfb1ced44	[ntp] enable/disable NTP long jump according to reboot type (#4577 ) * [ntp] enable/disable NTP long jump according to reboot type - Enable NTP long jump after cold reboot. - Disable NTP long jump after warrm/fast reboot. Signed-off-by: Ying Xie <ying.xie@microsoft.com> * fix typo * further refactoring * use sonic-db-cli instead	2020-05-20 10:57:21 -07:00
rajendra-dendukuri	9c7105b5f3	Install swsssdk-py3 in the base Debian image for python3 based apps (#4542 ) Signed-off-by: Rajendra Dendukuri <rajendra.dendukuri@broadcom.com>	2020-05-19 11:15:05 -07:00
Joe LeVeque	5150e7b655	[caclmgrd] Ignore keys in interface-related tables if no IP prefix is present (#4581 ) Since the introduction of VRF, interface-related tables in ConfigDB will have multiple entries, one of which only contains the interface name and no IP prefix. Thus, when iterating over the keys in the tables, we need to ignore the entries which do not contain IP prefixes.	2020-05-12 18:16:55 -07:00
abdosi	5fe2216ea3	Fix for issue where image is compile with flag ENABLE_DHCP_GRAPH_SERVICE (#4573 ) and then we load image and reboot even if there was existing config_db.json we will look for DHCP Service. we should disbale update_graph in such cases. This behaviour is silimar to what we have in 201811 image.	2020-05-12 14:49:56 -07:00
lguohan	1066f238ba	[baseimage]: pin down package version for azure-storage, watchdog and futures (#4575 ) Signed-off-by: Guohan Lu <lguohan@gmail.com>	2020-05-11 23:17:47 -07:00
Joe LeVeque	5e8e0d76fc	[caclmgrd] Add some default ACCEPT rules and lastly drop all incoming packets (#4412 ) Modified caclmgrd behavior to enhance control plane security as follows: Upon starting or receiving notification of ACL table/rule changes in Config DB: 1. Add iptables/ip6tables commands to allow all incoming packets from established TCP sessions or new TCP sessions which are related to established TCP sessions 2. Add iptables/ip6tables commands to allow bidirectional ICMPv4 ping and traceroute 3. Add iptables/ip6tables commands to allow bidirectional ICMPv6 ping and traceroute 4. Add iptables/ip6tables commands to allow all incoming Neighbor Discovery Protocol (NDP) NS/NA/RS/RA messages 5. Add iptables/ip6tables commands to allow all incoming IPv4 DHCP packets 6. Add iptables/ip6tables commands to allow all incoming IPv6 DHCP packets 7. Add iptables/ip6tables commands to allow all incoming BGP traffic 8. Add iptables/ip6tables commands for all ACL rules for recognized services (currently SSH, SNMP, NTP) 9. For all services which we did not find configured ACL rules, add iptables/ip6tables commands to allow all incoming packets for those services (allows the device to accept SSH connections before the device is configured) 10. Add iptables rules to drop all packets destined for loopback interface IP addresses 11. Add iptables rules to drop all packets destined for management interface IP addresses 12. Add iptables rules to drop all packets destined for point-to-point interface IP addresses 13. Add iptables rules to drop all packets destined for our VLAN interface gateway IP addresses 14. Add iptables/ip6tables commands to allow all incoming packets with TTL of 0 or 1 (This allows the device to respond to tools like tcptraceroute) 15. If we found control plane ACLs in the configuration and applied them, we lastly add iptables/ip6tables commands to drop all other incoming packets	2020-05-11 12:36:47 -07:00
abdosi	a96f9ecee9	Changes for LLDP docker to support multi-npu platforms (#4530 ) * Changes for LLDP for Multi NPU Platoforms:- a) Enable LLDP for Host namespace for Management Port b) Make sure Management IP is avaliable in per asic namespace needed for LLDP Chassis configuration c) Make sure chassis mac-address is correct in per asic namespace d) Do not run lldp on eth0 of per asic namespace and avoid chassis configuration for same e) Use Linux hostname instead from Device Metadata for lldp chassis configuration since in multi-npu platforms device metadata hostname will be differnt Signed-off-by: Abhishek Dosi <abdosi@microsoft.com> * Address Review Comment with following changes: a) Use Device Metadata hostname even in per namespace conatiner. updated minigraph parsing for same to have hostname as system hostname and add new key for asic name b) Minigraph changes to have MGMT_INTERFACE Key in per asic/namespace config also as needed for LLDP for setting chassis management IP. Signed-off-by: Abhishek Dosi <abdosi@microsoft.com> * Address Review Comments	2020-05-11 11:05:44 -07:00
Neetha John	286aa35ac6	[qos]: Alpha and ECN settings change for Th (#4564 ) Dynamic threshold setting changed to 0 and WRED profile green min threshold set to 250000 for Tomahawk devices Changed the dynamic threshold settings in pg_profile_lookup.ini Added a macro for WRED profiles in qos.json.j2 for Tomahawk devices Necessary changes made in qos.config.j2 to use the macro if present Signed-off-by: Neetha John <nejo@microsoft.com>	2020-05-09 11:21:18 -07:00
judyjoseph	acf465b43b	Multi DB with namespace support, Introducing the database_global.json… (#4477 ) * Multi DB with namespace support, Introducing the database_global.json file for supporting accessing DB's in other namespaces for service running in linux host * Updates based on comments * Adding the j2 templates for database_config and database_global files. * Updating to retrieve the redis DIR's to be mounted from database_global.json file. * Additional check to see if asic.conf file exists before sourcing it. * Updates based on PR comments discussion. * Review comments update * Updates to the argument "-n" for namespace used in both context of parsing minigraph and multi DB access. * Update with the attribute "persistence_for_warm_boot" that was added to database_config.json file earlier. * Removing the database_config.json file to avioid confusion in future. We use the database_config.json.j2 file to generate database_config.json files dynamically. * Update the comments for sudo usage in docker_image_ctrl.j2 * Update with the new logic in PING PONG tests using sonic-db-cli. With this we wait till the PONG response is received when redis server is up. * Similar changes in swss and syncd scripts for the PING tests with sonic-db-cli * Updated with a missing , in the database_config.json.j2 file, Do pip install of j2cli in docker-base-buster.	2020-05-08 21:24:05 -07:00
Akhilesh Samineni	86627dfd35	[NAT] : Removed requires dependency on swss (#4551 ) Signed-off-by: Akhilesh Samineni <akhilesh.samineni@broadcom.com>	2020-05-08 00:01:48 -07:00
Joe LeVeque	dfdd94d8ad	[process-reboot-cause] If software reboot cause is unknown add note if first boot into new image (#4538 )	2020-05-06 22:48:33 -07:00
wangshengjun	bed4a799df	[ebtables]add the filter rule for ARP packets with vlan tag: (#3945 ) 1. ebtables -t filter -A FORWARD -p 802_1Q --vlan-encap 0806 -j DROP The ARP packet with vlan tag can't match the default rule. Signed-off-by: wangshengjun <wangshengjun@asterfusion.com>	2020-05-06 20:03:09 -07:00
Dong Zhang	340cf826a6	[MultiDB] use sonic-db-cli PING and fix wrong multiDB API in NAT (#4541 )	2020-05-06 15:41:28 -07:00
rkdevi27	4511216789	Ssd mitigation changes (#4214 ) * ssd_mitigation_changes * ssd_mitigation_changes * ssd_mitigation_changes * ssd_mitigation_changes	2020-04-30 22:58:09 -07:00
lguohan	86bc8aec5f	[vs]: dynamically create front panel ports in vs docker (#4499 ) currently, vs docker always create 32 front panel ports. when vs docker starts, it first detects the peer links in the namespace and then setup equal number of front panel interfaces as the peer links. Signed-off-by: Guohan Lu <lguohan@gmail.com>	2020-04-30 12:50:59 -07:00
Olivier Singla	799f22d4c7	[baseimage]: Run fsck filesystem check support prior mounting filesystem (#4431 ) * Run fsck filesystem check support prior mounting filesystem If the filesystem become non clean ("dirty"), SONiC does not run fsck to repair and mark it as clean again. This patch adds the functionality to run fsck on each boot, prior to the filesystem being mounted. This allows the filesystem to be repaired if needed. Note that if the filesystem is maked as clean, fsck does nothing and simply return so this is perfectly fine to call fsck every time prior to mount the filesystem. How to verify this patch (using bash): Using an image without this patch: Make the filesystem "dirty" (not clean) [we are making the assumption that filesystem is stored in /dev/sda3 - Please adjust depending of the platform] [do this only on a test platform!] dd if=/dev/sda3 of=superblock bs=1 count=2048 printf "$(printf '\\x%02X' 2)" \| dd of="superblock" bs=1 seek=1082 count=1 conv=notrunc &> /dev/null dd of=/dev/sda3 if=superblock bs=1 count=2048 Verify that filesystem is not clean tune2fs -l /dev/sda3 \| grep "Filesystem state:" reboot and verify that the filesystem is still not clean Redo the same test with an image with this patch, and verify that at next reboot the filesystem is repaired and becomes clean. fsck log is stored on syslog, using the string FSCK as markup.	2020-04-30 00:33:20 -07:00
pavel-shirshov	057ced0391	[bgpcfgd]: Split one bgp mega-template to chunks. (#4143 ) The one big bgp configuration template was splitted into chunks. Currently we have three types of bgp neighbor peers: general bgp peers. They are represented by CONFIG_DB::BGP_NEIGHBOR table entries dynamic bgp peers. They are represented by CONFIG_DB::BGP_PEER_RANGE table entries monitors bgp peers. They are represented by CONFIG_DB::BGP_MONITORS table entries This PR introduces three templates for each peer type: bgp policies: represent policieas that will be applied to the bgp peer-group (ip prefix-lists, route-maps, etc) bgp peer-group: represent bgp peer group which has common configuration for the bgp peer type and uses bgp routing policy from the previous item bgp peer-group instance: represent bgp configuration, which will be used to instatiate a bgp peer-group for the bgp peer-type. Usually this one is simple, consist of the referral to the bgp peer-group, bgp peer description and bgp peer ip address. This PR redefined constant.yml file. Now this file has a setting for to use or don't use bgp_neighbor metadata. This file has more parameters for now, which are not used. They will be used in the next iteration of bgpcfgd. Currently all tests have been disabled. I'm going to create next PR with the tests right after this PR is merged. I'm going to introduce better bgpcfgd in a short time. It will include support of dynamic changes for the templates. FIX:: #4231	2020-04-23 09:42:22 -07:00
arlakshm	3a82ade3ef	[docker]: Enabled ipv6 in dockers when using docker bridge network (#4426 ) Signed-off-by: Arvindsrinivasan Lakshmi Narasimhan <arlakshm@microsoft.com>	2020-04-21 17:09:41 -07:00
byu343	e7075907f9	[arista]: Change kernel param for smartsville (#56 )	2020-04-20 07:34:43 +00:00
Stephen	e95504fe72	[Mellanox]WA to avoid fsroot being corrupted by "dpkg --extract"	2020-04-17 04:51:51 +00:00
bsun-sudo	012c832ce5	[ntp] add ntp support in buster with mgmt vrf (#55 ) - create a file in files/image_config/ntp/ntp-systemd-wrapper to add mgmt vrf related start cmd for ntp service. So that the default /usr/lib/ntp/ntp-systemd-wrapper can be overriden during build time. - modify build_debian.sh to cp files/image_config/ntp/ntp-systemd-wrapper to /usr/lib/ntp/ntp-systemd-wrapper during build time. Co-authored-by: Bing Sun <Bing_Sun@dell.com>	2020-04-17 04:51:51 +00:00
bsun-sudo	2a237c57e6	[mgmt-vrf]: mgmt vrf related change for Buster (#53 ) Co-authored-by: Bing Sun <Bing_Sun@dell.com>	2020-04-17 04:51:51 +00:00
Guohan Lu	65dfe75903	[build]: umount target directory properly Signed-off-by: Guohan Lu <lguohan@gmail.com>	2020-04-17 04:51:51 +00:00
Stephen Sun	aec51c8aaf	[docker-wait-any] Use APIClient instead of Client according to API update due to the upgrade from docker-py (1.6.0) to docker (4.1.0)	2020-04-17 04:51:51 +00:00
Guohan Lu	6f5ac4b282	[initramfs]: move mke2fs to /usr/local/sbin /usr/sbin/mke2fs is now rom busybox which does not support the operation we need	2020-04-17 04:51:51 +00:00
Guohan Lu	d0a3fa4487	[sshd]: Create /run/sshd under systemd using RuntimeDirectory backport upstream patch `4c771b9c7f` Signed-off-by: Guohan Lu <lguohan@gmail.com>	2020-04-17 04:51:51 +00:00

1 2 3 4 5 ...

576 Commits