sonic-buildimage

Author	SHA1	Message	Date
Renuka Manavalan	f8a9a1b805	[k8s]: switching to Flannel from Calico. (#4768 ) Switching to Flannel from Calico which brings down the image size by around 500+MB.	2020-06-16 08:18:54 -07:00
Joe LeVeque	c625e0e3e6	[build] Enable telemetry service by default (#4760 ) - Why I did it To ensure telemetry service is enabled by default after installing a fresh SONiC image - How I did it Set telemetry feature status to "enabled" when generating init_cfg.json file	2020-06-16 08:17:47 -07:00
Ying Xie	aecebac86b	[ntp] disable ntp long jump (#4748 ) Found another syncd timing issue related to clock going backwards. To be safe disable the ntp long jump. Signed-off-by: Ying Xie <ying.xie@microsoft.com>	2020-06-16 08:15:00 -07:00
Joe LeVeque	ed0e6aed1c	[hostcfgd] Get service enable/disable feature working (#4676 ) Fix hostcfgd so that changes to the "FEATURE" table in ConfigDB are properly handled. Three changes here: 1. Fix indenting such that the handling of each key actually occurs in the for key in status_data.keys(): loop 2. Add calls to sudo systemctl mask and sudo systemctl unmask as appropriate to ensure changes persist across reboots 3. Substitute returns with continues so that even if one service fails, we still try to handle the others Note that the masking is persistent, even if the configuration is not saved. We may want to consider only calling systemctl enable/disable in hostcfgd when the DB table changes, and only call systemctl mask/unmask upon calling config save.	2020-06-16 08:13:32 -07:00
Joe LeVeque	42bc14f44c	[systemd] Relocate all SONiC unit files to /usr/lib/systemd/system (#4673 ) This will allow us to disable services and have it persist across reboots by using the `systemctl mask` operation	2020-06-16 08:12:47 -07:00
Olivier Singla	18bbbb3c02	[baseimage]: Run fsck filesystem check support prior mounting filesystem (#4431 ) * Run fsck filesystem check support prior mounting filesystem If the filesystem become non clean ("dirty"), SONiC does not run fsck to repair and mark it as clean again. This patch adds the functionality to run fsck on each boot, prior to the filesystem being mounted. This allows the filesystem to be repaired if needed. Note that if the filesystem is maked as clean, fsck does nothing and simply return so this is perfectly fine to call fsck every time prior to mount the filesystem. How to verify this patch (using bash): Using an image without this patch: Make the filesystem "dirty" (not clean) [we are making the assumption that filesystem is stored in /dev/sda3 - Please adjust depending of the platform] [do this only on a test platform!] dd if=/dev/sda3 of=superblock bs=1 count=2048 printf "$(printf '\\x%02X' 2)" \| dd of="superblock" bs=1 seek=1082 count=1 conv=notrunc &> /dev/null dd of=/dev/sda3 if=superblock bs=1 count=2048 Verify that filesystem is not clean tune2fs -l /dev/sda3 \| grep "Filesystem state:" reboot and verify that the filesystem is still not clean Redo the same test with an image with this patch, and verify that at next reboot the filesystem is repaired and becomes clean. fsck log is stored on syslog, using the string FSCK as markup.	2020-06-16 08:12:11 -07:00
Joe LeVeque	913d380f6b	[caclmgrd] Get first VLAN host IP address via next() (#4685 ) I found that with IPv4Network types, calling list(ip_ntwrk.hosts()) is reliable. However, when doing the same with an IPv6Network, I found that the conversion to a list can hang indefinitely. This appears to me to be a bug in the ipaddress.IPv6Network implementation. However, I could not find any other reports on the web. This patch changes the behavior to call next() on the ip_ntwrk.hosts() generator instead, which returns the IP address of the first host.	2020-06-03 15:38:11 -07:00
Joe LeVeque	f2c0ed8e21	[caclmgrd] Allow more ICMP types (#4625 )	2020-06-03 15:35:49 -07:00
Joe LeVeque	1e59be8941	[caclmgrd] Ignore keys in interface-related tables if no IP prefix is present (#4581 ) Since the introduction of VRF, interface-related tables in ConfigDB will have multiple entries, one of which only contains the interface name and no IP prefix. Thus, when iterating over the keys in the tables, we need to ignore the entries which do not contain IP prefixes.	2020-06-03 15:35:10 -07:00
Joe LeVeque	ac957a0c7a	[caclmgrd] Add some default ACCEPT rules and lastly drop all incoming packets (#4412 ) Modified caclmgrd behavior to enhance control plane security as follows: Upon starting or receiving notification of ACL table/rule changes in Config DB: 1. Add iptables/ip6tables commands to allow all incoming packets from established TCP sessions or new TCP sessions which are related to established TCP sessions 2. Add iptables/ip6tables commands to allow bidirectional ICMPv4 ping and traceroute 3. Add iptables/ip6tables commands to allow bidirectional ICMPv6 ping and traceroute 4. Add iptables/ip6tables commands to allow all incoming Neighbor Discovery Protocol (NDP) NS/NA/RS/RA messages 5. Add iptables/ip6tables commands to allow all incoming IPv4 DHCP packets 6. Add iptables/ip6tables commands to allow all incoming IPv6 DHCP packets 7. Add iptables/ip6tables commands to allow all incoming BGP traffic 8. Add iptables/ip6tables commands for all ACL rules for recognized services (currently SSH, SNMP, NTP) 9. For all services which we did not find configured ACL rules, add iptables/ip6tables commands to allow all incoming packets for those services (allows the device to accept SSH connections before the device is configured) 10. Add iptables rules to drop all packets destined for loopback interface IP addresses 11. Add iptables rules to drop all packets destined for management interface IP addresses 12. Add iptables rules to drop all packets destined for point-to-point interface IP addresses 13. Add iptables rules to drop all packets destined for our VLAN interface gateway IP addresses 14. Add iptables/ip6tables commands to allow all incoming packets with TTL of 0 or 1 (This allows the device to respond to tools like tcptraceroute) 15. If we found control plane ACLs in the configuration and applied them, we lastly add iptables/ip6tables commands to drop all other incoming packets	2020-06-03 09:41:52 -07:00
Ying Xie	14b3f0022b	[ntp] enable/disable NTP long jump according to reboot type (#4577 ) * [ntp] enable/disable NTP long jump according to reboot type - Enable NTP long jump after cold reboot. - Disable NTP long jump after warrm/fast reboot. Signed-off-by: Ying Xie <ying.xie@microsoft.com> * fix typo * further refactoring * use sonic-db-cli instead	2020-05-20 22:44:14 -07:00
abdosi	bb60e2b670	Changes to support config-setup service for multi-npu (#4609 ) * Changes to support config-setup service for multi-npu platforms. For Multi-npu we are not supporting as of now config initializtion and ZTP. It will support creating config db from minigraph or using config db from previous file system Signed-off-by: Abhishek Dosi <abdosi@microsoft.com> * Address Review Comments. * Address Review comments * Address Review Comments of using pyhton based config load_minigraph/ config save/config reload from shell scripts so that we don't duplicate code. Also while running from shell we will skip stop/start services done by those commands. * Updated to use python command so no code duplication.	2020-05-20 22:44:14 -07:00
abdosi	508f6bfa02	Fix for issue where image is compile with flag ENABLE_DHCP_GRAPH_SERVICE (#4573 ) and then we load image and reboot even if there was existing config_db.json we will look for DHCP Service. we should disbale update_graph in such cases. This behaviour is silimar to what we have in 201811 image.	2020-05-20 07:53:23 -07:00
abdosi	9ea746e25f	Changes for LLDP docker to support multi-npu platforms (#4530 ) * Changes for LLDP for Multi NPU Platoforms:- a) Enable LLDP for Host namespace for Management Port b) Make sure Management IP is avaliable in per asic namespace needed for LLDP Chassis configuration c) Make sure chassis mac-address is correct in per asic namespace d) Do not run lldp on eth0 of per asic namespace and avoid chassis configuration for same e) Use Linux hostname instead from Device Metadata for lldp chassis configuration since in multi-npu platforms device metadata hostname will be differnt Signed-off-by: Abhishek Dosi <abdosi@microsoft.com> * Address Review Comment with following changes: a) Use Device Metadata hostname even in per namespace conatiner. updated minigraph parsing for same to have hostname as system hostname and add new key for asic name b) Minigraph changes to have MGMT_INTERFACE Key in per asic/namespace config also as needed for LLDP for setting chassis management IP. Signed-off-by: Abhishek Dosi <abdosi@microsoft.com> * Address Review Comments	2020-05-20 07:51:49 -07:00
lguohan	710d176162	[baseimage]: pin down package version for azure-storage, watchdog and futures (#4575 ) Signed-off-by: Guohan Lu <lguohan@gmail.com>	2020-05-12 06:19:05 +00:00
judyjoseph	c808640f4e	Multi DB with namespace support, Introducing the database_global.json… (#4477 ) * Multi DB with namespace support, Introducing the database_global.json file for supporting accessing DB's in other namespaces for service running in linux host * Updates based on comments * Adding the j2 templates for database_config and database_global files. * Updating to retrieve the redis DIR's to be mounted from database_global.json file. * Additional check to see if asic.conf file exists before sourcing it. * Updates based on PR comments discussion. * Review comments update * Updates to the argument "-n" for namespace used in both context of parsing minigraph and multi DB access. * Update with the attribute "persistence_for_warm_boot" that was added to database_config.json file earlier. * Removing the database_config.json file to avioid confusion in future. We use the database_config.json.j2 file to generate database_config.json files dynamically. * Update the comments for sudo usage in docker_image_ctrl.j2 * Update with the new logic in PING PONG tests using sonic-db-cli. With this we wait till the PONG response is received when redis server is up. * Similar changes in swss and syncd scripts for the PING tests with sonic-db-cli * Updated with a missing , in the database_config.json.j2 file, Do pip install of j2cli in docker-base-buster.	2020-05-09 21:33:07 -07:00
Santhosh Kumar T	1e3df476e5	[DellEMC] S6100 Last Reboot Reason Thermal Support (#3767 )	2020-05-09 18:37:31 -07:00
wangshengjun	18e51088a0	[ebtables]add the filter rule for ARP packets with vlan tag: (#3945 ) 1. ebtables -t filter -A FORWARD -p 802_1Q --vlan-encap 0806 -j DROP The ARP packet with vlan tag can't match the default rule. Signed-off-by: wangshengjun <wangshengjun@asterfusion.com>	2020-05-09 18:36:36 -07:00
Joe LeVeque	9bdd2ef014	[process-reboot-cause] If software reboot cause is unknown add note if first boot into new image (#4538 )	2020-05-09 18:17:31 -07:00
Dong Zhang	3faa4e936e	[MultiDB] use sonic-db-cli PING and fix wrong multiDB API in NAT (#4541 )	2020-05-09 18:16:48 -07:00
Akhilesh Samineni	3be7c5786b	[NAT] : Removed requires dependency on swss (#4551 ) Signed-off-by: Akhilesh Samineni <akhilesh.samineni@broadcom.com>	2020-05-09 18:16:02 -07:00
Neetha John	596bec1b32	[qos]: Alpha and ECN settings change for Th (#4564 ) Dynamic threshold setting changed to 0 and WRED profile green min threshold set to 250000 for Tomahawk devices Changed the dynamic threshold settings in pg_profile_lookup.ini Added a macro for WRED profiles in qos.json.j2 for Tomahawk devices Necessary changes made in qos.config.j2 to use the macro if present Signed-off-by: Neetha John <nejo@microsoft.com>	2020-05-09 18:13:10 -07:00
arlakshm	542f722055	[docker]: Enabled ipv6 in dockers when using docker bridge network (#4426 ) Signed-off-by: Arvindsrinivasan Lakshmi Narasimhan <arlakshm@microsoft.com>	2020-04-27 08:50:23 -07:00
pavel-shirshov	2f44bcd071	[bgpcfgd]: Split one bgp mega-template to chunks. (#4143 ) The one big bgp configuration template was splitted into chunks. Currently we have three types of bgp neighbor peers: general bgp peers. They are represented by CONFIG_DB::BGP_NEIGHBOR table entries dynamic bgp peers. They are represented by CONFIG_DB::BGP_PEER_RANGE table entries monitors bgp peers. They are represented by CONFIG_DB::BGP_MONITORS table entries This PR introduces three templates for each peer type: bgp policies: represent policieas that will be applied to the bgp peer-group (ip prefix-lists, route-maps, etc) bgp peer-group: represent bgp peer group which has common configuration for the bgp peer type and uses bgp routing policy from the previous item bgp peer-group instance: represent bgp configuration, which will be used to instatiate a bgp peer-group for the bgp peer-type. Usually this one is simple, consist of the referral to the bgp peer-group, bgp peer description and bgp peer ip address. This PR redefined constant.yml file. Now this file has a setting for to use or don't use bgp_neighbor metadata. This file has more parameters for now, which are not used. They will be used in the next iteration of bgpcfgd. Currently all tests have been disabled. I'm going to create next PR with the tests right after this PR is merged. I'm going to introduce better bgpcfgd in a short time. It will include support of dynamic changes for the templates. FIX:: #4231	2020-04-25 09:41:28 +00:00
Renuka Manavalan	9b017a83b5	[baseimage]: Install Kubernetes packages if enabled in image (#4374 ) (#4432 ) Install kubeadm, which transparently installs kubelet & kubectl As well download required Kubernetes images required to run as kubernetes node. The kubelet service is intentionally kept in disabled state, as it would otherwise continuously restart wasting resources, until join to master.	2020-04-16 21:54:45 -07:00
SuvarnaMeenakshi	2f66b4c545	[sonic-netns-exec]: use "$@" to reflects all positional parameters as they were set initially (#4375 ) sonic-netns-exec fails to execute below command in swss.sh: sonic-netns-exec "$NET_NS" sonic-db-cli $1 EVAL " local tables = {$2} for i = 1, table.getn(tables) do local matches = redis.call('KEYS', tables[i]) for j,name in ipairs(matches) do redis.call('DEL', name) end end" 0 This command fails with error " redis.exceptions.ResponseError: value is not an integer or out of range" . Root cause: When sonic-netns-exec executes the above function, argument passed to sonic-db-cli is NOT executed as a single script. The argument is passed as separate keywords to sonic-db-cli, as below: ['EVAL', 'local', 'tables', '=', "{'PORT_TABLE'}", 'for', 'i', '=', '1,', 'table.getn(tables)', 'do', 'local', 'matches', '=', "redis.call('KEYS',", 'tables[i])', 'for', 'j,name', 'in', 'ipairs(matches)', 'do', "redis.call('DEL',", 'name)', 'end', 'end', '0'] - How I did it To make sure that the parameters are passed as they were set initially, fix sonic-netns-exec to use double quoted "$@", where "$@" is "$1" "$2" "$3" ... "${N}" After fix, the argument passed to sonic-db-cli is as below: Argument passed to sonic-db-cli: ['EVAL', "\n local tables = {'PORT_TABLE'}\n for i = 1, table.getn(tables) do\n local matches = redis.call('KEYS', tables[i])\n for j,name in ipairs(matches) do\n redis.call('DEL', name)\n end\n end", '0'] Signed-off-by: SuvarnaMeenakshi <sumeenak@microsoft.com>	2020-04-15 13:13:31 -07:00
SuvarnaMeenakshi	0099305475	Multi-ASIC implementation (#3888 ) Changes made to support multi-asic platform. Added multi-instance support for swss, syncd, database, bgp, teamd and lldp.	2020-04-15 13:08:34 -07:00
Nazarii Hnydyn	0b35fcf3bf	[mellanox]: Add SSD FW update tool (#4351 ) * [mellanox]: Add SSD FW update tool. Signed-off-by: Nazarii Hnydyn <nazariig@mellanox.com> * [mellanox]: Align Platform API. Signed-off-by: Nazarii Hnydyn <nazariig@mellanox.com> * [mellanox]: Fix firmware description. Signed-off-by: Nazarii Hnydyn <nazariig@mellanox.com> * [mellanox]: Update SSD tool. Signed-off-by: Nazarii Hnydyn <nazariig@mellanox.com>	2020-04-15 13:02:36 -07:00
rajendra-dendukuri	a97b73e79c	Fix typo in config-setup service (#4388 )	2020-04-10 21:23:07 -07:00
Abhishek Dosi	249265ad99	Revert "Multi-ASIC implementation (#3888 )" This reverts commit `2e87a16941`.	2020-04-03 14:34:38 -07:00
Samuel Angebault	8819322210	[Arista] Update drivers submodules (#4353 ) * Update arista drivers submodules * Add device configs for 7060CX2-32S * Update boot0 and union-mount for 7060CX2-32S * Add 7170-32C and 7170-32CD support in boot0 * Sync after writting boot configs * Add 7170-32C and 7170-32CD device configurations Co-authored-by: Boyang Yu <byu@arista.com> Co-authored-by: Boyang Yu <byu@arista.com>	2020-04-01 23:26:42 -07:00
SuvarnaMeenakshi	2e87a16941	Multi-ASIC implementation (#3888 ) Changes made to support multi-asic platform. Added multi-instance support for swss, syncd, database, bgp, teamd and lldp.	2020-04-01 23:21:49 -07:00
Kebo Liu	2fd1641feb	copy spc3 fw file to image (#4328 )	2020-03-29 22:48:10 -07:00
Garrick He	a059d7ec0e	[procdockerstatsd] Fix CMD field in dB (#4335 ) * Fix the CMD for the PROCESSSTATS entries so that there is a space between the command name and the arguments. Signed-off-by: Garrick He <garrick_he@dell.com>	2020-03-29 22:47:05 -07:00
Stepan Blyshchak	ee84dca683	[docker_image_ctl.j2] Share UTS namespace with host OS (#4169 ) Instead of updating hostname manualy on Config DB hostname change, simply share containers UTS namespace with host OS. Ideally, instead of setting `--uts=host` for every container in SONiC, this setting can be set per container if feature requires. One behaviour change is introduced in this commit, when `--privileged` or `--cap-add=CAP_SYS_ADMIN` and `--uts=host` are combined, container has privilege to change host OS and every other container hostname. Such privilege should be fixed by limiting containers capabilities. Signed-off-by: Stepan Blyschak <stepanb@mellanox.com>	2020-03-22 23:04:02 -07:00
SuvarnaMeenakshi	7b4b1245bd	[ntp]: Add "tinker panic 0" in ntp.conf to avoid ntpd from panic (#4263 ) - What I did Add configuration to avoid ntpd from panic and exit if the drift between new time and current system time is large. - How I did it Added "tinker panic 0" in ntp.conf file. - How to verify it [this assumes that there is a valid NTP server IP in config_db/ntp.conf] Change the current system time to a bad time with a large drift from time in ntp server; drift should be greater than 1000s. Reboot the device. Before the fix: 3. upon reboot, ntp-config service comes up fine, ntp service goes to active(exited) state without any error message. This is because the offset between new time (from ntp server) and the current system time is very large, ntpd goes to panic mode and exits. The system continues to show the bad time. After the fix: 3. Upon reboot, ntp-config comes up fine, ntp services comes up from and stays in active (running) state. The system clock gets synced with the ntp server time.	2020-03-22 23:00:40 -07:00
yozhao101	358570324b	[Monit] Delay start of monitoring for 5 minutes (#4281 )	2020-03-22 22:58:57 -07:00
Andriy Kokhan	39889a3c35	[Service] Added NAT entry into CONTAINER_FEATURE. Fixes #4247 . (#4250 ) * [Service] Added NAT entry into CONTAINER_FEATURE. Fixes #4247. Signed-off-by: Andriy Kokhan <akokhan@barefootnetworks.com>	2020-03-19 22:18:13 -07:00
Joe LeVeque	8e36068237	[sonic-cfggen] Loading the configuration from init_cfg.json and then from config_db.json (#4148 )	2020-03-15 08:54:05 -07:00
Olivier Singla	a8baca0d6e	[kernel]: security kernel update to 4.9.189 (#3913 ) This patch upgrade the kernel from version 4.9.0-9-2 (4.9.168-1+deb9u3) to 4.9.0-11-2 (4.9.189-3+deb9u2) Co-authored-by: rajendra-dendukuri <47423477+rajendra-dendukuri@users.noreply.github.com>	2020-03-15 08:52:29 -07:00
Joe LeVeque	102cb83097	[Services] Restart NAT service upon unexpected critical process exit. (#4208 )	2020-03-14 18:03:29 -07:00
Stephen Sun	c700127101	[Mellanox]Take advantage of sdk variable to customize the location where sdk_socket exists. (#4223 ) Take advantage of an SDK environment variable to customize the location where sdk_socket exists. In the latest SDK sdk_socket has been moved from /tmp to /var/run which is a better place to contain this kind of file. However, this prevents the subdirs under /var/run from being mapped to different volumes. To resolve this, we take advantage of an SDK variable to designate the location of sdk_socket. This requires every process that requires to access sdk_socket have this environment variable defined. However, to define environment variable for each process is less scalable. We take advantage of the docker scope environment variable to avoid that. It depends on PR 4227	2020-03-14 18:02:43 -07:00
byu343	950926a837	[arista]: Add support for Arista Lodoga (#4232 ) Backport the support of Arista Lodoga to 201911	2020-03-11 13:12:39 -07:00
Abhishek Dosi	cc2d497aa4	Fixing Bad Cherry-pick	2020-03-04 10:46:45 -08:00
rajendra-dendukuri	8581a52571	ZTP infrastructure changes to support DHCP discovery provisioning data (#3298 ) * ZTP infrastructure changes to support DHCP discovery provisioning data - Dynamically generate DHCP client configuration based on current ZTP state - Added support to request and process hostname when using DHCPv6 - Do not process graphservice url dhcp option if ZTP is enabled, ZTP service will process it - Generate /e/n/i file with all active interfaces seeking address assignment via DHCP. Only interfaces that are created in Linux will be added to /e/n/i. Also DHCP is started only on linked up in-band interfaces. Signed-off-by: Rajendra Dendukuri <rajendra.dendukuri@broadcom.com>	2020-03-03 22:23:59 -08:00
yozhao101	5c8c4b2a50	[Services] Restart BGP service upon unexpected critical process exit. (#4207 )	2020-03-03 19:19:44 -08:00
rajendra-dendukuri	1edb69647e	[sonic-ztp]: Build sonic-ztp package (#3299 ) * Build sonic-ztp package - Add changes in make rules to conditionally include sonic-ztp package Signed-off-by: Rajendra Dendukuri <rajendra.dendukuri@broadcom.com>	2020-02-24 14:27:24 -08:00
Stepan Blyshchak	398929c622	[mgmt-framework] start after syncd (#4174 ) every service starts after syncd to start the most critical parts first Signed-off-by: Stepan Blyschak <stepanb@mellanox.com>	2020-02-24 11:04:51 -08:00
Prince Sunny	20510d58d3	Sleep done before mismatch handler (#4165 ) * Sleep done before mismatch handler	2020-02-24 10:25:56 -08:00
Prince Sunny	6740b2d3df	Fix service and container name to be same (#4151 )	2020-02-24 10:24:11 -08:00

1 2 3 4 5 ...

539 Commits