sonic-buildimage

Author	SHA1	Message	Date
Saikrishna Arcot	aafb3d00e2	Start haveged before systemd-random-seed (#10328 ) The haveged service file in Debian Buster specifies that haveged should start after systemd-random-seed starts (this was removed in Bullseye after systemd changes caused a bootloop). This is a bit counterproductive, since haveged is meant to be used in environments with minimal sources of entropy, but one of the checks that systemd-random-seed does is to verify that entropy is present. Therefore, override the default .service file for haveged that moves systemd-random-seed to the Before list, allowing it to start before systemd-random-seed checks the system entropy level. (systemd doesn't allow removing items from dependency/ordering entries such as After= and Before=, so the entire .service file has to be overwritten.) Note that despite this, haveged takes up to two seconds to actually start working, so systemd-random-seed may still block for about two seconds. However, this still allows other work (such as running rc.local) to proceed a bit sooner. Signed-off-by: Saikrishna Arcot <sarcot@microsoft.com>	2022-03-24 14:28:42 -07:00
noaOrMlnx	4f021c44c2	Update docker-sonic-vs infrastructure in order to run CoPP UT (#10230 ) *Changes to run CoPP UT in docker-sonic-vs	2022-03-21 21:55:24 -07:00
xumia	67312ff635	[Build]: Use one debian mirror config (#10281 ) Why I did it Use one debian mirror config. The empty config in https://github.com/Azure/sonic-buildimage/blob/master/files/image_config/apt/sources.list overrides the file https://github.com/Azure/sonic-buildimage/blob/master/files/apt/sources.list.amd64 (armhf/arm64), it does not make sense. All the content in files/image_config/apt is no use, any one wants to add mirror config, please add in files/apt. How I did it Remove files/image_config/apt and the reference.	2022-03-21 17:04:19 +08:00
gechiang	a984757b9d	[202012 BRCM SAI 4.3.5.3-3] Picked up fixes that makes up BRCM SAI version 4.3.5.3-3 (#10255 )	2022-03-19 17:18:50 -07:00
xumia	413ee3e219	[Build]: Fix /proc not mounted issue (#10164 ) (#10256 ) [Build]: Fix /proc not mounted issue	2022-03-19 22:19:06 +08:00
mssonicbld	03d058efe4	[ci/build]: Upgrade SONiC package versions (#10283 ) [ci/build]: Upgrade SONiC package versions	2022-03-19 11:09:51 +08:00
Stepan Blyshchak	8ce5e4e77b	[teamd.sh] kill teamd docker on warm shutdown for faster shutdown (#10219 ) This can save 6 sec for teamd LAG restoration - the time between: ``` Mar 9 13:51:10.467757 r-panther-13 WARNING teamd#teamd_PortChannel1[28]: Got SIGUSR1. Mar 9 13:52:33.310707 r-panther-13 INFO teamd#teamd_PortChannel1[27]: carrier changed to UP ``` - Why I did it Optimize warm boot. Specifically reduce the time needed for LAG restoration. - How I did it Kill teamd docker after graceful shutdown of teamd processes. - How to verify it Run warm reboot. Signed-off-by: Stepan Blyschak <stepanb@nvidia.com>	2022-03-16 22:22:26 +00:00
wenyiz2021	5878cfdb06	Update container_checker for multi-asic devices when state is 'always_enabled' (#10067 ) * Update container_checker for multi-asic devices Update container_checker for multi-asic devices to add database containers in always_running_containers. Previous change was made for single-asic, and that database containers were not considered as feature when writing to state_db. * Update container_checker Update an indent	2022-03-14 23:01:43 +00:00
mssonicbld	1c4364222d	[ci/build]: Upgrade SONiC package versions (#10214 )	2022-03-11 15:13:09 +00:00
Santhosh Kumar T	e83955599d	[202012] Refactoring DELL platform init to reduce rc.local processing time (#10171 ) Why I did it To reduce the processing time of rc.local, refactoring s6100 platform initialization. Fixing [warm-upgrade][202012] Slow DELL platform init in rc.local causes lacp-teardown #10150 How I did it On branch 202012-s6100-rclocalChanges to be committed: (use "git restore --staged <file>..." to unstage) modified: ../../../../files/image_config/platform/rc.local modified: ../debian/platform-modules-s6100.install modified: scripts/fast-reboot_plugin modified: scripts/s6100_platform.sh renamed: scripts/s6100_i2c_enumeration.sh -> scripts/s6100_platform_startup.sh renamed: systemd/s6100-i2c-enumerate.service -> systemd/s6100-platform-startup.service	2022-03-10 18:51:07 -08:00
mssonicbld	7fe1489061	[ci/build]: Upgrade SONiC package versions (#10194 )	2022-03-09 22:41:51 +00:00
mssonicbld	063882cf87	[ci/build]: Upgrade SONiC package versions (#10069 ) [ci/build]: Upgrade SONiC package versions (#10069)	2022-03-08 21:32:36 +08:00
xumia	a8d844c83d	[build]: Fix marvell-armhf build hung issue (#10156 ) The marvel-armhf build is hung, it does not exist after waiting for a long time. It is caused by the process /etc/entropy.py which is started by the postinst script in target/debs/buster/sonic-platform-nokia-7215_1.0_armhf.deb $ cat postinst sh /usr/sbin/nokia-7215_plt_setup.sh ... $ cat usr/sbin/nokia-7215_plt_setup.sh \| tail python /etc/entropy.py & $ cat etc/entropy.py if path.exists("/proc/sys/kernel/random/entropy_avail"): while 1: while avail() < 2048: with open('/dev/urandom', 'rb') as urnd, open("/dev/random", mode='wb') as rnd: d = urnd.read(512) t = struct.pack('ii', 4 * len(d), len(d)) + d fcntl.ioctl(rnd, RNDADDENTROPY, t) time.sleep(30) It is a workaround to fix the build issue, need to fix debian package, and revert the change.	2022-03-07 08:00:56 -08:00
roman_savchuk	4d6f9f2de7	[ BFN ] update SDE package for BFN platform (#10049 ) Updated SDE package for Barefoot platform with fixes for: - NAT - VRF	2022-03-04 20:43:08 -08:00
Qi Luo	04925df451	[build] Fix the urllib3 version in sonic-mgmt-framework (#10149 ) Fix the urllib3 version in sonic-mgmt-framework constrain file because it is already updated in Dockerfile	2022-03-04 20:34:23 -08:00
gechiang	7fb546dce4	[202012]BRCM SAI 4.3.5.3-2 Fixes CS00012228504, SONIC-55963:SID, CS00012209080, CS00012220761, and CS00012222414 (#10155 )	2022-03-04 16:24:59 -08:00
Lawrence Lee	4d1abbc09b	[write_standby]: Increase timeout to 60s (#10065 ) - Avoid scenarios where script times out before orchagent can establish IPinIP tunnel Signed-off-by: Lawrence Lee <lawlee@microsoft.com>	2022-03-01 22:49:17 +00:00
noaOrMlnx	7a35504ff7	[202012] [CoPP] Add always_enabled field (#9999 ) Add the "always_enabled" field to copp_cfg.j2 file, in order to allow traps without an entry in features table, to be installed automatically. This is a cherry-pick of https://github.com/Azure/sonic-buildimage/pull/9302 - Why I did it In order to allow traps without an entry in features table, to be installed automatically. - How I did it Add always_enabled field to traps without a feature	2022-02-20 12:42:39 +02:00
mssonicbld	a23aac25d3	[ci/build]: Upgrade SONiC package versions (#10023 ) [ci/build]: Upgrade SONiC package versions	2022-02-19 08:10:17 +08:00
Samuel Angebault	b32d7eedaf	Add emmc quirks to boot0 (#9989 ) Why I did it Fix some unreliability seen on emmc device with some AMD CPUs How I did it Added a kernel parameter to add quirks to It depends on a sonic-linux-kernel change to work properly but will be a no-op without it. Description for the changelog Add emmc quirks for Upperlake	2022-02-17 08:55:01 -08:00
vmittal-msft	304ec5b0cd	Updated traffic scheduler settings for HWSKUs : DellEMC-Z9332f-O32 & DellEMC-Z9332f-M-O16C64 (#9927 )	2022-02-15 16:15:20 -08:00
mssonicbld	f746d27c7d	[ci/build]: Upgrade SONiC package versions (#9933 )	2022-02-09 00:59:47 +00:00
Prince George	c1a0871fe9	Close console session due to user inactivity (#9890 ) Signed-off-by: Prince George <prgeor@microsoft.com>	2022-02-08 19:07:29 +00:00
tbgowda	78dc2d8a7b	Enable SAI_SWITCH_ATTR_UNINIT_DATA_PLANE_ON_REMOVAL attribute (#9419 ) Why I did it Fixes #8980 partly. The corresponding changes in sonic-sairedis is here : Azure/sonic-sairedis#975 How I did it Include changes from both repos and build an image for verification. How to verify it Trigger fast-reboot with the changes, see the attribute SAI_SWITCH_ATTR_UNINIT_DATA_PLANE_ON_REMOVAL being set at the SAI level. Signed-off-by: Thushar Gowda <24815472+tbgowda@users.noreply.github.com>	2022-02-08 19:07:08 +00:00
vmittal-msft	7435613216	[202012] BRCM SAI 4.3.5.3-1 Fix for CS00012218555 (#9923 )	2022-02-07 08:02:57 -08:00
Shi Su	4191889803	[bgpcfgd] Add bgpcfgd support to advertise routes (#9197 ) (#9697 ) Why I did it Cherry pick changes in #9197 to 202012 branch Add bgpcfgd support to advertise routes. How I did it Make bgpcfgd subscribe to the ADVERTISE_NETWORK table in STATE_DB and configure route advertisement accordingly. How to verify it Added unit tests in bgpcfgd and verify on KVM about route advertisement.	2022-01-26 14:38:04 -08:00
mssonicbld	3dae536de4	[ci/build]: Upgrade SONiC package versions (#9834 )	2022-01-23 22:13:50 +00:00
mssonicbld	ae7514b1bd	[ci/build]: Upgrade SONiC package versions (#9832 )	2022-01-22 16:01:17 +00:00
dflynn-Nokia	c715bdbf56	[firsttime boot] suppress error message on platforms not supporting kdump (#9521 ) Why I did it Eliminate benign firsttime boot error reported when running on platforms that do not support kdump. How I did it Change rc.local to check for presence of the file /etc/default/kdump-tools before referencing it. How to verify it Install a new image on an armhf or arm64 platform and check for a failed reference to /etc/default/kdump-tools on firsttime boot.	2022-01-21 02:39:17 +00:00
gechiang	090ef33ca2	[202012]BRCM SAI 4.3.5.3 Fixes CS00012218100,CS00012215529,CS00012208995,CS00012220761,CS00012211718,CS00012208995,CS00012220761, and CS00012225760 (#9815 )	2022-01-20 15:28:34 -08:00
mssonicbld	2eb8fe3a2c	[ci/build]: Upgrade SONiC package versions (#9799 )	2022-01-19 22:46:23 +00:00
gechiang	bdc7ce86de	[202012] BRCM SAI 4.3.5.2 Fixes CS00012205357, CS00012214196, CS00012213974 (#9754 )	2022-01-13 11:40:43 -08:00
mssonicbld	a0376a6e59	[ci/build]: Upgrade SONiC package versions (#9680 )	2022-01-07 22:12:12 +00:00
mssonicbld	9b1a3971bd	[ci/build]: Upgrade SONiC package versions (#9645 )	2021-12-26 23:30:40 +00:00
mssonicbld	813a6387c5	[ci/build]: Upgrade SONiC package versions (#9543 )	2021-12-24 17:05:45 +00:00
vmittal-msft	724037ebc3	BRCM SAI 4.3.5.1-9 for enabling SAI_SWITCH_ATTR_QOS_DSCP_TO_TC_MAP capability (#9463 )	2021-12-14 09:56:21 -08:00
Lawrence Lee	b3a3aa0c38	[mux]: Fix `mark_dhcp_packet` (#9373 ) - Consolidate the two [Service] sections by moving the ExecStartPre line for mark_dhcp_packet.py to the first section and removing the second. - Make the mark_dhcp_packet.py file executable - Also clean up mark_dhcp_packet.py - Remove unused imports - Fix spacing and line lengths to conform to PEP8 Signed-off-by: Lawrence Lee <lawlee@microsoft.com>	2021-12-01 02:28:56 +00:00
Stephen Sun	fafd5327bd	[Reclaim buffer] Common infrastructure update for reclaiming buffer (#9133 ) - Why I did it This is to update the common sonic-buildimage infra for reclaiming buffer. - How I did it Render zero_profiles.j2 to zero_profiles.json for vendors that support reclaiming buffer The zero profiles will be referenced in PR [Reclaim buffer] Reclaim unused buffers by applying zero buffer profiles #8768 on Mellanox platforms and there will be test cases to verify the behavior there. Rendering is done here for passing azure pipeline. Load zero_profiles.json when the dynamic buffer manager starts Generate inactive port list to reclaim buffer Signed-off-by: Stephen Sun <stephens@nvidia.com>	2021-12-01 02:28:46 +00:00
gechiang	a5f4780c64	[202012] BRCM SAI 4.3.5.1-8 Pick up fix for PFCWD getting continuously triggered/restored when pause frames are sent continuously to both queues of a port (#9296 ) 1. CS00012211718 [4.3] Pfcwd getting continuously triggered/restored when pause frames are sent continuously to both queues of a port (TD2/Th/Th2/TD3) MSFT Default Preliminary tests look fine. BGP neighbors were all up with proper routes programmed interfaces are all up Manually ran the following test cases on 7050CX3 (TD3) T0 DUT and all passed: ``` fib/test_fib.py vxlan/test_vxlan_decap.py fdb/test_fdb.py decap/test_decap.py ipfwd/test_dip_sip.py ipfwd/test_dir_bcast.py acl/test_acl.py vlan/test_vlan.py platform_tests/test_reboot.py ```	2021-11-17 21:30:10 -08:00
trzhang-msft	19008889de	update DHCP_PACKET_MARK schema (#9077 ) - update DHCP_PACKET_MARK schema in state_db - this is an update over PR: Add service mark_dhcp_packet to mux container #9015	2021-11-15 21:37:08 +00:00
trzhang-msft	86fa5eede2	Add service mark_dhcp_packet to mux container (#9015 ) - add a new service "mark_dhcp_packet" to mux container - apply packet marks on a per-interface basis in ebtables - write packet marks to "DHCP_PACKET_MARK" table in state_db	2021-11-15 21:36:29 +00:00
Renuka Manavalan	6cb7af73d9	add arista.log to logrotate (#9245 )	2021-11-15 21:32:03 +00:00
mssonicbld	36f1a547b1	[ci/build]: Upgrade SONiC package versions (#9255 )	2021-11-14 23:26:35 +00:00
mssonicbld	4d15a1c1f6	[ci/build]: Upgrade SONiC package versions (#9221 )	2021-11-13 23:37:09 +00:00
gechiang	7ac5b40f4b	[202012]BRCM SAI 4.3.5.1-7 Picked up fixes for CS00012209390, CS00012212995, SONIC-51583, CS00012215744, and SONIC-51638 (#9252 ) This is to pick up BRCM SAI 4.3.5.1-7 fixes which contains the following fixes: 1. CS00012209390: SONIC-50037, Used SAI_SWITCH_ATTR_QOS_DSCP_TO_TC_MAP as a default decap map for IPinIP tunnels. 2. CS00012212995: SONIC-50948 SAI_API_QUEUE:_brcm_sai_cosq_stat_get:1353 egress Min limit get failed with error Invalid parameter 3. SONIC-51583: Fixed acl group member creation failure with priority of -1 4. CS00012215744:SONIC-51395 [TH, TH2] WB 3.5 to 4.3 fails at APPLY_VIEW while setting SAI_PORT_ATTR_EGRESS_ACL 5. SONIC-51638: SDK-249337 ERROR: AddressSanitizer: heap-buffer-overflow in _tlv_print_array Preliminary tests look fine. BGP neighbors were all up with proper routes programmed interfaces are all up Manually ran the following test cases on 7050CX3 (TD3) T0 DUT and all passed: ``` fib/test_fib.py vxlan/test_vxlan_decap.py fdb/test_fdb.py decap/test_decap.py ipfwd/test_dip_sip.py ipfwd/test_dir_bcast.py acl/test_acl.py vlan/test_vlan.py platform_tests/test_reboot.py ```	2021-11-13 10:45:46 -08:00
Mykhailo Onipko	a7117b905f	[BFN]: Updated SDK packages to 20211112 (#9244 ) Signed-off-by: Mykhailo Onipko <monipko@barefootnetworks.com>	2021-11-12 21:47:56 -08:00
Lawrence Lee	b027e87ffb	[mux.service]: Remove pmon dependency (#9211 ) Signed-off-by: Lawrence Lee <lawlee@microsoft.com>	2021-11-11 02:56:27 +00:00
Lawrence Lee	f317d93cb0	Merged PR 4679112: [write_standby]: Ignore non-auto interfaces [write_standby]: Ignore non-auto interfaces * In the event that `write_standby.py` is used to automatically switchover interfaces when linkmgrd or bgp crashes, ignore any interfaces that are not configured to auto-switch Signed-off-by: Lawrence Lee <lawlee@microsoft.com>	2021-11-10 18:54:33 -08:00
Lawrence Lee	57ad50cfd9	Merged PR 4559560: [bgp]: Switch to standby if BGP container exits [bgp]: Switch mux to standby if BGP container exits Signed-off-by: Lawrence Lee <lawlee@microsoft.com>	2021-11-10 18:54:33 -08:00
Lawrence Lee	6a9c709336	[write_standby]: Improve logging Signed-off-by: Lawrence Lee <lawlee@microsoft.com>	2021-11-10 18:54:33 -08:00
Lawrence Lee	77378b4364	[mux]: Call write_standby from host only Signed-off-by: Lawrence Lee <lawlee@microsoft.com>	2021-11-10 18:54:33 -08:00
Lawrence Lee	25712c712e	[mux]: Make write_standby available on host Signed-off-by: Lawrence Lee <lawlee@microsoft.com> [write_standby]: Cleanup and fix build Signed-off-by: Lawrence Lee <lawlee@microsoft.com>	2021-11-10 18:54:33 -08:00
Tamer Ahmed	18d1f65339	Merged PR 4813977: [mux] Update Service Install With SONiC Target [mux] Update Service Install With SONiC Target Recent PR grouped all SONiC service into sonic.taget. The install section of mux.service was not update and this causes delays when using config reload as the service failed state is not being reset. signed-off-by: Tamer Ahmed <tamer.ahmed@microsoft.com>	2021-11-10 18:54:33 -08:00
Lawrence Lee	70fbd6826c	Merged PR 4366316: [mux.service]: Bind to sonic.target [mux.service]: Bind to sonic.target Signed-off-by: Lawrence Lee <lawlee@microsoft.com>	2021-11-10 18:54:33 -08:00
Tamer Ahmed	b42aef68f3	Merged PR 4234524: [mux] Start Mux on Only Dual-ToR Platform [mux] Start Mux on Only Dual-ToR Platform mux docker depends on the presence of mux cable hardware and is supposed to run only Gemini ToRs. This PR change the mux feature config in order to enable mux docker based on device configuration. signed-off-by: Tamer Ahmed <tamer.ahmed@microsoft.com>	2021-11-10 18:54:33 -08:00
Tamer Ahmed	b8f70f8986	Merged PR 3845699: [linkmgrd]: Introduce MUX cable linkmgrd Linkmgrd monitors link status, mux status, and link state. Has the link becomes unhealthy, linkmgrd will trigger mux switchover on a standby ToR ensuring uninterrupted service to servers/blades. This PR is initial implementation of linkmgrd. Also, docker-mux container hold packages related to maintaining and managing mux cable. It currently runs linkmgrd binary that monitor and switches the mux if needed. This PR also introduces mux-container and starts linkmgrd as startup when build is configured with INCLUDE_MUX=y Edit: linkmgrd PR will follow. signed-off-by: Tamer Ahmed <tamer.ahmed@microsoft.com> Related work items: #2315, #3146150	2021-11-10 18:54:33 -08:00
tjchadaga	9a1b1bc44e	Fix for additional intf flap during fast-reboot (#9166 )	2021-11-09 23:20:06 +00:00
mssonicbld	c15bae7c84	[ci/build]: Upgrade SONiC package versions (#9128 )	2021-11-09 22:52:26 +00:00
gechiang	400e40f255	[202012] BRCM SAI 4.3.5.1-6 Picked up fixes for CS00012213351, CS00012182162, and CS00012210826 (#9158 ) This is to pick up BRCM SAI 4.3.5.1-6 fixes which contains the following fixes: 1. CS00012213351 SONIC-50679: [TH, TH2] Warm-reboot from 3.5 to 4.3 fails due to null objects discovered 2. CS00012182162: SONIC-49805 TD3 MMU config profile optimization changes 3. CS00012210826:SONIC-50205/760c60fc: Should read MMU_INTFI_MMU_PORT_TO_MMU_QUEUES_FC_BKP for TH3 Preliminary tests looks fine. BGP neighbors were all up with proper routes programmed interfaces are all up Manually ran the following test cases on 7050CX3 (TD3) T0 DUT and all passed: ``` fib/test_fib.py vxlan/test_vxlan_decap.py fdb/test_fdb.py decap/test_decap.py ipfwd/test_dip_sip.py ipfwd/test_dir_bcast.py acl/test_acl.py vlan/test_vlan.py platform_tests/test_reboot.py ```	2021-11-03 07:24:33 -07:00
Sumukha Tumkur Vani	65626c8925	Flush RESTAPI DB upon config reload (#9093 )	2021-10-28 09:31:38 -07:00
Nazarii Hnydyn	0cbda8d362	[teamd]: Send USR1/USR2 only to subscribers. (#8856 ) To fix teamd signal handling, without which Process 'tlm_teamd' exited unexpectedly	2021-10-27 03:54:58 +00:00
mssonicbld	1c86196411	[ci/build]: Upgrade SONiC package versions (#9050 )	2021-10-25 17:09:12 +00:00
gechiang	c95178157d	[202012]BRCM SAI 4.5.3.1-5 picked up SAI fixes for several CSP cases (#9003 )	2021-10-19 14:08:31 -07:00
Ying Xie	f1d5aaced0	[copp] bind copp-config.service to sonic.target (#8969 ) copp-config service needs to be started after sonic.target so that it could render the copp-config with the latest information. It also needs to be restarted when config reload or load_minigraph is invoked. Signed-off-by: Ying Xie <ying.xie@microsoft.com>	2021-10-15 00:40:05 +00:00
gechiang	eca9020a48	[202012] BRCM SAI 4.5.3.1-4 Fixes dscp-uniform mode, th3 debug counter bmp crash (#8968 ) * [202012] BRCM SAI 4.5.3.1-4 Fixes dscp-uniform mode, th3 debug counter bmp crash	2021-10-13 08:25:44 -07:00
mssonicbld	b11d6cf5ee	[ci/build]: Upgrade SONiC package versions (#8919 )	2021-10-09 19:12:09 +00:00
mssonicbld	0f48239167	[ci/build]: Upgrade SONiC package versions (#8894 )	2021-10-02 19:01:40 +00:00
mssonicbld	d790caecbc	[ci/build]: Upgrade SONiC package versions (#8867 )	2021-09-29 17:11:32 +00:00
Vaibhav Hemant Dixit	636870d86f	Save DB dump after warm/fast reboot (#8803 ) As a part of warmboot, redis database is dumped: `c97fe546e5/scripts/fast-reboot (L269)` However, this dump file is deleted, after it is loaded back into db post reboot. The DB dump can be useful for debugging purpose, hence taking a backup of it can be useful. Instead of deleting the dump, rename and keep the dump.	2021-09-27 02:29:12 +00:00
gechiang	ac9feadbf1	[202012] BRCMSAI 4.3.5.1-3 fix CS00012203600, CS00012202255, CS00012208537 (#8840 )	2021-09-25 17:09:34 -07:00
mssonicbld	667fe3702c	[ci/build]: Upgrade SONiC package versions (#8829 )	2021-09-23 17:34:56 +00:00
mssonicbld	c988a7766c	[ci/build]: Upgrade SONiC package versions (#8800 )	2021-09-20 12:48:20 +00:00
mssonicbld	7ce529ea35	[ci/build]: Upgrade SONiC package versions (#8795 )	2021-09-19 15:26:49 +00:00
mssonicbld	f716745d76	[ci/build]: Upgrade SONiC package versions (#8637 )	2021-09-17 16:40:09 +00:00
abdosi	7732fa95bb	[baseimage]: Logrotate for wtmp and btmp files. (#8743 ) Added logrotate file for wtmp and btmp to override default conf and set size cap as 100K as done in PR: #865. For buster this is control by separate file wtmp and btmp. Signed-off-by: Abhishek Dosi <abdosi@microsoft.com>	2021-09-17 08:24:10 +00:00
Sudharsan Dhamal Gopalarathnam	9c5917d8dd	Removing execute permission from copp config file (#8680 ) *Removed execute permissions from the systemd copp-config.service file. Without this we will get a warning: "Configuration file /lib/systemd/system/copp-config.service is marked executable. Please remove executable permission bits. Proceeding anyway."	2021-09-14 08:59:21 +00:00
Ying Xie	e8b8012818	[202012][fstrim] delay fstrim timer after sonic.target (#8737 ) Why I did it fstrim has dependency on pmon docker. How I did it start fstrim timer after sonic.target. How to verify it local test and PR test. Signed-off-by: Ying Xie ying.xie@microsoft.com	2021-09-14 08:59:17 +00:00
gechiang	84b5659372	[202012] BRCM SAI 4.3.5.1-2 Fix BRCM SAI regression due to ACL Egress Mirroring Action capability (#8682 )	2021-09-06 22:12:59 -07:00
Samuel Angebault	96f2eaaadb	[Arista] Fix flash size computation for Lodoga (#8622 ) The Lodoga platform also matched crow which was hardcoding the flash size to 3700. This change enables autodetect on Clearlake which in turns allows autodetect for Lodoga. The threshold was bumped from 3700 to 4000 because size computation can differ slightly and report slightly above 3700.	2021-09-01 01:40:45 +00:00
mssonicbld	7eb4a345fa	[ci/build]: Upgrade SONiC package versions (#8584 ) Co-authored-by: mssonicbld <vsts@fv-az232-326.x3jni0md3anuvcz2px3t3ecixa.bx.internal.cloudapp.net>	2021-08-30 16:24:18 +08:00
Samuel Angebault	01117d58b5	[Arista] Rely on automatic flash size detection for Lodoga (#8608 ) Lodoga actually has a 8GB storage device. LodogaSsd variant has a 30GB SSD drive. However, in boot0 both were mishandled and assigned 4GB for legacy reasons. Remove the hardcoding of the flash size and let boot0 autodetect the available space.	2021-08-27 02:27:15 +00:00
dflynn-Nokia	2c91efcd15	[Nokia ixs7215] Add support for changing the console baud rate (#8595 ) This commit adds support for changing the default console baud rate configured within the U-Boot bootloader. That default baud rate is exposed via the value of the U-Boot 'baudrate' environment variable. This commit removes logic that hardcoded the console baud rate to 115200 and instead ensures that the U-Boot 'baudrate' variable is always used when constructing the Linux kernel boot arguments used when booting Sonic. A change is also made to rc.local to ensure that the specified baud rate is set correctly in the serial getty service.	2021-08-27 02:27:06 +00:00
gechiang	fcdd63835b	[202012]BRCM SAI 4.3.5.1-1 Fix configurable drop counter out of resource (#8601 ) * [202012]BRCM SAI 4.3.5.1 Fix for configurable drop counter out of resource	2021-08-26 14:30:22 -07:00
mssonicbld	98dd76c485	[ci/build]: Upgrade SONiC package versions (#8561 )	2021-08-24 14:53:16 +00:00
mssonicbld	8f604998b4	[ci/build]: Upgrade SONiC package versions (#8556 )	2021-08-23 17:32:59 +00:00
Volodymyr Samotiy	8365245122	[monit] Periodically monitor VNET route consistency (#8266 ) To run VNET route consistency check periodically. For any failure, the monit will raise alert based on return code. Signed-off-by: Volodymyr Samotiy <volodymyrs@nvidia.com>	2021-08-23 03:05:16 +00:00
mssonicbld	733c851fc9	[ci/build]: Upgrade SONiC package versions (#8527 )	2021-08-19 18:49:15 +00:00
mssonicbld	aa5d05ed2c	[ci/build]: Upgrade SONiC package versions (#8385 )	2021-08-16 10:48:24 +00:00
Stephen Sun	d599450052	Use predefined macro as vendor information (#8361 ) #### Why I did it Use a predefined variable to get vendor information when the swss docker container is created #### How I did it Use `{{ sonic_asic_platform }}` instead of `$SONIC_CFGGEN -y /etc/sonic/sonic_version.yml -v asic_type` #### How to verify it Manually test.	2021-08-16 07:51:01 +00:00
Sudharsan Dhamal Gopalarathnam	ba2284c4c0	Grouping delayed services under a target for config reload checks (#7846 ) #### Why I did it Create a target for delayed service timers. Few services in sonic have delayed to speed up the bring up of the system and essential services. However there is no way to track when they start. This will be a problem when executing config reload as config reload expects all services to be up. Hence grouped all the timers that trigger the delayed services under one target so that they could be tracked in 'config reload' command #### How I did it Created delay.target service and add created dependency on the delayed targets.	2021-08-16 07:50:56 +00:00
Ying Xie	92fb9c94bd	[aboot] use ram partition for /var/log for devices with 3.7G disks (#8400 ) Master/202012 image size grew quite a bit. 3.7G harddrive can no longer hold one image and safely upgrade to another image. Every bit of harddrive space is precious to save now. Also sh syntax seemingly changed, [ condition ] && action was a legit syntax in 201911 branch but it is an error when condition not met with 202012 or later images. Change the syntax to if statement to avoid the issue. Signed-off-by: Ying Xie ying.xie@microsoft.com	2021-08-14 17:22:01 -07:00
novikauanton	aae4e8dc7c	[build]: Fix bfn package version for reproducible build (#8468 ) Barefoot pipeline is broken, because version has not been update by ci build yet.	2021-08-14 14:27:45 -07:00
Vladyslav Morokhovych	754378f1d8	[swss] Fix arp_update script (#8412 ) Fix #7968 Issue is detected on SONiC.20201231.11 In test_static_route.py::test_static_route_ecmp static routes are configured, but neighbors are not resolved after config reload even after 10 minutes. It looks like the arp_update script is starting to ping when Vlan1000 is not fully configured. When issue is reproduced, stuck ping6 process is observed in swss container : USER PID %CPU %MEM VSZ RSS TTY STAT START TIME COMMAND root 180 0.1 0.0 6296 1272 pts/0 S 17:03 0:03 ping6 -I Vlan1000 -n -q -i 0 -c 1 -W 0 ff02::1 And when arp_update script successfully resolves neighbors, we observe sleep 300 instead of ping process	2021-08-12 23:25:28 -07:00
mssonicbld	1c30a8b0c1	[ci/build]: Upgrade SONiC package versions (#8376 )	2021-08-08 19:05:13 +00:00
Guohan Lu	ceab083fc5	[build]: add sonic_release 202012 Signed-off-by: Guohan Lu <lguohan@gmail.com>	2021-08-07 18:04:28 -07:00
Longxiang Lyu	25f53289eb	[swss][arp_update] Send ipv6 pings over vlan sub interfaces (#8363 ) #### Why I did it * `arp_update` fails to ping those neighbors over vlan sub interfaces. #### How I did it * modify `arp_update_vars.j2` to get vlan sub interfaces with ipv6 addresses assigned. * modify `arp_update` to send ipv6 pings over those retrieved vlan sub interfaces. Signed-off-by: Longxiang Lyu <lolv@microsoft.com>	2021-08-07 12:43:51 +00:00
Guohan Lu	db8cc247e0	[build]: Fix docker pull on armhf platform armhf build uses native dockerd Signed-off-by: Guohan Lu <lguohan@gmail.com>	2021-08-06 23:35:25 -07:00
gechiang	0f3f0c2a1a	[202012] BRCM SAI 4.3.5.1 Fix for TH3 FDB Flush Timeout (#8342 ) This is to pick up BRCM SAI 4.3.5.1 which contains the following fix: CS00012201406: [4.3.3.9] SAI_STATUS_FAILURE on FDB flush after all ports flapped Preliminary tests looks fine. BGP neighbors were all up with proper routes programmed interfaces are all up Manually ran the following test cases on z9332f (TH3) T0 DUT and all passed: ``` ipfwd/test_dir_bcast.py fib/test_fib.py ``` Manually ran the following test cases on S6100 (TH) and all passed: ``` ipfwd/test_dir_bcast.py fdb/test_fdb.py ```	2021-08-05 19:03:06 -07:00
VenkatCisco	cb8ff6dba1	[baseimage]: add j2cli to sonic_debian_extension.j2 (#8019 ) j2cli provides access to jinja library. cisco platform.py requires j2cli to handle jinja template configuration files.	2021-08-05 15:22:57 +00:00
vdahiya12	5e594043ce	[pmon] create and mount firmware directory on PMON for firmware upgrade support on muxcable (#8283 ) This PR creates a directory firmware on the HOST with the path /usr/share/sonic/firmware, as well as this is mounted on PMON container with the same path /usr/share/sonic/firmware. This is required for firmware upgrade support for muxcable as currently by design all Y-Cable API's are called by xcvrd. As such if CLI has to transfer a file to PMON we need to mount a directory from host to PMON just for getting the firmware files. Hence we require this change. Signed-off-by: vaibhav-dahiya <vdahiya@microsoft.com>	2021-08-05 15:22:41 +00:00
mssonicbld	2a41adcdc1	[ci/build]: Upgrade SONiC package versions (#8303 )	2021-08-02 02:21:04 +00:00
mssonicbld	cdbcf7da47	[ci/build]: Upgrade SONiC package versions (#8256 )	2021-07-28 08:44:24 +00:00
mssonicbld	814865b2c5	[ci/build]: Upgrade SONiC package versions (#8206 )	2021-07-24 12:27:11 +00:00
Shilong Liu	6d24cd6c92	Add reproducible build for armhf and arm64 related versions. (#8224 ) * Add version for 202012 branch	2021-07-22 12:31:55 +08:00
Renuka Manavalan	91f611157a	cherry-pick PR #8158 & PR #8205 into 202012 (#8235 )	2021-07-20 20:52:33 -07:00
Shilong Liu	21927ec941	Add some versions for build of sonic-slave-stretch on armhf and arm64 arch. (#8221 )	2021-07-20 11:23:28 +08:00
mssonicbld	763fcd7eeb	[ci/build]: Upgrade SONiC package versions (#8199 )	2021-07-16 14:34:02 +00:00
mssonicbld	b1728187be	[ci/build]: Upgrade SONiC package versions (#8163 )	2021-07-15 14:57:14 +00:00
Blueve	762847c2cf	[port_config] Introduce ad-hoc mport_config.json file (#8066 ) Signed-off-by: Jing Kan jika@microsoft.com	2021-07-15 12:06:47 +00:00
gechiang	514f760793	[202012] BRCM SAI 4.3.3.9 Changes for ISSU support and Dual ToR fixes (#8179 )	2021-07-14 10:36:15 -07:00
Kebo Liu	86d64d2fef	mount 'mellanox' folder only instead of create each sub folder (#7830 ) #### Why I did it Following the discussion in another PR https://github.com/Azure/sonic-buildimage/pull/7708#discussion_r642933510 , since there will be multi subfolders under /var/log/mellanox, so we agreed to only mount this folder and the subfolders will be created afterward on demand. #### How I did it during the syncd docker creation, only mount folder /var/log/mellanox #### How to verify it build an Mellanox image and verify the related folder on the host and docker side.	2021-07-13 11:36:56 +00:00
mssonicbld	18ae4a37d8	[ci/build]: Upgrade SONiC package versions (#8156 )	2021-07-12 14:01:55 +00:00
mssonicbld	a5fe47858a	[ci/build]: Upgrade SONiC package versions (#8152 )	2021-07-11 14:18:53 +00:00
mssonicbld	58b8d5502b	[ci/build]: Upgrade SONiC package versions (#8148 )	2021-07-10 14:37:53 +00:00
mssonicbld	cd5d950c98	[ci/build]: Upgrade SONiC package versions (#8127 )	2021-07-09 13:25:21 +00:00
mssonicbld	3ded393093	[ci/build]: Upgrade SONiC package versions (#8062 )	2021-07-07 13:26:21 +00:00
rajendra-dendukuri	049a23e5a4	[kdump] Fix kdump error message when a reboot is issued (#7985 ) dash doesn't support += operation to append to a variable's value. Use KDUMP_CMDLINE_APPEND="${KDUMP_CMDLINE_APPEND} " instead The below error message is seen when a reboot is issued. [ 342.439096] kdump-tools[13655]: /etc/init.d/kdump-tools: 117: /etc/default/kdump-tools: KDUMP_CMDLINE_APPEND+= panic=10 debug hpet=disable pcie_port=compat pci=nommconf sonic_platform=x86_64-accton_as7326_56x-r0: not found	2021-07-07 09:40:16 +00:00
mssonicbld	de46dc53a3	[ci/build]: Upgrade SONiC package versions (#8054 )	2021-07-04 13:45:42 +00:00
mssonicbld	f44d6cef8b	[ci/build]: Upgrade SONiC package versions (#8051 )	2021-07-03 13:53:50 +00:00
mssonicbld	70cb258b7e	[ci/build]: Upgrade SONiC package versions (#8041 )	2021-07-02 13:52:41 +00:00
Guohan Lu	d3e2983188	Revert "[Kubernetes]: The kube server could be used as http-proxy for docker (#7469 )" This reverts commit `e851a42db7`.	2021-07-01 18:41:21 -07:00
mssonicbld	e611f50b48	[ci/build]: Upgrade SONiC package versions (#7858 ) Upgrade SONiC Versions	2021-07-01 21:42:47 +08:00
xumia	c3018773d7	Fix vtysh shell-ingestion security issue (#7759 ) Fix vtysh shell-ingestion security issue Only expose the limited parameters of the command vtysh show.	2021-06-28 09:38:00 +00:00
gechiang	efb6c1d9cb	[202012][BRCMSAI]Fix two crash issues introduced by SAI 4.3.3.8 (#7979 ) Why I did it There were two regression issues introduced by BRCM SAI 4.3.3.8: CS00012196056 [4.3.3.8][WARMBOOT] syncd[2584]: segfault at 5616ad6c3d80 ip 00007f61e0c6bc65 sp 00007fff0c5a7a90 error 4 in libsai.so.1.0[7f61e0a95000+3cd8000] CS00012195956 [4.3.3.8] [TD3]Syncd Crash at brcm_sai_tnl_mp_create_tunnel() How I did it Patch for CS00012195956 from BRCM was validated to have addressed the tunnel creation issue. Temporary worked around the issue by commenting out a portion of questionable code in BRCM SAI that seems to be the root cause of CS00012196056 . How to verify it See the BRCM cases for details.	2021-06-25 08:06:45 -07:00
gechiang	7e4d42eb88	[202012] Pick up BRCM SAI 4.3.3.8 Changes that fixed several issues (#7918 )	2021-06-20 22:50:32 -07:00
Sujin Kang	d67a5b887f	Support multiple pcie configuration file and change the pcie status table name to match with pcied changes (#7886 ) Why I did it Support multiple pcie configuration file and change the pcie status table name This is to match with below two PRs. Azure/sonic-platform-common#195 Azure/sonic-platform-daemons#189 How I did it Check pcie configuration file with wild card and change the device status table name How to verify it Restart with changes and see if the pcie check works as expected.	2021-06-17 07:09:50 +00:00
Renuka Manavalan	e851a42db7	[Kubernetes]: The kube server could be used as http-proxy for docker (#7469 ) Why I did it The SONiC switches get their docker images from local repo, populated during install with container images pre-built into SONiC FW. With the introduction of kubernetes, new docker images available in remote repo could be deployed. This requires dockerd to be able to pull images from remote repo. Depending on the Switch network domain & config, it may or may not be able to reach the remote repo. In the case where remote repo is unreachable, we could potentially make Kubernetes server to also act as http-proxy. How I did it When admin explicitly enables, the kubernetes-server could be configured as docker-proxy. But any update to docker-proxy has to be via service-conf file environment variable, implying a "service restart docker" is required. But restart of dockerd is vey expensive, as it would restarts all dockers, including database docker. To avoid dockerd restart, pre-configure an http_proxy using an unused IP. When k8s server is enabled to act as http-proxy, an IP table entry would be created to direct all traffic to the configured-unused-proxy-ip to the kubernetes-master IP. This way any update to Kubernetes master config would be just manipulating IPTables, which will be transparent to all modules, until dockerd needs to download from remote repo. How to verify it Configure a switch such that image repo is unreachable Pre-configure dockerd with http_proxy.conf using an unused IP (e.g. 172.16.1.1) Update ctrmgrd.service to invoke ctrmgrd.py with "-p" option. Configure a k8s server, and deploy an image for feature with set_owner="kube" Check if switch could successfully download the image or not.	2021-06-17 07:09:50 +00:00
gechiang	341e15b620	[202012] Bring in BRCM SAI changes from SAI 4.3.3.7 (#7850 )	2021-06-14 17:52:35 -07:00
mssonicbld	99b03cff45	[ci/build]: Upgrade SONiC package versions (#7856 )	2021-06-12 14:22:14 +00:00
mssonicbld	b5551f044e	[ci/build]: Upgrade SONiC package versions (#7805 )	2021-06-11 12:58:22 +00:00
yozhao101	fb2c995f53	[202012][Monit] Deprecate the feature of monitoring the critical processes by Monit (#7823 ) Signed-off-by: Yong Zhao yozhao@microsoft.com Why I did it Currently we leveraged the Supervisor to monitor the running status of critical processes in each container and it is more reliable and flexible than doing the monitoring by Monit. So we removed the functionality of monitoring the critical processes by Monit. How I did it I removed the script process_checker and corresponding Monit configuration entries of critical processes. How to verify it I verified this on the device str-7260cx3-acs-1.	2021-06-09 09:04:22 -07:00
Renuka Manavalan	32e5137ab7	Add service to restore TACACS from old config (#7560 ) Why I did it In upgrade scenarios, where config_db.json is not carry forwarded to new image, it could be left w/o TACACS credentials. Added a service to trigger 5 minutes after boot and restore TACACS, if /etc/sonic/old_config/tacacs.json is present. How I did it By adding a service, that would fire 5 mins after boot. This service apply tacacs if available. How to verify it Upgrade and watch status of tacacs.timer & tacacs.service You may create /etc/sonic/old_config/tacacs.json, with updated credentials (before 5mins after boot) and see that appears in config & persisted too. Which release branch to backport (provide reason below if selected) 201911 202006 202012	2021-06-07 06:02:32 +00:00
mssonicbld	1e9cb30008	[ci/build]: Upgrade SONiC package versions (#7770 )	2021-06-05 14:03:04 +00:00
mssonicbld	eddce4d58b	[ci/build]: Upgrade SONiC package versions (#7755 )	2021-05-31 13:02:58 +00:00
yozhao101	3af05fdffe	[Monit] Restart telemetry container if memory usage is beyond the threshold (#7645 ) Signed-off-by: Yong Zhao yozhao@microsoft.com Why I did it This PR aims to monitor the memory usage of streaming telemetry container and restart streaming telemetry container if memory usage is larger than the pre-defined threshold. How I did it I borrowed the system tool Monit to run a script memory_checker which will periodically check the memory usage of streaming telemetry container. If the memory usage of telemetry container is larger than the pre-defined threshold for 10 times during 20 cycles, then an alerting message will be written into syslog and at the same time Monit will run the script restart_service to restart the streaming telemetry container. How to verify it I verified this implementation on device str-7260cx3-acs-1.	2021-05-31 04:38:18 +00:00
mssonicbld	71d4b17ad0	[ci/build]: Upgrade SONiC package versions (#7746 )	2021-05-29 14:22:37 +00:00
Prince Sunny	7a816ed5fa	[Mux] Do not clean-up HW_MUX_CABLE_TABLE from State DB (#7710 ) Co-authored-by: Ubuntu <prsunny@prince-vm.vzw1i4tqyeburcdz5lrgulxi2c.yx.internal.cloudapp.net>	2021-05-27 22:30:06 +00:00
Lawrence Lee	cb3a9eec58	[swss.service]: Remove ordering with pmon (#7614 ) Signed-off-by: Lawrence Lee <lawlee@microsoft.com>	2021-05-27 22:29:35 +00:00
Alexander Allen	bd6096a018	[ntp] Fix ntp.conf template to allow setting of source port in CONFIG_DB (#7586 ) Why I did it Currently, there is a bug in the ntp.conf jinja2 template where it will ignore the src_intf directive in CONFIG_DB if there are multiple IP addresses associated with an interface. This code change fixes that bug and allows the template to select the correct source interface for NTP. How I did it I did this by modifying the macro in ntp.conf.j2 which determines if there is an ip address associated with an interface to set a state variable when it detects a valid interface entry in CONFIG_DB instead of outputting "true" directly (which could result in multiple "trues" outputted for interfaces with multiple valid IP addresses). How to verify it Add two ipv4 addresses to an interface in SONiC Add the following configuration to config_db.json { "NTP": { "global": { "src_intf": "Ethernet1" } } } Replace Ethernet1 with the interface name of the one you assigned the IP addresses to. Run sudo config reload -y Open /etc/ntp.conf and verify that the following line exists ... interface listen Ethernet1 ... The interface specified should be the one set in the previous steps. Description for the changelog [ntp] Fix ntp.conf template to allow setting of source port in CONFIG_DB	2021-05-27 22:29:01 +00:00
Renuka Manavalan	53b3d378c7	Invoke disk check periodically. (#7374 ) Why I did it Helps with periodic scan of disk for RO state. If found, this script makes transient fix and raise error message.	2021-05-27 22:28:44 +00:00
gechiang	5d29a2c2a6	[202012] BRCM SAI 4.3.3.5-3 Enable VFP-based subintf on td2 (#7728 ) Why I did it This SAI change is to Enable TD2 platforms to also be able to handle VFP-based sub-interfaces. This is the feature that is also needed for TD2 platforms. How to verify it Guohan and Team has validated this feature change on TD2 platforms	2021-05-27 15:11:03 -07:00
mssonicbld	5b5131f26d	[ci/build]: Upgrade SONiC package versions (#7701 )	2021-05-26 13:58:58 +00:00
shlomibitton	c53f58e488	Remove 'vm.panic_on_oom=1' (#7678 ) #### Why I did it If a process limits using nodes by mempolicy/cpusets, and those nodes become memory exhaustion status, one process may be killed by oom-killer. No panic occurs in this case, because other node's memory may be free. This means system total status may be not fatal yet. #### How I did it Remove 'vm.panic_on_oom=1' kernel flag from 'vmcore-sysctl.conf '	2021-05-26 02:41:02 +00:00
Neetha John	cb930c30cd	[qos]: modify dot1p to tc mapping (#7661 ) Map priority 0 to TC 1 and priority 1 to TC 0 Send traffic on priority 0 and 1 and verified that it gets mapped correctly in hw Signed-off-by: Neetha John <nejo@microsoft.com>	2021-05-24 22:25:47 +00:00
mssonicbld	a9af05d1a7	[ci/build]: Upgrade SONiC package versions (#7686 )	2021-05-24 13:02:24 +00:00
mssonicbld	460e150187	[ci/build]: Upgrade SONiC package versions (#7664 )	2021-05-22 13:14:06 +00:00
mssonicbld	839527def8	[ci/build]: Upgrade SONiC package versions (#7659 )	2021-05-20 14:05:56 +00:00
Sujin Kang	d1043e3c91	add config-setup.service as dependency for pcie-check.service (#7599 ) Why I did it start pcie-check.service after config-setup.service since pcie_util depends on device_info which is available with config db metadata. How I did it Add config-setup.service as a dependency of pcie-check.service How to verify it Upon reboot, check if the pcie-check.sh throws the platform api error which is dependent on DEVICE_METADATA	2021-05-19 18:14:19 +00:00
mssonicbld	1461c9f98f	[ci/build]: Upgrade SONiC package versions (#7613 )	2021-05-19 14:02:10 +00:00
gechiang	3eaadec21c	[202012] BRCM SAI 4.3.3.5-2 picked up BRCM hsdk-6.5.21-p1.patch and CS00012097141 fix (#7581 ) This is to pick up BRCM SAI 4.3.3.5-2 which contains 2 main changes: 1. hsdk-6.5.21-p1.patch (to address some field problems related to SDK issue. 2. Fix for CS00012097141 (remove some unnecessary debug setting that was causing non functional impacting problem at boot time) Preliminary tests looks fine. BGP neighbors were all up with proper routes programmed interfaces are all up Manually ran the following test cases on S6100 DUT and all passed: ``` ipfwd/test_dir_bcast.py fib/test_fib.py vxlan/test_vxlan_decap.py decap/test_decap.py fdb/test_fdb.py ```	2021-05-12 07:56:46 -07:00
mssonicbld	c8ee27319b	[ci/build]: Upgrade SONiC package versions (#7589 )	2021-05-12 14:11:06 +00:00
mssonicbld	aac5c04d91	[ci/build]: Upgrade SONiC package versions (#7583 )	2021-05-12 07:19:54 +00:00
Qi Luo	262d0948f1	[build]: Update the netifaces pip3 package version (#7574 ) 202012 branch is using reproducible build. The versions-py3 file must contains correct pip3 package version. Otherwise we will get a build error ``` Step 31/74 : RUN pip3 install scapy==2.4.4 pyroute2==0.5.14 netifaces==0.10.9 ---> Running in d7a2401dd21d Collecting scapy==2.4.4 Downloading scapy-2.4.4.tar.gz (1.0 MB) Collecting pyroute2==0.5.14 Downloading pyroute2-0.5.14.tar.gz (873 kB) ERROR: Cannot install netifaces==0.10.9 because these package versions have conflicting dependencies. The conflict is caused by: The user requested netifaces==0.10.9 The user requested (constraint) netifaces==0.10.7 To fix this you could try to: 1. loosen the range of package versions you've specified 2. remove package versions to allow pip attempt to solve the dependency conflict ERROR: ResolutionImpossible: for help visit https://pip.pypa.io/en/latest/user_guide/#fixing-conflicting-dependencies The command '/bin/sh -c pip3 install scapy==2.4.4 pyroute2==0.5.14 netifaces==0.10.9' returned a non-zero code: 1 [ FAIL LOG END ] [ target/docker-sonic-vs.gz ] ```	2021-05-11 09:19:52 -07:00
mssonicbld	a3980f948a	[ci/build]: Upgrade SONiC package versions (#7576 )	2021-05-11 14:35:57 +00:00
Renuka Manavalan	99958304c1	[container_checker] Use Feature table to get running containers (#7474 ) Why I did it Finding running containers through "docker ps" breaks when kubernetes deploys container, as the names are mangled. How I did it The data is is available from FEATURE table, which takes care of kubernetes deployment too. How to verify it Deploy a feature via kubernetes and don't expect error from container_check.	2021-05-10 15:59:57 -07:00
mssonicbld	b95f86722c	[ci/build]: Upgrade SONiC package versions (#7540 )	2021-05-09 04:44:39 +00:00
mssonicbld	b0460c7ce1	[ci/build]: Upgrade SONiC package versions (#7480 ) Co-authored-by: mssonicbld <vsts@fv-az196-264.g5lmldmxpfterdw5iojffagm3c.gx.internal.cloudapp.net>	2021-05-06 06:50:02 +08:00
Nazarii Hnydyn	0e970582c1	[swss_vars]: Add 'resource_type' attribute. (#7188 ) Signed-off-by: Nazarii Hnydyn <nazariig@nvidia.com>	2021-05-03 10:38:11 -07:00
guxianghong	a0fde3a626	[arm] support compile sonic arm image on arm server (#7285 ) - Support compile sonic arm image on arm server. If arm image compiling is executed on arm server instead of using qemu mode on x86 server, compile time can be saved significantly. - Add kernel argument systemd.unified_cgroup_hierarchy=0 for upgrade systemd to version 247, according to #7228 - rename multiarch docker to sonic-slave-${distro}-march-${arch} Co-authored-by: Xianghong Gu <xgu@centecnetworks.com> Co-authored-by: Shi Lei <shil@centecnetworks.com>	2021-05-02 08:11:56 -07:00
Samuel Angebault	30cc959787	[Arista] Fix dockerd issue on Arista platforms (#7376 ) Why I did it Recent systemd upgrade from #7228 requires an extra cmdline parameter for dockerd to start properly. Updating boot0 was missed as part of the systemd upgrade change. How I did it Just added the missing cmdline parameter in files/Aboot/boot0.j2 This change fixes #7372 How to verify it Boot the image and dockerd should start normally.	2021-05-01 19:43:51 -07:00
Stepan Blyshchak	ae574ab000	[systemd] disable default systemd udev rules for interfaces (#7369 ) Fix #7364 99-default.link - was always in SONiC, but previous systemd (<247) had an issue and it did not work due to issue systemd/systemd#3374. Now systemd 247 works. However, such policy overrides teamd provided mac address which causes teamd netdev to use a random mac address. Therefore, needs to be disabled. Signed-off-by: Stepan Blyschak <stepanb@nvidia.com>	2021-05-01 19:43:41 -07:00
xumia	1b05982727	Support readonly vtysh for sudoers (#7383 ) Why I did it Support readonly version of the command vtysh How I did it Check if the command starting with "show", and verify only contains single command in script.	2021-04-29 10:08:55 -07:00
mssonicbld	f7adf9c180	[ci/build]: Upgrade SONiC package versions (#7415 )	2021-04-24 15:17:07 +00:00
Kuanyu Chen	66dedf38c2	[config-setup]: Fix a bug in checking if updategraph is enabled (#7093 ) Encounter error during "config-setup boot" if the updategraph is enabled. How I did it Correct the code inside the config-setup script. Remove the space between the assignment operator. How to verify it Remove the /etc/sonic/config_db.json and reboot the device. Originally, it will return following error after boot up. rv: command not found After modification, it can correctly parse the status of updategraph without error.	2021-04-21 13:58:03 -07:00
yozhao101	c63b59698c	[container_checker] Exclude the 'always_disabled' container from expected running container list (#7217 ) Signed-off-by: Yong Zhao yozhao@microsoft.com Why I did it Since we introduced a new value always_disabled for the state field in FEATURE table, the expected running container list should exclude the always_diabled containers. This bug was found by nightly test and posted at here: issue. This PR fixes #7210. How I did it I added a logic condition to decide whether the value of state field of a container was always_disabled or not. How to verify it I verified this on the device str-dx010-acs-1. Which release branch to backport (provide reason below if selected) 201811 201911 202006 [ x] 202012	2021-04-02 11:52:35 -07:00
mssonicbld	505db8e91a	[ci/build]: Upgrade SONiC package versions (#7042 ) Co-authored-by: mssonicbld <vsts@fv-az80-884.nqsemdo0cabejmrqkclmmohwag.dx.internal.cloudapp.net>	2021-03-29 08:07:44 -07:00
Tamer Ahmed	6fb1600234	[hostcfgd]: Add Ability To Configure Feature During Run-time (#6700 ) Features may be enabled/disabled for the same topology based on run-time configuration. This PR adds the ability to enable/disable feature based on config db data. signed-off-by: Tamer Ahmed <tamer.ahmed@microsoft.com>	2021-03-15 19:09:31 -07:00
arlakshm	cc6e521b40	[baseimage] add ipintutil in sudoer file (#6845 ) show ip interfaces is enhanced recently to support multi ASIC platforms in this PR- https://github.com/Azure/sonic-utilities/pull/1396 . The ipintutil script as to run as sudo user, to get the ip interface from each namespace. Add this script to the sudoer file so that show ip interface command is available for user with read-only permissions Signed-off-by: Arvindsrinivasan Lakshmi Narasimhan <arlakshm@microsoft.com>	2021-03-13 23:29:24 -08:00
mssonicbld	77477857b4	Update SONiC version files (#6996 ) Co-authored-by: mssonicbld <vsts@fv-az113-375.lunlmptkugju1kgiw3yhqmpbea.bx.internal.cloudapp.net>	2021-03-12 10:01:12 +08:00
Renuka Manavalan	a4d81f3c19	Copy dummy flannel.conf to get around absence of CNI Network (#6985 ) Why I did it We skip install of CNI plugin, as we don't need. But this leaves node in "not ready" state, upon joining master. To fix, we copy this dummy .conf file in /etc/cni/net.d How I did it Keep this file in /usr/share/sonic/templates and copy to /etc/cni/net.d upon joining k8s master. How to verify it Upon configuring master-IP and enable join, watch node join and move to ready state. You may verify using kubectl get nodes command	2021-03-10 09:32:49 -08:00
mssonicbld	0830738503	Update SONiC version files (#6972 ) Co-authored-by: mssonicbld <vsts@fv-az131-135.jj2e24u0tnvezfdztknplege1f.xx.internal.cloudapp.net>	2021-03-08 14:55:35 -08:00
mssonicbld	57085e4a6a	Update SONiC version files (#6963 ) Co-authored-by: mssonicbld <vsts@fv-az124-394.1jx3ho342nguppyzzg0wtvoj2f.bx.internal.cloudapp.net>	2021-03-05 13:45:16 +08:00
yozhao101	7748597fa2	[Supervisord] Deduplicate the alerting messages of critical processes from Supervisord. (#6849 ) Signed-off-by: Yong Zhao yozhao@microsoft.com Why I did it In the configuration of rsyslog, duplicate messages will be suppressed and reported in the format of message repeated n times. Due to this behavior, if a critical process in a container exited unexpectedly, the alerting message will be written into syslog once and not be written into syslog anymore until the second critical process exited. This PR aims to differentiate these alerting messages such that they will not be suppressed by rsyslogd and can appear in the syslog periodically. How I did it This PR adds a counter into the alerting message and shows how many minutes a critical process was not running. How to verify it I verified and test this implementation on a physical DUT.	2021-03-04 21:23:05 +00:00
Sujin Kang	15aed52ef2	[pcie.yaml] Move pcie configuration file path to platform directory (#6475 ) - Why I did it The pcie configuration file location is under plugin directory not under platform directory. #6437 - How I did it Move all pcie.yaml configuration file from plugin to platform directory. Remove unnecessary timer to start pcie-check.service Move pcie-check.service to sonic-host-services - How to verify it Verify on the device	2021-03-04 21:23:05 +00:00
Stepan Blyshchak	7fb5a72d23	[services] introduce sonic.target (#5705 ) - Why I did it Group all SONiC services together and able to manage them together. Will be used in config reload command as much simpler and generic way to restart services. - How I did it Add services to sonic.target - How to verify it Together with Azure/sonic-utilities#1199 config reload -y Signed-off-by: Stepan Blyshchak <stepanb@nvidia.com>	2021-03-04 21:23:05 +00:00
mssonicbld	193fc248a0	Update SONiC version files (#6927 ) Co-authored-by: mssonicbld <vsts@fv-az95-714.mvvw4rc1ki0utgz3kiduxkzutd.ex.internal.cloudapp.net>	2021-03-03 19:05:52 +08:00
dflynn-Nokia	e3ab6b0494	[armhf build] Fix azure-storage dependency on cryptography package (#6780 ) Fix marvell-armhf build break The azure-storage package depends on the cryptography package. Newer versions of cryptography require the rust compiler, the correct version for which is not readily available in buster. Hence we pre-install an older version here to satisfy the azure-storage dependency. Note: This is not a problem for other architectures as pre-built versions of cryptography are available for those. This sequence can be removed after upgrading to debian bullseye.	2021-03-01 09:40:00 -08:00
Samuel Angebault	a654518968	[Arista] Driver and platform update (#6468 ) (#6872 ) - Add support for `DCS-7050SX3-48YC8` and `DCS-7050SX3-48C8` platform - Add support for more variants of `DCS-7280CR3-32[PD]4` - Add Supervisor to Linecard consutil support - Complete Watchdog platform API support - Fix some PSU behavior on `DCS-7050QX-32` and `DCS-7060CX-32S` - Fix SEU management on `DCS-7060CX-32S` - Allow kernel modules to build up to linux 5.10 - Rename led color `orange` to `amber` - Miscellaneous fixes	2021-02-24 10:09:52 -08:00
SuvarnaMeenakshi	b6aaeb979e	[multi_asic][vs]: Add dependency in teamd service to start after topology service(#6594 ) [multi_asic][vs]: Add dependency in teamd service to start after topology service. - Why I did it In multi-asic VS, topology service is run after database service to set up the internal asic topology. swss and syncd have a dependency to start after topology service is run so that the interfaces are moved to right namespace and created in the right namespace. In case of multi-asic vs, during the initial boot up, when there is no configuration added, teamd service starts and swss/syncd do not start as topology service does not start. Upon loading configuration using config_db or minigraph, swss and sycnd start up , but teamd is not restarted as swss is not stopped and started. This causes teamd to be in a bad state and requires a reload of config. - How I did it Add dependency in teamd service to start after topology service is completed. - How to verify it No change in single asic vs or platform. No change in multi-asic regular image. Change only in multi-asic VS. Bring up a multi-asic VS image without any configration, teamd service will fail to start due to dependency failure. Load minigraph, start topology service, load configuration, ensure all services come up. Signed-off-by: SuvarnaMeenakshi <sumeenak@microsoft.com>	2021-02-23 23:56:01 +00:00
arlakshm	d7be5a021a	[Multi Asic] support of swss.rec and sairedis.rec for multi asic (#6310 ) Signed-off-by: Arvindsrinivasan Lakshmi Narasimhan arlakshm@microsoft.com - Why I did it This PR has the changes to support having different swss.rec and sairedis.rec for each asic. The logrotate script is updated as well - How I did it Update the orchagent.sh script to use the logfile name options in these PRs(Azure/sonic-swss#1546 and Azure/sonic-sairedis#747) In multi asic platforms the record files will be different for each asic, with the format swss.asic{x}.rec and sairedis.asic{x}.rec Update the logrotate script for multiasic platform .	2021-02-23 23:56:01 +00:00
Joe LeVeque	d7517a704c	[PDDF] Build and install Python 3 package (#6286 ) - Make PDDF code compliant with both Python 2 and Python 3 - Align code with PEP8 standards using autopep8 - Build and install both Python 2 and Python 3 PDDF packages	2021-02-23 23:56:01 +00:00
xumia	7c00106817	Add the version files for the branch 202012 (#6836 )	2021-02-23 10:36:32 +08:00
xumia	fa7d636226	Add mirrors for reproducible build (#6813 )	2021-02-19 11:51:34 -08:00
shlomibitton	6361d36fb2	Stop teamd service before syncd (#6755 ) - What I did All SWSS dependent services should stop before SWSS service to avoid future possible issues. For example 'teamd' service will stop before to allow the driver unload netdev gracefully. This is to stop all LAG's before restarting syncd service when running 'config reload' command. - How I did it Change the order of dependent services of SWSS. - How to verify it Run 'config reload' command. Previously the operation failed when a large number of PortChannel configured on the system. Signed-off-by: Shlomi Bitton <shlomibi@nvidia.com>	2021-02-16 15:33:10 -08:00
Lawrence Lee	e0efbc1e14	[swss]: Clear MUX-related state DB tables on start (#6759 ) * Add MUX_CABLE_TABLE to set of tables to clear on SWSS start, which will clear HW_MUX_CABLE_TABLE and MUX_CABLE_TABLE * Order swss to start before pmon to ensure that DBs are cleared before xcvrd (running inside pmon) starts and re-populates the tables Signed-off-by: Lawrence Lee <lawlee@microsoft.com>	2021-02-16 15:33:03 -08:00
Lior Avramov	80b048d4d1	[systemd] Increase syncd startup script timeout to support FW upgrade on init. (#6709 ) - Why I did it To support FW upgrade on init. - How I did it Change timeout value - How to verify it I manually changed ASIC and Gearbox FW followed by hard reset in order for FW upgrade to take place on init. Signed-off-by: liora <liora@nvidia.com>	2021-02-16 15:31:10 -08:00
judyjoseph	0c17839908	[teamd]: Increase wait timeout for teamd docker stop to clean Port channels. (#6537 ) The Portchannels were not getting cleaned up as the cleanup activity was taking more than 10 secs which is default docker timeout after which a SIGKILL will be send. Fixes #6199 To check if it works out for this issue in 201911 ? #6503 This issue is significantly seen in master branch compared to 201911 because the Portchannel cleanup takes more time in master. Test on a DUT with 8 Port Channels. master admin@str-s6000-acs-8:~$ time sudo systemctl stop teamd real 0m15.599s user 0m0.061s sys 0m0.038s Sonic 201911.v58 admin@str-s6000-acs-8:~$ time sudo systemctl stop teamd real 0m5.541s user 0m0.020s sys 0m0.028s	2021-02-05 16:22:28 -08:00
Joe LeVeque	57a6fb9f39	[pcie-check] Update underlying pcieutil command and add to sudoers file (#6682 ) - Why I did it As of Azure/sonic-utilities#1297, subcommands of pcieutil have changed to remove the redundant pcie- prefix. This PR adapts calling applications (pcie-check) to the new syntax. Resolves #6676 - How I did it Remove pcie- prefix from pcieutil subcommands in calling applications Also add pcieutil * to sudoers file, as pcieutil requires elevated permissions	2021-02-05 15:47:58 -08:00
Guohan Lu	bab136fc8f	[proc-exit-listener]: fix syntax error the bug is introduced in commit `34cca20c` Signed-off-by: Guohan Lu <lguohan@gmail.com>	2021-02-03 10:46:07 -08:00
arlakshm	24d785a64d	[baseimage]: add docker ps to the sudoer file (#6604 ) fixes Azure/sonic-utilities#1389 With the recent changes in sudoer files. The show commands fails for the read-only users. The problem here is the 'docker ps' is failing in the function [get_routing_stack()](`8a1109ed30/show/main.py (L54)`) therefore all the CLI commands are failing. Signed-off-by: Arvindsrinivasan Lakshmi Narasimhan <arlakshm@microsoft.com>	2021-02-03 10:39:00 -08:00
arlakshm	197f75a246	[multi asic] add ip netns identify command to sudoer (#6591 ) Signed-off-by: Arvindsrinivasan Lakshmi Narasimhan <arlakshm@microsoft.com> - Why I did it The command sudo ip netns identify <pid> is used in function get_current_namespace to check in the cli command is running in host context or within a namespace. This function is used for every CLI command and command sudo ip netns identify <pid> needs to be added in sudoer files to allow users with RO access to run show cli commands This problem is not there on single asic platforms. - How I did it Add ip netns identify [0-9]* to sudoers file.	2021-02-03 10:38:24 -08:00
Guohan Lu	f00bb52f7c	[proc-exit-listener]: ignore blank lines make proc-exit-listener more rebust Signed-off-by: Guohan Lu <lguohan@gmail.com>	2021-01-28 09:28:52 -08:00
yozhao101	cc9c3f567e	[supervisord] Monitoring the critical processes with supervisord. (#6242 ) - Why I did it Initially, we used Monit to monitor critical processes in each container. If one of critical processes was not running or crashed due to some reasons, then Monit will write an alerting message into syslog periodically. If we add a new process in a container, the corresponding Monti configuration file will also need to update. It is a little hard for maintenance. Currently we employed event listener of Supervisod to do this monitoring. Since processes in each container are managed by Supervisord, we can only focus on the logic of monitoring. - How I did it We borrowed the event listener of Supervisord to monitor critical processes in containers. The event listener will take following steps if it was notified one of critical processes exited unexpectedly: The event listener will first check whether the auto-restart mechanism was enabled for this container or not. If auto-restart mechanism was enabled, event listener will kill the Supervisord process, which should cause the container to exit and subsequently get restarted. If auto-restart mechanism was not enabled for this contianer, the event listener will enter a loop which will first sleep 1 minute and then check whether the process is running. If yes, the event listener exits. If no, an alerting message will be written into syslog. - How to verify it First, we need checked whether the auto-restart mechanism of a container was enabled or not by running the command show feature status. If enabled, one critical process should be selected and killed manually, then we need check whether the container will be restarted or not. Second, we can disable the auto-restart mechanism if it was enabled at step 1 by running the commnad sudo config feature autorestart <container_name> disabled. Then one critical process should be selected and killed. After that, we will see the alerting message which will appear in the syslog every 1 minute. - Which release branch to backport (provide reason below if selected) 201811 201911 [x ] 202006	2021-01-28 09:28:27 -08:00
Qi Luo	c5b7370a8f	[baseimage]: Cleanup sudoers file (#6518 )	2021-01-21 08:41:23 -08:00
Ying Xie	a1951ea198	[warm boot finalizer] only wait for enabled components to reconcile (#6454 ) * [warm boot finalizer] only wait for enabled components to reconcile Define the component with its associated service. Only wait for components that have associated service enabled to reconcile during warm reboot. Signed-off-by: Ying Xie <ying.xie@microsoft.com>	2021-01-15 08:20:28 -08:00
yozhao101	bfec282a82	[Monit] Monitoring the running status of containers. (#6251 ) - Why I did it This PR aims to monitor the running status of each container. Currently the auto-restart feature was enabled. If a critical process exited unexpected, the container will be restarted. If the container was restarted 3 times during 20 minutes, then it will not run anymore unless we cleared the flag using the command `sudo systemctl reset-failed <container_name>` manually. - How I did it We will employ Monit to monitor a script. This script will generate the expected running container list and compare it with the current running containers. If there are containers which were expected to run but were not running, then an alerting message will be written into syslog. - How to verify it I tested this feature on a lab device `str-a7050-acs-3` which has single ASIC and `str2-n3164-acs-3` which has a Multi-ASIC. First I manually stopped a container by running the command `sudo systemctl stop <container_name>`, then I checked whether there was an alerting message in the syslog. Signed-off-by: Yong Zhao <yozhao@microsoft.com>	2021-01-09 08:27:53 -08:00
Renuka Manavalan	1bdefd16fa	Take a copy of existing TACACS credentials and restore it during upgrade (#6285 ) In scenario where upgrade gets config from minigraph, it could miss tacacs credentials as they are not in minigraph. Hence restore explicitly upon load-minigraph, if present. - Why I did it Upon boot, when config migration is required, the switch could load config from minigraph. The config-load from minigraph would wipe off TACACS key and disable login via TACACS, which would disable all remote user access. This change, would re-configure the TACACS if there is a saved copy available. - How I did it When config is loaded from minigraph, look for a TACACS credentials back up (tacacs.json) under /etc/sonic/old_config. If present, load the credentials into running config, before config-save is called. - How to verify it Remove /etc/sonic/config_db.json and do an image update. Upon reboot, w/o this change, you would not be able ssh in as remote user. You may login as admin and check out, "show tacacs" & "show aaa" to verify that tacacs-key is missing and login is not enabled for tacacs. With this change applied, remove /etc/sonic/config_db.json, but save tacacs & aaa credentials as tacacs.json in /etc/sonic/. Upon reboot, you should see remote user access possible.	2021-01-09 08:27:41 -08:00
Akhilesh Samineni	46c2bf0ed4	After first bootup, the FEATURE table is not present in CONFIG_DB (#5911 ) Fix the After first bootup(onie-install), the FEATURE table is not present in CONFIG_DB. Fix is done by calling config reload.	2021-01-06 06:19:21 -08:00
Joe LeVeque	566ea4f601	[system-health] Convert to Python 3 (#5886 ) - Convert system-health scripts to Python 3 - Build and install system-health as a Python 3 wheel - Also convert newlines from DOS to UNIX	2020-12-29 14:04:09 -08:00
Joe LeVeque	62662acbd5	No longer install some unnecessary Python 2 packages in host (#6301 ) - No longer install Python 2 packages in host: - libpython2.7-dev - docker - ipaddress - netifaces - azure-storage - watchdog - futures - Install Python 3 versions of the following packages in host: - docker - azure-storage - watchdog - redis - swsssdk (install unconditionally)	2020-12-29 13:02:11 -08:00

... 2 3 4 5 6 ...

1100 Commits