Commit Graph

5473 Commits

Author SHA1 Message Date
Jing Zhang
088d9923d9
[202012][sonic-linkmgrd] Submodule Update (#11726)
fc5d424 Jing Zhang Fri Aug 12 14:39:59 2022 -0700 [202012] Cherry-pick flaky unit test fixes (#115)
faceb93 Jing Zhang Thu Aug 11 10:03:05 2022 -0700 Backoff mux probing for server down scenario (#106)
86ddd95 Jing Zhang Fri Aug 12 14:21:37 2022 -0700 Fix race condition caused by strand wrap method (#104) (#110)
f68a03e Jing Zhang Thu Aug 11 15:31:22 2022 -0700 [lgtm]: add uuid-dev to lgtm prepare (#112)

sign-off: Jing Zhang zhangjing@microsoft.com
2022-08-15 09:14:44 -07:00
vdahiya12
fb7c962fbc
[202012][sonic-platorm-common] submodule update (#11712)
Signed-off-by: vaibhav-dahiya vdahiya@microsoft.com
the following commit is added in sonic-platform-common

fd8cf3a (HEAD -> 202012, origin/202012) [Credo][Ycable] fix incorrect uart statistics (#296)

Signed-off-by: vaibhav-dahiya <vdahiya@microsoft.com>
2022-08-12 14:07:33 -07:00
Sumukha Tumkur Vani
5d1b052e4f
[RESTAPI] Update submodule (#11686) 2022-08-11 17:17:35 -07:00
StormLiangMS
00d512577e [bgp-cfgd] BGP allow list enhancement (#11586)
Why I did it
2 things are missing in current allow-prefix list implementation.

In some usecase, need to tell the BGP neighbor and have different allow-prefix list for different neighbors, which is not supported.
for the prefix list, can't support flexible le and ge.
How I did it
To enhance the bgp allow-prefix list feature to have:

To include the neighbor type info for the allow-prefix list.
To support flexible le and ge length for allow-prefix list.
How to verify it
4 new unit test cases are added in this PR to cover changes.
2022-08-11 20:38:35 +00:00
Stepan Blyshchak
8ab448a852 [swss.sh/syncd.sh] Trap only on EXIT (#11590)
When using trap on SIGTERM the script will not react to the SIGTERM signal sent while a child is executing.
I.e, the following script does not react on SIGTERM sent to it if it is
waiting for sleep to finish:

```

trap "echo Handled SIGTERM" 0 2 3 15

echo "Before sleep"
sleep inf
echo "After sleep"
```

Instead, trap only on EXIT which covers also a scenario with exit on
SIGINT, SIGTERM.

Signed-off-by: Stepan Blyschak <stepanb@nvidia.com>
2022-08-11 20:38:20 +00:00
Sumukha Tumkur Vani
cd480810d7 [RESTAPI] Update submodule (#11674) 2022-08-11 20:38:15 +00:00
Jing Zhang
ff4f160570
[202012][sonic-linkmgrd] Submodule Update (#11514)
[202012][sonic-linkmgrd] Submodule Update

d3f5de1 Jing Zhang Tue Aug 9 10:14:48 2022 -0700 Fix build failure introduced by commit 847c1e8 (#105)
847c1e8 Jing Zhang Mon Aug 8 10:36:18 2022 -0700 [Active-Standby]Remove unnecessary handleMuxWaitTimeout logs (#100)
1fdf0e6 Jing Zhang Thu Jul 21 10:33:45 2022 -0700 Add support for reconciliation after warm restart (#76) (#97)

sign-off: Jing Zhang zhangjing@microsoft.com
2022-08-11 10:37:34 -07:00
vdahiya12
feab02df59
[202012][sonic-utilities][sonic-platform-daemons] submodule update (#11684)
The follwing commit is added in sonic-utilities
b034f0c (HEAD -> 202012, origin/202012) [config][muxcable] add support
to enable/disable ycable telemetry (#2… (#2304)

The follwing commit is added in sonic-platform-daemons
978667c (HEAD -> 202012, origin/202012) [ycabled] add capability to
enable/disable telemetry (#279) (#280

Signed-off-by: vaibhav-dahiya <vdahiya@microsoft.com>

Signed-off-by: vaibhav-dahiya <vdahiya@microsoft.com>
2022-08-11 09:55:15 -07:00
zitingguo-ms
5b5bd5e818
[202012 BRCM SAI 4.3.7.0] Pick up fixes and make up BRCM SAI version to 4.3.7.0 (#11681)
Pick upfollowing fixes and update BRCM SAI to 4.3.7.0:

CS00012208537: Add back previous commit 54c5bc4848eb748
CS00012253061,SONIC-63280: WB from 3.5 to 4.3, followed by WB to 4.3
CS00012207978: SDK-296517, time spent for SAI operations
CS00012245601,SONIC-62898: Egress ACL Counted ad Interface TX drops
Update pcbb with Fixes for CS00012243699
Upgrade on pcbb with Fixes for KB0025353, CS00012221689, CS00012221688, KB0025391, CS00012230519
commit of "CS00012221688:PFC frames egressing, PFC storm happens simultaneously on 2 ports" is purposely skipped to be picked up later due to SWSS dependency not ready.
Why I did it
How I did it
How to verify it
Tested build target, successful

Manually run these tests after installing sai binary within image 20201231.73 on 7050CX3 (TD3) T0 DUT, all passed.

vxlan/test_vxlan_decap.py
fdb/test_fdb.py
pfcwd/test_pfcwd_all_port_storm.py
acl/null_route/test_null_route_helper.py
acl/test_acl.py
vlan/test_vlan.py
platform_tests/test_reboot.py


Signed-off-by: zitingguo-ms <zitingguo@microsoft.com>
2022-08-10 15:02:47 -07:00
Liu Shilong
9b952945ca
[actions] Fix automerge actions don't work issue. (#11671) 2022-08-10 11:53:58 +08:00
Jing Zhang
ffd9e190e1 Update WARM START FINALIZER to wait for linkmgrd to reconcile (#11477)
Spanning from sonic-net/sonic-linkmgrd#76, this PR is to update warm restart finalizer to wait for linkmgrd to be reconciled.

sign-off: Jing Zhang zhangjing@microsoft.com

Why I did it
To make sure finalizer save config after linkmgrd's reconciliation.

How I did it
Add linkmgrd to the reconciliation wait list of warmboot finalizer.

How to verify it
Verified on lab device, linkmgrd reconciled as expected.
2022-08-09 21:05:12 +00:00
Ikki Zhu
cf12aa549a [hlx/sfp] fix hlx platform sfp+ tx disable issue (#11532)
Why I did it:
To fix hlx platform sfp+ module tx disable issue

How I did it:
Fix sfp+ tx disable function according SFF-8472 specification

Co-authored-by: Eric Zhu <erzhu@celestica.com>
2022-08-09 21:05:08 +00:00
xumia
1c8c1a6010
[202012] Fix CVE-2017-1000487 security alert (#11635)
Why I did it
Fix CVE-2017-1000487 alert in thrift 0.14.1.
See https://nvd.nist.gov/vuln/detail/CVE-2017-1000487

How I did it
Change the version of org.codehaus.plexus:plexus-utils from 3.0.14 to 3.0.16.
2022-08-08 12:48:30 +08:00
mssonicbld
14f93e15c6
[ci/build]: Upgrade SONiC package versions (#11629)
Why I did it
Upgrade SONiC Versions
2022-08-07 11:27:16 +08:00
Lawrence Lee
04ba6da1ab
[202012][arp_update]: Resolve failed neighbors on dualtor (#11641)
In arp_update, check for FAILED or INCOMPLETE kernel neighbor entries and manually ping them to try and resolve the neighbor

Signed-off-by: Lawrence Lee <lawlee@microsoft.com>
2022-08-05 23:30:04 -07:00
tjchadaga
6d66d9b8fc
Revert "Add load_minigraph option to include traffic-shift-away during config migration (#11403)" (#11625)
This reverts commit 6c2f99a327.
2022-08-06 10:05:45 +05:30
Lawrence Lee
663bf00c22
[swss]: Run tunnel_pkt_handler on dualtor only (#11626)
At SWSS docker init time, check the device subtype and enable tunnel packet handler only if it is dualtor

Signed-off-by: Lawrence Lee <lawlee@microsoft.com>
2022-08-05 11:33:37 -07:00
Kevin Wang
8bb38639e5
[202012][sonic-sairedis]Update sonic-sairedis submodule (#11486)
Remove duplicate cisco-8000 functions and call to those functions (#1077)
Support For CISCO debug-shell in syncd-container (#1070)
Fix submodule SAI branch name (#1031)

Signed-off-by: Kevin Wang <shengkaiwang@microsoft.com>
2022-08-03 13:22:38 +08:00
Vaibhav Hemant Dixit
56ea6d9b62
[submodule advance] Advance sonic-utilities to include warmboot fix (#11558)
To include latest warmboot fixes:

[202012] [cherry-pick] Update db_migrator to support `PORT_QOS_MAP|gl (sonic-net/sonic-utilities/pull/2215)
[202012] Migrate missed config on cross branch warm upgrade to 202012 (sonic-net/sonic-utilities#2277)
[202012] Add db_migrator_constants.py script to setup.py (sonic-net/sonic-utilities#2287)
2022-08-02 13:49:51 -07:00
Liu Shilong
1e812cd990
[ci] Update azp reference to support transfering organization from Azure to sonic-net (#11605)
Why I did it
When transfer repo to another organization, azp reference also need change.
Change azp reference to avoid pipeline failure.
2022-08-02 16:22:26 +08:00
bingwang-ms
84aca00847
[202012]Support different DSCP_TO_TC_MAP for T1 in dualtor deployment (#11580)
Why I did it
This PR is to backport #11569 into 202012 branch.
This PR is to apply different DSCP_TO_TC_MAP to downlink and uplink ports on T1 in dualtor deployment.
For T1 downlink ports (To T0)
The DSCP_TO_TC_MAP is not changed. DSCP2 and DSCP6 are mapped to TC2 and TC6 respectively.
For T1 uplink ports (To T1)
A new DSCP_TO_TC_MAP|AZURE_UPLINK is defined and applied. DSCP2 and DSCP6 are mapped to TC1 to avoid mixing up lossy and lossless traffic from T2.
The extra lossy PG2 and PG6 added in PR #11157 is reverted as well because no traffic from T2 is mapped to PG2 or PG6 now.

How I did it
Define a new map DSCP_TO_TC_MAP|AZURE_UPLINK for 7260 T1.

How to verify it
Verified by test case in test_j2files.py.
2022-08-01 08:59:45 -07:00
Stephen Sun
44ecff1154
Support queue 7 in dual ToR scenario (#11570)
Signed-off-by: Stephen Sun <stephens@nvidia.com>
2022-08-01 09:27:49 +08:00
Ying Xie
84a8bad821
[202012][prtest] use matching sonic-mgmt branch for 202012 branch PR tests (#11575)
Why I did it
202012 PR test is failing due to some recent change in sonic-mgmt master branch.

How I did it
Use matching sonic-mgmt branch to run 202012 branch PR tests.

How to verify it
this PR test.

Signed-off-by: Ying Xie ying.xie@microsoft.com
2022-07-29 11:09:12 -07:00
Nikola Dancejic
c5a5734242
[swss] Adding bgp container as dependent of swss (#11168)
What I did:
Added bgp as a dependent of swss

Why I did it:
bgp container was not restarting on swss crash. When swss crashes, linkmgrd
doesn't initate a switchover because it cannot access the default route from
orchagent. Bringing down bgp with swss will isolate the ToR, causing linkmgrd
to initiate a switchover to the peer ToR avoiding significant packet loss.

Signed-off-by: Nikola Dancejic <ndancejic@microsoft.com>
2022-07-29 09:37:09 -07:00
Kebo Liu
c40435c94c
[202012] [Mellanox] Add new sensor conf to support SN4410 A1 system (#8379) (#11530)
- Why I did it
New SN410 A1 system has a different sensor layout with A0 system, needs a new sensor conf file to support it.

- How I did it
Since the SN4410 A1 system use exactly the same sensor layout as the SN4700 A1 system, so add a symbol link linking to the SN4700 A1 sensor conf file to reuse.

- How to verify it
Run sensor test against the SN4410 A1 system;
Run platform related regression test against the SN4410 A1 system
2022-07-29 17:41:44 +02:00
Dror Prital
3e41a4bd86
[202012][submodule] Update sonic-linux-kernel submodule (#11462)
- Why I did it
Update sonic-linux-kernel submodule pointer to include the following:

add new kernel patch come with hw-mgmt 7.0010.2348 (#285)

- How I did it
Update sonic-linux-kernel submodule pointer

Signed-off-by: dprital <drorp@nvidia.com>
2022-07-29 17:39:40 +02:00
Dror Prital
193f1508bc
[202012][submodule] Update sonic-utilities submodule (#11515)
Update sonic-utilities submodule pointer to include the following:
* Fix test for pfcwd_sw_enable in db_migrator_test ([#2252](https://github.com/Azure/sonic-utilities/pull/2252))
* [DHCPv6] [202012] Update the dhcpv6_relay config/show cli ([#2271](https://github.com/Azure/sonic-utilities/pull/2271))
* [vnet_route_check] Align DB data parse logic with format used by swsscommon API ([#2268](https://github.com/Azure/sonic-utilities/pull/2268))
* [202012] [generate_dump] allow to extend dump with plugin scripts ([#2269](https://github.com/Azure/sonic-utilities/pull/2269))

Signed-off-by: dprital <drorp@nvidia.com>
2022-07-28 21:42:10 -07:00
Dror Prital
b1e0578177
[202012][submodule] Update sonic-platform-daemons submodule (#11500)
Update sonic-platform-daemons submodule pointer to include the following:
* [chassisd] Add script to initialize chassis info in STATE_DB ([#183](https://github.com/Azure/sonic-platform-daemons/pull/183))

Signed-off-by: dprital <drorp@nvidia.com>
2022-07-28 16:08:10 -07:00
Liu Shilong
de00b87161
[ci] Transfer organization from Azure to sonic-net for sonic-mgmt (#11559) (#11563)
Why I did it
Transfer organization from Azure to sonic-net for sonic-mgmt
2022-07-28 15:32:26 +08:00
Lior Avramov
a40aca43b9 [memory_checker] Do not check memory usage of containers if docker daemon is not running (#11476)
Fix in Monit memory_checker plugin. Skip fetching running containers if docker engine is down (can happen in deinit).
This PR fixes issue #11472.

Signed-off-by: liora liora@nvidia.com

Why I did it
In the case where Monit runs during deinit flow, memory_checker plugin is fetching the running containers without checking if Docker service is still running. I added this check.

How I did it
Use systemctl is-active to check if Docker engine is still running.

How to verify it
Use systemctl to stop docker engine and reload Monit, no errors in log and relevant print appears in log.

Which release branch to backport (provide reason below if selected)
The fix is required in 202205 and 202012 since the PR that introduced the issue was cherry picked to those branches (#11129).
2022-07-27 23:28:19 +00:00
xumia
14f67b130d [ci] Fix some not sai package removed issue (#11544)
Only replace the file name starts with "cisco-".
2022-07-27 23:28:15 +00:00
Taylor Cai
c4927e0e68 [device/celestica]:Fix failed test case of Seastone snmp (#11430)
* Update psu.py
* Update thermal.py
2022-07-27 23:28:11 +00:00
kellyyeh
a2e0542356 [dhcpmon] Open different socket for dual tor to enable interface filtering (#11201) 2022-07-27 23:27:58 +00:00
tjchadaga
6c2f99a327 Add load_minigraph option to include traffic-shift-away during config migration (#11403) 2022-07-27 23:27:21 +00:00
Dror Prital
db37325f76
[202012][Mellanox] Update SAI version to 1.22.0.0 and SDK/FW to version 4.5.2318/2010_2318 (#11534)
- Why I did it
Update SAI version - 1.22.0.0
Update SDK/FW version - 4.5.2318/2010_2318

SAI Changes:
1. Port FEC fix for multiple speeds
2. Next hop group optimized bulk API
3. Support BFD remote-disc exchange in negotiation stage
4. Reduce verbosity of shared database already exists print

SDK/FW Fixes:
1. Cr space timeout on Hold and Release GW - at warmboot
2. SPC-1 Port in stuck PHY_UP after peer side rebooted
3. memory leak in sx_api_router_ecmp_update_set

- How I did it
Update pointer for the new SAI and SDK/FW

- How to verify it
Run regression tests
2022-07-26 21:01:36 +03:00
jhli-cisco
66d49231cf
Update cisco-8000.ini (#11522)
update cisco-8000 platform version to 202012-v0.107
2022-07-24 11:43:07 +08:00
anamehra
ee43011748
Update cisco-8000 submodule to v0.106 (#11505)
Signed-off-by: anamehra <anamehra@cisco.com>
2022-07-22 17:01:57 +08:00
VenkatCisco
e2042e2ad6
update cisco-8000 platform version to v106 (#11504) 2022-07-21 08:31:50 -07:00
Kebo Liu
c60bf90590
[202012] [Mellanox] Update hw-mgmt package to V.7.0010.2349 (#11421)
- Why I did it
New changes in this new HW-MGMT package:

1. hw-mgmt: chassis events: Fix voltmon address conflict on connecting
2. hw-mgmt: topology: Add COMEX BRDWL respin support
  a. Removed A2D sensor from all COMEX BRDWL boards
  b. Add COMEX BRDWL boards with register defined (config3)

- How I did it
Advance the hw-mgmt repo pointer and update the hw-mgmt version number

- How to verify it
Run platform-related regression test cases on the new testbed.

Signed-off-by: Kebo Liu <kebol@nvidia.com>
2022-07-20 09:00:17 +03:00
xumia
85c36c5c69 [Build] Cleanup the version deb preference file after build (#11414)
Why I did it
Cleanup the version deb preference file after build.
The version file is no use after build.

How I did it
Remove the no use version file.
2022-07-19 23:09:07 +00:00
bingwang-ms
c5eb031111
[202012] Add flag to control the generation of global level map (#11451)
Why I did it
This PR is to cherry-pick #11448 to 202012 branch after resolving conflicts.
There are conflicts in

files/build_templates/qos_config.j2
src/sonic-config-engine/tests/test_j2files.py
2022-07-15 09:44:45 -07:00
Neetha John
15cc046eda
[202012] Update MMU and ECN settings for Arista-7260CX3-D96C16 (#11427)
Signed-off-by: Neetha John <nejo@microsoft.com>

Why I did it
Missed this sku in the previous PR #11398

How I did it
Update the dynamic threshold to 0 and ECN settings as 2mb/10mb/5%

How to verify it
Updated unit tests to use the modified values for 7260 ecn settings.
2022-07-15 09:33:39 -07:00
Kebo Liu
aa4379ddbe
[202012] [Mellanox] Add sensor conf file for new version of MSN3700/3700C/4600C platforms (#11358)
- Why I did it
MSN3700/3700C/4600C have been re-spined, the new HW version of platforms has different sensors, so need to apply the correct sensor.conf for them.

- How I did it
Add new sensor.conf files for the new re-spined platforms.
Enhance the logic of "get_sensors_conf_path" for the related platforms in order to load the correct sensor.conf for each version of platforms.

- How to verify it
run sensors test on different versions of platforms
Signed-off-by: Kebo Liu <kebol@nvidia.com>
2022-07-14 08:59:10 +03:00
Jing Zhang
81f200fdec
[202012][sonic-linkmgrd] submodule update #11371
[202012][sonic-linkmgrd] submodule update

a2367d0 Jing Zhang Fri Jun 24 09:10:12 2022 -0700 Remove exception throwing when initializing missing loopback interface #90

sign-off: Jing Zhang zhangjing@microsoft.com
2022-07-12 09:38:11 -07:00
Neetha John
4de610af15
[202012] Update 7260 MMU and ECN settings (#11398)
Signed-off-by: Neetha John <nejo@microsoft.com>

Why I did it
Improve throughput and latency for 7260 deployments

How I did it
Update the dynamic threshold to 0 and ECN settings as 2mb/10mb/5%

How to verify it
Updated unit tests to use the modified values for 7260 ecn settings.
2022-07-12 08:46:44 -07:00
Neetha John
c6f9664b2e
[202012] Minigraph parser changes to select mmu profiles based on SonicQosProfile attribute (#11383)
Signed-off-by: Neetha John <nejo@microsoft.com>

Why I did it
There is a need to select different mmu profiles based on deployment type

How I did it
There will be separate subfolders (RDMA-CENTRIC, TCP-CENTRIC, BALANCED) in each hwsku folder which contains deployment specific mmu and qos settings. SonicQosProfile attribute in the minigraph will be used to determine which settings to use. If that attribute is not present, the default settings that exist in the hwsku folder will be used
2022-07-12 08:45:55 -07:00
mssonicbld
550ab26fc7
[ci/build]: Upgrade SONiC package versions (#11422) 2022-07-12 15:39:32 +00:00
Liu Shilong
ed728abb08
[ci] Fix test stage dependency issue. (#11386) (#11391)
Why I did it
When any of the test job failed in the test stage, the rerun will not work, the test stage will be skipped automaticall, so we do not have chance to rerun the test stage again, and the checks of the test will be always in failed status, block the PR to merge forever.

It should be caused by the condition in the Test stage, we should specify the status of the BuildVS stage.

How I did it
Fix stage dependency logic.
2022-07-12 17:55:01 +08:00
Dror Prital
bc935d4002
[202012][submodule] Advance sonic-linux-kernel pointer (#11406)
Update sonic-linux-kernel submodule pointer to include the following:
* [202012][patch] mlxsw: i2c: Prevent transaction execution for special chip states ([#282](https://github.com/Azure/sonic-linux-kernel/pull/282))

Signed-off-by: dprital <drorp@nvidia.com>
2022-07-11 21:40:26 +03:00
Neetha John
ec7cc16199
[202012] Submodule update for sonic-utilities (#11400)
Signed-off-by: Neetha John <nejo@microsoft.com>

This PR contains the following commits
5a54bd7 Added cisco config platform commands (Azure/sonic-utilities#2241)
62c1640 [config/load_mgmt_config] Support load IPv6 mgmt IP (Azure/sonic-utilities#2206)
c061a18 Fix header for the output table following 'show ipv6 interface' command (Azure/sonic-utilities#2219)
ecca18ff [202012] Update load minigraph to load backend acl (Azure/sonic-utilities#2235)
2022-07-11 09:10:02 -07:00