Commit Graph

5692 Commits

Author SHA1 Message Date
Hua Liu
2b39cd61fb
[202012] [sonic-swss-common] Update sonic-swss-common submodule (#13813)
#### Why I did it
Submodule update for sonic-swss-common with following change:
```
3e34309 2023-02-11 | RedisPipeline ignore flush when call dtor from another thread. (#736) [Hua Liu]
```
2023-02-17 10:41:25 -08:00
Samuel Angebault
e01e1860d4 [Arista] Disable ATA NCQ for a few products (#13739)
Why I did it
Some products might experience an occasional IO failure in the communication between CPU and SSD.
Based on some research it could be attributable to some device not handling ATA NCQ (Native Command Queue).

This issue currently affect 4 products:

DCS-7170-32C*
DCS-7170-64C
DCS-7060DX4-32
DCS-7260CX3-64

How I did it
This change disable NCQ on the affected drive for a small set of products.

How to verify it
When the fix is applied, these 2 patterns can be found in the dmesg.
ata1.00: FORCE: horkage modified (noncq)
NCQ (not used)

Test results using: fio --direct=1 --rw=randrw --bs=64k --ioengine=libaio --iodepth=64 --runtime=120 --numjobs=4

with NCQ (ata1.00: 61865984 sectors, multi 1: LBA48 NCQ (depth 32), AA)

   READ: bw=33.9MiB/s (35.6MB/s), 33.9MiB/s-33.9MiB/s (35.6MB/s-35.6MB/s), io=4073MiB (4270MB), run=120078-120078msec
  WRITE: bw=34.1MiB/s (35.8MB/s), 34.1MiB/s-34.1MiB/s (35.8MB/s-35.8MB/s), io=4100MiB (4300MB), run=120078-120078msec
without NCQ (ata1.00: 61865984 sectors, multi 1: LBA48 NCQ (not used))

   READ: bw=31.7MiB/s (33.3MB/s), 31.7MiB/s-31.7MiB/s (33.3MB/s-33.3MB/s), io=3808MiB (3993MB), run=120083-120083msec
  WRITE: bw=31.9MiB/s (33.4MB/s), 31.9MiB/s-31.9MiB/s (33.4MB/s-33.4MB/s), io=3830MiB (4016MB), run=120083-120083msec
Which release branch to backport (provide reason below if selected)
2023-02-16 17:54:16 +00:00
Ikki Zhu
2135c6eb2f [DX010 platform] fix dx010 platform testcase issues (#13595)
Why I did it
1. fix chassis test_set_fans_led case
2. fix chassis get_name case mismatch issue
3. fix fan_drawer test_set_fans_speed
4. fix component test_components test case

How I did it
Add corresponding configuration into chassis json file

How to verify it
Run platform tests cases to verify these failure cases
2023-02-16 17:52:12 +00:00
Qi Luo
9731aa36c2
[sonic-snmpagent] Update submodule (#13832)
#### Why I did it
Include below commits:
```
7147354 2023-02-14 | Fix: zero route may have empty nexthop (#276) [Qi Luo]
e60a64c 2022-11-30 | Use github code scanning instead of LGTM (#274) [Liu Shilong]
```
2023-02-16 09:04:36 -08:00
jhli-cisco
592ce16d05
Update cisco-8000.ini (#13793)
#### Why I did it
1.57.x SDK based incremental drop that addresses:
Fix for MIGSMSFT-158
Support for VxLAN and BFD Serviceability CLI
sfputil reset platform fix to handle 100G optics
Added thermal management feature for ZR optics sensors

#### How I did it
Update cisco-8000 submodule to v0.2.5
2023-02-14 11:05:09 -08:00
jcaiMR
936679ee47 Set 'origin' and 'AS Path' for T1 SLB routes (#13613)
* set origin and as-path prepend for routes from SLB
2023-02-10 18:38:16 +00:00
Lawrence Lee
4e70a2bfbc
[202012][swss]: Submodule update for SWSS (#13722)
Include following commits:
- c98b9f09 [202012][orchagent]: Get bridge port ID from FDBOrch cache instead of SAI API #2657 (#2658)
- 59886b8f [MuxOrch] Enabling neighbor when adding in active state (#2601)
2023-02-09 11:01:58 -08:00
jingwenxie
95893698e2
[202012][sonic-utilites] advance submodule (#13734)
```
39cdb49c7 [202012][show] Add bgpraw to show run all (#2639)
b3ebba2ca [202012][show] add new CLI to show tunnel route objects #2255 (#2659)
d08f59b9f Fixed a bug in "show vnet routes all" causing screen overrun. (#2644) (#2654)
a996abdb5 [202012][show] show logging CLI support for logs stored in tmpfs (#2652)
c60f771c0 [202012][show_bfd] add local discriminator in show bfd command (#2616)
```
2023-02-09 10:04:58 -08:00
xumia
8657fbac53
[Build]: fix some version info missing in version control files issue (#10211) (#13673)
#### Why I did it
Fix some version info missing in version control files issue

#### How I did it
Change to collect version before it cleaned up
2023-02-08 20:25:04 -08:00
xumia
f48734fe50
[Build]: Fix the bin image generated from raw image issue (#10083) (#13672)
#### Why I did it
Cherry pick from #10083
It is to fix the issue #10048
When building .raw image, for instance, target/sonic-broadcom.raw, it will generate a .bin image, target/sonic-broadcom.bin, as the intermediate file. The intermediate file is a build target which may contains different dependencies with the raw one.

#### How I did it
Rename the intermediate file.
2023-02-08 20:24:06 -08:00
andywongarista
ff5a703301
Fix content of platform.json for DCS-7050CX3-32S (#13659)
#### Why I did it
Some tests under platform_tests/api were failing on the 7050CX3 due to outdated facts in platform.json

#### How I did it
Updated platform.json facts with appropriate values

#### How to verify it
Run tests under platform_tests/api to verify no failures
2023-02-08 12:01:07 -08:00
Ikki Zhu
85ca3abc2f [Celestica DX010] fix fan drawer and watchdog platform testcase issues (#13426)
Why I did it
fix DX010 fan drawer and watchdog platform test case issues

How I did it
1. Add fan_drawer get_maximum_consumed_power support
2. Adjust maximum watchdog timeout value check

How to verify it
Run test_fan_drawer and test_watchdog test cases.
2023-02-08 04:59:20 +00:00
Longxiang Lyu
6b4a278378
[dualtor] Let T0 delay 10 seconds before sending BGP updates (#13701)
Cherry-pick PR #12996 into 202012 branch.

Signed-off-by: Longxiang Lyu <lolv@microsoft.com>
2023-02-07 17:49:20 -08:00
Jing Zhang
abbcda67d2
update submodule linkmgrd (#13589)
9551386 Jing Zhang      Mon Jan 23 15:49:52 2023 -0800  [202012] Update link prober stats post logic #159 (#162)
e54f289 Liu Shilong     Wed Nov 30 18:04:15 2022 +0800  Use github code scanning instead of LGTM (#157)
2023-02-07 16:18:28 -08:00
Saikrishna Arcot
d8a299781a
Use tmpfs for /var/log for Arista 7260 (#13587) (#13607)
This is to reduce writes to disk, which then can use the SSD to get worn
out faster.

Signed-off-by: Saikrishna Arcot <sarcot@microsoft.com>
(cherry picked from commit ee1c32a802)
2023-02-06 17:09:55 -08:00
kellyyeh
9a26ed957c
Advance sonic-dhcp-relay (#13608)
Why I did it
Advance sonic-dhcp-relay submodule head on 202012 branch

How I did it
Added the following commits:
a4b15d8 jcaiMR Thu Dec 29 14:18:28 2022 +0800 fix relay-reply dhcpv6 packet counter issue (#29)
677543f jcaiMR Sat Dec 17 00:24:32 2022 +0800 fix handleSwssNotification crash in dhcp6relay (#28)
ed86546 jcaiMR Wed Dec 14 14:08:58 2022 +0800 Fix multiple vlan issue (#27)
5ec1f5b Vivek Thu Dec 8 09:44:15 2022 -0800 Made the Error log informative (#22)
063d41b jcaiMR Wed Nov 30 14:41:53 2022 +0800 disable cfg dynamic change (#25)
d4a51f6 kellyyeh Tue Jan 31 18:09:08 2023 -0800 Add unittest infrastructure (#5) (#31)

How to verify it
Ran full dhcpv6 test suite on lab device
2023-02-06 08:50:27 -08:00
Saikrishna Arcot
30fbc609c8 Use tmpfs for /var/log for Arista 7260 (#13587)
This is to reduce writes to disk, which then can use the SSD to get worn
out faster.

Signed-off-by: Saikrishna Arcot <sarcot@microsoft.com>
2023-02-02 22:53:50 +00:00
Yoush
7adf2b1c05 [BugFix] Fix the bug that it gets error system-mac of centec platform (#12721)
Why I did it
When getting system mac of centec platform, it would increase by 1 the last byte of mac, but it could not consider the case of carry.

How I did it
Firstly, I would replace the ":" with "" of mac to a string.
And then, I would convert the mac from string to int and increase by 1, at last convert it to string with inserting ":".
2023-02-02 22:53:49 +00:00
Nazarii Hnydyn
83b6518ae2
[202012][mellanox]: Add BIOS upgrade infra (#13571)
- Why I did it
Added BIOS upgrade infra

- How I did it
Added new make target

- How to verify it
Copy msn3800_bios.tar.gz to platform/mellanox/bios
make configure PLATFORM=mellanox
make target/files/buster/msn3800_bios.tar.gz

Signed-off-by: Nazarii Hnydyn <nazariig@nvidia.com>
2023-02-02 10:07:03 +02:00
kellyyeh
1354033f90
[dhcpmon] Fix dhcpmon socket filter and tx count issue (#13065) (#13441)
Why I did it
Fix issue caused by dualtor support PR [dhcpmon] Open different socket for dual tor to enable interface filtering #11201
Improve code
How I did it
On single ToR, packets received count was duplicated due to socket filter set to "inbound"
Tx count not increasing due to filter set to "inbound". Added an outbound socket to count tx packets
Added vlan member interface mapping for Ethernet interface to vlan interface lookup in reference to PR Fix multiple vlan issue sonic-dhcp-relay#27
Exit when socket fails to initialize to allow dhcp_relay docker to restart
How to verify it
Tested on vstestbed single tor and dual tor, sent packets and verify printed out dhcpmon rx and tx counters is correct

Correct number of tx increases
Tx does not increase when ToR is on standby
2023-01-30 09:51:06 -08:00
Prince Sunny
7448f7bc2d
[submodule] update for sonic-swss and sonic-restapi (#13531)
#### Why I did it

Update submodule for sonic-swss:

e739e6c - 2023-01-27 : custom advertised prefix for primary vxlan tunnel [202012]  (#2641) [siqbal1986]

sonic-restapi:
99c467d - 2023-01-24 : Add API support for adv prefix and custom monitoring (#133) [Prince Sunny]
347684a - 2022-11-30 : Use github code scanning instead of LGTM (#132) [Liu Shilong]
2023-01-27 16:27:55 -08:00
Ying Xie
467e01cbdb
[202012][swss] advance submodule head (#13519)
swss:
* f28d6622 2023-01-25 | [portinit] Do not call GET on SAI_PORT_ATTR_SPEED when AUTONEG is enabled (#2484) (#2639) (github/202012, 202012) [Ying Xie]
* 217f7f00 2023-01-21 | Cherry pick of https://github.com/sonic-net/sonic-swss/pull/2589 (#2627) [siqbal1986]

Signed-off-by: Ying Xie <ying.xie@microsoft.com>

Signed-off-by: Ying Xie <ying.xie@microsoft.com>
2023-01-26 02:49:50 +00:00
Lawrence Lee
6ff7c75597
[PATCH]: Update FRR patch 0009 to avoid inappropriate error messages (#12912) (#13423)
…(#12912)

Why I did it
As described in detail in #12753, the current FRR patch 0009-ignore-route-from-default-table.patch is causing unwanted FRR/zebra error logs. This change gets rid of the error messages for routes from kernel default table while these routes are ignored in prefix encoding.

How I did it
This fix updates the original 0009 patch by checking if the routes are from table default before printing the error logs. The original patch checks the same condition and ignores the routes from table default in prefix encoding.

How to verify it
Follow the steps to repro as described in #12753.
Also verify the test case ipfwd/test_nhop_count.py no longer fails due to the error messages.

#### Why I did it
Resolve cherry-pick conflict for https://github.com/sonic-net/sonic-buildimage/pull/12912
2023-01-25 15:24:11 -08:00
Richard.Yu
025e77bb5d
[202012] Update SAI version to 4.3.7.1-7 (#13431)
CS00012254651 (SONIC-66820) Fix missing break stmt

Verify

run case test_forward_ip_packet_with_0xffff_chksum_tolerant

Signed-off-by: richardyu-ms <richard.yu@microsoft.com>
2023-01-19 11:30:40 -08:00
Ying Xie
027c831be7 [Arista] add support for hardware sku Arista-7260CX3-D92C16 (#13438)
Signed-off-by: Ying Xie <ying.xie@microsoft.com>

Signed-off-by: Ying Xie <ying.xie@microsoft.com>
2023-01-19 19:19:30 +00:00
Ikki Zhu
607cbdefd3 [Celestica Seastone] fix multi sonic platform issues (#13356)
Why I did it
Fix the following issues for Seastone platform:

- system-health issue: show system-health detail will not complete #9530, Celestica Seastone DX010-C32: show system-health detail fails with 'Chassis' object has no attribute 'initizalize_system_led' #11322
- show platform firmware updates issue: Celestica Seastone DX010-C32: show platform firmware updates #11317
- other platform optimization

How I did it
Modify and optimize the platform implememtation.

How to verify it
Manual run the test commands described in these issues.
2023-01-19 19:18:21 +00:00
jhli-cisco
2357eeef46
[cisco]: Update cisco-8000.ini (#13425)
1.57.x SDK based incremental drop that addresses: 
1) orchagent crash
2) Port LED issue
3) Tunnel endpoint stats
4) test_warm_reboot issue
5) nhop test failure
6) "show platform versions' CLI
2023-01-19 09:13:51 -08:00
Junchao-Mellanox
46a774e294
[202012] [Mellanox] Fix select timeout in sfp event (#13347)
- Why I did it
Backport #9795
Python select.select accept a optional timeout value in seconds, however, the value passes to it is a value in millisecond.

- How I did it
Transfer the value to millisecond.

- How to verify it
Manual test
2023-01-19 17:29:41 +02:00
Prince Sunny
21ad9876a3
[Submodule] Update swss-common (#13406)
Swss-common submodule update:
c5ed58a - 2023-01-17 : 202012 cast of PR sonic-net/sonic-swss-common#725 (#738) [siqbal1986]
fe28725 - 2022-11-30 : Use github code scanning instead of LGTM (#718) [Liu Shilong]
2023-01-18 11:10:28 -08:00
Prince Sunny
b500e0ef5a
[Submodule] Update for sonic-swss (#13405)
#### Why I did it

Update for following swss commits:

96180bf - 2023-01-13 : [202012] Bfd default multiplier change  (#2615) [siqbal1986]
07506ac - 2023-01-11 : Add missing parameter to on_switch_shutdown_request method. (#2567) [Hua Liu]
3253cc8 - 2022-11-30 : Use github code scanning instead of LGTM (#2546) [Liu Shilong]
f4df524 - 2023-01-11 : [orchagent]:add local_discriminator to state_db (#2587) [Baorong Liu]
f3cd02d - 2022-12-05 : [202012][muxorch] Adding case for maintaining current state (#2500) [Nikola Dancejic]
2023-01-17 16:37:19 -08:00
Kebo Liu
a569bfc9eb
skip hw reboot cause if warm/fast reboot found from the proc cmdline (#13378)
#### Why I did it
Backport https://github.com/sonic-net/sonic-buildimage/pull/13246 to 202012 branch.

In case of warm/fast reboot, the hardware reboot cause will NOT be cleared because CPLD will not be touched in this flow. To not confuse the reboot cause determine logic, the leftover hardware reboot cause shall be skipped by the platform API, platform API will return the 'REBOOT_CAUSE_NON_HARDWARE' instead of the "hardware" reboot cause.

#### How I did it

Check the proc cmdline to see whether the last reboot is a warm or fast reboot, if yes skip checking the leftover hardware reboot cause.

#### How to verify it

a. Manual test:
> 1. Perform a power loss
> 2. Perform a warm/fast reboot
> 3. check the reboot cause should be "warm-reboot" or "fast-reboot" instead of "power loss"

b. Run reboot cause related regression test.
2023-01-17 13:21:31 -08:00
Nazarii Hnydyn
5193a96895
[202012][Mellanox]: Update ONiE FW tool: manual reboot control. (#13359)
Partial cherry-pick of: [Mellanox] Modified Platform API to support all firmware updates in single boot #9608

- Why I did it
To allow user manual reboot control over ONiE FW upgrade

- How I did it
Added a dedicated script argument handling

- How to verify it
mlnx-onie-fw-update.sh update --no-reboot

Signed-off-by: Nazarii Hnydyn <nazariig@nvidia.com>
2023-01-16 15:27:48 +02:00
Neetha John
94c689fa58
Advance sonic-utilities submodule pointer (#13360)
This PR includes the following commits
```
b44e153 [202012]Revert "[config reload]: On dual ToR systems, cache ARP and FDB table (sonic-net/sonic-utilities#2490)
59f3025 Use github code scanning instead of LGTM (sonic-net/sonic-utilities#2530)
678df5d [202012] avoid printing error if no neighbors are present  (sonic-net/sonic-utilities#2594)
```
2023-01-13 13:56:19 -08:00
xumia
d3a83cf8c7 [Bug] Fix SONiC installation failure caused by pip/pip3 not found (#13284)
The main issue is the pip/pip3 command cannot be found when the package is being installed by apt-get.
When using the dpkg install, the searching path is PATH=/usr/local/sbin:/usr/local/bin:/usr/sbin:/usr/bin:/sbin:/bin
When using the apt-get install, the searching path is PATH=/usr/sbin:/usr/bin:/sbin:/bin
But the pip/pip3 default path is at /usr/local/bin, so dpkg works, but apt-get not work.

How I did it
Export the path /usr/local/bin for pip/pip3.
Make the deb packages can be installed by apt-get.
2023-01-12 23:31:08 +00:00
byu343
e2f9f1e452 [Arista]: Add hwSku Arista-7260CX3-D108C10 (#13242)
* [Arista]: Add hwSku Arista-7260CX3-D108C10

* Add buffer-related config for Arista-7260CX3-D108C10
2023-01-12 23:30:33 +00:00
Ikki Zhu
2438025cf9 Seastone add platform capability enhancement config (#13079) 2023-01-12 23:30:29 +00:00
Liu Shilong
2fac7e31e1 [action] Keep 'request for xxx branch' label when finished auto-cherry-pick. (#13107)
Why I did it
To keep 'Request for xxx branch' label when finished auto-cherry-pick.

How I did it
Change logic in post cherry pick action.

How to verify it
2023-01-12 23:30:24 +00:00
Marty Y. Lok
27d798a2b7 [armhf][sonic-installer] Fix issue of the sonic-installer install a image after sonic-installer clean (#12609)
Signed-off-by: mlok <marty.lok@nokia.com>

Signed-off-by: mlok <marty.lok@nokia.com>
2023-01-12 23:30:07 +00:00
Santhosh Kumar T
2081e6f45d [DellEMC] Master: S6100: SSD upgrade status: Moving from smartctl to iSMART (#12784)
Why I did it
smartctl tool is available only in PMON docker. Hence, the tool may be not accessible incase PMON docker goes down.
Using iSMART_64 tool to fetch the SSD firmware version and device model information.

How I did it
Replacing smartctl with iSMART_64.
2023-01-12 23:30:02 +00:00
Liu Shilong
b111cecf02
[build] Fix sonic-host-services dependency deepdiff version issue (#13329)
Why I did it
sonic_host_services depends on deepdiff.
But latest deepdiff version has error.

How I did it
pin deepdiff to previous version.

How to verify it
2023-01-12 13:17:27 +08:00
Neetha John
642c7242f8 Update ECN settings for storage backend (#12855)
Signed-off-by: Neetha John <nejo@microsoft.com>

Why I did it
ECN parameters need to be updated for storage backend

How I did it
Included the check for storage backend devices to update qos configs

How to verify it
Verified that the new ecn settings are applied on storage backend device.
Verified that the old ecn settings are applied for storage frontend, non storage frontend/backend devices
2023-01-10 23:52:39 +00:00
mssonicbld
0e63a94fb6
[ci/build]: Upgrade SONiC package versions (#13249) 2023-01-04 19:49:04 +08:00
Yutong Zhang
e848e3616f
Improve the display of pipeline. (#13124)
The display of azure pipeline is not specific now, such as when the step Run test fails, the display of itself shows successful, but the display of step Kvmdump shows fails, but actually, the step Kvmdump doesn't fail. I improve the display of azure pipeline in this pr, each step has its own success or failure, and is shown in azure pipeline.

Why I did it
The display of azure pipeline is not specific now, such as when the step Run test fails, the display of itself shows successful, but the display of step Kvmdump shows fails, but actually, the step Kvmdump doesn't fail. I improve the display of azure pipeline in this pr, each step has its own success or failure, and is shown in azure pipeline.

How I did it
Each step has its own signature of success or failure.
Using the chain of responsibility pattern to manage all status.
Modify the expected-state in each step.
2023-01-04 17:11:55 +08:00
Liu Shilong
9175ebad0a
Fix sonic slave pipeline to set correct tag on sonic slave image. (#13177) (#13244)
Why I did it
Currently sonic-slave-* tag is confusing. Set correct tag on sonic-slave-* image.
Fix job name to fit the build.

How I did it
build amd image in amd64:
sonic-slave-bullseye:cfe29bff67c
sonic-slave-bullseye:latest
sonic-slave-bullseye:master

build armhf image in amd64:
sonic-slave-bullseye-march-armhf:33614806dc3
sonic-slave-bullseye-march-armhf:latest
sonic-slave-bullseye-march-armhf:master

build arm64 image in amd64:
sonic-slave-bullseye-march-arm64:f3b1b16c801
sonic-slave-bullseye-march-arm64:latest
sonic-slave-bullseye-march-arm64:master

build arm64 image in arm64:
sonic-slave-bullseye:75cb326c9a7
sonic-slave-bullseye-arm64:latest
sonic-slave-bullseye:master

build armhf image in armhf:
sonic-slave-bullseye:64d178951fc
sonic-slave-bullseye-armhf:latest
sonic-slave-bullseye:master

How to verify it
2023-01-04 16:29:42 +08:00
Liu Shilong
933bf06058
[build] Add environment when using dpkg hook to avoid lock loop. (#13233)
Why I did it
In some cases, dpkg will call dpkg to validate version.
dpkg hook will get stuck in a loop to lock.

How I did it
How to verify it
2023-01-04 16:29:25 +08:00
Richard.Yu
33bf592f09
[Cherry-pick][SAIServer]Upgrade SAI server init script (#13175) (#13226)
Why I did it
why
In order to apply different config across different platform, and use the code with a unified format, reuse syncd init script to init saiserver.

How I did it
how
Reuse syncd init script

How to verify it
Test
Test in DUT s6000 and dx010 with sonic 202205
2023-01-03 13:22:32 +08:00
Vivek
3fc76fdb22
[202012] [dhcpv6_relay] Fix dependency of dhcp-mon on VLAN with only v6 cfg is provided (#13006) (#13040)
Backport PR #13006

Signed-off-by: Vivek Reddy Karri <vkarri@nvidia.com>
2023-01-02 13:20:30 +02:00
jhli-cisco
26709ffb86
[cisco]: Update cisco-8000.ini (#13182)
Why I did it
1.50.x SDK based drop to fix MIGSMSFT-120 ([8102] Orchagent crash as addRoutePost failed at SAI")

How I did it
Update cisco-8000 submodule to v0.121
2023-01-01 11:58:59 -08:00
mssonicbld
352dd7ea7f
[ci/build]: Upgrade SONiC package versions (#13188) 2022-12-28 19:56:49 +08:00
Richard.Yu
830102a353
[202012][Submodule][SAI-Redis]Advance SAI Redis head pointer (#13157)
Why I did it
[202012][Submodule][SAI-Redis]Advance SAI Redis head pointer

How I did it
include changes:

sonic-net/sonic-sairedis@dcea4cd
sonic-net/sonic-sairedis@5e9bcb1
sonic-net/sonic-sairedis@8f2a53f
sonic-net/sonic-sairedis@c1d7938 [202012][Submodule][SAI]Advance SAI head pointer sonic-sairedis#1184
remove parameter skip-error, which removed from [202205][Submodule][SAI]Advance SAI head pointer sonic-sairedis#1185
How to verify it
local build
2022-12-27 08:10:42 +08:00