Commit Graph

813 Commits

Author SHA1 Message Date
zitingguo-ms
9f4a4bdc86
upgrade SAI xgs to 4.3.12.2 (#16354)
### Why I did it
Upgrade the SAI version to 4.3.12.2 to include the following changes:
- 4.3.10.2: [JIRA SONIC-68535][CSP CS00012273299] sai_query_attribute_capability for obj type SAI_OBJECT_TYPE_SWITCH
- 4.3.11.2: [JIRA SONIC-76883][CSP CS00012303308] SDK-367579: Skip flex port check when in iBodSync context
- 4.3.12.2: [JIRA SONIC-76472][CSP CS00012299690] Source suppressed broadcast packets count on interface TX discard statistics

##### Work item tracking
- Microsoft ADO **(number only)**: 24962692

#### How I did it
Upgrade the SAI version in sai.mk file.

#### How to verify it
Run SONiC basic test in SAI release pipeline: https://dev.azure.com/mssonic/internal/_build/results?buildId=361078&view=results
2023-09-14 22:43:14 -07:00
Samuel Angebault
a1ad1cf98c
[202012][Arista] Ignore poll errors during get_event_change (#16274)
#### Why I did it

When an exception happens during `get_change_event` it brings the process invoking it down.

##### Work item tracking
- Microsoft ADO **(number only)**: 24599154

#### How I did it

This is a backport of #16112
Handle exceptions gracefully within `get_change_event`

#### Description for the changelog

Ignore poll errors during get_event_change on Arista devices
2023-08-28 09:44:47 -07:00
Aravind Mani
7fc5436d73
[202012]Dell s6100 Fix sonic-mgmt platform test failures (#16207)
ADO: 24709703
#### Why I did it
sonic-mgmt platform testcases failed.
#### How I did it
Implement platform API 2.0.
#### How to verify it
Run sonic-mgmt tests.
2023-08-22 11:41:19 -07:00
Zhijian Li
2bafae38d5 [Celestica-E1031] Enable CPU watchdog (#16083)
Enable CPU watchdog on Celestica-E1031.
2023-08-19 14:32:27 +08:00
Ikki Zhu
de97458395 [E1031] fix pca9548 initializes failed occasionally (#15712)
Why I did it
[E1031] fix pca9548 initializes failed occasionally in stress test.
When failure happened, ismt i2c bus hang up and need power cycle to
recover it.

How I did it
Add 0.5s delay between setuping and configuring pca9548 i2c mux.

How to verify it
Reboot stress test at least 100 times without failure.
2023-08-03 12:32:29 +08:00
byu343
b32b5ec5ac
[202012][Arista] Clear all reload cause after reading (#15925)
### Why I did it
On 202012 images, there is an issue that certain reload causes are not cleared after reading and the uncleared causes may confuse the following reboot-cause reading after warm-reboot to master/202305.

##### Work item tracking
- Microsoft ADO **(number only)**: 24378183

#### How I did it
Clear the reload cause even in the case that it is determined as not important. 

#### How to verify it
1) Cold-boot to 202012
2) Warm-reboot to master
3) Check the reboot cause is 'warm-reboot' by 'show reboot-cause'
2023-07-31 15:02:44 -07:00
zitingguo-ms
ddf9f7411e
upgrade SAI to 4.3.8.1 (#15988)
### Why I did it
Upgrade SAI version to 4.3.8.1 to include the following change:
- CS00012288297: Fix TX queue for control packets
##### Work item tracking
- Microsoft ADO **(number only)**: 24669269
#### How I did it
Upgrade the SAI version in sai.mk file
2023-07-31 14:57:07 -07:00
mssonicbld
6bb16f3e54
[celestica/e1031]: enable emc2305 fan controller timeout feature (#14401) (#15571) 2023-06-22 08:27:53 +08:00
Hua Liu
1f3da955b9
[S6100] Improve S6100 serial-getty monitor, wait and re-check when getty not running to avoid false alert. (#14402) (#14755)
[S6100] Improve S6100 serial-getty monitor, wait and re-check when getty not running to avoid false alert. 

This is cherry-pick PR for: https://github.com/sonic-net/sonic-buildimage/pull/14402

#### Why I did it
On S6100, the serial-getty service some time can't auto-restart by systemd. So there is a monit unit to check serial-getty service status and restart it.

However, this monit will report false alert, because in most case when serial-getty not running, systemd can restart it successfully.

To avoid the false alert, improve the monitor to wait and re-check.

Steps to reproduce this issue:
1. User login to device via console, and keep the connection.
2. User login to device via SSH, check the serial-getty@ttyS1.service service, it's running.
3. Run 'monit reload' from SSH connection.
4. Check syslog 1 minutes later, there will be false alert: ' 'serial-getty' process is not running'

##### Work item tracking
- Microsoft ADO :17424426

#### How I did it
Add check-getty.sh script to recheck again later when getty service not running.
And update monit unit to check serial-getty service status with this script to avoid false alert.

#### How to verify it
Pass all UT.
Manually check fixed code work correctly:


```
admin@***:~$ sudo systemctl stop  serial-getty@ttyS1.service
admin@***:~$ sudo /usr/local/bin/check-getty.sh 
admin@***:~$ echo $?
1
admin@***:~$ sudo systemctl status serial-getty@ttyS1.serviceserial-getty@ttyS1.service - Serial Getty on ttyS1
     Loaded: loaded (/lib/systemd/system/serial-getty@.service; enabled-runtime; vendor preset: enabled)
     Active: inactive (dead) since Tue 2023-03-28 07:15:21 UTC; 1min 13s ago

admin@***:~$ sudo /usr/local/bin/check-getty.sh 
admin@***:~$ echo $?
0
admin@***:~$ sudo systemctl status serial-getty@ttyS1.serviceserial-getty@ttyS1.service - Serial Getty on ttyS1
     Loaded: loaded (/lib/systemd/system/serial-getty@.service; enabled-runtime; vendor preset: enabled)
```

syslog:
```
Mar 28 07:10:37.597458 *** INFO systemd[1]: serial-getty@ttyS1.service: Succeeded.
Mar 28 07:12:43.010550 *** ERR monit[593]: 'serial-getty' status failed (1) -- no output
Mar 28 07:12:43.010744 *** INFO monit[593]: 'serial-getty' trying to restart
Mar 28 07:12:43.010846 *** INFO monit[593]: 'serial-getty' stop: '/bin/systemctl stop serial-getty@ttyS1.service'
Mar 28 07:12:43.132172 *** INFO monit[593]: 'serial-getty' start: '/bin/systemctl start serial-getty@ttyS1.service'
Mar 28 07:13:43.286276 *** INFO monit[593]: 'serial-getty' status succeeded (0) -- no output
```

#### Tested branch (Please provide the tested image version)

- [x] 20201231.77

#### Description for the changelog
[S6100] Improve S6100 serial-getty monitor.
2023-04-20 23:10:01 -07:00
Ikki Zhu
be46225033 [Seastone] fix dx010 qsfp eeprom data write issue (#13930)
Why I did it
Platform cases test_tx_disable, test_tx_disable_channel, test_power_override failed in dx010.

How I did it
Add i2c access algorithm for CPLD i2c adapters.

How to verify it
Verify it with platform_tests/api/test_sfp.py::TestSfpApi test cases.
2023-03-02 20:06:09 +00:00
Richard.Yu
025e77bb5d
[202012] Update SAI version to 4.3.7.1-7 (#13431)
CS00012254651 (SONIC-66820) Fix missing break stmt

Verify

run case test_forward_ip_packet_with_0xffff_chksum_tolerant

Signed-off-by: richardyu-ms <richard.yu@microsoft.com>
2023-01-19 11:30:40 -08:00
Santhosh Kumar T
2081e6f45d [DellEMC] Master: S6100: SSD upgrade status: Moving from smartctl to iSMART (#12784)
Why I did it
smartctl tool is available only in PMON docker. Hence, the tool may be not accessible incase PMON docker goes down.
Using iSMART_64 tool to fetch the SSD firmware version and device model information.

How I did it
Replacing smartctl with iSMART_64.
2023-01-12 23:30:02 +00:00
Richard.Yu
33bf592f09
[Cherry-pick][SAIServer]Upgrade SAI server init script (#13175) (#13226)
Why I did it
why
In order to apply different config across different platform, and use the code with a unified format, reuse syncd init script to init saiserver.

How I did it
how
Reuse syncd init script

How to verify it
Test
Test in DUT s6000 and dx010 with sonic 202205
2023-01-03 13:22:32 +08:00
Richard.Yu
830102a353
[202012][Submodule][SAI-Redis]Advance SAI Redis head pointer (#13157)
Why I did it
[202012][Submodule][SAI-Redis]Advance SAI Redis head pointer

How I did it
include changes:

sonic-net/sonic-sairedis@dcea4cd
sonic-net/sonic-sairedis@5e9bcb1
sonic-net/sonic-sairedis@8f2a53f
sonic-net/sonic-sairedis@c1d7938 [202012][Submodule][SAI]Advance SAI head pointer sonic-sairedis#1184
remove parameter skip-error, which removed from [202205][Submodule][SAI]Advance SAI head pointer sonic-sairedis#1185
How to verify it
local build
2022-12-27 08:10:42 +08:00
Richard.Yu
e15acb59ff
enable sai-ptf logger in sai_adapter to log all the sai api invcations (#12922)
Why I did it
enable sai-ptf logger in sai_adapter to log all the sai api invcations

How I did it
add build parameter to enable the sai-ptf logger when build sai PRC

How to verify it
local build test
test the generated sai_adapter
test with pipeline
Signed-off-by: richardyu-ms <richard.yu@microsoft.com>
2022-12-04 22:10:28 -08:00
Richard.Yu
acd24d9804
[submodule]Update SAI SDK URL from package storage to public (#12835)
In order to make the sai update easier, change the URL pattern to a more unified format, which can be update automated latter.

Signed-off-by: richardyu-ms <richard.yu@microsoft.com>
2022-12-03 22:37:18 -08:00
zitingguo-ms
c10aa3b826
Add a parameter for libsaithrift to skip error on errno -2 (#12581) (#12617)
Signed-off-by: zitingguo-ms <zitingguo@microsoft.com>

Signed-off-by: zitingguo-ms <zitingguo@microsoft.com>

Signed-off-by: zitingguo-ms <zitingguo@microsoft.com>
2022-11-07 00:07:37 -08:00
zitingguo-ms
bafbfb5a26
Pickup fix and make up BRCM SAI version to 4.3.7.1-6 (#12486)
Signed-off-by: zitingguo-ms <zitingguo@microsoft.com>

Signed-off-by: zitingguo-ms <zitingguo@microsoft.com>
2022-10-26 09:52:48 +08:00
zitingguo-ms
08d1d60ccb
Pick up fixes and make up BRCM SAI version to 4.3.7.1-3 (#12439)
Signed-off-by: zitingguo-ms <zitingguo@microsoft.com>

Signed-off-by: zitingguo-ms <zitingguo@microsoft.com>
2022-10-19 12:18:48 +08:00
gechiang
1e6d63a412
[202012][BRCMSAI] 4.3.7.1-2 to back out a change that broke 4.3.7.1-1 (#12298)
This is basically the same as previous PR: (#12275)
With backing out a change that was breaking the build. Copying the same info from that PR here.
2022-10-06 21:25:34 -07:00
gechiang
9c9d902ede
[202012]BRCM SAI 4.3.7.1-1 pick up fix CS00012263713 (mirrored packet with extra VLAN Tag) (#12275)
Pick up fix for CS00012263713 (mirrored packet with extra VLAN Tag) BRCM SAI 4.3.7.1-1

Preliminary tests look fine. BGP neighbors were all up with proper routes programmed
interfaces are all up
Manually ran the following test cases on 7050CX3 (TD3) T0 DUT and all passed:

     fib/test_fib.py
     acl/test_acl.py
     arp/test_neighbor_mac_noptf.py
     fdb/test_fdb.py
     decap/test_decap.py
     pc/test_lag_2.py
     pc/test_po_cleanup.py
     pc/test_po_update.py
     everflow/test_everflow_ipv6.py
     everflow/test_everflow_testbed.py
     route/test_default_route.py
     ipfwd/test_dip_sip.py
     copp/test_copp.py
     crm/test_crm.py
2022-10-05 09:40:55 -07:00
Xichen96
a16843a67c Enable swap for haliburton device. (#11746)
Signed-off-by: Xichen Lin <lukelin0907@gmail.com>

Signed-off-by: Xichen Lin <lukelin0907@gmail.com>
2022-10-03 22:31:00 +00:00
zitingguo-ms
95b19bbb46
Pick up fixes and make up BRCM SAI version to 4.3.7.1 (#12069)
Signed-off-by: zitingguo-ms <zitingguo@microsoft.com>

Signed-off-by: zitingguo-ms <zitingguo@microsoft.com>
2022-09-22 22:59:51 -07:00
Aravind Mani
37d34ddd34
DellEMC Z9332f: Fix SFP issue (#11819)
* Update sfp

* Update sfp

* Update sfp.py
2022-09-07 09:35:59 -07:00
Arun Saravanan Balachandran
c1712b8c9a
[202012] DellEMC: S6000, S6100, Z9332f - Add capabilities fields in platform.json (#11772) 2022-08-31 09:06:47 -07:00
zitingguo-ms
5b5bd5e818
[202012 BRCM SAI 4.3.7.0] Pick up fixes and make up BRCM SAI version to 4.3.7.0 (#11681)
Pick upfollowing fixes and update BRCM SAI to 4.3.7.0:

CS00012208537: Add back previous commit 54c5bc4848eb748
CS00012253061,SONIC-63280: WB from 3.5 to 4.3, followed by WB to 4.3
CS00012207978: SDK-296517, time spent for SAI operations
CS00012245601,SONIC-62898: Egress ACL Counted ad Interface TX drops
Update pcbb with Fixes for CS00012243699
Upgrade on pcbb with Fixes for KB0025353, CS00012221689, CS00012221688, KB0025391, CS00012230519
commit of "CS00012221688:PFC frames egressing, PFC storm happens simultaneously on 2 ports" is purposely skipped to be picked up later due to SWSS dependency not ready.
Why I did it
How I did it
How to verify it
Tested build target, successful

Manually run these tests after installing sai binary within image 20201231.73 on 7050CX3 (TD3) T0 DUT, all passed.

vxlan/test_vxlan_decap.py
fdb/test_fdb.py
pfcwd/test_pfcwd_all_port_storm.py
acl/null_route/test_null_route_helper.py
acl/test_acl.py
vlan/test_vlan.py
platform_tests/test_reboot.py


Signed-off-by: zitingguo-ms <zitingguo@microsoft.com>
2022-08-10 15:02:47 -07:00
Zhijian Li
24b90d7556
[cherry-pick][202012] Fix issue where HLX module failed to do postinit (#11351)
* [HLX] Fix issue where HLX module failed to do postinit (#7274)

Signed-off-by: Jing Kan jika@microsoft.com
2022-07-06 17:27:29 +08:00
Santhosh Kumar T
7a7c363548
[DellEMC] S6100 Platform Service optimization (#10989)
Why I did it
- To reduce rc.local script execution time.
- Time consumption of rc.local script is around 22 seconds in S6100.
How I did it
- Moving platform-modules-s6100.service and s6100-lpc-monitor.service asynchronous to rc.local script.
How to verify it
- Load the image with the changes and the time consumption of rc.local script reduced from 22 seconds(approx.) to 14 seconds(approx.) during warm-/fast-reboot upgrades.
- sonic-mgmt test results.
2022-06-23 12:58:11 -07:00
Eric Zhu
27cd735082 [SONiC-CEL]: fix platform fancontrol testcase failure issue (#10934) 2022-06-08 01:21:53 +00:00
Richard.Yu
8f3edde302
[202012][BRCM SAI 4.3.5.3-5] Update saibcm for pcbb feature (#10998)
Support Tunnel PFC/pcbb feature on Broadcom platform.

How to verify it
Tested build target, successful

make target/docker-syncd-brcm.gz
manual run those tests after installing sai binary within image 20201231.67 on 7050CX3 (TD3) T0 DUT, all passed

     fib/test_fib.py
     vxlan/test_vxlan_decap.py
     fdb/test_fdb.py
     decap/test_decap.py
     pfcwd/test_pfcwd_all_port_storm.py
     acl/null_route/test_null_route_helper.py
     acl/test_acl.py
     vlan/test_vlan.py
     platform_tests/test_reboot.py

Signed-off-by: richardyu-ms <richard.yu@microsoft.com>
2022-06-06 09:54:00 -07:00
Arun Saravanan Balachandran
8981ae5cae
[202012][cherry-pick] DellEMC: Z9332f - Component API Fixes (#10997) 2022-06-03 10:27:39 -07:00
Arun Saravanan Balachandran
33c1ba1b2c [DellEMC S5248f] Remove duplicate ipmihelper.py (#10455)
Why I did it
To remove the ipmihelper.py in S5248f directory to prevent the image label being marked 'dirty', due to the file being replaced by the ipmihelper.py in common folder during build.

How I did it
Remove ipmihelper.py in S5248f directory.

How to verify it
Build a broadcom image and verify that the tracked files are not modified.

Which release branch to backport (provide reason below if selected)
 201811
 201911
 202006
 202012
 202106
 202111
Description for the changelog
DellEMC S5248f : Remove duplicate ipmihelper.py

Link to config_db schema for YANG module changes
A picture of a cute animal (not mandatory but encouraged)
2022-05-27 17:28:56 +00:00
Aravind Mani
9caf12859d
DellEMC: S52xx Reboot cause fix (#10783) 2022-05-23 21:03:11 -07:00
Aravind Mani
cceef8e36d
Dell S6100: Addition of SFP type_abbrv_name field (#10846)
* Dell S6100: Addition of SFP type_abbrv_name field

* Update sfp.py

* Update sfp.py

Co-authored-by: Aravind Mani <aravind.m1@dell.com>
2022-05-19 12:14:53 -07:00
Guohan Lu
8c2e04690e Revert "DellEMC: Z9332f - Component API Fixes (#10187)"
This reverts commit 8a38da94d5.
2022-04-02 14:08:28 -07:00
Arun Saravanan Balachandran
8a38da94d5 DellEMC: Z9332f - Component API Fixes (#10187) 2022-03-21 22:17:36 +00:00
Eric Zhu
deac4eed64
sonic-platform-modules-cel dx010: speed up dx010 platform init script (#10237)
Why I did it
To fix issue #10152 for dx010.
202012 Warm upgrade causes lacp-teardown on Dx010 TOR. platform code initialize slow causing lacp timeout.

How I did it
Remove the python2 sonic platform wheel which is deprecated.
Optimize the dx010 sonic platform script to speed up the init process.

How to verify it
Check the warm reboot log, warm reboot is 8-9s faster than before.

Signed-off-by: Eric Zhu <erzhu@celestica.com>
2022-03-21 09:28:25 -07:00
gechiang
a984757b9d
[202012 BRCM SAI 4.3.5.3-3] Picked up fixes that makes up BRCM SAI version 4.3.5.3-3 (#10255) 2022-03-19 17:18:50 -07:00
Santhosh Kumar T
e83955599d
[202012] Refactoring DELL platform init to reduce rc.local processing time (#10171)
Why I did it
To reduce the processing time of rc.local, refactoring s6100 platform initialization.
Fixing [warm-upgrade][202012] Slow DELL platform init in rc.local causes lacp-teardown #10150
How I did it
On branch 202012-s6100-rclocalChanges to be committed:  (use "git restore --staged <file>..." to unstage)
        modified:   ../../../../files/image_config/platform/rc.local        
	modified:   ../debian/platform-modules-s6100.install        
	modified:   scripts/fast-reboot_plugin
        modified:   scripts/s6100_platform.sh
        renamed:    scripts/s6100_i2c_enumeration.sh -> scripts/s6100_platform_startup.sh
        renamed:    systemd/s6100-i2c-enumerate.service -> systemd/s6100-platform-startup.service
2022-03-10 18:51:07 -08:00
Yang Wang
0bfc469d8c
[SAIServerV2] Build SAI Serverv2 docker (#9509) (#9940)
Support saiserver v2 with python3 and thrift 0.13.0 (Cherry-picked from master #9509)

add variables to support the saiserverv2
build different thrift in saithrift depends on saiserver version
build differernt versions of saiserver
make the saiserver and saiserver docker with version number

test done:
build two different versions of sasiserver in local build environment

Co-authored-by: Richard.Yu <richard.yu@microsoft.com>
2022-03-04 20:45:39 -08:00
Aravind Mani
53f2142928
Dell S6100: Fix xcvrd fix (#10123) 2022-03-04 16:34:06 -08:00
gechiang
7fb546dce4
[202012]BRCM SAI 4.3.5.3-2 Fixes CS00012228504, SONIC-55963:SID, CS00012209080, CS00012220761, and CS00012222414 (#10155) 2022-03-04 16:24:59 -08:00
Xin Wang
179c140ec8 Fix issue of parsing syseeprom value with whitespace in middle (#10021)
Fixes #10020

Why I did it
The platform api for parsing syseeprom information read from STATE DB has issue
with parsing the value part that has whitespace in the middle. The current
code assumes that the value part does not have whitespace. So everything after
the whitespace will be ignored. The syseeprom values returned from platform
API do not match the output of "show platform syseeprom".

How I did it
This change improved the regular expression for parsing syseeprom values to
accommodate whitespaces in the value.

How to verify it
Locally updated the code on a dx010 device. Call the platform API:
```
>>> import sonic_platform
>>> platform = sonic_platform.platform.Platform()
>>> chassis = platform.get_chassis()
>>> chassis.get_system_eeprom_info()
{'0x21': 'DX010', '0x22': 'R0872-F0020-02', '0x23': 'DX010B2F030A27BY200002', '0x24': '00:E0:EC:E7:71:0F', '0x25': '11/03/2020 21:22:56', '0x26': '3', '0x27': 'Seastone', '0x28': 'RANGELEY', '0x29': '2014.08', '0x2A': '131', '0x2B': 'CELESTICA', '0x2C': 'THA', '0x2D': 'Celestica', '0x2E': '1.0.5', '0x2F': 'LB', '0xFD': '', '0xFE': '0xAAB39BDB'}
```

Signed-off-by: Xin Wang <xiwang5@microsoft.com>
2022-02-23 22:46:38 +00:00
Richard.Yu
b4e8bcc19c
Generate sai.profile from j2 tempalte when saiserver start (#10045)
Generate the sai.profile base on the brcm j2 file if the sai.profile
is not existing in the dut mounted folder.
Change the supervisor service configuration accordingly.

Testing done:
Add the script and config in dut
saiservice server can start automatically with [systemctl start saiserver]

Signed-off-by: richardyu-ms <richard.yu@microsoft.com>
2022-02-21 17:55:23 -08:00
Aravind Mani
90fb01e4ab
DellEMC Z9332f: Fix thermalctld warning logs (#9977) 2022-02-14 22:53:30 -08:00
vmittal-msft
7435613216
[202012] BRCM SAI 4.3.5.3-1 Fix for CS00012218555 (#9923) 2022-02-07 08:02:57 -08:00
Samuel Angebault
b2d1dfb944
[202012][Arista] Backport logrotate configuration (#9518)
- Backport logrotate configuration for `arista*.log` files
2022-01-21 23:17:14 +05:30
gechiang
090ef33ca2
[202012]BRCM SAI 4.3.5.3 Fixes CS00012218100,CS00012215529,CS00012208995,CS00012220761,CS00012211718,CS00012208995,CS00012220761, and CS00012225760 (#9815) 2022-01-20 15:28:34 -08:00
gechiang
bdc7ce86de
[202012] BRCM SAI 4.3.5.2 Fixes CS00012205357, CS00012214196, CS00012213974 (#9754) 2022-01-13 11:40:43 -08:00
vmittal-msft
724037ebc3
BRCM SAI 4.3.5.1-9 for enabling SAI_SWITCH_ATTR_QOS_DSCP_TO_TC_MAP capability (#9463) 2021-12-14 09:56:21 -08:00