Commit Graph

1184 Commits

Author SHA1 Message Date
Volodymyr Samotiy
b085fb158f
[202205] [Mellanox] Disable SSD NCQ on Mellanox platforms (#17662) 2024-01-10 14:39:54 -08:00
Vadym Hlushko
7294103e67
[202205][Mellanox] Add mlxtrace to techsupport (#15961) (#15982)
* [mlxtrace] Add mft-fwtrace-cfg.deb which contains fwtrace_cfg files for the mlxtrace utility

Signed-off-by: vadymhlushko-mlnx <vadymh@nvidia.com>

* [mlxtrace] Remove mlxtrace support for SPC4

Signed-off-by: vadymhlushko-mlnx <vadymh@nvidia.com>

---------

Signed-off-by: vadymhlushko-mlnx <vadymh@nvidia.com>
2023-12-18 14:24:02 -08:00
Arvindsrinivasan Lakshmi Narasimhan
63b6dedfcf
change the max lag_id to 1024 (#17336) 2023-11-29 10:09:07 -08:00
mssonicbld
03fd20410a
[Nokia][Nokia-IXR7250E-SUP-10] Update BCM config for supervisor card to reduce the CPU usage (#16790) (#17307) 2023-11-28 05:19:21 +08:00
mssonicbld
b6f783ffa4
Revert "Move /var/log to RAM for Mellanox SN2700, Nokia 7215 and Dell S6100 (#15077)" (#16775) (#16886) 2023-10-14 15:38:25 +08:00
Vadym Hlushko
3ac09d544a
[202205][buffers] Add 'create_only_config_db_buffers.json' file for the Mellanox devices (not MSFT SKU) (#16232)
* [buffers] Add create_only_config_db_buffers.json for MLNX devices (not MSFT SKU), inject it at the start of the swss docker

Signed-off-by: vadymhlushko-mlnx <vadymh@nvidia.com>

* [buffers] Align the sonic-device_metadata.yang

Signed-off-by: vadymhlushko-mlnx <vadymh@nvidia.com>

---------

Signed-off-by: vadymhlushko-mlnx <vadymh@nvidia.com>
2023-10-10 09:28:00 -07:00
Nazarii Hnydyn
214ea08777
[ssm]: Enable Store-And-Forward switching mode for SN2700/SN3800/SN4600C/SN4700. (#16662)
Signed-off-by: Nazarii Hnydyn <nazariig@nvidia.com>
2023-09-28 09:50:28 -07:00
vmittal-msft
9ffa4bdfff [nokia]: Updated total headroom pool size to accommodate 100G ports on T2 uplinks (#16690)
Microsoft ADO (25266920)

sonic-mgmt xoff test was failing for [100g,120km]. Needed to update total headroom pool size when 100G line card is used as T2 uplink.

This size was calculated assuming 100g is used for downlink so cable length was 2km whereas it can also be used for uplink (cable length - 120km). so we need to do calculation based on 120km not 2km. Although it will be some wastage for 2km scenario but it should cover both cases.
2023-09-27 12:32:28 +08:00
Nazarii Hnydyn
7c68be04e8
[Mellanox]: Update SKUs to enable SDK dumps. (#16286)
CHERRY-PICK: #7708

Signed-off-by: Nazarii Hnydyn <nazariig@nvidia.com>
2023-09-23 00:39:23 -07:00
mssonicbld
624a5d489f
[Mellanox] Revise label name and fix typo in sensor.conf of 4600C (#16271) (#16467)
- Why I did it
Revise lable name and fix typo in sensor.conf of 4600C

- How I did it
Revise lable name and fix typo in sensor.conf of 4600C

- How to verify it
Manual test
sonic-mgmt test_sensors.py

Co-authored-by: Junchao-Mellanox <57339448+Junchao-Mellanox@users.noreply.github.com>
2023-09-06 16:08:27 -07:00
mssonicbld
0fe5c9fc7d
[platform]: Disable interrupt for intel i2c-i801 driver (#16309) (#16457)
On S6100 we are seeing almost 100K interrupts per second on intels i801 SMBUS controller which affects systems performance.

We now disable the i801 driver interrupt and instead enable polling

Microsoft ADO (number only): 24910530

How I did it
Disable the interrupt by passing the interrupt disable feature argument to i2c-i801 driver

How to verify it
This fix is NOT applicable for ARM based platforms. Applicable only for intel based platforms:-

- On SN2700 its already disabled in Mellanox hw-mgmt
- Celestica DX010 and E1031
- Dell S6100 verified the interrupts are no longer incrementing.
- Arista 7260CX3

Signed-off-by: Prince George <prgeor@microsoft.com>
Co-authored-by: Prince George <45705344+prgeor@users.noreply.github.com>
2023-09-06 09:49:58 -07:00
mssonicbld
74dbafe728
[Nokia-IXR7250E] Modify the platform_ndk.json for Nokia-IXR7250E platform (#16355) (#16382)
Signed-off-by: mlok <marty.lok@nokia.com>
Co-authored-by: Marty Y. Lok <76118573+mlok-nokia@users.noreply.github.com>
2023-09-01 20:12:14 -07:00
mssonicbld
88d692f987
[Nokia][DeviceData] Update the Nokia platform IXR-7250E device data (#16028) (#16381)
Why I did it
Update the platform_reboot of Nokia Platform IXR-7250E-36x400G to displays the correct reboot-cause history when reboot from supervisor card.

Work item tracking
Microsoft ADO (number only):
How I did it
Modify the platform_reboot script to copy the correct reboo-cause.txt file from NDK to the /host/reboot-cause directory at the down cycle when the reboot is issued from Supervisor (for both reboot right after install a new image and normal reboot)

Signed-off-by: mlok <marty.lok@nokia.com>
Co-authored-by: Marty Y. Lok <76118573+mlok-nokia@users.noreply.github.com>
2023-09-01 16:19:22 -07:00
vmittal-msft
134a22221c
Update CPU transmitted packets to queue 7 for chassis (#16349) 2023-08-31 08:57:02 -07:00
mssonicbld
d264df3984
Dell S6100 Platform API 2.0 fixes (#16208) (#16252)
Why I did it
Dell S6100 Platform components needs to be updated.

How I did it
Modified platform.json to fix the issue.

How to verify it
Run sonic-mgmt component test and check whether it passes.

Co-authored-by: Aravind Mani <53524901+aravindmani-1@users.noreply.github.com>
2023-08-25 17:05:37 -07:00
Pavan-Nokia
5d4a201453
[armhf][Nokia-7215]Add HWSKU files for new SAI (#16175)
Add new easy bringup (EZB) files for new SAI 1.10.2-5
2023-08-18 11:23:23 -07:00
mssonicbld
a5eda5aaa8
Updated PG headroom settings for 40g port speed (#16038) (#16177)
Co-authored-by: vmittal-msft <46945843+vmittal-msft@users.noreply.github.com>
2023-08-17 08:40:00 -07:00
mssonicbld
f02ca9a749
Potential fix for Celestica E1031 device hang (#15822) (#15844) 2023-07-15 05:29:15 +08:00
mssonicbld
365afbb3f0
Move /var/log to RAM for Mellanox SN2700, Nokia 7215 and Dell S6100 (#15077) (#15804) 2023-07-13 05:01:14 +08:00
mssonicbld
ca123ca2a5
[Arista] Update hwsku.json for Arista-7050QX-32S-S4Q31 (#15251) (#15671)
* [Arista] Update hwsku.json for Arista-7050QX-32S-S4Q31

* Change to 3x10G(3)+1x1G(1) on Arista-7050QX-32S-S4Q31

Co-authored-by: byu343 <byu@arista.com>
2023-06-30 13:24:54 -07:00
Marty Y. Lok
e8e8c019c2
[Nokia-IXR7250E][Devicedata] update the device data for Nokia IXR7250E platform (#15605)
Signed-off-by: mlok <marty.lok@nokia.com>
(cherry picked from commit 478ba478f5)
2023-06-23 15:41:46 -07:00
mssonicbld
a1990d4c48
[Arista][x86_64-arista_7050_qx32] Add Components to platform.json (#15252) (#15585) 2023-06-23 03:41:58 +08:00
vmittal-msft
41cd22d04e
Updated default ECN settings for T2 chassis (#15541)
* Updated default ECN settings for T2 chassis (#14388)

Why I did it
Update ECN settings for T2 chassis

How I did it
Updated qos config file to load these settings during switch bootup

How to verify it
Verified on line card on T2 chassis

* Fix for test failures

* Test case failures

* test case fix
2023-06-22 08:17:45 -07:00
Samuel Angebault
3396cca1a3 [Arista] fix platform.json for a few devices (#15308)
Why I did it
sonic-mgmt is failing tests due to invalid test data in platform.json
Fwutil is upset the chassis name in the platform_component.json of the 7060CX-32S

How I did it
Fixed the aforementioned issues
2023-06-16 09:55:06 +08:00
Arvindsrinivasan Lakshmi Narasimhan
2e15db02f2 set the default value for the port fec to RS on J2 based LC (#15346)
Why I did it
Work item tracking
Microsoft ADO (24182162):
How I did it
update the config.bcm to set the default fec RS 100G Linecard

How to verify it
Tests on chassis
2023-06-10 14:32:40 +08:00
vmittal-msft
723c508a30
Update PG headroom settings ports based on port speed/cable length (#15287)
Why I did it
Update cable length for uplink/downlink ports for chassis and and update PG/pool headroom size accordingly.

Work item tracking
17880812

How I did it
Updated cable length as well as buffer config in HWSKU files.
2023-06-02 15:48:11 -07:00
Neetha John
9f66b28068
[202205][brcm] Fix alignment of soc properties (#15240)
Why I did it
To improve readability of config.bcm, fixed the alignment of soc properties

How to verify it
Build sonic_config_engine-1.0-py3-none-any.whl successfully

Signed-off-by: Neetha John <nejo@microsoft.com>
2023-05-30 16:15:32 -07:00
Neetha John
8661e9de2c
[202205] Update SOC properties for DLR_INIT based pfcwd recovery (#15217)
Why I did it
Update soc properties for certain roles that need to use pfcwd dlr init based recovery mechanism

How to verify it
Updated the templates on a 7050cx3 dual tor and 7260 T1 which satisfies these conditions and validated pfcwd recovery which uses DLR_INIT based mechanism. Also validated that this mechanism is not used on 7050cx3 single tor with the updated templates

Signed-off-by: Neetha John <nejo@microsoft.com>
2023-05-26 14:29:34 -07:00
arista-nwolfe
9f8a250f61
Add soc property sai_pfc_dlr_init_capability=0 to missing DNX SKUs (#15155) 2023-05-26 09:08:40 -07:00
Marty Y. Lok
6335f610c8 [Nokia][device-data] Modify the Nokia-7250IXRE platform specific reboot script (#14568)
Why I did it

When reboot the chassis by issuing "sudo reboot" on Supervisor card. The internal midplane communication xe0 should be shutdown to avoid double reboot on the linecard.
Added a udev link rule to disable the autoneg on AMD xgbe port Xe0 and Xe1 and make the setting in sync with the peer Broadcom greyhound ports.

How I did it

Modify the Nokia-7250IXRE specific reboot script on the Supervisor card to shutdown the internal interface xe0. Also move reboot linecard code to the top of the script to make sure the notification has been send to Linecard before shutdown the xe0 interface.
Introduced a new rule 80-net-by-driver.link to disable the autoneg on the AMD size. This change requires the latest NDK which contains the change to set the autoneg on the xe0 and xe1 port on the Greyhound.

Signed-off-by: mlok <marty.lok@nokia.com>
2023-05-17 12:32:42 +08:00
mssonicbld
4cdcda43b8
[devices/arista] Update asic_port_name in Arista LCs (#14234) (#14732)
Updated asic_port_names for all Arista LC SKUs to follow latest naming
conventions to remove redundant ASICx suffix. For
Arista-7800R3-48CQ2-C48, added the asic_port_name mapping.

Co-authored-by: kenneth-arista <93353051+kenneth-arista@users.noreply.github.com>
2023-04-19 23:08:03 -07:00
mssonicbld
01311046ca
[devices/arista]: Added recycle ports required for egress mirroring (#13967) (#14731) 2023-04-20 05:56:32 +08:00
Pavan-Nokia
f6f060f49d
[armhf][Nokia-7215]Add SFP refactor support for Nokia-7215 platform (#14551)
Why I did it
Add support for SFP refactor on Nokia-7215 Marvell armhf platform.

Platform: armhf-nokia_ixs7215_52x-r0
HwSKU: Nokia-7215
ASIC: marvell
Port Config: 48x1G + 4x10G (SFP+)
How I did it
Modify sfp.py to support SFP refactor optoe driver and platform.json to facilitate proper OC test completion.

How to verify it
Build armhf target for Nokia-7215 and verify proper Xcvrd and SFP refactor operation.
2023-04-17 17:26:15 -07:00
Ikki Zhu
a2e244d3bd [Seastone] DX010 platform switch to sfp-refactor based sfp impl (#13972)
Why I did it
sonic-sfp based sfp impl would be deprecated in future, change to sfp-refactor based implementation.

How I did it
Use the new sfp-refactor based sfp implementation for seastone.

How to verify it
Manual test sfp platform api or run sfp platform test cases.
2023-04-03 16:34:16 +08:00
Neetha John
06939ffaec
[202205] [qos] Update RDMA-CENTRIC lossy profile to use static threshold for Th devices (#14399)
Backport #14372 to 202205

Why I did it
For better accounting purposes, updating the ingress lossy traffic profile to use static threshold. This change is only intended for Th devices using RDMA-CENTRIC profiles

How I did it
Update the buffer templates for Th devices in RDMA-CENTRIC folder to use the correct threshold

Signed-off-by: Neetha John <nejo@microsoft.com>
2023-03-24 10:42:15 -07:00
Pavan-Nokia
cb2e03da8e
[Nokia-7215]Update Marvell EZB XML files (#14261)
Update cnc-base counters to 4096 from 0 in the Marvell SAI
Easy bringup XML files
2023-03-17 10:58:55 -07:00
mssonicbld
32ac9b0bce
Update dynamic threshold for TD2 (#14224) (#14273)
Why I did it
Update dynamic threshold to -1 to get optimal performance for RDMA traffic

How I did it
Modified pg_profile_lookup.ini to reflect the correct value

Signed-off-by: Neetha John <nejo@microsoft.com>
Co-authored-by: Neetha John <nejo@microsoft.com>
2023-03-16 17:04:42 -07:00
Samuel Angebault
9ac78190ee [Arista] Add missing platform_components.json (#14067)
Provide platform-components.json for Clearwater2 and Wolverine

These files are needed for fwutil platform sonic-mgmt tests to pass.

Fix PikeZ platform_components.json

Co-authored-by: Patrick MacArthur <pmacarthur@arista.com>
Co-authored-by: Andy Wong <andywong@arista.com>
2023-03-16 04:32:32 +08:00
Jemston Fernando
f39bdaecf8
Fix Belgite platform issues (#14035)
As part of platform hardening this commit fixes several platform issues
in various components like PSU, FAN, Temperature, LED.
Cherrypick PR#13389
2023-03-15 19:29:00 +00:00
Sambath Kumar Balasubramanian
da348ecb79 sonic-buildimage Remove unused SAT port from arista configs. (#14167)
Why I did it
To fix aristanetworks/sonic#85

How I did it
Remove unnecessary SAT ports

How to verify it
Speed change from 400-100g without any error.
2023-03-14 04:32:42 +08:00
Ikki Zhu
3a8305d4b0 [Seastone] fix dx010 qsfp eeprom data write issue (#13930)
Why I did it
Platform cases test_tx_disable, test_tx_disable_channel, test_power_override failed in dx010.

How I did it
Add i2c access algorithm for CPLD i2c adapters.

How to verify it
Verify it with platform_tests/api/test_sfp.py::TestSfpApi test cases.
2023-03-06 16:39:26 +08:00
kenneth-arista
812c1aeecf
sonic-buildimage Make changes to arista config.bcm files to support max cores (#13831) (#14033)
To support 64 cores on arista skus. Fixes aristanetworks/sonic#77
Remapped recycle ports to lowers core port ids and set appl_param_nof_ports_per_modid to 64.

Co-authored-by: Sambath Kumar Balasubramanian <63021927+skbarista@users.noreply.github.com>
2023-03-02 13:08:00 -08:00
mssonicbld
ab6b3cde4e
Add QOS profiles for Arista SKUs (#13829) (#14040) 2023-03-02 14:51:41 +08:00
Stephen Sun
76a5c75b82 [Mellanox] Advance hw-mgmt to v.7.0020.4104 (#13372)
- Why I did it
Advance hw-mgmt service to V.7.0020.4100
Add missing thermal sensors that are supported by hw-mgmt package
Delay system health service before hw-mgmt has started on Mellanox platform in order to avoid reading some sensors before ready.
Depends on sonic-net/sonic-linux-kernel#305

- How I did it
1. Update hw mgmt version
2. Add missing sensors
3. Delay service 

- How to verify it
Regression test.

Signed-off-by: Stephen Sun <stephens@nvidia.com>
2023-03-01 12:38:50 +08:00
mssonicbld
e842241f71
add psu fans status led available config (#13926) (#13936) 2023-02-23 06:29:38 +08:00
Samuel Angebault
3c3a4ac517 [Arista] update sensors.conf to ignore sensors (#12529)
Why I did it
The sensors and sensord processes were reporting data on unused sensors.
This lead to ALARM messages or erroneous values that could be misinterpreted.

How I did it
Ignore the affected sensors in the sensors.conf

How to verify it
Check that there are no longer ALARM messages from sensord in the syslog or in the output of sensors
2023-02-22 04:33:53 +08:00
mssonicbld
e44b255555
[DX010 platform] fix dx010 platform testcase issues (#13595) (#13778)
Why I did it
1. fix chassis test_set_fans_led case
2. fix chassis get_name case mismatch issue
3. fix fan_drawer test_set_fans_speed
4. fix component test_components test case

How I did it
Add corresponding configuration into chassis json file

How to verify it
Run platform tests cases to verify these failure cases

Co-authored-by: Ikki Zhu <79439153+qnos@users.noreply.github.com>
2023-02-10 18:18:00 -08:00
mssonicbld
cdbdf95e70
fix platform.json on Wolverine for thermal sensors (#13524) (#13748)
Why I did it
The current platform.json contains entries for thermals and SFPs that do not exist on Wolverine.

How I did it
I removed the incorrect entries.

How to verify it
Verify using applicable sonic-mgmt platform API tests.

Co-authored-by: Patrick MacArthur <patrick@patrickmacarthur.net>
2023-02-10 14:39:00 -08:00
mssonicbld
268e866c02
[Celestica DX010] fix fan drawer and watchdog platform testcase issues (#13426) (#13747)
Why I did it
fix DX010 fan drawer and watchdog platform test case issues

How I did it
1. Add fan_drawer get_maximum_consumed_power support
2. Adjust maximum watchdog timeout value check

How to verify it
Run test_fan_drawer and test_watchdog test cases.

Co-authored-by: Ikki Zhu <79439153+qnos@users.noreply.github.com>
2023-02-10 09:19:38 -08:00
mssonicbld
06aa8aa11b
[Mellanox] Support DSCP remapping in dual ToR topo on T0 switch (#12605) (#13745)
- Why I did it
Support DSCP remapping in dual ToR topo on T0 switch for SKU Mellanox-SN4600c-C64, Mellanox-SN4600c-D48C40, Mellanox-SN2700, Mellanox-SN2700-D48C8.

- How I did it
Regarding buffer settings, originally, there are two lossless PGs and queues 3, 4. In dual ToR scenario, the lossless traffic from the leaf switch to the uplink of the ToR switch can be bounced back.
To avoid PFC deadlock, we need to map the bounce-back lossless traffic to different PGs and queues. Therefore, 2 additional lossless PGs and queues are allocated on uplink ports on ToR switches.

On uplink ports, map DSCP 2/6 to TC 2/6 respectively
On downlink ports, both DSCP 2/6 are still mapped to TC 1
Buffer adjusted according to the ports information:
Mellanox-SN4600c-C64:
56 downlinks 50G + 8 uplinks 100G
Mellanox-SN4600c-D48C40, Mellanox-SN2700, Mellanox-SN2700-D48C8:
24 downlinks 50G + 8 uplinks 100G

- How to verify it
Unit test.

Signed-off-by: Stephen Sun <stephens@nvidia.com>
Co-authored-by: Stephen Sun <5379172+stephenxs@users.noreply.github.com>
2023-02-10 09:16:56 -08:00