sonic-buildimage/device
Volodymyr Samotiy f1d6655004
[Mellanox] Disable SSD NCQ on Mellanox platforms (#17567)
- Why I did it
Based on some research some products might experience an occasional IO failures in the communication between CPU and SSD because of NCQ.
There seems to be a problem between some kernel versions and some SATA controllers.

Syslog error message examples:

Error "ata1: SError: { UnrecovData Handshk }" - "failed command: WRITE FPDMA QUEUED".
Error "ata1: SError: { RecovComm HostInt PHYRdyChg CommWake 10B8B DevExch }" - "failed command: READ FPDMA QUEUED".
Some vendors already disabled NCQ on their platforms in SONiC due to similar issue:

[Arista] Disable ATA NCQ for a few products #13739 [Arista] Disable ATA NCQ for a few products
[Arista] Disable SSD NCQ on DCS-7050CX3-32S #13964 [Arista] Disable SSD NCQ on DCS-7050CX3-32S
Also there are other discussions on Debian/Ubuntu forums about similar issues and it was suggested to disable NCQ:

https://askubuntu.com/questions/133946/are-these-sata-errors-dangerous

- How I did it
Add a kernel parameter to tell libata to disable NCQ

- How to verify it
Use FIO tool - fio --direct=1 --rw=randrw --bs=64k --ioengine=libaio --iodepth=64 --runtime=120 --numjobs=4
2024-01-28 16:26:07 +02:00
..
accton [Edgecore][device/accton] Fix subprocess.call issue in is_host(). (#13111) 2022-12-20 13:13:42 -08:00
alphanetworks Fix code issue when SonicV2Connector.get() return None. (#13250) 2023-01-09 11:37:43 -08:00
arista [Arista] Update config.bcm of 7060_cx32s for handling 40g optics with unreliable los settings (#17768) 2024-01-16 11:34:19 -08:00
barefoot [BFN] Remove not common entries from pcie yaml configuration (#12816) 2023-02-06 09:54:43 -08:00
broadcom [broadcom]: Set default SYNCD_SHM_SIZE for Broadcom XGS devices (#13297) 2023-01-30 20:23:03 -08:00
celestica [platform]: Disable interrupt for intel i2c-i801 driver (#16309) 2023-09-05 10:23:57 -07:00
centec [device/centec] Replace os.system and remove subprocess with shell=True (#12024) 2022-10-07 10:48:25 -04:00
cig Add platform_asic file to each platform folder in sonic-device-data based package (#8542) 2021-10-08 19:27:48 -07:00
common [Arista] Add support for DCS-7060DX5-32 (#14793) 2023-08-05 22:11:45 +08:00
dell Revert "Move /var/log to RAM for Mellanox SN2700, Nokia 7215 and Dell S6100 (#15077)" (#16775) 2023-10-11 10:36:29 -07:00
delta [device/delta] Mitigation for command injection vulnerability (#11865) 2022-10-13 21:11:51 -07:00
facebook/x86_64-facebook_wedge100-r0 Add platform_asic file to each platform folder in sonic-device-data based package (#8542) 2021-10-08 19:27:48 -07:00
fs/arm64-fs_s5800_48t4s-r0 [FS][arm64] support new boars s5800-48t4s and s5800-48t8s-mars8p (#12994) 2022-12-17 14:48:02 -08:00
ingrasys [BFN] Update configuration files (#9913) 2022-03-09 09:57:08 +05:30
inventec [BRCMSAI 6.0.0.13-1] Fix Cancun file directory at new location causing TD3 platform boot issue (#9922) 2022-02-07 08:56:06 -08:00
juniper [device/juniper] Mitigation for security vulnerability (#11838) 2022-11-22 10:46:12 -05:00
marvell [Marvell-arm64] Add platform support for rd98DX35xx (#16874) 2023-11-20 09:43:02 -08:00
mellanox [Mellanox] Disable SSD NCQ on Mellanox platforms (#17567) 2024-01-28 16:26:07 +02:00
mitac/x86_64-mitac_ly1200_b32h0_c3-r0 Add platform_asic file to each platform folder in sonic-device-data based package (#8542) 2021-10-08 19:27:48 -07:00
netberg [Netberg][Barefoot] Added support for Aurora 750 (#16342) 2023-09-01 22:52:39 -07:00
nokia [Nokia-IXR7250E] Modify the platform_reboot on the IXR7250E for PMON API reboot and Disable all SFPs (#17483) 2024-01-08 11:39:30 -08:00
pegatron/x86_64-pegatron_porsche-r0 Add platform_asic file to each platform folder in sonic-device-data based package (#8542) 2021-10-08 19:27:48 -07:00
pensando/arm64-elba-asic-r0 Add pensando platform (#15978) 2023-12-04 14:41:52 -08:00
quanta [device/quanta] Mitigation for security vulnerability (#11867) 2022-10-19 10:05:36 -04:00
ragile Revert "[Ragile]: Add new centec platform ra-b6010 (#14819)" 2023-09-01 22:43:18 -07:00
ruijie/x86_64-ruijie_b6510-48vs8cq-r0 [device/ruijie] Mitigation for security vulnerability #11779 2022-08-23 09:48:42 -04:00
supermicro/x86_64-supermicro_sse_t7132s-r0 [Supermicro]: Add a new supported device and platform, SSE-T7132S. (#15368) 2023-07-20 10:24:56 -07:00
tencent two platforms supporting S3IP SYSFS (TCS8400, TCS9400) (#12386) 2022-12-18 16:16:53 +08:00
ufispace [Ufispace][PDDF] Add PDDF support on S9110-32X, S8901-54XC, S7801-54XS and S6301-56ST (#16017) 2023-08-14 15:56:03 -07:00
virtual [sonic-vs]: Add fabric port data for vs test, and start fabricmgrd in vs environment (#16791) 2023-11-20 16:21:03 -08:00
wistron [devices]: Update the Wistron platform support in master branch (#12110) 2023-02-23 09:08:13 -08:00
wnc/x86_64-wnc_osw1800-r0 [BFN] Update configuration files (#9913) 2022-03-09 09:57:08 +05:30