Commit Graph

6736 Commits

Author SHA1 Message Date
Rajkumar-Marvell
bc70978540
[Marvell] Update armhf sai debian (#13037)
Why I did it
Update Marvell armhf sai debian version. Includes supports for,
Autoneg
Everflow

How to verify it
Compile and load SONiC armhf bin

Signed-off-by: rajkumar38 <rpennadamram@marvell.com>
2022-12-14 12:10:09 +08:00
mssonicbld
faaaea0464
[ci/build]: Upgrade SONiC package versions (#13036) 2022-12-13 23:15:57 +08:00
Ying Xie
8b924e99c7
[202205][linkmgrd][sairedis][platform-daemons][platform-common] advance submodule head (#13031)
linkmgrd:
* d7bf12b 2022-12-08 | update link prober stats post logic (#159) (HEAD -> 202205) [Jing Zhang]
* 1f28c08 2022-12-08 | [Active-Active] remove chatty messages (#158) [Jing Zhang]

sairedis:
* 6a64b85 2022-12-07 | Initialize attr variables in Legacy.switch_get and LegacyFdbEntry.fdb_entry_get (#1169) (HEAD -> 202205) [Saikrishna Arcot]

platform-daemons:
* fb4b158 2022-12-08 | [thermalctld] fix some redundant removal of state DB tables (#315) (HEAD -> 202205) [vdahiya12]

platform-common:
* 9170877 2022-12-04 | [syseeprom] Remove the trailing space in the value of VENDOR_EXT field in the eepromTlvInfo decode (#333) (HEAD -> 202205, github/202205) [Marty Y. Lok]
* 79a36e2 2022-12-02 | [202205] Add media assignment options to Application Advertisement (#324) (#329) [Junchao-Mellanox]
* b20896c 2022-12-02 | [Ci] Upgrade to bullseye and fix the branch reference issue (#332) [xumia]

Signed-off-by: Ying Xie <ying.xie@microsoft.com>

Signed-off-by: Ying Xie <ying.xie@microsoft.com>
2022-12-12 19:48:52 -08:00
Sudharsan Dhamal Gopalarathnam
f48706d5e2
[yang] Add collector_vrf to sflow yang model (#12897) (#13025)
- Why I did it
Fixed sflow yang model to include collector_vrf field.

- How I did it
Added leaf for collector_vrf under sflow_collector. Additionally aligned the configuration guide

- How to verify it
Added UT to verify.
2022-12-12 16:33:59 -08:00
mssonicbld
557283a7f8
Fix dependency of dhcp-mon on VLAN with only v6 (#13006) (#13029) 2022-12-13 08:12:33 +08:00
mssonicbld
943ea1b005
[arista] Add platform.json for arista chassis LC5 (#12949) (#13027) 2022-12-13 07:39:07 +08:00
mssonicbld
a42a7679f7
[ci] Disable Shallow fetch in pipeline build. (#12964) (#13028) 2022-12-13 06:40:53 +08:00
Saikrishna Arcot
c725dfb975 Replace logrotate cron file with (adapted) systemd timer file (#12921)
Debian is shipping a systemd timer unit for logrotate, but we're also
packaging in a cron job, which means both of them will run, potentially
at the same time. Remove our cron file, and add an override to the
shipped timer file to have it be run every 10 minutes.

Fixes #12392.

Signed-off-by: Saikrishna Arcot <sarcot@microsoft.com>

Signed-off-by: Saikrishna Arcot <sarcot@microsoft.com>
2022-12-13 06:34:28 +08:00
mssonicbld
6891aa915a
[ci/build]: Upgrade SONiC package versions (#13017) 2022-12-11 22:19:51 +08:00
mssonicbld
e428afae01
[ci/build]: Upgrade SONiC package versions (#13015) 2022-12-10 22:16:58 +08:00
lixiaoyuner
b0c9013ea1
Add k8s master feature (#11637) (#12984)
Signed-off-by: Yun Li <yunli1@microsoft.com>

* Add k8s master feature

* Update kubernetes version mistake and make variable passing clear

* Add CRI-dockerd package

* Update version variable passing logic

* Upgrade the worker kubernetes version

* Install xml file parse tool
2022-12-09 10:43:54 +08:00
Saikrishna Arcot
083a669635 [build]: Fix docker load image tag not being the expected tag (#12959)
PR #12829 modified the docker tagging scheme such that optional docker
containers would be tagged with the SONiC image version. However, the
docker-image-load macro wasn't updated for these changes. Update it
here.

Signed-off-by: Saikrishna Arcot <sarcot@microsoft.com>
2022-12-09 09:58:56 +08:00
byu343
6a718a3685 [Arista] Disable pcie checking on x86_64-arista_7050cx3_32s (#12900)
This change is to disable the pcie firmware check done by Broadcom SAI. The change is needed for the Arista platform x86_64-arista_7050cx3_32s; otherwise, the check will fail, blocking the initialization.

There was a pcie firmware check added in brcm SDK and certain Arista hardwares do not compliant with the check, so we added the disable_pcie_firmware_check originally for x86_64-arista_7060dx4_32. For x86_64-arista_7050cx3_32s, it was able to pass the check but some firmware change done in August made it fail.
2022-12-09 06:32:36 +08:00
mssonicbld
4147610fc5
[PATCH]: Update FRR patch 0009 to avoid inappropriate error messages (#12912) (#13004) 2022-12-09 06:23:58 +08:00
mssonicbld
1a34cff029
[sflow]: Unblocked psample_*() function calls in BRCM ESW platforms for proper functionality of sflow feature (#12918) (#13001) 2022-12-09 03:40:48 +08:00
Arvindsrinivasan Lakshmi Narasimhan
f1b7b68a52
[202205][chassis]fix to use the correct table in chassis_state_db (#12992)
Why I did it
In the PR sonic-net/sonic-platform-daemons#311 the table for updating the fabric asic was changed. This PR is update docker-init.sh to use the correct table to detect the fabric asic.

How I did it
update docker-init.sh

How to verify it
Check on chassis


Signed-off-by: Arvindsrinivasan Lakshmi Narasimhan <arlakshm@microsoft.com>
2022-12-08 11:36:45 -08:00
Junchao-Mellanox
5ea8d0c615 [system-health] Led color shall be controlled by configuration when system is booting (#12487)
* [system-health] Led color shall be controlled by configuration when system is booting

* Fix unit test issue
2022-12-08 17:19:43 +00:00
Stepan Blyshchak
7ed1cd0d68 [services] kill container on stop in warm/fast mode (#10510)
- Why I did it
To optimize stop on warm boot.

- How I did it
Added kill for containers
2022-12-08 17:19:16 +00:00
Michael Li
41858170d8 Limit reload BCM SDK kmods on syncd start to PikeZ platform (#12971)
Why I did it
Limiting #12804 changes to PikeZ platform only (Arista-720DT-48S). Note that this is a short term workaround for this platform until SDK investigation on SDK init failure on docker syncd restart due to DMA issues is resolved.

How I did it
Retrieve platform name from /host/machine.conf and only reload SDK kmods on Arista-720DT-48S platform.

Signed-off-by: Michael Li <michael.li@broadcom.com>
2022-12-08 17:18:00 +00:00
Ying Xie
7da66c2943 Revert "Revert "Reload BCM SDK kmods on syncd start to handle syncd restart issues (#12804)""
This reverts commit 7e910aecad.
2022-12-08 17:17:41 +00:00
Samuel Angebault
7c948a3b42
[202205][Arista] Update platform library submodules (#12968)
- add reboot cause support for linecards
- add back a Wolverine variant removed by mistake
- misc fixes and improvements
2022-12-07 14:59:29 -08:00
Jing Kan
f0f143ea62 [Arista 720DT] Create SKU alias Arista-720DT-G48S4 (#12905) 2022-12-08 04:32:58 +08:00
Stepan Blyshchak
699800bdf1 [swss.sh] optimize macsec feature state query (#12946)
- Why I did it
There's a slowdown in bootup related to the execution of a show command during startup of swss service. show is a pretty heavy command and takes long time to execute ~2 sec.

- How I did it
I replaced show with sonic-db-cli which takes a ms to run.

- How to verify it
Boot the switch and verify swss is active.

Signed-off-by: Stepan Blyschak <stepanb@nvidia.com>
2022-12-08 04:32:54 +08:00
Ikki Zhu
776cb55d96 Seastone: fix platform fan psu and temperature issues (#12567)
Why I did it:
Fix multiple seastone platform issues caused by sonic kernel upgrade.

How I did it:
Get gpio base id with new label path in gpio sys fs.

How to verify it:
After the change, show platform fan/psustatus/temperature works well.
2022-12-08 02:37:56 +08:00
Marty Y. Lok
3d9dcf5294 [armhf][sonic-installer] Fix issue of the sonic-installer install a image after sonic-installer clean (#12609)
Signed-off-by: mlok <marty.lok@nokia.com>

Signed-off-by: mlok <marty.lok@nokia.com>
2022-12-08 02:37:52 +08:00
Ying Xie
bf890a46bb
[202205][utilities][swss][sairedis] advance submodule head (#12974)
utilities:
* 4b51e41 2022-12-06 | [config] Add check in config interface ip command to block if the interface is portchannel member (#2539) (HEAD -> 202205) [Sudharsan Dhamal Gopalarathnam]
* e53b32e 2022-12-06 | [generate_dump] [Mellanox] Fix the duplicate dfw dump collection problem by adding symlinks (#2536) [Vivek]
* 0391221 2022-12-02 | [GCU] Add RemoveCreateOnlyDependency Validator/Generator (#2500) [jingwenxie]
* e3658e9 2022-04-13 | [scripts/fast-reboot] Shutdown remaining containers through systemd (#2133) [Stepan Blyshchak]

swss:
* 1a4a5d9 2022-12-02 | [ACL] Support ACTION_COUNTER action in custom ACL table type (#2550) [bingwang-ms]
* 33b0a9e 2022-12-05 | [muxorch] Adding case for maintaining current state (#2280) [Nikola Dancejic]

sairedis:
* b29bb45 2022-12-02 | enable cisco8000 SAI bulk API feature (#1153) (HEAD -> 202205) [Keith Lu]

Signed-off-by: Ying Xie <ying.xie@microsoft.com>

Signed-off-by: Ying Xie <ying.xie@microsoft.com>
2022-12-07 08:50:09 -08:00
Saikrishna Arcot
0eeb604258 [build]: Disable stretch slave container (#12868)
The only platforms that currently need the stretch slave container are
innovium and nephos, and both are not building with the current code due
to other issues. All other platforms only need buster and bullseye slave
containers.

Signed-off-by: Saikrishna Arcot <sarcot@microsoft.com>
2022-12-07 16:37:43 +08:00
Ikki Zhu
561bf2e076 [Platform/Seastone]: fix syseeprom tlv read issue (#12200)
Why I did it
Fix Seastone syseeprom tlv header read incorrect issue

How I did it
Set mux idle_state

How to verify it
i2cdump -y -f 12 0x50 i
2022-12-07 12:38:37 +08:00
Santhosh Kumar T
dad23df0c3 [DellEMC] Master: S6100: SSD upgrade status: Moving from smartctl to iSMART (#12784)
Why I did it
smartctl tool is available only in PMON docker. Hence, the tool may be not accessible incase PMON docker goes down.
Using iSMART_64 tool to fetch the SSD firmware version and device model information.

How I did it
Replacing smartctl with iSMART_64.
2022-12-07 12:38:36 +08:00
Marty Y. Lok
f187febcca [Nokia]Update Nokia platform submodule for Nokia-IXR7250E platform (#12876)
1d53bf4 Skip platform NDK health check two times in watchdog.sh
d68297c Added code to shutdown the channel after the grpc call also fixed the show fp-status command
0769efe Impelemented the module API to return the correct eeprom info for fabric card.
171569c Remove explicit logger identifier for transceiver module operations; use inherited id
6c4d651 Corrected the log messages for firmware install

Signed-off-by: mlok <marty.lok@nokia.com>
2022-12-07 12:38:32 +08:00
LuiSzee
fa31cac213 [centec][arm64] fix tsingma bsp compile error (#12774)
fix centec arm64 tsingma bsp compile error caused by linux kernel api change
2022-12-06 18:35:30 +08:00
Yutong Zhang
f1fa8a6c8e
[TestbedV2][202205]Set all jobs mandatory in pipeline. (#12939)
Recently, the job of t0-sonic runs stably in 202205 branch, so in this pr, I set it mandatory in azure pipeline.

Why I did it
Recently, the job of t0-sonic runs stably in 202205 branch, so in this pr, I set it mandatory in azure pipeline.

How I did it
Modify the value of continueOnError in this job from `true` to `false`.

Signed-off-by: Yutong Zhang <yutongzhang@microsoft.com>
2022-12-06 10:18:49 +08:00
mssonicbld
3f8bd9823a
[dockers] save extension dockers with an image tag (#12829) (#12947) 2022-12-06 06:29:39 +08:00
Yutong Zhang
8e4b9bc031
[TestbedV2][202205] Remove timeout in each step. (#12917)
Previously, we set timeout in each step such as Lock testbed, Prepare testbed, Run test and KVM dump. When some issue suck like retry happens in one step, it will cause timeout error, but actually, it only needs more time to success. In this pr, we remove the timeout limit in each step and control the timeout outside in each job. When the job runs more than four hours, it will be cancelled.

Why I did it
Previously, we set timeout in each step such as Lock testbed, Prepare testbed, Run test and KVM dump. When some issue suck like retry happens in one step, it will cause timeout error, but actually, it only needs more time to success. In this pr, we remove the timeout limit in each step and control the timeout outside in each job. When the job runs more than four hours, it will be cancelled.

How I did it
Remove the timeout parameter in each step, and control the timeout outside in each job.

How to verify it
Set the timeout of one job to 4 hours, and when timeout happens, azure pipeline will cancel this job.

Signed-off-by: Yutong Zhang <yutongzhang@microsoft.com>
2022-12-05 14:49:02 +08:00
Richard.Yu
a30d4a9008
[SAI-PTF][202205]enable sai-ptf logger in sai_adapter to log all the sai api … (#12923)
Why I did it
enable sai-ptf logger in sai_adapter to log all the sai api invcations

How I did it
add build parameter to enable the sai-ptf logger when build sai PRC

How to verify it
local build test
test the generated sai_adapter
test with pipeline
Signed-off-by: richardyu-ms <richard.yu@microsoft.com>
2022-12-04 22:09:57 -08:00
Richard.Yu
517e3eb45e
[202205][submodule][sairedis]Advance sairedis head pointer (#12931)
Advance header, include change
- https://github.com/sonic-net/sonic-sairedis/pull/1166 a1d8af296c

Signed-off-by: richardyu-ms <richard.yu@microsoft.com>

Signed-off-by: richardyu-ms <richard.yu@microsoft.com>
2022-12-04 21:43:03 -08:00
mssonicbld
e533fa4ca8
Fix code irregular issues (#12595) (#12934) 2022-12-05 11:56:38 +08:00
mssonicbld
ae80bab975
Improve feature mode switch process (#12188) (#12919)
* Fix kube mode to local mode long duration issue

* Remove IPV6 parameters which is not necessary

* Fix read node labels bug

* Tag the running image to latest if it's stable

* Disable image_version_higher check

* Change image_version_higher checker test case

Signed-off-by: Yun Li <yunli1@microsoft.com>

Signed-off-by: Yun Li <yunli1@microsoft.com>
Co-authored-by: lixiaoyuner <35456895+lixiaoyuner@users.noreply.github.com>
2022-12-04 23:04:45 +08:00
mssonicbld
7152e84277
Make client indentity by AME cert (#11946) (#12908) 2022-12-02 13:13:26 +08:00
Ying Xie
02345963db
[202205][utilities][platform-daemon] advance submodule head (#12910)
utilities:
* 252910a 2022-11-29 | [drop counters] Fix CLI script for unconfigured PGs (#2518) (HEAD -> 202205) [Lior Avramov]
* f2bf7ed 2022-11-30 | Change show kube command default value of insecure key to True (#2517) [lixiaoyuner]
* 0a030ce 2022-12-01 | [QoS] Introduce delay to the qos reload flow (#2503) [DavidZagury]
* d2fa21c 2022-11-02 | Disable "tag as local" when reboot (#2451) [lixiaoyuner]

platform-daemon:
* 5532070 2022-12-01 | Remove the argument that is causing the xcvrd to crash (#318) (HEAD -> 202205) [Vivek]

Signed-off-by: Ying Xie <ying.xie@microsoft.com>

Signed-off-by: Ying Xie <ying.xie@microsoft.com>
2022-12-01 15:05:36 -08:00
Ying Xie
7e910aecad Revert "Reload BCM SDK kmods on syncd start to handle syncd restart issues (#12804)"
This reverts commit 132c6e934a.
2022-12-01 19:47:33 +00:00
Saikrishna Arcot
60afb50c52
[202205] Update Linux kernel from 5.10.103 to 5.10.140 (#12660) #12874
Signed-off-by: Saikrishna Arcot <sarcot@microsoft.com>
(cherry picked from commit 672367c33e)
2022-12-01 09:25:40 -08:00
zitingguo-ms
88ad1751da
Publish docker saiserverv2 in 202205 branch (#12882)
Why I did it
Publish docker saiserverv2 in the build pipeline.

How I did it
Cherry-pick #12842 from master to 202205 branch.

How to verify it
Run test in #12843 and it has been built out successfully.
2022-11-30 22:26:07 -08:00
Neetha John
69343494dd Update ECN settings for storage backend (#12855)
Signed-off-by: Neetha John <nejo@microsoft.com>

Why I did it
ECN parameters need to be updated for storage backend

How I did it
Included the check for storage backend devices to update qos configs

How to verify it
Verified that the new ecn settings are applied on storage backend device.
Verified that the old ecn settings are applied for storage frontend, non storage frontend/backend devices
2022-12-01 01:36:46 +00:00
andywongarista
465cd9db8b [Arista] Enable ipv6 128b lpm on 720DT-48S (#12832)
Why I did it
Added to allow test_crm_route to pass; the test tries to add a /126 ipv6 route and this change is required in order for the count of available routes to be updated correctly.
2022-12-01 01:36:29 +00:00
Michael Li
132c6e934a Reload BCM SDK kmods on syncd start to handle syncd restart issues (#12804)
Why I did it
There is an issue on the Arista PikeZ platform (using T3.X2: BCM56274) while running SONiC. If the 'syncd' container in SONiC is restarted, the expected behaviour is that syncd will automatically restart/recover; however it does not and always fails at create_switch due to BCM SDK kmod DMA operation cancellation getting stuck.

Sep 16 22:19:44.855125 pkz208 ERR syncd#syncd: [none] SAI_API_SWITCH:platform_process_command:428 Platform command "init soc" failed, rc = -1. Sep 16 22:19:44.855206 pkz208 INFO syncd#supervisord: syncd CMIC_CMC0_PKTDMA_CH4_DESC_COUNT_REQ:0x33#015 Sep 16 22:19:44.855264 pkz208 CRIT syncd#syncd: [none] SAI_API_SWITCH:platformInit:1909 initialization command "init soc" failed, rc = -1 (Internal error). Sep 16 22:19:44.855403 pkz208 CRIT syncd#syncd: [none] SAI_API_SWITCH:sai_driver_init:642 Error initializing driver, rc = -1. ... Sep 16 22:19:44.855891 pkz208 CRIT syncd#syncd: [none] SAI_API_SWITCH:brcm_sai_create_switch:1173 initializing SDK failed with error Operation failed (0xfffffff5).

Reloading the BCM SDK kmods allows the switch init to continue properly.

How I did it
If BCM SDK kmods are loaded, unload and load them again on syncd docker start script.

How to verify it
Steps to reproduce:

In SONiC, run 'docker ps' to see current running containers; 'syncd' should be present.
Run 'docker stop syncd'
Wait ~1 minute.
Run 'docker ps' to see that syncd is missing.
Check logs to see messages similar to the above.

Signed-off-by: Michael Li <michael.li@broadcom.com>
2022-12-01 01:36:18 +00:00
Yutong Zhang
d14621563f
Support passing the instance numbers of a testplan. (#12881)
Previously, we hard code the min and max numbers of instance in a plan. In this pr, we support passing the instance numbers of a testplan.

Why I did it
Previously, we hard code the min and max numbers of instance in a plan. In this pr, we support passing the instance numbers of a testplan.

How I did it
Use a variable to set the instance number.
2022-12-01 08:48:57 +08:00
Ying Xie
552c8017f6
[202205][platform-daemon][platform-common] advance submodule head (#12870)
platform-daemon:
* f057b9a 2022-11-30 | [ycabled] fix minor appl_db retrieving logic for update (#319) (HEAD -> 202205) [vdahiya12]
* 9983106 2022-11-15 | [chassisd] update chassisd to write fabric and lc asics on sep erate table (#311) (HEAD -> 202205) [arlakshm]
* 8324c26 2022-11-10 | [ycabled] fix exception-handling logic for ycabled (#306) [vdahiya12]
* eaf73f8 2022-11-07 | [ycabled] move swsscommon API's from subroutines to call them exactly once per task_worker/thread (#303) [vdahiya12]

platform-common:
* 4b528a0 2022-11-21 | Add missing PM and VDM related EEPROM read (#326) (HEAD -> 202205) [mihirpat1]
* 8ca6c17 2022-11-14 | Initial commit (#323) [mihirpat1]
* 47f87d9 2022-11-23 | EEPROM/DOM Info: The Compliance Code will show "unknown" by using FINISAR 10G LR XCVR (#319) [ChiouRung Haung]
* 6273850 2022-06-01 | [ssd_generic] Fix innodisk health regex (#287) [Alexander Allen]

Signed-off-by: Ying Xie <ying.xie@microsoft.com>

Signed-off-by: Ying Xie <ying.xie@microsoft.com>
2022-11-30 15:36:25 -08:00
tjchadaga
7855cd8d15
Update BRCM SAI version to 7.1.25.4 (#12878) 2022-11-30 14:24:29 -08:00
Ying Xie
db5b47e3a3
[202205][sairedis] advance submodule head (#12873)
sairedis:
* 98def2d 2022-11-16 | [Recorder]: Acquire lock for ofstream changes (#1145) (HEAD -> 202205, github/202205) [Lawrence Lee]

Signed-off-by: Ying Xie <ying.xie@microsoft.com>

Signed-off-by: Ying Xie <ying.xie@microsoft.com>
2022-11-30 12:00:08 -08:00