Commit Graph

8531 Commits

Author SHA1 Message Date
ArunSaravananBalachandran
4809c8fe47 [PMON] Update smartmontool version >= 7.4 2024-03-07 10:03:04 +05:30
ArunSaravananBalachandran
92c752888b Update PMON to bookworm
- Use locally built ipmitool
- Use latest version of smartmontools from Bookworm Debian repos
- Update locally built lm-sensors to match version in Bookworm
- Update grpcio and grpcio-tools python packages to match version in
  Bookworm
2024-02-26 11:09:54 +05:30
Mridul Bajpai
54c1a49634
Sensormond support (#16089)
Enable Sensormon daemon in PMON container.
Pls see HLD : sonic-net/SONiC#1394
2024-02-23 22:25:48 -08:00
zitingguo-ms
41aa3295b9
upgrade xgs SAI version to 10.1.7.0 (#18156)
Signed-off-by: zitingguo-ms <zitingguo@microsoft.com>
2024-02-22 08:56:03 -08:00
Oleksandr Ivantsiv
cf9929546d
[build] Use configured architecture in the GRUB config. (#18093)
- Why I did it
The existing grub configuration has a hardcoded "amd64" architecture in the kernel and initramfs names. This causes an error in the attempt to load the image compiled for a different architecture.

- How I did it
Use configured architecture in the GRUB config.

- How to verify it
Compile the image for Arm64 architecture and install it on the device.
2024-02-22 18:04:52 +02:00
Lahav-Nvidia
8a7e38b3a3
[Mellanox] Add N/A as a valid fan direction for Nvidia platforms (#17930)
- Why I did it
On some Nvidia platforms, fan direction could not be determined. Therefore 'N/A' becomes a valid value for those cases.

- How I did it
Add 'N/A' to the valid fan direction mapping, to avoid an error in the log.

- How to verify it
Check fan direction on Nvidia platforms, and make sure there aren't errors in the log.
2024-02-22 11:35:10 +02:00
Xichen96
2244aa2d7b
[dhcp_server] add config dhcp server option (#18013)
* add dhcp server option cli
2024-02-21 16:52:51 -08:00
Saikrishna Arcot
8506826348
Update the database, teamd, and radv containers to Bookworm (#18108)
* Update Redis DB start options for multi-asic and chassis cases

Starting with Redis 7.0 (specifically, redis/redis#9034), setting a
custom `--bind` address on the command line no longer disables protected
mode (which blocks connections from non-localhost IP addresses unless a
password is set). For multi-asic and chassis database DBs, we currently
specify a non-localhost IP address and do not specify any password,
which means this change would break things there.

To work around this, if we are specifying a non-localhost IP address on
the command line, then disable protected mode.

* Clean up debug pacakge list for docker-teamd

The debug pacakges for swss and libswsscommon are already installed by
docker-swss-layer-bookworm and docker-config-engine-bookworm, so they
don't need to be specified here again.

Signed-off-by: Saikrishna Arcot <sarcot@microsoft.com>
2024-02-21 13:22:07 -08:00
Marty Y. Lok
3a0a6899eb
[service-checker] Fix the service-checker issue which is caused by PR17836 (#18109)
PR #17836 added the container checking for database-chassis for Supervisor. But the related container_feature_dict[] is missing for the database-chassis. This causes the exception failure shows on Supervisor

Signed-off-by: mlok <marty.lok@nokia.com>
2024-02-21 10:44:58 -08:00
ShiyanWangMS
df70bd304b
Remove Python3 venv from sonic-mgmt-docker build file (#18130)
Why I did it
This is sonic-mgmt-docker Python3 migration.

How I did it
Remove Python3 venv from sonic-mgmt-docker

How to verify it
Compile docker and verify the Python3 venv is NOT there.
2024-02-21 16:39:17 +08:00
Liu Shilong
3634a831d9
[ci] Disable some workflow use github app instead to have stable performance. (#18132)
Why I did it
github workflow is not easy to extend to other repos.
Use github app instead to share the same workflow.

Work item tracking
Microsoft ADO (number only): 26855292
How I did it
How to verify it
2024-02-19 18:27:31 +08:00
mssonicbld
97b3964677
[submodule] Update submodule sonic-platform-common to the latest HEAD automatically (#18124)
#### Why I did it
src/sonic-platform-common
```
* 796e89a - (HEAD -> master, origin/master, origin/HEAD) Change get_transceiver_info_firmware_versions return type to dict (#440) (2 days ago) [mihirpat1]
```
#### How I did it
#### How to verify it
#### Description for the changelog
2024-02-19 16:33:15 +08:00
mssonicbld
a7d948ff4e
[submodule] Update submodule sonic-platform-daemons to the latest HEAD automatically (#18125)
#### Why I did it
src/sonic-platform-daemons
```
* f1511c2 - (HEAD -> master, origin/master, origin/HEAD) Move firmware version fields to TRANSCEIVER_FIRMWARE_INFO table (#435) (2 days ago) [mihirpat1]
```
#### How I did it
#### How to verify it
#### Description for the changelog
2024-02-19 16:33:11 +08:00
Pavan Naregundi
4b8f172b46
[marvell-armhf] Update MRVL_PRESTERA_DRIVER (#17780)
Changes in MRVL_PRESTERA_DRIVER_1.8:
 * Migrate dtb to kernel 6.1.
 * Fix i2c kernel error log,
	[ 51.331287] i2c i2c-0: mv64xxx: I2C bus locked, block: 1, time_left: 0.

Signed-off-by: Pavan Naregundi <pnaregundi@marvell.com>
2024-02-16 08:51:49 -08:00
mssonicbld
5a34a49828
[submodule] Update submodule sonic-linux-kernel to the latest HEAD automatically (#18098)
#### Why I did it
src/sonic-linux-kernel
```
* f4b1a58 - (HEAD -> master, origin/master, origin/HEAD) Fix kexec kernel panic in mvpp2 (#377) (2 days ago) [Pavan Naregundi]
```
#### How I did it
#### How to verify it
#### Description for the changelog
2024-02-16 16:33:10 +08:00
mssonicbld
4b39bf60de
[submodule] Update submodule sonic-platform-daemons to the latest HEAD automatically (#18107)
#### Why I did it
src/sonic-platform-daemons
```
* 13c5049 - (HEAD -> master, origin/master, origin/HEAD) [ycabled][tests] Fix UT for bookworm (#431) (15 hours ago) [Arun Saravanan Balachandran]
```
#### How I did it
#### How to verify it
#### Description for the changelog
2024-02-16 16:32:55 +08:00
mssonicbld
de2ee9f96d
[submodule] Update submodule sonic-sairedis to the latest HEAD automatically (#18077)
#### Why I did it
src/sonic-sairedis
```
* a91d51d - (HEAD -> master, origin/master, origin/HEAD) Enable 8 Lanes Support in vslib (#1355) (55 minutes ago) [Deepak Singhal]
* ea8767c - [nvidia] Update syncd start to cover changes in SDK initialization process. (#1343) (6 days ago) [Oleksandr Ivantsiv]
* 46e713d - [syncd] Add workaround for SET operation on SAI_HOSTIF_ATTR_QUEUE (#1345) (7 days ago) [Kamil Cudnik]
```
#### How I did it
#### How to verify it
#### Description for the changelog
2024-02-15 16:33:07 +08:00
mssonicbld
0b73937ec5
[submodule] Update submodule sonic-swss to the latest HEAD automatically (#18096)
#### Why I did it
src/sonic-swss
```
* 1221eae4 - (HEAD -> master, origin/master, origin/HEAD) Explicitly initialize two local variables to 0 (#3046) (26 hours ago) [Saikrishna Arcot]
```
#### How I did it
#### How to verify it
#### Description for the changelog
2024-02-15 16:32:54 +08:00
mssonicbld
ca258a260e
[submodule] Update submodule sonic-dash-api to the latest HEAD automatically (#18087)
#### Why I did it
src/sonic-dash-api
```
* d033898 - (HEAD -> master, origin/master, origin/HEAD) Generate and install python stubs (2 days ago) [Lawrence Lee]
* 4fbed99 - Generate and install python stubs (8 days ago) [Lawrence Lee]
```
#### How I did it
#### How to verify it
#### Description for the changelog
2024-02-15 14:33:40 +08:00
mssonicbld
7b8c135e6c
[submodule] Update submodule sonic-ztp to the latest HEAD automatically (#18097)
#### Why I did it
src/sonic-ztp
```
* 56cebae - (HEAD -> master, origin/master, origin/HEAD) Enhancements to allow custom umask usage while executing plugins (#42) (2 hours ago) [rajendra-dendukuri]
```
#### How I did it
#### How to verify it
#### Description for the changelog
2024-02-15 10:33:38 +08:00
Saikrishna Arcot
fe0a5e1285
Install netaddr 0.10.1 in the Python 2 part of sonic-mgmt (#18094)
Recently, netaddr 1.0.0 was released, which dropped Python 2 support
entirely. Because of this, an empty netaddr package is installed for
Python 2. Ansible still uses Python 2, so this needs to remain
functional. The Python 3 part of sonic-mgmt is fine.

Pin the version of netaddr installed in Python 2 to 0.10.1, the last
supported version.

Signed-off-by: Saikrishna Arcot <sarcot@microsoft.com>
2024-02-13 20:40:25 -08:00
Zain Budhwani
ff5efe8fb3
[eventd] Fix eventd UT flakiness (#17055)
### Why I did it

Fix flakiness of eventd UT - run sub after capture service starts

##### Work item tracking
- Microsoft ADO **(number only)**:25650744

#### How I did it

Run sub socket after capture socket is initialized

#### How to verify it

Pipeline
2024-02-12 21:52:38 -08:00
Pavan Naregundi
c6602c9585
[Marvell-arm64]: Fix SYNCD_RPC build (#17266)
Change-Id: I0bd4932d03141f3f7bc523b49a1bf3d1809817a8

Signed-off-by: Pavan Naregundi <pnaregundi@marvell.com>
2024-02-12 15:11:19 -08:00
Pavan Naregundi
b31a3030fb
[Marvell-arm64] Fix boot issue on rd98DX35xx_cn9131 (#17277)
Change-Id: I411f12963fb8dc0eb3569faf4df68082b852e3a8

Signed-off-by: Pavan Naregundi <pnaregundi@marvell.com>
2024-02-12 15:11:00 -08:00
Prince George
0564ce48c9
[baseimage]: Update smartmontool version >= v7.4 (#17635)
Why I did it
Update smartmontool verson to 7.4. This is done to prevent smartmontools service to exit with non-zero exit status on platform that does not have a SSD/disk to be monitored.

Until Debian Bullseye (which had smartmontools 7.2), Debian had a patch applied that changed the default quit mode to never exit. A bug report was filed on Debian, saying that the source code patch isn't needed and could just be done via command line options, and also that smartmontools 7.3 has a new built-in option to exit with 0 if there are no monitorable devices found (which prevents systemd from treating it as a service failure). Because of that, Debian Bookworm (which also upgraded to 7.3) removed the patch and restored the default behavior of exiting with exit code 17 if there are no devices found.

Smartmontools v7.3 has this issue, because of which smartd exits with non-zero exit status even with "-q" option.

How I did it
Update the smartmontools to version 7.4 which has the fix for exiting gracefully if no monitoring device is found
Added smartd option "-q nodev0" to allow smartd to exit with status 0 if no monitoring device found
2024-02-12 09:37:12 -08:00
Stepan Blyshchak
cac73d80ca
[bootchart] enable command line recording (#17778)
Signed-off-by: Stepan Blyschak <stepanb@nvidia.com>
2024-02-12 08:36:44 -08:00
Kebo Liu
1b5f72127a
[Mellanox] Remove SFP sensors from sensors.conf (#17631)
- Why I did it
The cable thermal sensors will be deprecated from the kernel driver. When cable host management is enabled, NOS will fetch the cable temperature from cable EEPROM, kernel driver will not provide the sysfs anymore.

- How I did it
Remove the relevant sensor form the conf files

- How to verify it
Run sonic mgmt sensor test

Signed-off-by: Kebo Liu <kebol@nvidia.com>
2024-02-12 16:12:57 +02:00
Saikrishna Arcot
34bdfc8b39
Add Bookworm swss-layer (#18062)
Signed-off-by: Saikrishna Arcot <sarcot@microsoft.com>
2024-02-09 15:56:26 -08:00
mssonicbld
bd47fd1559
[submodule] Update submodule sonic-utilities to the latest HEAD automatically (#18078)
#### Why I did it
src/sonic-utilities
```
* 81c5349f - (HEAD -> master, origin/master, origin/HEAD) [chassis] fix show bgp summary when no neighbors are present on one ASIC (#3158) (10 hours ago) [Arvindsrinivasan Lakshmi Narasimhan]
```
#### How I did it
#### How to verify it
#### Description for the changelog
2024-02-09 16:32:30 +08:00
snider-nokia
7f3fd1377d
[Nokia-IXR7250E][Devicedata] Update the device data for Nokia IXR7250E platform (thermal logging thresholds) (#18063)
These changes adjust Nokia IXR7250 thermal sensor logging thresholds.

Why I did it
To modify the thermal sensor logging thresholds used on LC and Supervisor.

How I did it
Modified the JSON based thermal logging thresholds used to determine when to log current high sensor temperature and hottest sensor margin fluctuations.

How to verify it
Verify that syslog messages indicating current (high) temperature and margin values are only logged when these respective values fluctuate by at least 5 degrees.
2024-02-08 13:03:05 -08:00
Arvindsrinivasan Lakshmi Narasimhan
4703192d0f
[nokia][chassis][voq] update the sai_post_init soc file with interrupt ids (#18066)
Update/Add the sai_postinit_cmd.soc with the interrupt-ids

Microsoft ADO 26730061:

How to verify it
Verify on the Chassis LCs
2024-02-08 13:01:51 -08:00
dbarashinvd
7a34d4a275
[Mellanox] fix code for warm reboot to work with FW controlled ports (#18065)
- Why I did it
Fix the code to work also after warm reboot to work with FW controlled ports.
In warm reboot the control state sysfs of each port does not change unlike reboot or fast boot.

- How I did it
1. Check procfs cmdline if warm reboot done this is due to the fact pmon don't recognize warm reboot when it's taking place since pmon is loaded after warm reboot is finished.
2. If warm reboot done, check in static detection part for each port if it's FW controlled. If so, leave it this way and stop the state machine flow (set it to final state).

- How to verify it
1. Boot a switch with CMIS host management with at least one FW controlled port (non active cables or non cmis cables) then run warm reboot.
2. Verify no errors of sysfs reading appears for control sysfs
2024-02-08 14:49:56 +02:00
mssonicbld
a554ac40a7
[submodule] Update submodule sonic-sairedis to the latest HEAD automatically (#18068)
#### Why I did it
src/sonic-sairedis
```
* a504933 - (HEAD -> master, origin/master, origin/HEAD) Change dash API pipeline name (#1351) (11 hours ago) [Kamil Cudnik]
```
#### How I did it
#### How to verify it
#### Description for the changelog
2024-02-08 16:32:49 +08:00
mssonicbld
bd4bf76163
[submodule] Update submodule sonic-swss to the latest HEAD automatically (#18070)
#### Why I did it
src/sonic-swss
```
* b3b6a838 - (HEAD -> master, origin/master, origin/HEAD) [test_mux] Multi-mux-nh full test coverage (#3028) (25 minutes ago) [Nikola Dancejic]
* 3bd01444 - Bfd support for TSA state. (#2926) (6 hours ago) [siqbal1986]
```
#### How I did it
#### How to verify it
#### Description for the changelog
2024-02-08 16:32:42 +08:00
mssonicbld
f49711f246
[submodule] Update submodule sonic-utilities to the latest HEAD automatically (#18071)
#### Why I did it
src/sonic-utilities
```
* a3cf5c02 - (HEAD -> master, origin/master, origin/HEAD) Fix the sfputil treats page number as decimal instead of hexadecimal (#3153) (6 hours ago) [Kebo Liu]
* 167f9966 - [Mellanox] Add support of the nvidia-bluefield platform to generate-dump utility. (#3091) (20 hours ago) [Oleksandr Ivantsiv]
```
#### How I did it
#### How to verify it
#### Description for the changelog
2024-02-08 16:32:39 +08:00
jfeng-arista
5a20589415
Start fabric mgr daemon in swss container. (#17473)
The fabricmgr daemon started in vs environment for testing from #16791, we now start the daemon in product code.
2024-02-07 23:45:10 -08:00
mssonicbld
fffd6e6607
[submodule] Update submodule sonic-platform-common to the latest HEAD automatically (#18047)
#### Why I did it
src/sonic-platform-common
```
* 888075d - (HEAD -> master, origin/master, origin/HEAD) [ssd_generic] Add support Transcend ssd-health. (#436) (31 hours ago) [Michael Shih]
```
#### How I did it
#### How to verify it
#### Description for the changelog
2024-02-07 16:32:48 +08:00
mssonicbld
f9e510ccfa
[submodule] Update submodule sonic-dash-api to the latest HEAD automatically (#18050)
#### Why I did it
src/sonic-dash-api
```
* da6899b - (HEAD -> master, origin/master, origin/HEAD) Add/update fields needed for private link implementation (9 hours ago) [Prince Sunny]
* 960eab3 - Merge branch 'master' into pl-api (33 hours ago) [Prince Sunny]
* bc29979 - Merge branch 'master' into pl-api (4 days ago) [Lawrence Lee]
* 2d565d3 - Merge branch 'master' into pl-api (4 days ago) [Lawrence Lee]
* df6c512 - remove tunnel_key (4 days ago) [Lawrence Lee]
* 4d5ebda - Update proto files for PL (4 days ago) [Lawrence Lee]
```
#### How I did it
#### How to verify it
#### Description for the changelog
2024-02-07 16:32:44 +08:00
mssonicbld
22ac869f55
[submodule] Update submodule sonic-swss to the latest HEAD automatically (#18051)
#### Why I did it
src/sonic-swss
```
* b18cbac6 - (HEAD -> master, origin/master, origin/HEAD) [Ci] Fix the test script naming issue (#3021) (81 minutes ago) [xumia]
* 5fd896f6 - [PortOrch] Add FEC codeword errors in port stats (#3029) (87 minutes ago) [vdahiya12]
* 77d56e6e - Fix the Orchagent crash seen during Port channel OC test cases. (#3042) (9 hours ago) [saksarav-nokia]
* 4d470592 - Fix memory leak and object copying bugs in orchagent (#3017) (10 hours ago) [Saikrishna Arcot]
```
#### How I did it
#### How to verify it
#### Description for the changelog
2024-02-07 16:32:41 +08:00
mssonicbld
18bba22f88
[submodule] Update submodule sonic-utilities to the latest HEAD automatically (#18052)
#### Why I did it
src/sonic-utilities
```
* 0408226f - (HEAD -> master, origin/master, origin/HEAD) Fix `sudo config load_mgmt_config` fails with error "File /var/run/dhclient.eth0.pid does not exist" (#3149) (18 hours ago) [Mai Bui]
```
#### How I did it
#### How to verify it
#### Description for the changelog
2024-02-07 16:32:37 +08:00
zitingguo-ms
74494010e1
[Broadcom] Upgrade xgs SAI to 10.1.6.0 (#18044)
Why I did it
Upgrade the xgs SAI version to 10.1.6.0 to include the following fix:

10.1.6.0: [CS00012332630][SAI_BRANCH rel_ocp_sai_10_1] SAI - OTHER - [SAI BUG] sflow use psample to send packet, but the psample in linux version is not right.
10.1.4.0: [CS00012329827]ECMP LB traffic polarization, configure hash_offset along with hash_seed attr
10.1.3.0: Double commit test code fixes in EM for 10.1.
10.1.2.0: fix ODP packaging in rel_ocp_sai_10_1
10.1.1.0: Use knet-cb procfs path for DNX port speed sampling rate (does not use new genl)
Work item tracking
Microsoft ADO (number only): 26720003
How I did it
Upgrade xgs SAI version in sai.mk file.

How to verify it
Run full qual on s6100 T1: https://elastictest.org/scheduler/testplan/65c1c2e69e3e72f540cae34b
2024-02-07 09:29:40 +08:00
mssonicbld
c8371422fb
[submodule] Update submodule dhcprelay to the latest HEAD automatically (#18046)
#### Why I did it
src/dhcprelay
```
* 363fa06 - (HEAD -> master, origin/master, origin/HEAD) Skip vlans with no dhcpv6 server configured (#46) (8 hours ago) [kellyyeh]
```
#### How I did it
#### How to verify it
#### Description for the changelog
2024-02-06 16:32:27 +08:00
mssonicbld
858107eb28
[submodule] Update submodule sonic-swss to the latest HEAD automatically (#18048)
#### Why I did it
src/sonic-swss
```
* d566e15a - (HEAD -> master, origin/master, origin/HEAD) Allow L4 port range egress ACL rules on DNX (#3014) (9 hours ago) [arista-nwolfe]
```
#### How I did it
#### How to verify it
#### Description for the changelog
2024-02-06 16:32:22 +08:00
mssonicbld
3d9cf77c26
[submodule] Update submodule sonic-utilities to the latest HEAD automatically (#18049)
#### Why I did it
src/sonic-utilities
```
* b5487357 - (HEAD -> master, origin/master, origin/HEAD) [route_check.py] account static routes in route_check.py (#3120) (9 hours ago) [Stepan Blyshchak]
* 64e1f9f4 - [Mellanox buffer migrator] Do not touch the buffer model on generic SKUs if the buffer configuration is empty (#3114) (19 hours ago) [Stephen Sun]
```
#### How I did it
#### How to verify it
#### Description for the changelog
2024-02-06 16:32:18 +08:00
Oleksandr Ivantsiv
ea02734b8d
[dhcp-server] Change the kea-dhcp4 PID file directory to tmpfs. (#17974) 2024-02-05 10:26:46 -08:00
Yaqiang Zhu
c323ccfa72
[dhcp_server][yang] Update supported option type to string (#18029) 2024-02-05 10:25:55 -08:00
Yevhen Fastiuk
2f35079979
[Mellanox] Fix uninitialized variable on module plug event (#17011)
- Why I did it
To fix uninitialized variable

- How I did it
Add initial value

Signed-off-by: Yevhen Fastiuk <yfastiuk@nvidia.com>
2024-02-05 19:41:16 +02:00
dbarashinvd
0aacc1f28e
[Mellanox] fix sysfs reading that gets garbage end of line using strip (#17830)
- Why I did it
when reading sysfs fd upon python poller events, there's end of line garbage like "# 012" (without space between the 2 parts) trailing the real value of 1 or 0

- How I did it
using python strip() to remove end of line

- How to verify it
run the CMIS host management feature on a switch
wait few minutes until switch completes boot up sequence including CMIS host manager
then disconnect or reconnect a port to create a poller event
2024-02-05 19:39:55 +02:00
mssonicbld
529031210f
[submodule] Update submodule sonic-sairedis to the latest HEAD automatically (#18019)
#### Why I did it
src/sonic-sairedis
```
* e5b8d4e - (HEAD -> master, origin/master, origin/HEAD) Make changes to support compiling on Bookworm (with GCC 12) (#1344) (3 days ago) [Saikrishna Arcot]
```
#### How I did it
#### How to verify it
#### Description for the changelog
2024-02-05 16:32:54 +08:00
Stepan Blyshchak
e1a8d2a6e8
[nvidia][syncd] fix incorrect permission of /tmp in syncd container (#17777)
Fixes #16034
2024-02-05 00:00:29 -08:00