Commit Graph

8342 Commits

Author SHA1 Message Date
mssonicbld
d658e7847d
[ci/build]: Upgrade SONiC package versions (#18142) 2024-02-21 10:46:28 -08:00
mssonicbld
ab117b655e
[submodule] Update submodule sonic-swss to the latest HEAD automatically (#18141)
src/sonic-swss

* d322f660 - (HEAD -> 202311, origin/202311) Fix memory leak and object copying bugs in orchagent (#3017) (4 hours ago) [Saikrishna Arcot]
2024-02-21 10:46:04 -08:00
mssonicbld
1a01724845
[submodule] Update submodule sonic-sairedis to the latest HEAD automatically (#18140)
src/sonic-sairedis

* 5c05e23 - (HEAD -> 202311, origin/202311) [SAI] Move SAI submodule to v1.13.3 (#1349) (2 hours ago) [Kamil Cudnik]
2024-02-21 10:45:29 -08:00
Dror Prital
941b541baa
[Build] Fix krb5 package not found issue (#18135) 2024-02-20 08:54:53 -08:00
bktsim
f19fae59e7 [Arista] Remove aggregate port config files for multi-asic devices (#16923)
An aggregate port_config.ini file for Arista multi-asic devices was first introduced by mistake. This PR cleans up these unnecessary files.
2024-02-19 16:33:31 +08:00
Oleksandr Ivantsiv
ab5f9209b5 [dhcp-server] Change the kea-dhcp4 PID file directory to tmpfs. (#17974) 2024-02-19 16:33:27 +08:00
Kebo Liu
20ba1e2406 [Mellanox] Remove SFP sensors from sensors.conf (#17631)
- Why I did it
The cable thermal sensors will be deprecated from the kernel driver. When cable host management is enabled, NOS will fetch the cable temperature from cable EEPROM, kernel driver will not provide the sysfs anymore.

- How I did it
Remove the relevant sensor form the conf files

- How to verify it
Run sonic mgmt sensor test

Signed-off-by: Kebo Liu <kebol@nvidia.com>
2024-02-19 16:33:23 +08:00
mssonicbld
4383c7ff8b
[submodule] Update submodule sonic-platform-common to the latest HEAD automatically (#18126)
#### Why I did it
src/sonic-platform-common
```
* 5430f6f - (HEAD -> 202311, origin/202311) Change get_transceiver_info_firmware_versions return type to dict (#440) (2 days ago) [mihirpat1]
```
#### How I did it
#### How to verify it
#### Description for the changelog
2024-02-19 16:33:07 +08:00
mssonicbld
5ec5da734d
[submodule] Update submodule sonic-utilities to the latest HEAD automatically (#18128)
#### Why I did it
src/sonic-utilities
```
* c711b061 - (HEAD -> 202311, origin/202311) [Mellanox buffer migrator] Do not touch the buffer model on generic SKUs if the buffer configuration is empty (#3114) (2 days ago) [Stephen Sun]
```
#### How I did it
#### How to verify it
#### Description for the changelog
2024-02-19 16:33:04 +08:00
Zain Budhwani
76ef12ec73 [eventd] Fix eventd UT flakiness (#17055)
### Why I did it

Fix flakiness of eventd UT - run sub after capture service starts

##### Work item tracking
- Microsoft ADO **(number only)**:25650744

#### How I did it

Run sub socket after capture socket is initialized

#### How to verify it

Pipeline
2024-02-19 14:33:32 +08:00
Arvindsrinivasan Lakshmi Narasimhan
6d422d23ec [nokia][chassis][voq] update the sai_post_init soc file with interrupt ids (#18066)
Update/Add the sai_postinit_cmd.soc with the interrupt-ids

Microsoft ADO 26730061:

How to verify it
Verify on the Chassis LCs
2024-02-19 14:33:28 +08:00
abdosi
b9fbfcfa5c [chassis] update service_checker module to handle database-chassis service (#17836)
* Update service_checker.py

Signed-off-by: Abhishek Dosi <abdosi@microsoft.com>
2024-02-19 14:33:24 +08:00
mssonicbld
317c1ad479
[submodule] Update submodule sonic-platform-daemons to the latest HEAD automatically (#18127)
#### Why I did it
src/sonic-platform-daemons
```
* 7792838 - (HEAD -> 202311, origin/202311) Move firmware version fields to TRANSCEIVER_FIRMWARE_INFO table (#435) (22 hours ago) [mihirpat1]
```
#### How I did it
#### How to verify it
#### Description for the changelog
2024-02-18 20:41:51 +08:00
Liu Shilong
bacd215779 [ci] Enable cache for marvell-arm64 build in PR checks. (#15449)
Why I did it
Enable build cache for marvell-arm64 build to decrease PR check duration.

Work item tracking
Microsoft ADO (number only): 26340500
How I did it
How to verify it
2024-02-17 12:34:39 +08:00
Yevhen Fastiuk
491cf9a3f8 [Mellanox] Fix uninitialized variable on module plug event (#17011)
- Why I did it
To fix uninitialized variable

- How I did it
Add initial value

Signed-off-by: Yevhen Fastiuk <yfastiuk@nvidia.com>
2024-02-17 12:34:35 +08:00
dbarashinvd
b967cf0b99 [Mellanox] fix sysfs reading that gets garbage end of line using strip (#17830)
- Why I did it
when reading sysfs fd upon python poller events, there's end of line garbage like "# 012" (without space between the 2 parts) trailing the real value of 1 or 0

- How I did it
using python strip() to remove end of line

- How to verify it
run the CMIS host management feature on a switch
wait few minutes until switch completes boot up sequence including CMIS host manager
then disconnect or reconnect a port to create a poller event
2024-02-17 12:34:31 +08:00
mssonicbld
e54c5b4b98
[submodule] Update submodule sonic-platform-daemons to the latest HEAD automatically (#18103)
#### Why I did it
src/sonic-platform-daemons
```
* 121b338 - (HEAD -> 202311, origin/202311) Unable to retrieve media settings with just Vendor name (#419) (10 hours ago) [mihirpat1]
```
#### How I did it
#### How to verify it
#### Description for the changelog
2024-02-16 16:33:06 +08:00
dbarashinvd
dcc5a162ec [Mellanox] fix code for warm reboot to work with FW controlled ports (#18065)
- Why I did it
Fix the code to work also after warm reboot to work with FW controlled ports.
In warm reboot the control state sysfs of each port does not change unlike reboot or fast boot.

- How I did it
1. Check procfs cmdline if warm reboot done this is due to the fact pmon don't recognize warm reboot when it's taking place since pmon is loaded after warm reboot is finished.
2. If warm reboot done, check in static detection part for each port if it's FW controlled. If so, leave it this way and stop the state machine flow (set it to final state).

- How to verify it
1. Boot a switch with CMIS host management with at least one FW controlled port (non active cables or non cmis cables) then run warm reboot.
2. Verify no errors of sysfs reading appears for control sysfs
2024-02-16 09:29:06 +08:00
mssonicbld
71014b28e7
[submodule] Update submodule sonic-swss to the latest HEAD automatically (#18075)
#### Why I did it
src/sonic-swss
```
* 2910b0e3 - (HEAD -> 202311, origin/202311) Fix the Orchagent crash seen during Port channel OC test cases. (#3042) (7 days ago) [saksarav-nokia]
```
#### How I did it
#### How to verify it
#### Description for the changelog
2024-02-15 16:33:13 +08:00
mssonicbld
8cd9463de0
[submodule] Update submodule sonic-utilities to the latest HEAD automatically (#18081)
#### Why I did it
src/sonic-utilities
```
* b2125761 - (HEAD -> 202311, origin/202311) [chassis] fix show bgp summary when no neighbors are present on one ASIC (#3158) (2 days ago) [Arvindsrinivasan Lakshmi Narasimhan]
* 54595c1e - [202311]Fix the sfputil treats page number as decimal instead of hexadecimal (#3153) (#3160) (5 days ago) [Sudharsan Dhamal Gopalarathnam]
```
#### How I did it
#### How to verify it
#### Description for the changelog
2024-02-15 16:33:04 +08:00
mssonicbld
fb3f683282
[submodule] Update submodule sonic-sairedis to the latest HEAD automatically (#18083)
#### Why I did it
src/sonic-sairedis
```
* 23481f0 - (HEAD -> 202311, origin/202311) Skip FABRIC PORT Attributes from sairedis logging (#1339) (2 days ago) [saksarav-nokia]
* 682e860 - Revert "add if statement for module control mode support" (#1341) (4 days ago) [dbarashinvd]
* 3621a18 - SAI submodule update to pick the sai-thrift support added to read VOQ counters (#1332) (4 days ago) [saksarav-nokia]
* 52cd15b - Fix code coverage and ASAN not being enabled (#1338) (5 days ago) [Saikrishna Arcot]
```
#### How I did it
#### How to verify it
#### Description for the changelog
2024-02-15 16:33:00 +08:00
snider-nokia
a7daae67e3 [Nokia-IXR7250E][Devicedata] Update the device data for Nokia IXR7250E platform (thermal logging thresholds) (#18063)
These changes adjust Nokia IXR7250 thermal sensor logging thresholds.

Why I did it
To modify the thermal sensor logging thresholds used on LC and Supervisor.

How I did it
Modified the JSON based thermal logging thresholds used to determine when to log current high sensor temperature and hottest sensor margin fluctuations.

How to verify it
Verify that syslog messages indicating current (high) temperature and margin values are only logged when these respective values fluctuate by at least 5 degrees.
2024-02-15 14:33:43 +08:00
Hua Liu
f4b1eb0a5b
Fix IPV6 forced-mgmt-route not work issue (#17299) (#18045)
Fix IPV6 forced-mgmt-route not work issue

Why I did it
IPV6 forced-mgmt-route not work

When add a IPV6 route, should use 'ip -6 rule add pref 32764 address' command, but currently in the template the '-6' parameter are missing, so the IPV6 route been add to IPV4 route table.

Also this PR depends on #17281 , which will fix the IPV6 'default' route table missing in IPV6 route lookup issue. 

Microsoft ADO (number only):24719238
2024-02-07 06:50:57 -08:00
mssonicbld
d8149a1435
[submodule] Update submodule sonic-utilities to the latest HEAD automatically (#18059)
#### Why I did it
src/sonic-utilities
```
* 31a6584c - (HEAD -> 202311, origin/202311) Fix `sudo config load_mgmt_config` fails with error "File /var/run/dhclient.eth0.pid does not exist" (#3149) (16 hours ago) [Mai Bui]
```
#### How I did it
#### How to verify it
#### Description for the changelog
2024-02-07 18:32:27 +08:00
mssonicbld
a7af5b4a11
fix the compile issue for slim image (#18015) (#18057)
Why I did it
The PR introduced a bug for slim image build, #17905, by which the sonic_asic_platform is missing when build docker image for slim image.

[ building ] [ target/docker-dhcp-relay.gz ]
/sonic/dockers/docker-dhcp-relay/cli-plugin-tests /sonic
/sonic
Traceback (most recent call last):
  File "/usr/local/bin/j2", line 8, in <module>
    sys.exit(main())
  File "/usr/local/lib/python3.9/dist-packages/j2cli/cli.py", line 202, in main
    output = render_command(
  File "/usr/local/lib/python3.9/dist-packages/j2cli/cli.py", line 186, in render_command
    result = renderer.render(args.template, context)
  File "/usr/local/lib/python3.9/dist-packages/j2cli/cli.py", line 85, in render
    return self._env \
  File "/usr/lib/python3/dist-packages/jinja2/environment.py", line 1090, in render
    self.environment.handle_exception()
  File "/usr/lib/python3/dist-packages/jinja2/environment.py", line 832, in handle_exception
    reraise(*rewrite_traceback_stack(source=source))
  File "/usr/lib/python3/dist-packages/jinja2/_compat.py", line 28, in reraise
    raise value.with_traceback(tb)
  File "/sonic/dockers/docker-dhcp-relay/Dockerfile.j2", line 48, in top-level template code
    {% if build_reduce_image_size != "y" or sonic_asic_platform != "broadcom" %}
jinja2.exceptions.UndefinedError: 'sonic_asic_platform' is undefined
make: *** [slave.mk:1072: target/docker-dhcp-relay.gz] Error 1
make: *** Waiting for unfinished jobs....
[ finished ] [ target/docker-swss-layer-bullseye.gz ]
[ finished ] [ target/docker-syncd-brcm-dnx.gz ]
make[1]: *** [Makefile.work:608: target/sonic-broadcom.bin] Error 2
make[1]: Leaving directory '/data/work/1/s'
make: *** [Makefile:41: target/sonic-broadcom.bin] Error 2
And why it slipped the PR test? PR test doesn't compile with slim option, it won't check sonic_asic_platform != "broadcom" for PR build.

Work item tracking
Microsoft ADO (number only):
How I did it
Export sonic_asic_platform for docker build in slave.mk

How to verify it
build with slim image option.

Co-authored-by: StormLiangMS <89824293+StormLiangMS@users.noreply.github.com>
2024-02-06 23:58:46 -08:00
zitingguo-ms
05ae1fa285
upgrade xgs SAI version to 10.1.6.0 (#18055)
Signed-off-by: zitingguo-ms <zitingguo@microsoft.com>
2024-02-06 23:11:58 -08:00
Volodymyr Samotiy
e13ef9d9b2
[202311] [Mellanox] Disable SSD NCQ on Mellanox platforms (#18040)
Signed-off-by: Volodymyr Samotiy <volodymyrs@nvidia.com>
2024-02-06 16:31:50 -08:00
Yaqiang Zhu
0611bdfb39
[202311][dhcp_server][yang] Update supported option type to string (#18029) (#18043) 2024-02-06 16:31:19 -08:00
Ye Jianquan
5352135776
[202311, PR] deprecate DPU (#18035) 2024-02-05 10:07:20 -08:00
mssonicbld
627d9cb3bb
[submodule] Update submodule linkmgrd to the latest HEAD automatically (#18016)
#### Why I did it
src/linkmgrd
```
* 70b6d15 - (HEAD -> 202311, origin/202311) [active-standby] Fix `show mux status` inconsistency introduced by orchagent rollback  (#225) (3 days ago) [Jing Zhang]
```
#### How I did it
#### How to verify it
#### Description for the changelog
2024-02-05 16:33:00 +08:00
mssonicbld
0155096300
[submodule] Update submodule sonic-host-services to the latest HEAD automatically (#18018)
#### Why I did it
src/sonic-host-services
```
* 054aa7a - (HEAD -> 202311, origin/202311) Fixed ip6table internal_docker_ip_traffic rule command for multi-asic (#94) (3 days ago) [anamehra]
```
#### How I did it
#### How to verify it
#### Description for the changelog
2024-02-05 16:32:57 +08:00
mssonicbld
36ba782dfa
[submodule] Update submodule sonic-platform-common to the latest HEAD automatically (#18020)
#### Why I did it
src/sonic-platform-common
```
* 9bf5a17 - (HEAD -> 202311, origin/202311) Implementing set_optoe_write_timeout API (#422) (3 days ago) [mihirpat1]
* c8617b8 - APIs to help in finding NPU SI settings (#410) (3 days ago) [mihirpat1]
```
#### How I did it
#### How to verify it
#### Description for the changelog
2024-02-05 16:32:50 +08:00
mssonicbld
dae74cc7de
[submodule] Update submodule sonic-platform-daemons to the latest HEAD automatically (#18022)
#### Why I did it
src/sonic-platform-daemons
```
* dbaa079 - (HEAD -> 202311, origin/202311) Support 800G ifname in xcvrd (#416) (2 days ago) [Anoop Kamath]
* e4272c1 - 400ZR not linking up with latest SONiC master image (#410) (3 days ago) [mihirpat1]
```
#### How I did it
#### How to verify it
#### Description for the changelog
2024-02-05 16:32:47 +08:00
mssonicbld
ae59ff2ef7
[submodule] Update submodule sonic-snmpagent to the latest HEAD automatically (#18023)
#### Why I did it
src/sonic-snmpagent
```
* 5d5cfe5 - (HEAD -> 202311, origin/202311) Set the execute bit on sysDescr_pass.py (#306) (3 days ago) [Andre Kostur]
```
#### How I did it
#### How to verify it
#### Description for the changelog
2024-02-05 16:32:43 +08:00
mssonicbld
de6ad3fd71
[submodule] Update submodule sonic-swss to the latest HEAD automatically (#18024)
#### Why I did it
src/sonic-swss
```
* 55d53b79 - (HEAD -> 202311, origin/202311) [copporch] Add safeguard during policer attribute update (#2977) (3 days ago) [Vivek]
```
#### How I did it
#### How to verify it
#### Description for the changelog
2024-02-05 16:32:40 +08:00
mssonicbld
59c378da10
[submodule] Update submodule sonic-utilities to the latest HEAD automatically (#18025)
#### Why I did it
src/sonic-utilities
```
* 2046e66c - (HEAD -> 202311, origin/202311) Reduce generate_dump mem usage for cores (#3052) (3 days ago) [davidm-arista]
* fbd6c916 - Disable Key Validation feature during sonic-installation for Cisco Platforms (#3115) (3 days ago) [selvipal]
* 88c027f0 - [Techsupport]Adding more FRR and BGP dumps (#3118) (3 days ago) [Sudharsan Dhamal Gopalarathnam]
* 555ecf64 - [chassis]: Support show ip bgp summary to display without error when no external neighbors are configured on chassis LC (#3099) (3 days ago) [Arvindsrinivasan Lakshmi Narasimhan]
* 1515edcb - [db_migrator]Remove route migration (#3068) (3 days ago) [Sudharsan Dhamal Gopalarathnam]
* 8862c114 - Modify teamd retry count script to base BGP status on default BGP status (#3069) (3 days ago) [Saikrishna Arcot]
* f4b5ef21 - Add all SKUs to the generic config update list (#3131) (3 days ago) [Stephen Sun]
```
#### How I did it
#### How to verify it
#### Description for the changelog
2024-02-05 16:32:37 +08:00
mssonicbld
0aa67d4670
[submodule] Update submodule sonic-utilities to the latest HEAD automatically (#18011)
#### Why I did it
src/sonic-utilities
```
* be6224a3 - (HEAD -> 202311, origin/202311) [202311] Migrate GNMI table (#3138) (10 hours ago) [ganglv]
```
#### How I did it
#### How to verify it
#### Description for the changelog
2024-02-03 10:32:41 +08:00
Liping Xu
36028893a3 handle json load exception in bgpmon (#17856)
Why I did it
ICM reported due to "BGPMon Process exited" which was caused by json load exception.

Work item tracking
Microsoft ADO (number only):
25916773
How I did it
Add an exception handle during json load.

How to verify it
Verified locally, add debug log to modify the output string of cmd to make it not with json formation, then check the syslog.
2024-02-03 09:27:18 +08:00
Ze Gan
2b08a783f6 [Azp]: Add dash-api dependencies on building Azp ubuntu20.04 (#17507)
Signed-off-by: Ze Gan <ganze718@gmail.com>
2024-02-03 04:32:21 +08:00
xumia
66b469249e [Security] Fix the krb5 vulnerability issue (#17914)
### Why I did it
Fix the krb5 vulnerable issue
CVE-2021-36222  allows remote attackers to cause a NULL pointer dereference and daemon crash
CVE-2021-37750  NULL pointer dereference in kdc/do_tgs_req.c via a FAST inner body that lacks a server field
DSA 5286-1  remote code execution

##### Work item tracking
- Microsoft ADO **(number only)**: 26577929

#### How I did it
Upgrade the krb5 version to 1.18.3-6+deb11u14+fips.
2024-02-03 04:32:18 +08:00
Yaqiang Zhu
f97b53322f [dhcp_server] Add field not exist checks in dhcp_cfggen (#17645)
* [dhcp_server] Add field not exist checks in dhcp_cfggen
2024-02-03 02:32:40 +08:00
kellyyeh
a110e625a2 Only add to DHCP_RELAY if dhcpv6 servers exist (#17770) 2024-02-03 02:32:36 +08:00
mssonicbld
5cd18eeda7
[ci/build]: Upgrade SONiC package versions (#17956) 2024-02-02 08:15:51 -08:00
Stepan Blyshchak
1672ce81fc [config-topology] use cached variables (#17343)
- Why I did it
Improve  boot performance mostly needed for fast and warmboot

- How I did it
Use cached variable.

- How to verify it
Boot the system. Simply do "systemd-analyze blame" and look at service start time.

Signed-off-by: Stepan Blyschak <stepanb@nvidia.com>
2024-02-02 18:32:29 +08:00
Ze Gan
9f1bebbdba [ci]: Enable daily building for ubuntu20.04 to every branch (#17520)
- The ubuntu 2004 is needed by 202311
- Because the artifacts of ubuntu2004 are used by other repos, a daily building is needed without an updating of this repo for a long time.

Signed-off-by: Ze Gan <ganze718@gmail.com>
2024-02-02 18:32:19 +08:00
Hua Liu
009b0dd7ec Change orchagent stuck message from ERR to WARNING (#17872)
Change orchagent stuck message from ERR to WARNING

#### Why I did it
During switch initialization, sometime Orchagent will busy for more than 40seconds and will trigger process stuck workdog error.
To improve this issue, change watchdog error message to warning message.

##### Work item tracking
- Microsoft ADO: 26517622

#### How I did it
Change orchagent stuck message from ERR to WARNING.

#### How to verify it
Pass all UT.

### Description for the changelog
Change orchagent stuck message from ERR to WARNING.
2024-02-02 18:32:16 +08:00
mssonicbld
a467ff71a2
[submodule] Update submodule sonic-platform-common to the latest HEAD automatically (#18008)
#### Why I did it
src/sonic-platform-common
```
* 7c2ad66 - (HEAD -> 202311, origin/202311) Tx/Rx power values should be rounded up to 3 decimal places (#432) (4 hours ago) [mihirpat1]
```
#### How I did it
#### How to verify it
#### Description for the changelog
2024-02-02 18:32:12 +08:00
Feng-msft
69478a6b85 Fix dialout build flag issue. (#17715)
### Why I did it
Fix ENABLE_DIALOUT flag issue.

##### Work item tracking
- Microsoft ADO **(number only)**: 21326000

#### How I did it
Update Makefile.work and add debug string.

#### How to verify it
![image](https://github.com/sonic-net/sonic-buildimage/assets/97083744/960d75d1-618c-4734-acb5-7a32a28c262b)
2024-02-02 16:32:34 +08:00
Zain Budhwani
fe07450a26 Disable eventd and rsyslog plugin in slim images (#17905)
### Why I did it

Disable eventd at buildtime for slim images

##### Work item tracking
- Microsoft ADO **(number only)**:26386286

#### How I did it

Add flags for disabling eventd and only copy rsyslog conf files when eventd is included and not slim image

#### How to verify it

Manual testing
2024-02-02 16:32:26 +08:00
abdosi
c83f81ea0f [chassis] Added support of isolating given LC in Chassis with TSA mode (#16732)
What I did:
Added support when TSA is done on Line Card make sure it's completely
isolated from all e-BGP peer devices from this LC or remote LC

Why I did:
Currently when TSA is executed on LC routes are withdrawn from it's connected e-BGP peers only. e-BGP peers on remote LC can/will (via i-BGP) still have route pointing/attracting traffic towards this isolated LC.

How I did:

When TSA is applied on LC all the routes that are advertised via i-BGP are set with community tag of no-export so that when remote LC received these routes it does not send over to it's connected e-BGP peers.

Also once we receive the route with no-export  over iBGP match on it and and set the local preference of that route to lower value (80) so that we remove that route from the forwarding database. Below scenario explains why we do this:

- LC1 advertise R1 to LC3
- LC2 advertise R1 to LC3
- On LC3 we have multi-path/ECMP over both LC1 and LC2
- On LC3 R1 received from LC1 is consider best route over R1 over received from LC2 and is send to LC3 e-BGP peers
- Now we do TSA on LC2
- LC3 will receive R1 from LC2 with community no-export and from LC1 same as earlier (no change)
- LC3 will still get traffic for R1 since it is still advertised to e-BGP peers (since R1 from LC1 is best route)
- LC3 will forward to both LC1 and LC2 (ecmp) and this causes issue as LC2 is in TSA mode and should not receive traffic

To fix above scenario we change the preference to lower value of R1 received from LC2 so that it is removed from Multi-path/ECMP group.

How I verfiy:

UT has been added to make sure Template generation is correct
Manual Verification of the functionality
sonic-mgmt test case will be updated accordingly.
Please note this PR is on top of this :#16714 which needs to be merged first.

Signed-off-by: Abhishek Dosi <abdosi@microsoft.com>
2024-02-02 14:33:18 +08:00