Commit Graph

7360 Commits

Author SHA1 Message Date
Saikrishna Arcot
3316dcc76d
[202211] Update Linux kernel to 5.10.179 (#17324)
* Update Linux kernel to 5.10.179


Signed-off-by: Saikrishna Arcot <sarcot@microsoft.com>
2023-11-30 13:09:57 -08:00
JunhongMao
a3a4253ac0 [VOQ][saidump] Install rdbtools into the docker base related containers. (#16466)
Fix #13561

The existing saidump use https://github.com/sonic-net/sonic-swss-common/blob/master/common/table_dump.lua script which loops the ASIC_DB more than 5 seconds and blocks other processes access.

This solution uses the Redis SAVE command to save the snapshot of DB each time and recover later, instead of looping through each entry in the table.

Related PRs:
sonic-net/sonic-utilities#2972
sonic-net/sonic-sairedis#1288
sonic-net/sonic-sairedis#1298

How did I do it?
To use the Redis-db SAVE option to save the snapshot of DB each time and recover later, instead of looping through each entry in the table and saving it.

1. Updated dockers/docker-base-bullseye/Dockerfile.j2, install Python library rdbtools into the all the docker-base-bullseye containers.

2. Updated sonic-buildimage/src/sonic-sairedis/saidump/saidump.cpp, add a new option -r, which updates the rdbtools's output-JSON files' format.

3. To add a new script file: syncd/scripts/saidump.sh into the sairedis repo. This shell script does the following steps:

  For each ASIC, such as ASIC0,

  3.1. Config Redis consistency directory. 
  redis-cli -h $hostname -p $port CONFIG SET dir $redis_dir > /dev/null

  3.2. Save the Redis data.
  redis-cli -h $hostname -p $port SAVE > /dev/null

  3.3. Run rdb command to convert the dump files into JSON files
    rdb --command json $redis_dir/dump.rdb | tee $redis_dir/dump.json > /dev/null

  3.4.  Run saidump -r to update the JSON files' format as same as the saidump before. 
       Then we can get the saidump's result in standard output."
       saidump -r $redis_dir/dump.json -m 100

  3.5. Clear the temporary files.
   rm -f $redis_dir/dump.rdb
   rm -f $redis_dir/dump.json

4. Update sonic-buildimage/src/sonic-utilities/scripts/generate_dump. To check the asic db size and if it is larger than ROUTE_TAB_LIMIT_DIRECT_ITERATION (with default value 24000) entries, then do with REDIS SAVE, otherwise, to do with old method: looping through each entry of Redis DB.

How to verify it
On T2 setup with more than 96K routes, execute CLI command -- generate_dump
No error should be shown
Download the generate_dump result and verify the saidump file after unpacking it.
2023-11-21 12:34:10 +08:00
Junhua Zhai
153b838b4f [gearbox] use credo sai v0.9.3 (#16860)
Update credo sai package to the latest v0.9.3, which fixes the issue aristanetworks/sonic#92.
2023-11-21 10:41:54 +08:00
Lawrence Lee
795ac1a751 [tph]: Detect LAG flaps from APPL_DB (#16879)
Why I did it
A race condition exists while the TPH is processing a netlink message - if a second netlink message arrives during processing it will be missed since TPH is not listening for other messages.
Another bug was found where TPH was unnecessarily restarting since it was checking admin status instead of operational status of portchannels.

How I did it
Subscribe to APPL_DB for updates on LAG operational state
Track currently sniffed interfaces

How to verify it
Send tunnel packets with destination IP of an unresolved neighbor, verify that ping commands are run
Shut down a portchannel interface, verify that sniffer does not restart
Send tunnel packets, verify ping commands are still run
Bring up portchannel interface, verify that sniffer restarts

Signed-off-by: Lawrence Lee <lawlee@microsoft.com>
2023-11-15 12:33:19 +08:00
mssonicbld
6696a2612b
[submodule] Update submodule sonic-linux-kernel to the latest HEAD automatically (#17033) 2023-11-15 10:18:46 +08:00
mssonicbld
923b2a3844
Add special rsyslog filter for MSN2700 platform (#16684) (#17013) 2023-11-15 09:59:25 +08:00
mssonicbld
295b46b923
[Mellanox] Revise label name and fix typo in sensor.conf of 4600C (#16271) (#17136) 2023-11-15 09:53:15 +08:00
mssonicbld
0fb81fcda2
[Build] Deprecate the mirror packages.trafficmanager.net/debian (#17113) (#17129) 2023-11-15 09:43:16 +08:00
mssonicbld
36a2180eff
[submodule] Update submodule sonic-sairedis to the latest HEAD automatically (#16961) 2023-11-14 15:58:40 +08:00
mssonicbld
7fb9821ca1
[submodule] Update submodule sonic-utilities to the latest HEAD automatically (#16998) 2023-11-14 15:50:57 +08:00
mssonicbld
50d40b4bc6
[submodule] Update submodule sonic-swss to the latest HEAD automatically (#17086) 2023-11-14 15:47:24 +08:00
Saikrishna Arcot
b2088b4f72
[202211] Fix build failures after OpenSSH update to 1:8.4p1-5+deb11u2 (#17154)
Why I did it
This backports #16859, and #16896 to the 202211 branch, for fixing the subsequent build failures after the OpenSSH update to 1:8.4p1-5+deb11u2. This backport is being done in a single PR to avoid breakages in PR builds.

Work item tracking
Microsoft ADO (number only): 25517845
How I did it
How to verify it
2023-11-14 10:40:16 +08:00
xumia
cf66a45b8d [Security] Upgrade the OpenSSL/OpenSSH to fix CVE alerts (#16902)
### Why I did it
[Security] Upgrade the OpenSSL/OpenSSH to fix CVE alerts

Upgrade OpenSSL to 1.1.1n-0+deb11u5
Fix CVEs:
      CVE-2023-0464 (Excessive Resource Usage Verifying X.509 Policy
      CVE-2023-0465 (Invalid certificate policies in leaf certificates are
      CVE-2023-0466 (Certificate policy check not enabled).
      CVE-2022-4304 (Timing Oracle in RSA Decryption).
      CVE-2023-2650 (Possible DoS translating ASN.1 object identifiers).

Upgrade OpenSSH to 8.4p1-5+deb11u2
Fix CVEs:
    CVE-2023-38408 (Lacks SSH agent restriction)

##### Work item tracking
- Microsoft ADO **(number only)**: 25506776

#### How I did it
Upgrade the OpenSSL/OpenSSH package version and fix the UT failure.

#### How to verify it
Verified by UTs with and without FIPS enabled.
2023-10-20 14:34:16 +08:00
Zhijian Li
763f846e23 [YANG SONIC-ACL] Fix Yang definition of ACL_TABLE_TYPE (#16247)
How I did it
Update Yang definition of ACL_TABLE_TYPE.
Update existing testcase.
Add new testcase to cover lowercase key scenario.

How to verify it
Verified by building sonic_yang_models-1.0-py3-none-any.whl. While building the target package, unit tests were run and passed.
2023-10-20 12:34:25 +08:00
Vadym Hlushko
638cded2f6 [Mellanox] Remove mlxtrace support for SPC4 (#16373)
- Why I did it
Because the Spectrum4 devices don't support mlxtrace utility.

- How I did it
Edit sai.profile and remove mlxtrace_spectrum4_itrace_*.cfg.ext files

Signed-off-by: vadymhlushko-mlnx <vadymh@nvidia.com>
2023-10-20 12:34:20 +08:00
Saikrishna Arcot
2818e25969 Fix potentially not having any loopback address on lo interface (#16490)
In #15080, there was a command added to re-add 127.0.0.1/8 to the lo
interface when the networking configuration is being brought down.
However, the trigger for that command is `down`, which, looking at
ifupdown2 configuration files, runs immediately after 127.0.0.1/16 is
removed. This means there may be a period of time where there are no
loopback addresses assigned to the lo interface, and redis commands will
fail.

Fix this by changing this to pre-down, which should run well before
127.0.0.1/16 is removed, and should always leave lo with a loopback
address.

Signed-off-by: Saikrishna Arcot <sarcot@microsoft.com>
2023-10-20 12:34:15 +08:00
anamehra
a47e79d584 Chassis: fix pmon docker failure when DEVICE_METADATA is not available (#16527)
Signed-off-by: anamehra anamehra@cisco.com

Added a check for DEVICE_METADATA before accessing the data. This prevents the j2 failure when var is not available.
2023-10-20 12:34:10 +08:00
Sudharsan Dhamal Gopalarathnam
5bc3dc5f75 [ctnmgd]: Fixing netaddr build issue (#16668)
Fixing the following build issue

[2023-09-20T04:42:00.004Z] [ FAIL LOG START ] [ target/python-wheels/bullseye/sonic_bgpcfgd-1.0-py3-none-any.whl ]
[2023-09-20T04:42:00.004Z] Build start time: Wed Sep 20 04:41:54 UTC 2023
[2023-09-20T04:42:00.004Z] [ REASON ] :      target/python-wheels/bullseye/sonic_bgpcfgd-1.0-py3-none-any.whl does not exist   NON-EXISTENT PREREQUISITES: target/python-wheels/bullseye/sonic_config_engine-1.0-py3-none-any.whl-install target/python-wheels/bullseye/sonic_yang_mgmt-1.0-py3-none-any.whl-install target/python-wheels/bullseye/sonic_yang_models-1.0-py3-none-any.whl-install target/debs/bullseye/libyang_1.0.73_amd64.deb-install target/debs/bullseye/libyang-cpp_1.0.73_amd64.deb-install target/debs/bullseye/python3-yang_1.0.73_amd64.deb-install target/debs/bullseye/python3-swsscommon_1.0.0_amd64.deb-install 
[2023-09-20T04:42:00.004Z] [ FLAGS  FILE    ] : [] 
[2023-09-20T04:42:00.005Z] [ FLAGS  DEPENDS ] : [mellanox amd64 bullseye] 
[2023-09-20T04:42:00.005Z] [ FLAGS  DIFF    ] : [mellanox amd64 bullseye ] 
[2023-09-20T04:42:00.005Z] /sonic/src/sonic-bgpcfgd /sonic
[2023-09-20T04:42:00.005Z] running pytest
[2023-09-20T04:42:00.005Z] Searching for netaddr==0.8.0
[2023-09-20T04:42:00.005Z] Best match: netaddr 0.8.0
[2023-09-20T04:42:00.005Z] 
[2023-09-20T04:42:00.005Z] Using /var/sw-r2d2-bot/.local/lib/python3.9/site-packages
[2023-09-20T04:42:00.005Z] running egg_info
[2023-09-20T04:42:00.005Z] writing sonic_bgpcfgd.egg-info/PKG-INFO
[2023-09-20T04:42:00.005Z] writing dependency_links to sonic_bgpcfgd.egg-info/dependency_links.txt
[2023-09-20T04:42:00.005Z] writing entry points to sonic_bgpcfgd.egg-info/entry_points.txt
[2023-09-20T04:42:00.005Z] writing requirements to sonic_bgpcfgd.egg-info/requires.txt
[2023-09-20T04:42:00.005Z] writing top-level names to sonic_bgpcfgd.egg-info/top_level.txt
[2023-09-20T04:42:00.005Z] reading manifest file 'sonic_bgpcfgd.egg-info/SOURCES.txt'
[2023-09-20T04:42:00.005Z] writing manifest file 'sonic_bgpcfgd.egg-info/SOURCES.txt'
[2023-09-20T04:42:00.005Z] running build_ext
[2023-09-20T04:42:00.005Z] Traceback (most recent call last):
[2023-09-20T04:42:00.005Z]   File "/sonic/src/sonic-bgpcfgd/setup.py", line 3, in <module>
[2023-09-20T04:42:00.005Z]     setuptools.setup(
[2023-09-20T04:42:00.005Z]   File "/usr/local/lib/python3.9/dist-packages/setuptools/__init__.py", line 163, in setup
[2023-09-20T04:42:00.005Z]     return distutils.core.setup(**attrs)
[2023-09-20T04:42:00.005Z]   File "/usr/lib/python3.9/distutils/core.py", line 148, in setup
[2023-09-20T04:42:00.005Z]     dist.run_commands()
[2023-09-20T04:42:00.006Z]   File "/usr/lib/python3.9/distutils/dist.py", line 966, in run_commands
[2023-09-20T04:42:00.006Z]     self.run_command(cmd)
[2023-09-20T04:42:00.006Z]   File "/usr/lib/python3.9/distutils/dist.py", line 985, in run_command
[2023-09-20T04:42:00.006Z]     cmd_obj.run()
[2023-09-20T04:42:00.006Z]   File "/usr/local/lib/python3.9/dist-packages/ptr.py", line 208, in run
[2023-09-20T04:42:00.006Z]     with self.project_on_sys_path():
[2023-09-20T04:42:00.006Z]   File "/usr/lib/python3.9/contextlib.py", line 117, in __enter__
[2023-09-20T04:42:00.006Z]     return next(self.gen)
[2023-09-20T04:42:00.006Z]   File "/usr/local/lib/python3.9/dist-packages/setuptools/command/test.py", line 168, in project_on_sys_path
[2023-09-20T04:42:00.006Z]     require('%s==%s' % (ei_cmd.egg_name, ei_cmd.egg_version))
[2023-09-20T04:42:00.006Z]   File "/usr/local/lib/python3.9/dist-packages/pkg_resources/__init__.py", line 899, in require
[2023-09-20T04:42:00.006Z]     needed = self.resolve(parse_requirements(requirements))
[2023-09-20T04:42:00.006Z]   File "/usr/local/lib/python3.9/dist-packages/pkg_resources/__init__.py", line 790, in resolve
[2023-09-20T04:42:00.006Z]     raise VersionConflict(dist, req).with_context(dependent_req)
[2023-09-20T04:42:00.006Z] pkg_resources.ContextualVersionConflict: (netaddr 0.9.0 (/var/sw-r2d2-bot/.local/lib/python3.9/site-packages), Requirement.parse('netaddr==0.8.0'), {'sonic-bgpcfgd'})
[2023-09-20T04:42:00.007Z] [  FAIL LOG END  ] [ target/python-wheels/bullseye/sonic_bgpcfgd-1.0-py3-none-any.whl ]
[2023-09-20T04:42:00.007Z] make: *** [slave.mk:881: target/python-wheels/bullseye/sonic_bgpcfgd-1.0-py3-none-any.whl] Error 1
[2023-09-20T04:42:00.007Z] make: *** Waiting for unfinished jobs....
2023-10-20 12:34:05 +08:00
abdosi
366d558dd9 [chassis/multi-asic] Enable Sending BGP Community over internal neighbors over iBGP Session (#16705)
What I did:
Enable Sending BGP Community over internal neighbors over iBGP Session

Microsoft ADO: 25268695

Why I did:
Without this change BGP community send by e-BGP Peers are not carry-forward to other e-BGP peers.


str2-xxxx-lc1-2# show bgp ipv6  20c0:a801::/64
BGP routing table entry for 20c0:a801::/64, version 52141
Paths: (1 available, best #1, table default)
  Not advertised to any peer
  65000 65500
    2603:10e2:400::6 from 2603:10e2:400::6 (3.3.3.6)
      Origin IGP, localpref 100, valid, internal, best (First path received)
      Last update: Tue Sep 26 16:08:26 2023
str2-xxxx-lc1-2# show ip bgp 192.168.35.128/25
BGP routing table entry for 192.168.35.128/25, version 52688
Paths: (1 available, best #1, table default)
  Not advertised to any peer
  65000 65502
    3.3.3.6 from 3.3.3.6 (3.3.3.6)
      Origin IGP, localpref 100, valid, internal, best (First path received)
      Last update: Tue Sep 26 15:45:51 2023

After the change

str2-xxxx-lc2-2(config)# router bgp 65100
str2-xxxx-lc2-2(config-router)# address-family ipv4
str2-xxxx-lc2-2(config-router-af)# neighbor INTERNAL_PEER_V4 send-community
str2-xxxx-lc2-2(config-router-af)# exit
str2-xxxx-lc2-2(config-router)# address-family ipv6
str2-xxxx-lc2-2(config-router-af)# neighbor INTERNAL_PEER_V6 send-community
str2-xxxx-lc1-2# show bgp ipv6  20c0:a801::/64
BGP routing table entry for 20c0:a801::/64, version 52400
Paths: (1 available, best #1, table default)
  Not advertised to any peer
  65000 65500
    2603:10e2:400::6 from 2603:10e2:400::6 (3.3.3.6)
      Origin IGP, localpref 100, valid, internal, best (First path received)
      **Community: 1111:1111**
      Last update: Tue Sep 26 16:10:19 2023
str2-xxxx-lc1-2# show ip bgp 192.168.35.128/25
BGP routing table entry for 192.168.35.128/25, version 52947
Paths: (1 available, best #1, table default)
  Not advertised to any peer
  65000 65502
    3.3.3.6 from 3.3.3.6 (3.3.3.6)
      Origin IGP, localpref 100, valid, internal, best (First path received)
      **Community: 1111:1111**
      Last update: Tue Sep 26 16:10:09 2023

Signed-off-by: Abhishek Dosi <abdosi@microsoft.com>
2023-10-20 12:34:00 +08:00
Vaibhav Hemant Dixit
08742080d1 [fast-reboot] Fix regression: set FAST_REBOOT state_db flag to support fast-reboot from older images (#16733)
Why I did it
Fix: #16699

Fast reboot is failing from old OS versions (eg., 201911 image) to latest (eg., master branch) after PR #15685

The system wide flag for FAST_REBOOT is still required when the base OS version does not support the new fast-reboot reconciliation logic (no db dump)
2023-10-20 12:33:53 +08:00
Vadym Hlushko
4e865e97a4 [memory_checker] Add a specific log message in a case when the docker service is not running. (#16018)
#### Why I did it
To fix the logic introduced by [[memory_checker] Do not check memory usage of containers which are not created #11129](https://github.com/sonic-net/sonic-buildimage/pull/11129).
There could be a scenario before the reboot, where
1. The `docker service` has stopped
2. In a very short period of time, the monit service performs the `root@sonic:/home/admin# monit status container_memory_telemetry`

In such scenario, the `memory_checker` script will throw an error to the syslog:
```
ERR memory_checker: Failed to retrieve the running container list from docker daemon! Error message is: 'Error while fetching server API version: ('Connection aborted.', FileNotFoundError(2, 'No such file or directory'))'
```
But, actually, this scenario is a correct behavior, because when the docker service is stopped, the Unix socket is destroyed and that is why we could see the `FileNotFoundError(2, 'No such file or directory'` exception in the syslog.

#### How I did it
Change the log severity to the warning and changed the return value.

#### How to verify it
It is really hard to catch the exact moment described in the `Why I did it` section.
In order to check the logic:
1. Change the Unix socket path to non-existing in [/usr/bin/memory_checker](47742dfc2c/files/image_config/monit/memory_checker (L139)) file on the switch.
2. Execute the `root@sonic:/home/admin# monit restart container_memory_telemetry`
3. Check the syslog for such messages:
```
WARNING memory_checker: Failed to retrieve the running container list from docker daemon! Error message is: 'Error while fetching server API version: ('Connection aborte
d.', FileNotFoundError(2, 'No such file or directory'))'

INFO memory_checker: [memory_checker] Exits without checking memory usage since container 'telemetry' is not running!
```
2023-10-20 12:33:46 +08:00
abdosi
9a63b9df8d [chassisd]: Updated the API get_platform_info() to return running/detected ASIC's count (#16539)
previously, get_num_asics() returns the maximum number of asics. however, the asic_count 
should be actual number of asics populated which can be get from get_asic_presence_list().

ADO: 25158825

Signed-off-by: Abhishek Dosi <abdosi@microsoft.com>
2023-10-20 12:33:41 +08:00
Longxiang Lyu
3090d2671a [snmp] Check intfmgrd running before start (#16588)
Add pre start check to ensure intfmgrd is running.
The check will run for 20 seconds at most.

Signed-off-by: Longxiang Lyu <lolv@microsoft.com>
2023-10-20 12:33:36 +08:00
Samuel Angebault
cf4f06d1c5 Disable CPU C-States other than C1 (#16703)
Why I did it
Networking devices need to be responsive. Such responsiveness is harmed when the CPU change state.
There is a latency penalty when a CPU is idle (e.g C2) and need to exit this state to come back to C1 state.
To prevent this from happening the CPU should be forced to remain in C1 state.

How I did it
Generalize the cstate forcing to C1 to all Arista products.
This is done by adding processor.max_cstate=1 to the kernel cmdline for all CPUs.
Additionally Intel CPUs also need intel_idle.max_cstate=0 to fallback to the acpi_idle driver.

How to verify it
Check that processor.max_cstate=1 is present on the cmdline for AMD CPUs
Check that both processor.max_cstate=1 and intel_idle.max_cstate=0 are present on the cmdline for Intel CPUs
2023-10-20 12:33:31 +08:00
Saikrishna Arcot
d504600c9f [baseimage]: Update openssh to 1:8.4p1-5+deb11u2 (#16826)
Openssh in Debian Bullseye has been updated to 1:8.4p1-5+deb11u2 to fix CVE-2023-38408. 
Since we're building openssh with some patches, we need to update our version as well.

Signed-off-by: Saikrishna Arcot <sarcot@microsoft.com>
2023-10-20 12:33:26 +08:00
mssonicbld
466f689e78
[submodule] Update submodule sonic-linux-kernel to the latest HEAD automatically (#16654)
#### Why I did it
src/sonic-linux-kernel
```
* 9534615 - (HEAD -> 202211, origin/202211) arm64: ac5: Fix watchdog timeleft (#334) (5 days ago) [pavannaregundi]
* 70c4df8 - [marvell-arm64]: Add support for 98DX35xx and 98CX85xx platform (#311) (6 days ago) [pavannaregundi]
* aab079e - [Mellanox] Upstream kernel patches with HW-MGMT 7.0030.1011 (#327) (4 weeks ago) [Kebo Liu]
```
#### How I did it
#### How to verify it
#### Description for the changelog
2023-10-19 16:34:30 +08:00
mssonicbld
c6a1838a32
[submodule] Update submodule linkmgrd to the latest HEAD automatically (#16855)
#### Why I did it
src/linkmgrd
```
* abb22d2 - (HEAD -> 202211, origin/202211) [warmboot] config all interfaces back to `auto` if reconciliation times out  (#220) (7 days ago) [Jing Zhang]
```
#### How I did it
#### How to verify it
#### Description for the changelog
2023-10-19 14:34:44 +08:00
mssonicbld
6d3cd99217
[submodule] Update submodule sonic-swss to the latest HEAD automatically (#16695)
#### Why I did it
src/sonic-swss
```
* 9647b81f - (HEAD -> 202211, origin/202211) [muxorch] Reorder the neighbor disable operations (#2917) (12 hours ago) [Longxiang Lyu]
* 30cea968 - Support type7 encoded CAK key for macsec in config_db (#2892) (5 days ago) [judyjoseph]
* 8d76a4e7 - [202211][ppi]: General code cleanup: remove unused methods. (#2868) (3 weeks ago) [Nazarii Hnydyn]
```
#### How I did it
#### How to verify it
#### Description for the changelog
2023-10-19 10:33:44 +08:00
Volodymyr Samotiy
29926587be
[202211][Mellanox] Update SAI version to SAIBuild2211.25.1.6 (#16522)
Signed-off-by: Volodymyr Samotiy <volodymyrs@nvidia.com>
2023-10-10 14:38:04 +08:00
Kebo Liu
b56b34aa2c
[202211][Mellanox] Update HW-MGMT package to new version V.7.0030.1011(#16239) (#16295)
* [Mellanox] Update HW-MGMT package to new version V.7.0030.1010

Signed-off-by: Kebo Liu <kebol@mellanox.com>

* Update hw-mgmt version to 7.0030.1011

Signed-off-by: Kebo Liu <kebol@nvidia.com>

---------

Signed-off-by: Kebo Liu <kebol@mellanox.com>
Signed-off-by: Kebo Liu <kebol@nvidia.com>
2023-10-10 14:30:56 +08:00
mssonicbld
2b738c53fa
Fix the dependency grpcio-tools version (#16776) (#16810) 2023-10-09 18:01:59 +08:00
Junchao-Mellanox
5c74ecb87f [Mellanox] wait reset cause ready (#16722)
Why I did it
SONiC service determine-reboot-cause might run before driver creating reset cause files. In that case, the reset cause will be "Unknown". This PR introduces a wait mechanism to wait for reset cause sysfs files ready.

How I did it
/run/hw-management/config/reset_attr_ready is the file to indicate all reset cause files are ready. In chassis.get_reboot_cause function, it waits /run/hw-management/config/reset_attr_ready for up to 45 seconds.

How to verify it
Manual test on master/202211/202205
2023-10-05 09:33:21 +08:00
mssonicbld
7049b6f788
[Ci] Change the package upgrade PR title (#16674) (#16728) 2023-09-27 22:05:31 +08:00
mssonicbld
2cfa8b2d93
[build] Fix build issue in docker-ptf-sai caused by setuptools_scm new release (#16636) (#16680)
Why I did it
When SUPERVISOR_PROC_EXIT_LISTENER_SCRIPT changed, almost all dockers need to be built again.
But currently it will be loaded by cache.

Work item tracking
Microsoft ADO (number only): 25123348
How I did it
Add $(DOCKER)_FILES into dependencies.

How to verify it
2023-09-26 18:47:01 +08:00
mssonicbld
5b6fcb7711
[ci/build]: Upgrade SONiC package versions (#15614) 2023-09-23 00:34:57 -07:00
Yoush
330d0780fd
[centec]: update sonic centec-sai reference to v1.11.0-2 for 202211 (#16241)
Change makefile to reference to new SAI debian package of v1.11.0-2 for centec of 202211

Signed-off-by: yoush <yoush@centec.com>
2023-09-23 00:29:16 -07:00
Stephen Sun
7ea54f53d5
Add yang model for scheduler in PORT_QOS_MAP (#16244) (#16359)
Signed-off-by: Stephen Sun <stephens@nvidia.com>
2023-09-19 10:06:32 +08:00
vganesan-nokia
9ffa92cc61
[swss] Chassis db clean up optimization and bug fixes (#16454) (#16540)
* [swss] Chassis db clean up optimization and bug fixes

This commit includes the following changes:
    - Fix for regression failure due to error in finding CHASSIS_APP_DB in
    pizzabox (#PR 16451)
    - After attempting to delete the system neighbor entries from
    chassis db, before starting clearing the system interface entries,
    wait for sometime only if some system neighbors were deleted.
    If there are no system neighbors entries deleted for the asic coming up,
    no need to wait.
    - Similar changes for system lag delete. Before deleting the
    system lag, wait for some time only if some system lag memebers were
    deleted. If there are no system lag members deleted no need to wait.
    - Flush the SYSTEM_NEIGH_TABLE from the local STATE_DB. While asic
    is coming up, when system neigh entries are deleted from chassis ap
    db (as part of chassis db clean up), there is no orchs/process running to
    process the delete messages from chassis redis. Because of this, stale system
    neigh are entries present in the local STATE_DB. The stale entries result in
    creation of orphan (no corresponding data path/asic db entry) kernel neigh
    entries during STATE_DB:SYSTEM_NEIGH_TABLE entries processing by nbrmgr (after
    the swss serive came up). This is avoided by flushing the SYSTEM_NEIGH_TABLE from
    the local STATE_DB when sevice comes up.

Signed-off-by: vedganes <veda.ganesan@nokia.com>

* [swss] Chassis db clean up bug fixes review comment fix - 1

Debug logs added for deletion of other tables (SYSTEM_INTERFACE and SYSTEM_LAG_TABLE)

Signed-off-by: vedganes <veda.ganesan@nokia.com>

---------

Signed-off-by: vedganes <veda.ganesan@nokia.com>
(cherry picked from commit b13b41fc22)
2023-09-14 14:06:52 -07:00
SuvarnaMeenakshi
379f256bcf
[202211][SNMP][IPv6]: Revert PRs to support SNMP over IPv6 (#16278)
* Revert "[SNMP][IPv6]: Fix to use link local IPv6 address as snmp agentAddress (#16013)"

This reverts commit 803c71c86a.

* Revert "[SNMP][IPv6]: Fix SNMP IPv6 reachability issue in certain scenarios (#15487)"

This reverts commit 9864dfeaa1.
2023-09-10 22:18:17 +08:00
Dror Prital
4e67c18c11
[202211][Mellanox] Update SDK/FW to 4.6.1062/2012.1062 Update SDK/FW/SAI to 4.6.1062/2012.1062/SAIBuild2211.25.1.4 (#16434)
SAI bug Fixes

- When creating an ACL rule with SAI_ACL_ENTRY_ATTR_FIELD_SRC_IP/SAI_ACL_ENTRY_ATTR_FIELD_DST_IP enabled, and then disabling the field by setting enable=false, a match on L3_type=IPv4 will remain programmed for the rule Issue resolved after the fix
- Allow the max scale of virtual routers to be configure for SPC-1, SPC-2, SPC-3 which is 255 when fastboot enable and 511 when fastboot disable
- Remove default hash key of SRC_MAC, DST_MAC and ETH_TYPE

SDK/FW bug fixes

- When preforming fast boot from an old SDK version (currently installed) to a newer one (target version), and the system was initially loaded with a new SDK version (past version), and the system has not been wiped, under specific conditions, the fast boot would use the past version's data and may fail.
2023-09-08 23:55:42 -07:00
mssonicbld
4390159698
Update macsec CAK keys in profile for tests to change to type7 encoded format (#16388) (#16500) 2023-09-09 04:37:17 +08:00
Yaqiang Zhu
3310592d8f [yang] Add Bmc to Device Neighbor Metadata element type list (#16188)
Bmc is a valid neighbor type in minigraph, however it was missing from the YANG model definition. Usually, the Bmc type device can be neighbor of BmcMgmtToRRouter. This PR is to introduce this type.
2023-09-07 12:33:20 +08:00
mssonicbld
d0325862a8
[Mellanox] set select timeout to no more than 1 sec to make sure fast shutdown (#13611) (#16448) 2023-09-06 04:23:23 +08:00
mssonicbld
67ea31edc8
[submodule] Update submodule sonic-utilities to the latest HEAD automatically (#16442) 2023-09-05 18:45:48 +08:00
Zain Budhwani
71678dc355 [eventd]: Remove unnecessary log (#16166)
Work item tracking
Microsoft ADO (number only): 16789053
2023-09-03 16:33:03 +08:00
Senthil Kumar Guruswamy
7ab6be8440 Handle service start-limit-hit failure event case in sysmonitor (#16174) 2023-09-03 16:32:58 +08:00
mssonicbld
1909f019ab
[P4RT]Disabling p4rt by default to overcome build issues (#16343) (#16426) 2023-09-03 16:06:31 +08:00
Stephen Sun
72aab2b58e Fix issue: unprintable character is rendered when handling comments in j2 (#16287)
Use "{#-" and "-#}" to mark comments in jinja template

Signed-off-by: Stephen Sun <stephens@nvidia.com>
2023-09-03 14:32:27 +08:00
mssonicbld
07b5677095
[Nokia][DeviceData] Update the Nokia platform IXR-7250E device data (#16028) (#16380) 2023-09-02 07:32:54 +08:00
mssonicbld
26e1d59867
[chassis] Chassis DB cleanup when asic comes up (#16213) (#16379) 2023-09-02 07:18:17 +08:00