Commit Graph

5661 Commits

Author SHA1 Message Date
Kevin Wang
83780549c7 [Buffer] Separate buffer profile for Arista-7060CX-32S-D48C8
Signed-off-by: Kevin Wang <shengkaiwang@microsoft.com>
2022-07-07 14:09:01 -07:00
Kevin Wang
e8f04cd2e6 [Buffer] Separate buffer profile for Arista-7060CX-32S-Q32
Signed-off-by: Kevin Wang <shengkaiwang@microsoft.com>
2022-07-07 14:09:01 -07:00
Kevin Wang
2c81f02b13 [Buffer] Separate buffer profile for Celestica-DX010-D48C8
Signed-off-by: Kevin Wang <shengkaiwang@microsoft.com>
2022-07-07 14:09:01 -07:00
Kevin Wang
4dbdc8e0a0 [Buffer] Separate buffer profile for Force10-S6100
Signed-off-by: Kevin Wang <shengkaiwang@microsoft.com>
2022-07-07 14:09:01 -07:00
Ying Xie
2ed29da38d [buffer] create infrastructure to enable buffer/QoS profiles
Signed-off-by: Ying Xie <ying.xie@microsoft.com>
2022-07-07 14:09:01 -07:00
Vivek
729b4b4c40
Update sonic-swss submodule (#11310)
Update sonic-swss submodule

```
639d10e [PFC_WD] [202012] Avoid applying ZeroBuffer Profiles to ingress PG when a PFC storm is detected (https://github.com/Azure/sonic-buildimage/pull/2310)
475ae19 [202012] [cherry-pick] Apply `DSCP_TO_TC_MAP` from `PORT_QOS_MAP|global` to switch level (https://github.com/Azure/sonic-buildimage/issues/2328)
aa6f855 [ci] Change artifact reference pipeline to common lib pipeline. (https://github.com/Azure/sonic-buildimage/pull/2294)
752f8c5 [ci] Use correct branch when downloading artifact. (https://github.com/Azure/sonic-buildimage/pull/2292)
b3fcc5d [ci] Improve azp trigger settings to automaticlly support new release branch. (https://github.com/Azure/sonic-buildimage/pull/2289)
```
2022-07-06 17:43:35 -07:00
Zhijian Li
24b90d7556
[cherry-pick][202012] Fix issue where HLX module failed to do postinit (#11351)
* [HLX] Fix issue where HLX module failed to do postinit (#7274)

Signed-off-by: Jing Kan jika@microsoft.com
2022-07-06 17:27:29 +08:00
mssonicbld
9a86fa9264
[ci/build]: Upgrade SONiC package versions (#11074)
Upgrade SONiC Versions
2022-07-06 11:00:50 +08:00
Alexander Allen
851bd9bff8 [Mellanox] Add arch folder to SDK binary location (#11278)
- Why I did it
This is for the eventual support of multiple architectures for the mellanox platform.

- How I did it
Change the location of the binaries in Switch-SDK-drivers so that the path specifies the target architecture in addition to the target distribution that the debians are built for.

This is the most straightforward way to separate binaries built against different architectures and selectively target them for installation in the mellanox SONiC image.

- How to verify it
Build SONiC for mellanox and verify it compiles successfully.
2022-07-05 20:58:01 +00:00
yozhao101
c1ab4c6831 [tunnel_packet_handler] Add a whitespace in the warning syslog message. (#11232)
*This PR aims to add a whitespace in the warning syslog message of process tunnel_packet_handler.
Signed-off-by: Yong Zhao <yozhao@microsoft.com>
2022-07-05 20:57:57 +00:00
vmittal-msft
6ada55439d Updated buffer profile settings for TD3 based HWSKUs (Arista-7050CX3-32S-C32, Arista-7050CX3-32S-D48C8) (#11202)
* Updated buffer profile settings for TD3 based HWSKUs (Arista-7050CX3-32S-C32, Arista-7050CX3-32S-D48C8)
2022-07-05 20:57:53 +00:00
xumia
32cda89f93 [Build]: Support to use symbol links for lazy installation targets to reduce the image size (#10923)
Why I did it
Support to use symbol links in platform folder to reduce the image size.
The current solution is to copy each lazy installation targets (xxx.deb files) to each of the folders in the platform folder. The size will keep growing when more and more packages added in the platform folder. For cisco-8000 as an example, the size will be up to 2G, while most of them are duplicate packages in the platform folder.

How I did it
Create a new folder in platform/common, all the deb packages are copied to the folder, any other folders where use the packages are the symbol links to the common folder.

Why platform.tar?
We have implemented a patch for it, see #10775, but the problem is the the onie use really old unzip version, cannot support the symbol links.
The current solution is similar to the PR 10775, but make the platform folder into a tar package, which can be supported by onie. During the installation, the package.tar will be extracted to the original folder and removed.
2022-07-05 20:57:49 +00:00
yozhao101
4487a962e3 [memory_checker] Do not check memory usage of containers which are not created (#11129)
Signed-off-by: Yong Zhao yozhao@microsoft.com

Why I did it
This PR aims to fix an issue (#10088) by enhancing the script memory_checker.

Specifically, if container is not created successfully during device is booted/rebooted, then memory_checker do not need check its memory usage.

How I did it
In the script memory_checker, a function is added to get names of running containers. If the specified container name is not in current running container list, then this script will exit without checking its memory usage.

How to verify it
I tested on a lab device by following the steps:

Stops telemetry container with command sudo systemctl stop telemetry.service

Removes telemetry container with command docker rm telemetry

Checks whether the script memory_checker ran by Monit will generate the syslog message saying it will exit without checking memory usage of telemetry.
2022-07-05 20:57:45 +00:00
Samuel Angebault
d15a484dfa
[202012][Arista] Fix cmdline generation during warm-reboot from 201811/201911 (#11161)
Issue fixed: when performing a warm-reboot or fast-reboot from 201811 or 201911 to 202012 the kernel command line contains duplicate information. This issue is related to a change that was made to make 202012 boot0 file more futureproof.
A cold reboot brings everything back into a clean slate though not always desirable.

Changes done:
Added some logic to properly detect the end of the Aboot cmdline when cmdline-aboot-end delimiter is not set (clean case)
Added some logic to regenerate the Aboot cmdline when cmdline-aboot-end is set but duplicate parameters exists before (dirty case). Reorganized some code to handle duplicate parameter handling in the allowlist.
2022-07-04 11:01:03 -07:00
Stephen Sun
fe6be5da92
[202012] Configure different map between uplink and downlink on t1 switch in dual ToR scenario (#11299)
- Why I did it
Configure different DSCP_TO_TC_MAP between uplink and downlink on T1 switch in dual ToR scenario
On T1 uplink, both DSCP 2/6 will be mapped to TC 1 for the purpose of avoiding such traffic occupying lossless buffers.
On T1 downlink, they will be mapped to TC 2/6 respectively. (unchanged)

- How I did it
For vendors who want to configure different DSCP_TO_TC_MAP between uplinks and downlinks on T1, they should
Define generate_dscp_to_tc_map macro in SKU's qos.json.j2 file
Define map AZURE for downlink and AZURE_UPLINK for uplink
Define jinja2 variable different_dscp_to_tc_map as True

Signed-off-by: Stephen Sun <stephens@nvidia.com>
2022-07-03 15:58:06 +03:00
xumia
d766e7022e
[Build] Add the missing debian security mirrors in slave images (#11304)
Why I did it
The build below was broken, it was caused by one of the required debian mirror missing.
https://dev.azure.com/mssonic/build/_build/results?buildId=116719&view=logs&j=88ce9a53-729c-5fa9-7b6e-3d98f2488e3f&t=88f376cf-c35d-5783-0a48-9ad83a873284

 libpci-dev : Depends: libudev-dev (>= 196) but it is not going to be installed
 libsystemd-dev : Depends: libsystemd0 (= 232-25+deb9u14) but 232-25+deb9u13 is to be installed
How I did it
Add the missing mirrors for buster and stretch.
2022-07-01 21:17:03 +08:00
Jing Kan
2b76288a24
[minigraph] Support parse IPv6 in device_desc.xml (#11095) (#11272)
Signed-off-by: Jing Kan jika@microsoft.com
2022-07-01 16:23:56 +08:00
Nathan Cohen
94e3eb0474
Updated sonic-utilities to most recent 202012 (#11249) 2022-06-30 15:03:31 +08:00
Sumukha Tumkur Vani
82d59f6b41
[202012][RESTAPI] Update submodule (#11234) 2022-06-23 16:13:17 -07:00
Santhosh Kumar T
7a7c363548
[DellEMC] S6100 Platform Service optimization (#10989)
Why I did it
- To reduce rc.local script execution time.
- Time consumption of rc.local script is around 22 seconds in S6100.
How I did it
- Moving platform-modules-s6100.service and s6100-lpc-monitor.service asynchronous to rc.local script.
How to verify it
- Load the image with the changes and the time consumption of rc.local script reduced from 22 seconds(approx.) to 14 seconds(approx.) during warm-/fast-reboot upgrades.
- sonic-mgmt test results.
2022-06-23 12:58:11 -07:00
Prince Sunny
86f5dfc1f9
[Submodule] update sonic-swss (#11203)
Commits:

bee3684 - 2022-06-20 : Add BGP profile to Vnet routes (#2339) [Prince Sunny]
f9af510 - 2022-06-16 : [intfmgr]: Set proxy_arp kernel param (#2334) [Lawrence Lee]
725071f - 2022-06-08 : Fix test_warm_reboot issues blocking PR merge (#2309) [Vaibhav Hemant Dixit]
0db6f15 - 2021-11-16 : [orchagent] Flush pipeline every 1 second, not only when select will timeout (#2003) [Kamil Cudnik]
2022-06-21 15:38:10 -07:00
Stephen Sun
307d0e2aca
[Mellanox][202012] Support Mellanox-SN4600C-C64 as T1 switch in dual-ToR scenario (#11032)
Why I did it
Support Mellanox-SN4600C-C64 as T1 switch in dual-ToR scenario

1. Support additional queue and PG in buffer templates, including both traditional and dynamic model
2. Support mapping DSCP 2/6 to lossless traffic in the QoS template.
3. Add macros to generate additional lossless PG in the dynamic model
4. Adjust the order in which the generic/dedicated (with additional lossless queues) macros are checked and called to generate buffer tables in common template buffers_config.j2
  - Buffer tables are rendered via using macros.
  - Both generic and dedicated macros are defined on our platform. Currently, the generic one is called as long as it is defined, which causes the generic one always being called on our platform. To avoid it, the dedicated macrio is checked and called first and then the generic ones.
5. Support MAP_PFC_PRIORITY_TO_PRIORITY_GROUP on ports with additional lossless queues.

On Mellanox-SN4600C-C64, buffer configuration for t1 is calculated as:
40 * 100G downlink ports with 4 lossless PGs/queues, 1 lossy PG, and 3 lossy queues
16 * 100G uplink ports with 2 lossless PGs/queues, 1 lossy PG, and 5 lossy queues

Signed-off-by: Stephen Sun stephens@nvidia.com

How to verify it
Run regression test.
2022-06-21 10:04:49 -07:00
Liu Shilong
54de99ba23
[build] Add version files to docker image dependencies (#11192)
* [build] Add version files to docker image dependencies

* [ci] Support to skip vstest using include/exclude config file. (#11086)

example:
├── folderA
│  ├──  fileA (skip vstest)
│  ├──  fileB
│  └──  fileC
If we want to skip vstest when changing /folderA/fileA, and not skip vstest when changing fileB or fileC.

vstest-include:
^folderA/fileA

vstest-exclude:
^folderA
2022-06-21 20:28:50 +08:00
bingwang-ms
6ddf5cd7dc
[202012] [cherry-pick] Generate switch level dscp_to_tc_map entry from qos_config template (#11132)
* Generate switch level dscp_to_tc_map

Signed-off-by: bingwang <wang.bing@microsoft.com>
2022-06-17 20:49:56 +08:00
Jing Kan
5b2261da37
Revert "[202012][openssh] openssh: Upgrade from 7.9 to 8.4, to match version in buster-backports (#10910)" (#11136)
This reverts commit 14fdcc815a.
2022-06-17 20:46:43 +08:00
Nazarii Hnydyn
05ff95fdfc
[Mellanox]: Advance SAI submodule. (#11164)
[Mellanox]: Advance SAI submodule. (#11164)
Fix #3074227 - don't disable used tunnel underlay interfaces
fix bfd - notify Sonic for admin-down event
2022-06-16 18:09:59 -07:00
Saikrishna Arcot
044570c42e
Remove SSH host keys after installing the custom version of sshd (#10633) (#11140)
* Remove SSH host keys after installing the custom version of sshd

Signed-off-by: Saikrishna Arcot <sarcot@microsoft.com>

* Use an override for for sshd instead of overwriting the service file

Don't overwrite upstream's .service file, and instead use an override
file for making sure the host key(s) are generated.

Signed-off-by: Saikrishna Arcot <sarcot@microsoft.com>
2022-06-16 11:47:04 -07:00
Jing Zhang
a99debadc1
[sonic-linkmgrd][202012] submodule update (#11144)
[sonic-linkmgrd][202012] submodule update

0839af2 Longxiang Lyu Wed Jun 15 08:46:21 2022 +0800 [202012] Fix IP header checksum in handleSendSwitchCommand (#89)
afc4972 Jing Zhang Wed Jun 1 10:33:12 2022 -0700 Revert "Update log level for mux probing and mux state chance (#23)" (#85)
ed52d0a Longxiang Lyu Tue May 31 10:28:30 2022 +0800 Add a command line option to store logs into a separate file (#83)

sign-off: Jing Zhang zhangjing@microsoft.com
2022-06-15 14:40:31 -07:00
Vaibhav Hemant Dixit
dec5dcde01
[Advance submodule] Update sonic-sairedis pointer to latest (#11142)
Update sonic-sairedis submodule to include cross-branch warmboot fix:

Azure/sonic-sairedis#1065: Support ACL action data object in remove dep tree:
This is required if some of the attributes on ACL action data
object are OID's and we need to catch their refrence when
removing object and it's dependency tree.
2022-06-14 22:40:41 -07:00
Richard.Yu
27cfcf2c5d
[202012][Tunnel PFC][Fix bug] Fix bug and Tests for adding property 'sai_remap_prio_on_tnl_egress' (#11099)
* [202012][Tunnel PFC][Fix bug] Fix bug and Tests for adding property 'sai_remap_prio_on_tnl_egress'

manual cherry-pick from https://github.com/Azure/sonic-buildimage/pull/11027
Fix a bug in the template
Add tests for adding property 'sai_remap_prio_on_tnl_egress', this
property should only be added in dual tor environment.

* merge change after sync from 202012 and remove a mistaken changes in cherry-pick

Signed-off-by: richardyu-ms <richard.yu@microsoft.com>
2022-06-14 18:50:18 -07:00
Jon Goldberg
159613153f [Nokia ixs7215] change var/log size to 4GB (#11122)
This makes use of #11121 to add support for configuration of VAR_LOG_SIZE on Nokia IXS7215
2022-06-14 09:02:14 -07:00
Jon Goldberg
efdb507795 [installer]: fix armhf for installer.conf usage (#11121)
This fixes the build for armhf to be able to use '/device///installer.conf' files. Specifically, armhf needs support to be able to change the size of /var/log/ directory. It is hardcoded to 512 bytes on all armhf platforms currently. This change will allow any armhf platform to be able to use an installer.conf file to customize the installed image.
2022-06-14 09:02:01 -07:00
Shilong Liu
c27aa88acf
[build] Add dependencies for sonic-config-engine to block bad PR. (#10770) (#11054)
Add dependencies device/* for sonic-config-engine
2022-06-14 13:49:22 +08:00
Nikola Dancejic
49a2d8558d
Update submodule sonic-swss (#11109)
Signed-off-by: Nikola Dancejic <nikola.dancejic@live.com>
2022-06-10 20:45:06 -07:00
Lukas Stockner
ab10005729
[swss] Clear VXLAN tunnel table from State DB on startup (#11078)
*Clear VXLAN tunnel table from State DB on startup
Signed-off-by: Lukas Stockner <lstockner@genesiscloud.com>
2022-06-10 11:50:56 -07:00
Guohan Lu
b0c48f9b31 [devices]: fix j2 syntax error for the config.bcm in Arista-7260CX3-D108C8
Signed-off-by: Guohan Lu <lguohan@gmail.com>
2022-06-10 11:23:28 -07:00
Neetha John
5ceb25e21b
[202012] [sonic-config-engine] Generate expected output with different cable len (#11082)
* Generate expected output with 300m cable len and compare against it

Signed-off-by: Neetha John <nejo@microsoft.com>

* Rewrite assert logic

Signed-off-by: Neetha John <nejo@microsoft.com>
2022-06-10 18:06:44 +08:00
bingwang-ms
68f4e0b738
Update submodule sonic-utilities (#11088)
Signed-off-by: bingwang <wang.bing@microsoft.com>
2022-06-10 08:14:24 +08:00
StormLiangMS
52962f8f59
cherry-pick [bgpcfgd] ECMP overlay VxLan with BGP support to 202012 (#11079)
Why I did it
This is a cherry-pick PR from master to 202012 branch
below is the original PR which merged to master.
#10716

How I did it
How to verify it
2022-06-09 17:28:12 +08:00
bingwang-ms
adfe20b5f6
[202012] Automatically enable tunnel_qos_remap on T1 and T0 in DualToR deployment (#11056)
* Automatically enable tunnel_qos_remap on T1 and T0 in DualToR deployment

Signed-off-by: bingwang <wang.bing@microsoft.com>
2022-06-09 16:13:17 +08:00
Shilong Liu
4f19945b48
[ci] Enable default ACR in official/PR/UpgradeVersion pipelines. (#11052) 2022-06-09 15:58:39 +08:00
shlomibitton
2a9aa0836c
[202012] [Mellanox] [pmon] Fix for PMON service not starting when restarting SWSS service after fast/warm reboot (#10902)
- Why I did it
Recent change to delay PMON service in case of fast/warm reboot introduce an issue when restarting only SWSS service after fast/warm reboot for Nvidia platform.
Since the timer is triggered only when the system boot, in a scenario when the system is after a fast/warm reboot and the user restart SWSS service, as part of syncd.sh script, PMON service will stop but the timer will not start again.

- How I did it
On syncd.sh script, in case of fast/warm indication, check if pmon.timer is running.
If it is running it means we are at the first boot and continue normally.
If it is not running, meaning the service was restarted, start the timer to keep the system behavior consistent.

- How to verify it
Run fast/warm reboot.
service swss restart.
Observe PMON service starting.
2022-06-08 09:46:54 +03:00
Eric Zhu
27cd735082 [SONiC-CEL]: fix platform fancontrol testcase failure issue (#10934) 2022-06-08 01:21:53 +00:00
xumia
1a6cb0f409 [Ci]: Fix the target directory not empty issue when publishing artifacts #10972
Why I did it
Fix the target directory not empty issue when publishing artifacts.
Some of the artifacts are published to $(Build.ArtifactStagingDirectory)/target/ before source code checked out.
2022-06-08 01:21:49 +00:00
mssonicbld
855ae0491f
[ci/build]: Upgrade SONiC package versions (#11051)
Upgrade SONiC Versions #11051
2022-06-08 08:42:04 +08:00
Neetha John
881796f376
[202012] Adjust 7260 buffer sizes to accomodate extra lossless queues (#11050)
Backport changes from #11018

Signed-off-by: Neetha John <nejo@microsoft.com>

Why I did it
As part of PCBB changes, we need to enable 2 extra lossless queues. The changes in this PR are done to adjust only the reserved sizes on Th2 for the additional 2 lossless queues
Calculations are done based on 40 downlinks for T1 and 16 uplinks for dual ToR

How to verify it
Verified that the rendering works fine on Th2 dut
Unit tests have been updated to reflect the modified buffer sizes when pcbb is enabled. There are existing testcases that will test the original buffer sizes when pcbb is disabled. With these changes, was able to build sonic-config-engine wheel successfully
2022-06-06 18:13:16 -07:00
bingwang-ms
2faabf43eb
[202012] Automatically generate extra attributes for tunnel qos remapping (#11045)
* Automatically generate extra attributes for tunnel qos remapping

Signed-off-by: bingwang <wang.bing@microsoft.com>
2022-06-07 08:43:04 +08:00
bingwang-ms
05884d1182
[202012][submodule] Advance sonic-utilities submodule (#11044)
* Update submodule sonic-utilities

Signed-off-by: bingwang <bingwang@microsoft.com>
2022-06-07 08:41:27 +08:00
Kevin Wang
a442391c7d
Update cisco-8000 ref to release: 202012-v0.97 (#11038)
Important fixes since 202012-v0.97:
V0.102:
Hwsku changes to Cisco-8102-C64
Fix for watermark clear issue
V0.101:
Fix for dhcp_relay test issue
V0.100:
Fix for container_autorestart test issue
V0.99:
Fix for everflow test issue
Fix for pfcwd test issue
Fix for copp test issue
V0.98:
Fix for qos_sai test issue
RDMA enhancements dev complete and content included in this drop (flow based VoQ, ECN, Alpha)

Signed-off-by: Kevin Wang <shengkaiwang@microsoft.com>
2022-06-07 08:26:46 +08:00
Richard.Yu
8f3edde302
[202012][BRCM SAI 4.3.5.3-5] Update saibcm for pcbb feature (#10998)
Support Tunnel PFC/pcbb feature on Broadcom platform.

How to verify it
Tested build target, successful

make target/docker-syncd-brcm.gz
manual run those tests after installing sai binary within image 20201231.67 on 7050CX3 (TD3) T0 DUT, all passed

     fib/test_fib.py
     vxlan/test_vxlan_decap.py
     fdb/test_fdb.py
     decap/test_decap.py
     pfcwd/test_pfcwd_all_port_storm.py
     acl/null_route/test_null_route_helper.py
     acl/test_acl.py
     vlan/test_vlan.py
     platform_tests/test_reboot.py

Signed-off-by: richardyu-ms <richard.yu@microsoft.com>
2022-06-06 09:54:00 -07:00