Commit Graph

3826 Commits

Author SHA1 Message Date
Prince Sunny
68b717b556 Vxlan src port range for breakout SKU (#7612)
*Extended Vxlan src port range for lab breakout SKU - Mellanox-SN3800-D112C8
2021-05-14 11:42:58 -07:00
shlomibitton
b869ad1122 [Mellanox] Update FW to xx.2008.2526 (#7511)
- Why I did it
Updated FW to xx.2008.2526 version.

Fixed issues:
1. Spectrum-2, Spectrum-3 | sFlow | High CPU load and high on fully loaded switch.
2. Spectrum-2, Spectrum-3 | Fine grain LAG | in rare cases doesn’t update the right entry

- How I did it
Updated submodule pointer and version in a Makefile.

- How to verify it
Full regression and bugs validation

Signed-off-by: Shlomi Bitton <shlomibi@nvidia.com>
2021-05-14 11:42:22 -07:00
Andriy Yurkiv
eb87eb664d [Mellanox] Add support to VXLAN src port range setting via SAI profile for r SN3800-D28C49S1 (#7500)
- Why I did it
Enable VXLAN src port range configuration via SAI profile for Mellanox-SN3800-D28C49S1 SKU

- How I did it
Added SAI_VXLAN_SRCPORT_RANGE_ENABLE=1 configuration to appropriate sai.profile

Signed-off-by: Andriy Yurkiv <ayurkiv@nvidia.com>
2021-05-14 11:41:35 -07:00
Shilong Liu
f3d3c94df7 remove unused file
Signed-off-by: Shilong Liu <shilongliu@microsoft.com>
2021-05-12 15:25:36 +08:00
shilongliu
cec269c582 Update azp files with master branch 2021-05-12 15:25:36 +08:00
xumia
0f27cdd388 Export the azure pipeline build id for SONiC version (#7406)
Improve the SONiC version, fix the "azure pipeline build id" part

<target branch name>-<pullrequest id>.<azure pipelines build id>-<merge commit id>
Example: master-7381.11668-43df5c87
2021-05-12 15:25:36 +08:00
xumia
7ece5de700 [ci]: Fix official build not existing issue (#7408)
When submitting a new official build for broadcom, vs, it prompts a error message, which says the job is not defined.
It was caused by the default option "[]", which is not empty, it is used as the jobGroups parameter.
2021-05-12 15:25:36 +08:00
xumia
a5a381df7e Improve the PR build version (#7381)
Why I did it
Improve the version of the Pull Request build by changing the local branch name.

How I did it
Change the default branch name merge to [target_branch_name]-[pullrequestid].

How to verify it
For official build, the version is not changed.
For pull request build, the version as below:
2021-05-12 15:25:36 +08:00
xumia
8ef37946cd [ci] Fix the boolean value case sensitive issue in Azure Pipelines (#7399)
Why I did it
Fix the boolean value case sensitive issue in Azure Pipelines

When passing parameters to a template, the "true" or "false" will have case sensitive issue, it should be a type casting issue.
To fix it, we change the true/false to yes/no, to escape the trap.

Support to override the job groups in the template, so PR build has chance to use different build parameters, only build simple targets. For example, for broadcom, we only build target/sonic-broadcom.bin, the other images, such as swi, debug bin, etc, will not be built.
2021-05-12 15:25:36 +08:00
liushilongbuaa
fc08716732 [CI] Use the same template in PR-build pipeline and official-build pipeline
Signed-off-by: liushilongbuaa <shilongliu@microsoft.com>
2021-05-12 15:25:36 +08:00
Shilong Liu
ffdddf7346 [CI] Support 202012 branch generating version files for reproducible build (#7303)
for reproducible build azp

Signed-off-by: Shilong Liu <shilongliu@microsoft.com>
2021-05-12 15:25:36 +08:00
liushilongbuaa
14c79ee9f0 [CI] Fix pipeline template artifacts struct bug and cleanup bug
Signed-off-by: liushilongbuaa <shilongliu@microsoft.com>
2021-05-12 15:25:36 +08:00
xumia
d686a9a259 [ci] Fix no enough space issue in docker root in multi arch build (#7321)
Why I did it
Failed to build the centec-arm64 for no space in docker data root.

How I did it
Change to use the data root to the folder under /data.
See detail info about DOCKER_DATA_ROOT_FOR_MULTIARCH in the file Makefile.work.

How to verify it
Set the environment variable, then the /data used as docker root.
2021-05-12 15:25:36 +08:00
xumia
122ac7791a Fix the SONiC version starting with HEAD issue (#7319) 2021-05-12 15:25:36 +08:00
xumia
3653cb0d70 [ci] Fix the wrong build options issue (#7287)
Why I did it
Fix the wrong build options and improve display name for some tasks
1. Fix the wrong build architecture for arm64 and armhf
2. Fix the build timeout parameter
2021-05-12 15:25:36 +08:00
Shilong Liu
9ffc5466d0 [CI] Fix the wrong environment variable PLATFORM passing into the slave container issue (#7262)
* fix

* fix
2021-05-12 15:25:36 +08:00
Shilong Liu
2f794be816 [CI] Update sonic image official build template. (#7198)
* draft

* fix

* Update new-build-template.yml

* Update azure-pipelines-build.yml

* Update azure-pipelines-job-groups.yml

* fix

* fix

* draft

* fix

* fix

* fix

* fix

* fix

* fix

Signed-off-by: Shilong Liu <shilongliu@microsoft.com>

Co-authored-by: Sonic Automation <svc-acs@microsoft.com>
2021-05-12 15:25:36 +08:00
lguohan
2e7c3286da [ci]: get build logs when build fails (#7219)
Signed-off-by: Guohan Lu <lguohan@gmail.com>
2021-05-12 15:25:36 +08:00
lguohan
cde88ce946 [ci]: gzip the vm image disk and memdmp (#7131)
Signed-off-by: Guohan Lu <lguohan@gmail.com>
2021-05-12 15:25:36 +08:00
yozhao101
f0bbd9d1e9
[Monit] Install the unit of generate_monit_config.service. (#7558)
Signed-off-by: Yong Zhao yozhao@microsoft.com

Why I did it
The service file generate_monit_config.service is used to generate the Monit configuration file from template. I also should install this service file and enable it.

How I did it
I appended this service file name at the end of /etc/sonic/generated_services.conf.

How to verify it
I verified this on the device str2-7260cx3-acs-1.

Which release branch to backport (provide reason below if selected)
 201811
[x ] 201911
 202006
 202012
2021-05-07 13:37:09 -07:00
Junchao-Mellanox
e592984991
[Mellanox] [201911] Support new sensor conf file for MSN4700 A1/A0 (#7536)
#### Why I did it

MSN4700 A1/A0 used different sensor chip but keep the existing platform name *x86_64-mlnx_msn4700-r0*, this is a workaround to replace the sensor conf on MSN4700 A1/A0

#### How I did it

Use a shell script to get the sensor conf path and copy that files to /etc/sensors.d/sensors.conf
2021-05-06 15:01:35 -07:00
Abhishek Dosi
dd4e18ded0 [Submodule update] sonic-utilities
d898b03e4ec91f964f0e1fcba535ea33a78c838e (HEAD -> 201911, origin/201911) Create mappings using existing tunnel (#1593)

Signed-off-by: Abhishek Dosi <abdosi@microsoft.com>
2021-05-05 22:26:17 -07:00
Abhishek Dosi
89d7299203 [Submodule update] sonic-sairedis
e438b0db6a8912b50f7acddf93d4dc2157f53ecf (HEAD -> 201911, origin/201911) Increase Syncd operation timeout from 1 min to 6 min. (#828)
17974adb369111b44dd56837547806918ed4b1ed Update syncd_flex_counter.cpp (#798)

Signed-off-by: Abhishek Dosi <abdosi@microsoft.com>
2021-05-05 07:47:01 -07:00
anish-n
4c9d3599bc
Add downstreamsubrole parsing to minigraph.py (#7193) (#7514)
Add downstreamsubrole parsing to minigraph.py So that downstreamsubrole values can be used for policies. Backport PR, same as #7193
2021-05-05 07:43:25 -07:00
Abhishek Dosi
49cfed2916 [submodule update] sonic-swss
dd01491e4d167993b3a80517f737188151443a75 (HEAD -> 201911, origin/201911) [Monitor Vlan] Fix a typo in hostif (#1722)

Signed-off-by: Abhishek Dosi <abdosi@microsoft.com>
2021-05-04 19:32:25 -07:00
Abhishek Dosi
459b0f0c2f [submodule update] sonic-utilities
1f249282e8066a5837f2b34478eb4e0f6b4a654c (HEAD -> 201911, origin/201911) [201911] soft-reboot - support ssd_fw_update  (#1518)
30a3cb3c085a7f208a44b58060ba797e4299214a [route_check] Filter out VNET routes (#1582)

Signed-off-by: Abhishek Dosi <abdosi@microsoft.com>
2021-05-04 19:27:39 -07:00
Santhosh Kumar T
6204a1d809
[201911] DellEMC S6100 SSD Monitor additional changes (#7291)
Why I did it
Added soft-reboot plugin support.
Added SSD version s16425cq check
Added error message to display in console/SSH in case reboot is called in faulty/non-upgraded devices.
2021-05-04 09:48:04 -07:00
Abhishek Dosi
39e978bf75 [submodule upate] sonic-utilities
[201911]: add show bgp neigh/network support for multi asic (#1587)

Signed-off-by: Abhishek Dosi <abdosi@microsoft.com>
2021-05-03 19:03:10 -07:00
Andriy Yurkiv
4ff9f29350 [devices][hwsku] add support to VXLAN src port range feature (#7394)
Enable VXLAN src port range configuration via SAI profile
2021-05-03 18:30:41 -07:00
Stephen Sun
9a163fe87b
Install haveged package on 201911/stretch to accelerate the entropy collect process (#7308)
Recently, we found on some of our testbeds the entropy collecting process finishes more than 60 seconds after system started.
This results in swss not able to start sporadically.
To install haveged can accelerate the entropy collect process.

Signed-off-by: Stephen Sun <stephens@nvidia.com>
2021-05-02 08:16:24 -07:00
Junchao-Mellanox
6128ff6def
[Mellanox] [201911] Upgrade hw-mgmt to 7.0100.2303 (#7418)
- Update hw-mgmt pointer
- Remove unused patches
- Fix existing patch to make sure it apply successfully
2021-05-01 10:30:51 -07:00
Volodymyr Samotiy
efe0515519
[Mellanox] [201911] Update SAI submodule pointer (#7498)
* Set monitoring VLAN hostif up dy default (for VNET ping tool)

Signed-off-by: Volodymyr Samotiy <volodymyrs@nvidia.com>
2021-05-01 10:23:57 -07:00
Junchao-Mellanox
afbdcf257a
[submodule] [201911] Update submodule for sonic-linux-kernel (#7491)
20e1589 [Mellanox] [201911] backport kernel patches for hw-management 7.0100.2303 (#210)
2021-04-30 12:10:47 -07:00
abdosi
55cd1648c4
[201911] [sonic-slave-jessie]: use pip to install nose package directly (#7501)
Signed-off-by: Abhishek Dosi <abdosi@microsoft.com>
2021-04-30 12:09:40 -07:00
Dror Prital
b773412a96
[mellanox]: Integrate SAI version 1.18.3.1 into 201911 branch (#7426)
- Fix ACL ANY debug counter to correctly track ACL drops
- Add VXLAN source port hard coded range, controlled by K/V

Signed-off-by: Dror Prital <drorp@nvidia.com>
2021-04-28 23:01:58 -07:00
yozhao101
a8d2d0b5cd
[201911][Monit] Monitor critical processes in PMon contianer. (#7438)
Signed-off-by: Yong Zhao yozhao@microsoft.com

Why I did it
This PR aims to monitor the critical processes in PMon container by Monit in 201911 branch.

How I did it
I created a template configuration file of Monit and it will be rendered to generate Monit configuration file of PMon container
by a service generate_monit_config.service.

How to verify it
I verified this on a Mellanox device str-msn2700-03 and an Arista device str-a7050-acs-1.

Which release branch to backport (provide reason below if selected)
 201811
[x ] 201911
 202006
 202012
2021-04-28 17:12:21 -07:00
Kebo Liu
80f0836643 [Mellanox] Update SDK to 4.4.2522 and FW to 2008.2520 (#7391)
New features and fixes in the new SDK/FW:

SN4600C | AN/LT support
SN2700 | AN/LT bugs fixes
WJH | FID_MISS support

Signed-off-by: Kebo Liu <kebol@nvidia.com>
2021-04-28 16:06:37 -07:00
abdosi
c6c8c934e1
[buildfix-201911] Fix the snmp docker build error. (#7452)
Issue is get_pip.py is moved to pip 21.1 (https://github.com/pypa/get-pip/commits/main) which is not compatible with 3.6.
Issue of pip itself is fixed as part of 21.1.1 in pip community (pypa/pip#9835).
However get-pip.py is still not updated to latest pip. Also get.pip.py does not support python 3.6 version explicitly (pypa/get-pip#88)

Step 15/29 : RUN curl https://bootstrap.pypa.io/get-pip.py | python3.6
 ---> Running in bece31f49267
  % Total    % Received % Xferd  Average Speed   Time    Time     Time  Current
                                 Dload  Upload   Total   Spent    Left  Speed

  0     0    0     0    0     0      0      0 --:--:-- --:--:-- --:--:--     0
100 1891k  100 1891k    0     0  9564k      0 --:--:-- --:--:-- --:--:-- 9600k
Traceback (most recent call last):
  File "<stdin>", line 24298, in <module>
  File "<stdin>", line 139, in main
  File "<stdin>", line 115, in bootstrap
  File "<stdin>", line 96, in monkeypatch_for_cert
  File "/tmp/tmp5fnxrz0a/pip.zip/pip/_internal/commands/__init__.py", line 9, in <module>
  File "/tmp/tmp5fnxrz0a/pip.zip/pip/_internal/cli/base_command.py", line 12, in <module>
  File "/tmp/tmp5fnxrz0a/pip.zip/pip/_internal/cli/cmdoptions.py", line 30, in <module>
  File "/tmp/tmp5fnxrz0a/pip.zip/pip/_internal/utils/hashes.py", line 2, in <module>
ImportError: cannot import name 'NoReturn'
The command '/bin/sh -c curl https://bootstrap.pypa.io/get-pip.py | python3.6' returned a non-zero code: 1
How I did:

Got the file from https://github.com/pypa/get-pip/tree/21.0 and added to the buildimage
pin pip to the previous release 21.0.1. (Similar is done in other public repos eg: grpc/grpc-java#8115)

Signed-off-by: Abhishek Dosi <abdosi@microsoft.com>
2021-04-28 01:28:55 -07:00
Abhishek Dosi
a754e4b178 [Submodule update] sonic-utilities
[techsupport] Update show ip interface command (#1562)

Signed-off-by: Abhishek Dosi <abdosi@microsoft.com>
2021-04-23 15:29:25 -07:00
madhanmellanox
da6cf08917
[201911] Created a new 3800 SKU - D28C49S1 (#7312) 2021-04-23 14:48:50 -07:00
Qi Luo
e0f2d9a93b
[submodule] Update submodule of sonic-utilities (#7405)
a364614 2021-04-22 | [201911][acl] Use a list instead of a comma-separated string for ACL port list (#1576) [Danny Allen]
391e524 2021-04-15 | [201911] Fix Multi-ASIC show specific resursive route (#1563) [gechiang]
2021-04-23 09:49:33 -07:00
yozhao101
aeae87d1b5
[201911][Monit] Use VLAN name to differentiate each Monit service of dhcp_relay (#7378)
#### Why I did it
Since we will have multiple `dhcrelay` processes if there exists different VLANs in the table `VLAN_INTERFACE` of `CONIFG_DB`, 
we should use unique service name for each `dhcrelay` process in Monit configuration file. Otherwise, Monit service will fail to work.

#### How I did it
I append the VLAN name to the end of each service name such that they are unique.

Signed-off-by: Yong Zhao <yozhao@microsoft.com>
2021-04-22 18:04:29 -07:00
Aravind Mani
89adef3032
[201911] Dell S6100: Modify transceiver change event from interrupt to poll mode (#7334)
#### Why I did it

- xcvrd crash was seen in latest 201811 images.
- For Dell S6100,API 2.0 uses poll mode while 1.0 was still using interrupt mode.

#### How I did it

- Modified get_transceiver_change_event in 1.0 to poll mode in all the related branches.

Backport of https://github.com/Azure/sonic-buildimage/pull/7309 to the 201911 branch
2021-04-20 15:23:48 -07:00
yozhao101
528543bc6a
[201911][Monit] Monitor critical processes in radv and dhcp_relay containers. (#7340)
Signed-off-by: Yong Zhao yozhao@microsoft.com

Why I did it
This PR aims to monitor critical processes in router advertiser and dhcp_relay containers by Monit.

How I did it
Router advertiser container only ran on T0 device and the T0 device should have at least one VLAN interface
which was configured an IPv6 address. At the same time, router advertiser container will not run on devices of which
the deployment type is 8.

As such, I created a service which will dynamically generate Monit configuration file of router advertiser from a
template.

Similarly Monit configuration file of dhcp_relay was also generated from a template since the number of dhcrelay process in dhcp_relay container is depended on number of VLANs.

How to verify it
I verified this implementation on a DuT.
2021-04-16 08:40:06 -07:00
Abhishek Dosi
b375053a36 [submodule update] sonic-utilities
Fix Bad Merge

Signed-off-by: Abhishek Dosi <abdosi@microsoft.com>
2021-04-12 15:34:54 -07:00
Abhishek Dosi
16d9181d4a [submodule update] sonic-utilities
Fix show interface status Ethernet* (#1559)

Signed-off-by: Abhishek Dosi <abdosi@microsoft.com>
2021-04-12 15:17:18 -07:00
Abhishek Dosi
7f80287dec [submodule update] sonic-utilities
209b7ddec109587ddeb90071ca23ae6a288b1442 (HEAD -> 201911, origin/201911) Fixed the possibility of using uninitialized variable in route_check.py (#1551)
e30387cbebaaccbf9385059b1e501955c40be338 route_check: Fix hanging & logging level (#1520)
3c8de6950615a4608a80e3d47ea678f8e8487186 Add self timeout and crash if exceeded. (#1502)

Signed-off-by: Abhishek Dosi <abdosi@microsoft.com>
2021-04-12 15:15:03 -07:00
abdosi
feb7121d9e
[201911] Fix easy_install error when installing pip (#7272)
see below error:

+ sudo https_proxy= LANG=C chroot ./fsroot easy_install pip==20.3.3
Searching for pip==20.3.3
Reading https://pypi.python.org/simple/pip/
Couldn't find index page for 'pip' (maybe misspelled?)
Scanning index of all packages (this may take a while)
Reading https://pypi.python.org/simple/
No local packages or working download links found for pip==20.3.3
error: Could not find suitable distribution for Requirement.parse('pip==20.3.3')

How I fix:

Install python-pip via apt-get
Pin the version to 20.3.3
Master has same changes.

Signed-off-by: Abhishek Dosi <abdosi@microsoft.com>
2021-04-08 18:02:31 -07:00
judyjoseph
b9f8348a5d Fixes for errors seen in staging devices (#7171)
With the latest 201911 image, the following error was seen on staging devices with TSB command ( for both single asic, multi asic ). Though this err message doesn't affect the TSB functionality, it is good to fix.

admin@STG01-0101-0102-01T1:~$ TSB
BGP0 : % Could not find route-map entry TO_TIER0_V4 20
line 1: Failure to communicate[13] to zebra, line: no route-map TO_TIER0_V4 permit 20
% Could not find route-map entry TO_TIER0_V4 30
line 2: Failure to communicate[13] to zebra, line: no route-map TO_TIER0_V4 deny 30

In addition, in this PR I am fixing the message displayed to user when there are no BGP neighbors configured on that BGP instance. In multi-asic device there could be case where there are no BGP neighbors configured on a particular ASIC.
2021-04-08 15:30:46 -07:00
Abhishek Dosi
27600d9ccb [Submodule update] sonic-swss
5ff3d8c3cdc7516398f3c80f8cc3cd9ee02ef660 (HEAD -> 201911, origin/201911)
[vlan] Add support of VLAN host interface (#1645)

Signed-off-by: Abhishek Dosi <abdosi@microsoft.com>
2021-04-08 15:30:11 -07:00