Commit Graph

3763 Commits

Author SHA1 Message Date
xumia
122ac7791a Fix the SONiC version starting with HEAD issue (#7319) 2021-05-12 15:25:36 +08:00
xumia
3653cb0d70 [ci] Fix the wrong build options issue (#7287)
Why I did it
Fix the wrong build options and improve display name for some tasks
1. Fix the wrong build architecture for arm64 and armhf
2. Fix the build timeout parameter
2021-05-12 15:25:36 +08:00
Shilong Liu
9ffc5466d0 [CI] Fix the wrong environment variable PLATFORM passing into the slave container issue (#7262)
* fix

* fix
2021-05-12 15:25:36 +08:00
Shilong Liu
2f794be816 [CI] Update sonic image official build template. (#7198)
* draft

* fix

* Update new-build-template.yml

* Update azure-pipelines-build.yml

* Update azure-pipelines-job-groups.yml

* fix

* fix

* draft

* fix

* fix

* fix

* fix

* fix

* fix

Signed-off-by: Shilong Liu <shilongliu@microsoft.com>

Co-authored-by: Sonic Automation <svc-acs@microsoft.com>
2021-05-12 15:25:36 +08:00
lguohan
2e7c3286da [ci]: get build logs when build fails (#7219)
Signed-off-by: Guohan Lu <lguohan@gmail.com>
2021-05-12 15:25:36 +08:00
lguohan
cde88ce946 [ci]: gzip the vm image disk and memdmp (#7131)
Signed-off-by: Guohan Lu <lguohan@gmail.com>
2021-05-12 15:25:36 +08:00
yozhao101
f0bbd9d1e9
[Monit] Install the unit of generate_monit_config.service. (#7558)
Signed-off-by: Yong Zhao yozhao@microsoft.com

Why I did it
The service file generate_monit_config.service is used to generate the Monit configuration file from template. I also should install this service file and enable it.

How I did it
I appended this service file name at the end of /etc/sonic/generated_services.conf.

How to verify it
I verified this on the device str2-7260cx3-acs-1.

Which release branch to backport (provide reason below if selected)
 201811
[x ] 201911
 202006
 202012
2021-05-07 13:37:09 -07:00
Junchao-Mellanox
e592984991
[Mellanox] [201911] Support new sensor conf file for MSN4700 A1/A0 (#7536)
#### Why I did it

MSN4700 A1/A0 used different sensor chip but keep the existing platform name *x86_64-mlnx_msn4700-r0*, this is a workaround to replace the sensor conf on MSN4700 A1/A0

#### How I did it

Use a shell script to get the sensor conf path and copy that files to /etc/sensors.d/sensors.conf
2021-05-06 15:01:35 -07:00
Abhishek Dosi
dd4e18ded0 [Submodule update] sonic-utilities
d898b03e4ec91f964f0e1fcba535ea33a78c838e (HEAD -> 201911, origin/201911) Create mappings using existing tunnel (#1593)

Signed-off-by: Abhishek Dosi <abdosi@microsoft.com>
2021-05-05 22:26:17 -07:00
Abhishek Dosi
89d7299203 [Submodule update] sonic-sairedis
e438b0db6a8912b50f7acddf93d4dc2157f53ecf (HEAD -> 201911, origin/201911) Increase Syncd operation timeout from 1 min to 6 min. (#828)
17974adb369111b44dd56837547806918ed4b1ed Update syncd_flex_counter.cpp (#798)

Signed-off-by: Abhishek Dosi <abdosi@microsoft.com>
2021-05-05 07:47:01 -07:00
anish-n
4c9d3599bc
Add downstreamsubrole parsing to minigraph.py (#7193) (#7514)
Add downstreamsubrole parsing to minigraph.py So that downstreamsubrole values can be used for policies. Backport PR, same as #7193
2021-05-05 07:43:25 -07:00
Abhishek Dosi
49cfed2916 [submodule update] sonic-swss
dd01491e4d167993b3a80517f737188151443a75 (HEAD -> 201911, origin/201911) [Monitor Vlan] Fix a typo in hostif (#1722)

Signed-off-by: Abhishek Dosi <abdosi@microsoft.com>
2021-05-04 19:32:25 -07:00
Abhishek Dosi
459b0f0c2f [submodule update] sonic-utilities
1f249282e8066a5837f2b34478eb4e0f6b4a654c (HEAD -> 201911, origin/201911) [201911] soft-reboot - support ssd_fw_update  (#1518)
30a3cb3c085a7f208a44b58060ba797e4299214a [route_check] Filter out VNET routes (#1582)

Signed-off-by: Abhishek Dosi <abdosi@microsoft.com>
2021-05-04 19:27:39 -07:00
Santhosh Kumar T
6204a1d809
[201911] DellEMC S6100 SSD Monitor additional changes (#7291)
Why I did it
Added soft-reboot plugin support.
Added SSD version s16425cq check
Added error message to display in console/SSH in case reboot is called in faulty/non-upgraded devices.
2021-05-04 09:48:04 -07:00
Abhishek Dosi
39e978bf75 [submodule upate] sonic-utilities
[201911]: add show bgp neigh/network support for multi asic (#1587)

Signed-off-by: Abhishek Dosi <abdosi@microsoft.com>
2021-05-03 19:03:10 -07:00
Andriy Yurkiv
4ff9f29350 [devices][hwsku] add support to VXLAN src port range feature (#7394)
Enable VXLAN src port range configuration via SAI profile
2021-05-03 18:30:41 -07:00
Stephen Sun
9a163fe87b
Install haveged package on 201911/stretch to accelerate the entropy collect process (#7308)
Recently, we found on some of our testbeds the entropy collecting process finishes more than 60 seconds after system started.
This results in swss not able to start sporadically.
To install haveged can accelerate the entropy collect process.

Signed-off-by: Stephen Sun <stephens@nvidia.com>
2021-05-02 08:16:24 -07:00
Junchao-Mellanox
6128ff6def
[Mellanox] [201911] Upgrade hw-mgmt to 7.0100.2303 (#7418)
- Update hw-mgmt pointer
- Remove unused patches
- Fix existing patch to make sure it apply successfully
2021-05-01 10:30:51 -07:00
Volodymyr Samotiy
efe0515519
[Mellanox] [201911] Update SAI submodule pointer (#7498)
* Set monitoring VLAN hostif up dy default (for VNET ping tool)

Signed-off-by: Volodymyr Samotiy <volodymyrs@nvidia.com>
2021-05-01 10:23:57 -07:00
Junchao-Mellanox
afbdcf257a
[submodule] [201911] Update submodule for sonic-linux-kernel (#7491)
20e1589 [Mellanox] [201911] backport kernel patches for hw-management 7.0100.2303 (#210)
2021-04-30 12:10:47 -07:00
abdosi
55cd1648c4
[201911] [sonic-slave-jessie]: use pip to install nose package directly (#7501)
Signed-off-by: Abhishek Dosi <abdosi@microsoft.com>
2021-04-30 12:09:40 -07:00
Dror Prital
b773412a96
[mellanox]: Integrate SAI version 1.18.3.1 into 201911 branch (#7426)
- Fix ACL ANY debug counter to correctly track ACL drops
- Add VXLAN source port hard coded range, controlled by K/V

Signed-off-by: Dror Prital <drorp@nvidia.com>
2021-04-28 23:01:58 -07:00
yozhao101
a8d2d0b5cd
[201911][Monit] Monitor critical processes in PMon contianer. (#7438)
Signed-off-by: Yong Zhao yozhao@microsoft.com

Why I did it
This PR aims to monitor the critical processes in PMon container by Monit in 201911 branch.

How I did it
I created a template configuration file of Monit and it will be rendered to generate Monit configuration file of PMon container
by a service generate_monit_config.service.

How to verify it
I verified this on a Mellanox device str-msn2700-03 and an Arista device str-a7050-acs-1.

Which release branch to backport (provide reason below if selected)
 201811
[x ] 201911
 202006
 202012
2021-04-28 17:12:21 -07:00
Kebo Liu
80f0836643 [Mellanox] Update SDK to 4.4.2522 and FW to 2008.2520 (#7391)
New features and fixes in the new SDK/FW:

SN4600C | AN/LT support
SN2700 | AN/LT bugs fixes
WJH | FID_MISS support

Signed-off-by: Kebo Liu <kebol@nvidia.com>
2021-04-28 16:06:37 -07:00
abdosi
c6c8c934e1
[buildfix-201911] Fix the snmp docker build error. (#7452)
Issue is get_pip.py is moved to pip 21.1 (https://github.com/pypa/get-pip/commits/main) which is not compatible with 3.6.
Issue of pip itself is fixed as part of 21.1.1 in pip community (pypa/pip#9835).
However get-pip.py is still not updated to latest pip. Also get.pip.py does not support python 3.6 version explicitly (pypa/get-pip#88)

Step 15/29 : RUN curl https://bootstrap.pypa.io/get-pip.py | python3.6
 ---> Running in bece31f49267
  % Total    % Received % Xferd  Average Speed   Time    Time     Time  Current
                                 Dload  Upload   Total   Spent    Left  Speed

  0     0    0     0    0     0      0      0 --:--:-- --:--:-- --:--:--     0
100 1891k  100 1891k    0     0  9564k      0 --:--:-- --:--:-- --:--:-- 9600k
Traceback (most recent call last):
  File "<stdin>", line 24298, in <module>
  File "<stdin>", line 139, in main
  File "<stdin>", line 115, in bootstrap
  File "<stdin>", line 96, in monkeypatch_for_cert
  File "/tmp/tmp5fnxrz0a/pip.zip/pip/_internal/commands/__init__.py", line 9, in <module>
  File "/tmp/tmp5fnxrz0a/pip.zip/pip/_internal/cli/base_command.py", line 12, in <module>
  File "/tmp/tmp5fnxrz0a/pip.zip/pip/_internal/cli/cmdoptions.py", line 30, in <module>
  File "/tmp/tmp5fnxrz0a/pip.zip/pip/_internal/utils/hashes.py", line 2, in <module>
ImportError: cannot import name 'NoReturn'
The command '/bin/sh -c curl https://bootstrap.pypa.io/get-pip.py | python3.6' returned a non-zero code: 1
How I did:

Got the file from https://github.com/pypa/get-pip/tree/21.0 and added to the buildimage
pin pip to the previous release 21.0.1. (Similar is done in other public repos eg: grpc/grpc-java#8115)

Signed-off-by: Abhishek Dosi <abdosi@microsoft.com>
2021-04-28 01:28:55 -07:00
Abhishek Dosi
a754e4b178 [Submodule update] sonic-utilities
[techsupport] Update show ip interface command (#1562)

Signed-off-by: Abhishek Dosi <abdosi@microsoft.com>
2021-04-23 15:29:25 -07:00
madhanmellanox
da6cf08917
[201911] Created a new 3800 SKU - D28C49S1 (#7312) 2021-04-23 14:48:50 -07:00
Qi Luo
e0f2d9a93b
[submodule] Update submodule of sonic-utilities (#7405)
a364614 2021-04-22 | [201911][acl] Use a list instead of a comma-separated string for ACL port list (#1576) [Danny Allen]
391e524 2021-04-15 | [201911] Fix Multi-ASIC show specific resursive route (#1563) [gechiang]
2021-04-23 09:49:33 -07:00
yozhao101
aeae87d1b5
[201911][Monit] Use VLAN name to differentiate each Monit service of dhcp_relay (#7378)
#### Why I did it
Since we will have multiple `dhcrelay` processes if there exists different VLANs in the table `VLAN_INTERFACE` of `CONIFG_DB`, 
we should use unique service name for each `dhcrelay` process in Monit configuration file. Otherwise, Monit service will fail to work.

#### How I did it
I append the VLAN name to the end of each service name such that they are unique.

Signed-off-by: Yong Zhao <yozhao@microsoft.com>
2021-04-22 18:04:29 -07:00
Aravind Mani
89adef3032
[201911] Dell S6100: Modify transceiver change event from interrupt to poll mode (#7334)
#### Why I did it

- xcvrd crash was seen in latest 201811 images.
- For Dell S6100,API 2.0 uses poll mode while 1.0 was still using interrupt mode.

#### How I did it

- Modified get_transceiver_change_event in 1.0 to poll mode in all the related branches.

Backport of https://github.com/Azure/sonic-buildimage/pull/7309 to the 201911 branch
2021-04-20 15:23:48 -07:00
yozhao101
528543bc6a
[201911][Monit] Monitor critical processes in radv and dhcp_relay containers. (#7340)
Signed-off-by: Yong Zhao yozhao@microsoft.com

Why I did it
This PR aims to monitor critical processes in router advertiser and dhcp_relay containers by Monit.

How I did it
Router advertiser container only ran on T0 device and the T0 device should have at least one VLAN interface
which was configured an IPv6 address. At the same time, router advertiser container will not run on devices of which
the deployment type is 8.

As such, I created a service which will dynamically generate Monit configuration file of router advertiser from a
template.

Similarly Monit configuration file of dhcp_relay was also generated from a template since the number of dhcrelay process in dhcp_relay container is depended on number of VLANs.

How to verify it
I verified this implementation on a DuT.
2021-04-16 08:40:06 -07:00
Abhishek Dosi
b375053a36 [submodule update] sonic-utilities
Fix Bad Merge

Signed-off-by: Abhishek Dosi <abdosi@microsoft.com>
2021-04-12 15:34:54 -07:00
Abhishek Dosi
16d9181d4a [submodule update] sonic-utilities
Fix show interface status Ethernet* (#1559)

Signed-off-by: Abhishek Dosi <abdosi@microsoft.com>
2021-04-12 15:17:18 -07:00
Abhishek Dosi
7f80287dec [submodule update] sonic-utilities
209b7ddec109587ddeb90071ca23ae6a288b1442 (HEAD -> 201911, origin/201911) Fixed the possibility of using uninitialized variable in route_check.py (#1551)
e30387cbebaaccbf9385059b1e501955c40be338 route_check: Fix hanging & logging level (#1520)
3c8de6950615a4608a80e3d47ea678f8e8487186 Add self timeout and crash if exceeded. (#1502)

Signed-off-by: Abhishek Dosi <abdosi@microsoft.com>
2021-04-12 15:15:03 -07:00
abdosi
feb7121d9e
[201911] Fix easy_install error when installing pip (#7272)
see below error:

+ sudo https_proxy= LANG=C chroot ./fsroot easy_install pip==20.3.3
Searching for pip==20.3.3
Reading https://pypi.python.org/simple/pip/
Couldn't find index page for 'pip' (maybe misspelled?)
Scanning index of all packages (this may take a while)
Reading https://pypi.python.org/simple/
No local packages or working download links found for pip==20.3.3
error: Could not find suitable distribution for Requirement.parse('pip==20.3.3')

How I fix:

Install python-pip via apt-get
Pin the version to 20.3.3
Master has same changes.

Signed-off-by: Abhishek Dosi <abdosi@microsoft.com>
2021-04-08 18:02:31 -07:00
judyjoseph
b9f8348a5d Fixes for errors seen in staging devices (#7171)
With the latest 201911 image, the following error was seen on staging devices with TSB command ( for both single asic, multi asic ). Though this err message doesn't affect the TSB functionality, it is good to fix.

admin@STG01-0101-0102-01T1:~$ TSB
BGP0 : % Could not find route-map entry TO_TIER0_V4 20
line 1: Failure to communicate[13] to zebra, line: no route-map TO_TIER0_V4 permit 20
% Could not find route-map entry TO_TIER0_V4 30
line 2: Failure to communicate[13] to zebra, line: no route-map TO_TIER0_V4 deny 30

In addition, in this PR I am fixing the message displayed to user when there are no BGP neighbors configured on that BGP instance. In multi-asic device there could be case where there are no BGP neighbors configured on a particular ASIC.
2021-04-08 15:30:46 -07:00
Abhishek Dosi
27600d9ccb [Submodule update] sonic-swss
5ff3d8c3cdc7516398f3c80f8cc3cd9ee02ef660 (HEAD -> 201911, origin/201911)
[vlan] Add support of VLAN host interface (#1645)

Signed-off-by: Abhishek Dosi <abdosi@microsoft.com>
2021-04-08 15:30:11 -07:00
Abhishek Dosi
0f28dd84b4 [submodule update] sonic-dbsyncd
4a497407c8697a8c531ab999da95936ac1e71c9b (HEAD -> 201911, origin/201911) Fix the LLDP_LOC_CHASSIS not getting populated if no remote neighbors are present (#39)

Signed-off-by: Abhishek Dosi <abdosi@microsoft.com>
2021-04-08 10:00:07 -07:00
lguohan
1b92fc8e1b [sonic-slave-stretch]: use pip to install nose package directly (#7257)
Fix #7248

Issue is similiar to martinblech/xmltodict#47

The correct solution is to change mockredispy to move
nose from setup_requirement to test_requirement.

The quick workaround is to install nose explicitly.

fix build issue:

05:09:37    Downloading mockredispy-2.9.3.tar.gz (17 kB)
05:09:39  ?[91m    ERROR: Command errored out with exit status 1:
05:09:39       command: /usr/bin/python3 -c 'import sys, setuptools, tokenize; sys.argv[0] = '"'"'/tmp/pip-install-sypos2ry/mockredispy_ab86cd14995544df90f78a63ab7041a3/setup.py'"'"'; __file__='"'"'/tmp/pip-install-sypos2ry/mockredispy_ab86cd14995544df90f78a63ab7041a3/setup.py'"'"';f=getattr(tokenize, '"'"'open'"'"', open)(__file__);code=f.read().replace('"'"'\r\n'"'"', '"'"'\n'"'"');f.close();exec(compile(code, __file__, '"'"'exec'"'"'))' egg_info --egg-base /tmp/pip-pip-egg-info-ymhn19ne
05:09:39           cwd: /tmp/pip-install-sypos2ry/mockredispy_ab86cd14995544df90f78a63ab7041a3/
05:09:39      Complete output (23 lines):
05:09:39      Couldn't find index page for 'nose' (maybe misspelled?)
05:09:39      No local packages or working download links found for nose
05:09:39      Traceback (most recent call last):
05:09:39        File "<string>", line 1, in <module>
05:09:39        File "/tmp/pip-install-sypos2ry/mockredispy_ab86cd14995544df90f78a63ab7041a3/setup.py", line 29, in <module>
05:09:39          'with_redis = mockredis.noseplugin:WithRedis'
05:09:39        File "/usr/lib/python3.5/distutils/core.py", line 108, in setup
05:09:39          _setup_distribution = dist = klass(attrs)
05:09:39        File "/usr/lib/python3/dist-packages/setuptools/dist.py", line 317, in __init__
05:09:39          self.fetch_build_eggs(attrs['setup_requires'])
05:09:39        File "/usr/lib/python3/dist-packages/setuptools/dist.py", line 372, in fetch_build_eggs
05:09:39          replace_conflicting=True,
05:09:39        File "/usr/lib/python3/dist-packages/pkg_resources/__init__.py", line 846, in resolve
05:09:39          dist = best[req.key] = env.best_match(req, ws, installer)
05:09:39        File "/usr/lib/python3/dist-packages/pkg_resources/__init__.py", line 1118, in best_match
05:09:39          return self.obtain(req, installer)
05:09:39        File "/usr/lib/python3/dist-packages/pkg_resources/__init__.py", line 1130, in obtain
05:09:39          return installer(requirement)
05:09:39        File "/usr/lib/python3/dist-packages/setuptools/dist.py", line 440, in fetch_build_egg
05:09:39          return cmd.easy_install(req)
05:09:39        File "/usr/lib/python3/dist-packages/setuptools/command/easy_install.py", line 693, in easy_install
05:09:39          raise DistutilsError(msg)
05:09:39      distutils.errors.DistutilsError: Could not find suitable distribution for Requirement.parse('nose')
05:09:39      ----------------------------------------

Signed-off-by: Guohan Lu <lguohan@gmail.com>
2021-04-08 09:56:32 -07:00
Tamer Ahmed
86ea554d4a [radv] Fix Script Name Change (#7254)
PR https://github.com/Azure/sonic-buildimage/pull/4599 changed startup
script name from wait_for_intf.sh.j2 to wait_for_link.sh.j2, however
when PR https://github.com/Azure/sonic-buildimage/pull/5178 was cherry-
picked, the script name was not changed to wait_for_link.sh.

signed-off-by: Tamer Ahmed <tamer.ahmed@microsoft.com>
2021-04-08 09:56:31 -07:00
Renuka Manavalan
2276e2de25 [baseimage]: specify gid for redis group. (#7249)
Problem:
Default groupadd for redis, takes 1000 by default. This forces, subsequently created admin group to get 1001.
As all TACACS users are created with 1000 as their gid, they end up in redis group.

Fix:
Create redis group *after* admin group is created
Add a check that admin group id is 1000
2021-04-08 09:42:29 -07:00
xumia
31849a5f02 Fix CVE-2017-1000487 security alert (#7173)
#### Why I did it
Plexus-utils before 3.0.16 is vulnerable to command injection because it does not correctly process the contents of double quoted strings.

#### How I did it
Upgrade to 3.0.16
2021-04-07 18:09:56 -07:00
Nazarii Hnydyn
8eb26cf18c [install.sh] Fix CPU cstates configuration. (#7222)
The motivation of these changes is to fix (#6051):

- Why I did it
To fix CPU cstates configuration

- How I did it
Updated code to be POSIX compatible

- How to verify it
root@sonic:/home/admin# sonic_installer install sonic-mellanox.bin

Signed-off-by: Nazarii Hnydyn <nazariig@nvidia.com>
2021-04-07 18:08:41 -07:00
Sumukha Tumkur Vani
b70e47f2c8 [submodule]: Update restapi (#7213)
Feb 17 Fix tests failing due to duplicate vxlan tunnel creation (#75)
Mar 11 Update route api to specify limitation (#77)
Apr 01 Add host_ifname field while adding entry in VLAN table (#80)
2021-04-07 18:07:23 -07:00
Volodymyr Samotiy
d35c31b8fc [Mellanox] Update SDK to 4.4.2508 and FW to xx.2008.2508 (#7141)
Fix the following issues:

Spectrum-2, Spectrum-3 | Port | Fix link issue when using 25 GbE rate between two ports while one is on Spectrum-2-based system and the other is on Spectrum-3-based system
All | warmboot | fail to upgrade from earlier SONiC versions with official SDK/FW 4.4.2306 (was on SONiC 201911)
All | What-Just-Happened | When enabling or disabling WJH under high traffic load to the host CPU, in very specific and low probability conditions, an error could occur, that may result in loss of data, channel failure or in extreme cases SW failure

Signed-off-by: Volodymyr Samotiy <volodymyrs@nvidia.com>
2021-04-07 18:06:46 -07:00
abdosi
16b97f1859 [multi-asic] Make sure Everflow tables always gets classified as Mirror table (#7241)
Make sure Everflow always gets classified as Mirror table and not as Control Plane on multi-asic platforms.

Why I did:
In Multi-asic platforms we generate Everflow acl table data from minigraph for both host and namespace.
It is possible in multi-asic minigraph if there are no external port-channel (Only Router Port IP Interface) then Everflow table will have no binded interface in host and will gets classified as Control Plane ACL while in namespace gets classified as Mirror Table.
For ACL Rule generation we read global db as source of truth for acl table information and so for everflow rule generation if tables gets classified as Control plane we can generate rules with invalid action causing orchagent to throw runtime error.

How I did:
If the table is attach to erspan interface in minigraph then it always gets classified as mirror table.
2021-04-07 15:16:02 -07:00
roman_savchuk
840f19af18
[BFN] Updated SAI/SDK packages to 20210405 (#7229)
Updated SDE due to issue in driver part
2021-04-07 14:18:49 -07:00
Qi Luo
5bcd280b16
[submodule] Update submodule of sonic-utilities (#7235)
b39dbbc 2021-04-06 | [fast-reboot] Fix dump script to support PortChannels in a VLAN group [201911] (#1547) [shlomibitton]
2021-04-06 19:07:05 -07:00
Abhishek Dosi
b121b939dd [Submodule update] sonic-utilities
ecc1f9b1bb0ad18843e0f969fe8564cf37bf2080 (HEAD -> 201911, origin/201911)
[acl_loader]: add iptype match to the rules for dataplane acl

Signed-off-by: Abhishek Dosi <abdosi@microsoft.com>
2021-04-05 18:58:59 -07:00
abdosi
e94bc63057 [multi-asic] optimize api that checks port/port-channel/bgp is internal (#7232)
Optimize the API's is_port_internal() , is_portchannel_internal() and is_bgp_internal ()to call get_entry(Redis hget) instead of get_table (Redis keys *)
2021-04-05 18:58:59 -07:00