Commit Graph

5867 Commits

Author SHA1 Message Date
Judy Joseph
e0d9c02fc5 Update sonic-swss
b9c509d Fix test_warm_reboot issues blocking PR merge (#2309)
2022-06-14 08:50:21 -07:00
Aravind Mani
7bdf9986c9
[202111]DellEMC S5248f platform bug fixes (#11096)
* update files

* Update s5248f_platform.sh

Co-authored-by: Aravind Mani <aravind.m1@dell.com>
2022-06-13 09:31:38 -07:00
Judy Joseph
dc3d2c2263 Update submodules
sonic-utilities
e2dd672 [yang] remove mistakenly added parameter for 'get_module_name' (#2193)
2b12a39 Add check to not allow deleting PO if its member of vlan. (#2141)

sonic-platform-common

309d169 [ssd_generic] Fix innodisk health regex (#287)
2022-06-12 22:25:14 -07:00
xumia
f2c39cfd00
[Bug]: fix the version file name issue (#11073)
Why I did it
[Bug]: fix the version file name issue
The version file version-deb not found, it should be versions-deb
2022-06-09 08:25:09 +08:00
mssonicbld
94e8be646a
[ci/build]: Upgrade SONiC package versions (#10973)
Co-authored-by: mssonicbld <vsts@fv-az48-122.y11my21s2nfuzmiq0sccgy5und.cx.internal.cloudapp.net>
[ci/build]: Upgrade SONiC package versions (#10973)
2022-06-07 14:11:41 +08:00
Aravind Mani
eeac8ce969
[DellEMC]: S52xx platform bug fixes (#10981)
* S52xx bug fixes

* update sfp.py

Co-authored-by: Aravind Mani <aravind.m1@dell.com>
2022-06-06 09:14:32 -07:00
Aravind Mani
a8e1e479a6
[submodule update] sonic-platform-common submodule update (#11011)
Co-authored-by: Aravind Mani <aravind.m1@dell.com>
2022-06-06 09:11:43 -07:00
xumia
146b6328a1
[Build]: Fix the version files for armhf/arm64 not used issue (#11022)
Why I did it
[Build]: Fix the version files in host-base-image for armhf/arm64 not used issue
2022-06-06 19:36:38 +08:00
Myron Sosyak
ca284a1bf2
Fix syncd-rpc build (#10602) 2022-06-01 20:29:49 +08:00
xumia
03532c8ef9
[Build]: Support to use the base image version when a package version not specified (#10971) (#10974)
Why I did it
It is to fix issue: #10952
[Build]: Support to use the base image version when a package version not specified
2022-05-31 10:27:29 +08:00
xumia
5bf840d103 [Build]: Fix pip version constraint conflict issue (#10525)
Why I did it
[Build]: Fix pip version constraint conflict issue
When a version is specified in the constraint file, if upgrading the version in build script, it will have conflict issue.

How I did it
If a specified version has specified in pip command line, then the version constraint will be skipped.
2022-05-30 09:49:53 -07:00
Vivek R
5bd7cf9931
[202111] Update sonic-utilities submodule (#10939)
Update submodule ptr for sonic-utilities to include

[202111] [portchannel] Added ACL/PBH binding checks to the port before getting added to portchannel (#2186)

Signed-off-by: Vivek Reddy Karri <vkarri@nvidia.com>
2022-05-27 09:28:23 -07:00
Judy Joseph
45b4f49eb2 Update sonic-swss, sonic-utilities
swss

081d47b Update netlink messages handler (#2233)
de7c3eb [IntfMgrd] Retry adding ipv6 prefix by setting disabled_ipv6 flag  (#2267)

utilities

2716ff2 [yang] extend ConfigMgmt constructor to pass YANG options (#2118)
9fbe2ef [debug dump] dump interface module added (#2070)
a86da2d Add sonic-delayed.target to Application Extension .timer file generator (#2176)
90611dd [portconfig] Allow to configure interface mtu for physical ports only
2022-05-25 16:36:48 -07:00
Neetha John
0d62aad665 [sonic-config-engine] Change hwsku for sample graph in unit tests (#10889)
#### Why I did it
To ensure that some internal testcases do not break due to external changes

#### How to verify it
Ran test_cfggen.py with the changes and it passed
2022-05-25 16:32:09 -07:00
Lawrence Lee
0a3211ddd2 [scapy]: Patch scapy 2.4.5 for sniffing on intfs (#10644)
Apply scapy fix (https://github.com/secdev/scapy/pull/3240) since it is not available in release yet

Signed-off-by: Lawrence Lee <lawlee@microsoft.com>
2022-05-25 16:32:05 -07:00
xumia
7e5aed28bc [Ci]: Fix to trigger the publish pipeline in failure build issue (#10847)
Why I did it
It is not necessary to trigger the publish pipeline when build is failed.

How I did it
Remove the condition in the azp task, change to use template condition.
2022-05-25 16:32:00 -07:00
kellyyeh
980f6e9015 [dhcp6relay] Fix option parsing and add dhcpv6 client messages (#10819) 2022-05-25 16:31:55 -07:00
Shilong Liu
cd2c522a05
[build] Add dependencies for sonic-config-engine to block bad PR. (#10770) (#10796)
Add dependencies device/*
2022-05-24 14:58:55 +08:00
Shilong Liu
0a15f8177b
[ci] Add arm artifacts in common lib azure pipeline (#10890)
cherry-pick PR from master:
https://github.com/Azure/sonic-buildimage/pull/10817
2022-05-24 14:57:31 +08:00
mssonicbld
bb0c71246d
[ci/build]: Upgrade SONiC package versions (#10906) 2022-05-23 21:40:47 +00:00
Volodymyr Samotiy
1944f309de
[202111] [Mellanox] Update SAI to 1.21.1.1 and SDK/FW to 4.5.2262/xx.2010.2262 (#10881)
- Why I did it
To include latest fixes:
1. Warmboot | When trying to reconfigure the Flex Parser header and Flex transition parameters after ISSU, the switch will returned an error even if the configuration was identical to that done before performing the ISSU.
2. Link Up | When toggling many ports of the Spectrum devices while raising 10GbE link up and link maintenance is enabled, the switch may get stuck and may need to be rebooted.
3. Shared buffer | While moving from lossless to lossy while shared headroom was used, reduction of the shared headroom can only be done prior to pool type change and when shared headroom is not utilized.
4. Added support for Finisar DR4 (FTCD4523E2PCM) on Spectrum-2 and Spectrum-3 systems.

SAI
1. ECMP overlay support for IPv4 and IPv6
2. BFD offloading / 4K scale

SAI fixes
1. Reduce verbosity of print in case packet ingress on invalid port
2. Added support for Host table entry removal API to remove registration of a trap to a channel

- How I did it
Updated SAI & SDK submodules along with the relevant Makefiles

- How to verify it
Build an image and run tests from "sonic-mgmt".

Signed-off-by: Volodymyr Samotiy <volodymyrs@nvidia.com>
2022-05-22 09:51:48 +03:00
Stephen Sun
2aa1c2f437
Fix issue: error message from system-health daemon is observed during system starting (#10843)
- Why I did it
Error message: "ERR healthd: Failed to read from file /var/run/hw-management/led/led_status_capability" is observed during system starting
The system-health daemon will wait for 5 minutes before it starts to run.
During this time, the only thing it does is to set the LED even before it starts.
However, the corresponding sysfs has not been ready at the time it is being read, which causes the error message.

- How I did it
Defer system-health daemon until hw-management service starts

- How to verify it
Run regression test

Signed-off-by: Stephen Sun <stephens@nvidia.com>
2022-05-21 10:13:49 +03:00
ganglv
f58fec5d95
[sonic-cfggen]: Fix broken UT (#10863)
Why I did it
UT for sonic-config-engine is broken.

How I did it
Remove yang validation.

How to verify it
Run UT for sonic-config-engine.

Signed-off-by: Gang Lv ganglv@microsoft.com
2022-05-18 16:44:50 +08:00
Judy Joseph
9eadb98341 Update the submodules
sonic-utilities
0225195 Accept 0 for queue and dscp (#2162)
282faf0 [show][vrf]Fixing show vrf to include vlan subinterface (#2158)
f3f1b11 Validate destination port is not LAG (#2053)

sonic-platform-common
0f6cccd [sonic_ssd] Nokia-7215: "show platform ssdhealth" not showing health percent (#279)
2022-05-15 23:26:43 -07:00
ganglv
6a9ef8c1de [sonic-cfggen]: Update UT to run yang validation (#9700)
Why I did it
Config db schema generated by minigraph should run yang validation.

How I did it
Modify run_script to add yang validation.

How to verify it
Run sonic-config-engine unit test.

Signed-off-by: Gang Lv ganglv@microsoft.com
2022-05-15 23:13:12 -07:00
xumia
77ae1e8198 [Ci] Support to trigger a pipeline to download and publish artifacts to storage (#10820)
Why I did it
Support to trigger a pipeline to download and publish artifacts to storage and container registry.
Support to specify the patterns which docker images to upload.

How I did it
Pass the pipeline information and the artifact information by pipeline parameters to the pipeline which will be triggered a new build. It is to decouple the artifacts generation and the publish logic, how and where the artifacts/docker images will be published, depends on the triggered pipeline.

How to verify it
2022-05-15 23:13:01 -07:00
Vivek R
d628430148 Removed platform specific reboot files for mellanox simx platforms (#10806)
- Why I did it
Platform_reboot files for simx doesn't do aything different apart from calling /sbin/reboot. which is anyway done in the /usr/local/bin/reboot script i.e. the parent script which calls the platform specific reboot scripts if present.

Moreover, /sbin/reboot invoked in the platform specific reboot script is a non-blocking call and thus it returns back to the original script (although /sbin/reboot does it job in the background) and we see messages like this.

Signed-off-by: Vivek Reddy Karri <vkarri@nvidia.com>
2022-05-15 23:12:55 -07:00
kellyyeh
d03ede7ba0 [dhcp_relay] Remove dhcp6mon (#10467) 2022-05-15 23:12:48 -07:00
Saikrishna Arcot
bad29e535b Fix calculation of $(1)_DEP_PKGS_SHA in Makefile.cache (#10764)
In Makefile.cache, for $(1)_DEP_PKGS_SHA, the intention is to include
the DEP_MOD_SHA and MOD_HASH of each of the current package's
dependencies. However, there's a level of dereferencing missing; instead
of grabbing the value of $(dfile)_DEP_MOD_SHA, it is literally using the
variable name $(dfile)_DEP_MOD_SHA. This means that the value of this
variable will not change when some dependency changes.

The impact of this is in transitive dependencies. For a specific
example, if there is some change in sairedis, then sairedis will be
rebuilt (because there's a change within that component), and swss will
be rebuilt (because it's a direct dependency), but
docker-swss-layer-buster will not get rebuilt, because only the direct
dependencies are effectively being checked, and those aren't changing.

Signed-off-by: Saikrishna Arcot <sarcot@microsoft.com>
2022-05-15 23:11:05 -07:00
Marty Y. Lok
b1c3ab73ca [VoQ][config] Multiasic Supervisor card fails to load config_db#.json in chassis when system is reboot (#10106)
Supervisor card fails to load config_db#.json in chassis when system reboot. 
This is an intermittent issue, fixes #10105
2022-05-15 23:11:01 -07:00
Sudharsan Dhamal Gopalarathnam
df660f20be [caclmgrd]Added logic to allow BFD port numbers (#10735)
* [caclmgrd]Added logic to allow BFD port numbers
2022-05-15 23:10:57 -07:00
Dror Prital
51d675c604
[202111][submodule] Update sonic-utilities submodule (#10816)
Revert "[scripts/fast-reboot] Shutdown remaining containers through systemd (Azure/sonic-utilities#2133)" (Azure/sonic-utilities#2166)
2022-05-15 14:01:41 +03:00
Shilong Liu
339e68e1dd
[ci] Support multi tags when pushing docker image (#10771) (#10789) 2022-05-11 14:08:50 +08:00
Junchao-Mellanox
d0e7d9a01d
[YANG] Fix issue: Non compliant leaf list in config_db schema (#10291) (#10768)
Fix issue: Non compliant leaf list in config_db schema: https://github.com/Azure/sonic-buildimage/issues/9801

The basic flow of DPB is like:
1.	Transfer config db json value to YANG json value, name it “yangIn”
2.	Validate “yangIn” by libyang
3.	Generate a YANG json value to represent the target configuration, name it “yangTarget”
4.	Do diff between “yangIn” and “yangTarget”
5.	Apply the diff to CONFIG DB json and save it back to DB

The fix:
•	For step #1, If value of a leaf-list field string type, transfer it to a list by splitting it with “,” the purpose here is to make step#2 happy. We also need to save <table_name>.<key>.<field_name> to a set named “leaf_list_with_string_value_set”.
•	For step#5, loop “leaf_list_with_string_value_set” and change those fields back to a string.

1. Manual test
2. Changed sample config DB and unit test passed

Conflicts:
	src/sonic-yang-mgmt/sonic_yang_ext.py
2022-05-09 07:41:27 -07:00
mssonicbld
a4283019cd
[ci/build]: Upgrade SONiC package versions (#10724) 2022-05-08 23:13:04 +00:00
Judy Joseph
375e20338a Update submodules sonic-snmpagent, sonic-swss
079f80a (HEAD -> 202111, origin/202111) Fix: if routestr does not exist, skip (#257)
8fd0fe1 Fix: not to use blocking get_all() after keys() (#255)
981107a Add VoQ Recirc interface (i.e., Ethernet-Rec) to interface maps for S… (#244)

f4ecfb6 (HEAD -> 202111, origin/202111) Removing Vnet with scope default (#2239)
2022-05-07 23:28:17 -07:00
Junchao-Mellanox
4dabc46d82 Fix race condition between networking service and interface-config service (#10573)
Why I did it
The PR is aimed to fix a bug that mgmt port eth0 may loss IP even if user configured static IP of eth0. This is not a always reproduceable issue, the reproducing flow is like:

Systemd starts networking service, which runs a dhcp based configuration and assigned an ip from dhcp.
Systemd starts interface-config service who depends on networking service
Interface-config service runs command “ifdown –force eth0”, check line. but networking service is still running so that this line failed with error: “error: Another instance of this program is already running.”. This error is printed by ifupdown2 lib who is the main process of networking service. So, ifdown actually does not work here, the ip of eth0 is not down.
Interface-config service updates /etc/networking/interface to static configuration.
Interface-config service runs command “systemctl restart networking”. This command kills the previous networking related processes (log: networking.service: Main process exited, code=killed, status=15/TERM), and try to reconfigure the ip address with static configuration. But it detects that the configured IP and the existing IP are the same, and it does not really configure the ip to kernel. Hence, the ip is still getting from dhcp. (this could be a bug of ifupdown2: previous ip is from dhcp, new ip is a static ip, it treats them as same instead of re-configuring the IP)
When the lease of the ip expires, the ip of eth0 is removed by kernel and the issue reproduces.
The issue is not always reproduceable because networking service usually runs fast so that it won't hit step#3.

How I did it
Check networking service state before running "ifdown –force eth0", wait for it done if it is activating.

How to verify it
Manual test.
2022-05-07 23:17:07 -07:00
kellyyeh
0a3217004c [dhcp6relay] Add dhcpv6 option check (#10486) 2022-05-07 23:17:03 -07:00
Aravind Mani
f907f19064 DellEMC: S6000,S6100 SFP refactor (#9016)
* DellEMC: S6000,S6100 SFP refactor
2022-05-07 23:16:57 -07:00
Lior Avramov
3d3eb1fb53 [LLDP] Enhance lldmgrd Redis events handling (#10593)
Why I did it
When lldpmgrd handled events of other tables besides PORT_TABLE, error message was printed to log.

How I did it
Handle event according to its file descriptor instead of looping all registered selectables for each coming event.

How to verify it
I verified same events are being handled by printing events key and operation, before and after the change.
Also, before the change, in init flow after config reload, when lldpmgrd handled events of other tables besides PORT_TABLE, error messages were printed to log, this issue is solved now.
2022-05-07 23:16:48 -07:00
kellyyeh
78031eb863 [dhcp6relay] Add retry mechanism for binding socket to interface ipv6 addresses (#10712) 2022-05-07 23:16:44 -07:00
shlomibitton
d3d6d0fb52 [Fastboot] Delay PMON service for better fastboot performance (#10567)
- Why I did it
Profiling the system state on init after fast-reboot during create_switch function execution, it is possible to see few python scripts running at the same time.
This parallel execution consume CPU time and the duration of create_switch is longer than it should be.
Following this finding, and the motivation to ensure these services will not interfere in the future, PMON is delayed in 90 seconds until the system finish the init flow after fastboot.

- How I did it
Add a timer for PMON service.
Exclude for MLNX platform the start trigger of PMON when SYNCD starts in case of fastboot.
Copy the timer file to the host bin image.

- How to verify it
Run fast-reboot on MLNX platform and observe faster create_switch execution time.
2022-05-07 23:16:41 -07:00
Andriy Yurkiv
1334c0447f [yang] add yang options for Context object (#10359)
#### Why I did it
Need to pass LY_CTX_DISABLE_SEARCHDIR_CWD to Context in order to disable automatically searching for schemas in current working directory (which is by default searched automatically)

#### How I did it
add additional attribute into YANG context

#### How to verify it
Create some invalid link on switch :
1) **ln -s /usr/abc xxx**
2) run **spm list**
--> There should not be these messages:
```
libyang[1]: Unable to get information about "xxx" file in "/tmp" when searching for (sub)modules (No such file or directory)
libyang[1]: Unable to get information about "xxx" file in "/tmp" when searching for (sub)modules (No such file or directory)
libyang[1]: Unable to get information about "xxx" file in "/tmp" when searching for (sub)modules (No such file or directory)
libyang[1]: Unable to get information about "xxx" file in "/tmp" when searching for (sub)modules (No such file or directory)
```
2022-05-07 23:16:37 -07:00
Judy Joseph
b3f42feeaf Update sonic-swss, sonic-utilities submodules
swss
f71c57e [ACL]Avoid incrementing crm count when ACL rule create fails (#2238)

utilities
8a93fde Allow fw update for other boot type against on the previous "none" boot fw update (#2040)
5837559 [show] fix get routing stack routine (#2137)
c888f29 [techsupport] improve robustness (#2117)
2022-05-01 23:20:41 -07:00
Shilong Liu
482f45e28f [build] docker-sonic-mgmt replace USER by whoami (#9702) 2022-05-01 23:16:26 -07:00
xumia
a55ba095db Fix the build target error when building sonic-rest-api (#10693)
Why I did it
Fix target target/debs/bullseye/sonic-rest-api_1.0.1_arm64.deb not existing issue, the correct target is target/debs/bullseye/sonic-rest-api_1.0.1_armhf.deb.
Fix issue: #9896

[ FAIL LOG START ] [ target/debs/stretch/sonic-rest-api_1.0.1_amd64.deb ]
[ REASON ] :      target/debs/stretch/sonic-rest-api_1.0.1_amd64.deb does not exist   NON-EXISTENT PREREQUISITES: 
[ FLAGS  FILE    ] : []
2022-05-01 23:16:22 -07:00
shlomibitton
94f271c667 [Fastboot] Delay LLDP service for better fastboot performance (#10568)
- Why I did it
Profiling the system state on init after fast-reboot during create_switch function execution, it is possible to see few python scripts running at the same time.
This parallel execution consume CPU time and the duration of create_switch is longer than it should be.
Following this finding, and the motivation to ensure these services will not interfere in the future, LLDP is delayed in 90 seconds until the system finish the init flow after fastboot.

- How I did it
Add a timer for LLDP service.
Copy the timer file to the host bin image.

- How to verify it
Run fast-reboot on MLNX platform and observe faster create_switch execution time.
This PR is dependent on PR: #10567
2022-05-01 23:16:18 -07:00
Saikrishna Arcot
f1ec7107cb Remove SSH host keys after installing the custom version of sshd (#10633)
* Remove SSH host keys after installing the custom version of sshd

Signed-off-by: Saikrishna Arcot <sarcot@microsoft.com>

* Use an override for for sshd instead of overwriting the service file

Don't overwrite upstream's .service file, and instead use an override
file for making sure the host key(s) are generated.

Signed-off-by: Saikrishna Arcot <sarcot@microsoft.com>
2022-05-01 23:16:14 -07:00
mssonicbld
e2a2b30676
[ci/build]: Upgrade SONiC package versions (#10722) 2022-05-01 22:40:25 +00:00
mssonicbld
ff48ad4e9b
[ci/build]: Upgrade SONiC package versions (#10658) 2022-04-29 22:37:55 +00:00