Commit Graph

5505 Commits

Author SHA1 Message Date
xumia
3445104d72
[202012] Fix vs check install login timeout issue (#11727) (#12011)
Why I did it
Fix a build not stable issue: #11620
The vs vm has started successfully, but failed to wait for the message "sonic login:".

There were 55 builds failed caused by the issue in the last 30 days.

AzurePipelineBuildLogs
| where startTime > ago(30d)
| where type =~ "task"
| where result =~ "failed"
| where name =~ "Build sonic image"
| where content contains "Timeout exceeded"
| where content contains "re.compile('sonic login:')"
| project-away content
| extend branchName=case(reason=~"pullRequest", tostring(todynamic(parameters)['system.pullRequest.targetBranch']),
              replace("refs/heads/", "", sourceBranch))
| summarize FailedCount=dcount(buildId) by branchName

branchName	FailedCount
master	37
202012	9
202106	4
202111	2
202205	1
201911	1
It is caused by the login message mixed with the output message of the /etc/rc.local, one of the examples as below: (see the message rc.local[307]: sonic+ onie_disco_subnet=255.255.255.0 login: )
The check_install.py was waiting for the message "sonic login:", and Linux console was waiting for the username input (the login message has already printed in the console).
https://dev.azure.com/mssonic/build/_build/results?buildId=123294&view=logs&j=cef3d8a9-152e-5193-620b-567dc18af272&t=359769c4-8b5e-5976-a793-85da132e0a6f

2022-07-17T15:00:58.9198877Z [   25.493855] rc.local[307]: + onie_disco_opt53=05
2022-07-17T15:00:58.9199330Z [   25.595054] rc.local[307]: + onie_disco_router=10.0.2.2
2022-07-17T15:00:58.9199781Z [   25.699409] rc.local[307]: + onie_disco_serverid=10.0.2.2
2022-07-17T15:00:58.9200252Z [   25.789891] rc.local[307]: + onie_disco_siaddr=10.0.2.2
2022-07-17T15:00:58.9200622Z [   25.880920]
2022-07-17T15:00:58.9200745Z
2022-07-17T15:00:58.9201019Z Debian GNU/Linux 10 sonic ttyS0
2022-07-17T15:00:58.9201201Z
2022-07-17T15:00:58.9201542Z rc.local[307]: sonic+ onie_disco_subnet=255.255.255.0 login:
2022-07-17T15:00:58.9202309Z [   26.079767] rc.local[307]: + onie_exec_url=file://dev/vdb/onie-installer.bin

How I did it
Input a newline when finished to run the script /etc/rc.local.
If entering a newline, the message "sonic login:" will prompt again.
2022-09-08 14:23:50 -07:00
Qi Luo
d892214209
Revert "[202012] Change dhcp6relay to be a submodule (#11922)" (#12028)
This reverts commit 9db2e21c69.
2022-09-08 13:59:49 -07:00
arlakshm
a668ccf6df
[FRR] add patch in zebra to ignore route update on RT_TABLE_DEFAULT (#12023)
Why I did it
Fixes #11995 and #9188

How I did it
Add a patch for zebra to ignore sending netlink messages for routes in the table RT_DEFAULT_TABLE

How to verify it
Test mentioned in the github issues #11995 and #9188

Signed-off-by: Arvindsrinivasan Lakshmi Narasimhan <arlakshm@microsoft.com>
2022-09-08 11:03:19 -07:00
kellyyeh
9db2e21c69
[202012] Change dhcp6relay to be a submodule (#11922)
* Add dhcprelay submodule
2022-09-08 08:00:40 -07:00
Liu Shilong
f3443cbb97
[submodules] Update submodule sonic-gnmi HEAD pointer (#11933) 2022-09-08 15:54:24 +08:00
jhli-cisco
d8c405bf89
Update cisco-8000 submodule to v0.112 (#11983)
Update cisco-8000 submodule to v0.112
2022-09-08 09:11:29 +08:00
Aravind Mani
37d34ddd34
DellEMC Z9332f: Fix SFP issue (#11819)
* Update sfp

* Update sfp

* Update sfp.py
2022-09-07 09:35:59 -07:00
Dror Prital
edc4485d30
[202012][Mellanox] Update SDK/FW to version 4.5.2320/2010_2320 (#11975)
Update SDK/FW version - 4.5.2320/2010_2320 in order to have the following fixes:
• Spectrum-3 | PCI calibration changes from a static to a dynamic mechanism.
• [VxLAN] TTL was set to 0 for non IP traffic (such as ARP)
2022-09-07 08:33:18 +03:00
Lawrence Lee
dfd1727c3d
[sonic-utilities]: Submodule update: (#11957)
Include following commits:
```
839625d Replace cmp in acl_loader with operator.eq (#2328)
af245ad [route_check]: Ignore standalone tunnel routes (#2332)
c9aa65c Revert "[DHCPv6] [202012] Update the dhcpv6_relay config/show cli (#2271)" (#2336)
6b9cdc9 Made changes to only allow a valid IPv4 or Ipv6 string for dhcp cli (#2296)
```
2022-09-06 14:03:29 -07:00
mssonicbld
143af80061
[ci/build]: Upgrade SONiC package versions (#11976) 2022-09-07 00:40:48 +08:00
Liu Shilong
141120549b
[actions] Remove approve step in label action. (#11968)
Why I did it
Approve step needs special permission settings.
We already added permission setting to enable bypass merging PR.
So, approve step is not necessary.
2022-09-05 14:56:22 +08:00
Zhaohui Sun
5357bdb643
Support Restapi/gnmi control plane acls in 202012 branch (#11903)
For the Restapi/gnmi use-cases, Sonic has to support a new Table: EXTERNAL_CLIENT of type CTRLPLANE, stage ingress

This shall match on 'src ip prefix' and dst port '8080'. Caclmgrd must parse this from acl.json and install as in the below example:

iptables -A INPUT -s 20.20.20.20/27 -p tcp --dport 8080 -j ACCEPT

or ip6tables if the 'src ip prefix' is IPv6.

This change for master branch is in PR sonic-net/sonic-host-services#9

Signed-off-by: Zhaohui Sun <zhaohuisun@microsoft.com>
2022-09-05 09:13:45 +08:00
mssonicbld
586a623422
[ci/build]: Upgrade SONiC package versions (#11911) 2022-09-04 19:29:06 +08:00
Vivek
484402ff08
[202012] [Mellanox] [SKU] Mellanox-SN4700-C128 SKU added (11574) (#11878)
- Why I did it
New SKU for MSN-4700 Platform i.e. Mellanox-SN4700-C128

Requirements:
* Breakout: Port 1-32: 4x100G
* Downlinks: 120 (1-30)
* Uplinks: 8 (31-32)
* Shared Headroom: Enabled
* Over Subscribe Ratio: 1:8
* Default Topology: T2
* Default Cable Length for T2: 1500m
* QoS params: The default ones defined in qos_config.j2 will be applied
* Small Packet Percentage: Used 50% for traditional buffer model Note: For dynamic model, the value defined in LOSSLESS_TRAFFIC_PATTERN|AZURE|small_packet_percentage is used

Additional Details:
Switch Type has to be programmed as SpineRouter through config_db.json in DEVICE_METADATA|localhost|type field for the buffer values & cable lengths defined in the buffers_defaults_t2.j2 to apply on the device
Cable Lengths Used for generating buffer_defaults_{t0,t1,t2}.j2 values

Signed-off-by: Vivek Reddy <vkarri@nvidia.com>
2022-09-04 11:05:22 +03:00
Lawrence Lee
3d81b61fa1
[202012][kernel][swss]: Linux kernel and SWSS submodule update (#11929)
Include following new kernel commits:
- 815cb34 [patch]: Add accept_untracked_na kernel param (#291)
- fbb76cc Remove kernel config SERIAL_8250_RUNTIME_UARTS (#286)

Introduce following SWSS commits:
- eedf59d [intfmgr]: Enable `accept_untracked_na` kernel param (#2439)
- bbd4eb7 [BFD]Clean up state_db BFD entries on swss restart (#2434) (#2435)
- aa7b546 [FDB] [202012] Fix fbdorch to properly handle syncd FDB FLUSH Notif (#2401)
- 4190c13 Support for platforms based on Clounix Networks' device (#2399)

Signed-off-by: Lawrence Lee <lawlee@microsoft.com>
2022-09-02 21:09:06 -07:00
Lawrence Lee
e821dd8551 [arp_update]: Set failed IPv6 neighbors to incomplete (#11919)
After pinging any failed IPv6 neighbor entries, set the remaining failed/incomplete entries to a permanent INCOMPLETE state. This manual setting to INCOMPLETE prevents these entries from automatically transitioning to FAILED state, and since they are now incomplete any subsequent NA messages for these neighbors is able to resolve the entry in the cache.

Signed-off-by: Lawrence Lee <lawlee@microsoft.com>
2022-09-02 21:57:47 +00:00
Liu Shilong
f4e50597e2 [actions] Add github context env in label action. (#11926) 2022-09-02 21:57:41 +00:00
Ying Xie
4ab83170a5 [write_standby] update write_standby.py script (#11650)
Why I did it
The initial value has to be present for the state machines to work. In active-standby dual-tor scenario, or any hardware mux scenario, the value will be updtaed eventually with a delay.

However, in active-active dual-tor scenario, there is no other mechanism to initialize the value and get state machines started.
So this script will have to write something at start up time.

For active-active dualtor, 'active' is a more preferred initial value, the state machine will switch the state to standby soon if
link prober found link not in good state.

How I did it
Update the script to always provide initial values.

How to verify it
Tested on active-active dual-tor testbed.

Signed-off-by: Ying Xie ying.xie@microsoft.com
2022-09-01 23:57:23 +00:00
Jing Zhang
9d3194c77a Avoid write_standby in warm restart context (#11283)
Avoid write_standby in warm restart context.

sign-off: Jing Zhang zhangjing@microsoft.com

Why I did it
In warm restart context, we should avoid mux state change.

How I did it
Check warm restart flag before applying changes to app db.

How to verify it
Ran write_standby in table missing, key missing, field missing scenarios.
Did a warm restart, app db changes were skipped. Saw this in syslog:
WARNING write_standby: Taking no action due to ongoing warmrestart.
2022-09-01 23:57:17 +00:00
Dror Prital
0bf0133ecc
[202012][submodule] Advance sonic-swss-common pointer (#11888)
#### Why I did it

Update sonic-swss-common submodule pointer to include the following:
* [202012] Add docker-mux related table names  ([#669](https://github.com/sonic-net/sonic-swss-common/pull/669))
* Transfer organization from Azure to sonic-net ([#658](https://github.com/sonic-net/sonic-swss-common/pull/658))
2022-08-31 23:53:28 -07:00
Dev Ojha
8c57f0521f [Arista7050cx3] TD3 SKU changes for pg headroom value after interop testing with cisco 8102 (#11901)
Why I did it
After PFC interop testing between 8102 and 7050cx3, data packet losses were observed on the Rx ports of the 7050cx3 (inflow from 8102) during testing. This was primarily due to the slower response times to react to PFC pause packets for the 8102, when receiving such frames from neighboring devices. To solve for the packet drops, the 7050cx3 pg headroom size has to be increased to 160kB.

How I did it
Modified the xoff threshold value to 160kB in the pg_profile file to allow for the buffer manager to read that value when building the image, and configuring the device

How to verify it
run "mmuconfig -l" once image is built


Signed-off-by: dojha <devojha@microsoft.com>
2022-08-31 11:10:22 -07:00
Arun Saravanan Balachandran
c1712b8c9a
[202012] DellEMC: S6000, S6100, Z9332f - Add capabilities fields in platform.json (#11772) 2022-08-31 09:06:47 -07:00
Liu Shilong
932af450ed
[actions] Update github actions label and automerge. (#11736) (#11909)
1. Add auto approve step when adding label to version upgrading PR.
2. Use mssonicbld TOKEN to merge version upgrading PR instead of 'github actions'
2022-08-31 16:31:48 +08:00
mssonicbld
ed68e4c97c
[ci/build]: Upgrade SONiC package versions (#11896) 2022-08-30 22:44:47 +08:00
Liu Shilong
8e0393927f
[submodules] Upgrade HEAD pointer of sonic-mgmt-framework to fix CG alert (#11871) 2022-08-30 16:28:11 +08:00
mssonicbld
347b2dddcd
[ci/build]: Upgrade SONiC package versions (#11757) 2022-08-29 14:08:14 +08:00
jhli-cisco
62c6fb2eab
Update cisco-8000 submodule to v0.111 (#11835)
Update cisco-8000 submodule to v0.111 drop
2022-08-26 08:14:54 +08:00
kellyyeh
973fb9e494
[dhcp_relay] Add "vlan missing ip helper" dhcp relay unittest (#10654) (#11794) 2022-08-24 19:53:11 -07:00
Sujin Kang
42e29246b9
Add the hw reboot cause if it happened during a software reboot (#11792)
* Add the hw reboot cause if it happened during a software reboot

* fix the unknown software reboot cause
2022-08-22 10:52:01 -07:00
Mai Bui
32bee5df2d Replace strtok in systemd-sonic-generator (#11710)
Signed-off-by: maipbui <maibui@microsoft.com>

<!--
     Please make sure you've read and understood our contributing guidelines:
     https://github.com/Azure/SONiC/blob/gh-pages/CONTRIBUTING.md

     ** Make sure all your commits include a signature generated with `git commit -s` **

     If this is a bug fix, make sure your description includes "fixes #xxxx", or
     "closes #xxxx" or "resolves #xxxx"

     Please provide the following information:
-->

#### Why I did it
Replace unsafe functions to safe functions
#### How I did it
Replace `strtok()` by `strtok_r()`
#### How to verify it

#### Which release branch to backport (provide reason below if selected)

<!--
- Note we only backport fixes to a release branch, *not* features!
- Please also provide a reason for the backporting below.
- e.g.
- [x] 202006
-->

- [ ] 201811
- [ ] 201911
- [ ] 202006
- [ ] 202012
- [ ] 202106
- [ ] 202111
- [ ] 202205

#### Description for the changelog
<!--
Write a short (one line) summary that describes the changes in this
pull request for inclusion in the changelog:
-->

#### Link to config_db schema for YANG module changes
<!--
Provide a link to config_db schema for the table for which YANG model
is defined
Link should point to correct section on https://github.com/Azure/sonic-buildimage/blob/master/src/sonic-yang-models/doc/Configuration.md
-->

#### A picture of a cute animal (not mandatory but encouraged)
2022-08-22 07:24:10 +00:00
Saikrishna Arcot
9ee0219029
[sairedis]: Update submodule (#11762)
This brings in the following changes:

```
21f3d7c Add SAI_OBJECT_TYPE_TUNNEL object to break-before-make list (#1075)
8ac3aa7 [sairedis] Fix reopen recoding file (#1087)
0bbbb02 Transfer organization from Azure to sonic-net (#1096)
3406c57 [sairedis] Perform log rotate on request (#1083)
```
2022-08-18 00:22:01 -07:00
mssonicbld
07082bb5f5
[ci/build]: Upgrade SONiC package versions (#11676) 2022-08-16 13:07:32 +00:00
Jing Zhang
088d9923d9
[202012][sonic-linkmgrd] Submodule Update (#11726)
fc5d424 Jing Zhang Fri Aug 12 14:39:59 2022 -0700 [202012] Cherry-pick flaky unit test fixes (#115)
faceb93 Jing Zhang Thu Aug 11 10:03:05 2022 -0700 Backoff mux probing for server down scenario (#106)
86ddd95 Jing Zhang Fri Aug 12 14:21:37 2022 -0700 Fix race condition caused by strand wrap method (#104) (#110)
f68a03e Jing Zhang Thu Aug 11 15:31:22 2022 -0700 [lgtm]: add uuid-dev to lgtm prepare (#112)

sign-off: Jing Zhang zhangjing@microsoft.com
2022-08-15 09:14:44 -07:00
vdahiya12
fb7c962fbc
[202012][sonic-platorm-common] submodule update (#11712)
Signed-off-by: vaibhav-dahiya vdahiya@microsoft.com
the following commit is added in sonic-platform-common

fd8cf3a (HEAD -> 202012, origin/202012) [Credo][Ycable] fix incorrect uart statistics (#296)

Signed-off-by: vaibhav-dahiya <vdahiya@microsoft.com>
2022-08-12 14:07:33 -07:00
Sumukha Tumkur Vani
5d1b052e4f
[RESTAPI] Update submodule (#11686) 2022-08-11 17:17:35 -07:00
StormLiangMS
00d512577e [bgp-cfgd] BGP allow list enhancement (#11586)
Why I did it
2 things are missing in current allow-prefix list implementation.

In some usecase, need to tell the BGP neighbor and have different allow-prefix list for different neighbors, which is not supported.
for the prefix list, can't support flexible le and ge.
How I did it
To enhance the bgp allow-prefix list feature to have:

To include the neighbor type info for the allow-prefix list.
To support flexible le and ge length for allow-prefix list.
How to verify it
4 new unit test cases are added in this PR to cover changes.
2022-08-11 20:38:35 +00:00
Stepan Blyshchak
8ab448a852 [swss.sh/syncd.sh] Trap only on EXIT (#11590)
When using trap on SIGTERM the script will not react to the SIGTERM signal sent while a child is executing.
I.e, the following script does not react on SIGTERM sent to it if it is
waiting for sleep to finish:

```

trap "echo Handled SIGTERM" 0 2 3 15

echo "Before sleep"
sleep inf
echo "After sleep"
```

Instead, trap only on EXIT which covers also a scenario with exit on
SIGINT, SIGTERM.

Signed-off-by: Stepan Blyschak <stepanb@nvidia.com>
2022-08-11 20:38:20 +00:00
Sumukha Tumkur Vani
cd480810d7 [RESTAPI] Update submodule (#11674) 2022-08-11 20:38:15 +00:00
Jing Zhang
ff4f160570
[202012][sonic-linkmgrd] Submodule Update (#11514)
[202012][sonic-linkmgrd] Submodule Update

d3f5de1 Jing Zhang Tue Aug 9 10:14:48 2022 -0700 Fix build failure introduced by commit 847c1e8 (#105)
847c1e8 Jing Zhang Mon Aug 8 10:36:18 2022 -0700 [Active-Standby]Remove unnecessary handleMuxWaitTimeout logs (#100)
1fdf0e6 Jing Zhang Thu Jul 21 10:33:45 2022 -0700 Add support for reconciliation after warm restart (#76) (#97)

sign-off: Jing Zhang zhangjing@microsoft.com
2022-08-11 10:37:34 -07:00
vdahiya12
feab02df59
[202012][sonic-utilities][sonic-platform-daemons] submodule update (#11684)
The follwing commit is added in sonic-utilities
b034f0c (HEAD -> 202012, origin/202012) [config][muxcable] add support
to enable/disable ycable telemetry (#2… (#2304)

The follwing commit is added in sonic-platform-daemons
978667c (HEAD -> 202012, origin/202012) [ycabled] add capability to
enable/disable telemetry (#279) (#280

Signed-off-by: vaibhav-dahiya <vdahiya@microsoft.com>

Signed-off-by: vaibhav-dahiya <vdahiya@microsoft.com>
2022-08-11 09:55:15 -07:00
zitingguo-ms
5b5bd5e818
[202012 BRCM SAI 4.3.7.0] Pick up fixes and make up BRCM SAI version to 4.3.7.0 (#11681)
Pick upfollowing fixes and update BRCM SAI to 4.3.7.0:

CS00012208537: Add back previous commit 54c5bc4848eb748
CS00012253061,SONIC-63280: WB from 3.5 to 4.3, followed by WB to 4.3
CS00012207978: SDK-296517, time spent for SAI operations
CS00012245601,SONIC-62898: Egress ACL Counted ad Interface TX drops
Update pcbb with Fixes for CS00012243699
Upgrade on pcbb with Fixes for KB0025353, CS00012221689, CS00012221688, KB0025391, CS00012230519
commit of "CS00012221688:PFC frames egressing, PFC storm happens simultaneously on 2 ports" is purposely skipped to be picked up later due to SWSS dependency not ready.
Why I did it
How I did it
How to verify it
Tested build target, successful

Manually run these tests after installing sai binary within image 20201231.73 on 7050CX3 (TD3) T0 DUT, all passed.

vxlan/test_vxlan_decap.py
fdb/test_fdb.py
pfcwd/test_pfcwd_all_port_storm.py
acl/null_route/test_null_route_helper.py
acl/test_acl.py
vlan/test_vlan.py
platform_tests/test_reboot.py


Signed-off-by: zitingguo-ms <zitingguo@microsoft.com>
2022-08-10 15:02:47 -07:00
Liu Shilong
9b952945ca
[actions] Fix automerge actions don't work issue. (#11671) 2022-08-10 11:53:58 +08:00
Jing Zhang
ffd9e190e1 Update WARM START FINALIZER to wait for linkmgrd to reconcile (#11477)
Spanning from sonic-net/sonic-linkmgrd#76, this PR is to update warm restart finalizer to wait for linkmgrd to be reconciled.

sign-off: Jing Zhang zhangjing@microsoft.com

Why I did it
To make sure finalizer save config after linkmgrd's reconciliation.

How I did it
Add linkmgrd to the reconciliation wait list of warmboot finalizer.

How to verify it
Verified on lab device, linkmgrd reconciled as expected.
2022-08-09 21:05:12 +00:00
Ikki Zhu
cf12aa549a [hlx/sfp] fix hlx platform sfp+ tx disable issue (#11532)
Why I did it:
To fix hlx platform sfp+ module tx disable issue

How I did it:
Fix sfp+ tx disable function according SFF-8472 specification

Co-authored-by: Eric Zhu <erzhu@celestica.com>
2022-08-09 21:05:08 +00:00
xumia
1c8c1a6010
[202012] Fix CVE-2017-1000487 security alert (#11635)
Why I did it
Fix CVE-2017-1000487 alert in thrift 0.14.1.
See https://nvd.nist.gov/vuln/detail/CVE-2017-1000487

How I did it
Change the version of org.codehaus.plexus:plexus-utils from 3.0.14 to 3.0.16.
2022-08-08 12:48:30 +08:00
mssonicbld
14f93e15c6
[ci/build]: Upgrade SONiC package versions (#11629)
Why I did it
Upgrade SONiC Versions
2022-08-07 11:27:16 +08:00
Lawrence Lee
04ba6da1ab
[202012][arp_update]: Resolve failed neighbors on dualtor (#11641)
In arp_update, check for FAILED or INCOMPLETE kernel neighbor entries and manually ping them to try and resolve the neighbor

Signed-off-by: Lawrence Lee <lawlee@microsoft.com>
2022-08-05 23:30:04 -07:00
tjchadaga
6d66d9b8fc
Revert "Add load_minigraph option to include traffic-shift-away during config migration (#11403)" (#11625)
This reverts commit 6c2f99a327.
2022-08-06 10:05:45 +05:30
Lawrence Lee
663bf00c22
[swss]: Run tunnel_pkt_handler on dualtor only (#11626)
At SWSS docker init time, check the device subtype and enable tunnel packet handler only if it is dualtor

Signed-off-by: Lawrence Lee <lawlee@microsoft.com>
2022-08-05 11:33:37 -07:00
Kevin Wang
8bb38639e5
[202012][sonic-sairedis]Update sonic-sairedis submodule (#11486)
Remove duplicate cisco-8000 functions and call to those functions (#1077)
Support For CISCO debug-shell in syncd-container (#1070)
Fix submodule SAI branch name (#1031)

Signed-off-by: Kevin Wang <shengkaiwang@microsoft.com>
2022-08-03 13:22:38 +08:00