Commit Graph

3792 Commits

Author SHA1 Message Date
haowei1122
acb9bbafcc
Update sonic-fanthrml-monitor (#8636)
*Thermal mapping is wrong with BMC return value
2021-09-09 09:44:56 -07:00
Shi Su
09a1e90990
[201911] [FRR] Upgrade FRR to frr-7.2.1-s4 tag (#8572)
Why I did it
Update FRR 7.2.1 head. The following is a list of new commits.

5ae667a1f Merge pull request #9335 from FRRouting/mergify/bp/stable/7.2/pr-9214
eb679e8a1 zebra: bugfix of error quit of zebra, due to no nexthop ACTIVE
80d2eaa98 Merge pull request #8886 from FRRouting/mergify/bp/stable/7.2/pr-8876
1eeab2c1e lib: remove pure attribute from functions that modify memory
eb00dc4ec Merge pull request #6944 from LabNConsulting/working/lb/7.2/valgrind-supp-libyang
b9d6d05bf bgpd: suppress new libyang_1.0 related loss reports
8c26a71eb Merge pull request #6562 from ton31337/fix/configuration_for_labeled_unicast_in_place_7.2
386a1719c bgpd: Make sure network/aggregate-address commands lay down under labeled safi
b01c8bf28 Merge pull request #6526 from ton31337/fix/set_ipv6_ll_if_global_zero_7.2
c382833e8 bgpd: Use IPv6 LL address as nexthop if global was set to ::/LL
99509b835 Merge pull request #6395 from opensourcerouting/7.2/init-config-perms
7eef8f7b1 build: use configfile mode in init script
4cbe07705 Merge pull request #6360 from opensourcerouting/7.2/fix-warnings
84bb11785 nhrpd: clean up SA warning
aac726476 nhrpd: be more careful with linked lists
3a4b6d654 debian: Fix spelling error
756c67c6c Merge pull request #6284 from opensourcerouting/7.2/gcc-10
65a116a64 Merge pull request #6354 from ton31337/fix/communities_bgpd_crash_7.2
f7a00fd67 bgpd: Check to ensure community attributes exist before freeing them
a960f99c2 vrrpd: fix build on Fedora Rawhide
d4caff99f babeld: GCC complaining about no return in non-void function
a014c27ae babeld: fix build on Fedora Rawhide
79ff55b5b bgpd: remove unused variable
ff343e588 pimd: Make frr able to be built by gcc 10
9a3cf1ba2 ldpd: remove multiple definitions of thread_master
a19515bfe ldpd: fix another linking issue with GCC-10
b4c8de38c tests: fix build with GCC 10
4f27e8c85 ldpd: Fix linking error on Fedora Rawhide with GCC 10

How I did it
Update FRR 7.2 pointer and create a tag frr-7.2.1-s4.
2021-08-25 09:18:27 -07:00
shlomibitton
55f86768a6
[Mellanox] Update SDK\FW to version 4.4.3326\2008.3326 (#8568)
- Why I did it
Update SDK\FW version to 4.4.3326\2008.3326. This version contains:

New Features:
1. Add support for Fast Boot for SN3800

Bug Fixing:
1. In some cases, when the total number of allocations exceeds the resource limit, an error can occur due to incorrect resource release procedure. This issue is most likely to affect the following resources: flow counters, ACL actions, PBS, WJH filter, Tunnels, ECMP containers, MC (L2 &L3)
2. On Spectrum systems, when using Async Router API with IPV6, an error message in the log regarding failing to remove ECMP container may show up. This error is not functional and can be safely ignored.
3. On Spectrum-2 systems and above, when using warm boot, setting max_bridge_num to a value greater than 1968 will cause an error and potential crash.
4. Some Molex cables do not support speed after reboot

- How I did it

- How to verify it
Was verified by running regression tests that includes complete sonic-mgmt tests supported
2021-08-25 16:34:42 +03:00
Junchao-Mellanox
c647c7ce2b
[Mellanox] Upgrade hw-mgmt to 7.0100.2344 (#8378)
Why I did it
To support new PSU fan on mellanox platforms

How I did it
Upgrade hw-mgmt to 7.0100.2344
2021-08-19 18:07:58 -07:00
Aravind Mani
c53822c9e8
[201911] Dell S6100:Add serial-getty service to monit (#8409)
Why I did it
serial-getty service exited in Dell S6100 device randomly.

How I did it
Added serial-getty to monit services.

How to verify it
Stop serial-getty in ssh session and check whether the service restarts or not
2021-08-19 10:13:34 -07:00
abdosi
de3d30f36d
Updated Broadcom SAI Debian package to 3.7.6.1 (#8365)
Updated Broadcom SAI Debian package to 3.7.6.1 Following are the major changes here:

- CS00011651922/CS00012192502 SID:Parity error in TDM Calendar memories causes traffic drop after SER correction
- CS00011222060 soc_mem_alpm_delete: unit 0: ALPM delete operation[L3_DEFIP_ALPM_IPV6_128] encountered parity error
- Cesto Phy Recovery enhancement.
- SDK compile with flag -DBCM_MONOTONIC_TIME and -DBCM_MONOTONIC_MUTEXES
2021-08-06 17:55:41 -07:00
Abhishek Dosi
624ddf6dda [submodule update] sonic-utilities
97d971372fac773b98d46bb8f800df7b845e518e (HEAD -> 201911, origin/201911) [sfpshow] Gracefully handle improper 'specification_compliance' field (#1594) (#1729)
2099c73cea81ff4524e680b6f9335c0b0f13b94e [CLI] Implement null_route_helper script (#1740)
b56659175986fe0e5b82c6bd6b3dde163164777b [minigraph][port_config] Consume port_config.json while reloading minigraph (#1725)
e840c42da2a40db2bf993672271f6b75c51c426f Change the method name to align with master, reduce diverge issue (#1703)

Signed-off-by: Abhishek Dosi <abdosi@microsoft.com>
2021-08-06 11:02:19 -07:00
Renuka Manavalan
8cd6714ef4
hostcfgd: Handle missed tacacs updates between load & listen (#8223)
Why I did it
The time gap between last config load & db-listen seem to have increased.
Any config updates that occurred in this gap gets missed by db-listen.
This could miss updating /etc/pam.d/common-auth-sonic

How I did it
Add a one shot timer, just before db-listen. The timer will fire after the subscribe is done
When the timer fires, reload tacacs & aaa
2021-08-06 10:38:37 -07:00
Arun Saravanan Balachandran
d573cd141d
[201911] DellEMC S6100: Update SSD upgrade status checker (#8225)
Why I did it
To handle newer SSD firmware version in DellEMC S6100 platform (S210506G - 3IE devices).

How I did it
Update s6100_ssd_upgrade_status.sh to handle newer SSD firmware version.

How to verify it
Logs: UT_logs.txt
2021-08-05 22:43:53 -07:00
Dror Prital
2a34e8aca5
[mellanox]: Update SDK\FW to version 4.4.3228\2008_3224 (#8352)
Fix the following issue: Resource KVD hash Table tries to deallocate more resources than allocated.

Signed-off-by: Dror Prital <drorp@nvidia.com>
2021-08-05 19:05:26 -07:00
Dror Prital
949fcd21a8
Update SDK\FW to version 4.4.3222\2008.3224 (#8248)
*Update SDK\FW Version to 4.4.3222\2008.3224.
Signed-off-by: Dror Prital <drorp@nvidia.com>
2021-07-22 18:17:05 -07:00
shihjeff
940aaa0cbe
[201911] [Innovium] Update Cameo & Wistron Drivers (#7855)
Fix #8068

Update Innovium configs on Cameo and Wistron platforms
2021-07-21 09:09:36 -07:00
Vivek Reddy
fcc7d3102a
[201911][Mellanox] Update SDK\FW ver. 4.4.3216\2008.3218 (#8145)
Signed-off-by: Dror Prital <drorp@nvidia.com>
* [Mellanox] Update FW version to 2008.3218 (#8079)
Update FW version to 2008.3218, fixing the following issues:
- 50G/100G links that are operationally down before warm-reboot are not coming up after warm-reboot
- 50G/100G links with admin shut / no shut commands are not coming up after warm-reboot
2021-07-09 17:54:25 -07:00
Vivek Reddy
d958b6c664
Update SAI Commit (#8141)
[ba669c3] Fix saisdkdump
Co-authored-by: Vivek Reddy Karri <vkarri@r-build-sonic06.mtr.labs.mlnx>
2021-07-09 15:28:00 -07:00
noaOrMlnx
c5ace6d1ea
[201911] Change RIF counters to be enabled by default (#8053)
* Change RIF counters to be enabled by default for MLNX platform
2021-07-08 18:58:41 -07:00
madhanmellanox
2bec3004ef
[201911] Adding SKU Mellanox-SN3800-D100C12S2 (#7972)
* [201911]: Adding SKU Mellanox-SN3800-D100C12S2
Co-authored-by: Madhan Babu <madhan@l-csi-0241l.mtl.labs.mlnx>
2021-06-30 09:04:52 -07:00
xumia
e4a4cfed98
Fix vtysh shell-ingestion security issue (#8022)
Why I did it
Fix vtysh shell-ingestion security issue
Only expose the limited parameters of the command vtysh show.
2021-06-30 19:34:55 +08:00
Shilong Liu
77831f6d4b
[CI] Fix azp build rpc image targets. (#7976) 2021-06-25 14:31:09 +08:00
Shilong Liu
3bdd72163f
[CI] Fix azp targets for innovium platform (#7948)
There is no rule to build swi target. The correct target is *.bin.
2021-06-23 18:50:27 -07:00
xumia
2c22ab6735
[ci]: build rpc image for mellanox (#7905) (#7946)
Rename the sync_rpc_image to docker_syncd_rpc_image
Add a new parameter syncd_rpc_image to build sonic-mellanox-rpc.bin
2021-06-23 10:19:42 -07:00
Stepan Blyshchak
78f2b8c0e5
[sonic-utilities] updated submodule (#7938)
[201911][db_migrator] fix old 1911 feature config migration to a new schema (#1655)
Signed-off-by: Stepan Blyschak <stepanb@nvidia.com>
2021-06-22 16:32:15 +03:00
Shilong Liu
84f6c5d3b2 fix 2021-06-17 21:38:03 +08:00
Shilong Liu
28e0a61c02 fix 2021-06-17 21:38:03 +08:00
Shilong Liu
0f3353190d fix 2021-06-17 21:38:03 +08:00
Shilong Liu
a3ee4a79cc enable 201911 pr check 2021-06-17 21:38:03 +08:00
Qi Luo
78ffa9a39a
[sonic-swss-common] update submodule (#7888)
Includes below commits:
```
faaa72a 2020-04-07 | Fix test config (#337) [Qi Luo]
```
2021-06-15 23:24:48 -07:00
abdosi
0f56f8b4f4
[201911] Updated to Broadcom SAI debian package to 3.7.5.2-3 (#7887)
Updated to Broadcom SAI debian package to 3.7.5.2-3
2021-06-15 16:03:23 -07:00
arlakshm
2d6b531b3e
internal loopback ipv6 fix (#7885)
In the multi asic platforms all the ASIC are advertising the same IPv6 /64 network from Loopback4096.
Therefore, the IPv6 loopback address of backend asic is not learnt on the frontend asic.
Change this to advertise the Loopback4096 address as /128
2021-06-15 15:53:58 -07:00
Neetha John
27fe654388
Add support to parse bandwidth for DeviceMgmtLinks (#7883)
Add support to parse bandwidth for DeviceMgmtLinks
2021-06-15 15:02:16 -07:00
Neetha John
86c2f4248c
Update PG profile settings for Arista-7050QX-32S-S4Q31 (#7674)
PG profile settings need to be aligned with Arista-7050-QX-32S
2021-06-15 15:01:40 -07:00
abdosi
86a11e5a1e
Updated 7260 MMU Profile based on latest MSFT Tier 1 (#7882)
Updated 7260 MMU Profile based on latest MSFT Tier 1 Tomahawk2_MMU_Setting_48x100G_40m_16x100G_300m_v1.0 and
TH2_PGHdrm_MSFT

Signed-off-by: Abhishek Dosi <abdosi@microsoft.com>
2021-06-15 13:08:47 -07:00
Renuka Manavalan
3ea38a9788
Add service to restore TACACS from old config (#7560) (#7865)
In upgrade scenarios, where config_db.json is not carry forwarded to new image, it could be left w/o TACACS credentials.
Added a service to trigger 5 minutes after boot and restore TACACS, if /etc/sonic/old_config/tacacs.json is present.

How I did it
By adding a service, that would fire 5 mins after boot.
This service apply tacacs if available.

How to verify it
Upgrade and watch status of tacacs.timer & tacacs.service
You may create /etc/sonic/old_config/tacacs.json, with updated credentials
(before 5mins after boot) and see that appears in config & persisted too.
2021-06-15 10:52:31 -07:00
Ying Xie
0e82381335 [7050] updating 7050 MMU configurations (#7801)
Why I did it
7050 S4Q31 mmu configuration is missing ALPM configurations, causing not enough memory reserved for routes. Orchagent crashes on a nightly testbed with 6400 route entries.

How I did it
Add the missing ALPM configurations.

How to verify it
Load the configuration on testbed and verified new configuration exists and no more crash.

Signed-off-by: Ying Xie ying.xie@microsoft.com
2021-06-14 13:56:01 -07:00
Abhishek Dosi
9d35ea5236 [submodule update] sonic-swss
33f370f890c525512668583a7f4e231415b28738 (HEAD -> 201911, origin/201911)
    [vnet/vxlan] Add support of multiple mappers for the VxLAN tunnel
    (#1753)
Signed-off-by: Abhishek Dosi <abdosi@microsoft.com>
2021-06-14 13:55:33 -07:00
Abhishek Dosi
522c0a170b [submodule update] sonic-utilities
da3bbef916cd31267640dd4dc5823d82ee637b1e (HEAD -> 201911)
    [neighbor-advertiser] delete the tunnel maps appropriately (#1663)
    0fa589372ed391dd91f733d8a92b4f604118e8f7 (origin/201911)
    [201911][db_migrator][Mellanox] Update Mellanox buffer migrator with
    2km-cable supported (#1575)

Signed-off-by: Abhishek Dosi <abdosi@microsoft.com>
2021-06-14 13:55:33 -07:00
zzhiyuan
45a8208501
Add panic_on_unrecovered_nmi to kernel settings (#7837)
Watchdog does not trigger a kernel panic on Arista 7170 platform because this parameter is missing on 201911 but not on master.
2021-06-14 13:52:51 -07:00
Dror Prital
e2eb4e49ab
[Mellanox][201911] Update FW version to 2008_3110 (#7806)
- Why I did it
Update FW version to 2008_3110 fixing SN3800 specific warm boot scenario:

1. Disable interface
2. Warm Boot
3. Enable Interface --> link will remain down.

- How I did it
Use new FW that contains the fix for the problem mentioned above

- How to verify it
Run the scenario mentioned above and make sure that the link is up after warm boot

Signed-off-by: Dror Prital <drorp@nvidia.com>
2021-06-08 14:06:43 +03:00
Abhishek Dosi
261eac28a5 [submodule update] sonic-utilities
Revert "[201911][db_migrator] fix old 1911 feature config migration to
  a new … (#1637)"

Signed-off-by: Abhishek Dosi <abdosi@microsoft.com>
2021-06-07 09:10:32 -07:00
Volodymyr Samotiy
1523e9adf3
[sonic-utilities] Update submodule pointer (#7790)
* 42354e6 [201911][db_migrator] fix old 1911 feature config migration to a new … (#1637)

Signed-off-by: Volodymyr Samotiy <volodymyrs@nvidia.com>
2021-06-06 17:21:15 +03:00
Kebo Liu
7fe493aca1
[201911][Mellanox] Align PSU name convention returned from psu.get_name platform API (#7793)
Make PSU name returned from platform API aligned with the convention "PSU {X}" instead of "PSU{X}".

This PR is to backport https://github.com/Azure/sonic-buildimage/pull/7783
2021-06-04 10:38:16 -07:00
Volodymyr Samotiy
8405d2deef
[Mellanox] Update SDK to 4.4.3106 and FW to xx.2008.3106 (#7787)
Signed-off-by: Volodymyr Samotiy <volodymyrs@nvidia.com>
2021-06-03 10:02:23 -07:00
Kuanyu Chen
c4f8cf9371 [config-setup]: Fix a bug in checking if updategraph is enabled (#7093)
Encounter error during "config-setup boot" if the updategraph is enabled.

How I did it
Correct the code inside the config-setup script.
Remove the space between the assignment operator.

How to verify it
Remove the /etc/sonic/config_db.json and reboot the device.
Originally, it will return following error after boot up.
rv: command not found
After modification, it can correctly parse the status of updategraph without error.
2021-05-31 08:11:08 -07:00
Joe LeVeque
b6acac4e6a [brcm] Fix and simplify start_led.sh (#7548)
LED_PROC_INIT_SOC variable was incorrectly referenced as LED_SOC_INIT_SOC. Introduced in #5483

Rather than fixing the typo, I decided to simplify the script, removing the need for the conditional altogether by moving the bcmcmd call inside the conditional which checks for the presence of LED_SOC_INIT_SOC.
2021-05-31 08:09:19 -07:00
Neetha John
27985784eb Rename AristaQX-32S skus (#7751)
This PR contains the following changes
Original Arista-7050-QX-32S sku (32x40G ports) has been renamed to Arista-7050QX32S-Q32
Arista-7050-QX-32S is symlinked to Arista-7050QX-32S-S4Q31 (4x10G, 31x40G ports)

Signed-off-by: Neetha John <nejo@microsoft.com>
2021-05-31 08:02:06 -07:00
Neetha John
df61f462d5 Update MMU and QOS settings for Arista-7050QX-32S-S4Q31 (#7672)
Signed-off-by: Neetha John <nejo@microsoft.com>

Why I did it
Need proper MMU and Qos settings for Arista-7050QX-32S-S4Q31

How I did it
Updated the settings based on Arista-7050-QX-32S
2021-05-31 08:01:26 -07:00
Abhishek Dosi
35a9e93f72 [submodule update] sonic-swss
09457e524797bd988e011e24250a5e5c942bfdf2 (HEAD -> 201911, origin/201911)[tunneldecaporch] Set default MTU for the overlay loopback interface (#1756)

Signed-off-by: Abhishek Dosi <abdosi@microsoft.com>
2021-05-31 07:51:34 -07:00
Ying Xie
897b4d2b32 [Arista] add MMU configuration for Arista 7260 C64 (#7027)
Signed-off-by: Ying Xie <ying.xie@microsoft.com>
2021-05-31 07:46:38 -07:00
Abhishek Dosi
24e9c334d1 [submodule update] sonic-utilities
5f62262e5f9b5a18d94764b7f0883b34f094a0c3 (HEAD -> 201911, origin/201911) Fix unicode issue in ipaddress for python2 (#1627)

Signed-off-by: Abhishek Dosi <abdosi@microsoft.com>
2021-05-31 07:44:05 -07:00
Stepan Blyshchak
e1f122cff8
[sonic-swss-common] update submodule (#7765)
Changes:

```
36bd637 Lower the log level for outdated key for Subscribe
```

Signed-off-by: Stepan Blyschak <stepanb@nvidia.com>
2021-05-31 07:17:24 -07:00
Stephen Sun
d44936fa59
[Mellanox][201911] Support buffer configuration for 2km cable (#7338)
#### Why I did it
1. Update pg_profile_lookup.ini with 2000m cable supported
2. Update buffer configuration for t1 with uplink cable 2000m
  - For SN3800 platform:
    - C64:
      - t0: 32 100G down links and 32 100G up links.
      - t1: 56 100G down links and 8 100G up links with 2 km cable.
    - D112C8: 112 50G down links and 8 100G up links.
    - D24C52: 24 50G down links, 20 100G down links, and 32 100G up links.
    - D28C50: 28 50G down links, 18 100G down links, and 32 100G up links.
  - For SN2700 platform:
    - D48C8: 48 50G down links and 8 100G up links
    - C32:
      - t0: 16 100G down links and 16 100G up links.
      - t1: 24 100G down links and 8 100G up links with 2 km cable.

Signed-off-by: Stephen Sun <stephens@nvidia.com>

#### How I did it

#### How to verify it
Run QoS regression test
2021-05-30 20:03:16 -07:00