Commit Graph

3150 Commits

Author SHA1 Message Date
Wenda Ni
abc02a702e [sairedis]: Advance sairedis reference pointer (#3530)
Signed-off-by: Wenda Ni <wenni@microsoft.com>
2019-10-01 09:52:03 -07:00
Stephen Sun
f54792d8a7 [mellanox]Integrate sai-1.15, sdk 4.3.2104 and firmware 2162 to sonic. (#3538) 2019-10-01 09:54:27 +03:00
pavel-shirshov
fb666d23f1
[FRR]: Use stg instead of patch (#3480)
* Use stg instead of patch
2019-09-29 10:57:08 -07:00
pavel-shirshov
ff7ef851d5
Regular quit needs to clean the ports two (#3531) 2019-09-27 14:49:44 -07:00
Wirut Getbamrung
ecdd866713 [device/celestica]: Update Component APIs (#3510)
* [platform/cel]: add bios upgrade tool
* [device/celestica]: update Seastone/E1031 component api to support BIOS upgrade
* [device/celestica]: add error handler for eeprom api
* [device/celestica]: add component description
2019-09-27 12:44:16 -07:00
wangshengjun
59febed528 [docker-fpm-frr]:filter out the file of 'Dockerfile.j2' from the 'docker-fpm-frr' image. (#3507)
Signed-off-by: wangshengjun <wangshengjun@asterfusion.com>
2019-09-27 12:43:44 -07:00
hans-tseng
500c20f3cd [devics]: update the installer.conf and config.bcm for delta devices (#3482)
1. add the console port in install.conf 
2. update the ag9032v2a configuration file
3. add the default_sku in ag9032v2a

Signed-off-by: hans-tseng <hans.tseng@deltaww.com>
2019-09-27 10:26:09 +02:00
Andriy Kokhan
d3d04dcc58 [bfn] Add missing port 65 for Mavericks board (#3461)
Signed-off-by: Andriy Kokhan <akokhan@barefootnetworks.com>
2019-09-27 10:23:07 +02:00
Aravind Mani
00ab25d4e3 [devices]: DellEMC-Z9264f: 10G_Port_addition (#3475)
Added 10G ports for all the HWSKU(100G/40G/50G)
2019-09-27 10:20:07 +02:00
Stephen Sun
7308d2eb97 [Mellanox] Stop pmon ahead of syncd (#3505)
Issue Overview
shutdown flow

For any shutdown flow, which means all dockers are stopped in order, pmon docker stops after syncd docker has stopped, causing pmon docker fail to release sx_core resources and leaving sx_core in a bad state. The related logs are like the following:

INFO syncd.sh[23597]: modprobe: FATAL: Module sx_core is in use.
INFO syncd.sh[23597]: Unloading sx_core[FAILED]
INFO syncd.sh[23597]: rmmod: ERROR: Module sx_core is in use
config reload & service swss.restart
In the flows like "config reload" and "service swss restart", the failure cause further consequences:

sx_core initialization error with error message like "sx_core: create EMAD sdq 0 failed. err: -16"
syncd fails to execute the create switch api with error message "syncd_main: Runtime error: :- processEvent: failed to execute api: create, key: SAI_OBJECT_TYPE_SWITCH:oid:0x21000000000000, status: SAI_STATUS_FAILURE"
swss fails to call SAI API "SAI_SWITCH_ATTR_INIT_SWITCH", which causes orchagent to restart. This will introduce an extra 1 or 2 minutes for the system to be available, failing related test cases.
reboot, warm-reboot & fast-reboot
In the reboot flows including "reboot", "fast-reboot" and "warm-reboot" this failure doesn't have further negative effects since the system has already rebooted. In addition, "warm-reboot" requires the system to be shutdown as soon as possible to meet the GR time restriction of both BGP and LACP. "fast-reboot" also requires to meet the GR time restriction of BGP which is longer than LACP. In this sense, any unnecessary steps should be avoided. It's better to keep those flows untouched.

summary
To summarize, we have to come up with a way to ensure:

shutdown pmon docker ahead of syncd for "config reload" or "service swss restart" flow;
don't shutdown pmon docker ahead of syncd for "fast-reboot" or "warm-reboot" flow in order to save time.
for "reboot" flow, either order is acceptable.
Solution
To solve the issue, pmon shoud be stopped ahead of syncd stopped for all flows except for the warm-reboot.

- How I did it

To stop pmon ahead of syncd stopped. This is done in /usr/local/bin/syncd.sh::stop() and for all shutdown sequence.
Now pmon stops ahead of syncd so there must be a way in which pmon can start after syncd started. Another point that should be taken consideration is that pmon starting should be deferred so that services which have the logic of graceful restart in fast-reboot and warm-reboot have sufficient CPU cycles to meet their deadline.
This is done by add "syncd.service" as "After" to pmon.service and startin /usr/local/bin/syncd.sh::wait()
To start pmon automatically after syncd started.
2019-09-27 10:15:46 +02:00
Qi Luo
8c4678718e
[snmp] snmpd ipv6 supports multiple community strings (#3523) 2019-09-26 18:23:12 -07:00
pavel-shirshov
504cf62bfc [libteam]: Use last fixes from libteam master repo (#3513) 2019-09-26 16:39:48 -07:00
Stephen Sun
e9785d507d [sonic-utilities] advance head to cbf19f6 (#3519) 2019-09-26 15:25:35 -07:00
Joe LeVeque
a8f10c7b64
[sonic-slave] Update linux-compiler-gcc package version to fix build (#3514) 2019-09-26 10:57:16 -07:00
pavel-shirshov
a577a80b05 Flush port in both worm-reboot and fast-reboot mode on exit. Otherwise teamd will stuck (#3512) 2019-09-25 16:08:44 -07:00
Wenda Ni
7b0a5ba6ae Remove the divide by 4 operation to the under the hood SAI (#1532)
* Remove the divide by 4 operation to the under the hood SAI

This is to avoid the need and thus the confusion for application program to know
the mmu internal architecture

This change must have support from SAI change to reach the correct
config

Signed-off-by: Wenda <wenni@microsoft.com>

* Relegate the divide by 4 operation to the under the hood SAI for egress
lossless pool

Extend to 7060 and 6100

Signed-off-by: Wenda <wenni@microsoft.com>

* Add more TH/TH2 hwskus

Signed-off-by: Wenda Ni <wenni@microsoft.com>

* Update config test

Signed-off-by: Wenda Ni <wenni@microsoft.com>

* Add TH2 ingress lossy profile

Signed-off-by: Wenda Ni <wenni@microsoft.com>

* Move the divide by 4 operation to SAI internal

Signed-off-by: Wenda Ni <wenni@microsoft.com>

* [bcm SAI] Upgrade Broadcom SAI to version 3.5.3.1-15

- Broadcom SAI 3.5 GA release 20190924.

Signed-off-by: Ying Xie <ying.xie@microsoft.com>
2019-09-25 15:57:07 -07:00
Kebo Liu
d059bda21c Update sonic-platform-daemons submodule to pick up recent fix: (#3509)
[xcvrd] state machine enhancement
[xcvrd] Fix transceiver tuning issue
2019-09-25 12:07:28 -07:00
Stephen Sun
362a6855ec [Mellanox] enhance the initialization flow of sfp part of new platform api (#3319)
* [sonic_platform.sfp_event]enhance the initialization flow of sfp_event

* [sonic_platform.sfp_event] replace "retry = retry + 1" with "retry += 1"

* [sonic_platform] fix typo in sfp_event

* [sfp_event] remove unused variables

* [sonic_platform/sfp_event.py]remove unnecessary statements
2019-09-25 11:41:07 -07:00
Stephen Sun
c34a4783e0 [build] install new platform api on host (#3282)
slave.mk: add SONIC_PLATFORM_API_PY2 as dependency of host
sonic_debian_extension.j2: install sonic_daemon_base and Mellanox-specific sonic_platform on host
mlnx-platform-api.mk: export mlnx_platform_api_py2_wheel_path for sonic_debian_extension.j2
sonic-daemon-base.mk: export daemon_base_py2_wheel_path for sonic_debian_extension.j2
daemon_base.py: hind unnecessary dependency of swss_common on host
2019-09-25 11:00:24 -07:00
Mykola F
dc0868e24f [submodule update] update sonic-snmpagent pointer (#3495)
[sonic-snmpagent]
 - Fix issue #104: lldpLocManAddrTable supports multiple IP addresses (#106)
 - Add mgmt port speed test (#107)
 - [rfc2737] fix typo in xcvr removal flow (#109)
 - Add signal handler to change debug level. (#96) 
 - [LLDP MIB] fix lldp loc mgmt ip logic (#113)
2019-09-25 09:29:44 -07:00
Long Ou
b6a09999de [hostcfgd] hostcfgd will exit when set hostname in DEVICE_METADATA (#3394)
Signed-off-by: ouxiaolong <ouxiaolong@asterfusion.com>
2019-09-24 17:36:02 -07:00
John Cheung
905d429022 [barefoot]: updated SDK to 9.0.0 released debian package (#3471) 2019-09-24 16:59:25 -07:00
Sudharsan D.G
0e74e6d845 Fixing orchagent crash in S6100,Z9100 (#3439) 2019-09-24 15:16:21 -07:00
Qi Luo
75907f10aa
[snmp] Build snmpd with ipv6 support and enable it (#3500) 2019-09-24 15:13:18 -07:00
kannankvs
bccf18620a [snmpd] mvrf snmp ipv6 prototype mismatch issue resolved (#3504) 2019-09-24 12:04:17 -07:00
byu343
2f63e90b8b [device/arista] Add hwSku Arista-7280CR3-C40 for 7280 (#3477)
* Add hwSku Arista-7280CR3-C40

* Clean empty lines

* Fix violation to config_checker

* Fix no new line at end
2019-09-23 18:07:20 -07:00
brandonchuang
a97b15e0ec [device/as5835-54t] Implement get_transceiver_change_event() in sfputil.py (#3485)
Signed-off-by: brandon_chuang <brandon_chuang@edge-core.com>
2019-09-23 14:27:18 -07:00
habeebmohammed
622f3f5d1a [broadcom]: add led_fw_path to permitted_list (#3498)
* add led_fw_dir to permitted_list
2019-09-22 04:03:24 -07:00
pavel-shirshov
c60278de58
[libteam]: Add Fast-Reboot mode for teamd (#3490)
* [libteam]: add special Fast-Reboot teamd stop mode

* Fix last packet sending

* Update sonic-utilities module
2019-09-20 15:33:40 -07:00
Dong Zhang
dd4a50dd61 [database-docker]: update multiBD config file (#3487)
adding 'hostname' field and rename 'socket'
2019-09-20 09:34:32 -07:00
ciju-juniper
766b15d11d [Juniper][QFX5210] Workaround for orchagent crash (#3458)
Orchagent is crashing on the latest SONiC images. Issue #458
is raised to track the problem. Dynamic port breakout
commit 6f40933d3d introduced
this regression. One of the recommendation that we received
was to disable the loopback and mgmt ports in bcm config.
It was helpful in fixing the issue but not the ideal solution.
It works on qfx5210 as the loopback and management configurations
are not applicable.

While the orchagent crash is being debugged, we need this commit
to fix the issue in qfx5210 platform. This patch can be reversed
once the correct fix for the orchagent is identified.

Signed-off-by: Ciju Rajan K <crajank@juniper.net>
2019-09-20 09:30:01 -07:00
bbinxie
d1fe216d05 Add new device data for dx010 (Celestica-DX010-C32,Celestica-DX010-D48C8) (#3492)
* add config.bcm for hlx

* modify config.bcm path for hlx

* Delete hx4-cel-hbtn-48x1G+4x10G.config.bcm

* add config.bcm and path

* update led for cxp

* Add new device data for dx010
2019-09-20 07:26:40 -07:00
Harish Venkatraman
9d2d617264 [SNMP] management VRF SNMP support (#2608)
* [SNMP] management VRF SNMP support

This commit adds SNMP support for Management VRF using l3mdev.
The patch included provides VRF support, there is no single
"listendevice" configuration, rather multiple agentaddress
config options can each have their own "interface" to bind to
using "ip%interface". The snmpd.conf file is accordingly
generated using the snmp.yml file and redis database info.

Adding below the comments of SNMP patch 1376
--------------------------------------------
Since the Linux kernel added support for Virtual Routing
and Forwarding (VRF) in version 4.3
(Note: these won't compile on non-linux platforms)

https://www.kernel.org/doc/Documentation/networking/vrf.txt

Linux users could not use snmpd in its current form to
bind specific listening IP addresses to specific VRF
devices. A simplified description of a VRF inteface
is an interface that is a master (a container of sorts)
that collects a set of physicalinterfaces to form a
routing table.

This set of two patches (one for V5-7-patches and one
for V5-8-patches branches) is almost identical to patch
single "listendevice" configuration. Rather, multiple
agentAddress config options can each have their own
"interface" to bind to using the <ip>%<interface>
syntax.</interface></ip>
-------------------------------------------

Signed-off-by: Harish Venkatraman <harish_venkatraman@dell.com>
2019-09-18 17:26:45 -07:00
Arun Saravanan Balachandran
5c9348b093 DellEMC Z9100 : Platform2.0 API implementation [PSU, Thermal] (#3361)
* DellEMC Z9100 : Platform2.0 API implementation [PSU, Thermal]

* DellEMC Z9100 : Platform2.0 API [ Thermal ]

* DellEMC Z9100 : Platform2.0 API [PSU, Thermal]
2019-09-18 15:09:43 -07:00
Arun Saravanan Balachandran
6ca0d7b197 DellEMC S6000 : Platform2.0 API implementation [PSU, Thermal] (#3357)
* DellEMC S6000 : Platform2.0 API implementation [PSU, Thermal]

* Fix invalid number of arguments error

* DellEMC S6000 : Platform2.0 [ Thermal ]

* DellEMC S6000 : Platform2.0 API [PSU, Thermal]

* Updated merge changes
2019-09-18 15:06:42 -07:00
Nazarii Hnydyn
19ffb027d2 [submodule update]: Advance sonic-utilities submodule. (#3466)
Commits on Sep 10, 2019
[config] Add commands to remove BGP neighbor configuration (#625) da5cc8c

Commits on Sep 12, 2019
[config] Reset failed status of all SONiC services, whether or not th… b943d67

Commits on Sep 13, 2019
[warm-reboot]: Do not clean up mirror session state database (#639) cbcdc82

Commits on Sep 17, 2019
Add SSD Health CLI utility (#587) 065414a
Fixed config Asym PFC CLI. (#632) c1c53f5

Signed-off-by: Nazarii Hnydyn <nazariig@mellanox.com>
2019-09-18 11:44:16 -07:00
Nazarii Hnydyn
bbba3c7c57 [submodule update]: Advance sonic-platform-common submodule. (#3468)
Commits on Aug 15, 2019
[sonic_sfp] Fix global name about sfp module is not defined (#53) 04014b6

Commits on Aug 20, 2019
[xcvrd] Fix crashing due to missing fields of some module types (#54) c011c73
Remove duplicated sonic_eeprom and create symlink (#55) 087593e

Commits on Sep 10, 2019
[sfp_base] Raise exceptions (previously was incorrectly returning the… c21507e
Added new API to interact with the platform components (#60) 33b037d

Signed-off-by: Nazarii Hnydyn <nazariig@mellanox.com>
2019-09-18 11:43:11 -07:00
yangshiping
503d87b901 [lldp]: expose lldpcli command of lldp docker to host (#3473)
Lldp can not be configured in host environment, so we expose lldpcli command of lldp docker to host for configuring more convenient.
2019-09-18 11:39:31 -07:00
Prince Sunny
8ca1eb289e
Install Iptables rules to set TCPMSS for 'lo' interface (#3452)
* Install Iptables rules to set TCPMSS for lo interface
* Moved implementation to hostcfgd to maintain at one place
2019-09-18 10:12:28 -07:00
sridhar-ravindran
3c0b56a709 [DELL] S6100 Support PowerCycle in Last Reboot Reason (#3403)
* [DELL] S6100 Support PowerCycle in Last Reboot Reason

* handle first time boot properly

* S6000 Last Reboot Reason Fix
2019-09-17 16:51:46 -07:00
Mykola F
124b26d72f [Mellanox] platform_reboot - sync & umount fs before power cycle (#3430)
Signed-off-by: Mykola Faryma <mykolaf@mellanox.com>
2019-09-17 09:38:30 -07:00
Sujin Kang
a62ab7890e
Merge pull request #3381 from ArunSaravananBalachandran/platform2.0
DellEMC : Platform2.0 API Implementation for Chassis [S6000, S6100, Z9100], Fan [S6000]
2019-09-16 12:25:01 -07:00
Harish Venkatraman
31d1a76197 [baseimage]: Management vrf ntp support (#3204)
This commit adds NTP support for management VRF using L3mdev. Config vrf add
mgmt will enable management VRF, enslave the eth0 device to the master device
mgmt, stop ntp service in default, restart interfaces-configs and restart ntp
service in mgmt-vrf context. Requirement and design are covered in mgmt vrf
design document.

Signed-off-by: Harish Venkatraman <harish_venkatraman@dell.com>
2019-09-16 10:21:06 -07:00
Arun Saravanan Balachandran
1cbc99d99e
Merge branch 'master' into platform2.0 2019-09-16 10:33:13 +05:30
padmanarayana
75104bb35d [sflow]: Build infrastructure changes to support sflow docker and utilities (#3251)
Introduce a new "sflow" container (if ENABLE_SFLOW is set). The new docker will include:
hsflowd : host-sflow based daemon is the sFlow agent
psample : Built from libpsample repository. Useful in debugging sampled packets/groups.
sflowtool : Locally dump sflow samples (e.g. with a in-unit collector)

In case of SONiC-VS, enable psample & act_sample kernel modules.

VS' syncd needs iproute2=4.20.0-2~bpo9+1 & libcap2-bin=1:2.25-1 to support tc-sample

tc-syncd is provided as a convenience tool for debugging (e.g. tc-syncd filter show ...)
2019-09-14 20:27:09 -07:00
Aravind Mani
ac1d2c1023 [devices]: DellEMC Z9264f_50gProfile: Added support for 50G T0 profile (#3418)
* Z9264f_50gProfile: Added support for 50g profile
2019-09-14 20:23:48 -07:00
Greg Paussa
a2cd26a2e1 [build] Avoid race condition in sonic-platform-common (#3457)
Both python2 and python3 wheels being built out of the same
source directory can interfere with each other when
SONIC_BUILD_JOBS > 1.

Signed-off-by: Greg Paussa <greg.paussa@broadcom.com>
2019-09-13 22:09:34 -07:00
Renuka Manavalan
72e879ebcb [snmpd]: Fix possible snmpd crash when sub agen timeout. (#3455)
Upon snmpd closes a netsnmp_agent_session due to snmp_timeout
there is a possibility of crash due to stale memory access.

This is a patch from source-forge:net-snmp. commit-id #793d59
2019-09-13 18:29:05 -07:00
Sujin Kang
faa754fe3c
Merge pull request #3363 from ArunSaravananBalachandran/platform2.0_s6100
DellEMC S6100 : Platform2.0 API implementation [Module, Thermal]
2019-09-13 17:18:20 -07:00
Wenda Ni
81aef6b64c [Qos] use dot1p to tc mapping for backend switches (#3422)
* Use dot1p to tc mapping for backend switches

Signed-off-by: Wenda Ni <wenni@microsoft.com>

* Do not write DSCP to TC mapping into CONFIG_DB or config_db.json for
storage switches

Signed-off-by: Wenda Ni <wenni@microsoft.com>
2019-09-13 11:28:25 -07:00