Commit Graph

5099 Commits

Author SHA1 Message Date
Renuka Manavalan
76bf5a0bc4 [build]: Added debug symbols to many debug dockers. (#3098)
* Added debug symbols to many debug dockers.

* For debug images *only*:
1) Archive source files into debug image
2) Archived source is copied into /src
3) Created an empty dir /debug
4) Mount both /src as ro & /debug as rw into every docker
5) Login banner will give some details on /src & /debug
6) Devs can copy core file into /debug and view it from inside a container.
7) Dev may create all gdb logs and other data directly into /debug.

* Dropped redundant REDIS_TOOLS per review comments.

* Added debug symbols to frr package and hence FRR based BGP docker.

* 1) Moved dbg_files.sh to scripts/
2) Src directories to archive are now collected from individual Makefiles.
3) Added few more debug symbols
4) Added few more debug dockers.

Here after no more changes except per review comments.

To debug:
Install required version of debug image in Switch or VM.
Copy core file into /debug of host
Get into Docker
gdb /usr/bin/<daemon> -c /debug/<your core file>
set directory /src/... <-- inside gdb to get the source

For non-in-depth debugging:

Download corresponding debug Docker image (docker-...-dbg.gz) to your VM
Load the image
Run image with entrypoint as 'bash' with dir containing core mapped in.
Run gdb on the core.
2019-07-03 22:13:55 -07:00
Joe LeVeque
f14354f003
[monit] Restart rsyslog service if rsyslogd consumes > 800 MB memory (#3117) 2019-07-03 18:21:05 -07:00
Joe LeVeque
e5a2beb13b [reboot-cause]: Move reboot cause processing to its own service, 'process-reboot-cause' (#3102) 2019-07-03 10:38:20 -07:00
Stepan Blyshchak
211c8cd536 [syncd-rpc] fix syncd rpc build and add cleanup (#3113)
Signed-off-by: Stepan Blyschak <stepanb@mellanox.com>
2019-07-03 10:37:54 -07:00
sridhar-ravindran
56608bf06b [devices]: DELL Platform 2.0 API Infra and Reboot Reason support in Z9100 & S6100 (#3063) 2019-07-03 06:52:35 -07:00
brandonchuang
206ca1b092 [devices]: Add lpmode in sfputil.py for Accton AS7716-32X (#3039)
Signed-off-by: brandon_chuang <brandon_chuang@edge-core.com>
2019-07-03 06:49:38 -07:00
Steven LU
810bc0d2ed [devices]: Fix Accton as7326 port breakout (#3103) 2019-07-03 06:48:42 -07:00
Dante (Kuo-Jung) Su
ce5279196b sonic-device-data: update SAI config checker for Broadcom TD3 and TH3 (#3107)
* sonic-device-data: update SAI config checker for Broadcom TD3 and TH3

The following properties have been approved by the Broadcom chip arch team:

l3_alpm_ipv6_128b_bkt_rsvd
ifp_inports_support_enable
pll_bypass
dpr_clock_frequency
device_clock_frequency
port_flex_enable
mmu_port_num_mc_queue
serdes_core_rx_polarity_flip_physical{<PORT>}
serdes_core_tx_polarity_flip_physical{<PORT>}

Signed-off-by: Dante (Kuo-Jung) Su <dante.su@broadcom.com>
Change-Id: I1c6239cddfb0582a9298e671d792a32f79e4f006
2019-07-03 06:47:58 -07:00
brandonchuang
d0981c4d58 [devices]: Add lpmode in sfputil.py for Accton AS7312-54X (#3006)
Signed-off-by: brandon_chuang <brandon_chuang@edge-core.com>
2019-07-03 06:46:14 -07:00
Ashok Daparthi-Dell
469b2bbd8e [devices]: DellEMC S5232f > /64 prefix route support bug fix. (#3111)
* Added > /64 ip6 route support for 1024 entries.

Signed-off-by: Ashok Daparthi <Ashok_Daparthi@Dell.com>
2019-07-03 06:45:28 -07:00
Stephen Sun
86495a15a2 [Mellanox] Support new platform api sfp part (#3101)
Implement new platform api sfp part, including the following APIs;
- get_reset_status
- get_tx_disable_channel
- get_lpmode
- get_power_override
- reset
- set_lpmode
- tx_disable
- tx_disable_channel
- set_power_override
2019-07-02 14:50:20 -07:00
Kebo Liu
dc7854c327 [pmon]: update sonic-platform-daemons sub module pointer (#3106) 2019-07-02 13:09:57 -07:00
Michel Moriniaux
dc747247d1 [ARISTA] adding 7060_cs32s to eMMC exclusions (#2982)
* [ARISTA] adding 7060_cs32s to eMMC exclusions

Following PR 2774 we added the 7060-cx32s according to the guidelines of
PR 2780

This adds the 7060-cx32s to the list f devices that mount /var/log as a
tmpfs to mitigate eMMC wearout

Signed-off-by: Michel Moriniaux <m.moriniaux@criteo.com>

* [ARISTA] adding 7060_cs32s to eMMC exclusions

Following PR 2774 we added the 7060-cx32s according to the guidelines of
PR 2780

This adds the 7060-cx32s to the list f devices that mount /var/log as a
tmpfs to mitigate eMMC wearout

Signed-off-by: Michel Moriniaux <m.moriniaux@criteo.com>
2019-07-02 11:52:43 -07:00
brandonchuang
2ec473cdd1 [devices]: Add lpmode in sfputil.py for Accton AS7312-54XS (#3007)
Signed-off-by: brandon_chuang <brandon_chuang@edge-core.com>
2019-07-02 11:39:59 -07:00
txj36
22c0f4d877 [devices]: fix SFP initialization in the Chassis for mlnx-platform-api (#3012) 2019-07-02 11:39:24 -07:00
Wirut Getbamrung
1616f482e7 [device/celestica]: Add firmware management api based on the new platform API (#3013) 2019-07-02 11:05:18 -07:00
paavaanan
4f737f8b65 [devices]: DellEMC S6100 Watchdog support (#2835) 2019-06-28 23:47:36 -07:00
jostar-yang
9b0d1b74f9 [devices]: Add support as9716 platform (#2993) 2019-06-28 21:53:06 -07:00
brandonchuang
8b225d3ca1 [devices]: Add lpmode in sfputil.py for Accton AS7512 (#3005)
Signed-off-by: brandon_chuang <brandon_chuang@edge-core.com>
2019-06-28 21:50:30 -07:00
sridhar-ravindran
fe45a0d1ea [devices]: DELLEMC: Z9100,S6100 Exporting Device Last PowerOn Reason (#3059) 2019-06-28 21:49:12 -07:00
srideepDell
0695a6449f [devices]: Add a new supported device DellEMC s5232f (#3090)
* Switch Vendor: DellEMC
    * Switch SKU: s5232F
    * ASIC Vendor: Broadcom
    * Swich ASIC: Trident3
    * Port Configuration: 32x100G
    * SONiC Image: sonic-broadcom.bin

    * LED support for s5232f
    * Changes Include ipmitool implementation for platform_sensors script is inclued in pmon startup
    * Added 100G,25G,10G configruation ( 100G is default).
2019-06-28 21:40:50 -07:00
Samuel Angebault
a80baaf8d1 [device/Arista] Update drivers submodules (#3092)
- Add reboot cause support for the platform API
 - Add LED support for the platform API xcvrs and fans
 - Misc fixes
2019-06-28 21:37:36 -07:00
brandonchuang
5ec23dfd09 [devices]: Add lpmode in sfputil.py for Accton AS5712-54X (#3094)
Signed-off-by: brandon_chuang <brandon_chuang@edge-core.com>
2019-06-28 21:35:58 -07:00
brandonchuang
0b8c1a10e8 [devices]: Add lpmode in sfputil.py for Accton AS6712-32X (#3095)
Signed-off-by: brandon_chuang <brandon_chuang@edge-core.com>
2019-06-28 21:35:27 -07:00
Karthik Gengan
8c3fdfd079 DellEmc(Z9264f): Bug fix in show platform psustatus cli (#3033) 2019-06-28 16:45:21 -07:00
Wenda Ni
dbdc166f6f
Add new hwsku Arista-7050QX32S-Q32 (#3091)
Signed-off-by: Wenda Ni <wenni@microsoft.com>
2019-06-27 17:22:44 -07:00
Stepan Blyshchak
6961816dec fix fast reboot compatibility (#3083)
* fix fast reboot compatibility

We should handle both cases for backward-compatible with 201803:
 - fast-reboot
 - SONIC_BOOT_TYPE=fast-reboot

* handle review comments
* add a comment that getBootType code snippet is shared between two files
2019-06-26 12:46:58 -07:00
Karthik Gengan
b92d980769 Remapping S6000 platform files under Vendor (DellEmc) specific directory (#3079) 2019-06-25 12:23:24 -07:00
Wirut Getbamrung
94b3c362f8 [device/celestica] Implement PSU APIs based on the new platform API (#2874) 2019-06-25 12:22:13 -07:00
Kebo Liu
331866dbe3 [sonic-cfggen] Add Mellanox platform specific code to read base mac from machine.conf (#2991)
* add code to read base mac from machine.conf

* rewording the comments

* add mac validation with re

* fix review comments

* remove empty line
2019-06-25 08:56:35 +03:00
Jipan Yang
9a1bebe496 [telemetry]: change the service dependency from swss to database (#3072)
Signed-off-by: Jipan Yang <jipan.yang@alibaba-inc.com>
2019-06-24 12:36:16 -07:00
simonJi2018
3f54b58a67 [nephos]: update to support linux-kernel-4.9.0-9-2 (#3058) 2019-06-24 08:14:27 -07:00
Sudharsan D.G
2e40fa5dbe [devices]: Poller to detect Intel Rangely LPC failure for dell z9100/s6100 (#3065)
- What I did
Added Daemon to Log LPC bus degradation in Intel C2000 processor. Intel Rangeley C2000 processors with revision less than or equal to 2 have issue where LPC bus degrades over time in some processors. To identify the problem and to notify the issue, a daemon has been added which will log on encountering the issue.

- How I did it
Added a daemon which validates the CPLD scratch(0x102) and SMF scratch(0x202) registers by writing and reading values on regular polling intervals (300 seconds). If there is a discrepancy between read and write, a critical log will be thrown.

- How to verify it
The infra is verify by simulating the issue where between write and read, the value in register is modified and the log appearance is checked.

- Description for the changelog

Added Daemon to identify LPC bus degradation issue and notify using syslog in Dell S6100 and Z9100 platforms. This daemon will only run on processors with revision less than or equal to 2.
2019-06-24 08:13:40 -07:00
pavel-shirshov
dd0f005b8a
[FRR]: Port some patches from sonic-quagga repo (#3017)
* Update sonic-quagga submodule

* Port some patches from sonic-quagga

* Fix Makefile

* Another patch

* Uncomment bgp test

* Downport Nikos's patch

* Add a patch to alleviate the vendor issue

* use patch instead of stg
2019-06-23 15:26:02 -07:00
brandonchuang
93582c7af9 [devices]: Add lpmode in sfputil.py for Accton AS7726-32X (#3040)
Signed-off-by: brandon_chuang <brandon_chuang@edge-core.com>
2019-06-23 11:54:27 -07:00
Stepan Blyshchak
34e790bfce [swss] update swss pointer (#3055)
to include update in mellanox PFCWD lua script
matching new SAI

sonic-swss:
    407d048 [mellanox] convert logic to use quanta in pfc_detect_mellanox.lua (#930)
    67c0940 [test]: Skip test_clear in test_watermark (#937)
    c72c34f Enable Vnet/Vxlan VS test (#935)
    4c771d0 add incCrmAclUsedCounter and decCrmAclUsedCounter for SAI_ACL_BIND_POINT_TYPE_SWITCH case. (#899)
    825c0cb [vs]: Fix bitmap VNET virtual switch test (#936)
    4577b40 Add buffer pool watermark support (#853)
    4a67378 Add support of VXLAN tunnel removal (#931)

Signed-off-by: Stepan Blyschak <stepanb@mellanox.com>
2019-06-22 17:40:26 -07:00
paavaanan
622c426a84 [devices]: DellEMC Z9264f remove 115200 baud-ratesupport (#3046) 2019-06-22 17:15:22 -07:00
Andriy Kokhan
4f35cce27b [bfn] added TNA config for all devices (#3029)
Signed-off-by: Andriy Kokhan <akokhan@barefootnetworks.com>
2019-06-22 14:50:14 -07:00
sridhar-ravindran
9cf34ec464 Revert "[Dell] [Z9100] Supporting PreEmphasis Settings for Optics (#2501)" (#3060)
This reverts commit ed2d87a710.
2019-06-22 14:45:43 -07:00
lguohan
40330bcbdf
[build]: wait up to 60 seconds for docker engine to start (#3067)
* [build]: wait 60 seconds for docker engine to start

On some platforms, it can take more than 1 second for docker
engine to start.

Signed-off-by: Guohan Lu <gulv@microsoft.com>
2019-06-22 14:40:05 -07:00
Stepan Blyshchak
81cf33231f [build]: Improve dockerfile instructions (#3048)
- create a dockerfile-marcros.j2 file with all common operations
  written as j2 macro
- use single dockerfile instruction for COPY and RUN commands
  when possible to improve build time
- reorganize dockerfile instructions to make more cache friendly
  (in case someday we will remove --no-cache to build docker images)

Signed-off-by: Stepan Blyschak <stepanb@mellanox.com>
2019-06-22 11:26:23 -07:00
Pavlo Yadvichuk
817c6370bf [frr.conf] Advertise /64 prefix for ipv6 lo addresses in case 'unified' config mode (#3045)
Signed-off-by: Pavlo Yadvychuk <pyadvychuk@barefootnetworks.com>
2019-06-22 10:29:41 -07:00
Myron Sosyak
f4d07dc003 fixed BFN target build (#3061)
Signed-off-by: Myron Sosyak <msosyak@barefootnetworks.com>
2019-06-21 13:29:35 -07:00
Samuel Angebault
e680892f39 [build] Improve overlay check in Makefile (#3050)
Some kernels are built with overlayfs as a builtin and not a module.
For these the check via lsmod currently fails.
This improvement now checks the kernel configuration for the
CONFIG_OVERLAY_FS entry. Depending on the OS and kernel version the
build configuration can be in multiple places.
2019-06-20 09:20:34 -07:00
Stepan Blyshchak
1bcd54d38c [build]: Upgrade docker to 18.09 in stretch slave (#3049)
Also add an option to use docker buildkit for image build

Signed-off-by: Stepan Blyschak <stepanb@mellanox.com>
2019-06-20 09:19:33 -07:00
Qi Luo
69b8ad7be0 [submodule] update sonic-linux-kernel to 4.9.168-1+deb9u3 (#3034) 2019-06-19 14:20:38 -07:00
RAMA CHANDRA REDDY GADDAM
6370d64b3d [broadcom]: Check for broadcom platform sdk init ready during syncd start. (#2931)
- What I did
During boot/reload time, wait in a loop to check for bcm initialization.
Break the loop, once sdk is ready to process the 'bcmcmd' request (or) loop count reached the maximum value.

- How I did it
In the existing implementation during syncd start process will sleep for a fixed time (3 secs)
for sdk initialization to happen. But the time taken for sdk initialization is varying for different platforms.
To fix this issue, the syncd start process wait in a loop and check whether sdk is ready to process 'bcmcmd' command.

- How to verify it
Check for syncd process status and interface status.
Check for syslogs and no failures related to syncd should be present.
2019-06-19 01:25:37 -07:00
Michel Moriniaux
18544530d3 [FRR] Enable SNMP support (#2981)
This is a follow-up of sonic-snmpagent PR 92
Now that licensing issues have been solved FRR is distributed with SNMP
support compiled-in. This PR adds the last bits of configuration to get
the frr-snmp debian packages added to the docker container and the
config bits to enable the snmp module in FRR

This PR brings the functionality of being able to poll bgpd for routes
and peer status.

Signed-off-by: Michel Moriniaux <m.moriniaux@criteo.com>
2019-06-19 01:24:42 -07:00
Greg Paussa
c67c29f7ae [build]: Add missing 'rm -rf' to component Makefiles for clean rebuild (#3028)
* src/iproute2/Makefile
* src/python3/Makefile

These Makefiles do not properly clean out the src build subdirectory
prior to downloading the source code contents. This causes an error
during a rebuild following a 'make clean'.

Signed-off-by: Greg Paussa <greg.paussa@broadcom.com>
2019-06-19 00:18:12 -07:00
neethajohn
2e121f3329 [submodule]: Update submodule sonic-quagga (#3030)
Reduce the retry time for IPV6 connections to avoid reprogramming IPv6 routes after warm-reboot (#37)
2019-06-19 00:17:06 -07:00