sonic-buildimage

Author	SHA1	Message	Date
Joe LeVeque	51292330e9	[enable_counters.py] Convert to Python 3 (#5789 ) - Why I did it As part of moving all SONiC code from Python 2 (no longer supported) to Python 3 - How I did it - Convert enable_counters.py script to Python 3 - Reorganize imports per PEP8 standard - Two blank lines precede functions per PEP8 standard	2020-11-06 09:00:19 -08:00
pavel-shirshov	13f8e9ce5e	[bgpcfgd]: Convert bgpcfgd and bgpmon to python3 (#5746 ) * Convert bgpcfgd to python3 Convert bgpmon to python3 Fix some issues in bgpmon * Add python3-swsscommon as depends * Install dependencies * reorder deps Co-authored-by: Pavel Shirshov <pavel.contrib@gmail.com>	2020-11-05 10:01:43 -08:00
Joe LeVeque	ba7fda7fd1	[docker-platform-monitor] Install Python 2 'enum34' package to fix Arista platforms (#5779 ) Recent changes to dependencies caused the 'enum34' package to cease being installed for Python 2 in the PMon container. This broke Arista platforms, where the Arista sonic_platform package imports 'enum'. This is because on Arista devices, the sonic_platform wheel is not installed in the container. Instead, the installation directory is mounted from the host OS. However, this method doesn't ensure all dependencies are installed in the container.	2020-11-04 11:23:03 -08:00
dflynn-Nokia	ac3a605c75	[build]: ARM build: Download redis-tools and redis-server from sonicstorage (#5797 ) Prevent intermittent build failures when building Sonic for the ARM platform architecture due to version upgrades of the redis-tools and redis-server packages. Modify select Dockerfile templates to download the redis-tools and redis-server packages from sonicstorage rather than from debian.org. This PR has been made possible by the inclusion of ARM versions of redis-tools and redis-server into sonicstorage as described in Issue# 5701	2020-11-04 09:31:06 -08:00
Joe LeVeque	e3164d5fb4	[lldpmgrd] Convert to Python 3 (#5785 ) - Convert lldpmgrd to Python 3 - Install Python 3 swsscommon package in docker-lldp	2020-11-03 12:50:11 -08:00
Lawrence Lee	10ab46f7a0	Revert "[docker-base]: Rate limit priority INFO and lower in syslog" (#5763 ) * This was a temporary fix for orchagent spamming log messages and causing rate limiting, leading to critical messages being dropped for the syslog. No longer needed since Azure/sonic-sairedis#680 was merged.	2020-11-02 08:49:40 -08:00
Joe LeVeque	f2a258aca9	[docker-platform-monitor] Check if sonic_platform is available before installed (#5764 ) On Arista platforms, sonic_platform packages are not installed in the PMon container, but are rather mounted into the container from the host OS. Therefore, pip show sonic_platform will fail in the PMon container. This change will first check if we can import sonic_platform. If this fails, it will then fall back to checking if the package is installed. If both fail, it will attempt to install the package.	2020-11-01 01:48:07 -08:00
abdosi	dddf96933c	[monit] Adding patch to enhance syslog error message generation for monit alert action when status is failed. (#5720 ) Why/How I did: Make sure first error syslog is triggered based on FAULT TOLERANCE condition. Added support of repeat clause with alert action. This is used as trigger for generation of periodic syslog error messages if error is persistent Updated the monit conf files with repeat every x cycles for the alert action	2020-10-31 17:29:49 -07:00
Junchao-Mellanox	781188f549	[thermalctld] Enlarge startretries value to avoid thermalctld not able to restart during regression test (#5633 ) Increase startretires value from default of 10 to 50 to prevent supervisor from placing thermalctld in FATAL state during regression testing. Also ensures supervisord tries hard to get thermalctld running in production, as thermalctld is critical to prevent device from overheating.	2020-10-30 12:01:17 -07:00
Joe LeVeque	6333bb73b0	Explicitly call `pip2` rather than `pip` in locations where both pip2 and pip3 are installed (#5747 ) As part of the transition from Python 2 to Python 3, we are installing both pip2 and pip3 in the slave and config-engine containers. This PR replaces calls to `pip` in these containers with an explicit call to `pip2` to ensure the proper version of pip is executed, no matter which version of pip is aliased to `pip`, as we no longer rely on that alias. Also some other pip-related cleanup	2020-10-30 09:43:14 -07:00
Joe LeVeque	b132ca0980	[build]: Upgrade pip3 before pip2 (#5743 ) Upgrading pip3 after pip2 caused the pip command to be aliased to the pip3 command. However, since we are still transitioning from Python 2 to Python 3, most pip commands in the codebase are expecting pip to alias to pip2. The proper solution here is to explicitly call pip2 and pip3, and no longer call pip, however this will require extensive changes and testing, so to quickly fix this issue, we upgraded pip2 after pip3 to ensure that pip2 is installed after pip3.	2020-10-29 19:01:17 -07:00
judyjoseph	6088bd59de	[multi-ASIC] BGP internal neighbor table support (#5520 ) * Initial commit for BGP internal neighbor table support. > Add new template named "internal" for the internal BGP sessions > Add a new table in database "BGP_INTERNAL_NEIGHBOR" > The internal BGP sessions will be stored in this new table "BGP_INTERNAL_NEIGHBOR" * Changes in template generation tests with the introduction of internal neighbor template files.	2020-10-28 16:41:27 -07:00
Joe LeVeque	9e34003136	[sonic-config-engine] Clean up dependencies, pin versions; install Python 3 package in Buster container (#5656 ) To clean up the image build procedure, and let setuptools/pip[3] implicitly install Python dependencies. Also use ipaddress package instead of ipaddr.	2020-10-26 13:48:50 -07:00
shlomibitton	e66d49a57c	[LLDP] Fix for LLDP advertisements being sent with wrong information. (#5493 ) * Fix for LLDP advertisments being sent with wrong information. Since lldpd is starting before lldpmgr, some advertisment packets might sent with default value, mac address as Port ID. This fix hold the packets from being sent by the lldpd until all interfaces are well configured by the lldpmgrd. Signed-off-by: Shlomi Bitton <shlomibi@nvidia.com> * Fix comments * Fix unit-test output caused a failure during build * Add 'run_cmd' function and use it * Resume lldpd even if port init timeout reached	2020-10-26 19:38:09 +02:00
lguohan	7d4ab4237a	[docker-base]: swss/syncd support use zmq as rpc channel (#5715 ) install libzmq5 in docker-base Signed-off-by: Guohan Lu <lguohan@gmail.com>	2020-10-25 20:39:38 -07:00
Shi Su	67408c85aa	[synchronous-mode] Add template file for synchronous mode (#5644 ) The orchagent and syncd need to have the same default synchronous mode configuration. This PR adds a template file to translate the default value in CONFIG_DB (empty field) to an explicit mode so that the orchagent and syncd could have the same default mode.	2020-10-23 13:08:35 -07:00
pavel-shirshov	c94f93f046	[bgpcfgd]: Dynamic BBR support (#5626 ) - Why I did it To introduce dynamic support of BBR functionality into bgpcfgd. BBR is adding `neighbor PEER_GROUP allowas-in 1' for all BGP peer-groups which points to T0 Now we can add and remove this configuration based on CONFIG_DB entry - How I did it I introduced a new CONFIG_DB entry: - table name: "BGP_BBR" - key value: "all". Currently only "all" is supported, which means that all peer-groups which points to T0s will be updated - data value: a dictionary: {"status": "status_value"}, where status_value could be either "enabled" or "disabled" Initially, when bgpcfgd starts, it reads initial BBR status values from the [constants.yml](https://github.com/Azure/sonic-buildimage/pull/5626/files#diff-e6f2fe13a6c276dc2f3b27a5bef79886f9c103194be4fcb28ce57375edf2c23cR34). Then you can control BBR status by changing "BGP_BBR" table in the CONFIG_DB (see examples below). bgpcfgd knows what peer-groups to change fron [constants.yml](https://github.com/Azure/sonic-buildimage/pull/5626/files#diff-e6f2fe13a6c276dc2f3b27a5bef79886f9c103194be4fcb28ce57375edf2c23cR39). The dictionary contains peer-group names as keys, and a list of address-families as values. So when bgpcfgd got a request to change the BBR state, it changes the state only for peer-groups listed in the constants.yml dictionary (and only for address families from the peer-group value). - How to verify it Initially, when we start SONiC FRR has BBR enabled for PEER_V4 and PEER_V6: ``` admin@str-s6100-acs-1:~$ vtysh -c 'show run' \| egrep 'PEER_V.? allowas' neighbor PEER_V4 allowas-in 1 neighbor PEER_V6 allowas-in 1 ``` Then we apply following configuration to the db: ``` admin@str-s6100-acs-1:~$ cat disable.json { "BGP_BBR": { "all": { "status": "disabled" } } } admin@str-s6100-acs-1:~$ sonic-cfggen -j disable.json -w ``` The log output are: ``` Oct 14 18:40:22.450322 str-s6100-acs-1 DEBUG bgp#bgpcfgd: Received message : '('all', 'SET', (('status', 'disabled'),))' Oct 14 18:40:22.450620 str-s6100-acs-1 DEBUG bgp#bgpcfgd: execute command '['vtysh', '-f', '/tmp/tmpmWTiuq']'. Oct 14 18:40:22.681084 str-s6100-acs-1 DEBUG bgp#bgpcfgd: execute command '['vtysh', '-c', 'clear bgp peer-group PEER_V4 soft in']'. Oct 14 18:40:22.904626 str-s6100-acs-1 DEBUG bgp#bgpcfgd: execute command '['vtysh', '-c', 'clear bgp peer-group PEER_V6 soft in']'. ``` Check FRR configuraiton and see that no allowas parameters are there: ``` admin@str-s6100-acs-1:~$ vtysh -c 'show run' \| egrep 'PEER_V.? allowas' admin@str-s6100-acs-1:~$ ``` Then we apply enabling configuration back: ``` admin@str-s6100-acs-1:~$ cat enable.json { "BGP_BBR": { "all": { "status": "enabled" } } } admin@str-s6100-acs-1:~$ sonic-cfggen -j enable.json -w ``` The log output: ``` Oct 14 18:40:41.074720 str-s6100-acs-1 DEBUG bgp#bgpcfgd: Received message : '('all', 'SET', (('status', 'enabled'),))' Oct 14 18:40:41.074720 str-s6100-acs-1 DEBUG bgp#bgpcfgd: execute command '['vtysh', '-f', '/tmp/tmpDD6SKv']'. Oct 14 18:40:41.587257 str-s6100-acs-1 DEBUG bgp#bgpcfgd: execute command '['vtysh', '-c', 'clear bgp peer-group PEER_V4 soft in']'. Oct 14 18:40:42.042967 str-s6100-acs-1 DEBUG bgp#bgpcfgd: execute command '['vtysh', '-c', 'clear bgp peer-group PEER_V6 soft in']'. ``` Check FRR configuraiton and see that the BBR configuration is back: ``` admin@str-s6100-acs-1:~$ vtysh -c 'show run' \| egrep 'PEER_V.? allowas' neighbor PEER_V4 allowas-in 1 neighbor PEER_V6 allowas-in 1 ``` * The test coverage * Below is the test coverage ``` ---------- coverage: platform linux2, python 2.7.12-final-0 ---------- Name Stmts Miss Cover ---------------------------------------------------- bgpcfgd/__init__.py 0 0 100% bgpcfgd/__main__.py 3 3 0% bgpcfgd/config.py 78 41 47% bgpcfgd/directory.py 63 34 46% bgpcfgd/log.py 15 3 80% bgpcfgd/main.py 51 51 0% bgpcfgd/manager.py 41 23 44% bgpcfgd/managers_allow_list.py 385 21 95% bgpcfgd/managers_bbr.py 76 0 100% bgpcfgd/managers_bgp.py 193 193 0% bgpcfgd/managers_db.py 9 9 0% bgpcfgd/managers_intf.py 33 33 0% bgpcfgd/managers_setsrc.py 45 45 0% bgpcfgd/runner.py 39 39 0% bgpcfgd/template.py 64 11 83% bgpcfgd/utils.py 32 24 25% bgpcfgd/vars.py 1 0 100% ---------------------------------------------------- TOTAL 1128 530 53% ``` - Which release branch to backport (provide reason below if selected) - [ ] 201811 - [x] 201911 - [x] 202006	2020-10-22 11:04:21 -07:00
BrynXu	29928c93a1	[chassis]: Use correct path for chassisdb.conf file (#5632 ) use correct chassisdb.conf path while bringing up chassis_db service on VoQ modular switch.chassis_db service on VoQ modular switch. resolves #5631 Signed-off-by: Honggang Xu <hxu@arista.com>	2020-10-21 01:40:04 -07:00
Lawrence Lee	207587d97c	[docker-base]: Rate limit priority INFO and lower in syslog (#5666 ) There is currently a bug where messages from swss with priority lower than the current log level are still being counted against the syslog rate limiting threshhold. This leads to rate-limiting in syslog when the rate-limiting conditions have not been met, which causes several sonic-mgmt tests to fail since they are dependent on LogAnalyzer. It also omits potentially useful information from the syslog. Only rate-limiting messages of level INFO and lower allows these tests to pass successfully. Signed-off-by: Lawrence Lee <lawlee@microsoft.com>	2020-10-20 11:52:46 -07:00
BrynXu	a2e3d2fcea	[ChassisDB]: bring up ChassisDB service (#5283 ) bring up chassisdb service on sonic switch according to the design in Distributed Forwarding in VoQ Arch HLD Signed-off-by: Honggang Xu <hxu@arista.com> - Why I did it To bring up new ChassisDB service in sonic as designed in ['Distributed forwarding in a VOQ architecture HLD' ](`90c1289eaf/doc/chassis/architecture.md`). - How I did it Implement the section 2.3.1 Global DB Organization of the VOQ architecture HLD. - How to verify it ChassisDB service won't start without chassisdb.conf file on the existing platforms. ChassisDB service is accessible with global.conf file in the distributed arichitecture. Signed-off-by: Honggang Xu <hxu@arista.com>	2020-10-14 15:15:24 -07:00
Joe LeVeque	88c1d66c27	[python-click] No longer build our own package, let pip/setuptools install vanilla (#5549 ) We were building our own python-click package because we needed features/bug fixes available as of version 7.0.0, but the most recent version available from Debian was in the 6.x range. "Click" is needed for building/testing and installing sonic-utilities. Now that we are building sonic-utilities as a wheel, with Click specified as a dependency in the setup.py file, setuptools will install a more recent version of Click in the sonic-slave-buster container when building the package, and pip will install a more recent version of Click in the host OS of SONiC when installing the sonic-utilities package. Also, we don't need to worry about installing the Python 2 or 3 version of the package, as the proper one will be installed as necessary.	2020-10-14 10:16:35 -07:00
pavel-shirshov	812e1a3489	[bgp]: Enable next-hop-tracking through default (#5600 ) - Why I did it FRR introduced [next hop tracking](http://docs.frrouting.org/projects/dev-guide/en/latest/next-hop-tracking.html) functionality. That functionality requires resolving BGP neighbors before setting BGP connection (or explicit ebgp-multihop command). Sometimes (BGP MONITORS) our neighbors are not directly connected and sessions are IBGP. In this case current configuration prevents FRR to establish BGP connections. Reason would be "waiting for NHT". To fix that we need either add static routes for each not-directly connected ibgp neighbor, or enable command `ip nht resolve-via-default` - How I did it Put `ip nht resolve-via-default` into the config - How to verify it Build an image. Enable BGP_MONITOR entry and check that entry is Established or Connecting in FRR Co-authored-by: Pavel Shirshov <pavel.contrib@gmail.com>	2020-10-13 22:21:28 -07:00
Mahesh Maddikayala	744612d269	[ECMP][Multi-ASIC] Have different ECMP seed value on each ASIC (#5357 ) * Calculate ECMP hash seed based on ASIC ID on multi ASIC platform. Each ASIC will have a unique ECMP hash seed value.	2020-10-08 09:05:37 -07:00
abdosi	70528f7460	[Multi-asic] Fixed Default Route to be BGP (#5548 ) Learned and not docker default route for multi-asic platforms. Signed-off-by: Abhishek Dosi <abdosi@microsoft.com>	2020-10-05 22:54:47 -07:00
Lawrence Lee	8c344095a8	[docker-orchagent]: Add NDP Proxy Daemon (#5517 ) * Install ndppd during image build, and copy config files to image * Configure proxy settings based on config DB at container start * Pipe ndppd output to logger inside container to log output in syslog	2020-10-05 08:48:13 -07:00
pavel-shirshov	ffae82f8be	[bgp] Add 'allow list' manager feature (#5513 ) implements a new feature: "BGP Allow list." This feature allows us to control which IP prefixes are going to be advertised via ebgp from the routes received from EBGP neighbors.	2020-10-02 10:06:04 -07:00
Tamer Ahmed	6754635010	[cfggen] Make Jinja2 Template Python 3 Compatible Jinja2 templates rendered using Python 3 interpreter, are required to conform with Python 3 new semantics. singed-off-by: Tamer Ahmed <tamer.ahmed@microsoft.com>	2020-09-30 07:07:43 -07:00
Nazarii Hnydyn	79bda7d0d6	[monit]: Fix process checker. (#5480 ) Signed-off-by: Nazarii Hnydyn <nazariig@nvidia.com>	2020-09-29 17:23:09 -07:00
Volodymyr Boiko	d71a4efe3b	[sonic-platform-common] Install Python 3 package in host OS and PMon container (#5461 ) Signed-off-by: Volodymyr Boyko <volodymyrx.boiko@intel.com>	2020-09-29 13:57:54 -07:00
arlakshm	e3a0feaa47	Vtysh support for multi asic (#5479 ) Signed-off-by: Arvindsrinivasan Lakshmi Narasimhan <arlakshm@microsoft.com>	2020-09-29 12:39:53 -07:00
Guohan Lu	e412338743	Revert "[bgp] Add 'allow list' manager feature (#5309 )" This reverts commit `6eed0820c8`.	2020-09-28 22:00:29 -07:00
pavel-shirshov	6eed0820c8	[bgp] Add 'allow list' manager feature (#5309 ) implements a new feature: "BGP Allow list." This feature allows us to control which IP prefixes are going to be advertised via ebgp from the routes received from EBGP neighbors.	2020-09-27 10:47:43 -07:00
gechiang	43a8368874	make bgpmon autorestart enabled by supervisord (#5460 )	2020-09-25 10:25:11 -07:00
Sumukha Tumkur Vani	b5bcfef013	Update conf DB with CA cert & rename ca_crt field (#5448 )	2020-09-25 09:20:09 -07:00
Syd Logan	0311a4a037	Add gearbox phy device files and a new physyncd docker to support VS gearbox phy feature (#4851 ) * buildimage: Add gearbox phy device files and a new physyncd docker to support VS gearbox phy feature * scripts and configuration needed to support a second syncd docker (physyncd) * physyncd supports gearbox device and phy SAI APIs and runs multiple instances of syncd, one per phy in the device * support for VS target (sonic-sairedis vslib has been extended to support a virtual BCM81724 gearbox PHY). HLD is located at `b817a12fd8/doc/gearbox/gearbox_mgr_design.md` - Why I did it This work is part of the gearbox phy joint effort between Microsoft and Broadcom, and is based on multi-switch support in sonic-sairedis. - How I did it Overall feature was implemented across several projects. The collective pull requests (some in late stages of review at this point): https://github.com/Azure/sonic-utilities/pull/931 - CLI (merged) https://github.com/Azure/sonic-swss-common/pull/347 - Minor changes (merged) https://github.com/Azure/sonic-swss/pull/1321 - gearsyncd, config parsers, changes to orchargent to create gearbox phy on supported systems https://github.com/Azure/sonic-sairedis/pull/624 - physyncd, virtual BCM81724 gearbox phy added to vslib - How to verify it In a vslib build: root@sonic:/home/admin# show gearbox interfaces status PHY Id Interface MAC Lanes MAC Lane Speed PHY Lanes PHY Lane Speed Line Lanes Line Lane Speed Oper Admin -------- ----------- --------------- ---------------- --------------- ---------------- ------------ ----------------- ------ ------- 1 Ethernet48 121,122,123,124 25G 200,201,202,203 25G 204,205 50G down down 1 Ethernet49 125,126,127,128 25G 206,207,208,209 25G 210,211 50G down down 1 Ethernet50 69,70,71,72 25G 212,213,214,215 25G 216 100G down down In addition, docker ps \| grep phy should show a physyncd docker running. Signed-off-by: syd.logan@broadcom.com	2020-09-25 08:32:44 -07:00
yozhao101	13cec4c486	[Monit] Unmonitor the processes in containers which are disabled. (#5153 ) We want to let Monit to unmonitor the processes in containers which are disabled in `FEATURE` table such that Monit will not generate false alerting messages into the syslog. Signed-off-by: Yong Zhao <yozhao@microsoft.com>	2020-09-25 00:28:28 -07:00
Tamer Ahmed	b43f1129b4	[swss] Start Restore Neighbor After SWSS Config (#5451 ) SWSS config script restore ARP/FDB/Routes. Restore neighbor script uses config DB ARP information to restore ARP entries and so needs to be started after swssconfig exits. signed-off-by: Tamer Ahmed <tamer.ahmed@microsoft.com>	2020-09-24 14:57:42 -07:00
Danny Allen	a56ad41b9e	[mgmt] Install dhclient in sonic-mgmt docker (#5447 ) Signed-off-by: Danny Allen <daall@microsoft.com>	2020-09-23 20:01:58 -07:00
Danny Allen	7eda531ffd	[mgmt] Install ip command in mgmt docker (#5430 ) Signed-off-by: Danny Allen <daall@microsoft.com>	2020-09-22 18:39:14 -07:00
Danny Allen	9feba88455	[mgmt] Fix Azure CLI install behind proxy (#5407 ) Signed-off-by: Danny Allen <daall@microsoft.com>	2020-09-22 09:37:03 -07:00
lguohan	0ed44db7b7	[docker-base-stretch]: install rsyslog from stretch-backports (#5410 ) Install a newer version of rsyslog from stretch-backports to support -iNONE Previous backport from master use -iNONE option which is only available after v8.32.0 Signed-off-by: Guohan Lu <lguohan@gmail.com>	2020-09-21 02:09:50 -07:00
Joe LeVeque	3987cbd80a	[sonic-utilities] Build and install as a Python wheel package (#5409 ) We are moving toward building all Python packages for SONiC as wheel packages rather than Debian packages. This will also allow us to more easily transition to Python 3. Python files are now packaged in "sonic-utilities" Pyhton wheel. Data files are now packaged in "sonic-utilities-data" Debian package. - How I did it - Build and install sonic-utilities as a Python package - Remove explicit installation of wheel dependencies, as these will now get installed implicitly by pip when installing sonic-utilities as a wheel - Build and install new sonic-utilities-data package to install data files required by sonic-utilities applications - Update all references to sonic-utilities scripts/entrypoints to either reference the new /usr/local/bin/ location or remove absolute path entirely where applicable Submodule updates: * src/sonic-utilities aa27dd9...2244d7b (5): > Support building sonic-utilities as a Python wheel package instead of a Debian package (#1122) > [consutil] Display remote device name in show command (#1120) > [vrf] fix check state_db error when vrf moving (#1119) > [consutil] Fix issue where the ConfigDBConnector's reference is missing (#1117) > Update to make config load/reload backward compatible. (#1115) * src/sonic-ztp dd025bc...911d622 (1): > Update paths to reflect new sonic-utilities install location, /usr/local/bin/ (#19)	2020-09-20 20:16:42 -07:00
gechiang	128def6969	Add bgpmon to be started as a new daemon under BGP docker (#5329 ) * Add bgpmon under sonic-bgpcfgd to be started as a new daemon under BGP docker * Added bgpmon to be monitored by Monit so that if it crashed, it gets alerted * use console_scripts entry point to package bgpmon	2020-09-20 14:32:09 -07:00
Tamer Ahmed	2de3afaf35	[swss] Enhance ARP Update to Call Sonic Cfggen Once (#5398 ) This PR limited the number of calls to sonic-cfggen to one call per iteration instead of current 3 calls per iteration. The PR also installs jq on host for future scripts if needed. signed-off-by: Tamer Ahmed <tamer.ahmed@microsoft.com>	2020-09-18 18:44:23 -07:00
Danny Allen	2868a27935	[mgmt] Upgrade sonic-mgmt container to stretch (#5397 ) - Bump sonic-mgmt version to 18.04 - Update installation methods - Add virtualenv for python3 Signed-off-by: Danny Allen <daall@microsoft.com>	2020-09-17 22:00:32 -07:00
Tamer Ahmed	7515aea9db	[swss] Start Arp Update Process (#5391 ) Arp update process was not being started due to an issue with the directory name having an extra 'd' in supervisor as in '/etc/supervisord/conf.d/arp_update.conf'. signed-off-by: Tamer Ahmed <tamer.ahmed@microsoft.com>	2020-09-17 18:33:10 -07:00
Prince Sunny	ae9a86fac4	Add new DB for Restapi to database config (#5350 )	2020-09-16 19:02:47 -07:00
judyjoseph	642479f75d	[multi-Asic] Add support for multi-asic to swssloglevel (#5316 ) * Support for multi-asic platform for swssloglevel command admin@str-acs-1:~$ swssloglevel Usage: /usr/bin/swssloglevel -n [0 to 3] [OPTION]... * Update to use the env file to get the PLATFORM string.	2020-09-16 11:29:05 -07:00
Tamer Ahmed	15f5d47338	[dhcpmon] Print Both Snapshot And Current Counters (#5374 ) Printing both snapshot and current counter sets will make it easier to pinpoint which message type(s) is/are not being relayed. This PR prints both counter sets. Also, this PR defines gnu11 as a C standard to compile with in order to avoid making changes when porting to 201811 branch. singed-off-by: Tamer Ahmed <tamer.ahmed@microsoft.com>	2020-09-15 15:27:36 -07:00
Joe LeVeque	12c94a7431	[lldpmgrd] Inherit DaemonBase class from sonic-py-common package (#5370 ) Eliminate duplicate logging and signal handling code by inheriting from DaemonBase class in sonic-py-common package.	2020-09-15 10:55:55 -07:00
Tamer Ahmed	1bf6fdc6d2	[dhcpmon] Monitor Mgmt Interface For DHCP Packets (#5317 ) When BGP routes are missing, DHCP packets get relayed over mgmt interface. This results in dhcpmon alerting that DHCP packets are not being relayed. This is PR include mgmt interface as uplink device, and so, if DHCP packet gets relayed over mgmt interface, regular dhcpmon alert will not be issues. Instead, dhcpmon will check the mgmt interface counts and issue a separate alert regarding packets travelling through mgmt network. In addition, this PR includes the following enhancements: 1. Add SIGUSR1 handler that prints out current packet counts 2. Increase alert grace window to 3 minutes from currently 2 minutes 3. Time is now computed more accurately 4. Print vlan name before counters signed-off-by: Tamer Ahmed <tamer.ahmed@microsoft.com>	2020-09-09 18:37:01 -07:00
Joe LeVeque	5b3b4804ad	[dockers][supervisor] Increase event buffer size for dependent-startup (#5247 ) When stopping the swss, pmon or bgp containers, log messages like the following can be seen: ``` Aug 23 22:50:43.789760 sonic-dut INFO swss#supervisord 2020-08-23 22:50:10,061 ERRO pool dependent-startup event buffer overflowed, discarding event 34 Aug 23 22:50:43.789760 sonic-dut INFO swss#supervisord 2020-08-23 22:50:10,063 ERRO pool dependent-startup event buffer overflowed, discarding event 35 Aug 23 22:50:43.789760 sonic-dut INFO swss#supervisord 2020-08-23 22:50:10,064 ERRO pool dependent-startup event buffer overflowed, discarding event 36 Aug 23 22:50:43.789760 sonic-dut INFO swss#supervisord 2020-08-23 22:50:10,066 ERRO pool dependent-startup event buffer overflowed, discarding event 37 ``` This is due to the number of programs in the container managed by supervisor, all generating events at the same time. The default event queue buffer size in supervisor is 10. This patch increases that value in all containers in order to eliminate these errors. As more programs are added to the containers, we may need to further adjust these values. I increased all buffer sizes to 25 except for containers with more programs or templated supervisor.conf files which allow for a variable number of programs. In these cases I increased the buffer size to 50. One final exception is the swss container, where the buffer fills up to ~50, so I increased this buffer to 100. Resolves https://github.com/Azure/sonic-buildimage/issues/5241	2020-09-08 23:36:38 -07:00
Qi Luo	d4fc8e5b22	[redis] Use redis-server and redis-tools in blob storage to prevent upstream link broken (#5340 ) * [redis] Use redis-server and redis-tools in blob storage to prevent upstream link broken * Use curl instead of wget * Explicitly install dependencies	2020-09-08 19:30:14 -07:00
abdosi	eff745fbb1	Fix the issue as reported in (#5315 ) https://github.com/Azure/sonic-buildimage/issues/5255 Root Cause: Waiting on Restore count != 0 can lead to race condition between orchagent process and swssconfig.sh. Ideally check of Restore count != 0 is not needed as the State DB cannot be flushed as if it was flushed then Warm Restart or swss-restart should not be true also.	2020-09-04 09:34:26 -07:00
Joe LeVeque	fb8f09a116	[radvd] No longer build from source; Install vanilla Debian package once again (#5242 ) Remove radvd Makefile and patch, change docker-router-advertiser Dockerfile template to simply install the vanilla radvd package using apt-get. - In PR https://github.com/Azure/sonic-buildimage/pull/2795, we started building radvd from source and patching it to prevent it from erroring out when advertising an MTU of 9100 which was greater than the MTU size configured on the bridge interface (1500), which was due to a limitation in the 4.9 Linux kernel. - Master branch is now using Linux kernel 4.19. As of 4.18, the kernel supports setting a bridge MTU to a value > 1500. - PR https://github.com/Azure/sonic-swss/pull/1393 modified vlanmgrd to take advantage of this and now configures the MTU of bridge interfaces in SONiC to the proper size of 9100. Therefore, we no longer need to patch radvd. Since we no longer need to patch radvd, we no longer need to build it from source, so we can save build time by going back to simply installing the vanilla radvd Debian package in the router-advertiser container.	2020-09-01 13:53:36 -07:00
arlakshm	17e78715ae	[Multi-ASIC]:Update the template to add ipinip entry for Loopback4096 (#5235 ) Signed-off-by: Arvindsrinivasan Lakshmi Narasimhan <arlakshm@microsoft.com> The following changes are done. - Multi asic platform have 2 Loopback interfaces, Loopback0 and Loopback4096. IPinIP decap entries need to be added for both of them. Update the ipinip.json.j2 template to add decap entries for Loopback4096. - Add corressponding unit test	2020-08-31 17:35:48 -07:00
Prince Sunny	4338d8293f	Skip vnet-vxlan interfaces from generating networks (#5251 ) * Skip Vnet interface from generating networks	2020-08-27 14:14:04 -07:00
Mykola F	834a29cb66	[enable counters] Enable port buffer drops by default and update MLNX SAI submodule (#5059 ) * Enable port buffer drops by default * [Mellanox] Update SAI_Implementation Signed-off-by: Mykola Faryma <mykolaf@mellanox.com>	2020-08-25 08:48:56 -07:00
shi-su	f3feb56c8a	Add switch for synchronous mode (#5237 ) Add a master switch so that the sync/async mode can be configured. Example usage of the switch: 1. Configure mode while building an image `make ENABLE_SYNCHRONOUS_MODE=y <target>` 2. Configure when the device is running Change CONFIG_DB with `sonic-cfggen -a '{"DEVICE_METADATA":{"localhost": {"synchronous_mode": "enable"}}}' --write-to-db` Restart swss with `systemctl restart swss`	2020-08-24 14:04:10 -07:00
Joe LeVeque	97d44214cf	[docker-radv] Fix startup issues (#5230 ) - Why I did it PR https://github.com/Azure/sonic-buildimage/pull/4599 introduced two bugs in the startup of the router advertiser container: 1. References to the `wait_for_intf.sh` script were changed to `wait_for_link.sh`, but the actual script was not renamed 2. The `ipv6_found` Jinja2 variable added to the supervisor config file goes out of scope before it is read. - How I did it 1. Rename the `wait_for_intf.sh` script to `wait_for_link.sh` 2. Use the Jinja2 "namespace" construct to fix the scope issue - How to verify it Ensure all processes in the radv container start properly under the correct conditions (i.e., whether or not there is at least one VLAN with an IPv6 address assigned).	2020-08-21 13:12:01 -07:00
Tamer Ahmed	adcca53b8d	[radv] Reduce Calls to SONiC Cfggen (#5178 ) Calls to sonic-cfggen is CPU expensive. This PR reduces calls to sonic-cfggen to one call during startup when starting radv service. singed-off-by: Tamer Ahmed <tamer.ahmed@microsoft.com>	2020-08-17 15:48:04 -07:00
Tamer Ahmed	89f3206a3f	[swss] Reduce Calls to SONiC Cfggen (#5177 ) Calls to sonic-cfggen is CPU expensive. This PR reduces calls to sonic-cfggen to one call during startup when starting swss service. singed-off-by: Tamer Ahmed <tamer.ahmed@microsoft.com>	2020-08-17 15:47:52 -07:00
Tamer Ahmed	a10c5bfd02	[frr] Reduce Calls to SONiC Cfggen (#5176 ) Calls to sonic-cfggen is CPU expensive. This PR reduces calls to sonic-cfggen to two calls during startup when starting frr service. singed-off-by: Tamer Ahmed <tamer.ahmed@microsoft.com>	2020-08-17 15:47:42 -07:00
Tamer Ahmed	3a10e9c6fa	[dhcp-relay] Reduce Calls to SONiC Cfggen (#5175 ) Calls to sonic-cfggen is CPU expensive. This PR reduces calls to sonic-cfggen to one call during startup when starting dhcp-relay service. singed-off-by: Tamer Ahmed <tamer.ahmed@microsoft.com>	2020-08-17 15:47:14 -07:00
Tamer Ahmed	cc970627d0	[snmp]: Reduce Calls to SONiC Cfggen (#5166 ) Calls to sonic-cfggen is CPU expensive. This PR reduces calls to sonic-cfggen to once calle during snmp startup singed-off-by: Tamer Ahmed <tamer.ahmed@microsoft.com>	2020-08-17 15:46:29 -07:00
Joe LeVeque	6132ae34fe	[build] Build/install remaining platform daemons as Python wheel packages (#5188 ) As part of migrating all Python-based package installers to wheel format rather than Debian packages. Also to allow for easily building a Python 3 version of the package in the near future. ledd and psud were converted in earlier PRs. This PR converts the remainder: - pcied - syseepromd - thermalctld - xcvrd	2020-08-15 08:42:11 -07:00
Joe LeVeque	c3202d8982	[build] Build/install sonic-psud as a Python wheel package (#5182 ) As part of migrating all Python-based package installers to wheel format rather than Debian packages. Also to allow for easily building a Python 3 version of the package in the near future.	2020-08-14 11:11:45 -07:00
Joe LeVeque	fc9e97fc3d	[build] Build/install sonic-ledd as a Python wheel package (#5168 ) As part of migrating all Python-based package installers to wheel format rather than Debian packages. Also to allow for easily building a Python 3 version of the package in the near future. - Also remove some references to sonic-daemon-base which I previously missed and add missing sonic-py-common dependency for sonic-pcied.	2020-08-13 11:26:43 -07:00
Tamer Ahmed	7872b4e196	[platform] Add Support For Environment Variable File (#5010 ) * [platform] Add Support For Environment Variable This PR adds the ability to read environment file from /etc/sonic. the file contains immutable SONiC config attributes such as platform, hwsku, version, device_type. The aim is to minimize calls being made into sonic-cfggen during boot time. singed-off-by: Tamer Ahmed <tamer.ahmed@microsoft.com>	2020-07-31 17:59:09 -07:00
abdosi	ec435b955c	Changes to add template support for copp.json. (#5053 ) * Changes to add template support for copp.json. This is needed so that we can install differnt type of Traps based on Device Role (Tor/Leaf/Mgmt/etc...). Initial use case is to install DHCP/DHCPv6 tarp only for tor router. Signed-off-by: Abhishek Dosi <abdosi@microsoft.com> * Fixed based on review comments. Signed-off-by: Abhishek Dosi <abdosi@microsoft.com> * Fixed based on review comment.	2020-07-31 14:14:21 -07:00
joyas-joseph	f0dfe36953	[docker-fpm-frr]: Upgrade docker-fpm-frr to buster (#4920 ) Verify that /etc/apt/sources.list points to buster using docker exec bgp cat /etc/apt/sources.list BGP neighborship is established. root@sonic:~# show ip bgp summary IPv4 Unicast Summary: BGP router identifier 10.1.0.1, local AS number 65100 vrf-id 0 BGP table version 1 RIB entries 1, using 184 bytes of memory Peers 1, using 20 KiB of memory Neighbor V AS MsgRcvd MsgSent TblVer InQ OutQ Up/Down State/PfxRcd 6.1.1.1 4 100 96 96 0 0 0 01:32:04 0 Total number of neighbors 1 root@sonic:~# Signed-off-by: Joyas Joseph <joyas_joseph@dell.com>	2020-07-29 14:19:03 -07:00
Sujin Kang	02a98add92	Add pcied to PMON docker to monitor the PCIe device status (#5000 ) * Add pcied to PMON container * remove tailing spaces * update pmon submodule * review comments * rebase to the latest	2020-07-29 11:27:49 -07:00
Qi Luo	48b5792b07	[redis] Upgrade redis version (#5060 ) buster-backports updated and the old version disappeared	2020-07-28 20:50:31 -07:00
Joe LeVeque	2600747f0e	[docker-pmon] Fix copy of fancontrol config file (#5037 ) Copy proper fancontrol config file to the proper destination. Also some minor refactoring for code reuse to help prevent issues like this in the future. Fixes a bug introduced by #4599	2020-07-28 00:23:21 -07:00
pavel-shirshov	89184038fd	[docker-fpm-frr]: Start bgpd after zebra was started (#5038 ) fixes https://github.com/Azure/sonic-buildimage/issues/5026 Explanation: In the log from the issue I found: ``` I see following in the log Jul 22 21:13:06.574831 vlab-01 WARNING bgp#bgpd[49]: [EC 33554499] sendmsg_nexthop: zclient_send_message() failed ``` Analyzing source code I found that the error message could be issues only when `zclient_send_rnh()` return less than 0. ``` ret = zclient_send_rnh(zclient, command, p, exact_match, bnc->bgp->vrf_id); /* TBD: handle the failure / if (ret < 0) flog_warn(EC_BGP_ZEBRA_SEND, "sendmsg_nexthop: zclient_send_message() failed"); ``` I checked [zclient_send_rnh()](`88351c8f6d/lib/zclient.c (L654)`) and found that this function will return the exit code which the function gets from [zclient_send_message()](`88351c8f6d/lib/zclient.c (L266)`) But the latter function could return not 0 in two cases: 1. bgpd didn’t connect to the zclient socket yet [code](`88351c8f6d/lib/zclient.c (L269)`) 2. The socket was closed. But in this case we would receive the error message in the log. (And I can find the message in the log when we reboot sonic) [code](`88351c8f6d/lib/zclient.c (L277)`) Also I see from the logs that client connection was set later we had the issue in bgpd. Bgpd.log ``` Jul 22 21:13:06.574831 vlab-01 WARNING bgp#bgpd[49]: [EC 33554499] sendmsg_nexthop: zclient_send_message() failed ``` Vs Zebra.log ``` Jul 22 21:13:12.713249 vlab-01 NOTICE bgp#zebra[48]: client 25 says hello and bids fair to announce only static routes vrf=0 Jul 22 21:13:12.820352 vlab-01 NOTICE bgp#zebra[48]: client 30 says hello and bids fair to announce only bgp routes vrf=0 Jul 22 21:13:12.820352 vlab-01 NOTICE bgp#zebra[48]: client 33 says hello and bids fair to announce only vnc routes vrf=0 ``` So in our case we should start zebra first. Wait until it is started and then start bgpd and other daemons. - How I did it* I changed a graph to start daemons in the following order: 1. First start zebra 2. Then starts staticd and bgpd 3. Then starts vtysh -b and bgpeoi after bgpd is started.	2020-07-25 03:48:47 -07:00
anish-n	da017f4ec9	[bgpcfgd]: Add Vlan prefix list to the FRR templates (#5005 ) add the Vlan prefix list to the FRR templates	2020-07-21 19:26:19 -07:00
Stepan Blyshchak	16a37d8c17	[dockers] update mellanox syncd and pmon to buster (#4818 ) Upgrade to libsensors5 Updated sonic-sairedis pointer: d54bfb4 [SAI] update pointer (#636) 1885a8c [syncd] Fix notification on shutdown request (#635) 9e57ba2 Fixing hostif For Genetlink host interfaces (#633) 449a092 sonic-sairedis: Add support to sonic-sairedis for gearbox phys (#632) Signed-off-by: Stepan Blyschak <stepanb@mellanox.com>	2020-07-18 03:46:15 -07:00
joyas-joseph	78945766fc	[docker-iccpd]: Upgrade docker-iccpd to buster (#4984 ) Signed-off-by: Joyas Joseph <joyas_joseph@dell.com>	2020-07-18 00:12:59 -07:00
Nazarii Hnydyn	a3f4c31193	[orchagent]: Fix platform string export. (#4993 ) Signed-off-by: Nazarii Hnydyn <nazariig@mellanox.com>	2020-07-18 00:05:10 -07:00
joyas-joseph	18bfa6df08	[docker-nat]: upgrade docker-nat to buster (#4943 ) move iptables to 1.8.2-4 (version in buster) Signed-off-by: Joyas Joseph <joyas_joseph@dell.com>	2020-07-15 22:48:09 -07:00
taochengyi	1ca47da40d	[build][arm] Adding a backport source to arm to resolve docker-base-stretch install redis-tools=5:5.0.3-3~bpo9+2 failure issue (#4950 )	2020-07-15 12:23:20 -07:00
Tamer Ahmed	5f3c4fac4b	[docker-orchagent] Call sonic-cfggen Once (#4936 ) Optimizing number of calls made to sonic-cfggen during service start up as it adds to total system boot up time. signed-off-by: Tamer Ahmed <tamer.ahmed@microsoft.com> - Why I did it sonic-cfggen call is slow and it adds to system start up time - How I did it places all required variable into single template and called into sonic-cfggen using this template - How to verify it *-Test 1* there is an average saving of .5 to 1 sec between old script and new script ``` root@str-s6000-acs-14:/# time ./orchagent_old.sh /usr/bin/orchagent -d /var/log/swss -b 8192 -m f4:8e:38:16:bc:8d real 0m3.546s user 0m2.365s sys 0m0.585s root@str-s6000-acs-14:/# time ./orchagent_new.sh /usr/bin/orchagent -d /var/log/swss -b 8192 -m f4:8e:38:16:bc:8d real 0m2.058s user 0m1.650s sys 0m0.363s ``` *-Test 2* Built an image with this change and orchagent is running with intended params: ``` admin@str-s6000-acs-14:~$ ps -ef \| grep orchagent root 2988 1901 1 02:09 pts/0 00:00:02 /usr/bin/orchagent -d /var/log/swss -b 8192 -m f4:8e:38:16:bc:8d ``` signed-off-by: Tamer Ahmed <tamer.ahmed@microsoft.com>	2020-07-14 15:36:47 -07:00
joyas-joseph	71e93d921c	[docker-team]: upgrade docker-teamd to buster (#4914 ) Signed-off-by: Joyas Joseph <joyas_joseph@dell.com>	2020-07-12 18:08:52 +00:00
Stephen Sun	0db05e35b0	[watermark] Fix error: BUFFER_POOL_WATERMARK isn't enabled by default (#4882 ) * Fix error: watermarkstat -t buffer_pool doesn't work Signed-off-by: Stephen Sun <stephens@mellanox.com>	2020-07-12 18:08:52 +00:00
Tamer Ahmed	3197c2dfac	[mgmt-framework] Call sonic-cfggen Once (#4937 ) Optimizing number of calls made to sonic-cfggen during service start up as it adds to total system boot up time. *-Test 1* there is an average saving of 1 to 1.5 sec between old script and new script ``` root@str-s6000-acs-14:/# time /usr/bin/rest-server-old.sh Generating temporary TLS server certificate ... 2020/07/09 19:03:33 wrote cert.pem 2020/07/09 19:03:33 wrote key.pem REST_SERVER_ARGS = -ui /rest_ui -logtostderr -cert /tmp/cert.pem -key /tmp/key.pem /usr/sbin/rest_server -ui /rest_ui -logtostderr -cert /tmp/cert.pem -key /tmp/key.pem real 0m8.790s user 0m7.993s sys 0m0.584s root@str-s6000-acs-14:/# time /usr/bin/rest-server-new.sh Generating temporary TLS server certificate ... 2020/07/09 19:03:45 wrote cert.pem 2020/07/09 19:03:45 wrote key.pem REST_SERVER_ARGS = -ui /rest_ui -logtostderr -cert /tmp/cert.pem -key /tmp/key.pem /usr/sbin/rest_server -ui /rest_ui -logtostderr -cert /tmp/cert.pem -key /tmp/key.pem real 0m6.940s user 0m5.670s sys 0m0.386s ``` *-Test 2* Built an image with this change and rest server is running with params as described in test 1 above ``` admin@str-s6000-acs-14:~$ ps -ef \| grep rest_server root 3301 2866 2 02:09 pts/0 00:00:10 /usr/sbin/rest_server -ui /rest_ui -logtostderr -cert /tmp/cert.pem -key /tmp/key.pem ``` signed-off-by: Tamer Ahmed <tamer.ahmed@microsoft.com>	2020-07-12 18:08:52 +00:00
pra-moh	24f684da68	[docker-ptf] add gnmi python client (#4928 ) For telemetry regression test we need gnmi client to be present on ptfdocker. Gnmi-server will be present on SONiC DuT. Further, we can access gnmi_get from ptfdocker inside pytest to verify gnmi server streaming data successfully or not.	2020-07-12 18:08:52 +00:00
Tamer Ahmed	ceace4b605	[telemetry] Fix telemetry vars template path (#4938 ) The template is referenced relative to the script path and this could results in errors in case script is run from root. Add explicit path to the template file name. Also, moving telemetry_var template to template dir. And remove double quotes from around json dict. signed-off-by: Tamer Ahmed <tamer.ahmed@microsoft.com>	2020-07-12 18:08:52 +00:00
Ying Xie	d499a266c0	[mgmt docker] move pycryptodome installation to the end of the docker building (#4917 ) * [mgmt docker] move pycryptodome installation to the end of the docker building Signed-off-by: Ying Xie <ying.xie@microsoft.com> * pin down the version to current: 3.9.8 * comment	2020-07-12 18:08:52 +00:00
Akhilesh Samineni	525029e3d8	[NAT]: Update the conntrack entries timeout to Max value after warmboot (#4596 ) Signed-off-by: Akhilesh Samineni <akhilesh.samineni@broadcom.com> All new NAT conntrack entries are added to kernel with max entry timeout of 432000 and setting the same timeout during system warm reboot also	2020-07-12 18:08:52 +00:00
joyas-joseph	7a6fca2f98	[docker-sflow]: upgrade docker-sflow on buster (#4904 )	2020-07-12 18:08:52 +00:00
Tamer Ahmed	f4eae5dabd	[telemetry] Call sonic-cfggen Once (#4901 ) sonic-cfggen call is slow and this is taking place in the SONiC boot up process. The change uses templates to assemble all required vars into single template file. With this change, telemetry now calls once into sonic-cfggen. signed-off-by: Tamer Ahmed <tamer.ahmed@microsoft.com>	2020-07-12 18:08:52 +00:00
Ying Xie	6f11833ffa	Revert "[sonic mgmt docker] lock pycryptodome version to 3.9.7 (#4913 )" (#4915 ) This reverts commit `f427d2eecf`.	2020-07-12 18:08:51 +00:00
Ying Xie	caa3323e9d	[sonic mgmt docker] lock pycryptodome version to 3.9.7 (#4913 ) Signed-off-by: Ying Xie <ying.xie@microsoft.com>	2020-07-12 18:08:51 +00:00
arlakshm	97fa2c087b	"[config]: Multi ASIC loopback changes (#4895 ) Resubmitting the changes for (#4825) with fixes for sonic-bgpcdgd test failures Signed-off-by: Arvindsrinivasan Lakshmi Narasimhan <arlakshm@microsoft.com>	2020-07-12 18:08:51 +00:00
lguohan	e2e57d32d6	[docker-orchagent]: upgrade docker-orchagent to buster (#4889 ) also update submodule * 01f810f 2020-07-02 \| fix compiling issue for gcc8.3 (#1339) [lguohan] * 9b13120 2020-07-03 \| Fix in script to avoid orchagent crash when port down followed by fdb delete (#1340) [rupesh-k] * 9b01844 2020-07-01 \| [qosorch] Update QoS scheduler params for shaping features (#1296) [Michael Li] * 86b5e99 2020-07-02 \| [mirrororch] Port Mirroring implementation (#1314) [rupesh-k] * c05601c 2020-06-24 \| [portsyncd]: add debug message if a port cannot be found in port able (#1328) [lguohan] * a0b6412 2020-06-23 \| COPP_DEL_fix: DEL for one trap group from SONIC is resetting all the trap IDs (#1273) [SinghMinu] Signed-off-by: Guohan Lu <lguohan@gmail.com>	2020-07-12 18:08:51 +00:00
Guohan Lu	f8da3e4c69	Revert "[config]: Loopback Interface changes for multi ASIC devices (#4825 )" This reverts commit `cae65a451c`.	2020-07-12 18:08:51 +00:00
arlakshm	002335a3d5	[config]: Loopback Interface changes for multi ASIC devices (#4825 ) * Loopback IP changes for multi ASIC devices multi ASIC will have 2 Loopback Interfaces - Loopback0 has globally unique IP address, which is advertised by the multi ASIC device to its peers. This way all the external devices will see this device as a single device. - Loopback4096 is assigned an IP address which has a scope is within the device. Each ASIC has a different ip address for Loopback4096. This ip address will be used as Router-Id by the bgp instance on multi ASIC devices. This PR implements this change for multi ASIC devices Signed-off-by: Arvindsrinivasan Lakshmi Narasimhan <arlakshm@microsoft.com>	2020-07-12 18:08:51 +00:00
pavel-shirshov	2b137fb540	Tests of FRR templates which rendered by sonic-cfggen (#4875 ) * Tests of FRR templates which rendered by sonic-cfggen	2020-07-12 18:08:51 +00:00
abdosi	fc6bcff52b	[sonic-buildimage] Changes to make network specific sysctl common for both host and docker namespace (#4838 ) * [sonic-buildimage] Changes to make network specific sysctl common for both host and docker namespace (in multi-npu). This change is triggered with issue found in multi-npu platforms where in docker namespace net.ipv6.conf.all.forwarding was 0 (should be 1) because of which RS/RA message were triggered and link-local router were learnt. Beside this there were some other sysctl.net.ipv6* params whose value in docker namespace is not same as host namespace. So to make we are always in sync in host and docker namespace created common file that list all sysctl.net.* params and used both by host and docker namespace. Any change will get applied to both namespace. Signed-off-by: Abhishek Dosi <abdosi@microsoft.com> * Address Review Comments and made sure to invoke augtool only one and do string concatenation of all set commands * Address Review Comments.	2020-07-12 18:08:51 +00:00
judyjoseph	1af68b3aa6	Support for connecting to DB in namespace via TCP port in multi-asic platform. (#4779 ) * Support for connecting to DB in namespace via IP:port ( using docker bridge network ) for applications in multi-asic platform. * Added the default IP as 127.0.0.1 if the IPaddress derivation from interface fails. Moved the localhost loopback IP binding logic into the supervisor.j2 file.	2020-07-12 18:08:51 +00:00
abdosi	15440b6e43	Changes to make default route programming correct in multi-npu platforms (#4774 ) * Changes to make default route programming correct in multi-asic platform where frr is not running in host namespace. Change is to set correct administrative distance. Also make NAMESPACE* enviroment variable available for all dockers so that it can be used when needed. Signed-off-by: Abhishek Dosi <abdosi@microsoft.com> * Fix review comments * Review comment to check to add default route only if default route exist and delete is successful.	2020-06-29 11:38:46 -07:00
yozhao101	1c32933c7d	[docker] Correct the lldp-syncd program name in critical_process file. (#4862 ) The program name in critical_processes file must match the program name defined in supervisord.conf file. Signed-off-by: Yong Zhao <yozhao@microsoft.com>	2020-06-28 11:08:30 -07:00
pavel-shirshov	1eb3dfe541	[docker-teamd]: Introducing tlm_teamd: telemetry for teamd (#4824 ) - What I did 1. Updated submodule sonic-swss to bring tlm_teamd to the buildimage. 2. Updated supervisord for the teamd 3. Updated critical process list (not sure that tlm_teamd is critical for now) - How to verify it Build an image and run. Check that tlm_teamd is running and STATE_DB has information in the LAG_INTERFACE, and :LAG_MEMBER_INTERFACE ``` admin@sonic:~$ redis-cli -n 6 hgetall 'LAG_TABLE\|PortChannel16' 1) "state" 2) "ok" 3) "team_device.ifinfo.dev_addr" 4) "4c:76:25:f5:48:80" 5) "setup.kernel_team_mode_name" 6) "loadbalance" 7) "team_device.ifinfo.ifindex" 8) "6" 9) "runner.fast_rate" 10) "false" 11) "runner.active" 12) "true" 13) "setup.pid" 14) "35" 15) "runner.fallback" 16) "false" ``` ``` admin@sonic:~$ redis-cli -n 6 hgetall 'LAG_MEMBER_TABLE\|PortChannel16\|Ethernet16' 1) "runner.selected" 2) "true" 3) "runner.aggregator.selected" 4) "true" 5) "runner.aggregator.id" 6) "26" 7) "runner.actor_lacpdu_info.state" 8) "61" 9) "runner.state" 10) "current" 11) "runner.actor_lacpdu_info.system" 12) "4c:76:25:f5:48:80" 13) "runner.partner_lacpdu_info.state" 14) "61" 15) "link.up" 16) "true" 17) "ifinfo.dev_addr" 18) "4c:76:25:f5:48:80" 19) "ifinfo.ifindex" 20) "26" 21) "link_watches.list.link_watch_0.up" 22) "true" 23) "runner.actor_lacpdu_info.port" 24) "17" 25) "runner.partner_lacpdu_info.port" 26) "1" 27) "runner.partner_lacpdu_info.system" 28) "52:54:00:ff:34:1b" ```	2020-06-27 01:22:23 -07:00
Qi Luo	6849a0351c	[redis] Install vanilla redis packages for Buster and Stretch; upgrade Buster to 6.0.5 (#4732 ) upgrade redis server to 5:6.0.5-1~bpo10+1	2020-06-27 01:17:20 -07:00
yozhao101	4fa81b4f8d	[dockers] Update critical_processes file syntax (#4831 ) - Why I did it Initially, the critical_processes file contains either the name of critical process or the name of group. For example, the critical_processes file in the dhcp_relay container contains a single group name `isc-dhcp-relay`. When testing the autorestart feature of each container, we need get all the critical processes and test whether a container can be restarted correctly if one of its critical processes is killed. However, it will be difficult to differentiate whether the names in the critical_processes file are the critical processes or group names. At the same time, changing the syntax in this file will separate the individual process from the groups and also makes it clear to the user. Right now the critical_processes file contains two different kind of entries. One is "program:xxx" which indicates a critical process. Another is "group:xxx" which indicates a group of critical processes managed by supervisord using the name "xxx". At the same time, I also updated the logic to parse the file critical_processes in supervisor-proc-event-listener script. - How to verify it We can first enable the autorestart feature of a specified container for example `dhcp_relay` by running the comman `sudo config container feature autorestart dhcp_relay enabled` on DUT. Then we can select a critical process from the command `docker top dhcp_relay` and use the command `sudo kill -SIGKILL <pid>` to kill that critical process. Final step is to check whether the container is restarted correctly or not.	2020-06-25 21:18:21 -07:00
Shuba Viswanathan	921d132a32	[sonic-mgmt]: Support for pytest-html to control logs better (#4791 ) The current stdout file which also includes the dut logs are very verbose and noisy. We have manually installed it in the sonic-mgmt docker in our organization and tuned the pytest settings to produce very helpful and concise logs. pytest-html plugins can be used to post-process the output in various ways based on our different and unique organizational needs. Hence proposing to add this pkt to the docker file	2020-06-25 17:45:16 -07:00
pavel-shirshov	d592e9b0f8	Tests for bgpcfgd templates (#4841 ) * Tests for bgpcfgd templates	2020-06-25 14:54:02 -07:00
lguohan	cebb85b161	[docker-orchagent]: start portsyncd before orchagent (#4845 ) when portsyncd starts, it first enumerates all front panel ports and marks them as old interfaces. Then, for new front panel ports it checks if their indexes exist in previous sets. If yes, it will treats them as old interfaces and ignore them. The reason we have this check is because broadcom SAI only removes front panel ports after sai switch init. So, if portsyncd starts after orchagent, new interfaces could be created before portsyncd and treated as old interface. Signed-off-by: Guohan Lu <lguohan@gmail.com>	2020-06-24 22:48:37 -07:00
padmanarayana	5cacc2004c	Add a port index mapper service for sFlow (#4794 ) * Add a port index mapper service for sFlow	2020-06-23 22:23:08 -07:00
joyas-joseph	b48d274f69	[docker-dhcp-relay]: convert dhcp-relay docker to buster (#4671 ) Upgrade isc-dhcp to 4.4.1-2 (buster version) Update libevent dependency for dhcpmon to 2.1-6 Signed-off-by: Joyas Joseph <joyas_joseph@dell.com>	2020-06-22 15:34:21 -07:00
pavel-shirshov	0d863c39ac	[bgpcfgd]: make a package for bgpcfgd (#4813 )	2020-06-20 21:01:24 -07:00
Tamer Ahmed	211d1e7e2e	[fast-reboot] Back up FDB/ARP/Default routes (#4795 ) FDB/ARP/Default routes files are deleted after swssconfig. This makes debugging/validation of device conversion hard. This PR saves those files in order to facilitate debugging of device conversion. signed-off-by: Tamer Ahmed <tamer.ahmed@microsoft.com>	2020-06-20 15:30:53 -07:00
Sachin Holla	3dc7992a6e	[mgmt-framework]: REST server cert configurations (#4799 ) REST and telemetry servers were using "DEVICE_METADATA\|x509" table for server certificate configurations. This table has been deprecated now. Enhanced REST server startup script to read server certificate file path configurations from REST_SERVER table. Three more attributes - server_crt, server_key and ca_crt are introduced as described in https://github.com/Azure/SONiC/pull/550. For backard compatibility, certificate configurations are read from old "DEVICE_METADATA\|x509" table if they (server_crt, server_key and ca_crt) are not present in REST_SERVER table. Fixes bug https://github.com/Azure/sonic-buildimage/issues/4291 Signed-off-by: Sachin Holla <sachin.holla@broadcom.com>	2020-06-19 00:46:03 -07:00
Danny Allen	364511aa36	[mgmt docker] Clean up docker-sonic-mgmt dockerfile (#4759 ) - Alphabetize dependencies to prevent duplicates - Remove unneccesary git clone Signed-off-by: Danny Allen <daall@microsoft.com>	2020-06-16 11:32:25 -07:00
arlakshm	80298fae1f	[bgp]:Add redistribution connected for ipv6 also for Frontend ASICs (#4767 ) * fix redistribution connected for ipv6 also Signed-off-by: Arvindsrinivasan Lakshmi Narasimhan <arlakshm@microsoft.com>	2020-06-16 06:45:45 -07:00
Arun Saravanan Balachandran	a748daeaed	[docker-sonic-mgmt]: import patch to support 'become' and 'become_user' arguments in pytest-ansible (#4681 )	2020-06-12 16:21:10 -07:00
joyas-joseph	1714e621be	[docker-radv]: Convert radv docker to buster (#4727 ) * Set radvd version to match buster version(2.17-2) Signed-off-by: Joyas Joseph <joyas_joseph@dell.com>	2020-06-12 16:10:23 -07:00
Ying Xie	4b39193a13	[sonic-mgmt] upgrade ansible to 2.7.12 (#4751 ) Signed-off-by: Ying Xie <ying.xie@microsoft.com>	2020-06-11 23:23:27 -07:00
Joe LeVeque	5d5d5739c2	[dockers] Rename 'docker-snmp-sv2' to 'docker-snmp' (#4699 ) The `-sv2` suffix was used to differentiate SNMP Dockers when we transitioned from "SONiCv1" to "SONiCv2", about four years ago. The old Docker materials were removed long ago; there is no need to keep this suffix. Removing it aligns the name with all the other Dockers. Also edit Monit configuration to detect proper snmp-subagent command line in Buster, and make snmpd command line matching more robust.	2020-06-11 16:04:23 -07:00
Ying Xie	2e93a92bd9	[sonic-mgmt] upgrade paramilo to version 2.7.1 (#4750 ) spytest requires higher paramiko version. Fix it to 2.7.1. Signed-off-by: Ying Xie <ying.xie@microsoft.com>	2020-06-11 14:04:51 -07:00
Joe LeVeque	9b27efdcc2	[dockers] Rename 'docker-lldp-sv2' to 'docker-lldp' (#4700 ) The -sv2 suffix was used to differentiate SNMP Dockers when we transitioned from "SONiCv1" to "SONiCv2", about four years ago. The old Docker materials were removed long ago; there is no need to keep this suffix. Removing it aligns the name with all the other Dockers.	2020-06-09 09:09:56 -07:00
Mykola F	49a93743a4	[enable counters] enable RIF flex counter by default (#4655 ) - Why I did it We need RIF counters to be enabled by default. Flex Counter does probe for supported counters. If a platform does not support RIF counters, SAI will return NOT_SUPPORTED and Flex Counter will stop polling the counter. - How to verify it After fresh install rif counter gropup is enabled by default: $ counterpoll show Type Interval (in ms) Status -------------------- ------------------ -------- QUEUE_STAT default (10000) enable PORT_STAT default (1000) enable RIF_STAT default (1000) enable QUEUE_WATERMARK_STAT default (10000) enable PG_WATERMARK_STAT default (10000) enable Signed-off-by: Mykola Faryma <mykolaf@mellanox.com>	2020-06-04 09:52:43 -07:00
taochengyi	ccd08f10dd	[build]: fix mgmt-framework build failure on ARM64 (#4674 ) PIP installs grpcio-tools via source code Co-authored-by: taocy <taocy2@centecnetworks.com>	2020-05-31 03:09:39 -07:00
joyas-joseph	cae67728f5	[docker-database]: Upgrade docker-database to buster (#4665 ) Signed-off-by: Joyas Joseph <joyas_joseph@dell.com>	2020-05-29 03:29:49 -07:00
Guohan Lu	767152f09b	[docker-sonic-mgmt]: fix pip version to 20.1.1 Signed-off-by: Guohan Lu <lguohan@gmail.com>	2020-05-26 10:29:13 +00:00
taocy	00e7e14927	Install the libraries that j2cli relies on from source, for arm arch.	2020-05-25 13:15:19 +00:00
Guohan Lu	ddd6368e64	[docker-database]: do not generate pidfile for rsyslogd Signed-off-by: Guohan Lu <lguohan@gmail.com>	2020-05-22 11:01:28 -07:00
Guohan Lu	6cd8d5bb68	[docker-snmp-sv2]: use service dependency in supervisord to start services Signed-off-by: Guohan Lu <lguohan@gmail.com>	2020-05-22 11:01:28 -07:00
Guohan Lu	2e42a4ba0f	[docker-dhcp-relay]: use service dependency in supervisord to start services	2020-05-22 11:01:28 -07:00
Guohan Lu	448d1cdd49	[docker-teamd]: use service dependency in supervisord to start services	2020-05-22 11:01:28 -07:00
Guohan Lu	38499ab0f8	[docker-mgmt-framework]: use service dependency in supervisord to start services Signed-off-by: Guohan Lu <lguohan@gmail.com>	2020-05-22 11:01:28 -07:00
Guohan Lu	1cf417ed1b	[docker-telemetry]: use service dependency in supervisord to start services Signed-off-by: Guohan Lu <lguohan@gmail.com>	2020-05-22 11:01:28 -07:00
Guohan Lu	15c6282eac	[docker-restapi]: use service dependency in supervisord to start services	2020-05-22 11:01:28 -07:00
Guohan Lu	8da46d26c3	[docker-pmon]: use service dependency in supervisord to start services	2020-05-22 11:01:28 -07:00
Guohan Lu	1636be4b68	[docker-sflow]: use service dependency in supervisord to start services	2020-05-22 11:01:28 -07:00
Guohan Lu	b8da6c3588	[docker-orchagent]: use service dependency in supervisord to start services	2020-05-22 11:01:28 -07:00
Guohan Lu	7ea6d9dc8f	[docker-radvd]: use service dependency in supervisord to start services	2020-05-22 11:01:28 -07:00
Guohan Lu	1f7602b275	[docker-lldp-sv2]: use service dependency in supervisord to start services	2020-05-22 11:01:28 -07:00
Guohan Lu	267b0b7aa8	[docker-iccpd]: use service dependency in supervisord to start services Signed-off-by: Guohan Lu <lguohan@gmail.com>	2020-05-22 11:01:28 -07:00
Guohan Lu	0c005fdb1c	[docker-nat]: use service dependency in supervisord to start servicesx Signed-off-by: Guohan Lu <lguohan@gmail.com>	2020-05-22 11:01:28 -07:00
Guohan Lu	2c7e55ae98	[docker-frr]: use service dependency in supervisord to start services Signed-off-by: Guohan Lu <lguohan@gmail.com>	2020-05-22 11:01:28 -07:00
Guohan Lu	c915c3cbd6	[docker-base]: remove dummy password for supervisord control Signed-off-by: Guohan Lu <lguohan@gmail.com>	2020-05-22 11:01:28 -07:00
Guohan Lu	75fe8888d2	[docker-base]: add supervisord-dependent-startup plugin for supervisord	2020-05-22 11:01:28 -07:00
Tyler Li	2398992d52	[iccpd] build iccpd deb by auto tools (#4540 ) * [iccpd] build iccpd deb by auto tools	2020-05-21 09:12:51 -07:00
judyjoseph	4ba2f608c1	Adding new BGP peer groups PEER_V4_INT and PEER_V6_INT. (#4620 ) * Adding new BGP peer groups PEER_V4_INT and PEER_V6_INT. The internal BGP sessions will be added to this peer group while the external BGP sessions will be added to the exising PEER_V4 and PEER_V6 peer group. * Check for "ASIC" keyword in the hostname to identify the internal neighbors.	2020-05-20 20:52:11 -07:00
joyas-joseph	9084ac50fb	[docker-telemetry]: upgrade telemetry docker to buster (#4515 ) Signed-off-by: Joyas Joseph <joyas_joseph@dell.com>	2020-05-19 03:12:50 -07:00
Kelly Chen	e0f8aa71d6	[lldpmgrd] only log error in is_port_up() after port init done (#4606 ) lldpmgrd listens for changes to the PORT table in the CONFIG_DB and APP_DB in order to handle alias/description config change. It checks if port is up or down by looking into the oper-status for in APP_DB PORT TABLE. If it cannot find it in the App DB, it will log error. During initializing, it is possible that there is a port change in CONFIG_DB and but the not ready in APP_DB. The change here is to only log error in is_port_up() after port init done.	2020-05-19 02:52:48 -07:00
Qi Luo	af95d57fa9	[docker-base-buster]: Install python 3.7 into docker-base-buster (#4603 ) * Install python3 in docker-base-buster, so all derived image will benefit Signed-off-by: Qi Luo <qiluo-msft@users.noreply.github.com>	2020-05-17 04:26:50 -07:00
joyas-joseph	9814da1e21	[docker-lldp-sv2]: upgrade docker-lldp-sv2 to buster (#4598 ) Signed-off-by: Joyas Joseph <joyas_joseph@dell.com>	2020-05-15 14:51:53 -07:00
joyas-joseph	9dea816532	Convert docker-snmp-sv2 to buster (#4529 ) * Fix libsnmp-base compilation failure * Convert docker-snmp-sv2 to buster * Define install_python3_wheels * Address review comments * Address review comments * Advance snmpagent submodule * Bump net-snmp to the Buster version * Revert "Fix libsnmp-base compilation failure" * use azure storage url	2020-05-14 10:23:37 -07:00

1 2 3 4 5 ...

803 Commits