sonic-buildimage

Author	SHA1	Message	Date
Saikrishna Arcot	5f3269a61b	Create a docker-swss-layer that holds the swss package. This is to save about 40MB of disk space, since 5 containers individually install this package. Signed-off-by: Saikrishna Arcot <sarcot@microsoft.com> (cherry picked from commit `bd479cad29`)	2022-01-27 23:53:09 -08:00
bingwang-ms	c1b380df73	[docker-teamd]: Increase teammgrd timeout to allow graceful shutdown. (#7662 ) (#7842 ) The PR is a cherry-pick of #7662. Signed-off-by: Nazarii Hnydyn <nazariig@nvidia.com>	2021-06-10 12:49:18 -07:00
yozhao101	fb2c995f53	[202012][Monit] Deprecate the feature of monitoring the critical processes by Monit (#7823 ) Signed-off-by: Yong Zhao yozhao@microsoft.com Why I did it Currently we leveraged the Supervisor to monitor the running status of critical processes in each container and it is more reliable and flexible than doing the monitoring by Monit. So we removed the functionality of monitoring the critical processes by Monit. How I did it I removed the script process_checker and corresponding Monit configuration entries of critical processes. How to verify it I verified this on the device str-7260cx3-acs-1.	2021-06-09 09:04:22 -07:00
Joe LeVeque	dd9be59cd1	[202012][dockers][supervisor] Increase event buffer size for process exit listener; Set all event buffer sizes to 1024 (#7203 ) #### Why I did it Backport of https://github.com/Azure/sonic-buildimage/pull/7083 to the 202012 branch. To prevent error [messages](https://dev.azure.com/mssonic/build/_build/results?buildId=2254&view=logs&j=9a13fbcd-e92d-583c-2f89-d81f90cac1fd&t=739db6ba-1b35-5485-5697-de102068d650&l=802) like the following from being logged: ``` Mar 17 02:33:48.523153 vlab-01 INFO swss#supervisord 2021-03-17 02:33:48,518 ERRO pool supervisor-proc-exit-listener event buffer overflowed, discarding event 46 ``` This is basically an addendum to https://github.com/Azure/sonic-buildimage/pull/5247, which increased the event buffer size for dependent-startup. While supervisor-proc-exit-listener doesn't subscribe to as many events as dependent-startup, there is still a chance some containers (like swss, as in the example above) have enough processes running to cause an overflow of the default buffer size of 10. This is especially important for preventing erroneous log_analyzer failures in the sonic-mgmt repo regression tests, which have started occasionally causing PR check builds to fail. Example [here](https://dev.azure.com/mssonic/build/_build/results?buildId=2254&view=logs&j=9a13fbcd-e92d-583c-2f89-d81f90cac1fd&t=739db6ba-1b35-5485-5697-de102068d650&l=802). I set all supervisor-proc-exit-listener event buffer sizes to 1024, and also updated all dependent-startup event buffer sizes to 1024, as well, to keep things simple, unified, and allow headroom so that we will not need to adjust these values frequently, if at all.	2021-04-01 12:52:19 -07:00
yozhao101	cc9c3f567e	[supervisord] Monitoring the critical processes with supervisord. (#6242 ) - Why I did it Initially, we used Monit to monitor critical processes in each container. If one of critical processes was not running or crashed due to some reasons, then Monit will write an alerting message into syslog periodically. If we add a new process in a container, the corresponding Monti configuration file will also need to update. It is a little hard for maintenance. Currently we employed event listener of Supervisod to do this monitoring. Since processes in each container are managed by Supervisord, we can only focus on the logic of monitoring. - How I did it We borrowed the event listener of Supervisord to monitor critical processes in containers. The event listener will take following steps if it was notified one of critical processes exited unexpectedly: The event listener will first check whether the auto-restart mechanism was enabled for this container or not. If auto-restart mechanism was enabled, event listener will kill the Supervisord process, which should cause the container to exit and subsequently get restarted. If auto-restart mechanism was not enabled for this contianer, the event listener will enter a loop which will first sleep 1 minute and then check whether the process is running. If yes, the event listener exits. If no, an alerting message will be written into syslog. - How to verify it First, we need checked whether the auto-restart mechanism of a container was enabled or not by running the command show feature status. If enabled, one critical process should be selected and killed manually, then we need check whether the container will be restarted or not. Second, we can disable the auto-restart mechanism if it was enabled at step 1 by running the commnad sudo config feature autorestart <container_name> disabled. Then one critical process should be selected and killed. After that, we will see the alerting message which will appear in the syslog every 1 minute. - Which release branch to backport (provide reason below if selected) 201811 201911 [x ] 202006	2021-01-28 09:28:27 -08:00
lguohan	4d3eb18ca7	[supervisord]: use abspath as supervisord entrypoint (#5995 ) use abspath makes the entrypoint not affected by PATH env. Signed-off-by: Guohan Lu <lguohan@gmail.com>	2020-11-22 21:18:44 -08:00
Joe LeVeque	7bf05f7f4f	[supervisor] Install vanilla package once again, install Python 3 version in Buster container (#5546 ) - Why I did it We were building a custom version of Supervisor because I had added patches to prevent hangs and crashes if the system clock ever rolled backward. Those changes were merged into the upstream Supervisor repo as of version 3.4.0 (http://supervisord.org/changes.html#id9), therefore, we should be able to simply install the vanilla package via pip. This will also allow us to easily move to Python 3, as Python 3 support was added in version 4.0.0. - How I did it - Remove Makefiles and patches for building supervisor package from source - Install Python 3 supervisor package version 4.2.1 in Buster base container - Also install Python 3 version of supervisord-dependent-startup in Buster base container - Debian package installed binary in `/usr/bin/`, but pip package installs in `/usr/local/bin/`, so rather than update all absolute paths, I changed all references to simply call `supervisord` and let the system PATH find the executable to prevent future need for changes just in case we ever need to switch back to build a Debian package, then we won't need to modify these again. - Install Python 2 supervisor package >= 3.4.0 in Stretch and Jessie base containers	2020-11-19 23:41:32 -08:00
abdosi	84aa99d04b	[multi-asic] teamdctl support for multi-asic (#5851 ) Signed-off-by: Abhishek Dosi <abdosi@microsoft.com>	2020-11-09 12:31:33 -08:00
abdosi	dddf96933c	[monit] Adding patch to enhance syslog error message generation for monit alert action when status is failed. (#5720 ) Why/How I did: Make sure first error syslog is triggered based on FAULT TOLERANCE condition. Added support of repeat clause with alert action. This is used as trigger for generation of periodic syslog error messages if error is persistent Updated the monit conf files with repeat every x cycles for the alert action	2020-10-31 17:29:49 -07:00
yozhao101	13cec4c486	[Monit] Unmonitor the processes in containers which are disabled. (#5153 ) We want to let Monit to unmonitor the processes in containers which are disabled in `FEATURE` table such that Monit will not generate false alerting messages into the syslog. Signed-off-by: Yong Zhao <yozhao@microsoft.com>	2020-09-25 00:28:28 -07:00
Joe LeVeque	5b3b4804ad	[dockers][supervisor] Increase event buffer size for dependent-startup (#5247 ) When stopping the swss, pmon or bgp containers, log messages like the following can be seen: ``` Aug 23 22:50:43.789760 sonic-dut INFO swss#supervisord 2020-08-23 22:50:10,061 ERRO pool dependent-startup event buffer overflowed, discarding event 34 Aug 23 22:50:43.789760 sonic-dut INFO swss#supervisord 2020-08-23 22:50:10,063 ERRO pool dependent-startup event buffer overflowed, discarding event 35 Aug 23 22:50:43.789760 sonic-dut INFO swss#supervisord 2020-08-23 22:50:10,064 ERRO pool dependent-startup event buffer overflowed, discarding event 36 Aug 23 22:50:43.789760 sonic-dut INFO swss#supervisord 2020-08-23 22:50:10,066 ERRO pool dependent-startup event buffer overflowed, discarding event 37 ``` This is due to the number of programs in the container managed by supervisor, all generating events at the same time. The default event queue buffer size in supervisor is 10. This patch increases that value in all containers in order to eliminate these errors. As more programs are added to the containers, we may need to further adjust these values. I increased all buffer sizes to 25 except for containers with more programs or templated supervisor.conf files which allow for a variable number of programs. In these cases I increased the buffer size to 50. One final exception is the swss container, where the buffer fills up to ~50, so I increased this buffer to 100. Resolves https://github.com/Azure/sonic-buildimage/issues/5241	2020-09-08 23:36:38 -07:00
joyas-joseph	71e93d921c	[docker-team]: upgrade docker-teamd to buster (#4914 ) Signed-off-by: Joyas Joseph <joyas_joseph@dell.com>	2020-07-12 18:08:52 +00:00
pavel-shirshov	1eb3dfe541	[docker-teamd]: Introducing tlm_teamd: telemetry for teamd (#4824 ) - What I did 1. Updated submodule sonic-swss to bring tlm_teamd to the buildimage. 2. Updated supervisord for the teamd 3. Updated critical process list (not sure that tlm_teamd is critical for now) - How to verify it Build an image and run. Check that tlm_teamd is running and STATE_DB has information in the LAG_INTERFACE, and :LAG_MEMBER_INTERFACE ``` admin@sonic:~$ redis-cli -n 6 hgetall 'LAG_TABLE\|PortChannel16' 1) "state" 2) "ok" 3) "team_device.ifinfo.dev_addr" 4) "4c:76:25:f5:48:80" 5) "setup.kernel_team_mode_name" 6) "loadbalance" 7) "team_device.ifinfo.ifindex" 8) "6" 9) "runner.fast_rate" 10) "false" 11) "runner.active" 12) "true" 13) "setup.pid" 14) "35" 15) "runner.fallback" 16) "false" ``` ``` admin@sonic:~$ redis-cli -n 6 hgetall 'LAG_MEMBER_TABLE\|PortChannel16\|Ethernet16' 1) "runner.selected" 2) "true" 3) "runner.aggregator.selected" 4) "true" 5) "runner.aggregator.id" 6) "26" 7) "runner.actor_lacpdu_info.state" 8) "61" 9) "runner.state" 10) "current" 11) "runner.actor_lacpdu_info.system" 12) "4c:76:25:f5:48:80" 13) "runner.partner_lacpdu_info.state" 14) "61" 15) "link.up" 16) "true" 17) "ifinfo.dev_addr" 18) "4c:76:25:f5:48:80" 19) "ifinfo.ifindex" 20) "26" 21) "link_watches.list.link_watch_0.up" 22) "true" 23) "runner.actor_lacpdu_info.port" 24) "17" 25) "runner.partner_lacpdu_info.port" 26) "1" 27) "runner.partner_lacpdu_info.system" 28) "52:54:00:ff:34:1b" ```	2020-06-27 01:22:23 -07:00
yozhao101	4fa81b4f8d	[dockers] Update critical_processes file syntax (#4831 ) - Why I did it Initially, the critical_processes file contains either the name of critical process or the name of group. For example, the critical_processes file in the dhcp_relay container contains a single group name `isc-dhcp-relay`. When testing the autorestart feature of each container, we need get all the critical processes and test whether a container can be restarted correctly if one of its critical processes is killed. However, it will be difficult to differentiate whether the names in the critical_processes file are the critical processes or group names. At the same time, changing the syntax in this file will separate the individual process from the groups and also makes it clear to the user. Right now the critical_processes file contains two different kind of entries. One is "program:xxx" which indicates a critical process. Another is "group:xxx" which indicates a group of critical processes managed by supervisord using the name "xxx". At the same time, I also updated the logic to parse the file critical_processes in supervisor-proc-event-listener script. - How to verify it We can first enable the autorestart feature of a specified container for example `dhcp_relay` by running the comman `sudo config container feature autorestart dhcp_relay enabled` on DUT. Then we can select a critical process from the command `docker top dhcp_relay` and use the command `sudo kill -SIGKILL <pid>` to kill that critical process. Final step is to check whether the container is restarted correctly or not.	2020-06-25 21:18:21 -07:00
Guohan Lu	448d1cdd49	[docker-teamd]: use service dependency in supervisord to start services	2020-05-22 11:01:28 -07:00
lguohan	60b16495cc	[docker-base-stretch]: move common packages into docker-base-stretch (#4371 ) libpython2.7, libdaemon0, libdbus-1-3, libjansson4 are common across different containers. move them into docker-base-stretch Signed-off-by: Guohan Lu <lguohan@gmail.com>	2020-04-05 13:29:34 -07:00
yozhao101	91e5fb5602	[Service] Enable/disable container auto-restart based on configuration. (#4073 )	2020-02-07 12:34:07 -08:00
judyjoseph	c95d8e4d47	[teamd]: increase startsecs to 5 seconds for teamsyncd (#4083 ) Updating the startsecs=5sec for teamsyncd to make the time for which the process needs to stay up before declaring the startup successfull.	2020-01-30 00:46:55 -08:00
pavel-shirshov	1848fb262b	[fast-reboot]: Save fast-reboot state into the db (#3741 ) Put a flag for fast-reboot to the db using EXPIRE feature. Using this flag in other part of SONiC to start in Fast-reboot mode. If we reload a config, the state in the db will be removed.	2019-12-04 14:10:19 -08:00
yozhao101	4c31ef3cd2	[Services] Restart Teamd service upon unexpected critical process exit. (#3703 ) Signed-off-by: Yong Zhao <yozhao@microsoft.com>	2019-11-04 17:45:41 -08:00
Jipan Yang	9a8202a39d	[database]: Update redis to 5.0.3 (#3066 ) Signed-off-by: Jipan Yang <jipan.yang@alibaba-inc.com>	2019-07-03 22:16:09 -07:00
Stepan Blyshchak	81cf33231f	[build]: Improve dockerfile instructions (#3048 ) - create a dockerfile-marcros.j2 file with all common operations written as j2 macro - use single dockerfile instruction for COPY and RUN commands when possible to improve build time - reorganize dockerfile instructions to make more cache friendly (in case someday we will remove --no-cache to build docker images) Signed-off-by: Stepan Blyschak <stepanb@mellanox.com>	2019-06-22 11:26:23 -07:00
Haiyang Zheng	0af5f0b6b5	[docker-team]: update teamd docker to stretch (#2734 ) Signed-off-by: Haiyang Zheng <haiyang.z@alibaba-inc.com>	2019-04-12 10:14:51 -07:00
zhenggen-xu	51a76614a3	Restore neighbor table to kernel during system warm-reboot (#2213 ) * Restore neighbor table to kernel during system warm-reboot Added a service: "restore_neighbors" to restore neighbor table into kernel during system warm reboot. The service is started by supervisord in swss docker when the docker is started. In case system warm reboot is enabled, it will try to restore the neighbor table from appDB into kernel through netlink API calls and update the neighbor table by sending arp/ns requests to all neighbor entries, then it sets the stateDB flag for neighsyncd to continue the reconciliation process. -- Added tcpdump python-scapy debian package into orchagent and vs dockers. -- Added python module: pyroute2 netifaces into orchagent and vc dockers. -- Workarounded tcpdump issue in the vs docker Signed-off-by: Zhenggen Xu <zxu@linkedin.com> * Move the restore_neighbors.py to sonic-swss submodule Made changes to makefiles accordingly Make dockerfile.j2 changes and supervisord config changes Add python monotonic lib for time access Signed-off-by: Zhenggen Xu <zxu@linkedin.com> * Added PYTHON_SWSSCOMMON as swss runtime dependency Signed-off-by: Zhenggen Xu <zxu@linkedin.com>	2018-11-09 17:06:09 -08:00
pavel-shirshov	61fe8fd2b5	Add /host filesystem into teamd docker (#2235 )	2018-11-09 16:14:03 -08:00
Shuotian Cheng	7313e7d9bc	[teamd]: Add teammgrd in docker-teamd (#2064 ) Remove the teamd.j2 templates used for starting the teamd. Add teammgrd instead to manage all port channel related configuration changes. Remove front panel port related configurations in interfaces.j2 templates as well. Remove teamd.sh script and use teammgrd to start all the teamd processes. Remove all the logics in the start.sh script as well. Update the sonic-swss submodule. Signed-off-by: Shu0T1an ChenG <shuche@microsoft.com>	2018-10-19 03:41:53 -07:00
pavel-shirshov	3e1b9e17b4	[teamd] Force team device recreation in case it already exists (#2168 ) * Force team device recreation in case it already exists	2018-10-18 15:22:26 -07:00
lguohan	f3ca7c422f	[rsyslog]: use # to separate container name and program name in syslog message (#1918 ) Previously use / to separate container name and program name. However, in rsyslogd: Precisely, the programname is terminated by either (whichever occurs first): end of tag nonprintable character ‘:’ ‘[‘ ‘/’ The above definition has been taken from the FreeBSD syslogd sources. Signed-off-by: Guohan Lu <gulv@microsoft.com>	2018-08-12 22:23:58 -07:00
Andriy Moroz	dadc17d9e6	[Mellanox] Use MAC from EEPROM for PortChannels and VLAN Interfaces (#1793 ) * Use MAC from EEPROM for PortChannels Signed-off-by: Andriy Moroz <c_andriym@mellanox.com> * Use MAC from EEPROM in DEVICE_METADATA Will affect MAC for VLAN interfaces Signed-off-by: Andriy Moroz <c_andriym@mellanox.com> * Get MAC via decode-syseeprom Signed-off-by: Andriy Moroz <c_andriym@mellanox.com> * hw-management is now a service Signed-off-by: Andriy Moroz <c_andriym@mellanox.com> * Add error handling for MAC fetch process Signed-off-by: Andriy Moroz <c_andriym@mellanox.com>	2018-07-23 15:51:03 -07:00
Qi Luo	7ba08e5bf6	Prefix docker container name to syslog syslogtag (program name) (#1810 )	2018-06-25 10:48:42 -07:00
Joe LeVeque	e1cb2ace36	[base image files] All 'docker exec' wrapper scripts now dynamically adjust their flags depending on whether or not they are run on a terminal (#1507 )	2018-03-17 00:43:29 -07:00
Haiyang Zheng	2abdf8dc58	[libteam] Add fallback support for single-member-port LAG (#1118 ) * [libteam] Add fallback support for single-member-port LAG * Allow the port to be selected if the LAG is configured with fallback and port is in defaulted state due to missing LACP PDUs from remote end * Only enable port if LAG is admin up and the member port is link up * [team] Add lacp fallback config to teamd.j2 template * [teamd] Resolve config conflict between fallback and minlink * Remove min_link config if fallback is configured * Add support for fallback config in minigraph * [teamd] Only enable fallback if it is single-member-port LAG Signed-off-by: Haiyang Zheng <haiyang.z@alibaba-inc.com> * [teamd] Removing the admin status check in lacp_port_link_update Will submit another pull request to fix this issue. Signed-off-by: Haiyang Zheng <haiyang.z@alibaba-inc.com>	2017-12-16 11:28:18 -08:00
Shuotian Cheng	c93c008bae	Revert "[docker-teamd]: Manage teamd and teamsyncd processes with supervisor (#1137 )" (#1156 ) This reverts commit `a6edef2fa5`. The reason to revert this commit is that it breaks the current nightly test as no port channel interfaces are get created after boot. teamd failed to start and complained about 'Cannot allocate memory' possibly due to nlmsg_alloc function failure. Will revert this commit to investigate it further before moving to supervisor. Signed-off-by: Shu0T1an ChenG <shuche@microsoft.com>	2017-11-16 10:30:41 -08:00
Joe LeVeque	a6edef2fa5	[docker-teamd]: Manage teamd and teamsyncd processes with supervisor (#1137 )	2017-11-13 14:16:19 -08:00
Shuotian Cheng	d1156ca3b2	[teamd]: Bring down all member interfaces before starting teamd (#1081 ) teamd requires all members to be set down before adding as a team port Signed-off-by: Shu0T1an ChenG <shuche@microsoft.com>	2017-10-26 14:20:44 -07:00
Shuotian Cheng	0f6c8c14e8	[teamd]: Remove deprecated blocking logic before starting teamd (#976 ) With the fixes in /etc/network/interfaces file, host interfaces could be added into the corresponding LAGs automatically. Thus, the logic of checking if port initialization is ready is no longer needed. Signed-off-by: Shu0T1an ChenG <shuche@microsoft.com>	2017-09-21 14:56:16 -07:00
Taoyu Li	c9cc7aea41	[configdb] Migrate minigraph configurations to DB (#942 ) Modify minigraph parser output format so it fit DB schema Modify configuration templates to fit new schema Systemd services dependencies are modified so database starts before any configuration consumer	2017-09-12 14:13:27 -07:00
Joe LeVeque	f49cac086f	Remove extra trailing newlines at EOF (#804 ) Files now end with a single newline	2017-07-12 20:54:37 -07:00
Shuotian Cheng	a74b3a1eb7	[teamd]: Fix Jinja2 template for calculating min_ports (#791 ) In Jinja2, '\|' cannot be treated directly as piping operator. The operator precedence of '\|' is higher than '*'. The filter only applies to the value just before it. Group the expression to make sure that the filter is applied to the outcome of the expression. Update the unit test to add such case.	2017-07-06 16:33:24 -07:00
Marian Pritsak	3903b45d41	[teamd.sh]: Remove LAG interfaces on exit (#643 ) Use -k option for teamd to properly remove LAG interfaces when docker is exiting. Signed-off-by: marian-pritsak <marianp@mellanox.com>	2017-06-08 01:53:51 -07:00
Marian Pritsak	820e7aafd0	[docker-teamd]: Explicitly set LAG hwaddr (#664 ) * [docker-teamd]: Explicitly set LAG hwaddr Team device is initially created without any members and has a random HW address, which is later changed to port's address. This configuration sets team device's address explicitly to base MAC to avoid reassignment. Signed-off-by: marian-pritsak <marianp@mellanox.com> * Update teamd config tests with hwaddr Signed-off-by: marian-pritsak <marianp@mellanox.com> * Align HW addr byte for Centec and Mellanox Signed-off-by: marian-pritsak <marianp@mellanox.com> * Change HW addr to unicast in config tests Signed-off-by: marian-pritsak <marianp@mellanox.com>	2017-06-06 16:13:38 -07:00
Joe LeVeque	d5c13c0a83	[dockers]: Disable autorestart on all supervisor processes inside containers (#580 )	2017-05-09 17:37:08 -07:00
Joe LeVeque	8f348399f5	[Dockers]: Manage all Docker containers with Supervisord (#573 ) - Consolidate config.sh and start.sh scripts into one script (start.sh) - Solve issue #435 - All dockers now run supervisord as their ENTRYPOINT - All stdout/stderr output from processes managed by supervisord is now sent to syslog instead of their own files - Supervisord log messages are now also sent to syslog - Removed unused smartmontools package from docker-platform-monitor	2017-05-08 15:43:31 -07:00
pavel-shirshov	814fd87e63	Remove /var/run/rsyslogd.pid bofore starting rsyslog (#453 )	2017-03-29 18:07:25 -07:00
Shuotian Cheng	27dae90726	[docker-teamd]: Clean /etc/teamd/ folder before adding new configurations (#451 ) Signed-off-by: Shuotian Cheng <shuche@microsoft.com>	2017-03-28 15:49:48 -07:00
Shuotian Cheng	7b7a61693a	[sonic-cfggen]: Add -p option and add teamd.j2 test (#414 ) - Add -p --port-config option to feed sonic-cfggen with port_config.ini file when necessary. - Update minigraph.py file to accept the -p option - Add test_j2files.py test to test config.sh and all .j2 templates * Currently test_teamd is added to test both the config.sh and teamd.j2 file works well with the t0 sample minigraph and sample port config file * The sample output is added to the folder sample_output for comparison Signed-off-by: Shuotian Cheng <shuche@microsoft.com>	2017-03-17 21:38:20 -07:00
Shuotian Cheng	783be14a5d	[minigraph]: Add portchannels/vlans dictionary and update teamd templates (#408 ) - minigraph_portchannel_interfaces and minigraph_vlan_interfaces are lists of interfaces and the name could duplicate due to multiple IPs - Add minigraph_portchannels and minigraph_vlans dictionaries to support querying port channels and vlans via the name - Update teamd.j2 template and config.sh file in docker-teamd - Update zebra.conf.j2 template to add port channel interfaces Signed-off-by: Shuotian Cheng <shuche@microsoft.com>	2017-03-17 16:48:13 -07:00
Joe LeVeque	5dafa907b2	[dockers]: Add base image files to syncd-brcm, database and teamd (#380 )	2017-03-06 12:22:42 -08:00
Shuotian Cheng	f9d8137975	[teamd]: Add redis-tools as the dependency of docker-teamd and fix bugs (#363 ) - Fix the if condition bug to wait till PORT_TABLE:ConfigDone before start Signed-off-by: Shuotian Cheng <shuche@microsoft.com>	2017-03-02 18:07:43 -08:00
Shuotian Cheng	f06dc5d3f9	[teamd]: Update the start.sh script to clean up the docker state (#351 ) This change should be temporary because the current teamd cannot re-create net devices acrosss restart. Basically, it will fail when there're files in /var/run/teamd/ folder or the previously created net devices are still there. Thus, the current workaround is to remove the obsolete files to restart the docker-teamd. This workaround cannot resolve the swss restart issue. Before restarting swss, docker teamd needs to be stopped manually. After swss starts, docker teamd needs to be restarted manually. This change will only make sure that rebooting the switch will make the switch at the correct state. Signed-off-by: Shuotian Cheng <shuche@microsoft.com>	2017-03-02 15:59:16 -08:00

1 2

63 Commits