sonic-buildimage

Author	SHA1	Message	Date
yozhao101	0dfa79d3c4	[pft_docker] Checks out the update of gNMI client script. (#11450 ) Signed-off-by: Yong Zhao yozhao@microsoft.com Why I did it This PR aims to check in the commit f2b11e4 introduced by the update related to gNMI python client. How I did it I changed the Dockfile.j2 such that the update of gNMI client script will be checked in when ptf docker image is built. How to verify it A PTF container image was built and then loaded on a testbed. I checked the update of gNMI client script was checked in.	2022-07-18 14:39:47 -07:00
tjchadaga	077a537b14	Log message fix for TSB (#11441 )	2022-07-14 12:26:58 -07:00
賓少鈺	f92aca837d	PDE migration to bullseye (#10836 ) #### Why I did it Upgrade docker-pde to bullseye #### How to verify it Check Azp status	2022-07-13 11:58:47 -07:00
tjchadaga	849eb4bf32	Changes to persist TSA/B state across reloads (#11257 )	2022-07-12 00:22:48 -07:00
Hua Liu	a9b7a1facd	Replace swsssdk with swsscommon (#11215 ) #### Why I did it Update scripts in sonic-buildimage from py-swsssdk to swsscommon #### How I did it Change code to use swsscommon. #### How to verify it Pass all E2E test case #### Which release branch to backport (provide reason below if selected) <!-- - Note we only backport fixes to a release branch, not features! - Please also provide a reason for the backporting below. - e.g. - [x] 202006 --> - [ ] 201811 - [ ] 201911 - [ ] 202006 - [ ] 202012 - [ ] 202106 - [ ] 202111 - [ ] 202205 #### Description for the changelog Update scripts in sonic-buildimage from py-swsssdk to swsscommon #### Link to config_db schema for YANG module changes <!-- Provide a link to config_db schema for the table for which YANG model is defined Link should point to correct section on https://github.com/Azure/sonic-buildimage/blob/master/src/sonic-yang-models/doc/Configuration.md --> #### A picture of a cute animal (not mandatory but encouraged)	2022-07-11 10:01:10 +08:00
Ye Jianquan	27d53cbba7	FIX the build error introduced by textfsm 1.1.3(Published on 2022/7/6) (#11394 ) Why I did it sonic-mgmt docker image build error, because of the textfsm new version(1.1.3). https://dev.azure.com/mssonic/build/_build/results?buildId=119147&view=logs&j=3dc8fd7e-4368-5a92-293e-d53cefc8c4b3&t=44e6c678-cb87-52d9-8547-bcdbd0ad6ae4&l=43043 How I did it Fix textfsm version to 1.1.2 How to verify it I build the image on my local env, and reproduce the issue with 1.1.3, it's fixed after I change the version to 1.1.2 . Signed-off-by: jianquanye@microsoft.com	2022-07-09 19:02:10 +08:00
Ye Jianquan	5873e8ef9c	Upgrade snappi version to 0.7.44 (#11335 ) Why I did it Keysight provide a new version with some snappi API source code related fix: snappi[ixnetwork,convergence]==0.7.44 How I did it Upgrade snappi version to 0.7.44 How to verify it Whether it's installed in sonic-mgmt docker container	2022-07-06 16:59:19 +08:00
yozhao101	1720fa21d9	[tunnel_packet_handler] Add a whitespace in the warning syslog message. (#11232 ) *This PR aims to add a whitespace in the warning syslog message of process tunnel_packet_handler. Signed-off-by: Yong Zhao <yozhao@microsoft.com>	2022-06-28 17:29:02 -07:00
Yakiv Huryk	0ced7081c7	[asan] add print_suppressions=0 to ASAN configs (#11252 ) - Why I did it To provide an ability to suppress ASAN false positives and have a clean ASAN report for docker-sonic-vs/mlnx-syncd/orchagent docker - How I did it Added the "print_suppressions=0" to ASAN configs. - How to verify it add a suppression to some ASAN-enabled component (the suppression should catch some leak) build with ENABLE_ASAN=y run a test and see that the ASAN report is empty instead of having the suppression summary Signed-off-by: Yakiv Huryk <yhuryk@nvidia.com>	2022-06-28 18:45:52 +03:00
Prince George	11ff80b786	Skip CMIS manager (#10907 ) * Removed unwanted changes * Fix j2 compilation error * Address review comment * Add newline	2022-06-22 10:14:06 -07:00
Junhua Zhai	04ea32b0c2	[macsec] CLI Supports display of gearbox macsec counter (#11113 ) Why I did it To support gearbox macsec counter display, following Azure/sonic-swss-common#622. How I did it Use swsscommon CounterTable API	2022-06-18 19:17:05 +08:00
Saikrishna Arcot	ead106eda8	Add ping to swss-layer docker (#11093 ) Signed-off-by: Saikrishna Arcot <sarcot@microsoft.com>	2022-06-10 07:40:37 -07:00
Sudharsan Dhamal Gopalarathnam	14f6f70ca3	[BGP]Adding configuration knob to allow advertise Loopback ipv6 /128 prefix (#10958 ) * [BGP]Adding configuration knob to allow advertise Loopback ipv6 /128 prefix By default when IPv6 address is configured with /128 as subnet mask in Loopback0 interface, it will be advertised as prefix with /64 subnet. To control this behavior a new field 'bgp_adv_lo_prefix_as_128' is introduced in DEVICE_METADATA table which when set to true will advertise prefix with /128 subnet as it is.	2022-06-06 08:51:04 -07:00
kellyyeh	6bdbe14975	[dhcp_relay] Add "vlan missing ip helper" dhcp relay unittest (#10654 )	2022-06-04 11:37:04 -07:00
abdosi	0285bfe42e	[chassis] Fix issues regarding database service failure handling and mid-plane connectivity for namespace. (#10500 ) What/Why I did: Issue1: By setting up of ipvlan interface in interface-config.sh we are not tolerant to failures. Reason being interface-config.service is one-shot and do not have restart capability. Scenario: For example if let's say database service goes in fail state then interface-services also gets failed because of dependency check but later database service gets restart but interface service will remain in stuck state and the ipvlan interface nevers get created. Solution: Moved all the logic in database service from interface-config service which looks more align logically also since the namespace is created here and all the network setting (sysctl) are happening here.With this if database starts we recreate the interface. Issue 2: Use of IPVLAN vs MACVLAN Currently we are using ipvlan mode. However above failure scenario is not handle correctly by ipvlan mode. Once the ipvlan interface is created and ip address assign to it and if we restart interface-config or database (new PR) service Linux Kernel gives error "Error: Address already assigned to an ipvlan device." based on this:https://github.com/torvalds/linux/blob/master/drivers/net/ipvlan/ipvlan_main.c#L978Reason being if we do not do cleanup of ip address assignment (need to be unique for IPVLAN) it remains in Kernel Database and never goes to free pool even though namespace is deleted. Solution: Considering this hard dependency of unique ip macvlan mode is better for us and since everything is managed by Linux Kernel and no dependency for on user configured IP address. Issue3: Namespace database Service do not check reachability to Supervisor Redis Chassis Server. Currently there is no explicit check as we never do Redis PING from namespace to Supervisor Redis Chassis Server. With this check it's possible we will start database and all other docker even though there is no connectivity and will hit the error/failure late in cycle Solution: Added explicit PING from namespace that will check this reachability. Issue 4:flushdb give exception when trying to accces Chassis Server DB over Unix Sokcet. Solution: Handle gracefully via try..except and log the message.	2022-05-24 16:54:12 -07:00
kellyyeh	2ead3aaefc	[dhcp6relay] Fix option parsing and add dhcpv6 client messages (#10819 )	2022-05-24 14:37:16 -07:00
Ze Gan	0156c21eff	[macsec-cli]: Fixing to config MACsec on the port will clear port attributes in config db (#10903 ) Why I did it There is a bug that the Port attributes in CONFIG_DB will be cleared if using sudo config macsec port add Ethernet0 or sudo config macsec port del Ethernet0 How I did it To fetch the port attributes before set/remove MACsec field in port table. Signed-off-by: Ze Gan <ganze718@gmail.com>	2022-05-24 18:42:54 +08:00
Ze Gan	910e1c6eb4	[docker-macsec]: MACsec CLI Plugin (#9390 ) #### Why I did it To provide MACsec config and show CLI for manipulating MACsec #### How I did it Add `config macsec` and `show macsec`. #### How to verify it This PR includes unittest for MACsec CLI, check Azp status. - Add MACsec profile ``` admin@sonic:~$ sudo config macsec profile add --help Usage: config macsec profile add [OPTIONS] <profile_name> Add MACsec profile Options: --priority <priority> For Key server election. In 0-255 range with 0 being the highest priority. [default: 255] --cipher_suite <cipher_suite> The cipher suite for MACsec. [default: GCM- AES-128] --primary_cak <primary_cak> Primary Connectivity Association Key. [required] --primary_ckn <primary_cak> Primary CAK Name. [required] --policy <policy> MACsec policy. INTEGRITY_ONLY: All traffic, except EAPOL, will be converted to MACsec packets without encryption. SECURITY: All traffic, except EAPOL, will be encrypted by SecY. [default: security] --enable_replay_protect / --disable_replay_protect Whether enable replay protect. [default: False] --replay_window <enable_replay_protect> Replay window size that is the number of packets that could be out of order. This field works only if ENABLE_REPLAY_PROTECT is true. [default: 0] --send_sci / --no_send_sci Send SCI in SecTAG field of MACsec header. [default: True] --rekey_period <rekey_period> The period of proactively refresh (Unit second). [default: 0] -?, -h, --help Show this message and exit. ``` - Delete MACsec profile ``` admin@sonic:~$ sudo config macsec profile del --help Usage: config macsec profile del [OPTIONS] <profile_name> Delete MACsec profile Options: -?, -h, --help Show this message and exit. ``` - Enable MACsec on the port ``` admin@sonic:~$ sudo config macsec port add --help Usage: config macsec port add [OPTIONS] <port_name> <profile_name> Add MACsec port Options: -?, -h, --help Show this message and exit. ``` - Disable MACsec on the port ``` admin@sonic:~$ sudo config macsec port del --help Usage: config macsec port del [OPTIONS] <port_name> Delete MACsec port Options: -?, -h, --help Show this message and exit. ``` Show MACsec ``` MACsec port(Ethernet0) --------------------- ----------- cipher_suite GCM-AES-256 enable true enable_encrypt true enable_protect true enable_replay_protect false replay_window 0 send_sci true --------------------- ----------- MACsec Egress SC (5254008f4f1c0001) ----------- - encoding_an 2 ----------- - MACsec Egress SA (1) ------------------------------------- ---------------------------------------------------------------- auth_key 849B69D363E2B0AA154BEBBD7C1D9487 next_pn 1 sak AE8C9BB36EA44B60375E84BC8E778596289E79240FDFA6D7BA33D3518E705A5E salt 000000000000000000000000 ssci 0 SAI_MACSEC_SA_ATTR_CURRENT_XPN 179 SAI_MACSEC_SA_STAT_OCTETS_ENCRYPTED 0 SAI_MACSEC_SA_STAT_OCTETS_PROTECTED 0 SAI_MACSEC_SA_STAT_OUT_PKTS_ENCRYPTED 0 SAI_MACSEC_SA_STAT_OUT_PKTS_PROTECTED 0 ------------------------------------- ---------------------------------------------------------------- MACsec Egress SA (2) ------------------------------------- ---------------------------------------------------------------- auth_key 5A8B8912139551D3678B43DD0F10FFA5 next_pn 1 sak 7F2651140F12C434F782EF9AD7791EE2CFE2BF315A568A48785E35FC803C9DB6 salt 000000000000000000000000 ssci 0 SAI_MACSEC_SA_ATTR_CURRENT_XPN 87185 SAI_MACSEC_SA_STAT_OCTETS_ENCRYPTED 0 SAI_MACSEC_SA_STAT_OCTETS_PROTECTED 0 SAI_MACSEC_SA_STAT_OUT_PKTS_ENCRYPTED 0 SAI_MACSEC_SA_STAT_OUT_PKTS_PROTECTED 0 ------------------------------------- ---------------------------------------------------------------- MACsec Ingress SC (525400edac5b0001) MACsec Ingress SA (1) --------------------------------------- ---------------------------------------------------------------- active true auth_key 849B69D363E2B0AA154BEBBD7C1D9487 lowest_acceptable_pn 1 sak AE8C9BB36EA44B60375E84BC8E778596289E79240FDFA6D7BA33D3518E705A5E salt 000000000000000000000000 ssci 0 SAI_MACSEC_SA_ATTR_CURRENT_XPN 103 SAI_MACSEC_SA_STAT_IN_PKTS_DELAYED 0 SAI_MACSEC_SA_STAT_IN_PKTS_INVALID 0 SAI_MACSEC_SA_STAT_IN_PKTS_LATE 0 SAI_MACSEC_SA_STAT_IN_PKTS_NOT_USING_SA 0 SAI_MACSEC_SA_STAT_IN_PKTS_NOT_VALID 0 SAI_MACSEC_SA_STAT_IN_PKTS_OK 0 SAI_MACSEC_SA_STAT_IN_PKTS_UNCHECKED 0 SAI_MACSEC_SA_STAT_IN_PKTS_UNUSED_SA 0 SAI_MACSEC_SA_STAT_OCTETS_ENCRYPTED 0 SAI_MACSEC_SA_STAT_OCTETS_PROTECTED 0 --------------------------------------- ---------------------------------------------------------------- MACsec Ingress SA (2) --------------------------------------- ---------------------------------------------------------------- active true auth_key 5A8B8912139551D3678B43DD0F10FFA5 lowest_acceptable_pn 1 sak 7F2651140F12C434F782EF9AD7791EE2CFE2BF315A568A48785E35FC803C9DB6 salt 000000000000000000000000 ssci 0 SAI_MACSEC_SA_ATTR_CURRENT_XPN 91824 SAI_MACSEC_SA_STAT_IN_PKTS_DELAYED 0 SAI_MACSEC_SA_STAT_IN_PKTS_INVALID 0 SAI_MACSEC_SA_STAT_IN_PKTS_LATE 0 SAI_MACSEC_SA_STAT_IN_PKTS_NOT_USING_SA 0 SAI_MACSEC_SA_STAT_IN_PKTS_NOT_VALID 0 SAI_MACSEC_SA_STAT_IN_PKTS_OK 0 SAI_MACSEC_SA_STAT_IN_PKTS_UNCHECKED 0 SAI_MACSEC_SA_STAT_IN_PKTS_UNUSED_SA 0 SAI_MACSEC_SA_STAT_OCTETS_ENCRYPTED 0 SAI_MACSEC_SA_STAT_OCTETS_PROTECTED 0 --------------------------------------- ---------------------------------------------------------------- MACsec port(Ethernet1) --------------------- ----------- cipher_suite GCM-AES-256 enable true enable_encrypt true enable_protect true enable_replay_protect false replay_window 0 send_sci true --------------------- ----------- MACsec Egress SC (5254008f4f1c0001) ----------- - encoding_an 1 ----------- - MACsec Egress SA (1) ------------------------------------- ---------------------------------------------------------------- auth_key 35FC8F2C81BCA28A95845A4D2A1EE6EF next_pn 1 sak 1EC8572B75A840BA6B3833DC550C620D2C65BBDDAD372D27A1DFEB0CD786671B salt 000000000000000000000000 ssci 0 SAI_MACSEC_SA_ATTR_CURRENT_XPN 4809 SAI_MACSEC_SA_STAT_OCTETS_ENCRYPTED 0 SAI_MACSEC_SA_STAT_OCTETS_PROTECTED 0 SAI_MACSEC_SA_STAT_OUT_PKTS_ENCRYPTED 0 SAI_MACSEC_SA_STAT_OUT_PKTS_PROTECTED 0 ------------------------------------- ---------------------------------------------------------------- MACsec Ingress SC (525400edac5b0001) MACsec Ingress SA (1) --------------------------------------- ---------------------------------------------------------------- active true auth_key 35FC8F2C81BCA28A95845A4D2A1EE6EF lowest_acceptable_pn 1 sak 1EC8572B75A840BA6B3833DC550C620D2C65BBDDAD372D27A1DFEB0CD786671B salt 000000000000000000000000 ssci 0 SAI_MACSEC_SA_ATTR_CURRENT_XPN 5033 SAI_MACSEC_SA_STAT_IN_PKTS_DELAYED 0 SAI_MACSEC_SA_STAT_IN_PKTS_INVALID 0 SAI_MACSEC_SA_STAT_IN_PKTS_LATE 0 SAI_MACSEC_SA_STAT_IN_PKTS_NOT_USING_SA 0 SAI_MACSEC_SA_STAT_IN_PKTS_NOT_VALID 0 SAI_MACSEC_SA_STAT_IN_PKTS_OK 0 SAI_MACSEC_SA_STAT_IN_PKTS_UNCHECKED 0 SAI_MACSEC_SA_STAT_IN_PKTS_UNUSED_SA 0 SAI_MACSEC_SA_STAT_OCTETS_ENCRYPTED 0 SAI_MACSEC_SA_STAT_OCTETS_PROTECTED 0 --------------------------------------- ---------------------------------------------------------------- ```	2022-05-19 21:59:37 +08:00
Lawrence Lee	0eeb249fd8	[swss]: Convert swss docker to bullseye (#10484 ) * [swss]: Convert swss docker to bullseye Signed-off-by: Lawrence Lee <lawlee@microsoft.com>	2022-05-17 13:55:59 -07:00
Alexander Allen	d202bf26d7	Upgrade mellanox platform containers (syncd / saiserver / syncd-rpc) and pmon to bullseye (#10580 ) Fixes #9279 - Why I did it Part of larger effort to move all SONiC systems to bullseye - How I did it 1. Update container makefiles with correct dependencies 2. Update container Dockerfile with correct base image 3. Update container Dockerfile with correct apt dependencies 4. Update any other makefiles with dependencies to remove python2 support 5. Minor changes to support bullseye / python3 - How to verify it Run regression on the switch: 1. Verify PTF community tests work 2. Verify syncd runs and all ports come up / pass traffic 3. Verify all platform tests succeed	2022-05-10 12:45:28 +03:00
Jing Zhang	322363b9ab	[master][sonic-linkmgrd] submodule updates (#10763 ) [master][sonic-linkmgrd] submodule updates df51322 Longxiang Lyu Fri May 6 10:01:46 2022 +0800 Add `ActiveActiveStateMachine` implementation (#64) e721ceb Jing Zhang Wed May 4 10:07:14 2022 -0700 Add doc for default route related changes (#63) 7bb06fb Jing Zhang Tue May 3 09:48:28 2022 -0700 Add Cli support to enable or disable default route related feature (#68) e4b02cb Jing Zhang Mon May 2 13:27:54 2022 -0700 Reset WaitActiveUp count before switching to active (#70) 212d960 Jing Zhang Wed Apr 27 10:35:05 2022 -0700 lower log level to warning (#69) 48abc9e Jing Zhang Thu Apr 14 16:50:04 2022 -0700 Add support to enable switchover time measurement (with link prober interval decreased to 10ms) feature (#61) c4858a6 Jing Zhang Thu Apr 14 11:27:55 2022 -0700 Avoid proactively switching to `active` if default route is missing (#62) sign-off: Jing Zhang zhangjing@microsoft.com	2022-05-06 13:42:23 -07:00
kellyyeh	cfdb8431df	[dhcp6relay] Add dhcpv6 option check (#10486 )	2022-05-05 18:04:14 -07:00
xumia	8ec8900d31	Support SONiC OpenSSL FIPS 140-3 based on SymCrypt engine (#9573 ) Why I did it Support OpenSSL FIPS 140-3, see design doc: https://github.com/Azure/SONiC/blob/master/doc/fips/SONiC-OpenSSL-FIPS-140-3.md. How I did it Install the fips packages. To build the fips packages, see https://github.com/Azure/sonic-fips Azure pipelines: https://dev.azure.com/mssonic/build/_build?definitionId=412 How to verify it Validate the SymCrypt engine: admin@sonic:~$ dpkg-query -W \| grep openssl openssl 1.1.1k-1+deb11u1+fips symcrypt-openssl 0.1 admin@sonic:~$ openssl engine -v \| grep -i symcrypt (symcrypt) SCOSSL (SymCrypt engine for OpenSSL) admin@sonic:~$	2022-05-06 07:21:30 +08:00
Lior Avramov	6284d136ec	[LLDP] Enhance lldmgrd Redis events handling (#10593 ) Why I did it When lldpmgrd handled events of other tables besides PORT_TABLE, error message was printed to log. How I did it Handle event according to its file descriptor instead of looping all registered selectables for each coming event. How to verify it I verified same events are being handled by printing events key and operation, before and after the change. Also, before the change, in init flow after config reload, when lldpmgrd handled events of other tables besides PORT_TABLE, error messages were printed to log, this issue is solved now.	2022-05-04 08:21:02 -07:00
Kalimuthu-Velappan	bc30528341	Parallel building of sonic dockers using native dockerd(dood). (#10352 ) Currently, the build dockers are created as a user dockers(docker-base-stretch-<user>, etc) that are specific to each user. But the sonic dockers (docker-database, docker-swss, etc) are created with a fixed docker name and common to all the users. docker-database:latest docker-swss:latest When multiple builds are triggered on the same build server that creates parallel building issue because all the build jobs are trying to create the same docker with latest tag. This happens only when sonic dockers are built using native host dockerd for sonic docker image creation. This patch creates all sonic dockers as user sonic dockers and then, while saving and loading the user sonic dockers, it rename the user sonic dockers into correct sonic dockers with tag as latest. docker-database:latest <== SAVE/LOAD ==> docker-database-<user>:tag The user sonic docker names are derived from 'DOCKER_USERNAME and DOCKER_USERTAG' make env variable and using Jinja template, it replaces the FROM docker name with correct user sonic docker name for loading and saving the docker image.	2022-04-28 08:39:37 +08:00
Zhaohui Sun	cc30771f6b	Add python3 virtual environment for docker-ptf (#10599 ) Why I did it Migrate ptftests script to python3, in order to do an incremental migration, add python virtual environment firstly, install all required python packages in virtual env as well. Then migrate ptftests scripts from python2 to python3 one by one avoid impacting non-changed scripts. Signed-off-by: Zhaohui Sun zhaohuisun@microsoft.com How I did it Add python3 virtual environment for docker-ptf. Add submodule ptf-py3 and install patched ptf 0.9.3 into virtual environment as well, two ptf issues were reported here: p4lang/ptf#173 p4lang/ptf#174 Signed-off-by: Zhaohui Sun <zhaohuisun@microsoft.com>	2022-04-26 09:13:26 +08:00
Song Yuan	aa62e33339	[chassis] Do not configure LLDP on recirc ports (#7909 ) Why I did it Recirc port is used to only forward traffic from one asic to another asic. So it's not required to configure LLDP on it. How I did it Add interface prefix helper for recirc port. Similar to skip configuring LLDP on inband port, add check in lldpmgrd to skip recirc port by checking interface prefix.	2022-04-25 13:14:17 -07:00
Maxime Lorrillere	0606add017	[chassis] Get asic PCI ID from CHASSIS_STATE_DB and update asic_id in CONFIG_DB (#9681 ) Asic PCI ID (PCI address) is collected by chassisd (inside pmon - Azure/sonic-platform-daemons#175) and saved in CHASSIS_STATE_DB (in redis_chassis). CHASSIS_STATE_DB is accessible by swss containers. At docker-init.sh (script is called after swss container is created and before anything that could run in swss like orchagent...), we wait until asic PCI ID of the corresponding asic is populated by chassisd. We then update asic_id in CONFIG_DB of asic's database. A system supporting dynamic asic PCI ID identification requires to have a file (empty) use_pci_id_chassis in its platform dir. When orchagent runs, it has correct asic PCI ID in its CONFIG_DB. Together with this PR: Azure/sonic-platform-daemons#175 Azure/sonic-platform-common#185 Signed-off-by: Maxime Lorrillere <mlorrillere@arista.com> Co-authored-by: Maxime Lorrillere <mlorrillere@arista.com>	2022-04-25 13:09:42 -07:00
yozhao101	e24fe9bc60	[Monit] Fix the issue which shows Monit can not reset its counter. (#10288 ) Signed-off-by: Yong Zhao <yozhao@microsoft.com> Why I did it This PR aims to fix the Monit issue which shows Monit can't reset its counter when monitoring memory usage of telemetry container. Specifically the Monit configuration file related to monitoring memory usage of telemetry container is as following: check program container_memory_telemetry with path "/usr/bin/memory_checker telemetry 419430400" if status == 3 for 10 times within 20 cycles then exec "/usr/bin/restart_service telemetry" If memory usage of telemetry container is larger than 400MB for 10 times within 20 cycles (minutes), then it will be restarted. Recently we observed, after telemetry container was restarted, its memory usage continuously increased from 400MB to 11GB within 1 hour, but it was not restarted anymore during this 1 hour sliding window. The reason is Monit can't reset its counter to count again and Monit can reset its counter if and only if the status of monitored service was changed from Status failed to Status ok. However, during this 1 hour sliding window, the status of monitored service was not changed from Status failed to Status ok. Currently for each service monitored by Monit, there will be an entry showing the monitoring status, monitoring mode etc. For example, the following output from command sudo monit status shows the status of monitored service to monitor memory usage of telemetry: Program 'container_memory_telemetry' status Status ok monitoring status Monitored monitoring mode active on reboot start last exit value 0 last output - data collected Sat, 19 Mar 2022 19:56:26 Every 1 minute, Monit will run the script to check the memory usage of telemetry and update the counter if memory usage is larger than 400MB. If Monit checked the counter and found memory usage of telemetry is larger than 400MB for 10 times within 20 minutes, then telemetry container was restarted. Following is an example status of monitored service: Program 'container_memory_telemetry' status Status failed monitoring status Monitored monitoring mode active on reboot start last exit value 0 last output - data collected Tue, 01 Feb 2022 22:52:55 After telemetry container was restarted. we found memory usage of telemetry increased rapidly from around 100MB to more than 400MB during 1 minute and status of monitored service did not have a chance to be changed from Status failed to Status ok. How I did it In order to provide a workaround for this issue, Monit recently introduced another syntax format repeat every <n> cycles related to exec. This new syntax format will enable Monit repeat executing the background script if the error persists for a given number of cycles. How to verify it I verified this change on lab device str-s6000-acs-12. Another pytest PR (Azure/sonic-mgmt#5492) is submitted in sonic-mgmt repo for review.	2022-04-20 18:08:06 -07:00
Jing Zhang	8b5d908c92	Upgrade mux container to Bullseye (#10498 ) sign-off: Jing Zhang zhangjing@microsoft.com #### Why I did it As part of the process moving containers from buster to bullseye. #### How I did it 1. change base image from buster to bullseye. 2. remove unused addition to orchagent run options #### How to verify it Tested building locally.	2022-04-19 09:27:45 -07:00
Ze Gan	87036c34ec	[macsec]: Upgrade docker-macsec to bullseye (#10574 ) Following the patch from : https://packages.debian.org/bullseye/wpasupplicant, to upgrade sonic-wpa-supplicant for supporting bullseye and upgrade docker-macsec.mk as a bullseye component.	2022-04-17 20:32:51 +08:00
Zhaohui Sun	44cf773a96	Revert "[docker-ptf]: Upgrade scapy to 2.4.5 in docker-ptf (#10507 )" (#10537 ) It upgraded scapy to 2.4.5 in docker-ptf container, after this upgrade, all scripts under ansible/roles/test/files/ptftests will import scapy 2.4.5, some test cases will fail because they are not upgraded accordingly. Reverts #10507 to avoid breaking regression test. This reverts commit `92efc01270`.	2022-04-13 08:57:10 +08:00
kellyyeh	396a92cb2e	[dhcp_relay] Remove dhcp6mon (#10467 )	2022-04-12 10:44:17 -07:00
Ashwin Srinivasan	b4f8f1dd22	Removed python2 dependency for sonic-pcied in sonic-platform-daemons (#10421 ) Removed python2 support for sonic-platform-daemons that was causing unit test errors in sonic_pcied. * Removed config from docker supervisord jinja templates per VD review comment * Removed space and python3 per QL comments	2022-04-09 13:16:50 -07:00
Ze Gan	92efc01270	[docker-ptf]: Upgrade scapy to 2.4.5 in docker-ptf (#10507 ) Why I did it Existing dataplane tests cannot be tested under MACsec environment due to the traffic under MACsec link is encrypted. So, I will override the dp_poll of ptf to MACsec dp_poll to decrypt the MACsec packets on injected ports (PR: Azure/sonic-mgmt#5490). MACsec decryption library depends on scapy 2.4.5. How I did it Upgrade scapy library to 2.4.5 by pip. How to verify it Check the scapy version in docker-ptf by python -c "import scapy; print(scapy.__version__)" 2.4.5 Signed-off-by: Ze Gan <ganze718@gmail.com>	2022-04-08 22:26:40 -07:00
kellyyeh	330d11a128	Add EPMS and MgmtTsToR (#10478 )	2022-04-07 21:49:42 -07:00
Stepan Blyshchak	4426f7715f	[scapy] update scapy to 2.4.5 and patch it (#10457 ) Why I did it Running warm-reboot in a loop for 500 times leads to this error on 318-th iteration: Apr 2 15:56:27.346747 sonic INFO swss#/supervisord: restore_neighbors Traceback (most recent call last): Apr 2 15:56:27.346747 sonic INFO swss#/supervisord: restore_neighbors File "/usr/bin/restore_neighbors.py", line 24, in <module> Apr 2 15:56:27.346747 sonic INFO swss#/supervisord: restore_neighbors from scapy.all import conf, in6_getnsma, inet_pton, inet_ntop, in6_getnsmac, get_if_hwaddr, Ether, ARP, IPv6, ICMPv6ND_NS, ICMPv6NDOptSrcLLAddr Apr 2 15:56:27.346795 sonic INFO swss#/supervisord: restore_neighbors File "/usr/local/lib/python3.7/dist-packages/scapy/all.py", line 25, in <module> Apr 2 15:56:27.346956 sonic INFO swss#/supervisord: restore_neighbors from scapy.route import * Apr 2 15:56:27.346995 sonic INFO swss#/supervisord: restore_neighbors File "/usr/local/lib/python3.7/dist-packages/scapy/route.py", line 205, in <module> Apr 2 15:56:27.347089 sonic INFO swss#/supervisord: restore_neighbors conf.iface = get_working_if() Apr 2 15:56:27.347129 sonic INFO swss#/supervisord: restore_neighbors File "/usr/local/lib/python3.7/dist-packages/scapy/arch/linux.py", line 128, in get_working_if Apr 2 15:56:27.347213 sonic INFO swss#/supervisord: restore_neighbors ifflags = struct.unpack("16xH14x", get_if(i, SIOCGIFFLAGS))[0] Apr 2 15:56:27.347250 sonic INFO swss#/supervisord: restore_neighbors File "/usr/local/lib/python3.7/dist-packages/scapy/arch/common.py", line 31, in get_if Apr 2 15:56:27.347345 sonic INFO swss#/supervisord: restore_neighbors return ioctl(sck, cmd, struct.pack("16s16x", iff.encode("utf8"))) Apr 2 15:56:27.347365 sonic INFO swss#/supervisord: restore_neighbors OSError: [Errno 19] No such device The issue was reported to scapy devs secdev/scapy#3369, the fix is secdev/scapy#3371, however there is no released scapy version with this fix right now, thus decided to build scapy v2.4.5 from sources and apply the fix in a form of a patch. Signed-off-by: Stepan Blyschak <stepanb@nvidia.com>	2022-04-07 14:23:35 +03:00
kellyyeh	8cd346d80b	Update docker-router-advertiser.supervisord.conf.j2 (#10375 )	2022-04-06 09:44:21 -07:00
Saikrishna Arcot	588ed0b760	Upgrade router-advertiser container to Bullseye (#10374 ) Change the base image from `docker-config-engine-buster` to `docker-config-engine-bullseye`, and remove the hardcoded `radvd` version from the Dockerfile. Signed-off-by: Saikrishna Arcot <sarcot@microsoft.com>	2022-04-01 16:12:43 -07:00
Junchao-Mellanox	106fac5f09	[counter] Fix issue: non default counters will be delayed forever after fastboot (#10413 ) - Why I did it Fastboot will delay all counters in CONFIG DB, it relies on enable_counters.py to recover the delayed counters. However, enable_counters.py does not recover those non-default counters. - How I did it For non-default counters, if it is in CONFIG DB, put delay status to false after the waiting. - How to verify it Manual test	2022-03-31 15:23:57 +03:00
Lawrence Lee	b31df59c7c	[tun_pkt]: Wait for AsyncSniffer to init fully (#10346 ) Fix for Tunnel packet handler can crash at system startup Signed-off-by: Lawrence Lee <lawlee@microsoft.com>	2022-03-30 14:03:29 -07:00
judyjoseph	8e642848c2	Introduce the asic_subtype field for adding the sub platform variants. (#10235 ) * Introduce the asic_subtype field for adding the sub platform variants. It uses the value of TARGET_MACHINE variable in slave.mk.	2022-03-28 11:22:32 -07:00
tomer-israel	cc938e73a3	Dynamic port configuration - solve lldp issues when adding/removing ports (#9386 ) #### Why I did it when adding and removing ports after init stage we saw two issues: first: In several cases, after removing a port, lldpmgr is continuing to try to add a port to lldp with lldpcli command. the execution of this command is continuing to fail since the port is not existing anymore. second: after adding a port, we sometimes see this warning messgae: "Command failed 'lldpcli configure ports Ethernet18 lldp portidsubtype local etp5b': 2021-07-27T14:16:54 [WARN/lldpctl] cannot find port Ethernet18" we added these changes in order to solve it. #### How I did it port create events are taken from app db only. lldpcli command is executed only when linux port is up. when delete port event is received we remove this command from pending_cmds dictionary #### How to verify it manual tests and running lldp tests #### Description for the changelog Dynamic port configuration - solve lldp issues when adding/removing ports	2022-03-25 17:47:24 -07:00
Robert J. Halstead	147d631065	[PINS] update sonic-p4rt docker to bullseye (#10182 ) #### Why I did it SONiC is migrating to bullseye. This will update the sonic-pins container to bullseye. #### How I did it The [sonic-pins code](https://github.com/Azure/sonic-buildimage/blob/master/rules/p4rt.mk) isn't dependent on any architecture so it will already build successfully for bullseye. This PR updates the docker to use bullseye. #### How to verify it Today we cannot build the docker-sonic-p4rt.gz target (e.g. Issue #9885). With this change the docker will build successfully. The P4RT executable will not run, because of a missing runtime library, libgmpxx, which I'll address in a followup PR. #### Description for the changelog Update docker-sonic-p4rt.gz target to build with Bullseye instead of Buster.	2022-03-23 17:21:36 -07:00
Saikrishna Arcot	4a5e75e45e	[restapi]: Don't use python/python2 for restapi start scripts (#10285 ) Python 2 isn't installed by default in Buster and Bullseye containers, and the scripts/modules can be used with Python 3, so make sure Python 3 is used. Why I did it After the Buster and Bullseye upgrade for the restapi container, processes will no longer start because supervisord is trying to call python and python2, both of which are unavailable. Signed-off-by: Saikrishna Arcot <sarcot@microsoft.com>	2022-03-22 18:34:42 -07:00
Oleksandr Kozodoi	c5849c9650	Add scapy support for python3 virtual environment in the sonic-mgmt docker container (#10234 ) Why I did it Migration of sonic-mgmt codebase from Python 2 to Python 3 How I did it Added scapy dependencies to the env-python3 virtual environment. How to verify it Run test case: py.test --testbed=testbed-t0 --inventory=../ansible/lab --testbed_file=../ansible/testbed.csv --host-pattern=testbed-t0 -- module-path=../ansible/library lldp Signed-off-by: Oleksandr Kozodoi <oleksandrx.kozodoi@intel.com>	2022-03-16 12:00:51 +08:00
Saikrishna Arcot	5617b1ae3e	Image disk space reduction (#10172 ) # Why I did it Reduce the disk space taken up during bootup and runtime. # How I did it 1. Remove python package cache from the base image and from the containers. 2. During bootup, if logs are to be stored in memory, then don't create the `var-log.ext4` file just to delete it later during bootup. 3. For the partition containing `/host`, don't reserve any blocks for just the root user. This just makes sure all disk space is available for all users, if needed during upgrades (for example). * Remove pip2 and pip3 caches from some containers Only containers which appeared to have a significant pip cache size are included here. Signed-off-by: Saikrishna Arcot <sarcot@microsoft.com> * Don't create var-log.ext4 if we're storing logs in memory Signed-off-by: Saikrishna Arcot <sarcot@microsoft.com> * Run tune2fs on the device containing /host to not reserve any blocks for just the root user Signed-off-by: Saikrishna Arcot <sarcot@microsoft.com>	2022-03-15 18:12:49 -07:00
Shilong Liu	3fa627f290	Add a config variable to override default container registry instead of dockerhub. (#10166 ) * Add variable to reset default docker registry * fix bug in docker version control	2022-03-14 18:09:20 +08:00
xwjiang2021	b73da484c4	Install the allure-pytest package globally in sonic-mgmt docker (#10216 ) Why I did it This fix is to address issue: Azure/sonic-mgmt#5280 In the sonic-mgmt Dockerfile, python package allure-pytest is installed after ENV USER $user. Consequently the package is installed to path /home/$user/.local and is only available to the $user account. If we use root account in sonic-mgmt docker container to run tests, any script importing the allure package will fail with ImportError. We need to install the allure-pytest package to global directory instead of user local directory. How I did it Update the sonic-mgmt Dockerfile to ensure that the allure-pytest package is installed to global directory How to verify it Build a new sonic-mgmt docker image based on the changes. Use sonic-mgmt docker container of the newly built image to run test scripts that depend on the allure-pytest package. No ImportError is raised.	2022-03-12 20:18:12 +08:00
Oleksandr Kozodoi	3fa18d18d4	Add necessary changes for python3 virtual environment of sonic-mgmt docker container (#9277 ) This PR includes necessary changes for the setup of the Python3 virtual environment in the sonic-mgmt docker container. How to activate Python3 virtual environment? Connect to the sonic-mgmt container $ docker exec -ti sonic-mgmt bash Activate the virtual environment $ source /var/user/env-python3/bin/activate Why I did it Migration of sonic-mgmt codebase from Python 2 to Python 3 How I did it Added all necessary dependencies to the env-python3 virtual environment. Signed-off-by: Oleksandr Kozodoi <oleksandrx.kozodoi@intel.com>	2022-03-09 12:28:01 +08:00
Alexander Allen	d0ff8b5f48	[pmon] Clean up supervisord chassis_db_init entry and fix startsecs (#10071 ) Why I did it Code review was still in progress when #9858 was merged and upon further testing I have arrived at a better solution. How I did it Modified supervisord configuration j2 template for pmon to require no minimum uptime for chassisd_db_init and to remove the redundant exit_codes directive How to verify it Boot switch and verify in syslog that there are no errors related to chassis_db_init	2022-03-03 17:10:15 -08:00
Lawrence Lee	4d2a55d373	[swss]: Wait for vlan intf to start ndppd (#10119 ) - Use the `wait_for_link.sh` script to delay ndppd start until after the VLAN interface is ready - Avoids issue where ndppd tries to change interface attributes before the interface is ready	2022-03-02 16:23:56 -08:00
Lawrence Lee	47d9b26063	Revert "[swss]: Wait for vlan intf to start ndppd (#10036 )" (#10085 ) This reverts commit `91204879df`. #10036 breaks ndppd functionality	2022-02-28 15:42:02 -08:00
Yang Wang	b8fa5e0d8d	install xmlrunner python3 version (#10086 )	2022-02-28 11:21:04 +08:00
Lawrence Lee	91204879df	[swss]: Wait for vlan intf to start ndppd (#10036 ) - Use the `wait_for_link.sh` script to delay ndppd start until after the VLAN interface is ready - Avoids issue where ndppd tries to change interface attributes before the interface is ready Signed-off-by: Lawrence Lee <lawlee@microsoft.com>	2022-02-24 17:54:45 -08:00
xumia	b101b023d3	[Security]: Upgrade urllib3 to fix CVE-2021-33503 See https://security.archlinux.org/CVE-2021-33503	2022-02-25 08:59:57 +08:00
arlakshm	fd22635de0	[chassis][bgp] create v4 and v6 peer group for VoQ internal neighbors (#9693 ) Why I did it In the recent minigraph changes we add separate BGP session configuration for V4 and V6 internal VoQ neighbors. This PR is adding different Peer groups for V4 and V6 neighbors How I did it Add VOQ_CHASSIS_V4_PEER and VOQ_CHASSIS_V6_PEER groups Add extra Unit tests How to verify it Signed-off-by: Arvindsrinivasan Lakshmi Narasimhan <arlakshm@microsoft.com>	2022-02-24 11:21:26 -08:00
Richard.Yu	2210c82ef8	[PTF-SAIv2]Add ptf docker for sai-ptf (saiv2) (#9729 ) * [PTF-SAIv2]Add ptf dockre for sai-ptf (saiv2) Base on current ptf docker create a new docker for sai-ptf(saiv2) upgrade related package use the latest ptf and install it test done: NOJESSIE=1 NOSTRETCH=1 NOBULLSEYE=1 ENABLE_SYNCD_RPC=y make target/docker-ptf-sai.gz BLDENV=buster make -f Makefile.work target/docker-ptf-sai.gz * upgrade the thrift to 014	2022-02-18 01:48:50 -08:00
kellyyeh	f136c53d19	[radv] Support multiple ipv6 prefixes per vlan interface (#9934 ) Why I did it Radvd.conf.j2 template creates two copies of the vlan interface when there are more than one ipv6 address assigned to a single vlan interface. Changed the format to add prefixes under the same vlan interface block. How I did it Modifies radvd.conf.j2 and added unit tests How to verify it Configure multiple ipv6 address to the same vlan, start radvd Unit test will check if radvd.conf with multiple ipv6 addresses is formed correctly	2022-02-16 14:17:26 -08:00
Jason Lyu	b023c29a1e	[redis] Upgrade redis version (#9757 ) #### Why I did it The current redis version of SONiC is `6.0.6`, which contains many high-risky security issues like CVEs that are fixed in the latest version. The Redis release notes also highly recommend to upgrade with SECURITY urgency. ``` ================================================================================ Redis 6.0.16 Released Mon Oct 4 12:00:00 IDT 2021 ================================================================================ Upgrade urgency: SECURITY, contains fixes to security issues. Security Fixes: * (CVE-2021-41099) Integer to heap buffer overflow handling certain string commands and network payloads, when proto-max-bulk-len is manually configured to a non-default, very large value [reported by yiyuaner]. * (CVE-2021-32762) Integer to heap buffer overflow issue in redis-cli and redis-sentinel parsing large multi-bulk replies on some older and less common platforms [reported by Microsoft Vulnerability Research]. * (CVE-2021-32687) Integer to heap buffer overflow with intsets, when set-max-intset-entries is manually configured to a non-default, very large value [reported by Pawel Wieczorkiewicz, AWS]. * (CVE-2021-32675) Denial Of Service when processing RESP request payloads with a large number of elements on many connections. * (CVE-2021-32672) Random heap reading issue with Lua Debugger [reported by Meir Shpilraien]. * (CVE-2021-32628) Integer to heap buffer overflow handling ziplist-encoded data types, when configuring a large, non-default value for hash-max-ziplist-entries, hash-max-ziplist-value, zset-max-ziplist-entries or zset-max-ziplist-value [reported by sundb]. * (CVE-2021-32627) Integer to heap buffer overflow issue with streams, when configuring a non-default, large value for proto-max-bulk-len and client-query-buffer-limit [reported by sundb]. * (CVE-2021-32626) Specially crafted Lua scripts may result with Heap buffer overflow [reported by Meir Shpilraien]. Other bug fixes: * Fix appendfsync to always guarantee fsync before reply, on MacOS and FreeBSD (kqueue) (#9416) * Fix the wrong mis-detection of sync_file_range system call, affecting performance (#9371) * Fix replication issues when repl-diskless-load is used (#9280) ``` #### How I did it Edit `Dockerfile.j2` file #### How to verify it Check redis version #### Description for the changelog This PR will upgrade redis-server version to `6.0.16`.	2022-02-15 16:43:01 -08:00
Alexander Allen	9677401f4a	[pmon] Fix chassis_db_init exit not being expected (#9858 ) - Why I did it Error log was shown on switches during boot pmon#supervisord 2021-12-22 04:27:16,709 INFO exited: chassis_db_init (exit status 0; not expected) - How I did it Add exit code zero as an expected exit code and also disable autorestart. - How to verify it Boot the switch and ensure the above log line does not appear.	2022-02-15 10:51:24 +02:00
Travis Van Duyn	62934ad4c4	updated jinja template for snmp contact python2 vs python3 issue (#9949 )	2022-02-10 09:01:46 -08:00
Oleksandr Ivantsiv	25a0ce5eb1	[asan] Add address sanitizer support. (#9857 ) Implement infrastructure that allows enabling address sanitizer for docker containers. Enable address sanitizer for SWSS container. - Why I did it To add a possibility to compile SONiC applications with address sanitizer (ASAN). ASAN is a memory error detector for C/C++. It finds: 1. Use after free (dangling pointer dereference) 2. Heap buffer overflow 3. Stack buffer overflow 4. Global buffer overflow 5. Use after return 6. Use after the scope 7. Initialization order bugs 8. Memory leaks - How I did it By adding new ENABLE_ASAN configuration option. - How to verify it By default ASAN is disabled and the SONiC image is not affected. When ASAN is enabled it inspects all allocation, deallocation, and memory usage that the application does in run time. To verify whether the application has memory errors tests that trigger memory usage of the application should be run. Ideally, the whole regression tests should be run. Memory leaks reports will be placed in /var/log/asan/ directory of SONiC host OS. Signed-off-by: Oleksandr Ivantsiv <oivantsiv@nvidia.com>	2022-02-09 13:29:18 +02:00
abdosi	e44a40cc3b	Updated Internal BGP Templates for chassis packet (#9674 ) Fixes: https://github.com/Azure/sonic-buildimage/issues/9610	2022-02-08 09:36:32 -08:00
Lawrence Lee	eff80f750f	[swss]: Reduce tunnel_packet_handler memory usage (#9762 ) * Configure scapy to not store sniffed packets Signed-off-by: Lawrence Lee <lawlee@microsoft.com>	2022-02-07 11:55:48 -08:00
Christian Svensson	660c0cbe7b	docker-dhcp-relay: Fix test call to MockConfigDb (#9903 ) *docker-dhcp-relay: Fix test call to MockConfigDb Signed-off-by: Christian Svensson <blue@cmd.nu>	2022-02-01 18:52:52 -08:00
Andriy Yurkiv	cb3b9416a6	[Mellanox][VXLAN] add params to vxlan.json file in order to configure VXLAN src port range feature (#9658 ) - Why I did it Remove obsolete parameter that enables static VXLAN src port range provide functionality no generate json config file according to appropriate parameter in config_db Done for SN3800: • Mellanox-SN3800-D28C50 • Mellanox-SN3800-C64 • Mellanox-SN3800-D28C49S1 (New 10G SKU) SN2700: • Mellanox-SN2700-D48C8 - How I did it Remove SAI_VXLAN_SRCPORT_RANGE_ENABLE=1 from appropriate sai.profile files Created vxlan.json file and added few params that depends on DEVICE_METADATA.localhost.vxlan_port_range - How to verify it File /etc/swss/config.d/vxlan.json should be generated inside swss docker when it restart [ { "SWITCH_TABLE:switch": { "vxlan_src": "0xFF00", "vxlan_mask": "8" }, "OP": "SET" } ] Signed-off-by: Andriy Yurkiv <ayurkiv@nvidia.com>	2022-01-31 15:57:30 +02:00
vdahiya12	61e9a7683c	[y_cable] Support for initialization of new daemon ycable to support ycables (#9125 ) * [y_cable] Support for initialization of new Daemon ycable to support ycables This PR also adds the commit in sonic-platform-daemons 94fa239 [y_cable] refactor y_cable to a seperate logic and new daemon from xcvrd (#219) Why I did it This PR separates the logic of Y-Cable from xcvrd. Before this change we were utilizing xcvrd daemon to control all aspects of Y-Cable right from initialization to processing requests from other entities like orch,linkmgr. Now we would have another daemon ycabled which will serve this purpose. Logically everything still remains the same from the perspective of other daemons. it also take care aspects like init/delete daemon from Y-Cable perspective. How I did it To serve the purpose we build a new wheel sonic_ycabled-1.0-py3-none-any.whl and install it inside pmon. We also initalize the daemon ycabled which serves our purpose for refactor inside pmon How to verify it Ran the changes with an image for dualtor tests on a 7050cx3 platform Signed-off-by: vaibhav-dahiya <vdahiya@microsoft.com>	2022-01-25 11:10:25 -08:00
Longxiang Lyu	49a036e90c	Add dualtor TSA/B/C support (#9726 ) Why I did it Add TSA/B/C dualtor support Signed-off-by: Longxiang Lyu lolv@microsoft.com How I did it For TSA, toggle all the mux to standby if the device type is dualtor and there are active mux ports. For TSC, add mux status output. How to verify it Run TSA/B/C on a dualtor setup	2022-01-25 10:50:29 +08:00
DINESH KUMAR SELLAPPAN	d9b1577675	Support for Statistics Python Module in sonic-mgmt docker image (#9682 ) This PR includes the support for statistics module in sonic-mgmt docker image	2022-01-25 10:32:22 +08:00
xumia	7a226ffd0d	Support bullseye for docker-sonic-restapi docker-sonic-telemetry (#9791 ) Support bullseye for docker-sonic-restapi docker-sonic-telemetry Upgrade to bullseye and Golang-1.15 to support FIPS.	2022-01-21 08:41:39 +08:00
kellyyeh	3e263fa6a8	[dhcp_relay] Remove dhcpv6 servers from VlanBrief (#9718 )	2022-01-19 07:47:08 -08:00
Saikrishna Arcot	bb3362760d	[docker-dhcprelay]: Update to Bullseye (#9736 ) As part of this, update the isc-dhcp package to match the Bullseye version (this fixes some compile errors related to BIND), clean up some of the build dependencies and runtime dependencies for debian packaging, and use the default Boost version to compile against instead of explicitly saying using 1.74. Signed-off-by: Saikrishna Arcot <sarcot@microsoft.com>	2022-01-18 15:11:36 -08:00
shlomibitton	eaa888d948	Fix import error for DHCP relay CLI (#9691 ) Signed-off-by: Shlomi Bitton <shlomibi@nvidia.com>	2022-01-16 08:08:01 +02:00
SuvarnaMeenakshi	945278f6d8	[docker-snmp]: Modify log level of snmpd (#9734 ) #### Why I did it resolves https://github.com/Azure/sonic-buildimage/issues/8779 snmpd writes the below error message in syslog : snmp#snmpd[27]: truncating integer value > 32 bits This message is written in syslog when the hrSystemUptime(1.3.6.1.2.1.25.1.1.0 / system uptime) or sysUpTime(1.3.6.1.2.1.1.3 network management portion or snmpd uptime) is queried when either of these counters overflow beyond 32 bit value. This happens the device uptime or snmpd uptime is more than 497 days. #### How I did it Reference: https://access.redhat.com/solutions/367093 and https://linux.die.net/man/1/snmpcmd To avoid seeing this message if the counter grows, the snmpd error log level is changed to display LOG_EMERG, LOG_ALERT, LOG_CRIT, and LOG_DEBUG. Without this change, LOG_ERR and LOG_WARNING would also be logged in syslog. #### How to verify it On a device which is up for more than 497 days, modify supervisord.conf with the change and restart snmp. Query 1.3.6.1.2.1.1.3 and verify that log message is not seen.	2022-01-12 14:40:01 -08:00
Saikrishna Arcot	fee2441717	Create docker-base-bullseye and docker-config-engine-bullseye (#9666 ) * [slave-bullseye]: Remove Python 2 It shouldn't be needed anymore. Signed-off-by: Saikrishna Arcot <sarcot@microsoft.com> * [dockers]: Add docker-base-bullseye and docker-config-engine-bullseye Also upgrade socat from 1.7.3.1 to 1.7.4.1 Signed-off-by: Saikrishna Arcot <sarcot@microsoft.com>	2022-01-11 09:23:42 -08:00
abdosi	6c507329b7	Enable/Disable Order ECMP feature. (#9651 ) Updated Jinja2 Template in switch.json.j2 for enabling/disabling Order ECMP feature based on device role. Changes as per design: Azure/SONiC#896	2022-01-06 16:40:50 -08:00
Saikrishna Arcot	bd479cad29	Create a docker-swss-layer that holds the swss package. This is to save about 50MB of disk space, since 6 containers individually install this package. Signed-off-by: Saikrishna Arcot <sarcot@microsoft.com>	2022-01-06 09:26:55 -08:00
Saikrishna Arcot	b09b845225	[docker-platform-monitor]: Remove Python 2 Python 2 doesn't appear to be required any more.	2022-01-06 09:26:55 -08:00
Shilong Liu	36d866002a	[build] Fix docker-sonic-mgmt pylint dependency lazy-object-proxy version (#9596 )	2021-12-24 10:42:37 +08:00
zzhiyuan	a6d0a27a18	[Arista] Increase switch PCIe timeout for 7060-cx32s (#9248 ) Co-authored-by: Zhi Yuan (Carl) Zhao <zyzhao@arista.com> Why I did it Arista 7060 platform has a rare and unreproduceable PCIe timeout that could possibly be solved with increasing the switch PCIe timeout value. To do this we'll call a script for this platform to increase the PCIe timeout on boot-up. No issues would be expected from the setpci command. From the PCIe spec: "Software is permitted to change the value in this field at any time. For Requests already pending when the Completion Timeout Value is changed, hardware is permitted to use either the new or the old value for the outstanding Requests, and is permitted to base the start time for each Request either on when this value was changed or on when each request was issued. " How I did it Add "platform-init" support in swss docker similar to how "hwsku-init" is called, only this would be for any device belonging to a platform. Then the script would reside in device data folder. Additionally, add pciutils dependency to docker-orchagent so it can run the setpci commands. How to verify it On bootup of an Arista 7060, can execute: lspci -vv -s 01:00.0 \| grep -i "devctl2" In order to check that the timeout has changed.	2021-12-17 08:43:25 -08:00
Lawrence Lee	7bd0a2ad11	[swss]: Listen for undeliverable tunnel packets (#9348 ) - Create a script in the orchagent docker container which listens for these encapsulated packets which are trapped to CPU (indicating that they cannot be routed/no neighbor info exists for the inner packet). When such a packet is received, the script will issue a ping command to the packet's inner destination IP to start the neighbor learning process. - This script is also resilient to portchannel status changes (i.e. interface going up or down). An interface going down does not affect traffic sniffing on interfaces which are still up. When an interface comes back up, we restart the sniffer to start capturing traffic on that interface again.	2021-12-14 14:45:23 -08:00
Shi Su	f2774b635d	Add openbfdd to ptf docker (#9488 ) Why I did it To enable test support for BFD-related features, the PTF docker needs to have the proper support for BFD. This PR aims to add BFD support in ptf docker. How I did it Clone and build OpenBFDD for PTF docker. How to verify it Build locally and verify BFD is supported.	2021-12-14 11:46:48 -08:00
abdosi	6c0da4bcf0	[bgp] Enable BGP Graceful Restart based on device role (#9486 ) What I did: Updated Jinja Template to enable BGP Graceful Restart based on device role. By default it will be enable only if the device role type is TorRouter. Why I did:- By default FRR is configured in Graceful Helper mode. Graceful Restart is needed on T0/TorRouter only since the device can go for warm-reboot. For T1/LeafRouter it need to be in Helper mode only	2021-12-13 10:14:50 -08:00
novikauanton	969cea07aa	add platform to iccpd's env (#8945 )	2021-12-08 09:21:44 -08:00
Brian O'Connor	46bcda359c	[PINS] Build P4RT container for PINS (#9083 ) - Add INCLUDE_PINS to config to enable/disable container - Add Docker files and supporting resources - Add sonic-pins submodule and associated make files Submission containing materials of a third party: Copyright Google LLC; Licensed under Apache 2.0 #### Why I did it Adds P4RT container to SONiC for PINS The P4RT app is covered by this HLD: https://github.com/pins/SONiC/blob/master/doc/pins/p4rt_app_hld.md #### How I did it Followed the pattern and templates used for other SONiC applications #### How to verify it Build SONiC with INCLUDE_P4RT set to "y". Verify that the resulting build has a container called "p4rt" running. You can verify that the service is up by running the following command on the SONiC switch: ```bash sudo netstat -lpnt \| grep p4rt ``` You should see the service listening on TCP port 9559. #### Which release branch to backport (provide reason below if selected) None #### Description for the changelog Build P4RT container for PINS	2021-12-07 11:11:25 -08:00
abdosi	f501311f11	Updated BGP Template for Chassis/Multi-asic (#9291 ) Updated BGP Template for the case: 1. For Packet Chassis do not advertise Loopback4096 address into BGP as there is Static Route for same. Having this route in BGP causes two level of recursion in Zebra and cause assert in Zebra when there are many nexthop involved 2. Advertise only P2P Connected IP's into BGP (External Peers). For Packet chassis we have backend IP Interface subnet and if they get advertised into BGP then it also causes recursion	2021-12-06 09:36:24 -08:00
kellyyeh	d11207d4f4	[radv] Run radv on MgmtToRRouter (#9424 ) * Allow radv to run on mgmt tor and EPMS	2021-12-03 09:45:06 -08:00
kellyyeh	f2ee94d201	[dhcp_relay] Update DHCPv6 counter on relayed messages (#9283 )	2021-11-30 20:15:30 -08:00
vganesan-nokia	78de10713c	[voq-chassis][bgpcfg] VOQ_BGP_CHASSIS_NEIGHBORS timers default (#8455 ) The BGP_VOQ_CHASSIS_NEIGHBOR keepalive and holdtime timers are configured similar to general neighbors. Changes are done to configure BGP_VOQ_CHASSIS_NEIGHBOR timers similar to BGP_INTENAL_NEIGBOR since voq chassis bgp neighbors are similar to bgp internal neighbors in multi-asic. As it is done for bgp internal neighbors, the keepalive and holdtime timers are set to 3 and 10 seconds respectively. Also similar to bgp internal neighbors, connection retry timer is also configured for voq chassis bgp neighbors. Signed-off-by: vedganes <vedavinayagam.ganesan@nokia.com>	2021-11-30 12:10:27 -08:00
Saikrishna Arcot	fdd8236864	[docker-mgmt-framework]: Don't overwrite /etc/passwd and /etc/group with symlinks (#9375 ) Fixes #9376 Because /etc/passwd and /etc/group have been overwritten with symlinks to /host_etc/passwd and /host_etc/group, the debug container build fails. This is because the debug container is built without /etc being mounted at /host_etc in the container (which does happen at runtime). Because of that, /etc/passwd and /etc/group don't exist, which causes some package installation errors when openssh-client tries to create a group. This is a partial revert of `1347f29178`. Signed-off-by: Saikrishna Arcot <sarcot@microsoft.com>	2021-11-24 23:51:59 -08:00
arlakshm	5830852832	remove staticd.conf.j2 (#9182 ) Why I did it resolves #8979 and #9055 How I did it Remove the file static.conf.j2,which adds the default route on eth0 from bgp docker Signed-off-by: Arvindsrinivasan Lakshmi Narasimhan <arlakshm@microsoft.com>	2021-11-24 15:32:16 -08:00
Junchao-Mellanox	554b04f312	Add trap flow counter support (#8940 ) *Add trap flow counter support	2021-11-24 15:26:52 -08:00
Brian O'Connor	002827f08e	[PINS] Add APPL_STATE_DB and response path log (#9082 ) - Add APPL_STATE_DB to database_config.json - Clear APPL_STATE_DB during SwSS container restarts - Add response path log file to logrotate config: responsepublisher.rec Co-authored-by: PINS Working Group <sonic-pins-subgroup@googlegroups.com>	2021-11-24 10:31:06 -08:00
Stephen Sun	b3ccef9c08	[Reclaim buffer] Common infrastructure update for reclaiming buffer (#9133 ) - Why I did it This is to update the common sonic-buildimage infra for reclaiming buffer. - How I did it Render zero_profiles.j2 to zero_profiles.json for vendors that support reclaiming buffer The zero profiles will be referenced in PR [Reclaim buffer] Reclaim unused buffers by applying zero buffer profiles #8768 on Mellanox platforms and there will be test cases to verify the behavior there. Rendering is done here for passing azure pipeline. Load zero_profiles.json when the dynamic buffer manager starts Generate inactive port list to reclaim buffer Signed-off-by: Stephen Sun <stephens@nvidia.com>	2021-11-24 15:00:23 +02:00
Ze Gan	79b8ff52b0	[sonic-mgmt]: Upgrade scapy (#8554 ) * Upgrade scapy Signed-off-by: Ze Gan <ganze718@gmail.com> * Add scapy version Signed-off-by: Ze Gan <ganze718@gmail.com>	2021-11-24 10:07:50 +08:00
Stepan Blyshchak	a2c2d67098	[ACL] enable ACL FC when genereting config from minigraph but disable by default (#8908 ) * [ACL] enable ACL FC when genereting config from minigraph but disable by default Why I did it To support ACL counters on Flex Counter Infrastructure. How I did it Enable ACL FC in init_cfg and minigraph. Disable when genereting configuration from preset. How to verify it Together with depends PRs. Run ACL/Everflow test suite. Signed-off-by: Stepan Blyshchak <stepanb@nvidia.com>	2021-11-11 09:07:54 +08:00
tjchadaga	8544147a70	Fix for additional intf flap during fast-reboot (#9166 )	2021-11-08 15:21:11 -08:00
Lawrence Lee	7c0507b6db	[swss]: Start ndppd after vlanmgrd (#9155 ) Why I did it During swss container startup, if ndppd starts up before/with vlanmgrd, ndppd will be pinned at nearly 100% CPU usage. How I did it Only start ndppd after vlanmgrd is running. Also, call ndppd directly instead of through bash for improved logging and to prevent orphaned processes. Signed-off-by: Lawrence Lee <lawlee@microsoft.com>	2021-11-03 11:03:01 -07:00
Sudharsan Dhamal Gopalarathnam	fcff3f3d09	VxLAN Tunnel Counters and Rates implementation (#8369 ) * Enable flex counters for Vxlan tunnel	2021-11-01 10:42:21 -07:00
abdosi	919b3e5cdf	[chassis-packet] Fixed BGP Internal Peer template (#9106 ) What I did: Fix the typo in Internal Peer Group template for Packet-based Chassis. Address Review comments of PR: [chassis-packet] minigraph parsing and BGP template changes #8966 - Static Route Parsing for Host - Formatting of chassis port_config.ini	2021-10-29 11:02:38 -07:00
Junhua Zhai	7de673cb5b	[gearbox] Use separator ':' for GB_ASIC_DB, GB_COUNTERS_DB and GB_FLEX_COUNTER_DB (#9100 ) Keep GB_ASIC_DB, etc consistent with the ones in sonic-swss-common/common/database_config.json	2021-10-28 10:27:52 -07:00
Marty Y. Lok	b91190d82d	[Nokia] Add protobuf and grpc C++ and python lib to support Nokia IXR7250E platform (#8366 ) #### Why I did it Nokia IXR7250E platform requires grpcio, grpcio-tools python library, and libprotobuf-dev, libgrpc++ library #### How I did it Modified the build_debian.sh install libprotobuf-dev and libgrpc++ to support nokia ndk Modified the sonic_debian_extension.j2 to install the grpcio and grpcio-tools in the host Modified the docker-platform-monitor/Dockerfile.js to install grpcio and grpcio-tools for the pmon container. #### How to verify it Image running success.	2021-10-26 18:09:32 -07:00
Kebo Liu	9c4a7c2fed	[PMON] Skip chassis_db_init task on Mellanox simx platform (#9017 ) Why I did it "chassis_db_init" task of PMON should be skipped on Mellanox simx platform, since the hardware info which this task is trying to access is not available on simx platforms, It will introduce some error log. How I did it Add the capability for "chassis_db_init" in the template for it can be skipped by adding configuration in "pmon_daemon_control.json". add "skip_chassis_db_init" configuration for simx platforms. use symbol link for "pmon_daemon_control.json" since all the simx platforms share the same configuration How to verify it Build an image and install it on simx platform to check whether "chassis_db_init" task is skipped. Signed-off-by: Kebo Liu <kebol@nvidia.com>	2021-10-24 09:10:41 -07:00
Saikrishna Arcot	c1d5e0682f	docker-dhcp-relay: Fix waiting for interfaces to get set up (#9034 ) Fix the check used to wait for interfaces to come up. The group name in the supervisor config files has changed from isc-dhcp-relay to dhcp-relay. Also, in the wait script, wait 10 additional seconds after the vlans, port channels, and any interfaces are up. This is because dhcrelay listens on all interfaces (in addition to port channels and vlans), and to ensure that it stays in a clean state during runtime, wait some extra time to make sure that those interfaces are created as well. Signed-off-by: Saikrishna Arcot <sarcot@microsoft.com>	2021-10-21 18:45:00 -07:00
shlomibitton	546340bf7b	[dhcp_relay] Fix import for dhcp_counters on clear_dhcp6relay_counter.py (#8991 ) #### Why I did it Import issue will cause: root@sonic:/# sudo sonic-clear arp failed to import plugin clear.plugins.dhcprelay: No module named 'show_dhcp_relay' #### How I did it Fix the import. #### How to verify it run sudo sonic-clear arp	2021-10-19 03:10:36 -07:00
abdosi	3bb248bd67	[chassis-packet] minigraph parsing and BGP template changes (#8966 ) 1. Changes for Generation LC-Graph for packet-based chassis. 2. Added Support Ipv6 Peering on Loopback4096 for voq also 3. Updated asic topology yml files to be offset of slot 4. Made slot_num to take string slot<number> instead of number 5. Consolidated template_dpg_voq_asic.j2 into dpg_asic.j2 6. Remove Loopback4096 from asic topology and parse as dut invertory for multi-asic 7. Updated topo_facts parsing for asic topology_ 8. Internal BGP Session rename from <VoqChassisInternal> to <ChassisInternal> and take switch_type as value. Signed-off-by: Abhishek Dosi <abdosi@microsoft.com>	2021-10-18 18:44:24 -07:00
Lawrence Lee	fad5ec47b4	[mux]: Call write_standby from host only Signed-off-by: Lawrence Lee <lawlee@microsoft.com>	2021-10-15 09:59:59 -07:00
Lawrence Lee	5232647b33	[mux]: Make write_standby available on host Signed-off-by: Lawrence Lee <lawlee@microsoft.com> [write_standby]: Cleanup and fix build Signed-off-by: Lawrence Lee <lawlee@microsoft.com>	2021-10-15 09:59:59 -07:00
Lawrence Lee	14403c61d2	[mux]: Initialize all mux ports as standby Signed-off-by: Lawrence Lee <lawlee@microsoft.com>	2021-10-15 09:59:59 -07:00
Tamer Ahmed	c9c2826520	Merged PR 3845699: [linkmgrd]: Introduce MUX cable linkmgrd Linkmgrd monitors link status, mux status, and link state. Has the link becomes unhealthy, linkmgrd will trigger mux switchover on a standby ToR ensuring uninterrupted service to servers/blades. This PR is initial implementation of linkmgrd. Also, docker-mux container hold packages related to maintaining and managing mux cable. It currently runs linkmgrd binary that monitor and switches the mux if needed. This PR also introduces mux-container and starts linkmgrd as startup when build is configured with INCLUDE_MUX=y Edit: linkmgrd PR will follow. signed-off-by: Tamer Ahmed <tamer.ahmed@microsoft.com> Related work items: #2315, #3146150	2021-10-15 09:59:59 -07:00
kellyyeh	df6361f50c	Change radv interval to 3min (#8882 )	2021-10-01 15:00:16 -07:00
Ye Jianquan	38500fa92e	Add gdb and pyrasite to ptf image (#8816 )	2021-09-24 17:10:48 +08:00
kellyyeh	62a1f5eb19	Add CLI Support for IPv6 Helpers and DHCPv6 Relay Counters (#8593 )	2021-09-23 22:01:26 -07:00
DINESH KUMAR SELLAPPAN	31a647a72d	[docker-sonic-mgmt]: Snappi version to 0.5.11 (#8790 )	2021-09-23 02:12:12 -07:00
kellyyeh	bc06c6fcb5	Incorporate DHCPv6 Relay Agent into dhcp-relay docker (#8321 )	2021-09-22 16:05:03 -07:00
shlomibitton	112fda7877	[Flex Counters] Reset flex counters delay flag on config DB when enable_counters script is called (#8500 ) #### Why I did it Reset flex counters delay flag on config DB when enable_counters script is called to allow enablement of flex counters in orchagent. #### How I did it Push to config DB 'false' value for delay indication when enable_counters script is called before enabling the counters. #### How to verify it Observe counters are created when enable_counters script is called.	2021-09-01 21:17:36 -07:00
richardyu	479f61404b	Add thrift in the docker-sonic-mgmt (#8623 ) Co-authored-by: richardyu-ms <richard.yu@microsoft.com>	2021-08-31 19:20:06 -07:00
shlomibitton	56533ceb9e	[dhcp_relay] Adapt config/show CLI commands to support DHCPv6 relay (#8211 ) #### Why I did it - Adapt config/show CLI commands to support DHCPv6 relay - Support multiple dhcp servers assignment in one command - Fix IP validation - Adapt UT and add new UT cases #### How I did it - Modify config/show dhcp relay files - Modify config/show UT files #### How to verify it This PR has a dependency on PR https://github.com/Azure/sonic-utilities/pull/1717 Build an image with the dependent PR and this PR Use config/show DHCPv6 relay commands.	2021-08-25 00:48:39 -07:00
Christian Svensson	f7de685be2	[mgmt-framework]: Fix typo in mgmt_vars.j2 (#8475 ) Signed-off-by: Christian Svensson <blue@cmd.nu>	2021-08-24 10:54:13 -07:00
Kostiantyn Yarovyi	6530f93881	[Pcied] run by python 3 Why I did it Pcied running by python 2. How I did it dropped python2 support and add python3 support for pcied in file docker-pmon.supervisord.conf.j2 How to verify it docker exec pmon supervisorctl status	2021-08-23 03:30:12 +00:00
Myron Sosyak	4d03526311	[docker-ptf] Upgrade to buster (#8254 ) Co-authored-by: Your Name <you@example.com>	2021-08-18 10:42:03 -07:00
xumia	a4405f09ed	Support to build armhf/arm64 platforms on arm based system (#7731 ) Why I did it Support to build armhf/arm64 platforms on arm based system without qemu simulator. When building the armhf/arm64 on arm based system, it is not necessary to use qemu simulator. How I did it Build armhf on armhf system, or build arm64 on arm64 system, by default, qemu simulator will not be used. When building armhf on arm64, and you have enabled armhf docker, then it will build images without simulator automatically. It is based how the docker service is run. Docker base image change: For amd64, change from debian:to amd64/debian: For arm64, change from multiarch/debian-debootstrap:arm64- to arm64v8/debian: For armhf, change from multiarch/debian-debootstrap:armhf- to arm32v7/debian: See https://github.com/docker-library/official-images#architectures-other-than-amd64 The mapping relations: arm32v6 --- armel arm32v7 --- armhf arm64v8 --- arm64 Docker image armhf deprecated info: https://hub.docker.com/r/armhf/debian, using arm32v7 instead.	2021-08-12 22:24:37 +08:00
richardyu	9417fe9303	PTF adds unittest-xml-reporting (#8417 ) Co-authored-by: richardyu-ms <richard.yu@microsoft.com>	2021-08-11 20:55:21 -07:00
Blueve	aa01315f60	[ARM] Fix issue whre the ping6 tool is missing from orchagent docker (#8345 ) Signed-off-by: Jing Kan jika@microsoft.com	2021-08-05 22:00:50 +08:00
Sujin Kang	447f0c64da	[pmon]: Enable Autorestart of the daemons in PMON for unexpected exit cases (#8326 ) Remove the daemon list from the critical_process which prevent the PMON from restarting when the individual daemon crashes.	2021-08-04 09:57:54 -07:00
VenkatCisco	0803f7bf34	[pmon]: add python3-jsonschema pmon (#8018 ) jsonschema is an implementation of JSON Schema for Python . Signed-off-by: Venkat Garigipati <venkatg@cisco.com>	2021-08-03 18:08:09 -07:00
vganesan-nokia	f9231723f9	[multiasic][voq][bgpconf] Fix for the issue of same BGP router id in all asics (#8049 ) For multiasic, the back end asics use ip addresss of Loopback4096 for BGP router id. In VOQ multi-asic chassis there are no back end asics. All the asics are front end and the iBGP connections are established via Ethernet-IB of asics. Since these asics are not designated as BackEnd, the ip address of interface Loopback0 is used as BGP router id. Since the ip address of Loopback0 is same for all the asics in the line card, same router id is used for voq iBGP configurations and hence the iBGP connections are not established. Changes are done to fix this	2021-07-26 12:54:52 -07:00
Shi Su	8a48be9b74	Reduce route selection deferral timer for bgp graceful restart (#7533 ) Why I did it There are scenarios that End-of-RIB comes from a part of the peers arrives after reconciliation. In such scenarios, if the route selection deferral timer has the default value of 360 seconds, FRR would not set up routes and all routes would be removed after reconciliation. This PR reduces the route selection deferral timer so that at least routes to parts of the peers get restored at the point of reconciliation. Fix #7488 How I did it Reduce route selection deferral timer for bgp graceful restart to 15 seconds.	2021-07-26 10:16:19 -07:00
賓少鈺	aa59bfeab7	[PDE]: introduce the SONiC Platform Development Env (#7510 ) The PDE silicon test harness and platform test harness can be found in src/sonic-platform-pdk-pde	2021-07-24 16:24:43 -07:00
slutati1536	de43c6a163	Added retry to sonic-mgmt docker container (#7997 ) Why I did it the motivation for this PR is to add retry_call to several test cases in the community, for example, the following cases: test_show_platform_fanstatus_mocked test_show_platform_temperature_mocked are executing a command once and comparing the output to the expected mock data, sometimes differences between the mock and the actual are causing the tests to fail. retry will make these tests more stable. retry will also be more efficient than sleep which will cause the tests to run longer because sometimes it is not necessary to sleep all that time, retry will only run a function only until it passed. How I did it added retry to the docker file How to verify it I run the tests with retry on the docker after installing the retry package Signed-off-by: Sharon Lutati <slutati@nvidia.com>	2021-07-20 09:28:10 -07:00
shlomibitton	604becdd5c	[dhcp_relay] DHCP relay support for IPv6 (#7772 ) Why I did it Currently SONiC use the 'isc-dhcp-relay' package to allow DHCP relay functionality on IPv4 networks only. This will allow the IPv6 functionality along the IPv4 type. How I did it Edit supervisord template to start DHCPv6 instances when configured to do so on Config DB. Align cfg unit test to the new change. Add DHCPv6 relay minigraph parsing support and a suitable t0 topology xml file for UT. How to verify it Configure DHCPv6 agents as described on the feature HLD: Azure/SONiC#765 Test it with real client/server with IPv6 or use the dedicated automatic test: Azure/sonic-mgmt#3565 Signed-off-by: Shlomi Bitton <shlomibi@nvidia.com> * Split docker-dhcp-relay.supervisord.conf.j2 template into several files for easier code maintenance	2021-07-16 07:31:05 -07:00
Stepan Blyshchak	b3b6938fda	[dhcp-relay] make DHCP relay an extension (#6531 ) - Why I did it Make DHCP relay docker an extension. DHCP relay now carries dhcp relay commands CLI plugin and has a complete manifest. It is installed as extension if INCLUDE_DHCP_REALY is set to y. DEPENDS on #5939 - How I did it Modify DHCP relay docker makefile and dockerfile. Make changes to sonic_debian_extension.j2 to install sonic packages. I moved DHCP related CLI tests from sonic-utilities to DHCP relay docker. This PR introduces a way to write a plugin as part of docker image and run the tests from cli-plugin-tests directory under docker directory. The test result is available in target/docker-dhcp-relay.gz.log: [ REASON ] : target/docker-dhcp-relay.gz does not exist NON-EXISTENT PREREQUISITES: docker-start target/docker-config-engine-buster.gz-load target/python-wheels/sonic_utilities-1.2-py3-none-any.whl-in stall target/debs/buster/python3-swsscommon_1.0.0_amd64.deb-install [ FLAGS FILE ] : [] [ FLAGS DEPENDS ] : [] [ FLAGS DIFF ] : [] ============================= test session starts ============================== platform linux -- Python 3.7.3, pytest-3.10.1, py-1.7.0, pluggy-0.8.0 -- /usr/bin/python3 cachedir: .pytest_cache rootdir: /sonic/dockers/docker-dhcp-relay/cli-plugin-tests, inifile: plugins: cov-2.6.0 collecting ... collected 10 items test_config_dhcp_relay.py::TestConfigVlanDhcpRelay::test_plugin_registration PASSED [ 10%] test_config_dhcp_relay.py::TestConfigVlanDhcpRelay::test_config_vlan_add_dhcp_relay_with_nonexist_vlanid PASSED [ 20%] test_config_dhcp_relay.py::TestConfigVlanDhcpRelay::test_config_vlan_add_dhcp_relay_with_invalid_vlanid PASSED [ 30%] test_config_dhcp_relay.py::TestConfigVlanDhcpRelay::test_config_vlan_add_dhcp_relay_with_invalid_ip PASSED [ 40%] test_config_dhcp_relay.py::TestConfigVlanDhcpRelay::test_config_vlan_add_dhcp_relay_with_exist_ip PASSED [ 50%] test_config_dhcp_relay.py::TestConfigVlanDhcpRelay::test_config_vlan_add_del_dhcp_relay_dest PASSED [ 60%] test_config_dhcp_relay.py::TestConfigVlanDhcpRelay::test_config_vlan_remove_nonexist_dhcp_relay_dest PASSED [ 70%] test_config_dhcp_relay.py::TestConfigVlanDhcpRelay::test_config_vlan_remove_dhcp_relay_dest_with_nonexist_vlanid PASSED [ 80%] test_show_dhcp_relay.py::TestVlanDhcpRelay::test_plugin_registration PASSED [ 90%] test_show_dhcp_relay.py::TestVlanDhcpRelay::test_dhcp_relay_column_output PASSED [100%] =============================== warnings summary =============================== /usr/local/lib/python3.7/dist-packages/tabulate.py:7 /usr/local/lib/python3.7/dist-packages/tabulate.py:7: DeprecationWarning: Using or importing the ABCs from 'collections' instead of from 'collections.abc' is deprecated, and in 3.8 it will stop working from collections import namedtuple, Iterable -- Docs: https://docs.pytest.org/en/latest/warnings.html ==================== 10 passed, 1 warnings in 0.35 seconds =====================	2021-07-15 10:35:56 -07:00
Vivek Reddy	e439676455	autorestart inside restapi docker is disabled (#8006 ) Fix issue with critical process in the restapi docker restarting immediately after getting killed Signed-off-by: Vivek Reddy Karri <vkarri@nvidia.com>	2021-07-14 11:37:00 -07:00
Guohan Lu	4f2bc1fbed	Revert "Add ethtool to docker-platform-monitor (#8017 )" This reverts commit `1618aec370`.	2021-07-07 23:36:44 -07:00
Stepan Blyshchak	9dd05bb1f6	[docker-teamd]: Increase teammgrd timeout to allow graceful shutdown. (#7662 ) (#8045 ) NOTE: This is cherry-pick from 1911/2012 to master. - Why I did it To fix LAG IP configuration race - How I did it Extended timeout for teammgrd - How to verify it Add >80 router LAGs. Do config reload Signed-off-by: Nazarii Hnydyn <nazariig@nvidia.com>	2021-07-07 14:48:29 +03:00
novikauanton	1da8c145a6	[iccpd][docker] fix initial startup configuration (#7982 ) #### Why I did it The process of config generation (sonic-cfggen) fails, but the services continue to run with invalid config #### How I did it * add exit with error on errors in start.sh script (because supervisord relies on start.sh return code). * fix jinja template. Jinja use common python expressions under the hood and `has_key` method was removed from dict in py3, so use check by `in` operator as it is supported by both py2 and py3. #### How to verify it * compile sonic with enabled iccp. * add mclag config to CONFIG_DB. ``` 'MC_LAG\|1' => { "local_ip": "10.0.0.2", "peer_ip": "10.0.0.3", "peer_link": "Ethernet8", "mclag_interface": "Ethernet12" } * unmaks, enable and start swss and iccpd services in sonic. * log in into the iccpd container and check the config file `/etc/iccpd/iccpd.conf` * expected config: ``` mclag_id:1 local_ip:10.0.0.2 peer_ip:10.0.0.3 peer_link:Ethernet8 mclag_interface:Ethernet12 system_mac:YOUR_SYSTEM_MAC #### Description for the changelog Fixed initial iccpd startup configuration.	2021-07-01 00:47:26 -07:00
VenkatCisco	1618aec370	Add ethtool to docker-platform-monitor (#8017 ) #### Why I did it ethtool can be used to query and change settings such as speed, auto- negotiation and checksum offload on many network devices, especially Ethernet devices. #### How I did it add package extension to docker-platform-monitor/Dockerfile.j2	2021-06-30 09:36:47 -07:00
VenkatCisco	c5855eba08	Add libpci3 pkg to docker-platform-monitor (#8016 ) #### Why I did it The libpci library provides portable access to configuration registers of devices connected to the PCI bus. #### How I did it update dockers/docker-platform-monitor/Dockerfile.j2	2021-06-30 09:35:16 -07:00
thomas.cappleman@metaswitch.com	101b1fa08b	[build]: Fix sonic-cfggen contextlib err (#7996 ) A recent version of contextlib2 (https://pypi.org/project/contextlib2/21.6.0/#history) has broken Python2 compatibility, so the version picked up by netaddr when using Python2 must be specified, or else builds fail Co-authored-by: Tom Zhu <tom.zhu@metaswitch.com>	2021-06-28 17:15:03 -07:00
arlakshm	ef67ba5f6e	[multi-asic] fix network command for internal loopback (#7878 ) Signed-off-by: Arvindsrinivasan Lakshmi Narasimhan <arlakshm@microsoft.com> In the multi asic platforms all the ASIC are advertising the same IPv6 /64 network from Loopback4096. Therefore, the IPv6 loopback address of backend asic is not learnt on the frontend asic. Change the bgpd.conf.main.conf.j2 template file to advertise the Loopback4096 ipv6 address as /128	2021-06-24 12:02:01 -07:00
Shi Su	f52ba3b496	Remove quagga-related code (#7898 ) Why I did it Quagga is no longer being used. Remove quagga-related code (e.g., docker-fpm-quagga, sonic-quagga, etc.). How I did it Remove quagga-related code.	2021-06-23 09:15:56 -07:00
Qi Luo	658ed4fd37	Revert "Remove quagga related code (#7476 )" (#7831 ) Reverts Azure/sonic-buildimage#7476 It remove bgpd.conf.j2 and zebra.conf.j2, which is still used by sonic-config-engine unit test.	2021-06-09 18:52:45 -07:00
ngoc-do	710563f83d	[fabric] Disable unnecessary processes in swss and the orchagent-portsyncd dependency for fabric asic (#5569 ) * Disable unnecessary processes in swss for fabric asic Signed-off-by: ngocdo <ngocdo@arista.com>	2021-06-09 10:53:47 -07:00
Andriy Yurkiv	0c2521b936	Set default values only on the first start (#7735 )	2021-06-09 18:39:22 +08:00
Shi Su	62a4603eef	Remove quagga related code (#7476 ) Why I did it Quagga is no longer being used. Remove quagga-related code (e.g., docker-fpm-quagga, sonic-quagga, etc.). How I did it Remove quagga-related code.	2021-06-07 16:44:54 -07:00
yozhao101	1a3cab43ac	[Monit] Deprecate the feature of monitoring the critical processes by Monit (#7676 ) Signed-off-by: Yong Zhao yozhao@microsoft.com Why I did it Currently we leveraged the Supervisor to monitor the running status of critical processes in each container and it is more reliable and flexible than doing the monitoring by Monit. So we removed the functionality of monitoring the critical processes by Monit. How I did it I removed the script process_checker and corresponding Monit configuration entries of critical processes. How to verify it I verified this on the device str-7260cx3-acs-1.	2021-06-04 10:16:53 -07:00
Kwan	1347f29178	[docker-mgmt-framework]: update mgmt framework docker to support sonic-cli cmd (#6148 ) - Why I did it migrate to python3 support add dependent packages for Klish allow login as non-root user - How I did it update sonic-cli script to start Klish with user name, system name and timeout update the Dockerfile.j2 to resolve dependent packages add python3-dev for Klish use - How to verify it Incremental buster build with Azure/sonic-mgmt-framework#76 and verify the sonic-cli - Description for the changelog Migrate to python3.7 support, update sonic-cli script and resolve package dependencies	2021-06-02 19:38:21 -07:00
ppikh	3ad4f79fea	[sonic-mgmt docker]: Added allure-pytest library to sonic-mgmt docker container (#7665 ) * Modified Dockerfile.j2 - added allure-pytest library Signed-off-by: Petro Pikh <petrop@nvidia.com>	2021-06-02 08:42:30 -07:00
Myron Sosyak	3bf60b3db2	[docker-database] Fix Python3 issue (#7700 ) #### Why I did it To avoid the following error ``` Traceback (most recent call last): File "/usr/local/bin/flush_unused_database", line 10, in <module> if 'PONG' in output: TypeError: a bytes-like object is required, not 'str' ``` `communicate` method returns the strings if streams were opened in text mode; otherwise, bytes. In our case text arg in Popen is not true and that means that `communicate` return the bytes #### How I did it Set `text=True` to get strings instead of bytes #### How to verify it run `/usr/local/bin/flush_unused_database` inside database container	2021-05-31 05:36:24 -07:00

1 2 3 4 5 ...

1065 Commits