sonic-buildimage

Author	SHA1	Message	Date
Richard.Yu	a096363b48	[broadcom]: Set default SYNCD_SHM_SIZE for Broadcom XGS devices (#13297 ) After upgrade to brcmsai 8.1, the sdk running environment (container) recommended with mininum memory size as below TH4/TD4(ltsw) uses 512MB TH3 used 300MB Helix4/TD2/TD3/TH/TH 256 MB Base on this requirement, adjust the default syncd share memory size and set the memory size for special ACISs in platform_env.conf file for different types of Broadcom ASICs. How I did it Add the platform_env.conf file if none of it for broadcom platform (base on platform_asic file) Add the 'SYNCD_SHM_SIZE' and set the value for ltsw(TD4/TH4) devices set to 512M at least (update the platform_env.conf) for Td2/TH2/TH devices set to 256M for TH3 set to 300M verify How to verify it verify the image with code fix Check with UT Check on lab devices On a problematic device which cannot start successfully Run with the command $ cat /proc/linux-kernel-bde Broadcom Device Enumerator (linux-kernel-bde) Module parameters: maxpayload=128 usemsi=0 dmasize=32M himem=(null) himemaddr=(null) DMA Memory (kernel): 33554432 bytes, 0 used, 33554432 free, local mmap No devices found $ docker rm -f syncd syncd $ sudo /usr/bin/syncd.sh start Cannot get Broadcom Chip Id. Skip set SYNCD_SHM_SIZE. Creating new syncd container with HWSKU Force10-S6000 a4862129a7fea04f00ed71a88715eac65a41cdae51c3158f9cdd7de3ccc3dd31 $ docker inspect syncd \| grep -i shm "ShmSize": 67108864, "Tag": "fix_8.1_shm_issue.67873427-9f7ca60a0e", On Normal device $ docker inspect syncd \| grep -i shm "ShmSize": 268435456, "Tag": "fix_8.1_shm_issue.67873427-9f7ca60a0e" change the config syncd_shm.ini to b85=128m $ docker rm -f syncd syncd $ sudo /usr/bin/syncd.sh start Creating new syncd container with HWSKU Force10-S6000 3209ffc1e5a7224b99640eb9a286c4c7aa66a2e6a322be32fb7fe2113bb9524c $ docker inspect syncd \| grep -i shm "ShmSize": 134217728, "Tag": "fix_8.1_shm_issue.67873427-9f7ca60a0e", change the config under /usr/share/sonic/device/x86_64-dell_s6000_s1220-r0/Force10-S6000/platform_env.conf and run command $ cat /usr/share/sonic/device/x86_64-dell_s6000_s1220-r0/platform_env.conf SYNCD_SHM_SIZE=300m $ sudo /usr/bin/syncd.sh start Creating new syncd container with HWSKU Force10-S6000 897f6fcde1f669ad2caab7da4326079abd7e811bf73f018c6dacc24cf24bfda5 $ docker inspect syncd \| grep -i shm "ShmSize": 314572800, "Tag": "fix_8.1_shm_issue.67873427-9f7ca60a0e", Signed-off-by: richardyu-ms <richard.yu@microsoft.com>	2023-01-30 20:23:03 -08:00
Yaqiang Zhu	bb48ee92ab	[dhcp-relay] Add support for dhcp_relay config cli (#13373 ) Why I did it Currently the config cli of dhcpv4 is may cause confusion and config of dhcpv6 is missing. How I did it Add dhcp_relay config cli and test cases. config dhcp_relay ipv4 helper (add \| del) <vlan_id> <helper_ip_list> config dhcp_relay ipv6 destination (add \| del) <vlan_id> <destination_ip_list> Updated docs for it in sonic-utilities: https://github.com/sonic-net/sonic-utilities/pull/2598/files How to verify it Build docker-dhcp-relay.gz with and without INCLUDE_DHCP_RELAY, and check target/docker-dhcp-relay.gz.log	2023-01-30 17:48:01 -08:00
kenneth-arista	8c2d8ea4af	[device/arista] Reduce SDK stat polling freq in DNX devices (#13429 ) Eariler the SDK stat polling was erroneously set to once every msec which is far more frequent than required by SWSS. The new setting, which is consistent with other vendor SKUs, is once a second. The net result is reduced CPU MHz by syncd.	2023-01-30 14:13:01 -08:00
Oleksandr Ivantsiv	c7ecd92c54	Clear DNS configuration received from DHCP during networking reconfiguration in Linux. (#13516 ) - Why I did it fixes #12907 When the management interface IP address configuration changes from dynamic to static the DNS configuration (retrieved from the DHCP server) in /etc/resolv.conf remains uncleared. This leads to a DNS configuration pointing to the wrong nameserver. To make the behavior clear DNS configuration received from DHCP should be cleared. - How I did it Use resolvconf package for managing DNS configuration. It is capable of tracking the source of DNS configuration and puts the configuration retrieved from the DHCP servers into a separate file. This allows the implementation of DNS configuration cleanup retrieved from DHCP during networking reconfiguration. - How to verify it Ensure that the management interface has no static configuration. Check that /etc/resolv.conf has DNS configuration. Configure a static IP address on the management interface. Verify that /etc/resolv.conf has no DNS configuration. Remove the static IP address from the management interface. Verify that /etc/resolv.conf has DNS configuration retrieved form DHCP server.	2023-01-30 22:13:10 +02:00
Liu Shilong	cabaebb4b0	[action] Update github actions on trigger and label. (#13542 ) Why I did it github action will report error on forked repos. It is not by design. keep 'Approved for xxx branch' label in auto cherry pick workflow. How I did it Disable github action on folked repos. Keep 'approved for xxx' label in auto cherry pick workflow. How to verify it Which release bra	2023-01-30 16:57:39 +08:00
Junchao-Mellanox	b59f3888ff	[sonic-acl.yang] Add new ACL key BTH_OPCODE and AETH_SYNDROME (#13340 ) - Why I did it Add new ACL key BTH_OPCODE and AETH_SYNDROME - How I did it Add new ACL key BTH_OPCODE and AETH_SYNDROME - How to verify it manual test unit test	2023-01-29 13:44:35 +02:00
jingwenxie	fdfb35973f	[submodule] updater sonic-utilities (#13501 ) Includes below commits ``` 0d5e68f5a [GCU] Ignore bgpraw table in GCU operation (#2628) 22757b1f3 Add interface link-training command into the CLI doc (#2257) f4f857e10 [GCU] Ignore bgpraw in GCU applier (#2623) b5ac60036 [muxcable][config] Add support to enable/disable ceasing to be an advertisement interface when `radv` service is stopped (#2622) 981f9531e [chassis][voq] Add "show fabric reachability" command. (#2528) fba87f43f Revert (#2599) d6d7ab37f [warm-reboot] Use kexec_file_load instead of kexec_load when available (#2608) db4683d40 fix show techsupport error (#2597) 3d8e9c62d [GCU] Prohibit removal of PFC_WD POLL_INTERVAL field (#2545) 163e766cc [techsupport] include APPL_STATE_DB dump (#2607) 8703773eb YANG Validation for ConfigDB Updates: RADIUS_SERVER (#2604) c2d746d4f Remove TODO comment which is no longer relevant (#2600) f09da9983 [show] Add bgpraw to show run all (#2537) 39ac5641b Extend fast-reboot STATE_DB entry timer (#2577) ```	2023-01-27 11:48:14 -08:00
Devesh Pathak	c93716a142	rsyslog to start after interfaces-config (#13503 ) Fixes #12408 Why I did it We are running into #12408 very frequently. This results in no syslogs from any containers as rsyslog server could not start. some of the sonic-mgmt scripts look for log statements and error out if log is not present. Interfaces-config service configures the loopback interface along with other interfaces. rsyslog-config reads ip address of loopback interface and generates /etc/rsyslog.conf. When this race condition happens, lo interface ip is not yet programmed and rsyslog-config ends up writing UDP server as null in /etc/rsyslog.conf. How I did it rsyslog-config service is started after interfaces-config service. How to verify it Did multiple reboots and verified that $UDPServerAddress is valid.	2023-01-26 20:39:13 -08:00
Jing Zhang	dabb31c5f6	[sudoers] add `/usr/local/bin/storyteller` to `READ_ONLY_CMDS` (#13422 ) Adding /usr/local/bin/storyteller to READ_ONLY_CMDS. So no write access or prompt for password is needed to run storyteller. Tested on 202205 clusters, user who didn't request write access was able to grep log using storyteller. sign-off: Jing Zhang zhangjing@microsoft.com	2023-01-26 20:38:29 -08:00
xumia	77745f55cc	[FIPS] Upgrade Open-SymCrypt version to 0.6 (#13461 ) Why I did it [FIPS] Upgrade Open-SymCrypt version to 0.6 Improve the SymCrypt performance Support to download the debug packages from storage account in version 0.6. How I did it Upgrade to symcrypt-openssl from version 0.4 to version 0.6 Changes in https://github.com/sonic-net/sonic-fips: 0c29b23 Upgrade the submodules: SymCrypt and SymCrypt-OpenSSL #40 80022f3 Fix the ARM64 build failure 2e76a3d Disable the unsupported tests Other changes will be added as well: 55b8e0a Merge pull request #35 from xumia/change-license 120c1a7 Upgrade SymCrypt and SymCrypt-OpenSSL 2f9c084 Merge pull request #39 from liuh-80/dev/liuh/update-openssh-version a3be6c5 Revert openssh version e02fa1e Update fips version How to verify it	2023-01-27 11:54:44 +08:00
mihirpat1	24bdfc1bb2	[platform-common] Advance submodule head (#13515 ) Update sonic-platform-common submodule head to include: 38a7a65 mihirpat1 Wed Jan 25 09:49:05 2023 -0800 Change get_tx_bias return type to list (sonic-net/sonic-platform-common#342) ecb7dde qinchuanares Sat Jan 21 11:24:37 2023 -0800 add SOP ROC in bulk status (sonic-net/sonic-platform-common#341) Signed-off-by: Mihir Patel <patelmi@microsoft.com>	2023-01-26 11:36:10 -08:00
Volodymyr Samotiy	fd8d678927	[Mellanox] Update SDK/FW to 4.5.4150/2010.4150 (#13480 ) - Why I did it To include latest fixes and new functionality SDK/FW 1. Fixed bug in recovery mechanism in case of I2C error when trying to access the XSFP module. 2. On the NVIDIA Spectrum-2 switch, when receiving a packet with Symbol Errors on ports that are configured to cut-thought mode, a pipeline might get stuck. 3. On the Spectrum-2 and Spectrum-3 switch, if you enable ECN marking and the port is in split mode, traffic sent to the port under congestion (for example, when connecting two ports with a total speed of 50GbE to a single 25GbE port) is not marked. 4. Modifying existing entry/Adding new one when switch is at its maximum capacity (full by maximum allowed entries from any type such as routes, FDB, and so forth), will fail with an error. 5. When many ports are active (e.g., 70 ports up), and the configuration of shared buffer is applied on the fly, occasionally, the firmware might get stuck. 6. When a system has more than 256 ACL rules, on rare occasion, removing/adding rules may cause some ACL rules not to work. 7. On SN2201 system, on RJ45 port, the link might appear in 'down' state even if it operations properly. 8. Layer 4 port information is not initialized for BFD packet event. To address the issue, remote peer UDP port information was added in BFD packet event. 9. When setting LAG as a SPAN analyzer, the distributor mode of the LAG members was not taken into account. It may happen that the LAG member with distributor mode disabled will be set as a SPAN analyzer port. - How I did it Updated SDK/SAI submodule and relevant makefiles with the required versions. - How to verify it Build an image and run tests from "sonic-mgmt". Signed-off-by: Volodymyr Samotiy <volodymyrs@nvidia.com>	2023-01-26 12:41:22 +02:00
Mai Bui	2f2702f705	Revert "[system-health] Remove subprocess with shell=True (#12572 )" (#13505 ) This reverts commit `b3a8167968`. Due to issue https://github.com/sonic-net/sonic-buildimage/issues/13432	2023-01-25 13:41:08 -08:00
DavidZagury	4cc84c68dc	[Mellanox] Improve FW upgrade logging (#13465 ) - Why I did it To improve ASIC FW upgrade logging and have information about the cause of FW update failure in the log. - How I did it Added syslog logger support In case the FW update has failed the update tool will give the cause of the failure in the output in the last line, starting with "Fail". When running the tool, in case of a failed update, we will parse the output to retrieve the cause and log it. Device #1: ---------- Device Type: ConnectX6DX Part Number: MCX623106AN-CDA_Ax Description: ConnectX-6 Dx EN adapter card; 100GbE; Dual-port QSFP56; PCIe 4.0/3.0 x16; PSID: MT_0000000359 PCI Device Name: /dev/mst/mt4125_pciconf0 Base GUID: 0c42a103007d22d4 Base MAC: 0c42a17d22d4 Versions: Current Available FW 22.32.0498 22.32.0498 PXE 3.6.0500 3.6.0500 UEFI 14.25.0015 14.25.0015 Status: Forced update required --------- Found 1 device(s) requiring firmware update... Device #1: Updating FW ... FSMST_INITIALIZE - OK Writing Boot image component - OK Fail : The Digest in the signature is wrong - How to verify it mlnx-fw-upgrade.sh --upgrade	2023-01-25 20:53:39 +02:00
Lior Avramov	9a49aec570	[Mellanox] [ECMP calculator] Add script usage and more information to script description in help option (#13493 ) Add script usage and more information to script description being printed in help option. - Why I did it Missing information in script description in help option. - How I did it Expand script description and add script usage. - How to verify it Run the script with -h option.	2023-01-25 20:50:38 +02:00
Guohan Lu	d84deafdea	Revert "[build] Migrate libyang2 sources download from wget to dget (#13394 )" This reverts commit `9a0bf56a15`.	2023-01-25 02:17:40 -08:00
Sudharsan Dhamal Gopalarathnam	03348c44ac	[yang] Added Tunnel flex counter group (#13483 ) - Why I did it Fixes https://github.com/sonic-net/sonic-buildimage/issues/13457 Added Tunnel flex counter group - How I did it Added relevant container in sonic-flex_counter yang model - How to verify it Added UT to verify	2023-01-25 08:56:13 +02:00
Jing Zhang	78f249be38	change default to be on (#13495 ) Changing the default config knob value to be True for killing radv, due to the reasons below: Killing RADV is to prevent sending the "cease to be advertising interface" protocol packet. RFC 4861 says this ceasing packet as "should" instead of "must", considering that it's fatal to not do this. In active-active scenario, host side might have difficulty distinguish if the "cease to be advertising interface" is for the last interface leaving. 6.2.5. Ceasing To Be an Advertising Interface shutting down the system. In such cases, the router SHOULD transmit one or more (but not more than MAX_FINAL_RTR_ADVERTISEMENTS) final multicast Router Advertisements on the interface with a Router Lifetime field of zero. In the case of a router becoming a host, the system SHOULD also depart from the all-routers IP multicast group on all interfaces on which the router supports IP multicast (whether or not they had been advertising interfaces). In addition, the host MUST ensure that subsequent Neighbor Advertisement messages sent from the interface have the Router flag set to zero. sign-off: Jing Zhang zhangjing@microsoft.com	2023-01-24 23:59:54 +00:00
Zain Budhwani	2068a2697a	Change bgp notification leaf name and mem_usage leaf type (#13012 ) #### Why I did it Improve naming convention for bgp notification events and change type of leaf for sonic-events-host mem usage from uint64 to decimal64 #### How I did it Replace "-" with "_" Replace uint64 with decimal64 #### How to verify it Run yang model unit tests #### Description for the changelog Change YANG model leaf naming convention for bgp notification	2023-01-24 15:47:32 -08:00
Zain Budhwani	c9a33cb00e	Fix segfault issue inside memory_checker (#13066 ) #### Why I did it Segfault was occuring when running memory_checker #### How I did it Deinit publisher immediately after publishing #### How to verify it Manual testing	2023-01-24 15:30:41 -08:00
Marty Y. Lok	fd3966a0b8	[Nokia][sonic-platform] Update sonic-platform submodule for Nokia IXR7250E platform (#13437 ) Why I did it Update Nokia sonic-platform submodule 81a9c77 [Supervisor] Modifed the get_description to fix the name for Nokia-IXR7250E-SUP-10 card. e49ddfb Fix the LedContorlCommon to get the physical index from port mapping dd143f1 [module] modify the chassis.py and module.py to allow supervisor to retrieve the line card eemprom info How I did it Update Nokia sonic-platform submodule 81a9c77 [Supervisor] Modifed the get_description to fix the name for Nokia-IXR7250E-SUP-10 card. e49ddfb Fix the LedContorlCommon to get the physical index from port mapping dd143f1 [module] modify the chassis.py and module.py to allow supervisor to retrieve the line card eemprom info How to verify it On supervisor, "show chassis module status" should show Nokia-IXR7250E-SUP-10 instead of Nokia-IXR7250-SUP-10 Signed-off-by: mlok <marty.lok@nokia.com>	2023-01-24 11:40:59 -08:00
Dror Prital	940e2cd9bf	[Mellanox] Add ASIC simulation version tag to fw.mk (#13470 ) Signed-off-by: dprital <drorp@nvidia.com>	2023-01-23 13:30:02 +02:00
Jing Zhang	260a2ec3e7	[dualtor][active-active]Killing radv instead of stopping on `active-active` dualtor if config knob is on (#13408 ) How I did it radv sends a good-bye packet when the service is stopped, which causes a IPv6 route update on SoC side. And this update leads to an interface bouncing and causes traffic disruption even though the ToR device might already be isolated. This PR is to mitigate the traffic disruption issue during planned maintenance, by killing radv instead of stopping. So the cease packet won't be sent. How to verify it Verified on dev clusters: Traffic disruption was no longer reproducible. radv took the killing path if knob was off, radv would take the stopping path sign-off: Jing Zhang zhangjing@microsoft.com	2023-01-20 15:34:34 -08:00
abdosi	439d4eab98	[chassis] Fixed critical process not correct for database-chassis docker (#13445 ) *Critical process for database-chassis is redis-chassis but critical_process contains hard-coded to `redis` program always. Instead using jinja2 template to render critical process list based on database docker type. redis-chassis for database-chassis docker and redis for regular database docker.	2023-01-20 10:21:48 -08:00
bingwang-ms	b03a65f331	Support both port name and alias in ACL table `AttachTo` attribute (#13444 ) Why I did it This PR is an enhancement of PR #13105 Because the input string of AttachTo for ACL table can appear in both port name group and port alias group, I added a logic to determine whether the string should be port name or port alias If all the input strings belong to port name group, then we treat all of them as port name If all the input strings belong to port alias, then we treat all of them as port alias If all the input string belongs to both port alias group and port name group, we prefer port alias. The behavior is as before. How I did it Walk through all port names/alias in the input to make a decision. How to verify it Verified by adding UT.	2023-01-20 10:11:39 -08:00
mihirpat1	568e966ff1	[platform-daemon] Advance submodule head (#13428 ) a931d6c Prince George Wed Jan 18 19:10:55 2023 -0800 [Xcvrd]: Fix optics insertion/removal not detected (#333) 2211b7e mihirpat1 Wed Jan 18 16:00:22 2023 -0800 Xcvrd should restart if any child thread crashes (#326) 753b550 judyjoseph Tue Jan 17 13:10:09 2023 -0800 Chassisd do an explicit stop of the config_manager (#328) 879d630 Tal Berlowitz Fri Jan 6 01:57:42 2023 +0200 Fix bug where transceiver info is missing after port breakout change (#329) e119b69 Junchao-Mellanox Tue Dec 13 19:54:49 2022 +0800 Remove TODO comments which are no longer needed (#325) Signed-off-by: Mihir Patel <patelmi@microsoft.com>	2023-01-20 09:46:35 -08:00
judyjoseph	96cecc385a	Add explicit dependency on sonic_platform_common (#13446 ) Why I did it Add explicit dependency on sonic_platform_common in sonic-chassisd mk. This was needed because sonic-chassisd depends on sonic-platform-base which is present in sonic-platform-common wheel package. How I did it Add explicit dependency on sonic_platform_common in sonic-chassisd mk. How to verify it Verified by building all platforms broadcom, mellanox, marvel_arm	2023-01-19 22:42:28 -08:00
Jing Zhang	d3812621cf	[linkmgrd] submodule update (#12859 ) ac24ad1 Liu Shilong Wed Nov 30 18:04:15 2022 +0800 Use github code scanning instead of LGTM (#157) 1c755c4 Jing Zhang Fri Nov 4 17:12:51 2022 -0700 [active-active] Incrementing BOOST_ASIO_STRAND_IMPLEMENTATIONS (#154) sign-off: Jing Zhang zhangjing@microsoft.com	2023-01-19 11:17:12 -08:00
Ying Xie	e0ed5f968f	[Arista] add support for hardware sku Arista-7260CX3-D92C16 (#13438 ) Signed-off-by: Ying Xie <ying.xie@microsoft.com> Signed-off-by: Ying Xie <ying.xie@microsoft.com>	2023-01-19 11:08:21 -08:00
Guilt	9a0bf56a15	[build] Migrate libyang2 sources download from wget to dget (#13394 ) According to its manual page, "[dget in its] first form, [..] fetches the requested URLs. If this is a .dsc or .changes file, then dget acts as a source-package aware form of wget: it also fetches any files referenced in the .dsc/.changes file. The downloaded source is then checked with dscverify and, if successful, unpacked by dpkg-source." Thus, when possible, dget use is preferable to wget so that sources authenticity can be performed automatically by dscverify" Signed-off-by: Guillaume Lambert <guillaume.lambert@orange.com>	2023-01-19 09:19:54 -08:00
Guilt	e08914769b	[build] Fix SONIC_USERFACL_DOCKERD_FOR_MULTIARCH typo in Makefile.work (#13390 ) The variable name SONIC_USERFACL_DOCKERD_FOR_MULTIARCH is mispelled in Makefile.work Signed-off-by: Guillaume Lambert <guillaume.lambert@orange.com>	2023-01-19 09:18:57 -08:00
Ikki Zhu	eba30ff26f	[Celestica Seastone] fix multi sonic platform issues (#13356 ) Why I did it Fix the following issues for Seastone platform: - system-health issue: show system-health detail will not complete #9530, Celestica Seastone DX010-C32: show system-health detail fails with 'Chassis' object has no attribute 'initizalize_system_led' #11322 - show platform firmware updates issue: Celestica Seastone DX010-C32: show platform firmware updates #11317 - other platform optimization How I did it Modify and optimize the platform implememtation. How to verify it Manual run the test commands described in these issues.	2023-01-18 16:27:48 -08:00
Marty Y. Lok	e1f0d7650e	[Nokia][sonic-platform] Update sonic-platform submodule for Nokia IXR7250E (#13145 ) fcb45b5 Add MDIPC channel cleanup code at signal-based termination time and don't precache in get_presence unless required 8984b3d Properly synchronize transceiver module presence globally Signed-off-by: mlok <marty.lok@nokia.com> Signed-off-by: mlok <marty.lok@nokia.com>	2023-01-18 15:47:02 -08:00
Samuel Angebault	dfaf379e27	[Arista] Update platform library submodules (#13398 ) - add module reboot APIs for chassis - add supervisor module on linecard (fixes show chassis module midplane-status) - improve RTC update mechanism and sync every 10 mins - fix sbtsi temp sensor presence/thresholds - fix Mineral status leds - remove thermal object on xcvrs - misc fixes	2023-01-18 10:03:48 -08:00
Jemston Fernando	892f26556c	[platform]: Fix Belgite platform issues (#13389 ) As part of platform hardening this commit fixes several platform issues in various components like PSU, FAN, Temperature, LED.	2023-01-18 10:00:07 -08:00
Yoush	63f2ab2cc3	[BugFix] Fix the bug that it gets error system-mac of centec platform (#12721 ) Why I did it When getting system mac of centec platform, it would increase by 1 the last byte of mac, but it could not consider the case of carry. How I did it Firstly, I would replace the ":" with "" of mac to a string. And then, I would convert the mac from string to int and increase by 1, at last convert it to string with inserting ":".	2023-01-18 09:24:28 -08:00
Liu Shilong	d55913a679	[build] Check if patches are applied before applying patches (#13386 ) Why I did it If make fails, we can't rerun the make process, because existing patches can't apply again. How I did it Check if patches are applied. if yes, don't apply patches again. How to verify it	2023-01-18 13:35:11 +08:00
Lawrence Lee	5bb8c1a485	[PTF] Patch PTF library to use correct VXLAN module (#13155 ) Why I did it The current PTF library contains a typo - when building a VxLAN packet, it uses the VxLAN module directly from the scapy library which will cause test failures. How I did it Patch simple_vxlan_packet to use the VxLAN module wrapped/defined in packet.py from the PTF library. Signed-off-by: Lawrence Lee <lawlee@microsoft.com>	2023-01-17 15:03:13 -08:00
Tomer Shalvi	2d2d9433b3	Moving multiprocessing.Manager to the correct sub-process (#13377 ) Why I did it There is a queue in sysmonitor.py that is created based on an object of multiprocessing.Manager. After performing fast-reboot, system health monitor is being shut down, what causes this Manager to be shut down as well, since it is a child-process of healthd. That's why I moved the creation of this Manager from the top of the file to the function Sysmonitor.system_service() (The only place it is used), to make Manager a child-process of Sysmonitor, instead of Healthd. This way both the queue (the Manager) and the processes that uses this queue will be child-processes of the same process, and the problematic scenario of sysmonitor sending messages to a dead queue will not be possible. How I did it Removed the definition of manager as global and moved it to system_service() function How to verify it Perform a fast reboot and verify the traceback issue is fixed	2023-01-17 08:43:49 -08:00
xumia	5e4a866e33	[Build] Support Debian snapshot mirror to improve build stability (#13097 ) Why I did it [Build] Support Debian snapshot mirror to improve build stability It is to enhance the reproducible build, supports the Debian snapshot mirror. It guarantees all the docker images using the same Debian mirror snapshot and fixes the temporary build failure which is caused by remote Debain mirror indexes changed during the build. It is also to fix the version conflict issue caused by no fixed versions of some of the Debian packages. How I did it Add a new feature to support the Debian snapshot mirror. How to verify it	2023-01-13 16:16:35 +08:00
bingwang-ms	22fcc760c4	[minigraph]: Support port name in ACL table AttachTo attribute (#13105 ) Why I did it This PR is to update minigraph.py to support both port alias and port name as input of AttachTo attribute of ACL table. Before this change, only port alias is supported. How I did it Add a global variable to store port names Search both port names and port alias wheh parsing the value of AttachTo. How to verify it Verified by a new unit test case test_minigraph_acl_attach_to_ports Verified by copying the new minigraph.py to a testbed and run conflg load_minigraph.	2023-01-12 23:54:25 -08:00
Graham Hayes	e077b5362c	[Arista] Rely on automatic flash size detection for Raven (#13277 ) Many of these switches have had flash upgraded beyond 2G however, in boot0 both were assigned 2GB for legacy reasons. Remove the hardcoding of the flash size and let boot0 autodetect the available space. Signed-off-by: Graham Hayes <gr@ham.ie> Signed-off-by: Graham Hayes <gr@ham.ie>	2023-01-12 23:52:40 -08:00
Ikki Zhu	4539035e90	[Seastone] Enhancement fix for PR12200 syseeprom issue (#13344 ) Why I did it [Seastone] Enhancement fix for PR12200 syseeprom issue. How I did it Enhance the fix through replace the hardcoded devnum to bash variable How to verify it show platform syseeprom or decode-syseeprom	2023-01-12 23:51:33 -08:00
kenneth-arista	06d55b8027	[device/arista] Disabled polled_irq_mode for DNX SKUs (#13349 ) Disabled polled_irq_mode for all Arista DNX devices as this mode leads to excessive use of the CPU via an unneeded interrupt polling thread.	2023-01-12 23:48:37 -08:00
pettershao-ragilenetworks	bce4aa1412	[ragile] adapter for kernel 5.x (#10762 ) Why I did it Ragile adapter ra-b6510-32c ra-b6510-48v8c ra-b6910-64c ra-b6920-4s to kernel 5.x Signed-off-by: “pettershao” pettershao@ragilenetworks.com	2023-01-12 18:01:47 -08:00
Dror Prital	d9c75b3fa2	[submodule] Advance sonic-utilities pointer (#13333 ) Update sonic-utilities submodule pointer to include the following: * fb8f98b Preserve copp tables through DB migration ([#2524](https://github.com/sonic-net/sonic-utilities/pull/2524)) * 4aa512c [sfputil] Firmware download/upgrade CLI support for QSFP-DD (#1947) ([#2349](https://github.com/sonic-net/sonic-utilities/pull/2349)) * f63ef9a Revert sonic-utilities: Update config reload() to verify formatting of an input file (#2529) ([#2586](https://github.com/sonic-net/sonic-utilities/pull/2586)) * 3a09ecb [masic] 'show interfaces counters' reminds to use '-d all' option to check for internal links ([#2466](https://github.com/sonic-net/sonic-utilities/pull/2466)) * 65cf00a [storyteller] add link prober state change to story teller ([#2585](https://github.com/sonic-net/sonic-utilities/pull/2585)) Signed-off-by: dprital <drorp@nvidia.com>	2023-01-12 10:46:34 +02:00
shdasari	97161aeadb	SONiC YANG model for RADIUS. (#12749 ) #### Why I did it Added SONiC YANG model for RADIUS. Fixes https://github.com/sonic-net/sonic-buildimage/issues/12477 #### How I did it Added the RADIUS and RADIUS_SERVER tables for global and per RADIUS server configuration. RADIUS statistics reside in COUNTERS_DB and are not part of the configuration. These are not a part of this PR. #### How to verify it Compiled sonic_yang_mgmt-1.0-py3-none-any.whl. #### Description for the changelog SONiC YANG model for RADIUS.	2023-01-11 16:42:24 -08:00
Prince Sunny	21e507e22b	[Dash] Fix a typo (#13325 ) Fix a typo in yang for Dash	2023-01-11 11:24:47 -08:00
xumia	e6a01ca5eb	[Bug] Fix SONiC installation failure caused by pip/pip3 not found (#13284 ) The main issue is the pip/pip3 command cannot be found when the package is being installed by apt-get. When using the dpkg install, the searching path is PATH=/usr/local/sbin:/usr/local/bin:/usr/sbin:/usr/bin:/sbin:/bin When using the apt-get install, the searching path is PATH=/usr/sbin:/usr/bin:/sbin:/bin But the pip/pip3 default path is at /usr/local/bin, so dpkg works, but apt-get not work. How I did it Export the path /usr/local/bin for pip/pip3. Make the deb packages can be installed by apt-get.	2023-01-11 08:54:24 -08:00
Kebo Liu	7873a9131d	[Mellanox] Skip the leftover hardware reboot cause in case of last boot is warm/fast reboot (#13246 ) - Why I did it In case of warm/fast reboot, the hardware reboot cause will NOT be cleared because CPLD will not be touched in this flow. To not confuse the reboot cause determine logic, the leftover hardware reboot cause shall be skipped by the platform API, platform API will return the 'REBOOT_CAUSE_NON_HARDWARE' instead of the "hardware" reboot cause. - How I did it Check the proc cmdline to see whether the last reboot is a warm or fast reboot, if yes skip checking the leftover hardware reboot cause. - How to verify it a. Manual test: - Perform a power loss - Perform a warm/fast reboot - Check the reboot cause should be "warm-reboot" or "fast-reboot" instead of "power loss" b. Run reboot cause related regression test. Signed-off-by: Kebo Liu <kebol@nvidia.com>	2023-01-11 16:50:46 +02:00

1 2 3 4 5 ...

7129 Commits