Commit Graph

2475 Commits

Author SHA1 Message Date
Wirut Getbamrung
a416f49676
[platform/cel-haliburton]: add watchdog service (#6259)
Haliburton needed watchdog daemon to monitor the basic health of a machine. If something goes wrong, such as a crashing program overloading the CPU, or no more free memory on the system, watchdog can safely reboot the machine,
2020-12-26 03:04:21 -08:00
Volodymyr Samotiy
d609b406be
[Mellanox] Update SAI to version 1.14.3 (#6156)
* [SAI] Add PFC pause duration counters in microseconds
**- Why I did it**
To add PFC pause duration counters in microseconds
**- How I did it**
Updated SAI to version 1.14.3
**- How to verify it**

**- Description for the changelog**
[Mellanox] Update SAI to version 1.14.3
2020-12-14 23:51:14 -08:00
Shi Su
0574db2760
[201811][warm-reboot] Remove warmboot file path that overrides the default path (#6201)
This PR adds the changes in #6198 to 201811 branch to support warm-reboot image upgrade for kvm images.

The sai.profile file in kvm images overrides the warmboot file with path /var/cache/sai_warmboot.bin. Since the directory /var/cache is not mounted in syncd, it will be cleared in an image upgrade, the warm-reboot image upgrade will fail if the file is put in the directory.

Remove the path that overrides the default path. The warmboot file path will then be the default value /var/warmboot/sai-warmboot.bin. Since /var/warmboot/ is mounted by /host/warmboot/ in the host, it could survive an image upgrade.
2020-12-13 22:48:29 -08:00
Shi Su
40bd77c915
201811][syncd] Fix directory mount for vs syncd docker (#6200)
Since DOCKER_SYNCD_VS is no longer being used, the mount option does not properly mount the warmboot file directory. Fix the mount option so that the directory is properly mounted.
2020-12-13 22:42:02 -08:00
Sumukha Tumkur Vani
0eb8f773f4
Potential fix for Celestica E1031 device hang
set CPU max_cstate to 0
2020-12-04 12:39:47 -08:00
Ying Xie
93302d1810
[bcm SAI] Upgrade Broadcom SAI to version 3.5.3.5-3 (#5734)
- Include change to CS00011229318.

Signed-off-by: Ying Xie <ying.xie@microsoft.com>
2020-10-29 08:53:29 -07:00
Neetha John
6b96b8e4ac
[knet] Address Tx drop No DMA resource issue (#5727)
Signed-off-by: Neetha John <nejo@microsoft.com>
2020-10-28 08:29:16 -07:00
Aravind Mani
701a304b6d [Dell S6100] Properly release memory upon ICH driver deinit (#5561)
During platform deinitialization, dell_ich is not removed properly and when we do initialize s6100 platform, ICH driver sysfs attributes are not attached. Because of this, get_transceiver_change_event returns error and this leads xcvrd to crash.
2020-10-14 18:48:45 +00:00
pavel-shirshov
6c2801b846 teamd: fix possible race in master ifname callback (#4109)
- What I did
Ported a fix from libteam master to our master.
Fixes #4070
Fixes #3649

- How I did it
Applied patch jpirko/libteam@c723737 from upstream.

- How to verify it
Build image for your DUT and warm-reboot your DUT 10 times. Check that all PortChannels are up and no error messages in teamd.log
2020-10-09 15:59:56 +00:00
Ying Xie
9ea38c417c [rc.local] separate configuration migration and grub installation logic (#5528)
To address issue #5525

Explicitly control the grub installation requirement when it is needed.
We have scenario where configuration migration happened but grub
installation is not required.

Signed-off-by: Ying Xie <ying.xie@microsoft.com>
2020-10-05 15:11:35 +00:00
Ying Xie
4c80d996ba
Revert "[201811][platform/cel]: Port fancontrol changes for dx010/e1031 to 201811 branch (#4867)" (#5496)
This reverts commit c9d86f0587.
2020-09-29 18:02:47 -07:00
Ying Xie
81e9ec6be6
Revert "[201811][platform-cel]: Fix dx010 FSC error (#4949)" (#5485)
This reverts commit ec07d10748.
2020-09-29 08:01:02 -07:00
Aravind Mani
7e6fa15784
Dell S6100 fix mux log issue (#5413)
IOM completion log was not seen in syslog.
2020-09-21 12:19:07 -07:00
Aravind Mani
bee516e370
Dell S6100- Fix PCA MUX attachment issue (#5401)
* Dell S6100- Fix PCA MUX attachment t issue

* Update s6100_i2c_enumeration.sh

* Update s6100_i2c_enumeration.sh
2020-09-20 20:05:53 -07:00
Ying Xie
f041345e4e
[201811][bcm SAI] ugprade Broadcom SAI to 3.5.3.5-2 (#5405)
Including following Broadcom patches:
- CS00010869953, CS00010914668(KB29456), CS00010503275(KB0029315), CS00010914673(KB0029442)

Signed-off-by: Ying Xie <ying.xie@microsoft.com>
2020-09-18 14:54:54 -07:00
Wirut Getbamrung
ec07d10748
[201811][platform-cel]: Fix dx010 FSC error (#4949)
* [platform/cel-dx010]: add gpio init for fan direction

* [platform/cel-dx010]: remove invalid code on fancontrol service

* [platform/cel-dx010]: modify fancontrol service permission

* [platform/cel-dx010]: install fancontrol in pmon
2020-09-17 15:30:15 -07:00
Tamer Ahmed
b903c8e198 [dhcpmon] Print Both Snapshot And Current Counters (#5374)
Printing both snapshot and current counter sets will make it easier to pinpoint
which message type(s) is/are not being relayed. This PR prints both counter sets.
Also, this PR defines gnu11 as a C standard to compile with in order to avoid
making changes when porting to 201811 branch.

singed-off-by: Tamer Ahmed <tamer.ahmed@microsoft.com>
2020-09-16 09:57:36 -07:00
Tamer Ahmed
949bdee24e [dhcpmon] Monitor Mgmt Interface For DHCP Packets (#5317)
When BGP routes are missing, DHCP packets get relayed over mgmt
interface. This results in dhcpmon alerting that DHCP packets are
not being relayed. This is PR include mgmt interface as uplink
device, and so, if DHCP packet gets relayed over mgmt interface,
regular dhcpmon alert will not be issues. Instead, dhcpmon will
check the mgmt interface counts and issue a separate alert regarding
packets travelling through mgmt network.

In addition, this PR includes the following enhancements:
1. Add SIGUSR1 handler that prints out current packet counts
2. Increase alert grace window to 3 minutes from currently 2 minutes
3. Time is now computed more accurately
4. Print vlan name before counters

signed-off-by: Tamer Ahmed <tamer.ahmed@microsoft.com>
2020-09-16 09:57:36 -07:00
Ying Xie
db1ef65102
[201811][swss-common] advance swss-common sub module head (#5369)
* [201811][swss-common] advance swss-common sub module head

- Fix SubscriberStateTable::hasCachedData formula for a timing risk (#379)

Signed-off-by: Ying Xie <ying.xie@microsoft.com>

* Fix build of the unit test of SubscriberStateTable (#383)
2020-09-15 09:10:19 -07:00
Ying Xie
6c4914b62c Revert "[dhcpmon] Monitor Mgmt Interface For DHCP Packets (#5317)"
This reverts commit 44d6e03df3.
2020-09-14 22:03:55 +00:00
Tamer Ahmed
44d6e03df3 [dhcpmon] Monitor Mgmt Interface For DHCP Packets (#5317)
When BGP routes are missing, DHCP packets get relayed over mgmt
interface. This results in dhcpmon alerting that DHCP packets are
not being relayed. This is PR include mgmt interface as uplink
device, and so, if DHCP packet gets relayed over mgmt interface,
regular dhcpmon alert will not be issues. Instead, dhcpmon will
check the mgmt interface counts and issue a separate alert regarding
packets travelling through mgmt network.

In addition, this PR includes the following enhancements:
1. Add SIGUSR1 handler that prints out current packet counts
2. Increase alert grace window to 3 minutes from currently 2 minutes
3. Time is now computed more accurately
4. Print vlan name before counters

signed-off-by: Tamer Ahmed <tamer.ahmed@microsoft.com>
2020-09-14 16:56:01 +00:00
Guohan Lu
083607f4d1 [submodule]: update sonic-utilities
* 4d69425 2020-09-12 | [utilities] Define Explicit Dependency On Ipaddress Package (#1113) (HEAD, origin/201811) [Guohan Lu]

Signed-off-by: Guohan Lu <lguohan@gmail.com>
2020-09-12 23:08:39 -07:00
Ying Xie
6597bd8dea
[201811][utilities] advance utilities sub module head (#5339)
- [filter-fdb] Call Filter FDB Main From Within Test Code #1051 and #1059 (#1086)

Signed-off-by: Ying Xie <ying.xie@microsoft.com>
2020-09-08 14:35:22 -07:00
Blueve
55d2d15e4e [conf] append nos-config-part for s6100 (#5234)
* [conf] append nos-config-part for s6100

* modify rc.local

Signed-off-by: Guohan Lu <lguohan@gmail.com>

* Update rc.local

Co-authored-by: Blueve <jika@microsoft.com>
Co-authored-by: Guohan Lu <lguohan@gmail.com>
Co-authored-by: Ying Xie <yxieca@users.noreply.github.com>
2020-09-08 19:30:33 +00:00
Ying Xie
6b75059b1d
[201811][kernel][utilities][sairedis] advance submodule heads (#5288)
- Kernel: [201811] Fix I2C ISMT DMA buffer alignment issue (#158)[201811] Fix I2C ISMT DMA buffer alignment issue (#158)
- utilities: Fix pfcwd stats crash with invalid queue name (#1077)
- sairedis: [syncd] Fix notification on switch shutdown request (#638)

Signed-off-by: Ying Xie <ying.xie@microsoft.com>
2020-09-02 12:15:47 -07:00
zhenggen-xu
a99026acdc
[Build] pin down setuptools for build issues (#5280)
See: https://github.com/Azure/sonic-buildimage/issues/5279

Signed-off-by: Zhenggen Xu <zxu@linkedin.com>
2020-09-01 16:29:31 -07:00
Joe LeVeque
c909422abc [caclmgrd] Always restart service upon process termination (#5065) 2020-08-31 20:31:13 +00:00
Joe LeVeque
4547ea022d [caclmgrd] Improve code reuse (#4931)
Improve code reuse in `generate_block_ip2me_traffic_iptables_commands()` function.
2020-08-31 20:30:54 +00:00
Baptiste Covolato
c706a1079f [arista/aboot]: Zero out 1st MB before repartitioning (#5220)
The first partition starting point was changed to be 1M as part of this
commit: 6ba2f97f1e. On systems that are misaligned before conversion
(partition start is the first sector), the relica partition that is
left in the first MB can cause problems in Aboot and result in corruption
of the filesystem on the new aligned partition.

Zeroing this old relica makes sure that there is nothing left of the old
partition lying around. There won't be any risk of having Aboot corrupt
the new filesystem because of the old relica.

Signed-off-by: Baptiste Covolato <baptiste@arista.com>
2020-08-22 18:48:10 -07:00
Santhosh Kumar T
a2cb92056a
Dell S6100 Port I2C changes to 201811 branch (#5150)
* Dell S6100 Port I2C changes to 201811 branch

* Update s6100_i2c_enumeration.sh
2020-08-18 14:38:28 -07:00
zhenggen-xu
e1e97199e3
[201811 Monit] Enable monitoring of SWSS daemons (#5144)
Signed-off-by: Zhenggen Xu <zxu@linkedin.com>
2020-08-13 20:42:06 -07:00
Samuel Angebault
5d891d8832
[201811][Arista] Update arista driver submodules (#5149) 2020-08-12 09:31:58 -07:00
pavel-shirshov
e03ce8ba14
Clarify error message for bgpcfgd update loopback address (#5076) 2020-07-31 07:46:47 -07:00
pavel-shirshov
459c29cfaa
[bgpcfgd]: Fix bgpcfgd crash on reset Loopback0 ip addresses (#5050)
Fix an error which causes bgpcfgd crash on invalid ip address. Before the fix we had an issue here. When either loopback ipv4 or ipv6 addresses were already set and bgpcfgd received another "SET" message for already set ip loopback address, bgpcfgd will send syslog message about ambiguous ip address (despite the fact that the address is good) and crash of bgpcfgd. With this change this behavior is changed: if we receive ip address and this ip address is already set, bgpcfgd will send this message to the syslog and return from the handler.
2020-07-28 12:18:07 -07:00
Joe LeVeque
6120145bf1 [caclmgrd] remove default DROP rule on FORWARD chain (#5034) 2020-07-24 19:09:32 +00:00
zzhiyuan
59072a627b
[201811][Arista] Update 201811 branch with Arista syseeprom fix (#5016)
If a device had a master or 201911 image then installed a 201811 image, it could result in a prefdl that was not properly processed by 201811 Arista code.

This is a commit that was on 201911 and master branch.

Co-authored-by: Zhi Yuan Carl Zhao <zyzhao@arista.com>
2020-07-22 10:57:18 -07:00
Joe LeVeque
cf142e7e6c [caclmgrd] Filter DHCP packets based on dest port only (#4995) 2020-07-17 18:17:27 +00:00
Ying Xie
a37a7d3dcf
[201811][snmpagent] advance snmpagent submodule head (#4988)
- [psutil] pin psutil version to 5.7.0.

Signed-off-by: Ying Xie <ying.xie@microsoft.com>
2020-07-17 06:53:35 -07:00
Joe LeVeque
3d31ef3a0c
[201811][sonic-platform-daemons] Update submodule (#4974) 2020-07-14 19:12:41 -07:00
pavel-shirshov
b7a0669f36
[201811][quagga]: Use 201811 branch of sonic-quagga (#4966)
sonic-quagga using utility from master branch of sonic-buildimage. I had to create 201811 branch in sonic-quagga which could work with 201811 branch of sonic-buildimage.
2020-07-14 10:09:11 -07:00
pavel-shirshov
8a78ff6944
[quagga]: Update sonic-quagga (#4962)
sonic-quagga repository has new fix. Update submodule to bring the fix into the image.
2020-07-13 23:14:02 -07:00
Ying Xie
0a1f043b02
[201811][utilities] advance utilities submodule head (#4947)
- [filter-fdb] Fix For Vlan Defined With No CIDR (#976)

Signed-off-by: Ying Xie <ying.xie@microsoft.com>
2020-07-11 21:11:36 -07:00
zzhiyuan
a43eec53b7
[201811][Arista] Update Arista submodules (#4939)
Fix the method get_transceiver_change_event to abide by the function description, return True status and use timeout in milliseconds.

Co-authored-by: Zhi Yuan Carl Zhao <zyzhao@arista.com>
2020-07-09 22:55:06 -07:00
Ying Xie
ecd93eb8ab
[201811][swss] advance swss submodule head (#4935)
[aclorch] Use IPv6 Next Header internally for protocol number on MLNX platform (#1343)
Add/Del lag_name_map item according to lag adding and removing (#1124)

Signed-off-by: Ying Xie <ying.xie@microsoft.com>
2020-07-09 15:42:58 -07:00
Wirut Getbamrung
c9d86f0587
[201811][platform/cel]: Port fancontrol changes for dx010/e1031 to 201811 branch (#4867)
Update fancontrol service for Seastone-DX010/E1031 device to support hysteresis temperature threshold and difference config for each unit fan direction type (B2F/F2B); follow master branch
2020-07-03 19:59:55 -07:00
Guohan Lu
d04ad415b4 [docker-config-engine]: lockdown netaddr,ipaddr,jinja pip version
Signed-off-by: Guohan Lu <lguohan@gmail.com>
2020-06-25 06:58:02 +00:00
Ying Xie
6fc62208d5
[201811][utilities] advance utilities sub module head (#4844)
[filter-fdb] Check VLAN Presence When Filter FDB (#957)
[mellanox] enable watchdog before fast-reboot (#844)

Signed-off-by: Ying Xie <ying.xie@microsoft.com>
2020-06-24 10:15:23 -07:00
Tamer Ahmed
ab3400f217 [fast-reboot] Back up FDB/ARP/Default routes (#4795)
FDB/ARP/Default routes files are deleted after swssconfig. This
makes debugging/validation of device conversion hard. This PR
saves those files in order to facilitate debugging of device conversion.

signed-off-by: Tamer Ahmed <tamer.ahmed@microsoft.com>
2020-06-21 02:41:39 +00:00
padmanarayana
062fd849b3 [DELL]: FTOS to SONiC fast conversion fixes (#4807)
While migrating to SONiC 20181130, identified a couple of issues:
1. union-mount needs /host/machine.conf parameters for vendor specific checks : however, in case of migration, the /host/machine.conf is extracted from ONIE only in https://github.com/Azure/sonic-buildimage/blob/master/files/image_config/platform/rc.local#L127.
2. Since grub.cfg is updated to have net.ifnames=0 biosdevname=0, 70-persistent-net.rules changes are no longer required.
2020-06-19 22:35:29 +00:00
Joe LeVeque
d9b8bed916 [caclmgrd] Don't limit connection tracking to TCP (#4796)
Don't limit iptables connection tracking to TCP protocol; allow connection tracking for all protocols. This allows services like NTP, which is UDP-based, to receive replies from an NTP server even if the port is blocked, as long as it is in reply to a request sent from the device itself.
2020-06-19 04:33:50 +00:00