Commit Graph

639 Commits

Author SHA1 Message Date
Santhosh Kumar T
e481918e98 [DellEMC] Master: S6100: SSD upgrade status: Moving from smartctl to iSMART (#12784)
Why I did it
smartctl tool is available only in PMON docker. Hence, the tool may be not accessible incase PMON docker goes down.
Using iSMART_64 tool to fetch the SSD firmware version and device model information.

How I did it
Replacing smartctl with iSMART_64.
2023-01-25 21:06:57 +00:00
Arun Saravanan Balachandran
beb9eac1c7
[201811] DellEMC S6100 - Port SSD upgrade status checker (#10955)
Why I did it
To port DellEMC S6100 SSD upgrade status checker changes from master (based on #7289) to 201811 branch

Handle newer SSD firmware version (S210506G - 3IE devices)
Recover SSD upgrade state if in case ssd_fw_upgrade folder got deleted
2022-06-14 17:14:31 -07:00
Ying Xie
8e6d08a6b5
[bcmsai] upgrade Broadcom SAI to 3.5.3.8 (#9955)
Including following fixes:
CS00012225760, CS00012212820, CS00012215529, CS00012218100

Signed-off-by: Ying Xie <ying.xie@microsoft.com>
2022-02-10 17:05:26 -08:00
Samuel Angebault
70d87ccff4
[201811][Arista] Backport logrotate configuration (#9456)
Backport logrotate configuration for arista*.log files
2021-12-10 08:31:30 -08:00
Ying Xie
1823b853ff
[201811][sai bcm] upgrade Broadcom SAI to version 3.5.3.7-6 (#8902)
- Pick up Broadcom patches for CS00011222060 and CS00012204923.

Signed-off-by: Ying Xie <ying.xie@microsoft.com>
2021-10-10 20:17:31 -07:00
Aravind Mani
54e32c1f30
[201811] Dell S6100:Add serial-getty service to monit (#8410)
Why I did it
serial-getty service exited in Dell S6100 device randomly.

How I did it
Added serial-getty to monit services.

How to verify it
Stop serial-getty in ssh session and check whether the service restarts or not.
2021-08-19 10:14:03 -07:00
Ying Xie
871c78244a
[201811] upgrace broadcom SAI to version 3.5.3.7-4 (#7722)
- CS00012108770, CS00012115932: Sesto phy WB support + WB recovery + log messages

Signed-off-by: Ying Xie <ying.xie@microsoft.com>
2021-05-26 16:59:11 -07:00
Santhosh Kumar T
1952afdeed
[201811] DellEMC S6100 SSD Monitor porting changes (#7290)
Why I did it
To monitor the SSD health condition in DellEMC S6100 platform post upgrade.
A daemon is introduced to monitor the SSD every one hour.
To check for SSD status at boot time and at the time of cold-reboot.
All these changes are supported only for newer SSD firmware.
Porting changes from 201911 branch
Added a platform_reboot_pre_check script to prevent cold-reboot based on SSD status.
Depends on Azure/sonic-utilities#1557
2021-05-04 09:44:24 -07:00
Ying Xie
fa036c0ead
[201811][bcm sai] ugprade Broadcom SAI to 3.5.3.7-2 (#7385)
* 1d588e38 2021-03-25 | Pending changes for CS00011808451 (bcm_sai/REL_3.5.3, INT_3.5) [BrcmSAI]
* 53efbe91 2021-03-18 | Fix for CS00011810218 [BRCM-DevOps]
* ca0dc86c 2021-03-11 | Fix for CS00011808451 (msazure/INT_3.5, bcm_sai/REL_3.5.3, INT_3.5) [BrcmSAI]
* 208ea62e 2021-01-08 | Update changelog [BRCM-DevOps]
* 3961673c 2021-01-08 | Update version after sdk patch and egress obj leak fixes. [BRCM-DevOps]
* ce3818ad 2021-01-08 | Fix for CS00011581499 [BrcmSAI]
* 1e06157f 2021-01-08 | Merged SDk patch for CS00011651922 [BrcmSAI]
* 4d0c2c86 2021-04-06 | Merged PR 4328641: Add support for AN/LT (HEAD -> 3.5-201811, msazure/3.5-201811) [Joe LeVeque]

Signed-off-by: Ying Xie <ying.xie@microsoft.com>
2021-04-20 15:49:14 -07:00
Nazarii Hnydyn
72c506061c
[Mellanox] Improve FW upgrade logging: add syslog logger (#7221)
Basically mlnx-fw-upgrade.sh is used in two places:
1. https://github.com/Azure/sonic-buildimage/blob/201811/files/scripts/syncd.sh#L109
```bash
        /usr/bin/mst start
        /usr/bin/mlnx-fw-upgrade.sh
        /etc/init.d/sxdkernel start
        /sbin/modprobe i2c-dev
```
2. https://github.com/Azure/sonic-buildimage/blob/201811/device/mellanox/x86_64-mlnx_msn2700-r0/platform_reboot#L32
```bash
ParseArguments "$@"

${FW_UPGRADE_SCRIPT} --upgrade --verbose
EXIT_CODE="$?"
```

In first case the `stdout` is redirected to `syslog` directly by `systemd`.
Thus, the `syslog` logger is only required in second case.

#### Why I did it
* To improve ASIC/CPLD FW upgrade logging
* To improve CPLD upgrade time

#### How I did it
* Added `syslog` logger support
* Replaced `_pciconf0` -> `_pci_cr0` to reduce CPLD upgrade time

#### How to verify it
1. mlnx-fw-upgrade.sh --upgrade
2021-04-07 10:57:12 -07:00
rkdevi27
6c2fd18f51
Fixed S6000 abrupt reboot in 201811 (#6923)
Why I did it
The S6000 devices, the cold reboot is abrupt and it is likely to cause issues which will cause the device to land into EFI shell. Hence the platform reboot will happen after graceful unmount of all the filesystems as in S6100.

How I did it
Moved the platform_reboot to platform_reboot_override and hooked it to the systemd shutdown services as in S6100.
Fixed the "/host unmount failed" issue as well in 201811.

How to verify it
Issue "reboot" command to verify if the reboot is happening gracefully.
2021-03-12 11:09:54 -08:00
Wirut Getbamrung
37fc4d4afb
[platform/cel]: Fixed iSMT SMBUS conflict in Seastone-DX010. (#6889)
#### Why I did it
- The iSMT SMBUS I2c bus number conflicts in different kernel versions.

#### How I did it
- Add I2cbus number detector for iSMT bus
- Replace iSMT bus number in fancontrol config
2021-03-10 10:38:09 -08:00
Volodymyr Samotiy
7d0c9f4b3b
[Mellanox] Update SDK repo pointer for kernel package v4.3.1646 with kernel v4.9.0-14 (#6719)
To update rebuilt Mellanox SDK kernel package v4.3.1646 with kernel v4.9.0-14.

Signed-off-by: Volodymyr Samotiy <volodymyrs@nvidia.com>
2021-02-08 08:33:44 -08:00
Volodymyr Samotiy
3d7f4fc45d
[Mellanox] Update SDK kernel package v4.3.1646 with kernel v4.9.0-14 (#6647)
Updated commit hash pointer in the relevant Makefile for the repository which contains SDK packages.

Signed-off-by: Volodymyr Samotiy <volodymyrs@nvidia.com>
2021-02-02 16:35:19 -08:00
lguohan
7e282b89a2 [build]: wait for conflicts package to be uninstalled (#5039)
when parallel build is enabled, both docker-fpm-frr and docker-syncd-brcm
is built at the same time, docker-fpm-frr requires swss which requires to
install libsaivs-dev. docker-syncd-brcm requires syncd package which requires
to install libsaibcm-dev.

since libsaivs-dev and libsaibcm-dev install the sai header in the same
location, these two packages cannot be installed at the same time. Therefore,
we need to serialize the build between these two packages. Simply uninstall
the conflict package is not enough to solve this issue. The correct solution
is to have one package wait for another package to be uninstalled.

For example, if syncd is built first, then it will install libsaibcm-dev.
Meanwhile, if the swss build job starts and tries to install libsaivs-dev,
it will first try to query if libsaibcm-dev is installed or not. if it is
installed, then it will wait until libsaibcm-dev is uninstalled. After syncd
job is finished, it will uninstall libsaibcm-dev and swss build job will be
unblocked.

To solve this issue, _UNINSTALLS is introduced to uninstall a package that
is no longer needed and to allow blocked job to continue.

Signed-off-by: Guohan Lu <lguohan@gmail.com>
2021-01-20 23:06:13 -08:00
lguohan
4e08f4dcc3
[docker-orchagent]: make build depends only on sairedis package (#6467)
backport c4b5b002c3

make swss build depends only on libsairedis instead of syncd. This allows to build swss without depending
on vendor sai library.

Currently, libsairedis build also buils syncd which requires vendor SAI lib. This makes difficult to build
swss docker in buster while still keeping syncd docker in stretch, as swss requires libsairedis which also
build syncd and requires vendor to provide SAI for buster. As swss docker does not really contain syncd
binary, so it is not necessary to build syncd for swss docker.

[submodule]: update sonic-sairedis

* 9a66890 2020-06-28 | [build]: add option to build without syncd (HEAD -> 201811, origin/201811) [Guohan Lu]

Signed-off-by: Guohan Lu <lguohan@gmail.com>
2021-01-16 13:12:14 -08:00
lguohan
bcda39f394
[sonic-linux-kernel]: update kernel to 4.9.246 (#6461)
kernel ABI from 4.9.0-12 -> 4.9.0-14

Signed-off-by: Guohan Lu <lguohan@gmail.com>
Co-authored-by: Samuel Angebault <angebault.samuel@gmail.com>
2021-01-16 12:33:23 -08:00
Ying Xie
abdbda9435
[201811][bcm SAI] ugprade Broadcom SAI to version 3.5.3.6-2 (#6400)
- Rebase to Broadcom release 3.5.3.6.
- Taking fixes for: CS00011229318, CS00010775359, CS00011331832, CS00011444035, CS00011222060 and CS00010318905
- Taking CS00011581499 patch from Broadcom.

Signed-off-by: Ying Xie <ying.xie@microsoft.com>
2021-01-08 16:29:57 -08:00
Wirut Getbamrung
a416f49676
[platform/cel-haliburton]: add watchdog service (#6259)
Haliburton needed watchdog daemon to monitor the basic health of a machine. If something goes wrong, such as a crashing program overloading the CPU, or no more free memory on the system, watchdog can safely reboot the machine,
2020-12-26 03:04:21 -08:00
Volodymyr Samotiy
d609b406be
[Mellanox] Update SAI to version 1.14.3 (#6156)
* [SAI] Add PFC pause duration counters in microseconds
**- Why I did it**
To add PFC pause duration counters in microseconds
**- How I did it**
Updated SAI to version 1.14.3
**- How to verify it**

**- Description for the changelog**
[Mellanox] Update SAI to version 1.14.3
2020-12-14 23:51:14 -08:00
Shi Su
40bd77c915
201811][syncd] Fix directory mount for vs syncd docker (#6200)
Since DOCKER_SYNCD_VS is no longer being used, the mount option does not properly mount the warmboot file directory. Fix the mount option so that the directory is properly mounted.
2020-12-13 22:42:02 -08:00
Ying Xie
93302d1810
[bcm SAI] Upgrade Broadcom SAI to version 3.5.3.5-3 (#5734)
- Include change to CS00011229318.

Signed-off-by: Ying Xie <ying.xie@microsoft.com>
2020-10-29 08:53:29 -07:00
Neetha John
6b96b8e4ac
[knet] Address Tx drop No DMA resource issue (#5727)
Signed-off-by: Neetha John <nejo@microsoft.com>
2020-10-28 08:29:16 -07:00
Aravind Mani
701a304b6d [Dell S6100] Properly release memory upon ICH driver deinit (#5561)
During platform deinitialization, dell_ich is not removed properly and when we do initialize s6100 platform, ICH driver sysfs attributes are not attached. Because of this, get_transceiver_change_event returns error and this leads xcvrd to crash.
2020-10-14 18:48:45 +00:00
Ying Xie
4c80d996ba
Revert "[201811][platform/cel]: Port fancontrol changes for dx010/e1031 to 201811 branch (#4867)" (#5496)
This reverts commit c9d86f0587.
2020-09-29 18:02:47 -07:00
Ying Xie
81e9ec6be6
Revert "[201811][platform-cel]: Fix dx010 FSC error (#4949)" (#5485)
This reverts commit ec07d10748.
2020-09-29 08:01:02 -07:00
Aravind Mani
7e6fa15784
Dell S6100 fix mux log issue (#5413)
IOM completion log was not seen in syslog.
2020-09-21 12:19:07 -07:00
Aravind Mani
bee516e370
Dell S6100- Fix PCA MUX attachment issue (#5401)
* Dell S6100- Fix PCA MUX attachment t issue

* Update s6100_i2c_enumeration.sh

* Update s6100_i2c_enumeration.sh
2020-09-20 20:05:53 -07:00
Ying Xie
f041345e4e
[201811][bcm SAI] ugprade Broadcom SAI to 3.5.3.5-2 (#5405)
Including following Broadcom patches:
- CS00010869953, CS00010914668(KB29456), CS00010503275(KB0029315), CS00010914673(KB0029442)

Signed-off-by: Ying Xie <ying.xie@microsoft.com>
2020-09-18 14:54:54 -07:00
Wirut Getbamrung
ec07d10748
[201811][platform-cel]: Fix dx010 FSC error (#4949)
* [platform/cel-dx010]: add gpio init for fan direction

* [platform/cel-dx010]: remove invalid code on fancontrol service

* [platform/cel-dx010]: modify fancontrol service permission

* [platform/cel-dx010]: install fancontrol in pmon
2020-09-17 15:30:15 -07:00
Santhosh Kumar T
a2cb92056a
Dell S6100 Port I2C changes to 201811 branch (#5150)
* Dell S6100 Port I2C changes to 201811 branch

* Update s6100_i2c_enumeration.sh
2020-08-18 14:38:28 -07:00
zhenggen-xu
e1e97199e3
[201811 Monit] Enable monitoring of SWSS daemons (#5144)
Signed-off-by: Zhenggen Xu <zxu@linkedin.com>
2020-08-13 20:42:06 -07:00
Samuel Angebault
5d891d8832
[201811][Arista] Update arista driver submodules (#5149) 2020-08-12 09:31:58 -07:00
zzhiyuan
59072a627b
[201811][Arista] Update 201811 branch with Arista syseeprom fix (#5016)
If a device had a master or 201911 image then installed a 201811 image, it could result in a prefdl that was not properly processed by 201811 Arista code.

This is a commit that was on 201911 and master branch.

Co-authored-by: Zhi Yuan Carl Zhao <zyzhao@arista.com>
2020-07-22 10:57:18 -07:00
zzhiyuan
a43eec53b7
[201811][Arista] Update Arista submodules (#4939)
Fix the method get_transceiver_change_event to abide by the function description, return True status and use timeout in milliseconds.

Co-authored-by: Zhi Yuan Carl Zhao <zyzhao@arista.com>
2020-07-09 22:55:06 -07:00
Wirut Getbamrung
c9d86f0587
[201811][platform/cel]: Port fancontrol changes for dx010/e1031 to 201811 branch (#4867)
Update fancontrol service for Seastone-DX010/E1031 device to support hysteresis temperature threshold and difference config for each unit fan direction type (B2F/F2B); follow master branch
2020-07-03 19:59:55 -07:00
Ying Xie
d433e529fd
[bcm SAI] upgrade Broadcom SAI to version 3.5.3.5-1 (#4739)
- Broadcom SAI 3.5 GA code drop on 20200608.

Changes:
- CS9533198
- CS10283709
- CS00009716645
- CS00010389861
- CS00010406122
- CS00010503275
- Addressed a few memory leak issues.
- Addressed an array memory allocation issue.
- Addressed assert during SER handling.

Signed-off-by: Ying Xie <ying.xie@microsoft.com>
2020-06-10 01:29:39 -07:00
Santhosh Kumar T
e6312e72f2 [DellEMC] S6000 Disable Low power mode by default (#4592) 2020-06-09 16:06:00 +00:00
Wirut Getbamrung
9f8d691d4e
[platform/cel]: Backport reboot cause API to 201811 branch (#4619)
Add reboot cause API to support process-reboot-cause.service
Implement chassis.get_reboot_cause platform API
2020-05-26 02:27:03 -07:00
Renuka Manavalan
de05770895
Extend debug image build ability to all platforms. (#3134) (#4524) 2020-05-04 09:48:40 -07:00
Danny Allen
d2acc7f9b0
[syncd-rpc] Fix PTF dependency in syncd-rpc image (#4438)
Signed-off-by: Danny Allen <daall@microsoft.com>
2020-04-17 10:13:32 -07:00
Stepan Blyshchak
fd2ca95f06
[mlnx] add option to build sdk from sources (#4378)
Signed-off-by: Stepan Blyschak <stepanb@mellanox.com>
2020-04-13 08:29:54 -07:00
Nazarii Hnydyn
096a0e1e18
[mellanox]: Add SSD FW update tool (#4352)
* [mellanox]: Add SSD FW update tool.

Signed-off-by: Nazarii Hnydyn <nazariig@mellanox.com>

* [mellanox]: Update SSD tool.

Signed-off-by: Nazarii Hnydyn <nazariig@mellanox.com>
2020-04-13 18:12:16 +03:00
Stepan Blyshchak
46e5753656
[mellanox] update SDK to 4.3.1646 (#4381)
* fixes an issue when /host/warmboot/issu_bank.txt is empty/corrupted
switch is not able to over come this and enters continuos reload/reboot
failure.

Signed-off-by: Stepan Blyschak <stepanb@mellanox.com>
2020-04-07 15:01:46 +03:00
Stepan Blyshchak
4bb2190ba6
[mellanox] update CPLD archive (#4349)
New CPLD includes support for watchdog type 3 with maximum timeout 65536 sec.

Signed-off-by: Stepan Blyschak <stepanb@mellanox.com>
2020-04-03 03:47:40 -07:00
Joe LeVeque
cbf7c7d80d [rsyslog] Suppress duplicate messages from base image and all Docker containers (#2497) 2020-04-02 21:42:01 +00:00
Stepan Blyshchak
a4dd0aa09f
[mellanox] add hardware watchdog script (#4274)
admin@sonic:~$ sudo hw-management-wd.sh
Usage: hw-management-wd.sh start [timeout] | stop | tleft | check_reset | help
start - start watchdog
        timeout is optional. Default value will be used in case if it's omitted
        timeout provided in seconds
stop - stop watchdog
tleft - check watchdog timeout left
check_reset - check if previous reset was caused by watchdog
        Prints only in case of watchdog reset
help -this help

Signed-off-by: Stepan Blyschak <stepanb@mellanox.com>
2020-03-31 20:34:55 -07:00
Volodymyr Samotiy
c8d8f1c6f0
[Mellanox]: Update FW to 13.2000.1658 version (#4343)
This new FW version includes the following fixes:

SFP thermal shutdown issue

Signed-off-by: Volodymyr Samotiy <volodymyrs@mellanox.com>
2020-03-31 20:33:25 -07:00
Samuel Angebault
6c48a3f113
[arista] Update drivers submodule (#4272)
- Add mac address check for management port
- Fix sfputil reset logic
2020-03-17 21:14:48 -07:00
noaOrMlnx
aa73dbd6e6 [Mellanox] Update MFT version to 4.13.5 (#4199) 2020-03-02 18:01:35 +00:00