Commit Graph

246 Commits

Author SHA1 Message Date
xumia
9278653d24
[201803] Change submodule path from Azure to sonic-net (#12318)
Why I did it
Change the path of sonic submodules that point to "Azure" to point to "sonic-net"

How I did it
Replace "Azure" with "sonic-net" on all relevant paths of sonic submodules
2022-10-12 21:06:35 +08:00
Wenda Ni
8b45d2328e In the case of upgrade, have pfcwd enabled on the upgraded sonic (#2192)
Signed-off-by: Wenda <wenni@microsoft.com>
2019-07-03 18:59:06 +00:00
Qi Luo
f93bd6ff37 [security] Upgrade kernel from 3.16.57-2 to 3.16.68-2 on 201803 branch (#3052)
* [security] Upgrade kernel from 3.16.57-2 to 3.16.68-2 on 201803 branch
2019-06-23 11:53:40 -07:00
Joe LeVeque
769d3b303c [201803] [logrotate] Enhance robustness (#2941)
* [logrotate] Decrease frequency to every 10 minutes; kill any lingering logrotate processes

* [logrotate] Decrease usable space to 90%; Delete all *.1.gz files as firstaction
2019-05-25 18:00:33 -07:00
Renuka Manavalan
ca4f587597 [tacacs]: skip accessing tacacs servers for local non-tacacs users (#2843)
* Switch the nss look up order as "compat" followed by "tacplus".
This helps use the legacy passwd file for user info and go to tacacs only if not found.
This means, we never contact tacacs for local users like "admin".
This isolates local users from any issues with tacacs servers.
W/o this fix, the sudo commands by local users could take <count of servers> * <tacacs timeout> seconds, if the tacacs servers are unreachable.

* Skip tacacs server access for local non-tacacs users.
Revert the order of 'compat tacplus' to original 'tacplus compat' as tacplus
access is required for all tacacs users, who also get created locally.
2019-05-17 21:01:03 +00:00
Qi Luo
ac35206447 [docker-base]: Remove unused packages in docker images and host (#2807) (#2837)
* Remove unneeded packages in docker images and host
* Remove libpython3.6 from snmp docker image
2019-05-01 08:05:08 -07:00
Guohan Lu
9299a249d3 Revert "[201803] [services] Restart SwSS service upon unexpected critical process exit (#2546)"
This reverts commit 2a8af2705e.
2019-04-06 22:21:30 +00:00
Guohan Lu
968a0dfbd0 Revert "[201803] Restart SwSS, syncd and dependent services if a critical process in syncd container exits (#2615)"
This reverts commit 9e8518c855.
2019-04-06 22:21:16 +00:00
Joe LeVeque
99ed954203 [201803] Modify Debian apt repos to reflect changes made by maintainers (#2709)
* [201803][apt] Point to archive.debian.org for jessie-backports; remove jessie-updates

* Add no-check-valid-until for apt in base image

* Add no-check-valid-until for apt in docker-base

* [sonic-slave] Remove jessie-backports, as it is unused in the slave container
2019-03-27 11:23:38 -07:00
Ying Xie
b38bab0e4b [bde driver] black list linux_kernel_bde driver (#2284)
This driver should be loaded by sonic service. If kernel tries to load
it, the driver would be loaded with default parameters, which is not
right for sonic.

Signed-off-by: Ying Xie <ying.xie@microsoft.com>
2019-03-18 21:28:35 +00:00
Wenda Ni
d904b3e596 Add hook to allow customizing link cable lengths
Signed-off-by: Wenda Ni <wenni@microsoft.com>
2019-03-09 13:43:32 +00:00
Joe LeVeque
9e8518c855 [201803] Restart SwSS, syncd and dependent services if a critical process in syncd container exits (#2615)
* Add process exit listeners to syncd containers so that containers stop upon unexpected critical process exit

* swss.service unit file now starts both swss and syncd containers as ExecStartPre steps, and for ExecStart, it indefinitely checks that the containers are running

* Add 'docker-wait-any' script, use it to wait for containers to exit
2019-03-06 23:49:58 -08:00
Joe LeVeque
2a8af2705e [201803] [services] Restart SwSS service upon unexpected critical process exit (#2546)
* [service] Restart SwSS Docker container if orchagent exits unexpectedly

* [dhcp_relay] Use STATE_DB to determine whether interfaces are ready

* Supervisor now autorestarts rsyslogd upon unexpected exit

* Add other critical processes to event listener

* Make supervisor-proc-exit-listener script global, have it read from 'critical_processes' file inside container

* Add SwSS to 'WantedBy=' option of services which should be started along with SwSS
2019-02-26 11:26:14 -08:00
Praveen Chaudhary
e117802c51 [ntp]: Do not disable reader for error ENOBUFS (#2529)
Need to build NTP 4.2.6 locally due to a Bug in NTP 4.2.6.

Changes:
1.) Remove 'apt-get ntp' step from build_debian.sh.
2.) Add NTP package as part of base image in slave.mk.
3.) NTP Makefile for Sonic Build System rules/ntp.mk.
4.) NTP Source Makefile src/ntp/Makefile.
5.) Patch to fix the issure src/ntp/patch.

Signed-off-by: Praveen Chaudhary<pchaudhary@linkedin.com>

* [src/ntp/Makefile]: DSC file URL points to sonic storage.

Signed-off-by: Praveen Chaudhary<pchaudhary@linkedin.com>

* [src/ntp/Makefile]: Changes to build with sonic blob.
2019-02-06 17:33:56 -08:00
Wenda Ni
8fda69389e [QoS]: Qos config change (#2504)
* a7050-QX-32S, a7050-QX32, s6000: 1) dscp mapping; 2) DWRR

a7060-Q32, a7060-C32, a7060-D48C8, s6100, z9100-C32, z9100-D48C8, a7260-D108C8 mmu setting change:
1) Update headroom pool size; Update ingress service pool and egress lossy pool sizes accordingly;
2) Add DSCP 48 to TC 6 mapping;
3) scheduler weight redistribute

Signed-off-by: Wenda Ni <wenni@microsoft.com>

* Correct format

Signed-off-by: Wenda <wenni@microsoft.com>

* Link queue 2 to lossy buffer
Add scheduling weight to queue 2

Signed-off-by: Wenda <wenni@microsoft.com>

* Link pg 2 to lossy buffer

Signed-off-by: Wenda <wenni@microsoft.com>

* Add tc to pg, tc to queue, pfc to queue mapping for 2, 6 (and 7)

Signed-off-by: Wenda <wenni@microsoft.com>
2019-02-01 17:25:10 -08:00
Joe LeVeque
c82be495b9 [caclmgrd] Don't crash if we find empty/null rule_props (#2475)
* [caclmgrd] Don't crash if we find empty/null rule_props
2019-01-25 20:44:01 +00:00
shawnli-van
69d5d61eb8 [devices]: Fix arista-convertfs for aligning flash via sfdisk in Jessie (#2402) (#2406)
The sfdisk in Jessie use unit Cylinder by default. To perform 1M
aligning partition, 2048 sector unit is desirable instead.
2018-12-31 09:40:11 -08:00
Guohan Lu
2933665c16 [baseimage]: force create symlink in union-fsck
fix error in update-initramfs when install kernel package in sonic

Signed-off-by: Guohan Lu <gulv@microsoft.com>
2018-12-14 09:30:27 +00:00
Joe LeVeque
5479286502 [boot] Start ntp-config service after all Docker containers are started (#2303) 2018-12-11 14:25:20 +00:00
Qi Luo
6a24eb4151 Fix redis-py version to 2.10.6 (#2273)
* Fix redis-py version

Signed-off-by: Qi Luo <qiluo-msft@users.noreply.github.com>

* Update submodule sonic-py-swsssdk: Fix redis-py version to 2.10.6

Signed-off-by: Qi Luo <qiluo-msft@users.noreply.github.com>
2018-11-19 20:39:44 +00:00
Joe LeVeque
e710527382 [caclmgrd] Add a rule to allow all connections from localhost (#1858) 2018-11-03 12:54:28 +00:00
Ying Xie
f089d14970 [console speed] lock console speed to start up speed (#2222)
Auto negotiating console speed could cause sonic to lock on a wrong
speed under rare conditions. The only way to come out of the wrong
speed is to issue line break or restart console service with forced
speed, or reboot sonic.

Lock down the console speed to avoid these situations.

Signed-off-by: Ying Xie <ying.xie@microsoft.com>
2018-11-01 15:11:27 -07:00
Taoyu Li
2aac4c4822 [tacplus nss conf] tacplus should be before compat (#2163) 2018-10-18 22:22:38 +00:00
Samuel Angebault
15ae2a799c [devices]: Align flash partition at 1M (#2104)
Flashes used for the 7050QX-32 and 7050QX-32S have a fw issue.
The best option to solve the problem is to upgrade to a newer firmware.
However this can only be done while in memory and take 10 seconds.
Adding an upgrade mechanism is possible but would need more
consideration as flashing the firmware and reformating the flash will
exceed the fast-reboot requirements.

A quick mitigation is to align the ext4 partition that we create on
these vfat based system on a 4k boundary.
Here we chose 1M instead but it's the same.
Newer version of sfdisk do this automatically but the one in SONiC
today doesn't have this behavior.

This workaround will only reduce the pace of the flash health
degradation. The only long term fix is to flash the firmware.
2018-10-15 22:24:08 +00:00
Andriy Moroz
5d0748df03 [Mellanox] Use MAC from EEPROM for PortChannels and VLAN Interfaces (#1793)
* Use MAC from EEPROM for PortChannels

Signed-off-by: Andriy Moroz <c_andriym@mellanox.com>

* Use MAC from EEPROM in DEVICE_METADATA

Will affect MAC for VLAN interfaces

Signed-off-by: Andriy Moroz <c_andriym@mellanox.com>

* Get MAC via decode-syseeprom

Signed-off-by: Andriy Moroz <c_andriym@mellanox.com>

* hw-management is now a service

Signed-off-by: Andriy Moroz <c_andriym@mellanox.com>

* Add error handling for MAC fetch process

Signed-off-by: Andriy Moroz <c_andriym@mellanox.com>
2018-10-13 20:28:07 +00:00
zhenggen-xu
773ed99d0a [security] Upgrade kernel to 3.16.57-2 on 201803 branch (#2116)
* [security] Upgrade kernel to 3.16.57-2

Fix issues below:
https://www.debian.org/security/2018/dsa-4120
https://www.debian.org/security/2018/dsa-4179
https://www.debian.org/security/2018/dsa-4187
https://www.debian.org/security/2018/dsa-4188
https://www.debian.org/security/2018/dsa-4196

and more.

* update opennsl-modules-3.16.0-6-amd64_3.4.1.11-7_amd64.deb package

Signed-off-by: Guohan Lu <gulv@microsoft.com>

* [mellanox] update sdk base url (new kernel version)
2018-10-10 23:41:27 -07:00
Samuel Angebault
f9bca94d72 [arista]: Fix mount point discovery in boot0 for overlayfs (#1959)
On overlay filesystem the name of the mountpoint will also match in the
mount command for overlayfs as upperdir=
To prevent detecting the wrong partition we now look for space before.
This ensure that we match mountpoint and not devices in df and mount
outputs.
2018-09-26 02:41:50 +00:00
Wenda Ni
09a61f1846 [devices]: Buffers & qos config change on a6100 and a7050 (#2037)
* a7050 & a6100
ecn profile th_min = 1MB = 1048576 bytes, th_max = 2MB = 2097152 bytes, pmax = 5%

a6100
dscp 46 -> 5; apply lossy traffic profile
dwrr equal weight on 5 queues
ingress lossless alpha 2^(-3)

Signed-off-by: Wenda Ni <wenni@microsoft.com>

* Adjust config format for wred profile on a7050 & a6100

Signed-off-by: Wenda <wenni@microsoft.com>

* Update sonic-utilities submodule reference point

Signed-off-by: Wenda <wenni@microsoft.com>
2018-09-11 22:15:09 -07:00
Taoyu Li
2f0a3c63fa Don't reuse init_cfg.json from old image during upgrade (#2036) 2018-09-12 05:13:48 +00:00
Joe LeVeque
5817dcf72e Download newer version (8.23.0-2) of rsyslog from jessie-backports in hopes of eliminating memory leaks (#1912) 2018-08-17 09:54:28 +00:00
pavel-shirshov
36604f76b9 [swss]: Start counter from swss container (#1875)
* sonic-quagga update. Don't spam with 'Vtysh connected from' message

* Enable counters inside swss container. systemd is not flexible enough to follow our business rules
2018-07-28 00:55:48 +00:00
Taoyu Li
1f19c27006 Modify sudo lecture message to be more informational (#1800) 2018-07-28 00:55:03 +00:00
Rodny Molina
502ba0e86d Fix for bash's memory-leak (#1879)
* Fix for bash's memory-leak

Memory leak is observed during the execution of scripts that make use of bash-arrays. In scenarios where the offending script is executed on a regular basis (e.g. fancontrol), the leaking process may end up consuming most of the system resources.

In this PR i'm replacing bash in all the contexts where it executes (both host and dockers). The official patch for this issue is here: https://ftp.gnu.org/gnu/bash/bash-4.3-patches/bash43-040

* Fixing minor issue during code-merge

Signed-off-by: Rodny Molina <rmolina@linkedin.com>
2018-07-28 00:50:17 +00:00
pavel-shirshov
b42bd5b7b7 Convert arp_update into a 'start-it-once' mode (#1864)
* Run arp_update just once, don't restart it. It will run continuosly with 5 min pauses
2018-07-19 15:08:47 +00:00
pavel-shirshov
1e43b9950b Pavelsh/restart swss counters (#1854)
* sonic-quagga update. Don't spam with 'Vtysh connected from' message

* Restart counters in 60 seconds after manual swss restart
2018-07-18 02:03:01 +00:00
Taoyu Li
c8b2810d48 [sudoer] Enable read-only commands for all users (#1558) 2018-07-10 05:12:46 +00:00
pavel-shirshov
6568203c15 Enable sairedis counter thread in 3 minutes after SONiC was started (#1813)
* Enable sairedis counter thread in 3 minutes after SONiC was started

* Enable all counters
2018-06-26 12:17:51 -07:00
Taoyu Li
3779c5824a [interface-config] Force eth0 before reconfiguration (#1802) 2018-06-25 19:49:18 +00:00
Kebo Liu
c8753d037c [build templates] Add environment variables for mellanox syncd container to support mellanox sdk and prm sniffer. (#1551)
modified:   files/build_templates/docker_image_ctl.j2

  Signed-off-by:  Kebo Liu kebol@mellanox.com
2018-06-22 19:11:28 +00:00
pavel-shirshov
4d795ec995 Update 201803 branch with masters of sonic-swss, sonic-sairedis, sonic-swss-common, and sonic-utilities (#1791) 2018-06-15 11:49:10 -07:00
Joe LeVeque
704f2fa6df [caclmgrd] Heuristically determine whether ACL is IPv4 or IPv6, use iptables/ip6tables accordingly (#1767)
* [caclmgrd] Heuristically determine whether ACL is IPv4 or IPv6, use iptables/ip6tables accordingly

* Check all rules in table until we find one with a SRC_IP
2018-06-05 10:25:03 +00:00
Qi Luo
53373b6383 Revert the feature: serial port watchdog service (#1766)
* Revert "[serial watchdog] remove serial watchdog service dependency to rc.local (#1752)"
* Revert "[service] introducing serial port watchdog service (#1743)"
2018-06-05 05:54:55 +00:00
Ying Xie
85c45818c4 [serial watchdog] remove serial watchdog service dependency to rc.local (#1752)
* [serial watchdog] remove serial watchdog service dependency to rc.local

When restarting this service in rc.local, the dependency causes an error
in syslog. Removing the dependency to mute the error log entry.

* remove lines with empty inputs
2018-05-31 15:09:59 +00:00
Shuotian Cheng
188afc1e09 [docker_image_ctl]: Add --force while removing obsolete dockers (#1744) 2018-05-26 07:35:12 +00:00
Ying Xie
2aa1de5d0c [service] introducing serial port watchdog service (#1743)
* [rc.local] refactor platform identification code to separate function

Signed-off-by: Ying Xie <ying.xie@microsoft.com>

* [rc.local] infrastructure to take action according to installer.conf

* [serial port watchdog] add service to watch serial port processes

Monitor serial port processes. Kill ones stuck for too long.

Signed-off-by: Ying Xie <ying.xie@microsoft.com>

* [rc.local] start watchdog on serial port specified by installer.conf

Signed-off-by: Ying Xie <ying.xie@microsoft.com>
2018-05-26 07:35:08 +00:00
lguohan
a2d014450b [fast-reboot]: support encoded & gzipped minigraph in fast reboot (#1716)
* [fast-reboot]: support encoded & gzipped minigraph in fast reboot

Signed-off-by: Guohan Lu <gulv@microsoft.com>

* add acl.json and snmp.yml into fast-reboot

Signed-off-by: Guohan Lu <gulv@microsoft.com>
2018-05-19 09:35:28 +00:00
Andriy Moroz
323cafd7d9 Update buffers config for Mellanox 27xx devices (#1649)
* Update buffers config for Mellanox 27xx devices

Signed-off-by: Andriy Moroz <c_andriym@mellanox.com>

* Remove buffers template test for msn27xx

Signed-off-by: Andriy Moroz <c_andriym@mellanox.com>
2018-05-10 08:59:57 +00:00
Prince Sunny
90c94334e5 [updategraph]: Keep updategraph service active after start (#1651) 2018-04-26 23:39:20 +00:00
Joe LeVeque
7b3674441a Allow one Service ACL to bind to multiple services (#1576)
* [caclmgrd] Also ignore IP protocol if found in rule; we will only use our predefined protocols
2018-04-11 02:00:36 +00:00
lguohan
6fe95bce89 [baseimage]: bring down eth0 before restart networking (#1555)
cfggen generates new eth0 configuration. Need to first
clean existing configuration on eth0 before bring up
new configuration on eth0. Thus, we need to first bring
down eth0 before putting new configuration into /etc/network/
interfaces

Signed-off-by: Guohan Lu <gulv@microsoft.com>
2018-04-06 12:59:15 +00:00