With some disks of model HFS960G32FEH-BA1 running in a controller
Fusion-MPT 12GSAS/PCIe Secure SAS38xx (HBA355i Front),
notably after a swact, the system goes to a degraded state with errors
in sm services and in systemd-udev daemon.
The reason is that these disks were being spun down, causing latency
when spinning them up again, along with generalized errors across the
system.
The solution is to disable the disks spindown using the TuneD starlingx
profile.
This solution provides a temporary fix and may reduce the MTBF in
spinning hard disk drives (HDDs). However, HDDs are rarely used in
servers nowadays.
Test Plan:
PASS: 10x swact, without Rook Ceph
PASS: 5x swact, with Rook Ceph, 4 OSDs (HFS960G32FEH-BA) in each
host, writing data intensively
PASS: 10x swact, with Rook Ceph, 11 OSDs (HFS960G32FEH-BA) in each
host, writing data intensively
Closes-Bug: 2098400
Change-Id: I1a5f52dd5f30976adaf4ea09347f97c9179c260f
Signed-off-by: Ítalo Vieira <italo.gomesvieira@windriver.com>