Tříminutové záseky disku

Hamparle

  • ****
  • 262
  • junior developer ucho
    • Zobrazit profil
    • E-mail
Tříminutové záseky disku
« kdy: 02. 12. 2020, 21:34:29 »
Občas u disku pozoruji, že se na 3 minuty totálně zasekne, iostat hlásí 100% util. programy buď se také zaseknou a nebo ukáží hlášku nějakou že něco nejde . Po té horké chvilce disk zase jde, dokončí se příkazy, ukáží se hlášky...

Je to projev SMR?


Teď se to stalo, když jsem vytvořil 700GB FAT32 oddílv gnome-disks. poté se ukázala hláška něco ve smyslu 4.1kB (4096b) se nepodařilo načíst g-io-disks-quark. Asi to formátovalo a zkoušel i dávat cancel u Erasing

dmesg:
Kód: [Vybrat]
[ 7006.557226] usb 2-3.3: USB disconnect, device number 32
[ 7095.201310] sd 0:0:0:0: [sda] Synchronizing SCSI cache
[ 7095.322234] sd 0:0:0:0: [sda] Synchronize Cache(10) failed: Result: hostbyte=DID_ERROR driverbyte=DRIVER_OK
[ 7095.369186] usb 2-4: USB disconnect, device number 28
[ 7129.632135]  sdd: sdd1
[ 7561.205717]  sdd: sdd1
[ 7656.926673] sd 3:0:0:0: [sdd] tag#1 uas_eh_abort_handler 0 uas-tag 2 inflight: CMD
[ 7656.926683] sd 3:0:0:0: [sdd] tag#1 CDB: ATA command pass through(16) 85 06 20 00 00 00 00 00 00 00 00 00 00 00 e5 00
[ 7709.651417] sd 3:0:0:0: [sdd] tag#0 uas_eh_abort_handler 0 uas-tag 1 inflight: CMD
[ 7709.651423] sd 3:0:0:0: [sdd] tag#0 CDB: Synchronize Cache(10) 35 00 00 00 00 00 00 00 00 00
[ 7709.651427] scsi host3: uas_eh_bus_reset_handler start
[ 7714.771360] usb 2-1: Disable of device-initiated U1 failed.
[ 7719.891357] usb 2-1: Disable of device-initiated U2 failed.
[ 7719.995699] usb 2-1: reset SuperSpeed USB device number 33 using xhci_hcd
[ 7720.010656] scsi host3: uas_eh_bus_reset_handler success
[ 7720.010905] sd 3:0:0:0: [sdd] tag#1 FAILED Result: hostbyte=DID_OK driverbyte=DRIVER_TIMEOUT
[ 7720.010913] sd 3:0:0:0: [sdd] tag#1 CDB: ATA command pass through(16) 85 06 20 00 00 00 00 00 00 00 00 00 00 00 e5 00
[ 7824.733537] sd 3:0:0:0: [sdd] tag#4 uas_eh_abort_handler 0 uas-tag 5 inflight: CMD
[ 7824.733547] sd 3:0:0:0: [sdd] tag#4 CDB: ATA command pass through(16) 85 06 20 00 00 00 00 00 00 00 00 00 00 00 e5 00
[ 7850.706898] sd 3:0:0:0: [sdd] tag#5 uas_eh_abort_handler 0 uas-tag 6 inflight: CMD IN
[ 7850.706908] sd 3:0:0:0: [sdd] tag#5 CDB: Read(16) 88 00 00 00 00 00 00 00 00 00 00 00 04 00 00 00
[ 7850.706955] scsi host3: uas_eh_bus_reset_handler start
[ 7850.709510] sd 3:0:0:0: [sdd] tag#6 uas_zap_pending 0 uas-tag 7 inflight: CMD
[ 7850.709520] sd 3:0:0:0: [sdd] tag#6 CDB: Read(16) 88 00 00 00 00 00 00 00 04 00 00 00 04 00 00 00
[ 7850.709526] sd 3:0:0:0: [sdd] tag#7 uas_zap_pending 0 uas-tag 8 inflight: CMD
[ 7850.709530] sd 3:0:0:0: [sdd] tag#7 CDB: Read(16) 88 00 00 00 00 00 00 00 08 00 00 00 04 00 00 00
[ 7850.709534] sd 3:0:0:0: [sdd] tag#8 uas_zap_pending 0 uas-tag 9 inflight: CMD

...až do tagu 29


[ 7850.814639] usb 2-1: reset SuperSpeed USB device number 33 using xhci_hcd
[ 7850.828712] scsi host3: uas_eh_bus_reset_handler success


nyní od tagu 29 k 1

[ 7882.205969] sd 3:0:0:0: [sdd] tag#13 uas_eh_abort_handler 0 uas-tag 14 inflight: CMD IN
[ 7882.205972] sd 3:0:0:0: [sdd] tag#13 CDB: Read(16) 88 00 00 00 00 00 00 00 34 00 00 00 04 00 00 00
[ 7882.206011] sd 3:0:0:0: [sdd] tag#12 uas_eh_abort_handler 0 uas-tag 13 inflight: CMD IN
[ 7882.206014] sd 3:0:0:0: [sdd] tag#12 CDB: Read(16) 88 00 00 00 00 00 00 00 30 00 00 00 04 00 00 00
[ 7882.206053] sd 3:0:0:0: [sdd] tag#11 uas_eh_abort_handler 0 uas-tag 12 inflight: CMD IN
[ 7882.206056] sd 3:0:0:0: [sdd] tag#11 CDB: Read(16) 88 00 00 00 00 00 00 00 2c 00 00 00 04 00 00 00
[ 7882.206106] sd 3:0:0:0: [sdd] tag#10 uas_eh_abort_handler 0 uas-tag 11 inflight: CMD IN
[ 7882.206112] sd 3:0:0:0: [sdd] tag#10 CDB: Read(16) 88 00 00 00 00 00 00 00 28 00 00 00 04 00 00 00
[ 7882.206162] sd 3:0:0:0: [sdd] tag#9 uas_eh_abort_handler 0 uas-tag 10 inflight: CMD IN
[ 7882.206167] sd 3:0:0:0: [sdd] tag#9 CDB: Read(16) 88 00 00 00 00 00 00 00 24 00 00 00 04 00 00 00
[ 7882.206213] sd 3:0:0:0: [sdd] tag#8 uas_eh_abort_handler 0 uas-tag 9 inflight: CMD IN
[ 7882.206218] sd 3:0:0:0: [sdd] tag#8 CDB: Read(16) 88 00 00 00 00 00 00 00 20 00 00 00 04 00 00 00


... tady někdy přišel zase k životu

[ 7885.545762] usb 2-1: stat urb: no pending cmd for uas-tag 5
[ 7892.495569] usb 2-1: stat urb: no pending cmd for uas-tag 6
[ 7899.445371] usb 2-1: stat urb: no pending cmd for uas-tag 7
[ 7906.395204] usb 2-1: stat urb: no pending cmd for uas-tag 8
[ 7911.890101] sd 3:0:0:0: [sdd] tag#27 uas_eh_abort_handler 0 uas-tag 28 inflight: CMD
[ 7911.890111] sd 3:0:0:0: [sdd] tag#27 CDB: Synchronize Cache(10) 35 00 00 00 00 00 00 00 00 00
[ 7911.890121] scsi host3: uas_eh_bus_reset_handler start
[ 7911.996318] usb 2-1: reset SuperSpeed USB device number 33 using xhci_hcd
[ 7912.010877] scsi host3: uas_eh_bus_reset_handler success
[ 7912.011108] sd 3:0:0:0: [sdd] tag#27 Medium access timeout failure. Offlining disk!
[ 7912.011122] sd 3:0:0:0: Device offlined - not ready after error recovery
[ 7912.011139] sd 3:0:0:0: [sdd] tag#4 FAILED Result: hostbyte=DID_OK driverbyte=DRIVER_TIMEOUT
[ 7912.011147] sd 3:0:0:0: [sdd] tag#4 CDB: Read(16) 88 00 00 00 00 00 00 00 10 00 00 00 04 00 00 00
[ 7912.011153] blk_update_request: I/O error, dev sdd, sector 4096
[ 7912.011182] sd 3:0:0:0: rejecting I/O to offline device
[ 7912.011189] sd 3:0:0:0: [sdd] killing request
[ 7912.011197] sd 3:0:0:0: rejecting I/O to offline device
[ 7912.011203] sd 3:0:0:0: [sdd] killing request
[ 7912.011208] sd 3:0:0:0: rejecting I/O to offline device
[ 7912.011215] sd 3:0:0:0: [sdd] killing request
[ 7912.011220] sd 3:0:0:0: rejecting I/O to offline device
[ 7912.011227] sd 3:0:0:0: [sdd] killing request
[ 7912.011231] sd 3:0:0:0: rejecting I/O to offline device
[ 7912.011236] sd 3:0:0:0: killing request
[ 7912.011241] sd 3:0:0:0: rejecting I/O to offline device
[ 7912.011247] sd 3:0:0:0: [sdd] killing request
[ 7912.011257] sd 3:0:0:0: rejecting I/O to offline device
[ 7912.011269] sd 3:0:0:0: rejecting I/O to offline device
[ 7912.011280] sd 3:0:0:0: rejecting I/O to offline device
[ 7912.011289] sd 3:0:0:0: rejecting I/O to offline device
[ 7912.011298] sd 3:0:0:0: rejecting I/O to offline device
[ 7912.011316] sd 3:0:0:0: [sdd] tag#5 FAILED Result: hostbyte=DID_OK driverbyte=DRIVER_TIMEOUT
[ 7912.011322] sd 3:0:0:0: [sdd] tag#5 CDB: Read(16) 88 00 00 00 00 00 00 00 14 00 00 00 04 00 00 00
[ 7912.011326] blk_update_request: I/O error, dev sdd, sector 5120
...
[ 7912.011518] sd 3:0:0:0: [sdd] tag#13 FAILED Result: hostbyte=DID_OK driverbyte=DRIVER_TIMEOUT
[ 7912.011523] sd 3:0:0:0: [sdd] tag#13 CDB: Read(16) 88 00 00 00 00 00 00 00 34 00 00 00 04 00 00 00
[ 7912.011527] blk_update_request: I/O error, dev sdd, sector 13312
[ 7912.011699] sd 3:0:0:0: rejecting I/O to offline device
[ 7912.011705] sd 3:0:0:0: [sdd] killing request
[ 7912.011828] sd 3:0:0:0: rejecting I/O to offline device
[ 7912.014828] sd 3:0:0:0: rejecting I/O to offline device


Může to někdo přeložit z kernel ultraštiny do češtiny, co se dělo počas disk se zmítal? Nepotřebuji  z angličtiny.
jmenovitě (takhle z kontextu vytržené se to hodí jen pro dohledání, samotné útržky asi nic neřeknou spíš mezi kterými dvěma světy se disk nacházel a které vrstvy mezi sebou zápasí):
uas_eh_abort_handler
CDB
uas reset handler
termín tag#
uas_zap_pending
Medium access timeout failure. Offlining disk!
not ready after recovery
hostbyte=DID_
blk_update_request: I/O error
rejecting io to offline
killing request

je tohle normální chování disku? Nebo je vadný nebo jde o záchvat SMR(t)?
« Poslední změna: 02. 12. 2020, 22:02:31 od Petr Krčmář »


RDa

  • *****
  • 1 158
    • Zobrazit profil
    • E-mail
Re:Tříminutové záseky disku
« Odpověď #1 kdy: 02. 12. 2020, 23:15:13 »
UASP je casto zabugovany jak na strane radice, tak na strane disku.
Krok jedna je pridat tve VID:PID na UAS blacklist, aby to vynutilo klasicky mass-storage rezim.
Krok nula: proc nereknes model disku / prevodniku (i pocitace/radice), omg!