Fórum Root.cz
Hlavní témata => Hardware => Téma založeno: intelik 20. 04. 2016, 13:51:40
-
Ahojte,
po update Debianu ze Squeeze na Jessie mam na routeru vypadky sitoveho rozhrani, nahodne nekdy i nekolikrat za hodinu, jindy to jede 5 hodin ok. Problem je, ze behem toho vypadku samozrejme vypadne pripojeni na net a obcas je i dost slozite ted na tom routeru neco obslouzit pres SSH :-(
Dokud jel router na Squeeze vse byl ok.
Zkousel jsem zkompilovat a nahrat posledni ovladace od intelu vydane letos v lednu a take nic.
Sitovka je presne: Intel Corporation 82567LM-3 Gigabit Network
Mate nekdo nejakou radu? Navic nejsem sam (https://lists.debian.org/debian-kernel/2015/05/msg00045.html (https://lists.debian.org/debian-kernel/2015/05/msg00045.html)), kdo toto resi a nemohu zatim najit reseni :(
Vypis ze syslogu behem vypadku rozhranni:
[tt]Apr 20 13:43:07 elly kernel: [78550.820228] e1000e 0000:00:19.0 eth0: Detected Hardware Unit Hang:
Apr 20 13:43:07 elly kernel: [78550.820228] TDH <16>
Apr 20 13:43:07 elly kernel: [78550.820228] TDT <21>
Apr 20 13:43:07 elly kernel: [78550.820228] next_to_use <21>
Apr 20 13:43:07 elly kernel: [78550.820228] next_to_clean <11>
Apr 20 13:43:07 elly kernel: [78550.820228] buffer_info[next_to_clean]:
Apr 20 13:43:07 elly kernel: [78550.820228] time_stamp <1012a7fad>
Apr 20 13:43:07 elly kernel: [78550.820228] next_to_watch <16>
Apr 20 13:43:07 elly kernel: [78550.820228] jiffies <1012a80d1>
Apr 20 13:43:07 elly kernel: [78550.820228] next_to_watch.status <0>
Apr 20 13:43:07 elly kernel: [78550.820228] MAC Status <80083>
Apr 20 13:43:07 elly kernel: [78550.820228] PHY Status <796d>
Apr 20 13:43:07 elly kernel: [78550.820228] PHY 1000BASE-T Status <7800>
Apr 20 13:43:07 elly kernel: [78550.820228] PHY Extended Status <3000>
Apr 20 13:43:07 elly kernel: [78550.820228] PCI Status <10>
Apr 20 13:43:09 elly kernel: [78552.820225] e1000e 0000:00:19.0 eth0: Detected Hardware Unit Hang:
Apr 20 13:43:09 elly kernel: [78552.820225] TDH <16>
Apr 20 13:43:09 elly kernel: [78552.820225] TDT <21>
Apr 20 13:43:09 elly kernel: [78552.820225] next_to_use <21>
Apr 20 13:43:09 elly kernel: [78552.820225] next_to_clean <11>
Apr 20 13:43:09 elly kernel: [78552.820225] buffer_info[next_to_clean]:
Apr 20 13:43:09 elly kernel: [78552.820225] time_stamp <1012a7fad>
Apr 20 13:43:09 elly kernel: [78552.820225] next_to_watch <16>
Apr 20 13:43:09 elly kernel: [78552.820225] jiffies <1012a82c5>
Apr 20 13:43:09 elly kernel: [78552.820225] next_to_watch.status <0>
Apr 20 13:43:09 elly kernel: [78552.820225] MAC Status <80083>
Apr 20 13:43:09 elly kernel: [78552.820225] PHY Status <796d>
Apr 20 13:43:09 elly kernel: [78552.820225] PHY 1000BASE-T Status <7800>
Apr 20 13:43:09 elly kernel: [78552.820225] PHY Extended Status <3000>
Apr 20 13:43:09 elly kernel: [78552.820225] PCI Status <10>
Apr 20 13:43:11 elly kernel: [78554.820218] e1000e 0000:00:19.0 eth0: Detected Hardware Unit Hang:
Apr 20 13:43:11 elly kernel: [78554.820218] TDH <16>
Apr 20 13:43:11 elly kernel: [78554.820218] TDT <21>
Apr 20 13:43:11 elly kernel: [78554.820218] next_to_use <21>
Apr 20 13:43:11 elly kernel: [78554.820218] next_to_clean <11>
Apr 20 13:43:11 elly kernel: [78554.820218] buffer_info[next_to_clean]:
Apr 20 13:43:11 elly kernel: [78554.820218] time_stamp <1012a7fad>
Apr 20 13:43:11 elly kernel: [78554.820218] next_to_watch <16>
Apr 20 13:43:11 elly kernel: [78554.820218] jiffies <1012a84b9>
Apr 20 13:43:11 elly kernel: [78554.820218] next_to_watch.status <0>
Apr 20 13:43:11 elly kernel: [78554.820218] MAC Status <80083>
Apr 20 13:43:11 elly kernel: [78554.820218] PHY Status <796d>
Apr 20 13:43:11 elly kernel: [78554.820218] PHY 1000BASE-T Status <7800>
Apr 20 13:43:11 elly kernel: [78554.820218] PHY Extended Status <3000>
Apr 20 13:43:11 elly kernel: [78554.820218] PCI Status <10>
Apr 20 13:43:13 elly kernel: [78556.820258] e1000e 0000:00:19.0 eth0: Detected Hardware Unit Hang:
Apr 20 13:43:13 elly kernel: [78556.820258] TDH <16>
Apr 20 13:43:13 elly kernel: [78556.820258] TDT <21>
Apr 20 13:43:13 elly kernel: [78556.820258] next_to_use <21>
Apr 20 13:43:13 elly kernel: [78556.820258] next_to_clean <11>
Apr 20 13:43:13 elly kernel: [78556.820258] buffer_info[next_to_clean]:
Apr 20 13:43:13 elly kernel: [78556.820258] time_stamp <1012a7fad>
Apr 20 13:43:13 elly kernel: [78556.820258] next_to_watch <16>
Apr 20 13:43:13 elly kernel: [78556.820258] jiffies <1012a86ad>
Apr 20 13:43:13 elly kernel: [78556.820258] next_to_watch.status <0>
Apr 20 13:43:13 elly kernel: [78556.820258] MAC Status <80083>
Apr 20 13:43:13 elly kernel: [78556.820258] PHY Status <796d>
Apr 20 13:43:13 elly kernel: [78556.820258] PHY 1000BASE-T Status <7800>
Apr 20 13:43:13 elly kernel: [78556.820258] PHY Extended Status <3000>
Apr 20 13:43:13 elly kernel: [78556.820258] PCI Status <10>
Apr 20 13:43:15 elly kernel: [78558.820246] e1000e 0000:00:19.0 eth0: Detected Hardware Unit Hang:
Apr 20 13:43:15 elly kernel: [78558.820246] TDH <16>
Apr 20 13:43:15 elly kernel: [78558.820246] TDT <21>
Apr 20 13:43:15 elly kernel: [78558.820246] next_to_use <21>
Apr 20 13:43:15 elly kernel: [78558.820246] next_to_clean <11>
Apr 20 13:43:15 elly kernel: [78558.820246] buffer_info[next_to_clean]:
Apr 20 13:43:15 elly kernel: [78558.820246] time_stamp <1012a7fad>
Apr 20 13:43:15 elly kernel: [78558.820246] next_to_watch <16>
Apr 20 13:43:15 elly kernel: [78558.820246] jiffies <1012a88a1>
Apr 20 13:43:15 elly kernel: [78558.820246] next_to_watch.status <0>
Apr 20 13:43:15 elly kernel: [78558.820246] MAC Status <80083>
Apr 20 13:43:15 elly kernel: [78558.820246] PHY Status <796d>
Apr 20 13:43:15 elly kernel: [78558.820246] PHY 1000BASE-T Status <7800>
Apr 20 13:43:15 elly kernel: [78558.820246] PHY Extended Status <3000>
Apr 20 13:43:15 elly kernel: [78558.820246] PCI Status <10>
Apr 20 13:43:16 elly kernel: [78559.824096] e1000e 0000:00:19.0 eth0: Reset adapter unexpectedly
Apr 20 13:43:18 elly kernel: [78562.672861] e1000e: eth0 NIC Link is Up 1000 Mbps Full Duplex, Flow Control: None
[/tt]
-
Tak po dalsim dni googleni, jsem se dostal k tomu, ze by za tim mohlo byt povolene TSO (tcp segmentation offload) na sitovce, kde novy kernel s tim ma udajne u nekterych sitovek problem.
Je tu nekdo s hlubsi sitovou znalosti (TCP) a mel nervy a case me trosku "cesky"objasnit k cemu presne je TSO potreba, jak funguje, a pripadne jak moc jeho vypnuti muze treba snizit vykon systemu? :-)
Diky.
-
Tak po dalsim dni googleni, jsem se dostal k tomu, ze by za tim mohlo byt povolene TSO (tcp segmentation offload) na sitovce, kde novy kernel s tim ma udajne u nekterych sitovek problem.
Je tu nekdo s hlubsi sitovou znalosti (TCP) a mel nervy a case me trosku "cesky"objasnit k cemu presne je TSO potreba, jak funguje, a pripadne jak moc jeho vypnuti muze treba snizit vykon systemu? :-)
Diky.
Novější kernel a případně firmware z Debian Backports nepomůže?
-
Tak po dalsim dni googleni, jsem se dostal k tomu, ze by za tim mohlo byt povolene TSO (tcp segmentation offload) na sitovce, kde novy kernel s tim ma udajne u nekterych sitovek problem.
Je tu nekdo s hlubsi sitovou znalosti (TCP) a mel nervy a case me trosku "cesky"objasnit k cemu presne je TSO potreba, jak funguje, a pripadne jak moc jeho vypnuti muze treba snizit vykon systemu? :-)
Diky.
Novější kernel a případně firmware z Debian Backports nepomůže?
No popravde jsem zatim netestoval, ale kdyz nepomuze vypnuti TSO, tak asi zkusim...
-
trochu ina sietovka, ale rovnaky problem https://github.com/gislab-npo/gislab/issues/407 (https://github.com/gislab-npo/gislab/issues/407)
-
Tak po dalsim dni googleni, jsem se dostal k tomu, ze by za tim mohlo byt povolene TSO (tcp segmentation offload) na sitovce, kde novy kernel s tim ma udajne u nekterych sitovek problem.
Je tu nekdo s hlubsi sitovou znalosti (TCP) a mel nervy a case me trosku "cesky"objasnit k cemu presne je TSO potreba, jak funguje, a pripadne jak moc jeho vypnuti muze treba snizit vykon systemu? :-)
Diky.
Tak vypnuti TSO pomohlo a rozhranni drzi bez jedineho vypadku jiz 4. den :-) Takze snad to pomuze i nekomu jinemu :-)
-
Stejný problém na Debian Jessie. Adaptér je Intel 82578DM. Taky mě pomohlo vypnutí tso - vlastně na základě googlení vypínám tso, gso a gro.
ethtool -K eth0 gso off gro off tso off
Už několik měsíců bez výpadku. Někdy zkusím, jestli stačí vypnout jen tso. Dopad na výkon nepozoruji, ale to není síťově moc zatížený server.