<br><br><div class="gmail_quote">On Tue, Jun 22, 2010 at 4:05 PM, Robert Lindgren <span dir="ltr"><<a href="mailto:robert.lindgren@gmail.com">robert.lindgren@gmail.com</a>></span> wrote:<br><blockquote class="gmail_quote" style="margin:0 0 0 .8ex;border-left:1px #ccc solid;padding-left:1ex;">
Hi All,<div><br></div><div>Sorry if this topic has been up before, I'm new to this list.</div><div><br></div><div>I have a problem with gfs2_quotad turning up in D state, uninterpretable sleep, when I set one of my nodes in standby. Hence VirtualDomain resource agent stops working, since libvirt fails to read from gfs (drbd primary/primary). I'm running Ubuntu Lucid with the gfs tools from Cluster-stack ppa.</div>
<div><br></div><div>If one one node is started gfs doesn't behave like this, and when one host is lost due to standby or powercord yank.</div><div><br></div><div>Any hints would be appreciated.</div></blockquote><div>
<br></div><div>Last things in dmesg are:</div><div> </div><div>[86392.002282] block drbd0: conn( Unconnected -> WFConnection ) <br>[86395.120629] dlm: closing connection to node 34212362<br>[86395.162508] GFS2: fsid=pcmk:pcmk.0: jid=1: Trying to acquire journal lock...<br>
[86520.420036] INFO: task kslowd000:2290 blocked for more than 120 seconds.<br></div></div><div><br></div><div>After that I see that gfs doesn't respond:<br></div><div>root@sugadaddy:/var/log# ps -efl | grep D<br>F S UID PID PPID C PRI NI ADDR SZ WCHAN STIME TTY TIME CMD<br>
1 D root 2290 2 0 75 -5 - 0 dlm_lo Jun22 ? 00:00:00 [kslowd000]<br>1 D root 2310 2 0 80 0 - 0 dlm_lo Jun22 ? 00:00:00 [gfs2_quotad]<br>0 D root 18713 1147 0 80 0 - 4658 dlm_lo 15:54 ? 00:00:00 /usr/lib/libvirt/virt-aa-helper -r -u libvirt-c0b6fc07-7195-4e31-7ce7-e12d5b71bdee<br>
</div><div><br></div><div>Looks to be the gfs2_quotad stalling but since parentpid 2 is kinda hard to see if it's really quotad stalling or if it's waiting for the journal lock.</div><div><br></div><div>Not sure why I don't get the journal lock...</div>