All of lore.kernel.org
 help / color / mirror / Atom feed
From: Kent Overstreet <koverstreet-hpIqsD4AKlfQT0dZR+AlfA@public.gmane.org>
To: Joseph Glanville
	<joseph.glanville-2MxvZkOi9dvvnOemgxGiVw@public.gmane.org>
Cc: linux-bcache-u79uwXL29TY76Z2rM5mHXA@public.gmane.org
Subject: Re: bcache-3.2 branch
Date: Fri, 13 Jul 2012 02:01:50 -0700	[thread overview]
Message-ID: <CAH+dOxKJs5WmHk2hb+6fLCs=fCK_6mezPueaewaHwSn3jZ2m0A@mail.gmail.com> (raw)
In-Reply-To: <CAOzFzEjUy9zakyBhE5CNhcP-Dv7+hQBz3uzhKsMy5kj5GxfGwg-JsoAwUIsXosN+BqQ9rBEUg@public.gmane.org>

Argh, weird.

That kinda sounds like it'd be a massive pain for me to reproduce too...

So you're only seeing errors with Xen, correct?

Probably have to figure out either what xen_blkback is doing different
from everything else (in which case we should be able to reproduce the
errors without it) or track down where in the io stack the errors are
coming from.

Neither sound very appealing :/ I've had to chase bugs that showed up
like that before, the io stack is big and messy.

If you can get a test system set up though I can try and help narrow it down.

Something that would be really useful for narrowing it down is finding
out whether LVM is required - i.e. whether xen_blkback + bcache on a
partition works.

3.2 should be fine for debugging this (I'm keeping it up to date, and
running it on my workstation at work).

On Tue, Jul 10, 2012 at 11:52 AM, Joseph Glanville
<joseph.glanville-2MxvZkOi9dvvnOemgxGiVw@public.gmane.org> wrote:
> On 10 July 2012 03:07, Kent Overstreet <koverstreet-hpIqsD4AKlfQT0dZR+AlfA@public.gmane.org> wrote:
>> On Tue, Jul 10, 2012 at 02:32:36AM +1000, Joseph Glanville wrote:
>>> On 10 July 2012 01:57, Kent Overstreet <koverstreet-hpIqsD4AKlfQT0dZR+AlfA@public.gmane.org> wrote:
>>> > On Wed, Jun 20, 2012 at 10:08:51PM +1000, Joseph Glanville wrote:
>>> >> Hi Kent and list,
>>> >>
>>> >> I have pulled down the latest bcache code and have been playing around
>>> >> with it when I noticed that I am having issues starting Xen virtual
>>> >> machines using bcache + LVM.
>>> >> What is interesting is the QEMU storage emulation in userspace is able
>>> >> to access the device fine however blkback kernel module which uses the
>>> >> device directly seems to fail.
>>> >> How would I go about debugging any of this?
>>> >>
>>> >> Older versions of bcache work fine so it's a regression as far as I can tell.
>>> >
>>> > Hey, sorry for the delay - I just got back from my first sort-of
>>> > vacation in... awhile :P
>>> >
>>> > I'm pretty sure I know the approximate source of the regression - I
>>> > fairly recently reworked some code in the generic block layer to handle
>>> > arbitrary size bios (which enabled some major cleanups in the bcache
>>> > code). I've chased down a few bugs with that code since then.
>>> >
>>> > Got some logs for me to look at? Or did you want me to give you pointers
>>> > on debugging kernel code? :)
>>>
>>> A few pointers would be great. :)
>>
>> More than happy to :) I'm not sure what sort of general pointers I could
>> give you off the top of my head - there's no Unified Theory of
>> Debugging, it's just a big bag of tricks you learn to narrow things down
>> until you figure it out. But I'll try to tell you everything I'd do with
>> this bug, at least (and whatever else you find :)
>>
>> Also just understanding how things work so you can figure out a root
>> cause from the symptom.
>>
>>>
>>> Also how do I best get it to do a really verbose log that I can use to
>>> help you track down bugs?
>>
>> I think for all the bugs that have shown up in the wild so far we
>> haven't needed any special logging, just the normal stuff has been fine.
>> There's all kinds of logging and tracing and whatnot buried in there but
>> for the most part you don't want to bother with the non default stuff
>> unless you have to.
>>
>> But anyways, just whatever the kernel spits out is the place to start.
>> If you've still got that, I'll take a look and tell you what I'd get out
>> of it.
>
> Unfortunately the kernel wasn't talking much, I didn't see anything
> unusual and everything else seemed to work fine. :(
> I was able to successfully use bcached LVM volumes with filesystems
> too, it only became an issue when trying to use them as block devices
> for virtual machines.
> From the virtual machine all I could see where I/O errors, probably
> caused by the xen_blkback module returning failed read.
> Debugging that beast is not all that fun but I will see how I can go
> setting up a test system sometime this week with the latest bcache
> code.
> We are pretty entrenched in 3.2 but would be be more useful if I
> carried out testing on latter kernels instead or is 3.2 fine?
>
>>
>>>
>>> >
>>> >>
>>> >> Joseph.
>>> >>
>>> >> --
>>> >> CTO | Orion Virtualisation Solutions | www.orionvm.com.au
>>> >> Phone: 1300 56 99 52 | Mobile: 0428 754 846
>>>
>>> Cheers,
>>> Joseph.
>>>
>>> --
>>> CTO | Orion Virtualisation Solutions | www.orionvm.com.au
>>> Phone: 1300 56 99 52 | Mobile: 0428 754 846
>
> Joseph.
>
> --
> CTO | Orion Virtualisation Solutions | www.orionvm.com.au
> Phone: 1300 56 99 52 | Mobile: 0428 754 846

  parent reply	other threads:[~2012-07-13  9:01 UTC|newest]

Thread overview: 11+ messages / expand[flat|nested]  mbox.gz  Atom feed  top
2012-06-20 12:08 bcache-3.2 branch Joseph Glanville
     [not found] ` <CAOzFzEh8pO37dVWoMoD+hFoUGrBoubSdktdu7SQS0UcXLcC66w-JsoAwUIsXosN+BqQ9rBEUg@public.gmane.org>
2012-07-09 15:57   ` Kent Overstreet
     [not found]     ` <20120709155734.GA23774-hpIqsD4AKlfQT0dZR+AlfA@public.gmane.org>
2012-07-09 16:32       ` Joseph Glanville
     [not found]         ` <CAOzFzEjovYu4eE9E_asOBVyBhqFuvhgzJ7UFyESLY0XycAfkuA-JsoAwUIsXosN+BqQ9rBEUg@public.gmane.org>
2012-07-09 17:07           ` Kent Overstreet
     [not found]             ` <20120709170742.GA26798-hpIqsD4AKlfQT0dZR+AlfA@public.gmane.org>
2012-07-09 18:07               ` bcache & kernel branch that will build together Jason Warr
     [not found]                 ` <4FFB1DD4.7030304-/cow75dQlsI@public.gmane.org>
2012-07-09 18:41                   ` Kent Overstreet
     [not found]                     ` <20120709184149.GA3234-hpIqsD4AKlfQT0dZR+AlfA@public.gmane.org>
2012-07-09 20:16                       ` Kent Overstreet
     [not found]                         ` <CAC7rs0vkasCKYwBTrvhWg8pzvHuix1ZaBJxTLBW+5AVG_31hEQ-JsoAwUIsXosN+BqQ9rBEUg@public.gmane.org>
2012-07-09 21:48                           ` Jason Warr
2012-07-10 18:52               ` bcache-3.2 branch Joseph Glanville
     [not found]                 ` <CAOzFzEjUy9zakyBhE5CNhcP-Dv7+hQBz3uzhKsMy5kj5GxfGwg-JsoAwUIsXosN+BqQ9rBEUg@public.gmane.org>
2012-07-13  9:01                   ` Kent Overstreet [this message]
     [not found]                     ` <CAH+dOxKJs5WmHk2hb+6fLCs=fCK_6mezPueaewaHwSn3jZ2m0A-JsoAwUIsXosN+BqQ9rBEUg@public.gmane.org>
2012-07-13 21:10                       ` Joseph Glanville

Reply instructions:

You may reply publicly to this message via plain-text email
using any one of the following methods:

* Save the following mbox file, import it into your mail client,
  and reply-to-all from there: mbox

  Avoid top-posting and favor interleaved quoting:
  https://en.wikipedia.org/wiki/Posting_style#Interleaved_style

* Reply using the --to, --cc, and --in-reply-to
  switches of git-send-email(1):

  git send-email \
    --in-reply-to='CAH+dOxKJs5WmHk2hb+6fLCs=fCK_6mezPueaewaHwSn3jZ2m0A@mail.gmail.com' \
    --to=koverstreet-hpiqsd4aklfqt0dzr+alfa@public.gmane.org \
    --cc=joseph.glanville-2MxvZkOi9dvvnOemgxGiVw@public.gmane.org \
    --cc=linux-bcache-u79uwXL29TY76Z2rM5mHXA@public.gmane.org \
    /path/to/YOUR_REPLY

  https://kernel.org/pub/software/scm/git/docs/git-send-email.html

* If your mail client supports setting the In-Reply-To header
  via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line before the message body.
This is an external index of several public inboxes,
see mirroring instructions on how to clone and mirror
all data and code used by this external index.