All of lore.kernel.org
 help / color / mirror / Atom feed
From: Bart Van Assche <bart.vanassche-Re5JQEeQqe8AvxtiuMwx3w@public.gmane.org>
To: Philip Pokorny
	<ppokorny-pabcTyWEv4ZW60MLeMDbCVaTQe2KTcn/@public.gmane.org>
Cc: Vladislav Bolkhovitin <vst-d+Crzxg7Rs0@public.gmane.org>,
	scst-devel-5NWGOfrQmneRv+LV9MX5uipxlwaOVQ5f@public.gmane.org,
	linux-rdma-u79uwXL29TY76Z2rM5mHXA@public.gmane.org,
	Arend Dittmer
	<adittmer-pabcTyWEv4ZW60MLeMDbCVaTQe2KTcn/@public.gmane.org>,
	Vu Pham <vuhuong-VPRAkNaXOzVWk0Htik3J/w@public.gmane.org>
Subject: Re: SRPT and SCST
Date: Fri, 6 Nov 2009 15:53:43 +0100	[thread overview]
Message-ID: <e2e108260911060653g6832c124uaa6e11072a12e448@mail.gmail.com> (raw)
In-Reply-To: <654FA770A883FB43BAF3CB0B1E1DAC8C01C8C4FA-/U8SqUwOx9/OOpeOfUw7maQk6oIRg43YAL8bYrjMMd8@public.gmane.org>

On Fri, Nov 6, 2009 at 3:39 PM, Philip Pokorny
<ppokorny-pabcTyWEv4ZW60MLeMDbCVaTQe2KTcn/@public.gmane.org> wrote:
>
> >> This tells me that there is a pending I/O waiting to be completed but it
> >> seems to have been lost on the server, because this is taking much too
> >> long.  There are 7 seconds "between" each line of output above so that's
> >> almost 30 seconds of output with *no* change in the I/O status.
> >>
> >> The "gzip | tar -x" I was running is "hung"
>
> Upon further investigation, I found that the clients had actually aborted SCSI commands that took too long:
>
> sd 7:0:0:1: timing out command, waited 180s
> sd 7:0:0:1: SCSI error: return code = 0x06000000
> end_request: I/O error, dev sdb, sector 60377910
> Buffer I/O error on device sdb1, logical block 30188699
> lost page write due to I/O error on sdb1
> sd 7:0:0:1: timing out command, waited 180s
> sd 7:0:0:1: SCSI error: return code = 0x06000000
> end_request: I/O error, dev sdb, sector 186810934
> EXT3-fs error (device sdb1): ext3_get_inode_loc: unable to read inode block - inode=11675841, block=93405211
> Aborting journal on device sdb1.
>
> I should point out that the IB_SRP CLIENT we are using is from OFED 1.3.2
>
> [root@head0 ~]# modinfo ib_srp
> filename:       /lib/modules/2.6.18-128.1.1.el5.530g0000/kernel/drivers/infiniband/ulp/srp/ib_srp.ko
> license:        Dual BSD/GPL
> description:    InfiniBand SCSI RDMA Protocol initiator v0.2 (November 1, 2005)
>
> These are Red Hat 5 clients and we can upgrade to Red Hat 5.4 with the Red Hat IB_SRP, but it may be the same code.  Anything else will be more work.

It might be a good idea to repeat the test with the SRP initiator
included with RHEL 5 instead of the OFED SRP initiator. At least one
bug that is present in the OFED SRP initiator is not present in the
RHEL 5 SRP initiator. See also
https://bugs.openfabrics.org/show_bug.cgi?id=1745 for an example.

> > Can you please post the SCST target logs available for the above scenario ?
>
> Yes, and please make sure you are running the debug build.
>
> =====
> Sure.  We *are* running the CONFIG_SCST_DEBUG build.
>
> How do I collect the target logs?  I don't see anything obvious in /proc/scsi_tgt/...

The following commands will enable lots of additional tracing
information (probably way too much):
cat /proc/scsi_tgt/help
echo all >/proc/scsi_tgt/trace_level
echo all >/proc/scsi_tgt/vdisk/trace_level
echo all >/proc/scsi_tgt/ib_srpt/trace_level

Bart.
--
To unsubscribe from this list: send the line "unsubscribe linux-rdma" in
the body of a message to majordomo-u79uwXL29TY76Z2rM5mHXA@public.gmane.org
More majordomo info at  http://vger.kernel.org/majordomo-info.html

  parent reply	other threads:[~2009-11-06 14:53 UTC|newest]

Thread overview: 15+ messages / expand[flat|nested]  mbox.gz  Atom feed  top
     [not found] <3142CEFB1403044F9954E2DF6C85660FBB34BD@orca.penguincomputing.com>
     [not found] ` <f3177b9e0911040802o7fce0f4fte02c52dfe940f582@mail.gmail.com>
     [not found]   ` <3142CEFB1403044F9954E2DF6C85660FBB34BF@orca.penguincomputing.com>
     [not found]     ` <f3177b9e0911041004t2e75d545v5cc10d5375550bde@mail.gmail.com>
     [not found]       ` <654FA770A883FB43BAF3CB0B1E1DAC8C01C8C4DD@orca.penguincomputing.com>
     [not found]         ` <654FA770A883FB43BAF3CB0B1E1DAC8C01C8C4DD-/U8SqUwOx9/OOpeOfUw7maQk6oIRg43YAL8bYrjMMd8@public.gmane.org>
2009-11-05  8:51           ` SRPT and SCST Philip Pokorny
     [not found]             ` <4AF29201.6000606-pabcTyWEv4ZW60MLeMDbCVaTQe2KTcn/@public.gmane.org>
2009-11-05 13:27               ` Vladislav Bolkhovitin
     [not found]                 ` <4AF2D2B8.5080304-d+Crzxg7Rs0@public.gmane.org>
2009-11-05 18:34                   ` Bart Van Assche
     [not found]                     ` <654FA770A883FB43BAF3CB0B1E1DAC8C01C8C4F9@orca.penguincomputing.com>
     [not found]                       ` <654FA770A883FB43BAF3CB0B1E1DAC8C01C8C4F9-/U8SqUwOx9/OOpeOfUw7maQk6oIRg43YAL8bYrjMMd8@public.gmane.org>
2009-11-06  7:06                         ` Bart Van Assche
     [not found]                           ` <e2e108260911052306l230d8d7cxbae68bf08678d6fe-JsoAwUIsXosN+BqQ9rBEUg@public.gmane.org>
2009-11-06 11:59                             ` Vladislav Bolkhovitin
     [not found]                               ` <654FA770A883FB43BAF3CB0B1E1DAC8C01C8C4FA@orca.penguincomputing.com>
     [not found]                                 ` <654FA770A883FB43BAF3CB0B1E1DAC8C01C8C4FA-/U8SqUwOx9/OOpeOfUw7maQk6oIRg43YAL8bYrjMMd8@public.gmane.org>
2009-11-06 14:53                                   ` Bart Van Assche [this message]
     [not found]                                     ` <e2e108260911060653g6832c124uaa6e11072a12e448-JsoAwUIsXosN+BqQ9rBEUg@public.gmane.org>
2009-11-06 16:39                                       ` Vladislav Bolkhovitin
     [not found]                                         ` <3142CEFB1403044F9954E2DF6C85660FBB34E6@orca.penguincomputing.com>
     [not found]                                           ` <3142CEFB1403044F9954E2DF6C85660FBB34E6-/U8SqUwOx9/OOpeOfUw7maQk6oIRg43YAL8bYrjMMd8@public.gmane.org>
2009-11-08  9:49                                             ` Bart Van Assche
     [not found]                                               ` <e2e108260911080149t569fc016p6e38d86a15cb7d05-JsoAwUIsXosN+BqQ9rBEUg@public.gmane.org>
2009-11-09 20:26                                                 ` Vladislav Bolkhovitin
     [not found]                                                   ` <4AF87B05.1050902-d+Crzxg7Rs0@public.gmane.org>
2009-11-09 20:43                                                     ` Chris Worley
2009-11-11  0:33                                                       ` Arend Dittmer
     [not found]                                                         ` <3142CEFB1403044F9954E2DF6C85660FB801C9-/U8SqUwOx9/OOpeOfUw7maQk6oIRg43YAL8bYrjMMd8@public.gmane.org>
2009-11-11 12:36                                                           ` Vladislav Bolkhovitin
2009-11-09  7:27                         ` Bart Van Assche
2009-12-14 20:41               ` Bart Van Assche
2010-05-30  8:01               ` Bart Van Assche

Reply instructions:

You may reply publicly to this message via plain-text email
using any one of the following methods:

* Save the following mbox file, import it into your mail client,
  and reply-to-all from there: mbox

  Avoid top-posting and favor interleaved quoting:
  https://en.wikipedia.org/wiki/Posting_style#Interleaved_style

* Reply using the --to, --cc, and --in-reply-to
  switches of git-send-email(1):

  git send-email \
    --in-reply-to=e2e108260911060653g6832c124uaa6e11072a12e448@mail.gmail.com \
    --to=bart.vanassche-re5jqeeqqe8avxtiumwx3w@public.gmane.org \
    --cc=adittmer-pabcTyWEv4ZW60MLeMDbCVaTQe2KTcn/@public.gmane.org \
    --cc=linux-rdma-u79uwXL29TY76Z2rM5mHXA@public.gmane.org \
    --cc=ppokorny-pabcTyWEv4ZW60MLeMDbCVaTQe2KTcn/@public.gmane.org \
    --cc=scst-devel-5NWGOfrQmneRv+LV9MX5uipxlwaOVQ5f@public.gmane.org \
    --cc=vst-d+Crzxg7Rs0@public.gmane.org \
    --cc=vuhuong-VPRAkNaXOzVWk0Htik3J/w@public.gmane.org \
    /path/to/YOUR_REPLY

  https://kernel.org/pub/software/scm/git/docs/git-send-email.html

* If your mail client supports setting the In-Reply-To header
  via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line before the message body.
This is an external index of several public inboxes,
see mirroring instructions on how to clone and mirror
all data and code used by this external index.