linux-kernel.vger.kernel.org archive mirror
 help / color / mirror / Atom feed
From: "Sean Hefty" <sean.hefty@intel.com>
To: "'Roland Dreier'" <rdreier@cisco.com>,
	"Or Gerlitz" <or.gerlitz@gmail.com>
Cc: <linux-kernel@vger.kernel.org>, <general@lists.openfabrics.org>
Subject: RE: [ofa-general] Further 2.6.23 merge plans...
Date: Tue, 17 Jul 2007 22:26:35 -0700	[thread overview]
Message-ID: <000001c7c8fc$360eec90$5dcc180a@amr.corp.intel.com> (raw)
In-Reply-To: <adaabtuo0n9.fsf@cisco.com>

>I think this is an important question.  If we merge the local SA
>stuff, then are we creating a problem for dealing with QoS?

Yes - I do believe that merging PR caching and QoS together will be difficult.
I don't think the problems are insurmountable, but I can't say that I have a
definite solution for how to deal with this. 

My current thoughts are that the purpose of the cache is to increase SA
scalability on large clusters.  We've seen issues running MPI, trying to
establish all-to-all connections, on our 256 node cluster.  (With 4 processes
per node, this results in about 500,000+ PR queries hitting the SA.)  The SA was
swamped with work, and it wasn't trying to enforce QoS requirements across the
cluster.

I just don't see how an SA that is already having trouble scaling to this number
of nodes will be able to perform the additional task of providing QoS across the
cluster.  It may be that, at least initially, an administrator may need to
select between enabling PR caching or QoS.

>Are we going to have to revert the local SA stuff once the QoS stuff is
>available?

In the best case, the local SA will need enhancements added to the base support.
In the worst case, a user would have to choose between QoS or PR caching.  If
all users choose QoS, then it would make sense to remove the local SA. 

>Or is there at least a sketch of a plan on how to handle this?

This is only a rough idea, and it depends on how the QoS is implemented.  The
idea is to create a local QoS module on each node.  The local QoS modules would
be programmed with basic QoS information.  For example, which types of queries
to handle locally, versus which ones to forward to the SA.  Locally handled
queries would return PRs based on some QoS mapping table.  (I haven't looked
into any details of this.)

Ideally, local QoS modules would be programmed by a QoS master.  This would
require a new vendor-specific protocol, but would allow for a simple distributed
QoS manager.

We will have a better idea of the issues and possible solutions once the QoS
spec is released, and we can hold discussions on it.  I will be working more
details on QoS enhancements starting in the next couple of weeks. 

- Sean

  reply	other threads:[~2007-07-18  5:26 UTC|newest]

Thread overview: 31+ messages / expand[flat|nested]  mbox.gz  Atom feed  top
2007-07-12 23:07 [GIT PULL] please pull infiniband.git Roland Dreier
2007-07-12 23:15 ` Further 2.6.23 merge plans Roland Dreier
2007-07-13  0:17   ` [ofa-general] " Hal Rosenstock
2007-07-13  1:14   ` Sean Hefty
     [not found]     ` <15ddcffd0707172020j5b68fcb2v7d3ca77863998020@mail.gmail.com>
2007-07-18  3:23       ` Roland Dreier
2007-07-18  5:26         ` Sean Hefty [this message]
2007-07-18 17:05           ` Sean Hefty
2007-07-18 20:54             ` Sean Hefty
2007-07-13  5:47   ` Michael S. Tsirkin
2007-07-13 18:14     ` Roland Dreier
2007-07-13 18:50       ` [ofa-general] " Shirley Ma
2007-07-17 18:06         ` Roland Dreier
2007-07-14 17:54       ` Michael S. Tsirkin
     [not found]         ` <OF72F6B9D1.F60C4EEF-ON8725731A.00506757-8825731A.0024BD1C@us.ibm.com>
2007-07-16 14:55           ` [ofa-general] " Michael S. Tsirkin
2007-07-16 16:42         ` Roland Dreier
2007-07-16 20:05           ` Michael S. Tsirkin
2007-07-17 17:53             ` Roland Dreier
2007-07-13 18:56   ` [ofa-general] " Shirley Ma
2007-07-16 16:47     ` Roland Dreier
2007-07-15 12:26   ` Tziporet Koren
2007-07-16 16:42     ` Roland Dreier
2007-07-17 18:07   ` Roland Dreier
2007-07-17 20:43     ` Matt Leininger
2007-07-17 20:45       ` Roland Dreier
2007-07-18  6:36         ` Or Gerlitz
2007-07-17 21:44     ` Michael S. Tsirkin
2007-07-18  7:34       ` [ofa-general] " Tziporet Koren
2007-07-18  7:38         ` Michael S. Tsirkin
2007-07-18  8:48           ` Tziporet Koren
2007-07-18 16:16             ` Sean Hefty
2007-07-18 16:20               ` Roland Dreier

Reply instructions:

You may reply publicly to this message via plain-text email
using any one of the following methods:

* Save the following mbox file, import it into your mail client,
  and reply-to-all from there: mbox

  Avoid top-posting and favor interleaved quoting:
  https://en.wikipedia.org/wiki/Posting_style#Interleaved_style

* Reply using the --to, --cc, and --in-reply-to
  switches of git-send-email(1):

  git send-email \
    --in-reply-to='000001c7c8fc$360eec90$5dcc180a@amr.corp.intel.com' \
    --to=sean.hefty@intel.com \
    --cc=general@lists.openfabrics.org \
    --cc=linux-kernel@vger.kernel.org \
    --cc=or.gerlitz@gmail.com \
    --cc=rdreier@cisco.com \
    /path/to/YOUR_REPLY

  https://kernel.org/pub/software/scm/git/docs/git-send-email.html

* If your mail client supports setting the In-Reply-To header
  via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for NNTP newsgroup(s).