All of lore.kernel.org
 help / color / mirror / Atom feed
* Nightly "Kernel branch not found" errors
@ 2015-05-31 19:54 Gregory Farnum
  2015-05-31 20:40 ` Ilya Dryomov
  0 siblings, 1 reply; 7+ messages in thread
From: Gregory Farnum @ 2015-05-31 19:54 UTC (permalink / raw)
  To: idryomov, Zack Cerza, Andrew Schoen; +Cc: ceph-qa, ceph-devel

We are getting this error in what looks like everything that specifies
the testing kernel. (That turns out to be almost all of the FS tests
and a surprising number of the non-rados runs; e.g. rgw.) I've checked
that the testing branch of ceph-client.git still exists and when
looking at the teuthology git logs the recent ping-pong of commits on
kernel flavors et al stand out. Any ideas? :)
-Greg

On Fri, May 29, 2015 at 11:10 PM,  <teuthology@ceph.com> wrote:
> Kernel branch 'testing' not found
> _______________________________________________
> Ceph-qa mailing list
> Ceph-qa@lists.ceph.com
> http://lists.ceph.com/listinfo.cgi/ceph-qa-ceph.com

^ permalink raw reply	[flat|nested] 7+ messages in thread

* Re: Nightly "Kernel branch not found" errors
  2015-05-31 19:54 Nightly "Kernel branch not found" errors Gregory Farnum
@ 2015-05-31 20:40 ` Ilya Dryomov
  2015-05-31 20:49   ` Gregory Farnum
  0 siblings, 1 reply; 7+ messages in thread
From: Ilya Dryomov @ 2015-05-31 20:40 UTC (permalink / raw)
  To: Gregory Farnum; +Cc: Zack Cerza, Andrew Schoen, ceph-qa, ceph-devel

On Sun, May 31, 2015 at 10:54 PM, Gregory Farnum <greg@gregs42.com> wrote:
> We are getting this error in what looks like everything that specifies
> the testing kernel. (That turns out to be almost all of the FS tests
> and a surprising number of the non-rados runs; e.g. rgw.) I've checked
> that the testing branch of ceph-client.git still exists and when
> looking at the teuthology git logs the recent ping-pong of commits on
> kernel flavors et al stand out. Any ideas? :)

Which lab is this in?  Latest rgw and fs in sepia look fine.

Kernel task was broken the entire last week but in a different way -
when scheduled with teuthology-suite it wouldn't install anything even
if you told it to install e.g. testing.  I fixed that on Friday.

Looks like this is coming from create_initial_config().  Could be an
environment issue, like down gitbuilders or a problem with requests
module?  Kernel branch is checked before the others so it may not have
anything to do with it at all.

In any case, I'm half asleep so somebody else should look into this ;)

Thanks,

                Ilya

^ permalink raw reply	[flat|nested] 7+ messages in thread

* Re: Nightly "Kernel branch not found" errors
  2015-05-31 20:40 ` Ilya Dryomov
@ 2015-05-31 20:49   ` Gregory Farnum
  2015-05-31 22:19     ` Sage Weil
  0 siblings, 1 reply; 7+ messages in thread
From: Gregory Farnum @ 2015-05-31 20:49 UTC (permalink / raw)
  To: Ilya Dryomov; +Cc: Zack Cerza, Andrew Schoen, ceph-qa, ceph-devel

On Sun, May 31, 2015 at 1:40 PM, Ilya Dryomov <idryomov@gmail.com> wrote:
> On Sun, May 31, 2015 at 10:54 PM, Gregory Farnum <greg@gregs42.com> wrote:
>> We are getting this error in what looks like everything that specifies
>> the testing kernel. (That turns out to be almost all of the FS tests
>> and a surprising number of the non-rados runs; e.g. rgw.) I've checked
>> that the testing branch of ceph-client.git still exists and when
>> looking at the teuthology git logs the recent ping-pong of commits on
>> kernel flavors et al stand out. Any ideas? :)
>
> Which lab is this in?  Latest rgw and fs in sepia look fine.
>
> Kernel task was broken the entire last week but in a different way -
> when scheduled with teuthology-suite it wouldn't install anything even
> if you told it to install e.g. testing.  I fixed that on Friday.
>
> Looks like this is coming from create_initial_config().  Could be an
> environment issue, like down gitbuilders or a problem with requests
> module?  Kernel branch is checked before the others so it may not have
> anything to do with it at all.

It's happening across labs for tests that were supposed to be
scheduled starting on the 29 (at least, that I've noticed). I think
this issue is before they get into pulpito, which is why the latest fs
suite run there was scheduled on May 27. :(
Looking at the gitbuliders I do see that the CentOS6 testing branch is
red and rhel7 appears to be down...maybe we're checking on more of
them now and then failing when those don't appear? :/
-Greg

^ permalink raw reply	[flat|nested] 7+ messages in thread

* Re: Nightly "Kernel branch not found" errors
  2015-05-31 20:49   ` Gregory Farnum
@ 2015-05-31 22:19     ` Sage Weil
  2015-05-31 22:28       ` Gregory Farnum
  2015-06-01 12:56       ` Ilya Dryomov
  0 siblings, 2 replies; 7+ messages in thread
From: Sage Weil @ 2015-05-31 22:19 UTC (permalink / raw)
  To: Gregory Farnum
  Cc: Ilya Dryomov, Zack Cerza, Andrew Schoen, ceph-qa, ceph-devel

On Sun, 31 May 2015, Gregory Farnum wrote:
> On Sun, May 31, 2015 at 1:40 PM, Ilya Dryomov <idryomov@gmail.com> wrote:
> > On Sun, May 31, 2015 at 10:54 PM, Gregory Farnum <greg@gregs42.com> wrote:
> >> We are getting this error in what looks like everything that specifies
> >> the testing kernel. (That turns out to be almost all of the FS tests
> >> and a surprising number of the non-rados runs; e.g. rgw.) I've checked
> >> that the testing branch of ceph-client.git still exists and when
> >> looking at the teuthology git logs the recent ping-pong of commits on
> >> kernel flavors et al stand out. Any ideas? :)
> >
> > Which lab is this in?  Latest rgw and fs in sepia look fine.
> >
> > Kernel task was broken the entire last week but in a different way -
> > when scheduled with teuthology-suite it wouldn't install anything even
> > if you told it to install e.g. testing.  I fixed that on Friday.
> >
> > Looks like this is coming from create_initial_config().  Could be an
> > environment issue, like down gitbuilders or a problem with requests
> > module?  Kernel branch is checked before the others so it may not have
> > anything to do with it at all.
> 
> It's happening across labs for tests that were supposed to be
> scheduled starting on the 29 (at least, that I've noticed). I think
> this issue is before they get into pulpito, which is why the latest fs
> suite run there was scheduled on May 27. :(
> Looking at the gitbuliders I do see that the CentOS6 testing branch is
> red and rhel7 appears to be down...maybe we're checking on more of
> them now and then failing when those don't appear? :/

We're half-way through creating the centos7 kernel builder to replace the 
rhel ones so I expect things are broken on the rpm side.  Not sure if 
that is the root cause here, but we can probably wait for that to get 
fixed first before looking further

s

^ permalink raw reply	[flat|nested] 7+ messages in thread

* Re: Nightly "Kernel branch not found" errors
  2015-05-31 22:19     ` Sage Weil
@ 2015-05-31 22:28       ` Gregory Farnum
  2015-06-01 13:13         ` Ilya Dryomov
  2015-06-01 12:56       ` Ilya Dryomov
  1 sibling, 1 reply; 7+ messages in thread
From: Gregory Farnum @ 2015-05-31 22:28 UTC (permalink / raw)
  To: Sage Weil; +Cc: Ilya Dryomov, Zack Cerza, Andrew Schoen, ceph-qa, ceph-devel

On Sun, May 31, 2015 at 3:19 PM, Sage Weil <sage@newdream.net> wrote:
> On Sun, 31 May 2015, Gregory Farnum wrote:
>> On Sun, May 31, 2015 at 1:40 PM, Ilya Dryomov <idryomov@gmail.com> wrote:
>> > On Sun, May 31, 2015 at 10:54 PM, Gregory Farnum <greg@gregs42.com> wrote:
>> >> We are getting this error in what looks like everything that specifies
>> >> the testing kernel. (That turns out to be almost all of the FS tests
>> >> and a surprising number of the non-rados runs; e.g. rgw.) I've checked
>> >> that the testing branch of ceph-client.git still exists and when
>> >> looking at the teuthology git logs the recent ping-pong of commits on
>> >> kernel flavors et al stand out. Any ideas? :)
>> >
>> > Which lab is this in?  Latest rgw and fs in sepia look fine.
>> >
>> > Kernel task was broken the entire last week but in a different way -
>> > when scheduled with teuthology-suite it wouldn't install anything even
>> > if you told it to install e.g. testing.  I fixed that on Friday.
>> >
>> > Looks like this is coming from create_initial_config().  Could be an
>> > environment issue, like down gitbuilders or a problem with requests
>> > module?  Kernel branch is checked before the others so it may not have
>> > anything to do with it at all.
>>
>> It's happening across labs for tests that were supposed to be
>> scheduled starting on the 29 (at least, that I've noticed). I think
>> this issue is before they get into pulpito, which is why the latest fs
>> suite run there was scheduled on May 27. :(
>> Looking at the gitbuliders I do see that the CentOS6 testing branch is
>> red and rhel7 appears to be down...maybe we're checking on more of
>> them now and then failing when those don't appear? :/
>
> We're half-way through creating the centos7 kernel builder to replace the
> rhel ones so I expect things are broken on the rpm side.  Not sure if
> that is the root cause here, but we can probably wait for that to get
> fixed first before looking further

*None* of our FS tests are running while this problem persists (and
they're not alone). That's not the sort of thing we can wait on...
Maybe we have some gratuitous non-use of the testing kernel we can
remove (I'm not sure), but that's the sort of thing that needs to be
discussed across teams so we can deal with it proactively instead of
just finding out when the nightlies start failing.
-Greg

^ permalink raw reply	[flat|nested] 7+ messages in thread

* Re: Nightly "Kernel branch not found" errors
  2015-05-31 22:19     ` Sage Weil
  2015-05-31 22:28       ` Gregory Farnum
@ 2015-06-01 12:56       ` Ilya Dryomov
  1 sibling, 0 replies; 7+ messages in thread
From: Ilya Dryomov @ 2015-06-01 12:56 UTC (permalink / raw)
  To: Sage Weil; +Cc: Gregory Farnum, Zack Cerza, Andrew Schoen, ceph-qa, ceph-devel

On Mon, Jun 1, 2015 at 1:19 AM, Sage Weil <sage@newdream.net> wrote:
> On Sun, 31 May 2015, Gregory Farnum wrote:
>> On Sun, May 31, 2015 at 1:40 PM, Ilya Dryomov <idryomov@gmail.com> wrote:
>> > On Sun, May 31, 2015 at 10:54 PM, Gregory Farnum <greg@gregs42.com> wrote:
>> >> We are getting this error in what looks like everything that specifies
>> >> the testing kernel. (That turns out to be almost all of the FS tests
>> >> and a surprising number of the non-rados runs; e.g. rgw.) I've checked
>> >> that the testing branch of ceph-client.git still exists and when
>> >> looking at the teuthology git logs the recent ping-pong of commits on
>> >> kernel flavors et al stand out. Any ideas? :)
>> >
>> > Which lab is this in?  Latest rgw and fs in sepia look fine.
>> >
>> > Kernel task was broken the entire last week but in a different way -
>> > when scheduled with teuthology-suite it wouldn't install anything even
>> > if you told it to install e.g. testing.  I fixed that on Friday.
>> >
>> > Looks like this is coming from create_initial_config().  Could be an
>> > environment issue, like down gitbuilders or a problem with requests
>> > module?  Kernel branch is checked before the others so it may not have
>> > anything to do with it at all.
>>
>> It's happening across labs for tests that were supposed to be
>> scheduled starting on the 29 (at least, that I've noticed). I think
>> this issue is before they get into pulpito, which is why the latest fs
>> suite run there was scheduled on May 27. :(
>> Looking at the gitbuliders I do see that the CentOS6 testing branch is
>> red and rhel7 appears to be down...maybe we're checking on more of
>> them now and then failing when those don't appear? :/
>
> We're half-way through creating the centos7 kernel builder to replace the
> rhel ones so I expect things are broken on the rpm side.  Not sure if
> that is the root cause here, but we can probably wait for that to get
> fixed first before looking further

I think it's definitely related to this.  commit f2ce5e1ed3d4 ("Treat
RHEL as CentOS when scheduling") by Zack makes it so centos7
gitbuilders are poked in case distro wasn't specified explicitly, which
is how I assume all those runs are scheduled.  kernel centos6
gitbuilder is down and AFAICT the centos7 one doesn't exist.

Thanks,

                Ilya

^ permalink raw reply	[flat|nested] 7+ messages in thread

* Re: Nightly "Kernel branch not found" errors
  2015-05-31 22:28       ` Gregory Farnum
@ 2015-06-01 13:13         ` Ilya Dryomov
  0 siblings, 0 replies; 7+ messages in thread
From: Ilya Dryomov @ 2015-06-01 13:13 UTC (permalink / raw)
  To: Gregory Farnum; +Cc: Sage Weil, Zack Cerza, Andrew Schoen, ceph-qa, ceph-devel

On Mon, Jun 1, 2015 at 1:28 AM, Gregory Farnum <greg@gregs42.com> wrote:
> On Sun, May 31, 2015 at 3:19 PM, Sage Weil <sage@newdream.net> wrote:
>> On Sun, 31 May 2015, Gregory Farnum wrote:
>>> On Sun, May 31, 2015 at 1:40 PM, Ilya Dryomov <idryomov@gmail.com> wrote:
>>> > On Sun, May 31, 2015 at 10:54 PM, Gregory Farnum <greg@gregs42.com> wrote:
>>> >> We are getting this error in what looks like everything that specifies
>>> >> the testing kernel. (That turns out to be almost all of the FS tests
>>> >> and a surprising number of the non-rados runs; e.g. rgw.) I've checked
>>> >> that the testing branch of ceph-client.git still exists and when
>>> >> looking at the teuthology git logs the recent ping-pong of commits on
>>> >> kernel flavors et al stand out. Any ideas? :)
>>> >
>>> > Which lab is this in?  Latest rgw and fs in sepia look fine.
>>> >
>>> > Kernel task was broken the entire last week but in a different way -
>>> > when scheduled with teuthology-suite it wouldn't install anything even
>>> > if you told it to install e.g. testing.  I fixed that on Friday.
>>> >
>>> > Looks like this is coming from create_initial_config().  Could be an
>>> > environment issue, like down gitbuilders or a problem with requests
>>> > module?  Kernel branch is checked before the others so it may not have
>>> > anything to do with it at all.
>>>
>>> It's happening across labs for tests that were supposed to be
>>> scheduled starting on the 29 (at least, that I've noticed). I think
>>> this issue is before they get into pulpito, which is why the latest fs
>>> suite run there was scheduled on May 27. :(
>>> Looking at the gitbuliders I do see that the CentOS6 testing branch is
>>> red and rhel7 appears to be down...maybe we're checking on more of
>>> them now and then failing when those don't appear? :/
>>
>> We're half-way through creating the centos7 kernel builder to replace the
>> rhel ones so I expect things are broken on the rpm side.  Not sure if
>> that is the root cause here, but we can probably wait for that to get
>> fixed first before looking further
>
> *None* of our FS tests are running while this problem persists (and
> they're not alone). That's not the sort of thing we can wait on...
> Maybe we have some gratuitous non-use of the testing kernel we can
> remove (I'm not sure), but that's the sort of thing that needs to be
> discussed across teams so we can deal with it proactively instead of
> just finding out when the nightlies start failing.

It's not the kernel to blame here, so unless there is a specific reason
let's not remove use of testing kernel.

What I think we need to do is to make sure that teuthology test suite
includes "teuthology-suite --dry-run ..." with "--kernel testing" at
the very least.  It would be good to also verify that testing kernel is
actually going to be installed by the kernel task to avoid last week's
breakage, but that's a matter of adding unit tests and is a different
issue.

Unless I'm missing something, the only problem here is really that the
commit I mentioned in another mail got pushed before the gitbuilder was
setup.

Thanks,

                Ilya

^ permalink raw reply	[flat|nested] 7+ messages in thread

end of thread, other threads:[~2015-06-01 13:13 UTC | newest]

Thread overview: 7+ messages (download: mbox.gz / follow: Atom feed)
-- links below jump to the message on this page --
2015-05-31 19:54 Nightly "Kernel branch not found" errors Gregory Farnum
2015-05-31 20:40 ` Ilya Dryomov
2015-05-31 20:49   ` Gregory Farnum
2015-05-31 22:19     ` Sage Weil
2015-05-31 22:28       ` Gregory Farnum
2015-06-01 13:13         ` Ilya Dryomov
2015-06-01 12:56       ` Ilya Dryomov

This is an external index of several public inboxes,
see mirroring instructions on how to clone and mirror
all data and code used by this external index.