From mboxrd@z Thu Jan 1 00:00:00 1970 From: John Spray Subject: Re: [sepia] CentOS builds failing in Shaman since Friday evening Date: Sun, 4 Jun 2017 10:26:12 +0100 Message-ID: References: <4a0aa923-028a-a9bf-d988-ead0f51fe831@suse.cz> Mime-Version: 1.0 Content-Type: text/plain; charset="UTF-8" Return-path: Received: from mail-wr0-f179.google.com ([209.85.128.179]:35508 "EHLO mail-wr0-f179.google.com" rhost-flags-OK-OK-OK-OK) by vger.kernel.org with ESMTP id S1750847AbdFDJ0e (ORCPT ); Sun, 4 Jun 2017 05:26:34 -0400 Received: by mail-wr0-f179.google.com with SMTP id q97so24826628wrb.2 for ; Sun, 04 Jun 2017 02:26:33 -0700 (PDT) In-Reply-To: Sender: ceph-devel-owner@vger.kernel.org List-ID: To: Sage Weil Cc: Gregory Farnum , "sepia@ceph.com" , "ceph-devel@vger.kernel.org" , Nathan Cutler On Sat, Jun 3, 2017 at 9:45 PM, Sage Weil wrote: > I'm seeing the builds all complete: > > https://shaman.ceph.com/repos/ceph/wip-sage-testing/468be5dab6a2d8421a4fc35744463d47a80f47c2/ > > but it won't schedule: > > $ teuthology-suite -s rados -c wip-sage-testing2 --subset 111/444 -p 100 > -k distro > 2017-06-03 20:44:27,112.112 INFO:teuthology.suite.run:kernel sha1: distro > 2017-06-03 20:44:27,389.389 INFO:teuthology.suite.run:ceph sha1: > b03f3062d40c35e4898d77604d62e7e7c4e88afd > Traceback (most recent call last): > File "/home/sage/src/teuthology/virtualenv/bin/teuthology-suite", line 11, in > load_entry_point('teuthology', 'console_scripts', 'teuthology-suite')() > File "/home/sage/src/teuthology/scripts/suite.py", line 137, in main > return teuthology.suite.main(args) > File "/home/sage/src/teuthology/teuthology/suite/__init__.py", line 86, in main > run = Run(conf) > File "/home/sage/src/teuthology/teuthology/suite/run.py", line 46, in __init__ > self.base_config = self.create_initial_config() > File "/home/sage/src/teuthology/teuthology/suite/run.py", line 92, in create_initial_config > self.choose_ceph_version(ceph_hash) > File "/home/sage/src/teuthology/teuthology/suite/run.py", line 185, in choose_ceph_version > util.schedule_fail(str(exc), self.name) > File "/home/sage/src/teuthology/teuthology/suite/util.py", line 72, in schedule_fail > raise ScheduleFailError(message, name) > teuthology.exceptions.ScheduleFailError: Scheduling sage-2017-06-03_20:44:27-rados-wip-sage-testing2-distro-basic-smithi > failed: 'package_manager_version' > > :/ Same here. The tip of my branch is 50b0654e, I can see teuthology finding that and going to query shaman at this URL: https://shaman.ceph.com/api/search/?status=ready&project=ceph&flavor=default&distros=centos%2F7%2Fx86_64&sha1=50b0654ed19cb083af494878738c7debe80db31e The result has an empty dict for the 'extra' field, where teuthology is expecting to see package_manager_version. That stuff is supposed to be populated by ceph-build/build/build-rpm posting a repo-extra.json file to chacra.ceph.com I see my build log here: https://jenkins.ceph.com/job/ceph-dev-new-build/ARCH=x86_64,AVAILABLE_ARCH=x86_64,AVAILABLE_DIST=centos7,DIST=centos7,MACHINE_SIZE=huge/3848//consoleFull And I see the POST to chacra failing here: """ build@2/ARCH/x86_64/AVAILABLE_ARCH/x86_64/AVAILABLE_DIST/centos7/DIST/centos7/MACHINE_SIZE/huge/repo-extra.json -u admin:[*******] https://1.chacra.ceph.com/repos/ceph/wip-jcsp-testing-20170604/50b0654ed19cb083af494878738c7debe80db31e/centos/7/flavors/default/extra/ % Total % Received % Xferd Average Speed Time Time Time Current Dload Upload Total Spent Left Speed 0 0 0 0 0 0 0 0 --:--:-- --:--:-- --:--:-- 0 100 505 100 52 100 453 145 1267 --:--:-- --:--:-- --:--:-- 1268 404 Not Found The resource could not be found. """ So the ceph-build script is succeeding where it should be failing (does curl not return an error or is the script ignoring it?) and something is wrong with chacra.ceph.com that's making it 404 here (I don't know where to begin to debug that). John P.S. Probably a topic for another day, but I didn't love having to traverse several different git repos to try and work out what was happening during a build, wouldn't it be simpler to have a single repo for the build infrastructure? > > On Sat, 3 Jun 2017, Gregory Farnum wrote: > >> Adding sepia list for more infrastructure dev attention. (No idea where that >> problem is coming from.) >> >> On Sat, Jun 3, 2017 at 5:52 AM Nathan Cutler wrote: >> CentOS builds in Shaman started failing with this error: >> >> {standard input}: Assembler messages: >> {standard input}:186778: Warning: end of file not at end of a >> line; >> newline inserted >> {standard input}: Error: open CFI at the end of file; missing >> .cfi_endproc directive >> c++: internal compiler error: Killed (program cc1plus) >> Please submit a full bug report, >> with preprocessed source if appropriate. >> See for instructions. >> >> AFAICT the first occurrence was in [1] and the error has been >> haunting >> the build queue since then. >> >> [1] >> https://shaman.ceph.com/builds/ceph/wip-sage-testing2/f93ad23a8fec219667a03 >> 695136842edb0cceace/default/45729/ >> >> Nathan >> -- >> To unsubscribe from this list: send the line "unsubscribe >> ceph-devel" in >> the body of a message to majordomo@vger.kernel.org >> More majordomo info at >> http://vger.kernel.org/majordomo-info.html >> >> >> > > _______________________________________________ > Sepia mailing list > Sepia@lists.ceph.com > http://lists.ceph.com/listinfo.cgi/sepia-ceph.com >