All of lore.kernel.org
 help / color / mirror / Atom feed
* OT: xen libvirt issue
@ 2013-04-04 19:49 AL13N
  2013-04-05  8:21 ` Dario Faggioli
  0 siblings, 1 reply; 20+ messages in thread
From: AL13N @ 2013-04-04 19:49 UTC (permalink / raw)
  To: xen-devel

i realise this is completely off-topic, but if someone on this list has some 
knowledge on this, see:

https://www.redhat.com/archives/libvir-list/2013-April/msg00189.html

the issue is that shutting down xen domains, segfaults libvirtd... which is 
annoying

^ permalink raw reply	[flat|nested] 20+ messages in thread

* Re: OT: xen libvirt issue
  2013-04-04 19:49 OT: xen libvirt issue AL13N
@ 2013-04-05  8:21 ` Dario Faggioli
  2013-04-05 11:13   ` George Dunlap
  0 siblings, 1 reply; 20+ messages in thread
From: Dario Faggioli @ 2013-04-05  8:21 UTC (permalink / raw)
  To: AL13N; +Cc: Jim Fehlig, xen-devel


[-- Attachment #1.1: Type: text/plain, Size: 687 bytes --]

On gio, 2013-04-04 at 21:49 +0200, AL13N wrote:
> i realise this is completely off-topic, but if someone on this list has some 
> knowledge on this, see:
> 
> https://www.redhat.com/archives/libvir-list/2013-April/msg00189.html
> 
> the issue is that shutting down xen domains, segfaults libvirtd... which is 
> annoying
> 
I don't have any clue on this... But, perhaps, Jim does (Cc-ing him)?

Regards,
Dario

-- 
<<This happens because I choose it to happen!>> (Raistlin Majere)
-----------------------------------------------------------------
Dario Faggioli, Ph.D, http://about.me/dario.faggioli
Senior Software Engineer, Citrix Systems R&D Ltd., Cambridge (UK)


[-- Attachment #1.2: This is a digitally signed message part --]
[-- Type: application/pgp-signature, Size: 198 bytes --]

[-- Attachment #2: Type: text/plain, Size: 126 bytes --]

_______________________________________________
Xen-devel mailing list
Xen-devel@lists.xen.org
http://lists.xen.org/xen-devel

^ permalink raw reply	[flat|nested] 20+ messages in thread

* Re: OT: xen libvirt issue
  2013-04-05  8:21 ` Dario Faggioli
@ 2013-04-05 11:13   ` George Dunlap
  2013-04-08  4:22     ` Marek Marczykowski
  0 siblings, 1 reply; 20+ messages in thread
From: George Dunlap @ 2013-04-05 11:13 UTC (permalink / raw)
  To: Dario Faggioli; +Cc: Jim Fehlig, AL13N, xen-devel

On Fri, Apr 5, 2013 at 9:21 AM, Dario Faggioli
<dario.faggioli@citrix.com> wrote:
> On gio, 2013-04-04 at 21:49 +0200, AL13N wrote:
>> i realise this is completely off-topic, but if someone on this list has some
>> knowledge on this, see:
>>
>> https://www.redhat.com/archives/libvir-list/2013-April/msg00189.html
>>
>> the issue is that shutting down xen domains, segfaults libvirtd... which is
>> annoying
>>
> I don't have any clue on this... But, perhaps, Jim does (Cc-ing him)?

The e-mail that AL13N linked to was actually from Jim, saying he would
be really busy for a while and unable to look at it.

The question isn't off-topic, as libxl and xend have to work closely
with libvirt.  Unfortunately, I don't think any of the active
developers on this list has much familiarity with libvirt.  It Would
Be Good if someone could step up and learn, but with our feature
freeze next week, we're also kind of heads-down getting stuff
implemented...

 -George

^ permalink raw reply	[flat|nested] 20+ messages in thread

* Re: OT: xen libvirt issue
  2013-04-05 11:13   ` George Dunlap
@ 2013-04-08  4:22     ` Marek Marczykowski
  2013-04-08  6:26       ` AL13N
  0 siblings, 1 reply; 20+ messages in thread
From: Marek Marczykowski @ 2013-04-08  4:22 UTC (permalink / raw)
  To: George Dunlap; +Cc: Dario Faggioli, xen-devel, Jim Fehlig, AL13N


[-- Attachment #1.1: Type: text/plain, Size: 1320 bytes --]

On 05.04.2013 13:13, George Dunlap wrote:
> On Fri, Apr 5, 2013 at 9:21 AM, Dario Faggioli
> <dario.faggioli@citrix.com> wrote:
>> On gio, 2013-04-04 at 21:49 +0200, AL13N wrote:
>>> i realise this is completely off-topic, but if someone on this list has some
>>> knowledge on this, see:
>>>
>>> https://www.redhat.com/archives/libvir-list/2013-April/msg00189.html
>>>
>>> the issue is that shutting down xen domains, segfaults libvirtd... which is
>>> annoying
>>>
>> I don't have any clue on this... But, perhaps, Jim does (Cc-ing him)?
> 
> The e-mail that AL13N linked to was actually from Jim, saying he would
> be really busy for a while and unable to look at it.
> 
> The question isn't off-topic, as libxl and xend have to work closely
> with libvirt.  Unfortunately, I don't think any of the active
> developers on this list has much familiarity with libvirt.  It Would
> Be Good if someone could step up and learn, but with our feature
> freeze next week, we're also kind of heads-down getting stuff
> implemented...

I believe it is already fixed in unstable by this commit:
5f5ef65babc2ca15f43b775c4b47b0102fa2a632 "libxl: fix stale timeout event
callback race"

Sadly backport to 4.2 isn't trivial.

-- 
Best Regards / Pozdrawiam,
Marek Marczykowski
Invisible Things Lab


[-- Attachment #1.2: OpenPGP digital signature --]
[-- Type: application/pgp-signature, Size: 553 bytes --]

[-- Attachment #2: Type: text/plain, Size: 126 bytes --]

_______________________________________________
Xen-devel mailing list
Xen-devel@lists.xen.org
http://lists.xen.org/xen-devel

^ permalink raw reply	[flat|nested] 20+ messages in thread

* Re: OT: xen libvirt issue
  2013-04-08  4:22     ` Marek Marczykowski
@ 2013-04-08  6:26       ` AL13N
  2013-04-08 15:36         ` Jim Fehlig
                           ` (2 more replies)
  0 siblings, 3 replies; 20+ messages in thread
From: AL13N @ 2013-04-08  6:26 UTC (permalink / raw)
  To: xen-devel; +Cc: George Dunlap, Dario Faggioli, Jim Fehlig, Marek Marczykowski

Op maandag 8 april 2013 06:22:49 schreef Marek Marczykowski:
> On 05.04.2013 13:13, George Dunlap wrote:
> > On Fri, Apr 5, 2013 at 9:21 AM, Dario Faggioli
> > 
> > <dario.faggioli@citrix.com> wrote:
> >> On gio, 2013-04-04 at 21:49 +0200, AL13N wrote:
> >>> i realise this is completely off-topic, but if someone on this list has
> >>> some knowledge on this, see:
> >>> 
> >>> https://www.redhat.com/archives/libvir-list/2013-April/msg00189.html
> >>> 
> >>> the issue is that shutting down xen domains, segfaults libvirtd... which
> >>> is
> >>> annoying
> >> 
> >> I don't have any clue on this... But, perhaps, Jim does (Cc-ing him)?
> > 
> > The e-mail that AL13N linked to was actually from Jim, saying he would
> > be really busy for a while and unable to look at it.
> > 
> > The question isn't off-topic, as libxl and xend have to work closely
> > with libvirt.  Unfortunately, I don't think any of the active
> > developers on this list has much familiarity with libvirt.  It Would
> > Be Good if someone could step up and learn, but with our feature
> > freeze next week, we're also kind of heads-down getting stuff
> > implemented...
> 
> I believe it is already fixed in unstable by this commit:
> 5f5ef65babc2ca15f43b775c4b47b0102fa2a632 "libxl: fix stale timeout event
> callback race"
> 
> Sadly backport to 4.2 isn't trivial.

Looking at the patch i totally agree that it isn't trivial...

i'll hold off for now. and will work more towards libvirt integration for 
Mageia 4 (our release freeze is now in effect for Mageia 3).

thanks for all the help!

^ permalink raw reply	[flat|nested] 20+ messages in thread

* Re: OT: xen libvirt issue
  2013-04-08  6:26       ` AL13N
@ 2013-04-08 15:36         ` Jim Fehlig
  2013-04-08 17:22           ` Stefano Stabellini
  2013-04-08 17:57           ` AL13N
  2013-04-08 15:46         ` Jim Fehlig
  2013-04-08 16:09         ` Ian Jackson
  2 siblings, 2 replies; 20+ messages in thread
From: Jim Fehlig @ 2013-04-08 15:36 UTC (permalink / raw)
  To: AL13N
  Cc: George Dunlap, Dario Faggioli, Bamvor Jian Zhang,
	Marek Marczykowski, xen-devel

AL13N wrote:
> Op maandag 8 april 2013 06:22:49 schreef Marek Marczykowski:
>   
>> On 05.04.2013 13:13, George Dunlap wrote:
>>     
>>> On Fri, Apr 5, 2013 at 9:21 AM, Dario Faggioli
>>>
>>> <dario.faggioli@citrix.com> wrote:
>>>       
>>>> On gio, 2013-04-04 at 21:49 +0200, AL13N wrote:
>>>>         
>>>>> i realise this is completely off-topic, but if someone on this list has
>>>>> some knowledge on this, see:
>>>>>
>>>>> https://www.redhat.com/archives/libvir-list/2013-April/msg00189.html
>>>>>
>>>>> the issue is that shutting down xen domains, segfaults libvirtd... which
>>>>> is
>>>>> annoying
>>>>>           
>>>> I don't have any clue on this... But, perhaps, Jim does (Cc-ing him)?
>>>>         
>>> The e-mail that AL13N linked to was actually from Jim, saying he would
>>> be really busy for a while and unable to look at it.
>>>
>>> The question isn't off-topic, as libxl and xend have to work closely
>>> with libvirt.  Unfortunately, I don't think any of the active
>>> developers on this list has much familiarity with libvirt.  It Would
>>> Be Good if someone could step up and learn, but with our feature
>>> freeze next week, we're also kind of heads-down getting stuff
>>> implemented...
>>>       
>> I believe it is already fixed in unstable by this commit:
>> 5f5ef65babc2ca15f43b775c4b47b0102fa2a632 "libxl: fix stale timeout event
>> callback race"
>>
>> Sadly backport to 4.2 isn't trivial.
>>     
>
> Looking at the patch i totally agree that it isn't trivial...
>   

It will certainly help, but I've heard reports there are still problems
even with that patch.  IIRC, Bamvor has seen a similar segfault using
git master of libvirt and xen-unstable, although it is a bit harder to
trigger.

I think we need to rework the code for handling shutdown events.  The
current code worked with libxl in Xen 4.1, but has proven to be racy
with libxl in Xen 4.2.  I plan to work on this, but unfortunately not
for a few weeks.  I'm busy with another project this week and will be
traveling the week of April 15.

> i'll hold off for now. and will work more towards libvirt integration for 
> Mageia 4 (our release freeze is now in effect for Mageia 3).
>   

FYI, although it is deprecated, the xm/xend toolstack works well with
Xen 4.2, and the legacy libvirt xen driver is quite stable.  It was the
first hypervisor driver in libvirt  :).

Regards,
Jim

^ permalink raw reply	[flat|nested] 20+ messages in thread

* Re: OT: xen libvirt issue
  2013-04-08  6:26       ` AL13N
  2013-04-08 15:36         ` Jim Fehlig
@ 2013-04-08 15:46         ` Jim Fehlig
  2013-04-08 16:09         ` Ian Jackson
  2 siblings, 0 replies; 20+ messages in thread
From: Jim Fehlig @ 2013-04-08 15:46 UTC (permalink / raw)
  To: AL13N
  Cc: George Dunlap, Dario Faggioli, Bamvor Jian Zhang,
	Marek Marczykowski, xen-devel

AL13N wrote:
> Op maandag 8 april 2013 06:22:49 schreef Marek Marczykowski:
>   
>> On 05.04.2013 13:13, George Dunlap wrote:
>>     
>>> On Fri, Apr 5, 2013 at 9:21 AM, Dario Faggioli
>>>
>>> <dario.faggioli@citrix.com> wrote:
>>>       
>>>> On gio, 2013-04-04 at 21:49 +0200, AL13N wrote:
>>>>         
>>>>> i realise this is completely off-topic, but if someone on this list has
>>>>> some knowledge on this, see:
>>>>>
>>>>> https://www.redhat.com/archives/libvir-list/2013-April/msg00189.html
>>>>>
>>>>> the issue is that shutting down xen domains, segfaults libvirtd... which
>>>>> is
>>>>> annoying
>>>>>           
>>>> I don't have any clue on this... But, perhaps, Jim does (Cc-ing him)?
>>>>         
>>> The e-mail that AL13N linked to was actually from Jim, saying he would
>>> be really busy for a while and unable to look at it.
>>>
>>> The question isn't off-topic, as libxl and xend have to work closely
>>> with libvirt.  Unfortunately, I don't think any of the active
>>> developers on this list has much familiarity with libvirt.  It Would
>>> Be Good if someone could step up and learn, but with our feature
>>> freeze next week, we're also kind of heads-down getting stuff
>>> implemented...
>>>       
>> I believe it is already fixed in unstable by this commit:
>> 5f5ef65babc2ca15f43b775c4b47b0102fa2a632 "libxl: fix stale timeout event
>> callback race"
>>
>> Sadly backport to 4.2 isn't trivial.
>>     
>
> Looking at the patch i totally agree that it isn't trivial...
>   

I forgot to mention, that commit plus bc7e8a2a have been backported to
our openSUSE Xen 4.2 packages

https://build.opensuse.org/package/show?package=xen&project=Virtualization

See 26468-libxl-race.patch and 26469-libxl-race.patch.

Regards,
Jim

^ permalink raw reply	[flat|nested] 20+ messages in thread

* Re: OT: xen libvirt issue
  2013-04-08  6:26       ` AL13N
  2013-04-08 15:36         ` Jim Fehlig
  2013-04-08 15:46         ` Jim Fehlig
@ 2013-04-08 16:09         ` Ian Jackson
  2013-04-08 17:53           ` AL13N
  2 siblings, 1 reply; 20+ messages in thread
From: Ian Jackson @ 2013-04-08 16:09 UTC (permalink / raw)
  To: AL13N
  Cc: George Dunlap, Marek Marczykowski, Dario Faggioli, Jim Fehlig, xen-devel

AL13N writes ("Re: [Xen-devel] OT: xen libvirt issue"):
> Op maandag 8 april 2013 06:22:49 schreef Marek Marczykowski:
> > I believe it is already fixed in unstable by this commit:
> > 5f5ef65babc2ca15f43b775c4b47b0102fa2a632 "libxl: fix stale timeout event
> > callback race"
> > 
> > Sadly backport to 4.2 isn't trivial.
> 
> Looking at the patch i totally agree that it isn't trivial...
> 
> i'll hold off for now. and will work more towards libvirt integration for 
> Mageia 4 (our release freeze is now in effect for Mageia 3).

We have already done the backport.  The fix for this has been in Xen
upstream 4.2-staging since the 22nd of February.  You want these two
commits from xen.git:

  commit a87ef897295ec17788e41e9a8f4c0ada7a5a45f8
  Author: Ian Jackson <ian.jackson@eu.citrix.com>
  Date:   Wed Jan 23 16:53:11 2013 +0000

      libxl: fix stale fd event callback race

  commit 6f0f339dd4378d062a211969f45cd23af12bf386
  Author: Ian Jackson <ian.jackson@eu.citrix.com>
  Date:   Wed Jan 23 16:53:11 2013 +0000

      libxl: fix stale timeout event callback race

I don't know if that's any help for your Mageia release, of course.

Regards,
Ian.

^ permalink raw reply	[flat|nested] 20+ messages in thread

* Re: OT: xen libvirt issue
  2013-04-08 15:36         ` Jim Fehlig
@ 2013-04-08 17:22           ` Stefano Stabellini
  2013-04-08 17:57           ` AL13N
  1 sibling, 0 replies; 20+ messages in thread
From: Stefano Stabellini @ 2013-04-08 17:22 UTC (permalink / raw)
  To: Jim Fehlig
  Cc: George Dunlap, Dario Faggioli, Marek Marczykowski, xen-devel,
	AL13N, Bamvor Jian Zhang

On Mon, 8 Apr 2013, Jim Fehlig wrote:
> AL13N wrote:
> > Op maandag 8 april 2013 06:22:49 schreef Marek Marczykowski:
> >   
> >> On 05.04.2013 13:13, George Dunlap wrote:
> >>     
> >>> On Fri, Apr 5, 2013 at 9:21 AM, Dario Faggioli
> >>>
> >>> <dario.faggioli@citrix.com> wrote:
> >>>       
> >>>> On gio, 2013-04-04 at 21:49 +0200, AL13N wrote:
> >>>>         
> >>>>> i realise this is completely off-topic, but if someone on this list has
> >>>>> some knowledge on this, see:
> >>>>>
> >>>>> https://www.redhat.com/archives/libvir-list/2013-April/msg00189.html
> >>>>>
> >>>>> the issue is that shutting down xen domains, segfaults libvirtd... which
> >>>>> is
> >>>>> annoying
> >>>>>           
> >>>> I don't have any clue on this... But, perhaps, Jim does (Cc-ing him)?
> >>>>         
> >>> The e-mail that AL13N linked to was actually from Jim, saying he would
> >>> be really busy for a while and unable to look at it.
> >>>
> >>> The question isn't off-topic, as libxl and xend have to work closely
> >>> with libvirt.  Unfortunately, I don't think any of the active
> >>> developers on this list has much familiarity with libvirt.  It Would
> >>> Be Good if someone could step up and learn, but with our feature
> >>> freeze next week, we're also kind of heads-down getting stuff
> >>> implemented...
> >>>       
> >> I believe it is already fixed in unstable by this commit:
> >> 5f5ef65babc2ca15f43b775c4b47b0102fa2a632 "libxl: fix stale timeout event
> >> callback race"
> >>
> >> Sadly backport to 4.2 isn't trivial.
> >>     
> >
> > Looking at the patch i totally agree that it isn't trivial...
> >   
> 
> It will certainly help, but I've heard reports there are still problems
> even with that patch.  IIRC, Bamvor has seen a similar segfault using
> git master of libvirt and xen-unstable, although it is a bit harder to
> trigger.
 
Do you have a link to a bug report somewhere?


> I think we need to rework the code for handling shutdown events.  The
> current code worked with libxl in Xen 4.1, but has proven to be racy
> with libxl in Xen 4.2.  I plan to work on this, but unfortunately not
> for a few weeks.  I'm busy with another project this week and will be
> traveling the week of April 15.

I realize that it actually takes time but it would be great if you could
write down in a bit more details the proposed fix, in case somebody else
volunteers to fix the issue in the meantime.


> > i'll hold off for now. and will work more towards libvirt integration for 
> > Mageia 4 (our release freeze is now in effect for Mageia 3).
> >   
> 
> FYI, although it is deprecated, the xm/xend toolstack works well with
> Xen 4.2, and the legacy libvirt xen driver is quite stable.  It was the
> first hypervisor driver in libvirt  :).

The problem is that xend doesn't support upstream QEMU as a disk
backend, and the status of blktap in most distros is pretty poor.

^ permalink raw reply	[flat|nested] 20+ messages in thread

* Re: OT: xen libvirt issue
  2013-04-08 16:09         ` Ian Jackson
@ 2013-04-08 17:53           ` AL13N
  2013-04-09 12:39             ` Ian Jackson
  0 siblings, 1 reply; 20+ messages in thread
From: AL13N @ 2013-04-08 17:53 UTC (permalink / raw)
  To: xen-devel
  Cc: George Dunlap, Dario Faggioli, Ian Jackson, Marek Marczykowski,
	Jim Fehlig

Op maandag 8 april 2013 17:09:52 schreef Ian Jackson:
> AL13N writes ("Re: [Xen-devel] OT: xen libvirt issue"):
> > Op maandag 8 april 2013 06:22:49 schreef Marek Marczykowski:
> > > I believe it is already fixed in unstable by this commit:
> > > 5f5ef65babc2ca15f43b775c4b47b0102fa2a632 "libxl: fix stale timeout event
> > > callback race"
> > > 
> > > Sadly backport to 4.2 isn't trivial.
> > 
> > Looking at the patch i totally agree that it isn't trivial...
> > 
> > i'll hold off for now. and will work more towards libvirt integration for
> > Mageia 4 (our release freeze is now in effect for Mageia 3).
> 
> We have already done the backport.  The fix for this has been in Xen
> upstream 4.2-staging since the 22nd of February.  You want these two
> commits from xen.git:
> 
>   commit a87ef897295ec17788e41e9a8f4c0ada7a5a45f8
>   Author: Ian Jackson <ian.jackson@eu.citrix.com>
>   Date:   Wed Jan 23 16:53:11 2013 +0000
> 
>       libxl: fix stale fd event callback race
> 
>   commit 6f0f339dd4378d062a211969f45cd23af12bf386
>   Author: Ian Jackson <ian.jackson@eu.citrix.com>
>   Date:   Wed Jan 23 16:53:11 2013 +0000
> 
>       libxl: fix stale timeout event callback race
> 
> I don't know if that's any help for your Mageia release, of course.

i was more worried about the comments in the patch, ie: that it changes how 
tools using this (libvirt maybe) would need to be recoded.

but since it's backported, i might as well try them

^ permalink raw reply	[flat|nested] 20+ messages in thread

* Re: OT: xen libvirt issue
  2013-04-08 15:36         ` Jim Fehlig
  2013-04-08 17:22           ` Stefano Stabellini
@ 2013-04-08 17:57           ` AL13N
  1 sibling, 0 replies; 20+ messages in thread
From: AL13N @ 2013-04-08 17:57 UTC (permalink / raw)
  To: xen-devel
  Cc: George Dunlap, Marek Marczykowski, Jim Fehlig, Dario Faggioli,
	Bamvor Jian Zhang

Op maandag 8 april 2013 09:36:15 schreef Jim Fehlig:
> AL13N wrote:
> > Op maandag 8 april 2013 06:22:49 schreef Marek Marczykowski:
> >> On 05.04.2013 13:13, George Dunlap wrote:
> >>> On Fri, Apr 5, 2013 at 9:21 AM, Dario Faggioli
> >>> 
> >>> <dario.faggioli@citrix.com> wrote:
> >>>> On gio, 2013-04-04 at 21:49 +0200, AL13N wrote:
> >>>>> i realise this is completely off-topic, but if someone on this list
> >>>>> has
> >>>>> some knowledge on this, see:
> >>>>> 
> >>>>> https://www.redhat.com/archives/libvir-list/2013-April/msg00189.html
> >>>>> 
> >>>>> the issue is that shutting down xen domains, segfaults libvirtd...
> >>>>> which
> >>>>> is
> >>>>> annoying
> >>>> 
> >>>> I don't have any clue on this... But, perhaps, Jim does (Cc-ing him)?
> >>> 
> >>> The e-mail that AL13N linked to was actually from Jim, saying he would
> >>> be really busy for a while and unable to look at it.
> >>> 
> >>> The question isn't off-topic, as libxl and xend have to work closely
> >>> with libvirt.  Unfortunately, I don't think any of the active
> >>> developers on this list has much familiarity with libvirt.  It Would
> >>> Be Good if someone could step up and learn, but with our feature
> >>> freeze next week, we're also kind of heads-down getting stuff
> >>> implemented...
> >> 
> >> I believe it is already fixed in unstable by this commit:
> >> 5f5ef65babc2ca15f43b775c4b47b0102fa2a632 "libxl: fix stale timeout event
> >> callback race"
> >> 
> >> Sadly backport to 4.2 isn't trivial.
> > 
> > Looking at the patch i totally agree that it isn't trivial...
> 
> It will certainly help, but I've heard reports there are still problems
> even with that patch.  IIRC, Bamvor has seen a similar segfault using
> git master of libvirt and xen-unstable, although it is a bit harder to
> trigger.
> 
> I think we need to rework the code for handling shutdown events.  The
> current code worked with libxl in Xen 4.1, but has proven to be racy
> with libxl in Xen 4.2.  I plan to work on this, but unfortunately not
> for a few weeks.  I'm busy with another project this week and will be
> traveling the week of April 15.
> 
> > i'll hold off for now. and will work more towards libvirt integration for
> > Mageia 4 (our release freeze is now in effect for Mageia 3).
> 
> FYI, although it is deprecated, the xm/xend toolstack works well with
> Xen 4.2, and the legacy libvirt xen driver is quite stable.  It was the
> first hypervisor driver in libvirt  :).

i'm aware of that, and kudos to you for this...

It's just that xm/xend is deprecated as you say... and xl looks quite nice... 
:-)

^ permalink raw reply	[flat|nested] 20+ messages in thread

* Re: OT: xen libvirt issue
  2013-04-08 17:53           ` AL13N
@ 2013-04-09 12:39             ` Ian Jackson
  2013-04-09 12:52               ` AL13N
  2013-04-09 14:15               ` Jim Fehlig
  0 siblings, 2 replies; 20+ messages in thread
From: Ian Jackson @ 2013-04-09 12:39 UTC (permalink / raw)
  To: AL13N
  Cc: George Dunlap, Dario Faggioli, Jim Fehlig, Marek Marczykowski, xen-devel

AL13N writes ("Re: [Xen-devel] OT: xen libvirt issue"):
> i was more worried about the comments in the patch, ie: that it
> changes how tools using this (libvirt maybe) would need to be
> recoded.

Yes, in order to fully fix these races there are a number of libvirt
patches needed as well.  I don't know exactly which libvirt trees
these are in but the libvirt fixes are pure fixes which won't break
anything that's not already broken.

It is also the case that in theory the libxl fixes won't break
anything that's not already broken.  However, it turns out that some
versions of libvirt were already broken: at least some versions of
libvirt's libxl bindings had a bug in its timeout calculation code
which is triggered by timeout_modify(...{0,0}...), and the libxl patch
"libxl: fix stale timeout event callback race" exposes that bug which
was previously latent.

Ian.

^ permalink raw reply	[flat|nested] 20+ messages in thread

* Re: OT: xen libvirt issue
  2013-04-09 12:39             ` Ian Jackson
@ 2013-04-09 12:52               ` AL13N
  2013-04-09 14:41                 ` Jim Fehlig
  2013-04-09 14:15               ` Jim Fehlig
  1 sibling, 1 reply; 20+ messages in thread
From: AL13N @ 2013-04-09 12:52 UTC (permalink / raw)
  To: xen-devel
  Cc: George Dunlap, Dario Faggioli, Ian Jackson, Marek Marczykowski,
	Jim Fehlig, AL13N

> AL13N writes ("Re: [Xen-devel] OT: xen libvirt issue"):
>> i was more worried about the comments in the patch, ie: that it
>> changes how tools using this (libvirt maybe) would need to be
>> recoded.
>
> Yes, in order to fully fix these races there are a number of libvirt
> patches needed as well.  I don't know exactly which libvirt trees
> these are in but the libvirt fixes are pure fixes which won't break
> anything that's not already broken.
>
> It is also the case that in theory the libxl fixes won't break
> anything that's not already broken.  However, it turns out that some
> versions of libvirt were already broken: at least some versions of
> libvirt's libxl bindings had a bug in its timeout calculation code
> which is triggered by timeout_modify(...{0,0}...), and the libxl patch
> "libxl: fix stale timeout event callback race" exposes that bug which
> was previously latent.

FYI, applying these patches for me fixed the problem completely, i wasn't
able to segfault libvirtd anymore. i asked to have pass our
release-freeze. we'll see what they decide.

^ permalink raw reply	[flat|nested] 20+ messages in thread

* Re: OT: xen libvirt issue
  2013-04-09 12:39             ` Ian Jackson
  2013-04-09 12:52               ` AL13N
@ 2013-04-09 14:15               ` Jim Fehlig
  2013-04-09 14:18                 ` Ian Jackson
  2013-04-09 21:48                 ` AL13N
  1 sibling, 2 replies; 20+ messages in thread
From: Jim Fehlig @ 2013-04-09 14:15 UTC (permalink / raw)
  To: Ian Jackson
  Cc: George Dunlap, Marek Marczykowski, Dario Faggioli, AL13N, xen-devel

Ian Jackson wrote:
> AL13N writes ("Re: [Xen-devel] OT: xen libvirt issue"):
>   
>> i was more worried about the comments in the patch, ie: that it
>> changes how tools using this (libvirt maybe) would need to be
>> recoded.
>>     
>
> Yes, in order to fully fix these races there are a number of libvirt
> patches needed as well.  I don't know exactly which libvirt trees
> these are in but the libvirt fixes are pure fixes which won't break
> anything that's not already broken.
>   

libvirt >= 1.0.2 contains all of the related fixes.

Jim

^ permalink raw reply	[flat|nested] 20+ messages in thread

* Re: OT: xen libvirt issue
  2013-04-09 14:15               ` Jim Fehlig
@ 2013-04-09 14:18                 ` Ian Jackson
  2013-04-09 21:48                 ` AL13N
  1 sibling, 0 replies; 20+ messages in thread
From: Ian Jackson @ 2013-04-09 14:18 UTC (permalink / raw)
  To: Jim Fehlig
  Cc: George Dunlap, Marek Marczykowski, Dario Faggioli, AL13N, xen-devel

Jim Fehlig writes ("Re: [Xen-devel] OT: xen libvirt issue"):
> Ian Jackson wrote:
> > Yes, in order to fully fix these races there are a number of libvirt
> > patches needed as well.  I don't know exactly which libvirt trees
> > these are in but the libvirt fixes are pure fixes which won't break
> > anything that's not already broken.
> 
> libvirt >= 1.0.2 contains all of the related fixes.

Great, thanks for that information.

Ian.

^ permalink raw reply	[flat|nested] 20+ messages in thread

* Re: OT: xen libvirt issue
  2013-04-09 12:52               ` AL13N
@ 2013-04-09 14:41                 ` Jim Fehlig
  2013-04-09 21:50                   ` AL13N
  0 siblings, 1 reply; 20+ messages in thread
From: Jim Fehlig @ 2013-04-09 14:41 UTC (permalink / raw)
  To: AL13N
  Cc: George Dunlap, Dario Faggioli, Ian Jackson, Marek Marczykowski,
	xen-devel

AL13N wrote:
>> AL13N writes ("Re: [Xen-devel] OT: xen libvirt issue"):
>>     
>>> i was more worried about the comments in the patch, ie: that it
>>> changes how tools using this (libvirt maybe) would need to be
>>> recoded.
>>>       
>> Yes, in order to fully fix these races there are a number of libvirt
>> patches needed as well.  I don't know exactly which libvirt trees
>> these are in but the libvirt fixes are pure fixes which won't break
>> anything that's not already broken.
>>
>> It is also the case that in theory the libxl fixes won't break
>> anything that's not already broken.  However, it turns out that some
>> versions of libvirt were already broken: at least some versions of
>> libvirt's libxl bindings had a bug in its timeout calculation code
>> which is triggered by timeout_modify(...{0,0}...), and the libxl patch
>> "libxl: fix stale timeout event callback race" exposes that bug which
>> was previously latent.
>>     
>
> FYI, applying these patches for me fixed the problem completely, i wasn't
> able to segfault libvirtd anymore.

I don't think you tried hard enough :).  But glad it is working for
you!  I still plan to improve shutdown event handling when I have some
time to work on the libxl driver.

Jim

^ permalink raw reply	[flat|nested] 20+ messages in thread

* Re: OT: xen libvirt issue
  2013-04-09 14:15               ` Jim Fehlig
  2013-04-09 14:18                 ` Ian Jackson
@ 2013-04-09 21:48                 ` AL13N
  1 sibling, 0 replies; 20+ messages in thread
From: AL13N @ 2013-04-09 21:48 UTC (permalink / raw)
  To: xen-devel
  Cc: George Dunlap, Dario Faggioli, Jim Fehlig, Ian Jackson,
	Marek Marczykowski

Op dinsdag 9 april 2013 08:15:26 schreef Jim Fehlig:
> Ian Jackson wrote:
> > AL13N writes ("Re: [Xen-devel] OT: xen libvirt issue"):
> >> i was more worried about the comments in the patch, ie: that it
> >> changes how tools using this (libvirt maybe) would need to be
> >> recoded.
> > 
> > Yes, in order to fully fix these races there are a number of libvirt
> > patches needed as well.  I don't know exactly which libvirt trees
> > these are in but the libvirt fixes are pure fixes which won't break
> > anything that's not already broken.
> 
> libvirt >= 1.0.2 contains all of the related fixes.

that's good news, cause we have libvirt 1.0.2 (with some patches) :-)

^ permalink raw reply	[flat|nested] 20+ messages in thread

* Re: OT: xen libvirt issue
  2013-04-09 14:41                 ` Jim Fehlig
@ 2013-04-09 21:50                   ` AL13N
  2013-04-10  0:42                     ` Jim Fehlig
  0 siblings, 1 reply; 20+ messages in thread
From: AL13N @ 2013-04-09 21:50 UTC (permalink / raw)
  To: xen-devel
  Cc: George Dunlap, Jim Fehlig, Ian Jackson, Dario Faggioli,
	Marek Marczykowski

Op dinsdag 9 april 2013 08:41:56 schreef Jim Fehlig:
[...]
> I don't think you tried hard enough :).  But glad it is working for
> you!  I still plan to improve shutdown event handling when I have some
> time to work on the libxl driver.

i should say, "in that particular way" but i've restarted and shutdowned some 
domains more than 10 times in the ways i could do it before...

^ permalink raw reply	[flat|nested] 20+ messages in thread

* Re: OT: xen libvirt issue
  2013-04-09 21:50                   ` AL13N
@ 2013-04-10  0:42                     ` Jim Fehlig
  2013-04-10 10:23                       ` AL13N
  0 siblings, 1 reply; 20+ messages in thread
From: Jim Fehlig @ 2013-04-10  0:42 UTC (permalink / raw)
  To: AL13N
  Cc: George Dunlap, Dario Faggioli, Ian Jackson, Marek Marczykowski,
	xen-devel

AL13N wrote:
> Op dinsdag 9 april 2013 08:41:56 schreef Jim Fehlig:
> [...]
>   
>> I don't think you tried hard enough :).  But glad it is working for
>> you!  I still plan to improve shutdown event handling when I have some
>> time to work on the libxl driver.
>>     
>
> i should say, "in that particular way" but i've restarted and shutdowned some 
> domains more than 10 times in the ways i could do it before...
>   

Good news.  Have you tried save/restore in a loop?  Shutdown handling
when save completes might be more susceptible to the race.

Regards,
Jim

^ permalink raw reply	[flat|nested] 20+ messages in thread

* Re: OT: xen libvirt issue
  2013-04-10  0:42                     ` Jim Fehlig
@ 2013-04-10 10:23                       ` AL13N
  0 siblings, 0 replies; 20+ messages in thread
From: AL13N @ 2013-04-10 10:23 UTC (permalink / raw)
  To: Jim Fehlig
  Cc: George Dunlap, Dario Faggioli, Ian Jackson, Marek Marczykowski,
	xen-devel, AL13N

> AL13N wrote:
>> Op dinsdag 9 april 2013 08:41:56 schreef Jim Fehlig:
>> [...]
>>
>>> I don't think you tried hard enough :).  But glad it is working for
>>> you!  I still plan to improve shutdown event handling when I have some
>>> time to work on the libxl driver.
>>>
>>
>> i should say, "in that particular way" but i've restarted and shutdowned
>> some
>> domains more than 10 times in the ways i could do it before...
>>
>
> Good news.  Have you tried save/restore in a loop?  Shutdown handling
> when save completes might be more susceptible to the race.

hmm, no i actually didn't

^ permalink raw reply	[flat|nested] 20+ messages in thread

end of thread, other threads:[~2013-04-10 10:23 UTC | newest]

Thread overview: 20+ messages (download: mbox.gz / follow: Atom feed)
-- links below jump to the message on this page --
2013-04-04 19:49 OT: xen libvirt issue AL13N
2013-04-05  8:21 ` Dario Faggioli
2013-04-05 11:13   ` George Dunlap
2013-04-08  4:22     ` Marek Marczykowski
2013-04-08  6:26       ` AL13N
2013-04-08 15:36         ` Jim Fehlig
2013-04-08 17:22           ` Stefano Stabellini
2013-04-08 17:57           ` AL13N
2013-04-08 15:46         ` Jim Fehlig
2013-04-08 16:09         ` Ian Jackson
2013-04-08 17:53           ` AL13N
2013-04-09 12:39             ` Ian Jackson
2013-04-09 12:52               ` AL13N
2013-04-09 14:41                 ` Jim Fehlig
2013-04-09 21:50                   ` AL13N
2013-04-10  0:42                     ` Jim Fehlig
2013-04-10 10:23                       ` AL13N
2013-04-09 14:15               ` Jim Fehlig
2013-04-09 14:18                 ` Ian Jackson
2013-04-09 21:48                 ` AL13N

This is an external index of several public inboxes,
see mirroring instructions on how to clone and mirror
all data and code used by this external index.