xen-devel.lists.xenproject.org archive mirror
 help / color / mirror / Atom feed
From: "Ingo Jürgensmann" <ij@2013.bluespice.org>
To: xen-devel@lists.xen.org
Subject: Re: Kernel panic on Xen virtualisation in Debian
Date: Sun, 10 Jul 2016 15:18:37 +0200	[thread overview]
Message-ID: <17804F34-7019-4595-91D5-42FC9866A4C2@2013.bluespice.org> (raw)
In-Reply-To: <133d4197-8155-1219-46af-bb51e1092245@andreas-ziegler.de>


[-- Attachment #1.1: Type: text/plain, Size: 4997 bytes --]

Am 10.07.2016 um 00:29 schrieb Andreas Ziegler <ml@andreas-ziegler.de>:

> In May, Ingo Jürgensmann also started experiencing this problem and
> blogged about it:
> https://blog.windfluechter.net/content/blog/2016/03/23/1721-xen-randomly-crashing-server
> https://blog.windfluechter.net/content/blog/2016/05/12/1723-xen-randomly-crashing-server-part-2

Actually I’m suffering from this problem since April 2013. Here’s my story… ;)

Everything was working smoothly when I was still using a rootserver at Hetzner. The setup there was some sort of non-standard, as I needed to have eth0 as outgoing interface not being part of the Xen bridge. So I used a mixture of bridge and routed in xend-config.sxp. This setup worked for years without problems.

However: as Hetzner started to bill for every single IPv4 address, I moved to my new provider where I could get the same address space (/26) without being forced to pay for every IPv4 address. The server back then was a Cisco C200 M2.

Since I got my own VLAN at the new location, I was then able to dismiss the mixed setup of routing and bridging and used only bridging with eth0 now being part of the Xen bridge. The whole setup consists of two bridges: one for the external IP addresses (xenbr0) and one for internal traffic (xenbr1). This was already that way with Hetzner.

However, shortly after I moved to the new provider, the issues started: random crashes of the host. With the new provider, who was and is still very helpful, we exchanged for example the memory. The provider reported as well that other Cisco C200 server with Ubutu LTS didn’t show this issue.

Over time a pattern showed up that might cause the frequent crashes (sometimes several times in a row, let’s say 2-10 times a day!):

My setup is this:

Debian stable with packaged Xen hypervisor and these VMs:
1) Mail, Database, Nameserver, OpenVPN
2) Webserver, Squid3
3) Login server
4) … some more servers (10 in total), e.g. Tor Relay…

IPv4 /26 network, IPv6 /48 network

From my workplace I need to login to 3) and have a tunnel to the Squid on 2) via the internal addresses on xenbr1. Of course Squid queries the nameserver on 1), so there is some internal traffic going back and forth on the internal bridge and traffic originating from the external bridge (xenbr0). Using Squid I access my Roundcube on my small homebrew server that is connected to 1) via OpenVPN. Of course the webserver on 2) queries the database on 1)

So, the most crashes do happen while I’m using the SSH tunnel from my workplace. If a crash happen, it’s most likely that at least two in a row will happen in a short time frame (within 1-2 hours), sometimes even within 10 mins after the server came back. From time to time my impression was, that the server crashes the second time instantly when I try to access my Roundcube at home.

Furthermore, I switched from using the Cisco C200 server to my own server with Supermicro X9SRi-F mainboard and a XEON E5-2630L V2, but still the same provider, and the same issue: the new server crashes the same way as the Cisco server did. With the new server we did a replacement of the memory as well: from 32G to 128G. So over time we have switched memory twice and hardware once. Since then I don’t assume anymore that this might be hardware related.

In the meantime I switched from using Squid on 2) to tinyproxy running on 2) as well as running tinyproxy on another third party VPS. Still the crashes happen, regardless of using Squid on 2) or not.

In May the server crashed again several times a week and several times a day. Really, really annoying!
So together with my provider we setup a netconsole to catch some more information about the crash than just the few lines from the IPMI console.

Trying linux-image 4.4 from backports didn’t help either. I switched from PV to PVHVM as well some months ago.

> He is pretty sure, that the problem went away after disabling IPv6.

Indeed. Since I disabled IPv6 for all of my VMs (it’s still active on dom0, but not routed to the domUs anymore) no single crash happened again.

> But: we can't say for sure, because on our server it sometimes happened
> often in a short period of time, but then it didn't for months.
> and: disabling IPv6 is no option for me at all.

I won’t state that I have an exact way of reproducing the crashes, but it happens fairly often when doing as described above.

What I can offer is:
- activate IPv6 again
- install a kernel with debugging symbols (*-dbg)
- try to provoke another crash
- send netconsole output if happened

What I cannot do:
- interpret the debug symbols
- access IPMI console from workplace (firewalled)

I’m with Andreas that disabling IPv6 cannot be an option.

--
Ciao...          //        http://blog.windfluechter.net
      Ingo     \X/     XMPP: ij@jabber.windfluechter.net

gpg pubkey:  http://www.juergensmann.de/ij_public_key.asc




[-- Attachment #1.2: Message signed with OpenPGP using GPGMail --]
[-- Type: application/pgp-signature, Size: 801 bytes --]

[-- Attachment #2: Type: text/plain, Size: 127 bytes --]

_______________________________________________
Xen-devel mailing list
Xen-devel@lists.xen.org
https://lists.xen.org/xen-devel

  reply	other threads:[~2016-07-10 13:18 UTC|newest]

Thread overview: 32+ messages / expand[flat|nested]  mbox.gz  Atom feed  top
2016-07-09 22:29 Kernel panic on Xen virtualisation in Debian Andreas Ziegler
2016-07-10 13:18 ` Ingo Jürgensmann [this message]
2016-07-18 16:44   ` Andreas Ziegler
2016-07-22  9:03     ` Ingo Jürgensmann
2016-07-22 10:21       ` Ingo Jürgensmann
2016-07-25  9:38         ` Ingo Jürgensmann
2016-07-25 10:23           ` Wei Liu
2016-07-25 10:42             ` Andreas Ziegler
2016-07-25 10:50               ` Andreas Ziegler
2016-07-25 10:51               ` Wei Liu
2016-07-25 11:27                 ` Ingo Jürgensmann
2016-07-25 12:35                   ` Sander Eikelenboom
2016-07-25 11:41             ` Ingo Jürgensmann
2016-07-25 13:13               ` Wei Liu
2016-07-29 11:45                 ` Andreas Ziegler
2016-07-29 13:01                   ` Wei Liu
2016-07-29 20:17                   ` Ingo Jürgensmann
2016-08-02  9:20                     ` Wei Liu
2016-08-02 10:30                       ` Ingo Jürgensmann
2016-08-02 12:37                         ` Wei Liu
2016-11-14 15:55                           ` Andreas Ziegler
2016-11-29  9:08                             ` Wei Liu
2016-11-29 20:20                               ` Ingo Jürgensmann
2016-12-01 13:26                                 ` Wei Liu
2016-12-01 13:59                                   ` Ingo Jürgensmann
2016-12-01 14:07                                     ` Wei Liu
2016-12-01 14:50                                     ` Sander Eikelenboom
2016-07-25 10:49           ` David Vrabel
  -- strict thread matches above, loose matches on Subject: below --
2016-07-04 18:06 Jan Prunk
2016-07-06 14:14 ` George Dunlap
2016-07-08 11:14   ` Wei Liu
2016-07-08 12:22     ` Jan Prunk

Reply instructions:

You may reply publicly to this message via plain-text email
using any one of the following methods:

* Save the following mbox file, import it into your mail client,
  and reply-to-all from there: mbox

  Avoid top-posting and favor interleaved quoting:
  https://en.wikipedia.org/wiki/Posting_style#Interleaved_style

* Reply using the --to, --cc, and --in-reply-to
  switches of git-send-email(1):

  git send-email \
    --in-reply-to=17804F34-7019-4595-91D5-42FC9866A4C2@2013.bluespice.org \
    --to=ij@2013.bluespice.org \
    --cc=xen-devel@lists.xen.org \
    /path/to/YOUR_REPLY

  https://kernel.org/pub/software/scm/git/docs/git-send-email.html

* If your mail client supports setting the In-Reply-To header
  via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for NNTP newsgroup(s).