From mboxrd@z Thu Jan 1 00:00:00 1970 From: MaoXiaoyun Subject: RE: kernel BUG at arch/x86/xen/mmu.c:1872 Date: Sun, 10 Apr 2011 21:57:10 +0800 Message-ID: References: , , Mime-Version: 1.0 Content-Type: multipart/mixed; boundary="===============0049315248==" Return-path: In-Reply-To: List-Unsubscribe: , List-Post: List-Help: List-Subscribe: , Sender: xen-devel-bounces@lists.xensource.com Errors-To: xen-devel-bounces@lists.xensource.com To: xen devel Cc: jeremy@goop.org, keir@xen.org, ian.campbell@citrix.com, konrad.wilk@oracle.com, giamteckchoon@gmail.com, dave@ivt.com.au List-Id: xen-devel@lists.xenproject.org --===============0049315248== Content-Type: multipart/alternative; boundary="_f86da273-4979-42c1-8df3-f2363d70d864_" --_f86da273-4979-42c1-8df3-f2363d70d864_ Content-Type: text/plain; charset="gb2312" Content-Transfer-Encoding: quoted-printable Hi Konrad & Jeremy: I think we finally located the missing patch for this commit. We test commit http://git.kernel.org/?p=3Dlinux/kernel/git/je= remy/xen.git;a=3Dcommit;h=3Dc97f681f138039425c87f35ea46a92385d81e70e which is works. =20 We test commit http://git.kernel.org/?p=3Dlinux/kernel/git/je= remy/xen.git;a=3Dcommit;h=3D221c64dbf860d37f841f40893bddf8d804aa55bd which server crashed. =20 Later I found the comments for this commit:=20 http://git.kernel.org/?p=3Dlinux/kernel/git/jeremy/xen.git;a= =3Dcommit;h=3D64141da587241301ce8638cc945f8b67853156ec =20 So It looks like this fix is not applied on 2.6.32.36, Could = you take a look at this?=20 =20 Many thanks. =20 =3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D= =3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D= =3D=3D=3D >Hi Konrad & Jeremy: >=20 > I'd like to open this BUG in a new thread, since the old thread is = too long for easy read. > =20 > We recently want to upgrade our kernel to 2.6.32, but unfortunately= , we confront a kernel crash bug. >Our test case is simple, start 24 win2003 HVMS on our physical machine, = and each HVM reboot=20 >every 15minutes. The kernel will crash in half an hour.(That is crash on= VM second starts). >=20 >Our test go much further. >We test different kernel version. >2.6.32.10 http://git.kernel.org/?p=3Dlinux/kernel/git/jeremy/xen.git;a=3D= commit;h=3Dd945b014ac5df9592c478bf9486d97e8914aab59 >2.6.32.11 http://git.kernel.org/?p=3Dlinux/kernel/git/jeremy/xen.git;a=3D= commit;h=3D27f948a3bf365a5bc3d56119637a177d41147815 >2.6.32.12 http://git.kernel.org/?p=3Dlinux/kernel/git/jeremy/xen.git;a=3D= commit;h=3Dba739f9abd3f659b907a824af1161926b420a2ce >2.6.32.13 http://git.kernel.org/?p=3Dlinux/kernel/git/jeremy/xen.git;a=3D= commit;h=3Df6fe6583b77a49b569eef1b66c3d761eec2e561b >2.6.32.15 http://git.kernel.org/?p=3Dlinux/kernel/git/jeremy/xen.git;a=3D= commit;h=3D27ed1b0e0dae5f1d5da5c76451bc84cb529128bd >2.6.32.21 http://git.kernel.org/?p=3Dlinux/kernel/git/jeremy/xen.git;a=3D= commit;h=3D69e50db231723596ed8ef9275d0068d6697f466a >=20 >There are basic three different result we met. >=20 >i1) grant table issue >The host still function, but use xm dmesg, we have abnormal log. >please refer to the attched log of grant table >=20 >i2) kernel crash on a different place. >Host die during the test, after reboot, we can see nothing abnormal in /= var/log/messages >=20 >i3) kernel BUG at arch/x86/xen/mmu.c:1872;=20 >Host die during the test, after reboot, we see the crash log in messages= , refer to the attached log of 2.6.32.36 >Summary of the test result, can be classified in two: >=20 >1) 2.6.32.10 >30 machines involved the test, and three has issue (i1), and two has iss= ue (i2), *no* issue (i3) >Other machines run tests successfully till now, more than 8 hours >=20 >2)2.6.32.11 or later version. >Each version containers 10 machine for tests, and all machine crashed in= less than half an hour. >=20 >Conclusion: >1) grant table issue exists in all kernel version >2) kernerl crash at different place may exist in all kernel versions, bu= t not happen so frequently, 2 out of 30 >3) We observe the major difference of issue i3), from the test, it looks= like it is introduced between the version >2.6.32.10 and 2.6.32.11. >=20 >Hope this help to locate the bug. >Many thanks. >=20 > =20 --_f86da273-4979-42c1-8df3-f2363d70d864_ Content-Type: text/html; charset="gb2312" Content-Transfer-Encoding: quoted-printable Hi Konrad & Jeremy:

            I thin= k we finally located the missing patch for this commit.
            We tes= t commit http://git.kernel.org/?p=3Dlinux/kernel/git/jeremy/xen.git;a=3D= commit;h=3Dc97f681f138039425c87f35ea46a92385d81e70e
            w= hich is works.
 
            We tes= t commit http://git.kernel.org/?p=3Dlinux/kernel/git/jeremy/xen.git;a=3D= commit;h=3D221c64dbf860d37f841f40893bddf8d804aa55bd
 &= nbsp;          which se= rver crashed.
 
             = Later I found the comments for this commit:
             = http://git.kernel.or= g/?p=3Dlinux/kernel/git/jeremy/xen.git;a=3Dcommit;h=3D64141da587241301ce8= 638cc945f8b67853156ec
         
            S= o It looks like this fix is not applied on 2.6.32.36, Could you take a lo= ok at this?
 
            M= any thanks.
            &= nbsp;
=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D= =3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D= =3D=3D=3D
>Hi Konrad & Jeremy:
>
>     I'd = like to open this BUG in a new thread, since the old thread is too long f= or easy read.
>    
>   &= nbsp; We recently want to upgrade our kernel to 2.6.32, but unfortunately= , we confront a kernel crash bug.
>Our test case is simple, start 2= 4 win2003 HVMS on our physical machine, and each HVM reboot
>every= 15minutes. The kernel will crash in half an hour.(That is crash on VM se= cond starts).
>
>Our test go much further.
>We test di= fferent kernel version.
>2.6.32.10  http://git.kernel.org/?p=3Dlinux/kernel/git/j= eremy/xen.git;a=3Dcommit;h=3Dd945b014ac5df9592c478bf9486d97e8914aab59=
>2.6.32.11  http://git.kernel.org/?p=3Dlinux/kernel/git/jeremy/xen.gi= t;a=3Dcommit;h=3D27f948a3bf365a5bc3d56119637a177d41147815
>2.6.= 32.12  http://= git.kernel.org/?p=3Dlinux/kernel/git/jeremy/xen.git;a=3Dcommit;h=3Dba739f= 9abd3f659b907a824af1161926b420a2ce
>2.6.32.13  http://git.kernel.org/?p=3Dlinu= x/kernel/git/jeremy/xen.git;a=3Dcommit;h=3Df6fe6583b77a49b569eef1b66c3d76= 1eec2e561b
>2.6.32.15  http://git.kernel.org/?p=3Dlinux/kernel/git/jeremy/xen= .git;a=3Dcommit;h=3D27ed1b0e0dae5f1d5da5c76451bc84cb529128bd
>2= .6.32.21  http://git.kernel.org/?p=3D= linux/kernel/git/jeremy/xen.git;a=3Dcommit;h=3D69e50db231723596ed8ef9275d= 0068d6697f466a
>
>There are basic three different result= we met.
>
>i1) grant table issue
>The host still func= tion, but use xm  dmesg, we have abnormal log.
>please refer t= o the attched log of grant table
>
>i2) kernel crash on a di= fferent place.
>Host die during the test, after reboot, we can see = nothing abnormal in /var/log/messages
>
>i3) kernel BUG at a= rch/x86/xen/mmu.c:1872;
>Host die during the test, after reboot, w= e see the crash log in messages, refer to the attached log of 2.6.32.36>Summary of the test result, can be classified in two:
>
&= gt;1) 2.6.32.10
>30 machines involved the test, and three has issue= (i1), and two has issue (i2), *no* issue (i3)
>Other machines run = tests successfully till now, more than 8 hours>
>2)2.6.32.11 or later version.
>Each version contain= ers 10 machine for tests, and all machine crashed in less than half an ho= ur.
>
>Conclusion:
>1) grant table issue exists in all= kernel version
>2) kernerl crash at different place may exist in a= ll kernel versions, but not happen so frequently, 2 out of 30
>3) W= e observe the major difference of issue i3), from the test, it looks like= it is introduced between the version
>2.6.32.10 and 2.6.32.11.
= >
>Hope this help to locate the bug.
>Many thanks.
>= ;
>
--_f86da273-4979-42c1-8df3-f2363d70d864_-- --===============0049315248== Content-Type: text/plain; charset="us-ascii" MIME-Version: 1.0 Content-Transfer-Encoding: 7bit Content-Disposition: inline _______________________________________________ Xen-devel mailing list Xen-devel@lists.xensource.com http://lists.xensource.com/xen-devel --===============0049315248==--