All of lore.kernel.org
 help / color / mirror / Atom feed
* [ath9k-devel] still same problem
@ 2009-01-04  1:26 Dmitri Seletski
  2009-01-04  1:53 ` Barry Green
  2009-01-04 10:33 ` W. van den Akker
  0 siblings, 2 replies; 28+ messages in thread
From: Dmitri Seletski @ 2009-01-04  1:26 UTC (permalink / raw)
  To: ath9k-devel

Hi guys.
After quite some time has passed since my last message, i have pulled 
wireless drivers from web site several times, still same problem,  
refresh of drivers didn't seem to affect the problem at all.
I am lost, since you don't  seem to take it seriously, but it hangs 
machine! Can someone give me an advice what I can do in order to help 
you to troubleshoot the issue?
I am not a programmer, but looks like a memory leak somewhere. If you 
give clear understandable instructions - I am glad to follow them!
Regards

^ permalink raw reply	[flat|nested] 28+ messages in thread

* [ath9k-devel] still same problem
  2009-01-04  1:26 [ath9k-devel] still same problem Dmitri Seletski
@ 2009-01-04  1:53 ` Barry Green
  2009-01-04  3:34   ` Luis R. Rodriguez
  2009-01-04 10:33 ` W. van den Akker
  1 sibling, 1 reply; 28+ messages in thread
From: Barry Green @ 2009-01-04  1:53 UTC (permalink / raw)
  To: ath9k-devel

Hi Dimitri,

I"m getting a similar problem to you, in that my computer is hanging 
shortly after loading ath9k and trying to set up a wireless connection. 
I've logged a bug report here:

http://bugzilla.kernel.org/show_bug.cgi?id=12110

but unfortunately, there hasn't been any progress or feedback on my 
comments since the 6th December.  I'm willing to test, but I need some 
guidance on exactly how to get some useful debugging information.

Barry.

Dmitri Seletski wrote:
> Hi guys.
> After quite some time has passed since my last message, i have pulled 
> wireless drivers from web site several times, still same problem,  
> refresh of drivers didn't seem to affect the problem at all.
> I am lost, since you don't  seem to take it seriously, but it hangs 
> machine! Can someone give me an advice what I can do in order to help 
> you to troubleshoot the issue?
> I am not a programmer, but looks like a memory leak somewhere. If you 
> give clear understandable instructions - I am glad to follow them!
> Regards
> _______________________________________________
> ath9k-devel mailing list
> ath9k-devel at lists.ath9k.org
> https://lists.ath9k.org/mailman/listinfo/ath9k-devel
> 

^ permalink raw reply	[flat|nested] 28+ messages in thread

* [ath9k-devel] still same problem
  2009-01-04  1:53 ` Barry Green
@ 2009-01-04  3:34   ` Luis R. Rodriguez
  2009-01-04  9:58     ` Jens Ott - Opteamax
  0 siblings, 1 reply; 28+ messages in thread
From: Luis R. Rodriguez @ 2009-01-04  3:34 UTC (permalink / raw)
  To: ath9k-devel

On Sat, Jan 3, 2009 at 5:53 PM, Barry Green <barry@lemonpi.net> wrote:
> Hi Dimitri,
>
> I"m getting a similar problem to you, in that my computer is hanging
> shortly after loading ath9k and trying to set up a wireless connection.
> I've logged a bug report here:
>
> http://bugzilla.kernel.org/show_bug.cgi?id=12110

This is for 2.6.27, the issue being reported here is for
wireless-testing. Regardless we don't have any oops messages for
either issues.

> but unfortunately, there hasn't been any progress or feedback on my
> comments since the 6th December.  I'm willing to test, but I need some
> guidance on exactly how to get some useful debugging information.

We need an oops trace of your crashes so far it seems you have not
been able to get one, lets work on that. Until then it is not easy for
us to provide you with patches to fix your issues.

Please test with the latest drivers from
wireless-testing/compat-wireless and restart in single user mode, we
want to ensure X is not running at all and you are in a virtual
terminal. We also do not want Network Manager or wpa_supplicant
running in the background. To increase the resolution upon bootup try
booting with vga=ask (vga=791 works well for me), then try to
associate to your access point without encryption manually using
iwconfig.

If there is an oops it should hopefully come up.

  Luis

^ permalink raw reply	[flat|nested] 28+ messages in thread

* [ath9k-devel] still same problem
  2009-01-04  3:34   ` Luis R. Rodriguez
@ 2009-01-04  9:58     ` Jens Ott - Opteamax
  2009-01-05 16:09       ` Luis R. Rodriguez
  2009-01-17 23:55       ` W. van den Akker
  0 siblings, 2 replies; 28+ messages in thread
From: Jens Ott - Opteamax @ 2009-01-04  9:58 UTC (permalink / raw)
  To: ath9k-devel

Hi,

Luis R. Rodriguez schrieb:
> On Sat, Jan 3, 2009 at 5:53 PM, Barry Green <barry@lemonpi.net> wrote:
> [...]
> We need an oops trace of your crashes so far it seems you have not
> been able to get one, lets work on that. Until then it is not easy for
> us to provide you with patches to fix your issues.

I also have similar problems with an D-LINK DWA-547 and I'd really like 
to provide the oops trace, but unfortunately the machine seems to hang 
that fast, that even no oops trace is shown. Also magic-sysreq don't 
work anymore, neither via serial console nor via keyboard. Only chance I 
have in this moment is to press reset-button, which already caused a 
full xfs-filsystem-corruption with dataloss once :-(

My problem is, that the hang happens not immediately after loading the 
driver but "somewhen" while it is loaded.

I have seen two types of crash:

1.) first investigations on my side seem to show that it happens, as 
soon as there is high dataload transfered, but I don't have detailed 
info yet.
2.) Running hostapd works fine while started first time, but no client 
is connected. When a client connects a system freeze occures 
"sometimes". Reproducably the system freezes immediately when starting 
hostapd, stop it and then restart hostapd. So every change in hostapd 
conf needs a full reboot, unloading the ath9k module and reload it does 
not prevent freeze, but after reboot I can start hostapd once without a 
problem (only the metioned "sometime" freezes on client connect or high 
data-volume).

As this machine is productive, I suspended testing for now. I will 
change the card into a non-productive environment in the next days. As 
soon as I find more details, I'll report.

> 
> Please test with the latest drivers from
> wireless-testing/compat-wireless and restart in single user mode, we
> want to ensure X is not running at all and you are in a virtual
> terminal. We also do not want Network Manager or wpa_supplicant
> running in the background. To increase the resolution upon bootup try
> booting with vga=ask (vga=791 works well for me), then try to
> associate to your access point without encryption manually using
> iwconfig.

Ok, that'll be a thing I'd need to test to, as in my case wpa_supplicant 
is running.

Best regards
Jens

> 
> If there is an oops it should hopefully come up.
> 
>   Luis
> _______________________________________________
> ath9k-devel mailing list
> ath9k-devel at lists.ath9k.org
> https://lists.ath9k.org/mailman/listinfo/ath9k-devel
> 
> !DSPAM:2,49602e56281331804284693!
> 

^ permalink raw reply	[flat|nested] 28+ messages in thread

* [ath9k-devel] still same problem
  2009-01-04  1:26 [ath9k-devel] still same problem Dmitri Seletski
  2009-01-04  1:53 ` Barry Green
@ 2009-01-04 10:33 ` W. van den Akker
  2009-01-04 10:47   ` W. van den Akker
  1 sibling, 1 reply; 28+ messages in thread
From: W. van den Akker @ 2009-01-04 10:33 UTC (permalink / raw)
  To: ath9k-devel

On Sunday 04 January 2009, Dmitri Seletski wrote:
> Hi guys.
> After quite some time has passed since my last message, i have pulled
> wireless drivers from web site several times, still same problem,
> refresh of drivers didn't seem to affect the problem at all.
> I am lost, since you don't  seem to take it seriously, but it hangs
> machine! Can someone give me an advice what I can do in order to help
> you to troubleshoot the issue?
> I am not a programmer, but looks like a memory leak somewhere. If you
> give clear understandable instructions - I am glad to follow them!
> Regards

I am facing the same problems. Kernel hangups without any oops or log-trace.
Sometimes it freezes directly after hostapd (0.6.6) is loading. But once 
running hostapd I can connect with one station to the AP (AR5416). A second
station can be added but sometimes it causes a hangup.

For what I can see the problems may also have to do with the total number of
sessions to the AP or the number of sessions setup in a short time.

If I for example start my notebook and connect to the AP all is (mostly) ok.
I can start a browser. But if I start Kmail with 6 mailboxes it hangs within a 
second.

I am desperate looking for a solution.....


-- 
This message has been scanned for viruses and
dangerous content by MailScanner, and is
believed to be clean.

^ permalink raw reply	[flat|nested] 28+ messages in thread

* [ath9k-devel] still same problem
  2009-01-04 10:33 ` W. van den Akker
@ 2009-01-04 10:47   ` W. van den Akker
  0 siblings, 0 replies; 28+ messages in thread
From: W. van den Akker @ 2009-01-04 10:47 UTC (permalink / raw)
  To: ath9k-devel

On Sunday 04 January 2009, Dmitri Seletski wrote:
> Hi guys.
> After quite some time has passed since my last message, i have pulled
> wireless drivers from web site several times, still same problem,
> refresh of drivers didn't seem to affect the problem at all.
> I am lost, since you don't  seem to take it seriously, but it hangs
> machine! Can someone give me an advice what I can do in order to help
> you to troubleshoot the issue?
> I am not a programmer, but looks like a memory leak somewhere. If you
> give clear understandable instructions - I am glad to follow them!
> Regards

I am facing the same problems. Kernel hangups without any oops or
log-trace. Sometimes it freezes directly after hostapd (0.6.6) is loading.
But once running hostapd I can connect with one station to the AP (AR5416).
A second station can be added but sometimes it causes a hangup.

For what I can see the problems may also have to do with the total number
of sessions to the AP or the number of sessions setup in a short time.

If I for example start my notebook and connect to the AP all is (mostly)
ok. I can start a browser. But if I start Kmail with 6 mailboxes it hangs
within a second.

I am desperate looking for a solution.....

-- 
This message has been scanned for viruses and
dangerous content by MailScanner, and is
believed to be clean.

^ permalink raw reply	[flat|nested] 28+ messages in thread

* [ath9k-devel] still same problem
  2009-01-04  9:58     ` Jens Ott - Opteamax
@ 2009-01-05 16:09       ` Luis R. Rodriguez
  2009-01-17 23:55       ` W. van den Akker
  1 sibling, 0 replies; 28+ messages in thread
From: Luis R. Rodriguez @ 2009-01-05 16:09 UTC (permalink / raw)
  To: ath9k-devel

On Sun, Jan 4, 2009 at 1:58 AM, Jens Ott - Opteamax <jo@opteamax.de> wrote:
> Hi,
>
> Luis R. Rodriguez schrieb:
>>
>> On Sat, Jan 3, 2009 at 5:53 PM, Barry Green <barry@lemonpi.net> wrote:
>> [...]
>> We need an oops trace of your crashes so far it seems you have not
>> been able to get one, lets work on that. Until then it is not easy for
>> us to provide you with patches to fix your issues.
>
> I also have similar problems with an D-LINK DWA-547 and I'd really like to
> provide the oops trace, but unfortunately the machine seems to hang that
> fast, that even no oops trace is shown. Also magic-sysreq don't work
> anymore, neither via serial console nor via keyboard. Only chance I have in
> this moment is to press reset-button, which already caused a full
> xfs-filsystem-corruption with dataloss once :-(

What kernel are you using? Please note that 2.6.27 had some DMA issues
which were fixed since 2.6.27.8. So if you are using an older kenrel
than 2.6.27.8 you need to upgrade.

If you are using compat-wireless drivers then that's the latest and greatest.

> My problem is, that the hang happens not immediately after loading the
> driver but "somewhen" while it is loaded.
>
> I have seen two types of crash:
>
> 1.) first investigations on my side seem to show that it happens, as soon as
> there is high dataload transfered, but I don't have detailed info yet.
> 2.) Running hostapd works fine while started first time, but no client is
> connected. When a client connects a system freeze occures "sometimes".
> Reproducably the system freezes immediately when starting hostapd, stop it
> and then restart hostapd. So every change in hostapd conf needs a full
> reboot, unloading the ath9k module and reload it does not prevent freeze,
> but after reboot I can start hostapd once without a problem (only the
> metioned "sometime" freezes on client connect or high data-volume).
>
> As this machine is productive, I suspended testing for now. I will change
> the card into a non-productive environment in the next days. As soon as I
> find more details, I'll report.

I'll reply to the thread with some more suggestions as to what can be
done to help us debug this issue in another e-mail, please see that.
You may be able to help as you can reproduce these issues easily it
seems.

  Luis

^ permalink raw reply	[flat|nested] 28+ messages in thread

* [ath9k-devel] still same problem
  2009-01-04  9:58     ` Jens Ott - Opteamax
  2009-01-05 16:09       ` Luis R. Rodriguez
@ 2009-01-17 23:55       ` W. van den Akker
  2009-01-18  1:41         ` Jens Ott - Opteamax
  2009-01-18  5:17         ` Dmitri Seletski
  1 sibling, 2 replies; 28+ messages in thread
From: W. van den Akker @ 2009-01-17 23:55 UTC (permalink / raw)
  To: ath9k-devel

On Sunday 04 January 2009 10:58:02 Jens Ott - Opteamax wrote:
> Hi,
>
> Luis R. Rodriguez schrieb:
> > On Sat, Jan 3, 2009 at 5:53 PM, Barry Green <barry@lemonpi.net> wrote:
> > [...]
> > We need an oops trace of your crashes so far it seems you have not
> > been able to get one, lets work on that. Until then it is not easy for
> > us to provide you with patches to fix your issues.
>
> I also have similar problems with an D-LINK DWA-547 and I'd really like
> to provide the oops trace, but unfortunately the machine seems to hang
> that fast, that even no oops trace is shown. Also magic-sysreq don't
> work anymore, neither via serial console nor via keyboard. Only chance I
> have in this moment is to press reset-button, which already caused a
> full xfs-filsystem-corruption with dataloss once :-(
>
> My problem is, that the hang happens not immediately after loading the
> driver but "somewhen" while it is loaded.
>
> I have seen two types of crash:
>
> 1.) first investigations on my side seem to show that it happens, as
> soon as there is high dataload transfered, but I don't have detailed
> info yet.
> 2.) Running hostapd works fine while started first time, but no client
> is connected. When a client connects a system freeze occures
> "sometimes". Reproducably the system freezes immediately when starting
> hostapd, stop it and then restart hostapd. So every change in hostapd
> conf needs a full reboot, unloading the ath9k module and reload it does
> not prevent freeze, but after reboot I can start hostapd once without a
> problem (only the metioned "sometime" freezes on client connect or high
> data-volume).
>
> As this machine is productive, I suspended testing for now. I will
> change the card into a non-productive environment in the next days. As
> soon as I find more details, I'll report.
>
> > Please test with the latest drivers from
> > wireless-testing/compat-wireless and restart in single user mode, we
> > want to ensure X is not running at all and you are in a virtual
> > terminal. We also do not want Network Manager or wpa_supplicant
> > running in the background. To increase the resolution upon bootup try
> > booting with vga=ask (vga=791 works well for me), then try to
> > associate to your access point without encryption manually using
> > iwconfig.
>
> Ok, that'll be a thing I'd need to test to, as in my case wpa_supplicant
> is running.
>

Hi,

What is the status of your problem? I still have problems with it and have no
progress...

gr,
Willem

-- 
This message has been scanned for viruses and
dangerous content by MailScanner, and is
believed to be clean.

^ permalink raw reply	[flat|nested] 28+ messages in thread

* [ath9k-devel] still same problem
  2009-01-17 23:55       ` W. van den Akker
@ 2009-01-18  1:41         ` Jens Ott - Opteamax
  2009-01-18  9:14           ` W. van den Akker
  2009-01-18  5:17         ` Dmitri Seletski
  1 sibling, 1 reply; 28+ messages in thread
From: Jens Ott - Opteamax @ 2009-01-18  1:41 UTC (permalink / raw)
  To: ath9k-devel

-----BEGIN PGP SIGNED MESSAGE-----
Hash: SHA1

Hi,

I did not find the time to further investigate this issue. Well what I
did today was compile the latest git-update and the latest hostapd. At
least the machine does not crash that often, but with 802.11n I can't
connect to the card ... but this might also be a problem with the card
in my laptop. I hope I'll find the time to go on testing next week.

Best regards
Jens

W. van den Akker schrieb:
> On Sunday 04 January 2009 10:58:02 Jens Ott - Opteamax wrote:
>> Hi,
>>
>> Luis R. Rodriguez schrieb:
>>> On Sat, Jan 3, 2009 at 5:53 PM, Barry Green <barry@lemonpi.net> wrote:
>>> [...]
>>> We need an oops trace of your crashes so far it seems you have not
>>> been able to get one, lets work on that. Until then it is not easy for
>>> us to provide you with patches to fix your issues.
>> I also have similar problems with an D-LINK DWA-547 and I'd really like
>> to provide the oops trace, but unfortunately the machine seems to hang
>> that fast, that even no oops trace is shown. Also magic-sysreq don't
>> work anymore, neither via serial console nor via keyboard. Only chance I
>> have in this moment is to press reset-button, which already caused a
>> full xfs-filsystem-corruption with dataloss once :-(
>>
>> My problem is, that the hang happens not immediately after loading the
>> driver but "somewhen" while it is loaded.
>>
>> I have seen two types of crash:
>>
>> 1.) first investigations on my side seem to show that it happens, as
>> soon as there is high dataload transfered, but I don't have detailed
>> info yet.
>> 2.) Running hostapd works fine while started first time, but no client
>> is connected. When a client connects a system freeze occures
>> "sometimes". Reproducably the system freezes immediately when starting
>> hostapd, stop it and then restart hostapd. So every change in hostapd
>> conf needs a full reboot, unloading the ath9k module and reload it does
>> not prevent freeze, but after reboot I can start hostapd once without a
>> problem (only the metioned "sometime" freezes on client connect or high
>> data-volume).
>>
>> As this machine is productive, I suspended testing for now. I will
>> change the card into a non-productive environment in the next days. As
>> soon as I find more details, I'll report.
>>
>>> Please test with the latest drivers from
>>> wireless-testing/compat-wireless and restart in single user mode, we
>>> want to ensure X is not running at all and you are in a virtual
>>> terminal. We also do not want Network Manager or wpa_supplicant
>>> running in the background. To increase the resolution upon bootup try
>>> booting with vga=ask (vga=791 works well for me), then try to
>>> associate to your access point without encryption manually using
>>> iwconfig.
>> Ok, that'll be a thing I'd need to test to, as in my case wpa_supplicant
>> is running.
>>
> 
> Hi,
> 
> What is the status of your problem? I still have problems with it and have no
> progress...
> 
> gr,
> Willem
> 

-----BEGIN PGP SIGNATURE-----
Version: GnuPG v2.0.9 (GNU/Linux)
Comment: Using GnuPG with Mozilla - http://enigmail.mozdev.org

iEYEARECAAYFAklyiMoACgkQMf0yjMLKfXpcewCfWW65ZXnBL2tuuuBYAGWLlb+g
u7YAn1TrkV80Czyy1GTKqxiLeShnE+KL
=0wWG
-----END PGP SIGNATURE-----

^ permalink raw reply	[flat|nested] 28+ messages in thread

* [ath9k-devel] still same problem
  2009-01-17 23:55       ` W. van den Akker
  2009-01-18  1:41         ` Jens Ott - Opteamax
@ 2009-01-18  5:17         ` Dmitri Seletski
  2009-01-18  9:34           ` W. van den Akker
  1 sibling, 1 reply; 28+ messages in thread
From: Dmitri Seletski @ 2009-01-18  5:17 UTC (permalink / raw)
  To: ath9k-devel

W. van den Akker wrote:
> On Sunday 04 January 2009 10:58:02 Jens Ott - Opteamax wrote:
>   
>> Hi,
>>
>> Luis R. Rodriguez schrieb:
>>     
>>> On Sat, Jan 3, 2009 at 5:53 PM, Barry Green <barry@lemonpi.net> wrote:
>>> [...]
>>> We need an oops trace of your crashes so far it seems you have not
>>> been able to get one, lets work on that. Until then it is not easy for
>>> us to provide you with patches to fix your issues.
>>>       
>> I also have similar problems with an D-LINK DWA-547 and I'd really like
>> to provide the oops trace, but unfortunately the machine seems to hang
>> that fast, that even no oops trace is shown. Also magic-sysreq don't
>> work anymore, neither via serial console nor via keyboard. Only chance I
>> have in this moment is to press reset-button, which already caused a
>> full xfs-filsystem-corruption with dataloss once :-(
>>
>> My problem is, that the hang happens not immediately after loading the
>> driver but "somewhen" while it is loaded.
>>
>> I have seen two types of crash:
>>
>> 1.) first investigations on my side seem to show that it happens, as
>> soon as there is high dataload transfered, but I don't have detailed
>> info yet.
>> 2.) Running hostapd works fine while started first time, but no client
>> is connected. When a client connects a system freeze occures
>> "sometimes". Reproducably the system freezes immediately when starting
>> hostapd, stop it and then restart hostapd. So every change in hostapd
>> conf needs a full reboot, unloading the ath9k module and reload it does
>> not prevent freeze, but after reboot I can start hostapd once without a
>> problem (only the metioned "sometime" freezes on client connect or high
>> data-volume).
>>
>> As this machine is productive, I suspended testing for now. I will
>> change the card into a non-productive environment in the next days. As
>> soon as I find more details, I'll report.
>>
>>     
>>> Please test with the latest drivers from
>>> wireless-testing/compat-wireless and restart in single user mode, we
>>> want to ensure X is not running at all and you are in a virtual
>>> terminal. We also do not want Network Manager or wpa_supplicant
>>> running in the background. To increase the resolution upon bootup try
>>> booting with vga=ask (vga=791 works well for me), then try to
>>> associate to your access point without encryption manually using
>>> iwconfig.
>>>       
>> Ok, that'll be a thing I'd need to test to, as in my case wpa_supplicant
>> is running.
>>
>>     
>
> Hi,
>
> What is the status of your problem? I still have problems with it and have no
> progress...
>
> gr,
> Willem
>
>   
Hello Willem.

I think I have had similar issue to yours.

Car was working for short period of time and after a while crashed computer.
I was advised to change kernel version, i updated it to 2.6.27.9 and 
problem has gone away since.
It's something to do with DMA.

Perhaps try to upgrade kernel?

Regards
Dmitri

^ permalink raw reply	[flat|nested] 28+ messages in thread

* Re: [ath9k-devel] still same problem
  2009-01-18  1:41         ` Jens Ott - Opteamax
@ 2009-01-18  9:14           ` W. van den Akker
  0 siblings, 0 replies; 28+ messages in thread
From: W. van den Akker @ 2009-01-18  9:14 UTC (permalink / raw)
  To: Jens Ott - Opteamax, linux-wireless

On Sunday 18 January 2009, you wrote:
> Hi,
>
> I did not find the time to further investigate this issue. Well what I
> did today was compile the latest git-update and the latest hostapd. At
> least the machine does not crash that often, but with 802.11n I can't
> connect to the card ... but this might also be a problem with the card
> in my laptop. I hope I'll find the time to go on testing next week.
>

I have the same problem with my laptop. Cant make a connection if 802.11n
is enabled. I have tested with hostapd 0.6.7 and I didnt notice any changes.

Let me know if you have any progress.

gr,
Willem

-- 
This message has been scanned for viruses and
dangerous content by MailScanner, and is
believed to be clean.


^ permalink raw reply	[flat|nested] 28+ messages in thread

* [ath9k-devel] still same problem
  2009-01-18  5:17         ` Dmitri Seletski
@ 2009-01-18  9:34           ` W. van den Akker
  2009-01-18 15:16             ` Luis R. Rodriguez
  0 siblings, 1 reply; 28+ messages in thread
From: W. van den Akker @ 2009-01-18  9:34 UTC (permalink / raw)
  To: ath9k-devel

On Sunday 18 January 2009, Dmitri Seletski wrote:
> W. van den Akker wrote:
> > On Sunday 04 January 2009 10:58:02 Jens Ott - Opteamax wrote:
> >> Hi,
> >>
> >> Luis R. Rodriguez schrieb:
> >>> On Sat, Jan 3, 2009 at 5:53 PM, Barry Green <barry@lemonpi.net> wrote:
> >>> [...]
> >>> We need an oops trace of your crashes so far it seems you have not
> >>> been able to get one, lets work on that. Until then it is not easy for
> >>> us to provide you with patches to fix your issues.
> >>
> >> I also have similar problems with an D-LINK DWA-547 and I'd really like
> >> to provide the oops trace, but unfortunately the machine seems to hang
> >> that fast, that even no oops trace is shown. Also magic-sysreq don't
> >> work anymore, neither via serial console nor via keyboard. Only chance I
> >> have in this moment is to press reset-button, which already caused a
> >> full xfs-filsystem-corruption with dataloss once :-(
> >>
> >> My problem is, that the hang happens not immediately after loading the
> >> driver but "somewhen" while it is loaded.
> >>
> >> I have seen two types of crash:
> >>
> >> 1.) first investigations on my side seem to show that it happens, as
> >> soon as there is high dataload transfered, but I don't have detailed
> >> info yet.
> >> 2.) Running hostapd works fine while started first time, but no client
> >> is connected. When a client connects a system freeze occures
> >> "sometimes". Reproducably the system freezes immediately when starting
> >> hostapd, stop it and then restart hostapd. So every change in hostapd
> >> conf needs a full reboot, unloading the ath9k module and reload it does
> >> not prevent freeze, but after reboot I can start hostapd once without a
> >> problem (only the metioned "sometime" freezes on client connect or high
> >> data-volume).
> >>
> >> As this machine is productive, I suspended testing for now. I will
> >> change the card into a non-productive environment in the next days. As
> >> soon as I find more details, I'll report.
> >>
> >>> Please test with the latest drivers from
> >>> wireless-testing/compat-wireless and restart in single user mode, we
> >>> want to ensure X is not running at all and you are in a virtual
> >>> terminal. We also do not want Network Manager or wpa_supplicant
> >>> running in the background. To increase the resolution upon bootup try
> >>> booting with vga=ask (vga=791 works well for me), then try to
> >>> associate to your access point without encryption manually using
> >>> iwconfig.
> >>
> >> Ok, that'll be a thing I'd need to test to, as in my case wpa_supplicant
> >> is running.
> >
> > Hi,
> >
> > What is the status of your problem? I still have problems with it and
> > have no progress...
> >
> > gr,
> > Willem
>
> Hello Willem.
>
> I think I have had similar issue to yours.
>
> Car was working for short period of time and after a while crashed
> computer. I was advised to change kernel version, i updated it to 2.6.27.9
> and problem has gone away since.
> It's something to do with DMA.
>
> Perhaps try to upgrade kernel?
>
> Regards
> Dmitri

I have tried all kernel versions till 2.6.28 and still have the same hang-ups.
I think indeed its a DMA-hardware-ath9k related problem. ?The patch in 
2.6.27.8 didnt work for me. I have filled a bugreport on bugzilla.kernel but 
because AP isnt supported untill 2.6.29 the report is closed.

I will test today 2.6.29.rc2 and see if it has any progress.

gr,
Willem

-- 
This message has been scanned for viruses and
dangerous content by MailScanner, and is
believed to be clean.

^ permalink raw reply	[flat|nested] 28+ messages in thread

* [ath9k-devel] still same problem
  2009-01-18  9:34           ` W. van den Akker
@ 2009-01-18 15:16             ` Luis R. Rodriguez
  2009-01-19  8:35               ` W. van den Akker
  2009-01-19 22:31               ` W. van den Akker
  0 siblings, 2 replies; 28+ messages in thread
From: Luis R. Rodriguez @ 2009-01-18 15:16 UTC (permalink / raw)
  To: ath9k-devel

On Sun, Jan 18, 2009 at 01:34:03AM -0800, W. van den Akker wrote:
> On Sunday 18 January 2009, Dmitri Seletski wrote:
> > W. van den Akker wrote:
> > > On Sunday 04 January 2009 10:58:02 Jens Ott - Opteamax wrote:
> > >> Hi,
> > >>
> > >> Luis R. Rodriguez schrieb:
> > >>> On Sat, Jan 3, 2009 at 5:53 PM, Barry Green <barry@lemonpi.net> wrote:
> > >>> [...]
> > >>> We need an oops trace of your crashes so far it seems you have not
> > >>> been able to get one, lets work on that. Until then it is not easy for
> > >>> us to provide you with patches to fix your issues.
> > >>
> > >> I also have similar problems with an D-LINK DWA-547 and I'd really like
> > >> to provide the oops trace, but unfortunately the machine seems to hang
> > >> that fast, that even no oops trace is shown. Also magic-sysreq don't
> > >> work anymore, neither via serial console nor via keyboard. Only chance I
> > >> have in this moment is to press reset-button, which already caused a
> > >> full xfs-filsystem-corruption with dataloss once :-(
> > >>
> > >> My problem is, that the hang happens not immediately after loading the
> > >> driver but "somewhen" while it is loaded.
> > >>
> > >> I have seen two types of crash:
> > >>
> > >> 1.) first investigations on my side seem to show that it happens, as
> > >> soon as there is high dataload transfered, but I don't have detailed
> > >> info yet.
> > >> 2.) Running hostapd works fine while started first time, but no client
> > >> is connected. When a client connects a system freeze occures
> > >> "sometimes". Reproducably the system freezes immediately when starting
> > >> hostapd, stop it and then restart hostapd. So every change in hostapd
> > >> conf needs a full reboot, unloading the ath9k module and reload it does
> > >> not prevent freeze, but after reboot I can start hostapd once without a
> > >> problem (only the metioned "sometime" freezes on client connect or high
> > >> data-volume).
> > >>
> > >> As this machine is productive, I suspended testing for now. I will
> > >> change the card into a non-productive environment in the next days. As
> > >> soon as I find more details, I'll report.
> > >>
> > >>> Please test with the latest drivers from
> > >>> wireless-testing/compat-wireless and restart in single user mode, we
> > >>> want to ensure X is not running at all and you are in a virtual
> > >>> terminal. We also do not want Network Manager or wpa_supplicant
> > >>> running in the background. To increase the resolution upon bootup try
> > >>> booting with vga=ask (vga=791 works well for me), then try to
> > >>> associate to your access point without encryption manually using
> > >>> iwconfig.
> > >>
> > >> Ok, that'll be a thing I'd need to test to, as in my case wpa_supplicant
> > >> is running.
> > >
> > > Hi,
> > >
> > > What is the status of your problem? I still have problems with it and
> > > have no progress...
> > >
> > > gr,
> > > Willem
> >
> > Hello Willem.
> >
> > I think I have had similar issue to yours.
> >
> > Car was working for short period of time and after a while crashed
> > computer. I was advised to change kernel version, i updated it to 2.6.27.9
> > and problem has gone away since.
> > It's something to do with DMA.
> >
> > Perhaps try to upgrade kernel?
> >
> > Regards
> > Dmitri
> 
> I have tried all kernel versions till 2.6.28 and still have the same hang-ups.
> I think indeed its a DMA-hardware-ath9k related problem.  The patch in
> 2.6.27.8 didnt work for me. I have filled a bugreport on bugzilla.kernel but
> because AP isnt supported untill 2.6.29 the report is closed.

A better description of this is that AP support does not work (is not allowed)
and therefore not supported on < 2.6.28.

> I will test today 2.6.29.rc2 and see if it has any progress.

You can also try wireless-testing (entire kernel) or compat-wireless (just updates
your wireless stuff) for bleeding edge stuff.

http://wireless.kernel.org/en/developers/Documentation/git-guide
http://wireless.kernel.org/en/users/Download

  Luis

^ permalink raw reply	[flat|nested] 28+ messages in thread

* [ath9k-devel] still same problem
  2009-01-18 15:16             ` Luis R. Rodriguez
@ 2009-01-19  8:35               ` W. van den Akker
  2009-01-19 22:31               ` W. van den Akker
  1 sibling, 0 replies; 28+ messages in thread
From: W. van den Akker @ 2009-01-19  8:35 UTC (permalink / raw)
  To: ath9k-devel

On Sunday 18 January 2009, Luis R. Rodriguez wrote:
> On Sun, Jan 18, 2009 at 01:34:03AM -0800, W. van den Akker wrote:
> > On Sunday 18 January 2009, Dmitri Seletski wrote:
> > > W. van den Akker wrote:
> > > > On Sunday 04 January 2009 10:58:02 Jens Ott - Opteamax wrote:
> > > >> Hi,
> > > >>
> > > >> Luis R. Rodriguez schrieb:
> > > >>> On Sat, Jan 3, 2009 at 5:53 PM, Barry Green <barry@lemonpi.net>
> > > >>> wrote: [...]
> > > >>> We need an oops trace of your crashes so far it seems you have not
> > > >>> been able to get one, lets work on that. Until then it is not easy
> > > >>> for us to provide you with patches to fix your issues.
> > > >>
> > > >> I also have similar problems with an D-LINK DWA-547 and I'd really
> > > >> like to provide the oops trace, but unfortunately the machine seems
> > > >> to hang that fast, that even no oops trace is shown. Also
> > > >> magic-sysreq don't work anymore, neither via serial console nor via
> > > >> keyboard. Only chance I have in this moment is to press
> > > >> reset-button, which already caused a full xfs-filsystem-corruption
> > > >> with dataloss once :-(
> > > >>
> > > >> My problem is, that the hang happens not immediately after loading
> > > >> the driver but "somewhen" while it is loaded.
> > > >>
> > > >> I have seen two types of crash:
> > > >>
> > > >> 1.) first investigations on my side seem to show that it happens, as
> > > >> soon as there is high dataload transfered, but I don't have detailed
> > > >> info yet.
> > > >> 2.) Running hostapd works fine while started first time, but no
> > > >> client is connected. When a client connects a system freeze occures
> > > >> "sometimes". Reproducably the system freezes immediately when
> > > >> starting hostapd, stop it and then restart hostapd. So every change
> > > >> in hostapd conf needs a full reboot, unloading the ath9k module and
> > > >> reload it does not prevent freeze, but after reboot I can start
> > > >> hostapd once without a problem (only the metioned "sometime" freezes
> > > >> on client connect or high data-volume).
> > > >>
> > > >> As this machine is productive, I suspended testing for now. I will
> > > >> change the card into a non-productive environment in the next days.
> > > >> As soon as I find more details, I'll report.
> > > >>
> > > >>> Please test with the latest drivers from
> > > >>> wireless-testing/compat-wireless and restart in single user mode,
> > > >>> we want to ensure X is not running at all and you are in a virtual
> > > >>> terminal. We also do not want Network Manager or wpa_supplicant
> > > >>> running in the background. To increase the resolution upon bootup
> > > >>> try booting with vga=ask (vga=791 works well for me), then try to
> > > >>> associate to your access point without encryption manually using
> > > >>> iwconfig.
> > > >>
> > > >> Ok, that'll be a thing I'd need to test to, as in my case
> > > >> wpa_supplicant is running.
> > > >
> > > > Hi,
> > > >
> > > > What is the status of your problem? I still have problems with it and
> > > > have no progress...
> > > >
> > > > gr,
> > > > Willem
> > >
> > > Hello Willem.
> > >
> > > I think I have had similar issue to yours.
> > >
> > > Car was working for short period of time and after a while crashed
> > > computer. I was advised to change kernel version, i updated it to
> > > 2.6.27.9 and problem has gone away since.
> > > It's something to do with DMA.
> > >
> > > Perhaps try to upgrade kernel?
> > >
> > > Regards
> > > Dmitri
> >
> > I have tried all kernel versions till 2.6.28 and still have the same
> > hang-ups. I think indeed its a DMA-hardware-ath9k related problem.  The
> > patch in 2.6.27.8 didnt work for me. I have filled a bugreport on
> > bugzilla.kernel but because AP isnt supported untill 2.6.29 the report is
> > closed.
>
> A better description of this is that AP support does not work (is not
> allowed) and therefore not supported on < 2.6.28.
>
> > I will test today 2.6.29.rc2 and see if it has any progress.
>
> You can also try wireless-testing (entire kernel) or compat-wireless (just
> updates your wireless stuff) for bleeding edge stuff.
>
> http://wireless.kernel.org/en/developers/Documentation/git-guide
> http://wireless.kernel.org/en/users/Download
>
>   Luis

I have tried all kernel versions till 2.6.28 and still have the same hang-ups.
I think indeed its a DMA-hardware-ath9k related problem. ?The patch in 
2.6.27.8 didnt work for me. I have filled a bugreport on bugzilla.kernel but 
because AP isnt supported untill 2.6.29 the report is closed.

I will test today 2.6.29.rc2 and see if it has any progress.

gr,
Willem

-- 
This message has been scanned for viruses and
dangerous content by MailScanner, and is
believed to be clean.

^ permalink raw reply	[flat|nested] 28+ messages in thread

* [ath9k-devel] still same problem
  2009-01-18 15:16             ` Luis R. Rodriguez
  2009-01-19  8:35               ` W. van den Akker
@ 2009-01-19 22:31               ` W. van den Akker
  2009-01-20 17:48                 ` Luis R. Rodriguez
  1 sibling, 1 reply; 28+ messages in thread
From: W. van den Akker @ 2009-01-19 22:31 UTC (permalink / raw)
  To: ath9k-devel

On Sunday 18 January 2009, Luis R. Rodriguez wrote:
> On Sun, Jan 18, 2009 at 01:34:03AM -0800, W. van den Akker wrote:
> > On Sunday 18 January 2009, Dmitri Seletski wrote:
> > > W. van den Akker wrote:
> > > > On Sunday 04 January 2009 10:58:02 Jens Ott - Opteamax wrote:
> > > >> Hi,
> > > >>
> > > >> Luis R. Rodriguez schrieb:
> > > >>> On Sat, Jan 3, 2009 at 5:53 PM, Barry Green <barry@lemonpi.net>
> > > >>> wrote: [...]
> > > >>> We need an oops trace of your crashes so far it seems you have not
> > > >>> been able to get one, lets work on that. Until then it is not easy
> > > >>> for us to provide you with patches to fix your issues.
> > > >>
> > > >> I also have similar problems with an D-LINK DWA-547 and I'd really
> > > >> like to provide the oops trace, but unfortunately the machine seems
> > > >> to hang that fast, that even no oops trace is shown. Also
> > > >> magic-sysreq don't work anymore, neither via serial console nor via
> > > >> keyboard. Only chance I have in this moment is to press
> > > >> reset-button, which already caused a full xfs-filsystem-corruption
> > > >> with dataloss once :-(
> > > >>
> > > >> My problem is, that the hang happens not immediately after loading
> > > >> the driver but "somewhen" while it is loaded.
> > > >>
> > > >> I have seen two types of crash:
> > > >>
> > > >> 1.) first investigations on my side seem to show that it happens, as
> > > >> soon as there is high dataload transfered, but I don't have detailed
> > > >> info yet.
> > > >> 2.) Running hostapd works fine while started first time, but no
> > > >> client is connected. When a client connects a system freeze occures
> > > >> "sometimes". Reproducably the system freezes immediately when
> > > >> starting hostapd, stop it and then restart hostapd. So every change
> > > >> in hostapd conf needs a full reboot, unloading the ath9k module and
> > > >> reload it does not prevent freeze, but after reboot I can start
> > > >> hostapd once without a problem (only the metioned "sometime" freezes
> > > >> on client connect or high data-volume).
> > > >>
> > > >> As this machine is productive, I suspended testing for now. I will
> > > >> change the card into a non-productive environment in the next days.
> > > >> As soon as I find more details, I'll report.
> > > >>
> > > >>> Please test with the latest drivers from
> > > >>> wireless-testing/compat-wireless and restart in single user mode,
> > > >>> we want to ensure X is not running at all and you are in a virtual
> > > >>> terminal. We also do not want Network Manager or wpa_supplicant
> > > >>> running in the background. To increase the resolution upon bootup
> > > >>> try booting with vga=ask (vga=791 works well for me), then try to
> > > >>> associate to your access point without encryption manually using
> > > >>> iwconfig.
> > > >>
> > > >> Ok, that'll be a thing I'd need to test to, as in my case
> > > >> wpa_supplicant is running.
> > > >
> > > > Hi,
> > > >
> > > > What is the status of your problem? I still have problems with it and
> > > > have no progress...
> > > >
> > > > gr,
> > > > Willem
> > >
> > > Hello Willem.
> > >
> > > I think I have had similar issue to yours.
> > >
> > > Car was working for short period of time and after a while crashed
> > > computer. I was advised to change kernel version, i updated it to
> > > 2.6.27.9 and problem has gone away since.
> > > It's something to do with DMA.
> > >
> > > Perhaps try to upgrade kernel?
> > >
> > > Regards
> > > Dmitri
> >
> > I have tried all kernel versions till 2.6.28 and still have the same
> > hang-ups. I think indeed its a DMA-hardware-ath9k related problem.  The
> > patch in 2.6.27.8 didnt work for me. I have filled a bugreport on
> > bugzilla.kernel but because AP isnt supported untill 2.6.29 the report is
> > closed.
>
> A better description of this is that AP support does not work (is not
> allowed) and therefore not supported on < 2.6.28.
>
> > I will test today 2.6.29.rc2 and see if it has any progress.
>
> You can also try wireless-testing (entire kernel) or compat-wireless (just
> updates your wireless stuff) for bleeding edge stuff.
>

I tested it today with kernel 2.6.29-rc2. But no change. The kernel hangs 
completly (without oops or what so ever, even nmi-watchdog has no results).
It happend when I started the connection (from a IW4965 card). Then started up
a Kmail connection, start up a browser sessie. Stop Kmail, and start Kmail.

It also happens when I do a hostapd start, hostapd stop, hostapd start. 

Its a IBM 206 eserver, with no X running on it. The card is a WMP300N.

Any hints?

gr,
Willem


-- 
This message has been scanned for viruses and
dangerous content by MailScanner, and is
believed to be clean.

^ permalink raw reply	[flat|nested] 28+ messages in thread

* [ath9k-devel] still same problem
  2009-01-19 22:31               ` W. van den Akker
@ 2009-01-20 17:48                 ` Luis R. Rodriguez
  2009-01-20 18:32                   ` W. van den Akker
  2009-01-20 22:22                   ` W. van den Akker
  0 siblings, 2 replies; 28+ messages in thread
From: Luis R. Rodriguez @ 2009-01-20 17:48 UTC (permalink / raw)
  To: ath9k-devel

On Mon, Jan 19, 2009 at 2:31 PM, W. van den Akker <listsrv@wilsoft.nl> wrote:
> On Sunday 18 January 2009, Luis R. Rodriguez wrote:
>> On Sun, Jan 18, 2009 at 01:34:03AM -0800, W. van den Akker wrote:
>> > On Sunday 18 January 2009, Dmitri Seletski wrote:
>> > > W. van den Akker wrote:
>> > > > On Sunday 04 January 2009 10:58:02 Jens Ott - Opteamax wrote:
>> > > >> Hi,
>> > > >>
>> > > >> Luis R. Rodriguez schrieb:
>> > > >>> On Sat, Jan 3, 2009 at 5:53 PM, Barry Green <barry@lemonpi.net>
>> > > >>> wrote: [...]
>> > > >>> We need an oops trace of your crashes so far it seems you have not
>> > > >>> been able to get one, lets work on that. Until then it is not easy
>> > > >>> for us to provide you with patches to fix your issues.
>> > > >>
>> > > >> I also have similar problems with an D-LINK DWA-547 and I'd really
>> > > >> like to provide the oops trace, but unfortunately the machine seems
>> > > >> to hang that fast, that even no oops trace is shown. Also
>> > > >> magic-sysreq don't work anymore, neither via serial console nor via
>> > > >> keyboard. Only chance I have in this moment is to press
>> > > >> reset-button, which already caused a full xfs-filsystem-corruption
>> > > >> with dataloss once :-(
>> > > >>
>> > > >> My problem is, that the hang happens not immediately after loading
>> > > >> the driver but "somewhen" while it is loaded.
>> > > >>
>> > > >> I have seen two types of crash:
>> > > >>
>> > > >> 1.) first investigations on my side seem to show that it happens, as
>> > > >> soon as there is high dataload transfered, but I don't have detailed
>> > > >> info yet.
>> > > >> 2.) Running hostapd works fine while started first time, but no
>> > > >> client is connected. When a client connects a system freeze occures
>> > > >> "sometimes". Reproducably the system freezes immediately when
>> > > >> starting hostapd, stop it and then restart hostapd. So every change
>> > > >> in hostapd conf needs a full reboot, unloading the ath9k module and
>> > > >> reload it does not prevent freeze, but after reboot I can start
>> > > >> hostapd once without a problem (only the metioned "sometime" freezes
>> > > >> on client connect or high data-volume).
>> > > >>
>> > > >> As this machine is productive, I suspended testing for now. I will
>> > > >> change the card into a non-productive environment in the next days.
>> > > >> As soon as I find more details, I'll report.
>> > > >>
>> > > >>> Please test with the latest drivers from
>> > > >>> wireless-testing/compat-wireless and restart in single user mode,
>> > > >>> we want to ensure X is not running at all and you are in a virtual
>> > > >>> terminal. We also do not want Network Manager or wpa_supplicant
>> > > >>> running in the background. To increase the resolution upon bootup
>> > > >>> try booting with vga=ask (vga=791 works well for me), then try to
>> > > >>> associate to your access point without encryption manually using
>> > > >>> iwconfig.
>> > > >>
>> > > >> Ok, that'll be a thing I'd need to test to, as in my case
>> > > >> wpa_supplicant is running.
>> > > >
>> > > > Hi,
>> > > >
>> > > > What is the status of your problem? I still have problems with it and
>> > > > have no progress...
>> > > >
>> > > > gr,
>> > > > Willem
>> > >
>> > > Hello Willem.
>> > >
>> > > I think I have had similar issue to yours.
>> > >
>> > > Car was working for short period of time and after a while crashed
>> > > computer. I was advised to change kernel version, i updated it to
>> > > 2.6.27.9 and problem has gone away since.
>> > > It's something to do with DMA.
>> > >
>> > > Perhaps try to upgrade kernel?
>> > >
>> > > Regards
>> > > Dmitri
>> >
>> > I have tried all kernel versions till 2.6.28 and still have the same
>> > hang-ups. I think indeed its a DMA-hardware-ath9k related problem.  The
>> > patch in 2.6.27.8 didnt work for me. I have filled a bugreport on
>> > bugzilla.kernel but because AP isnt supported untill 2.6.29 the report is
>> > closed.
>>
>> A better description of this is that AP support does not work (is not
>> allowed) and therefore not supported on < 2.6.28.
>>
>> > I will test today 2.6.29.rc2 and see if it has any progress.
>>
>> You can also try wireless-testing (entire kernel) or compat-wireless (just
>> updates your wireless stuff) for bleeding edge stuff.
>>
>
> I tested it today with kernel 2.6.29-rc2. But no change. The kernel hangs
> completly (without oops or what so ever, even nmi-watchdog has no results).
> It happend when I started the connection (from a IW4965 card). Then started up
> a Kmail connection, start up a browser sessie. Stop Kmail, and start Kmail.
>
> It also happens when I do a hostapd start, hostapd stop, hostapd start.
>
> Its a IBM 206 eserver, with no X running on it. The card is a WMP300N.
>
> Any hints?

Since you are not getting any panic can you try wireless-testing git
tree [1], this will get you the latest and greatest wireless drivers
and mac80211/cfg80211 which are now being queued up for 2.6.30. See if
you see the issue there. If so then you can use git bisect.

[1] http://wireless.kernel.org/en/developers/Documentation/git-guide

  Luis

^ permalink raw reply	[flat|nested] 28+ messages in thread

* [ath9k-devel] still same problem
  2009-01-20 17:48                 ` Luis R. Rodriguez
@ 2009-01-20 18:32                   ` W. van den Akker
  2009-01-20 22:22                   ` W. van den Akker
  1 sibling, 0 replies; 28+ messages in thread
From: W. van den Akker @ 2009-01-20 18:32 UTC (permalink / raw)
  To: ath9k-devel

On Tuesday 20 January 2009, Luis R. Rodriguez wrote:
> On Mon, Jan 19, 2009 at 2:31 PM, W. van den Akker <listsrv@wilsoft.nl> 
wrote:
> > On Sunday 18 January 2009, Luis R. Rodriguez wrote:
> >> On Sun, Jan 18, 2009 at 01:34:03AM -0800, W. van den Akker wrote:
> >> > On Sunday 18 January 2009, Dmitri Seletski wrote:
> >> > > W. van den Akker wrote:
> >> > > > On Sunday 04 January 2009 10:58:02 Jens Ott - Opteamax wrote:
> >> > > >> Hi,
> >> > > >>
> >> > > >> Luis R. Rodriguez schrieb:
> >> > > >>> On Sat, Jan 3, 2009 at 5:53 PM, Barry Green <barry@lemonpi.net>
> >> > > >>> wrote: [...]
> >> > > >>> We need an oops trace of your crashes so far it seems you have
> >> > > >>> not been able to get one, lets work on that. Until then it is
> >> > > >>> not easy for us to provide you with patches to fix your issues.
> >> > > >>
> >> > > >> I also have similar problems with an D-LINK DWA-547 and I'd
> >> > > >> really like to provide the oops trace, but unfortunately the
> >> > > >> machine seems to hang that fast, that even no oops trace is
> >> > > >> shown. Also magic-sysreq don't work anymore, neither via serial
> >> > > >> console nor via keyboard. Only chance I have in this moment is to
> >> > > >> press
> >> > > >> reset-button, which already caused a full
> >> > > >> xfs-filsystem-corruption with dataloss once :-(
> >> > > >>
> >> > > >> My problem is, that the hang happens not immediately after
> >> > > >> loading the driver but "somewhen" while it is loaded.
> >> > > >>
> >> > > >> I have seen two types of crash:
> >> > > >>
> >> > > >> 1.) first investigations on my side seem to show that it happens,
> >> > > >> as soon as there is high dataload transfered, but I don't have
> >> > > >> detailed info yet.
> >> > > >> 2.) Running hostapd works fine while started first time, but no
> >> > > >> client is connected. When a client connects a system freeze
> >> > > >> occures "sometimes". Reproducably the system freezes immediately
> >> > > >> when starting hostapd, stop it and then restart hostapd. So every
> >> > > >> change in hostapd conf needs a full reboot, unloading the ath9k
> >> > > >> module and reload it does not prevent freeze, but after reboot I
> >> > > >> can start hostapd once without a problem (only the metioned
> >> > > >> "sometime" freezes on client connect or high data-volume).
> >> > > >>
> >> > > >> As this machine is productive, I suspended testing for now. I
> >> > > >> will change the card into a non-productive environment in the
> >> > > >> next days. As soon as I find more details, I'll report.
> >> > > >>
> >> > > >>> Please test with the latest drivers from
> >> > > >>> wireless-testing/compat-wireless and restart in single user
> >> > > >>> mode, we want to ensure X is not running at all and you are in a
> >> > > >>> virtual terminal. We also do not want Network Manager or
> >> > > >>> wpa_supplicant running in the background. To increase the
> >> > > >>> resolution upon bootup try booting with vga=ask (vga=791 works
> >> > > >>> well for me), then try to associate to your access point without
> >> > > >>> encryption manually using iwconfig.
> >> > > >>
> >> > > >> Ok, that'll be a thing I'd need to test to, as in my case
> >> > > >> wpa_supplicant is running.
> >> > > >
> >> > > > Hi,
> >> > > >
> >> > > > What is the status of your problem? I still have problems with it
> >> > > > and have no progress...
> >> > > >
> >> > > > gr,
> >> > > > Willem
> >> > >
> >> > > Hello Willem.
> >> > >
> >> > > I think I have had similar issue to yours.
> >> > >
> >> > > Car was working for short period of time and after a while crashed
> >> > > computer. I was advised to change kernel version, i updated it to
> >> > > 2.6.27.9 and problem has gone away since.
> >> > > It's something to do with DMA.
> >> > >
> >> > > Perhaps try to upgrade kernel?
> >> > >
> >> > > Regards
> >> > > Dmitri
> >> >
> >> > I have tried all kernel versions till 2.6.28 and still have the same
> >> > hang-ups. I think indeed its a DMA-hardware-ath9k related problem. 
> >> > The patch in 2.6.27.8 didnt work for me. I have filled a bugreport on
> >> > bugzilla.kernel but because AP isnt supported untill 2.6.29 the report
> >> > is closed.
> >>
> >> A better description of this is that AP support does not work (is not
> >> allowed) and therefore not supported on < 2.6.28.
> >>
> >> > I will test today 2.6.29.rc2 and see if it has any progress.
> >>
> >> You can also try wireless-testing (entire kernel) or compat-wireless
> >> (just updates your wireless stuff) for bleeding edge stuff.
> >
> > I tested it today with kernel 2.6.29-rc2. But no change. The kernel hangs
> > completly (without oops or what so ever, even nmi-watchdog has no
> > results). It happend when I started the connection (from a IW4965 card).
> > Then started up a Kmail connection, start up a browser sessie. Stop
> > Kmail, and start Kmail.
> >
> > It also happens when I do a hostapd start, hostapd stop, hostapd start.
> >
> > Its a IBM 206 eserver, with no X running on it. The card is a WMP300N.
> >
> > Any hints?
>
> Since you are not getting any panic can you try wireless-testing git
> tree [1], this will get you the latest and greatest wireless drivers
> and mac80211/cfg80211 which are now being queued up for 2.6.30. See if
> you see the issue there. If so then you can use git bisect.
>
> [1] http://wireless.kernel.org/en/developers/Documentation/git-guide

I will use the latest drivers from wl. The last I tried was 18-1 and that 
didnt had any success.

To be continued.

Willem



-- 
This message has been scanned for viruses and
dangerous content by MailScanner, and is
believed to be clean.

^ permalink raw reply	[flat|nested] 28+ messages in thread

* [ath9k-devel] still same problem
  2009-01-20 17:48                 ` Luis R. Rodriguez
  2009-01-20 18:32                   ` W. van den Akker
@ 2009-01-20 22:22                   ` W. van den Akker
  2009-01-20 23:08                     ` Chris Kennedy
  1 sibling, 1 reply; 28+ messages in thread
From: W. van den Akker @ 2009-01-20 22:22 UTC (permalink / raw)
  To: ath9k-devel

On Tuesday 20 January 2009, Luis R. Rodriguez wrote:
> On Mon, Jan 19, 2009 at 2:31 PM, W. van den Akker <listsrv@wilsoft.nl> 
wrote:
> > On Sunday 18 January 2009, Luis R. Rodriguez wrote:
> >> On Sun, Jan 18, 2009 at 01:34:03AM -0800, W. van den Akker wrote:
> >> > On Sunday 18 January 2009, Dmitri Seletski wrote:
> >> > > W. van den Akker wrote:
> >> > > > On Sunday 04 January 2009 10:58:02 Jens Ott - Opteamax wrote:
> >> > > >> Hi,
> >> > > >>
> >> > > >> Luis R. Rodriguez schrieb:
> >> > > >>> On Sat, Jan 3, 2009 at 5:53 PM, Barry Green <barry@lemonpi.net>
> >> > > >>> wrote: [...]
> >> > > >>> We need an oops trace of your crashes so far it seems you have
> >> > > >>> not been able to get one, lets work on that. Until then it is
> >> > > >>> not easy for us to provide you with patches to fix your issues.
> >> > > >>
> >> > > >> I also have similar problems with an D-LINK DWA-547 and I'd
> >> > > >> really like to provide the oops trace, but unfortunately the
> >> > > >> machine seems to hang that fast, that even no oops trace is
> >> > > >> shown. Also magic-sysreq don't work anymore, neither via serial
> >> > > >> console nor via keyboard. Only chance I have in this moment is to
> >> > > >> press
> >> > > >> reset-button, which already caused a full
> >> > > >> xfs-filsystem-corruption with dataloss once :-(
> >> > > >>
> >> > > >> My problem is, that the hang happens not immediately after
> >> > > >> loading the driver but "somewhen" while it is loaded.
> >> > > >>
> >> > > >> I have seen two types of crash:
> >> > > >>
> >> > > >> 1.) first investigations on my side seem to show that it happens,
> >> > > >> as soon as there is high dataload transfered, but I don't have
> >> > > >> detailed info yet.
> >> > > >> 2.) Running hostapd works fine while started first time, but no
> >> > > >> client is connected. When a client connects a system freeze
> >> > > >> occures "sometimes". Reproducably the system freezes immediately
> >> > > >> when starting hostapd, stop it and then restart hostapd. So every
> >> > > >> change in hostapd conf needs a full reboot, unloading the ath9k
> >> > > >> module and reload it does not prevent freeze, but after reboot I
> >> > > >> can start hostapd once without a problem (only the metioned
> >> > > >> "sometime" freezes on client connect or high data-volume).
> >> > > >>
> >> > > >> As this machine is productive, I suspended testing for now. I
> >> > > >> will change the card into a non-productive environment in the
> >> > > >> next days. As soon as I find more details, I'll report.
> >> > > >>
> >> > > >>> Please test with the latest drivers from
> >> > > >>> wireless-testing/compat-wireless and restart in single user
> >> > > >>> mode, we want to ensure X is not running at all and you are in a
> >> > > >>> virtual terminal. We also do not want Network Manager or
> >> > > >>> wpa_supplicant running in the background. To increase the
> >> > > >>> resolution upon bootup try booting with vga=ask (vga=791 works
> >> > > >>> well for me), then try to associate to your access point without
> >> > > >>> encryption manually using iwconfig.
> >> > > >>
> >> > > >> Ok, that'll be a thing I'd need to test to, as in my case
> >> > > >> wpa_supplicant is running.
> >> > > >
> >> > > > Hi,
> >> > > >
> >> > > > What is the status of your problem? I still have problems with it
> >> > > > and have no progress...
> >> > > >
> >> > > > gr,
> >> > > > Willem
> >> > >
> >> > > Hello Willem.
> >> > >
> >> > > I think I have had similar issue to yours.
> >> > >
> >> > > Car was working for short period of time and after a while crashed
> >> > > computer. I was advised to change kernel version, i updated it to
> >> > > 2.6.27.9 and problem has gone away since.
> >> > > It's something to do with DMA.
> >> > >
> >> > > Perhaps try to upgrade kernel?
> >> > >
> >> > > Regards
> >> > > Dmitri
> >> >
> >> > I have tried all kernel versions till 2.6.28 and still have the same
> >> > hang-ups. I think indeed its a DMA-hardware-ath9k related problem. 
> >> > The patch in 2.6.27.8 didnt work for me. I have filled a bugreport on
> >> > bugzilla.kernel but because AP isnt supported untill 2.6.29 the report
> >> > is closed.
> >>
> >> A better description of this is that AP support does not work (is not
> >> allowed) and therefore not supported on < 2.6.28.
> >>
> >> > I will test today 2.6.29.rc2 and see if it has any progress.
> >>
> >> You can also try wireless-testing (entire kernel) or compat-wireless
> >> (just updates your wireless stuff) for bleeding edge stuff.
> >
> > I tested it today with kernel 2.6.29-rc2. But no change. The kernel hangs
> > completly (without oops or what so ever, even nmi-watchdog has no
> > results). It happend when I started the connection (from a IW4965 card).
> > Then started up a Kmail connection, start up a browser sessie. Stop
> > Kmail, and start Kmail.
> >
> > It also happens when I do a hostapd start, hostapd stop, hostapd start.
> >
> > Its a IBM 206 eserver, with no X running on it. The card is a WMP300N.
> >
> > Any hints?
>
> Since you are not getting any panic can you try wireless-testing git
> tree [1], this will get you the latest and greatest wireless drivers
> and mac80211/cfg80211 which are now being queued up for 2.6.30. See if
> you see the issue there. If so then you can use git bisect.
>

Also the latest wireless testing git has the same problem.
No Oops, trace or any panic visible. 

Some nasty problem here.....

gr,
Willem


-- 
This message has been scanned for viruses and
dangerous content by MailScanner, and is
believed to be clean.

^ permalink raw reply	[flat|nested] 28+ messages in thread

* [ath9k-devel] still same problem
  2009-01-20 22:22                   ` W. van den Akker
@ 2009-01-20 23:08                     ` Chris Kennedy
  2009-01-21  7:54                       ` W. van den Akker
  0 siblings, 1 reply; 28+ messages in thread
From: Chris Kennedy @ 2009-01-20 23:08 UTC (permalink / raw)
  To: ath9k-devel


On Jan 20, 2009, at 4:22 PM, W. van den Akker wrote:

> On Tuesday 20 January 2009, Luis R. Rodriguez wrote:
>> On Mon, Jan 19, 2009 at 2:31 PM, W. van den Akker  
>> <listsrv@wilsoft.nl>
> wrote:
>>> On Sunday 18 January 2009, Luis R. Rodriguez wrote:
>>>> On Sun, Jan 18, 2009 at 01:34:03AM -0800, W. van den Akker wrote:
>>>>> On Sunday 18 January 2009, Dmitri Seletski wrote:
>>>>>> W. van den Akker wrote:
>>>>>>> On Sunday 04 January 2009 10:58:02 Jens Ott - Opteamax wrote:
>>>>>>>> Hi,
>>>>>>>>
>>>>>>>> Luis R. Rodriguez schrieb:
>>>>>>>>> On Sat, Jan 3, 2009 at 5:53 PM, Barry Green  
>>>>>>>>> <barry@lemonpi.net>
>>>>>>>>> wrote: [...]
>>>>>>>>> We need an oops trace of your crashes so far it seems you have
>>>>>>>>> not been able to get one, lets work on that. Until then it is
>>>>>>>>> not easy for us to provide you with patches to fix your  
>>>>>>>>> issues.
>>>>>>>>
>>>>>>>> I also have similar problems with an D-LINK DWA-547 and I'd
>>>>>>>> really like to provide the oops trace, but unfortunately the
>>>>>>>> machine seems to hang that fast, that even no oops trace is
>>>>>>>> shown. Also magic-sysreq don't work anymore, neither via serial
>>>>>>>> console nor via keyboard. Only chance I have in this moment  
>>>>>>>> is to
>>>>>>>> press
>>>>>>>> reset-button, which already caused a full
>>>>>>>> xfs-filsystem-corruption with dataloss once :-(
>>>>>>>>
>>>>>>>> My problem is, that the hang happens not immediately after
>>>>>>>> loading the driver but "somewhen" while it is loaded.
>>>>>>>>
>>>>>>>> I have seen two types of crash:
>>>>>>>>
>>>>>>>> 1.) first investigations on my side seem to show that it  
>>>>>>>> happens,
>>>>>>>> as soon as there is high dataload transfered, but I don't have
>>>>>>>> detailed info yet.
>>>>>>>> 2.) Running hostapd works fine while started first time, but no
>>>>>>>> client is connected. When a client connects a system freeze
>>>>>>>> occures "sometimes". Reproducably the system freezes  
>>>>>>>> immediately
>>>>>>>> when starting hostapd, stop it and then restart hostapd. So  
>>>>>>>> every
>>>>>>>> change in hostapd conf needs a full reboot, unloading the ath9k
>>>>>>>> module and reload it does not prevent freeze, but after  
>>>>>>>> reboot I
>>>>>>>> can start hostapd once without a problem (only the metioned
>>>>>>>> "sometime" freezes on client connect or high data-volume).
>>>>>>>>
>>>>>>>> As this machine is productive, I suspended testing for now. I
>>>>>>>> will change the card into a non-productive environment in the
>>>>>>>> next days. As soon as I find more details, I'll report.
>>>>>>>>
>>>>>>>>> Please test with the latest drivers from
>>>>>>>>> wireless-testing/compat-wireless and restart in single user
>>>>>>>>> mode, we want to ensure X is not running at all and you are  
>>>>>>>>> in a
>>>>>>>>> virtual terminal. We also do not want Network Manager or
>>>>>>>>> wpa_supplicant running in the background. To increase the
>>>>>>>>> resolution upon bootup try booting with vga=ask (vga=791 works
>>>>>>>>> well for me), then try to associate to your access point  
>>>>>>>>> without
>>>>>>>>> encryption manually using iwconfig.
>>>>>>>>
>>>>>>>> Ok, that'll be a thing I'd need to test to, as in my case
>>>>>>>> wpa_supplicant is running.
>>>>>>>
>>>>>>> Hi,
>>>>>>>
>>>>>>> What is the status of your problem? I still have problems  
>>>>>>> with it
>>>>>>> and have no progress...
>>>>>>>
>>>>>>> gr,
>>>>>>> Willem
>>>>>>
>>>>>> Hello Willem.
>>>>>>
>>>>>> I think I have had similar issue to yours.
>>>>>>
>>>>>> Car was working for short period of time and after a while  
>>>>>> crashed
>>>>>> computer. I was advised to change kernel version, i updated it to
>>>>>> 2.6.27.9 and problem has gone away since.
>>>>>> It's something to do with DMA.
>>>>>>
>>>>>> Perhaps try to upgrade kernel?
>>>>>>
>>>>>> Regards
>>>>>> Dmitri
>>>>>
>>>>> I have tried all kernel versions till 2.6.28 and still have the  
>>>>> same
>>>>> hang-ups. I think indeed its a DMA-hardware-ath9k related problem.
>>>>> The patch in 2.6.27.8 didnt work for me. I have filled a  
>>>>> bugreport on
>>>>> bugzilla.kernel but because AP isnt supported untill 2.6.29 the  
>>>>> report
>>>>> is closed.
>>>>
>>>> A better description of this is that AP support does not work  
>>>> (is not
>>>> allowed) and therefore not supported on < 2.6.28.
>>>>
>>>>> I will test today 2.6.29.rc2 and see if it has any progress.
>>>>
>>>> You can also try wireless-testing (entire kernel) or compat- 
>>>> wireless
>>>> (just updates your wireless stuff) for bleeding edge stuff.
>>>
>>> I tested it today with kernel 2.6.29-rc2. But no change. The  
>>> kernel hangs
>>> completly (without oops or what so ever, even nmi-watchdog has no
>>> results). It happend when I started the connection (from a IW4965  
>>> card).
>>> Then started up a Kmail connection, start up a browser sessie. Stop
>>> Kmail, and start Kmail.
>>>
>>> It also happens when I do a hostapd start, hostapd stop, hostapd  
>>> start.
>>>
>>> Its a IBM 206 eserver, with no X running on it. The card is a  
>>> WMP300N.
>>>
>>> Any hints?
>>
>> Since you are not getting any panic can you try wireless-testing git
>> tree [1], this will get you the latest and greatest wireless drivers
>> and mac80211/cfg80211 which are now being queued up for 2.6.30.  
>> See if
>> you see the issue there. If so then you can use git bisect.
>>
>
> Also the latest wireless testing git has the same problem.
> No Oops, trace or any panic visible.
>
> Some nasty problem here.....
>
> gr,
> Willem
>
>

This is interesting since I had the same lockups using the module  
ath9k, including Gentoo's newest 2.6.27-gentoo-r7, the compat- 
wireless newest git compiled with wireless-testing, and using  
wireless-testing itself (newest as of a day ago).  So all of those,  
when used on the console with just hostapd (version 0.6.7) to setup  
an AP, would freeze the system without any messages or information.   
This was on an HP xw8200 workstation using 2 Xeon hyperthreaded  
CPU's, so a very large system and lots of possible locking-bugs  
finding there.  So I built a test system with a Uni Processor system,  
an older compaq P4 workstation that isn't SMP and not multithreaded.   
Now it works very stable on the UP system, compared with the SMP  
Hyperthreaded one.

So I've been really happy with it now, I've been able to get the AP  
going using the newest wireless-testing using WPA-PSK encryption with  
the newest hostapd.  This was doing exactly what your reporting on my  
HP SMP system,and I duplicated the system when I setup the Compaq UP  
system, so nothing is different except hardware.

I have been watching this thread and as of last night since I got it  
working on the UP system, and today tweaked the system to match my  
Apple Airport Extreme AP (have 802.11n enabled, using the setting  
"ht_capab=[HT40+][SHORT-GI-40][DSSS_CCK-40]" which seems to have  
gotten it to be faster than the Apple AP for my Windows systems using  
Linksys WMP300N cards (the American ones with the Broadcom chips).   
Since today from this working now, and your newest email about using  
the exact same wireless-testing kernel I'm using (mine is newest git  
version as-of last night).

So my theory is that I was triggering the same problem using SMP/ 
Hyperthreading, I suspect your system is not a smaller single  
processor system?  That would at least support that theory.  I think  
the problem could be a spinlock locking problem in the interrupt  
handler, or something like that.  I've programmed drivers, wrote a  
good part of the IVTV driver, and this behavior reminds me of what I  
saw there at times during development  (it would be stable on UP  
systems, lock like this on SMP systems).  I have been peeking around  
in the code some to get familiar with it, but of course may take some  
time for me to start grasping what is going on to be able to  
contribute any help myself.  So this is just what I'm thinking, since  
freezes like this often seem to be something wrong in the interrupt  
handler for SMP/preemption configured systems.  If you are using SMP,  
maybe trying it with just one processor enabled would be a good test  
to see if this is really true there too.

Thanks,
Chris

^ permalink raw reply	[flat|nested] 28+ messages in thread

* [ath9k-devel] still same problem
  2009-01-20 23:08                     ` Chris Kennedy
@ 2009-01-21  7:54                       ` W. van den Akker
  2009-01-21 10:11                         ` W. van den Akker
  2009-01-21 10:17                         ` W. van den Akker
  0 siblings, 2 replies; 28+ messages in thread
From: W. van den Akker @ 2009-01-21  7:54 UTC (permalink / raw)
  To: ath9k-devel

>
> So my theory is that I was triggering the same problem using SMP/
> Hyperthreading, I suspect your system is not a smaller single
> processor system?  That would at least support that theory.  I think
> the problem could be a spinlock locking problem in the interrupt
> handler, or something like that.  I've programmed drivers, wrote a
> good part of the IVTV driver, and this behavior reminds me of what I
> saw there at times during development  (it would be stable on UP
> systems, lock like this on SMP systems).  I have been peeking around
> in the code some to get familiar with it, but of course may take some
> time for me to start grasping what is going on to be able to
> contribute any help myself.  So this is just what I'm thinking, since
> freezes like this often seem to be something wrong in the interrupt
> handler for SMP/preemption configured systems.  If you are using SMP,
> maybe trying it with just one processor enabled would be a good test
> to see if this is really true there too.
>

Hil Chris,

This is correct. Its a SMP system with HT. So I think your assumption is 
right. It must be in that direction.
 
I will try to test it with one CPU disabled.
I will report back.

gr,
Willem

-- 
This message has been scanned for viruses and
dangerous content by MailScanner, and is
believed to be clean.

^ permalink raw reply	[flat|nested] 28+ messages in thread

* [ath9k-devel] still same problem
  2009-01-21  7:54                       ` W. van den Akker
@ 2009-01-21 10:11                         ` W. van den Akker
  2009-01-21 12:35                           ` Jouni Malinen
  2009-01-21 10:17                         ` W. van den Akker
  1 sibling, 1 reply; 28+ messages in thread
From: W. van den Akker @ 2009-01-21 10:11 UTC (permalink / raw)
  To: ath9k-devel

On Wednesday 21 January 2009, W. van den Akker wrote:
> > So my theory is that I was triggering the same problem using SMP/
> > Hyperthreading, I suspect your system is not a smaller single
> > processor system?  That would at least support that theory.  I think
> > the problem could be a spinlock locking problem in the interrupt
> > handler, or something like that.  I've programmed drivers, wrote a
> > good part of the IVTV driver, and this behavior reminds me of what I
> > saw there at times during development  (it would be stable on UP
> > systems, lock like this on SMP systems).  I have been peeking around
> > in the code some to get familiar with it, but of course may take some
> > time for me to start grasping what is going on to be able to
> > contribute any help myself.  So this is just what I'm thinking, since
> > freezes like this often seem to be something wrong in the interrupt
> > handler for SMP/preemption configured systems.  If you are using SMP,
> > maybe trying it with just one processor enabled would be a good test
> > to see if this is really true there too.
>
> Hil Chris,
>
> This is correct. Its a SMP system with HT. So I think your assumption is
> right. It must be in that direction.
>
> I will try to test it with one CPU disabled.
> I will report back.
>

I have tested with 1 CPU disabled. Running now for about 2 hours without
hangups (I have several connection drops, but it will reconnect).

I have also tested with 2 CPU's and HT-disabled (noht parameter) but that
had no effect.

So SMP looks like the cause of the problem here.

I will be happy to do further testing.

gr,
Willem





-------------- next part --------------
A non-text attachment was scrubbed...
Name: not available
Type: application/pgp-signature
Size: 197 bytes
Desc: This is a digitally signed message part.
Url : http://lists.ath9k.org/pipermail/ath9k-devel/attachments/20090121/e24cd9bf/attachment.pgp 

^ permalink raw reply	[flat|nested] 28+ messages in thread

* [ath9k-devel] still same problem
  2009-01-21  7:54                       ` W. van den Akker
  2009-01-21 10:11                         ` W. van den Akker
@ 2009-01-21 10:17                         ` W. van den Akker
  1 sibling, 0 replies; 28+ messages in thread
From: W. van den Akker @ 2009-01-21 10:17 UTC (permalink / raw)
  To: ath9k-devel

On Wednesday 21 January 2009, W. van den Akker wrote:
> > So my theory is that I was triggering the same problem using SMP/
> > Hyperthreading, I suspect your system is not a smaller single
> > processor system?  That would at least support that theory.  I think
> > the problem could be a spinlock locking problem in the interrupt
> > handler, or something like that.  I've programmed drivers, wrote a
> > good part of the IVTV driver, and this behavior reminds me of what I
> > saw there at times during development  (it would be stable on UP
> > systems, lock like this on SMP systems).  I have been peeking around
> > in the code some to get familiar with it, but of course may take some
> > time for me to start grasping what is going on to be able to
> > contribute any help myself.  So this is just what I'm thinking, since
> > freezes like this often seem to be something wrong in the interrupt
> > handler for SMP/preemption configured systems.  If you are using SMP,
> > maybe trying it with just one processor enabled would be a good test
> > to see if this is really true there too.
>
> Hil Chris,
>
> This is correct. Its a SMP system with HT. So I think your assumption is
> right. It must be in that direction.
>
> I will try to test it with one CPU disabled.
> I will report back.
>

I have tested with 1 CPU disabled. Running now for about 2 hours without
hangups (I have several connection drops, but it will reconnect).

I have also tested with 2 CPU's and HT-disabled (noht parameter) but that
had no effect.

So SMP looks like the cause of the problem here.

I will be happy to do further testing.

gr,
Willem

-- 
This message has been scanned for viruses and
dangerous content by MailScanner, and is
believed to be clean.

^ permalink raw reply	[flat|nested] 28+ messages in thread

* [ath9k-devel] still same problem
  2009-01-21 10:11                         ` W. van den Akker
@ 2009-01-21 12:35                           ` Jouni Malinen
  2009-01-21 17:01                             ` [ath9k-devel] [PATCH] " Chris Kennedy
  2009-01-21 17:15                             ` [ath9k-devel] " W. van den Akker
  0 siblings, 2 replies; 28+ messages in thread
From: Jouni Malinen @ 2009-01-21 12:35 UTC (permalink / raw)
  To: ath9k-devel

On Wed, 2009-01-21 at 02:11 -0800, W. van den Akker wrote:
> I have tested with 1 CPU disabled. Running now for about 2 hours without
> hangups (I have several connection drops, but it will reconnect).
> 
> I have also tested with 2 CPU's and HT-disabled (noht parameter) but that
> had no effect.
> 
> So SMP looks like the cause of the problem here.

Thanks for testing this! I'm running most of my tests with a dual core
system, so SMP is being used, but with two cores, not two separate
chips. I don't think there should be much difference there, but
certainly our hardware configuration is different.

Could you please describe your hardware with more details so that we can
see whether we could find a similar system to try to reproduce this? Is
this the IBM 206 eserver with WMP300N you mentioned in an earlier
message in the thread? That seems to be (by default) a uni-processor
setup, so I would like to make sure we understand what is the exact
hardware used here since I do not think we have been able to reproduce
this type of issue so far in any dual core systems.

- Jouni

^ permalink raw reply	[flat|nested] 28+ messages in thread

* [ath9k-devel] [PATCH] still same problem
  2009-01-21 12:35                           ` Jouni Malinen
@ 2009-01-21 17:01                             ` Chris Kennedy
  2009-01-21 21:05                               ` W. van den Akker
  2009-01-21 17:15                             ` [ath9k-devel] " W. van den Akker
  1 sibling, 1 reply; 28+ messages in thread
From: Chris Kennedy @ 2009-01-21 17:01 UTC (permalink / raw)
  To: ath9k-devel



On Jan 21, 2009, at 6:35 AM, Jouni Malinen wrote:

> On Wed, 2009-01-21 at 02:11 -0800, W. van den Akker wrote:
>> I have tested with 1 CPU disabled. Running now for about 2 hours  
>> without
>> hangups (I have several connection drops, but it will reconnect).
>>
>> I have also tested with 2 CPU's and HT-disabled (noht parameter)  
>> but that
>> had no effect.
>>
>> So SMP looks like the cause of the problem here.
>
> Thanks for testing this! I'm running most of my tests with a dual core
> system, so SMP is being used, but with two cores, not two separate
> chips. I don't think there should be much difference there, but
> certainly our hardware configuration is different.
>
> Could you please describe your hardware with more details so that  
> we can
> see whether we could find a similar system to try to reproduce  
> this? Is
> this the IBM 206 eserver with WMP300N you mentioned in an earlier
> message in the thread? That seems to be (by default) a uni-processor
> setup, so I would like to make sure we understand what is the exact
> hardware used here since I do not think we have been able to reproduce
> this type of issue so far in any dual core systems.
>
> - Jouni
>
>

This is a patch (against yesterdays current wireless-testing kernel)
  that shows basically what we did in the IVTV driver,
essentially holding a spin_lock() in the interrupt handler.  So it
isn't even used, since compiled out, if a system isn't SMP.  When the
system is SMP it prevents multiple instances of the interrupt handler
from happening.  This seemed to be the magic fix there, I'm not sure if
this is totally correct for this driver, but it's a patch to test  
(hopefully the
spinlock I used makes some sense, seems this also is good since
wouldn't want to reset the card and have interrupts happen while
that's going on?).  It's at least an example showing what I suspect  
could
fix it, and would be interesting if you had the ability to test it on  
your SMP
system to see if it is just the interrupt handler where it's happening.

Thanks,
Chris

-------------- next part --------------
A non-text attachment was scrubbed...
Name: irq_handler_spinlock.diff
Type: application/applefile
Size: 1435 bytes
Desc: not available
Url : http://lists.ath9k.org/pipermail/ath9k-devel/attachments/20090121/36ecdc7d/attachment.bin 
-------------- next part --------------
A non-text attachment was scrubbed...
Name: irq_handler_spinlock.diff
Type: application/octet-stream
Size: 1148 bytes
Desc: not available
Url : http://lists.ath9k.org/pipermail/ath9k-devel/attachments/20090121/36ecdc7d/attachment.obj 
-------------- next part --------------

^ permalink raw reply	[flat|nested] 28+ messages in thread

* [ath9k-devel] still same problem
  2009-01-21 12:35                           ` Jouni Malinen
  2009-01-21 17:01                             ` [ath9k-devel] [PATCH] " Chris Kennedy
@ 2009-01-21 17:15                             ` W. van den Akker
  1 sibling, 0 replies; 28+ messages in thread
From: W. van den Akker @ 2009-01-21 17:15 UTC (permalink / raw)
  To: ath9k-devel

On Wednesday 21 January 2009, Jouni Malinen wrote:
> On Wed, 2009-01-21 at 02:11 -0800, W. van den Akker wrote:
> > I have tested with 1 CPU disabled. Running now for about 2 hours without
> > hangups (I have several connection drops, but it will reconnect).
> >
> > I have also tested with 2 CPU's and HT-disabled (noht parameter) but that
> > had no effect.
> >
> > So SMP looks like the cause of the problem here.
>
> Thanks for testing this! I'm running most of my tests with a dual core
> system, so SMP is being used, but with two cores, not two separate
> chips. I don't think there should be much difference there, but
> certainly our hardware configuration is different.
>
> Could you please describe your hardware with more details so that we can
> see whether we could find a similar system to try to reproduce this? Is
> this the IBM 206 eserver with WMP300N you mentioned in an earlier
> message in the thread? That seems to be (by default) a uni-processor
> setup, so I would like to make sure we understand what is the exact
> hardware used here since I do not think we have been able to reproduce
> this type of issue so far in any dual core systems.
>

Its indeed an IBM eServer x206 8482-2RX with a WMP300N (Linksys,  
Atheros Communications Inc. AR5416 802.11abgn Wireless PCI Adapter (rev 01).
The IBM is a dual-core system with 2 CPU's on one fysical CPU-chip (its boxed 
so I am not for sure).

/proc/cpuinfo shows (with one CPU disabled)
processor       : 0
vendor_id       : GenuineIntel
cpu family      : 15
model           : 3
model name      : Intel(R) Pentium(R) 4 CPU 3.00GHz
stepping        : 4
cpu MHz         : 3000.473
cache size      : 1024 KB
physical id     : 0
siblings        : 1
core id         : 0
cpu cores       : 1
apicid          : 0
initial apicid  : 0
fdiv_bug        : no
hlt_bug         : no
f00f_bug        : no
coma_bug        : no
fpu             : yes
fpu_exception   : yes
cpuid level     : 5
wp              : yes
flags           : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca 
cmov pat pse36 clflush dts acpi mmx fxsr sse sse2 ss ht tm pbe constant_tsc 
up pebs bts pni dtes64 monitor ds_cpl cid xtpr
bogomips        : 6000.94
clflush size    : 64
power management:

lscpi -vvv for the adapter shows:
03:02.0 Network controller: Atheros Communications Inc. AR5416 802.11abgn 
Wireless PCI Adapter (rev 01)
        Subsystem: Linksys Device 0061
        Control: I/O- Mem+ BusMaster+ SpecCycle- MemWINV+ VGASnoop- ParErr+ 
Stepping- SERR+ FastB2B- DisINTx-
        Status: Cap+ 66MHz+ UDF- FastB2B+ ParErr- DEVSEL=medium >TAbort- 
<TAbort- <MAbort- >SERR- <PERR- INTx-
        Latency: 168, Cache Line Size: 32 bytes
        Interrupt: pin A routed to IRQ 25
        Region 0: Memory@d0200000 (32-bit, non-prefetchable) [size=64K]
        Capabilities: [40] #80 [0000]
        Kernel driver in use: ath9k
        Kernel modules: ath9k

If more information is needed I am happy to provide.

Greetings,
Willem

-- 
This message has been scanned for viruses and
dangerous content by MailScanner, and is
believed to be clean.

^ permalink raw reply	[flat|nested] 28+ messages in thread

* [ath9k-devel] [PATCH] still same problem
  2009-01-21 17:01                             ` [ath9k-devel] [PATCH] " Chris Kennedy
@ 2009-01-21 21:05                               ` W. van den Akker
  2009-01-27  9:30                                 ` W. van den Akker
  0 siblings, 1 reply; 28+ messages in thread
From: W. van den Akker @ 2009-01-21 21:05 UTC (permalink / raw)
  To: ath9k-devel

On Wednesday 21 January 2009, Chris Kennedy wrote:
> On Jan 21, 2009, at 6:35 AM, Jouni Malinen wrote:
> > On Wed, 2009-01-21 at 02:11 -0800, W. van den Akker wrote:
> >> I have tested with 1 CPU disabled. Running now for about 2 hours
> >> without
> >> hangups (I have several connection drops, but it will reconnect).
> >>
> >> I have also tested with 2 CPU's and HT-disabled (noht parameter)
> >> but that
> >> had no effect.
> >>
> >> So SMP looks like the cause of the problem here.
> >
> > Thanks for testing this! I'm running most of my tests with a dual core
> > system, so SMP is being used, but with two cores, not two separate
> > chips. I don't think there should be much difference there, but
> > certainly our hardware configuration is different.
> >
> > Could you please describe your hardware with more details so that
> > we can
> > see whether we could find a similar system to try to reproduce
> > this? Is
> > this the IBM 206 eserver with WMP300N you mentioned in an earlier
> > message in the thread? That seems to be (by default) a uni-processor
> > setup, so I would like to make sure we understand what is the exact
> > hardware used here since I do not think we have been able to reproduce
> > this type of issue so far in any dual core systems.
> >
> > - Jouni
>
> This is a patch (against yesterdays current wireless-testing kernel)
>   that shows basically what we did in the IVTV driver,
> essentially holding a spin_lock() in the interrupt handler.  So it
> isn't even used, since compiled out, if a system isn't SMP.  When the
> system is SMP it prevents multiple instances of the interrupt handler
> from happening.  This seemed to be the magic fix there, I'm not sure if
> this is totally correct for this driver, but it's a patch to test
> (hopefully the
> spinlock I used makes some sense, seems this also is good since
> wouldn't want to reset the card and have interrupts happen while
> that's going on?).  It's at least an example showing what I suspect
> could
> fix it, and would be interesting if you had the ability to test it on
> your SMP
> system to see if it is just the interrupt handler where it's happening.
>

Thanks Chris,

I applied the patch.... but no luck. Systems still hangs.

gr,
Willem

-- 
This message has been scanned for viruses and
dangerous content by MailScanner, and is
believed to be clean.

^ permalink raw reply	[flat|nested] 28+ messages in thread

* [ath9k-devel] [PATCH] still same problem
  2009-01-21 21:05                               ` W. van den Akker
@ 2009-01-27  9:30                                 ` W. van den Akker
  2009-01-27 21:47                                   ` Chris Kennedy
  0 siblings, 1 reply; 28+ messages in thread
From: W. van den Akker @ 2009-01-27  9:30 UTC (permalink / raw)
  To: ath9k-devel

On Wednesday 21 January 2009, W. van den Akker wrote:
> On Wednesday 21 January 2009, Chris Kennedy wrote:
> > On Jan 21, 2009, at 6:35 AM, Jouni Malinen wrote:
> > > On Wed, 2009-01-21 at 02:11 -0800, W. van den Akker wrote:
> > >> I have tested with 1 CPU disabled. Running now for about 2 hours
> > >> without
> > >> hangups (I have several connection drops, but it will reconnect).
> > >>
> > >> I have also tested with 2 CPU's and HT-disabled (noht parameter)
> > >> but that
> > >> had no effect.
> > >>
> > >> So SMP looks like the cause of the problem here.
> > >
> > > Thanks for testing this! I'm running most of my tests with a dual core
> > > system, so SMP is being used, but with two cores, not two separate
> > > chips. I don't think there should be much difference there, but
> > > certainly our hardware configuration is different.
> > >
> > > Could you please describe your hardware with more details so that
> > > we can
> > > see whether we could find a similar system to try to reproduce
> > > this? Is
> > > this the IBM 206 eserver with WMP300N you mentioned in an earlier
> > > message in the thread? That seems to be (by default) a uni-processor
> > > setup, so I would like to make sure we understand what is the exact
> > > hardware used here since I do not think we have been able to reproduce
> > > this type of issue so far in any dual core systems.
> > >
> > > - Jouni
> >
> > This is a patch (against yesterdays current wireless-testing kernel)
> >   that shows basically what we did in the IVTV driver,
> > essentially holding a spin_lock() in the interrupt handler.  So it
> > isn't even used, since compiled out, if a system isn't SMP.  When the
> > system is SMP it prevents multiple instances of the interrupt handler
> > from happening.  This seemed to be the magic fix there, I'm not sure if
> > this is totally correct for this driver, but it's a patch to test
> > (hopefully the
> > spinlock I used makes some sense, seems this also is good since
> > wouldn't want to reset the card and have interrupts happen while
> > that's going on?).  It's at least an example showing what I suspect
> > could
> > fix it, and would be interesting if you had the ability to test it on
> > your SMP
> > system to see if it is just the interrupt handler where it's happening.
>
> Thanks Chris,
>
> I applied the patch.... but no luck. Systems still hangs.
>

Running for about a week with one CPU. The driver works ok and the 
connection is rock-solid.

Any progress on the dual-CPU lockup?

gr,

Willem

-- 
This message has been scanned for viruses and
dangerous content by MailScanner, and is
believed to be clean.

^ permalink raw reply	[flat|nested] 28+ messages in thread

* [ath9k-devel] [PATCH] still same problem
  2009-01-27  9:30                                 ` W. van den Akker
@ 2009-01-27 21:47                                   ` Chris Kennedy
  0 siblings, 0 replies; 28+ messages in thread
From: Chris Kennedy @ 2009-01-27 21:47 UTC (permalink / raw)
  To: ath9k-devel


On Jan 27, 2009, at 3:30 AM, W. van den Akker wrote:

> On Wednesday 21 January 2009, W. van den Akker wrote:
>> On Wednesday 21 January 2009, Chris Kennedy wrote:
>>> On Jan 21, 2009, at 6:35 AM, Jouni Malinen wrote:
>>>> On Wed, 2009-01-21 at 02:11 -0800, W. van den Akker wrote:
>>>>> I have tested with 1 CPU disabled. Running now for about 2 hours
>>>>> without
>>>>> hangups (I have several connection drops, but it will reconnect).
>>>>>
>>>>> I have also tested with 2 CPU's and HT-disabled (noht parameter)
>>>>> but that
>>>>> had no effect.
>>>>>
>>>>> So SMP looks like the cause of the problem here.
>>>>
>>>> Thanks for testing this! I'm running most of my tests with a  
>>>> dual core
>>>> system, so SMP is being used, but with two cores, not two separate
>>>> chips. I don't think there should be much difference there, but
>>>> certainly our hardware configuration is different.
>>>>
>>>> Could you please describe your hardware with more details so that
>>>> we can
>>>> see whether we could find a similar system to try to reproduce
>>>> this? Is
>>>> this the IBM 206 eserver with WMP300N you mentioned in an earlier
>>>> message in the thread? That seems to be (by default) a uni- 
>>>> processor
>>>> setup, so I would like to make sure we understand what is the exact
>>>> hardware used here since I do not think we have been able to  
>>>> reproduce
>>>> this type of issue so far in any dual core systems.
>>>>
>>>> - Jouni
>>>
>>> This is a patch (against yesterdays current wireless-testing kernel)
>>>   that shows basically what we did in the IVTV driver,
>>> essentially holding a spin_lock() in the interrupt handler.  So it
>>> isn't even used, since compiled out, if a system isn't SMP.  When  
>>> the
>>> system is SMP it prevents multiple instances of the interrupt  
>>> handler
>>> from happening.  This seemed to be the magic fix there, I'm not  
>>> sure if
>>> this is totally correct for this driver, but it's a patch to test
>>> (hopefully the
>>> spinlock I used makes some sense, seems this also is good since
>>> wouldn't want to reset the card and have interrupts happen while
>>> that's going on?).  It's at least an example showing what I suspect
>>> could
>>> fix it, and would be interesting if you had the ability to test  
>>> it on
>>> your SMP
>>> system to see if it is just the interrupt handler where it's  
>>> happening.
>>
>> Thanks Chris,
>>
>> I applied the patch.... but no luck. Systems still hangs.
>>
>
> Running for about a week with one CPU. The driver works ok and the
> connection is rock-solid.
>
> Any progress on the dual-CPU lockup?

Same here, great on my one CPU system, minor kernel panic but that
only happened 2 times when changing settings and restarting things
so not triggered by normal use by me.  Unfortunately my SMP system
is too much 'in use' for me to test more on, and not wanting any risk  
of data loss.
Eventually would like to move the card into it of course since it's  
my main router system and
would be nice to have my wireless network routing directly into my main
router.  These SMP type system freezes seem hard to figure out, I'm not
even sure how to really get detailed data and pinpoint where the locking
problems would be.  It's just frozen and I'm not sure a serial  
console would
even show anything or not.  I think the usual procedure is to review  
the locking
and trial and error on an SMP system, guessing what could be wrong,  
doing it
by feel but basically blind about what your trying to fix.  This is  
probably one of the
hardest parts of kernel development, especially for someone who  
didn't write the code
or knows the code inside and out, and even then I'm suspecting it's  
very elusive and
has to do with both the locking and the way the hardware is being  
accessed.  Also
it's probaby rarer for people to be using a box as an AP that is SMP,  
since really
just need a low grade single processor system to do a dedicated  
routing system like
this.  So I'm not getting my hopes up for my SMP box to be able to  
have the AP
functionality directly on it anytime soon :(, although it would be  
nice I'm getting by
pretty good on my spare UP system.

Thanks,
Chris



>
> gr,
>
> Willem
>
> -- 
> This message has been scanned for viruses and
> dangerous content by MailScanner, and is
> believed to be clean.
>

^ permalink raw reply	[flat|nested] 28+ messages in thread

end of thread, other threads:[~2009-01-27 21:47 UTC | newest]

Thread overview: 28+ messages (download: mbox.gz / follow: Atom feed)
-- links below jump to the message on this page --
2009-01-04  1:26 [ath9k-devel] still same problem Dmitri Seletski
2009-01-04  1:53 ` Barry Green
2009-01-04  3:34   ` Luis R. Rodriguez
2009-01-04  9:58     ` Jens Ott - Opteamax
2009-01-05 16:09       ` Luis R. Rodriguez
2009-01-17 23:55       ` W. van den Akker
2009-01-18  1:41         ` Jens Ott - Opteamax
2009-01-18  9:14           ` W. van den Akker
2009-01-18  5:17         ` Dmitri Seletski
2009-01-18  9:34           ` W. van den Akker
2009-01-18 15:16             ` Luis R. Rodriguez
2009-01-19  8:35               ` W. van den Akker
2009-01-19 22:31               ` W. van den Akker
2009-01-20 17:48                 ` Luis R. Rodriguez
2009-01-20 18:32                   ` W. van den Akker
2009-01-20 22:22                   ` W. van den Akker
2009-01-20 23:08                     ` Chris Kennedy
2009-01-21  7:54                       ` W. van den Akker
2009-01-21 10:11                         ` W. van den Akker
2009-01-21 12:35                           ` Jouni Malinen
2009-01-21 17:01                             ` [ath9k-devel] [PATCH] " Chris Kennedy
2009-01-21 21:05                               ` W. van den Akker
2009-01-27  9:30                                 ` W. van den Akker
2009-01-27 21:47                                   ` Chris Kennedy
2009-01-21 17:15                             ` [ath9k-devel] " W. van den Akker
2009-01-21 10:17                         ` W. van den Akker
2009-01-04 10:33 ` W. van den Akker
2009-01-04 10:47   ` W. van den Akker

This is an external index of several public inboxes,
see mirroring instructions on how to clone and mirror
all data and code used by this external index.