* [PATCH] Unaligend accesses nulldevname
@ 2004-01-07 23:00 Kurt Garloff
2004-01-11 10:01 ` Harald Welte
0 siblings, 1 reply; 2+ messages in thread
From: Kurt Garloff @ 2004-01-07 23:00 UTC (permalink / raw)
To: Linux kernel list, netfilter-devel
[-- Attachment #1.1: Type: text/plain, Size: 1122 bytes --]
Hi,
I found an excessive amount of unaligned accesses on my AXP workstation
and tracked it down to ip_packet_match() in the ip_tables module.
indev and outdev are not properly aligned if set to nulldevname in
ipt_do_table().
This destroys the benefits of comparing names in units of (long) and
on architectures with expensive unaligned accesses (such as ia64 or
alpha), it even hurts a lot.
Find attached a patch against 2.6.0. A similar patch is needed for 2.4,
also attached.
Please consider merging them.
Looking at ip_packet_match(), I have two more thoughts:
* It should not be inlined. It's too large to benefit from inlining,
IMHO. (OTOH, it's only called from one place, so it does not
really matter.)
* There's a comment about the compiler being able to unroll the 2/4
(64/32bit) iter loop which is not completely appropriate: We don't
pass -funroll-loops, so gcc does not do it :-(
It would be beneficial though.
Regards,
--
Kurt Garloff <garloff@suse.de> Cologne, DE
SUSE LINUX AG, Nuernberg, DE SUSE Labs (Head)
[-- Attachment #1.2: iptables-nulldevname-unaligned26.diff --]
[-- Type: text/plain, Size: 445 bytes --]
--- linux-2.6.0.ix86/net/ipv4/netfilter/ip_tables.c.orig 2003-12-18 03:58:28.000000000 +0100
+++ linux-2.6.0.ix86/net/ipv4/netfilter/ip_tables.c 2004-01-07 23:49:29.000000000 +0100
@@ -260,7 +260,7 @@
struct ipt_table *table,
void *userdata)
{
- static const char nulldevname[IFNAMSIZ];
+ static const char nulldevname[IFNAMSIZ] __attribute__((aligned(sizeof(long))));
u_int16_t offset;
struct iphdr *ip;
u_int16_t datalen;
[-- Attachment #1.3: iptables-nulldevname-unaligned24.diff --]
[-- Type: text/plain, Size: 449 bytes --]
--- linux-2.4.19/net/ipv4/netfilter/ip_tables.c 2002-02-25 20:38:14.000000000 +0100
+++ linux-2.4.19.AXP/net/ipv4/netfilter/ip_tables.c 2004-01-09 01:55:01.000000000 +0100
@@ -259,7 +264,7 @@
struct ipt_table *table,
void *userdata)
{
- static const char nulldevname[IFNAMSIZ] = { 0 };
+ static const char nulldevname[IFNAMSIZ] __attribute__((aligned(sizeof(long)))) = { 0 };
u_int16_t offset;
struct iphdr *ip;
void *protohdr;
[-- Attachment #2: Type: application/pgp-signature, Size: 189 bytes --]
^ permalink raw reply [flat|nested] 2+ messages in thread
* Re: [PATCH] Unaligend accesses nulldevname
2004-01-07 23:00 [PATCH] Unaligend accesses nulldevname Kurt Garloff
@ 2004-01-11 10:01 ` Harald Welte
0 siblings, 0 replies; 2+ messages in thread
From: Harald Welte @ 2004-01-11 10:01 UTC (permalink / raw)
To: Kurt Garloff, Linux kernel list, netfilter-devel
[-- Attachment #1: Type: text/plain, Size: 2050 bytes --]
On Thu, Jan 08, 2004 at 12:00:11AM +0100, Kurt Garloff wrote:
> Hi,
>
> I found an excessive amount of unaligned accesses on my AXP workstation
> and tracked it down to ip_packet_match() in the ip_tables module.
> indev and outdev are not properly aligned if set to nulldevname in
> ipt_do_table().
> This destroys the benefits of comparing names in units of (long) and
> on architectures with expensive unaligned accesses (such as ia64 or
> alpha), it even hurts a lot.
Thanks for submitting your patch, this is the same bug report as
https://bugzilla.netfilter.org/cgi-bin/bugzilla/show_bug.cgi?id=84
> Find attached a patch against 2.6.0. A similar patch is needed for 2.4,
> also attached.
the fix is already in our patch-o-matic CVS tree, and I'm going to
submit this to davem together with some other fixes I have pending.
> Looking at ip_packet_match(), I have two more thoughts:
> * It should not be inlined. It's too large to benefit from inlining,
> IMHO. (OTOH, it's only called from one place, so it does not
> really matter.)
That is the idea. It's just split in two functions to make it more
readable. It's never intended to be called from somewhere else.
> * There's a comment about the compiler being able to unroll the 2/4
> (64/32bit) iter loop which is not completely appropriate: We don't
> pass -funroll-loops, so gcc does not do it :-(
> It would be beneficial though.
I'm not a compiler geek. Thanks for pointing this out, I will do some
testing and look at the compiler output to see what is happening.
> Regards,
> --
> Kurt Garloff <garloff@suse.de> Cologne, DE
--
- Harald Welte <laforge@netfilter.org> http://www.netfilter.org/
============================================================================
"Fragmentation is like classful addressing -- an interesting early
architectural error that shows how much experimentation was going
on while IP was being designed." -- Paul Vixie
[-- Attachment #2: Digital signature --]
[-- Type: application/pgp-signature, Size: 189 bytes --]
^ permalink raw reply [flat|nested] 2+ messages in thread
end of thread, other threads:[~2004-01-11 10:05 UTC | newest]
Thread overview: 2+ messages (download: mbox.gz / follow: Atom feed)
-- links below jump to the message on this page --
2004-01-07 23:00 [PATCH] Unaligend accesses nulldevname Kurt Garloff
2004-01-11 10:01 ` Harald Welte
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for NNTP newsgroup(s).