* [PATCH v2 0/5] mm: modest useability enhancements for node sysfs attrs
@ 2009-10-22 4:15 ` Alex Chiang
0 siblings, 0 replies; 42+ messages in thread
From: Alex Chiang @ 2009-10-22 4:15 UTC (permalink / raw)
To: akpm; +Cc: linux-mm, linux-kernel
This is v2 of the series.
The last patch in this series is dependent upon the documentation patch
series that I just sent out a few moments ago:
http://thread.gmane.org/gmane.linux.kernel/905018
Thanks,
/ac
v1 -> v2: http://thread.gmane.org/gmane.linux.kernel.mm/40084/
Address David Rientjes's comments
- check return value of sysfs_create_link in register_cpu_under_node
- do /not/ convert [un]register_cpu_under_node to return void, since
sparse starts whinging if you ignore sysfs_create_link()'s return
value and working around sparse makes the code ugly
- adjust documentation
Added S390 maintainers to cc: for patch [1/5] as per Kame-san's
suggestion. S390 may map a memory section to more than one node,
causing this series to break.
---
Alex Chiang (5):
mm: add numa node symlink for memory section in sysfs
mm: refactor register_cpu_under_node()
mm: refactor unregister_cpu_under_node()
mm: add numa node symlink for cpu devices in sysfs
Documentation: ABI: /sys/devices/system/cpu/cpu#/node
Documentation/ABI/testing/sysfs-devices-memory | 14 ++++-
Documentation/ABI/testing/sysfs-devices-system-cpu | 15 +++++
Documentation/memory-hotplug.txt | 11 ++--
drivers/base/node.c | 58 ++++++++++++++------
4 files changed, 77 insertions(+), 21 deletions(-)
^ permalink raw reply [flat|nested] 42+ messages in thread
* [PATCH v2 0/5] mm: modest useability enhancements for node sysfs attrs
@ 2009-10-22 4:15 ` Alex Chiang
0 siblings, 0 replies; 42+ messages in thread
From: Alex Chiang @ 2009-10-22 4:15 UTC (permalink / raw)
To: akpm; +Cc: linux-mm, linux-kernel
This is v2 of the series.
The last patch in this series is dependent upon the documentation patch
series that I just sent out a few moments ago:
http://thread.gmane.org/gmane.linux.kernel/905018
Thanks,
/ac
v1 -> v2: http://thread.gmane.org/gmane.linux.kernel.mm/40084/
Address David Rientjes's comments
- check return value of sysfs_create_link in register_cpu_under_node
- do /not/ convert [un]register_cpu_under_node to return void, since
sparse starts whinging if you ignore sysfs_create_link()'s return
value and working around sparse makes the code ugly
- adjust documentation
Added S390 maintainers to cc: for patch [1/5] as per Kame-san's
suggestion. S390 may map a memory section to more than one node,
causing this series to break.
---
Alex Chiang (5):
mm: add numa node symlink for memory section in sysfs
mm: refactor register_cpu_under_node()
mm: refactor unregister_cpu_under_node()
mm: add numa node symlink for cpu devices in sysfs
Documentation: ABI: /sys/devices/system/cpu/cpu#/node
Documentation/ABI/testing/sysfs-devices-memory | 14 ++++-
Documentation/ABI/testing/sysfs-devices-system-cpu | 15 +++++
Documentation/memory-hotplug.txt | 11 ++--
drivers/base/node.c | 58 ++++++++++++++------
4 files changed, 77 insertions(+), 21 deletions(-)
--
To unsubscribe, send a message with 'unsubscribe linux-mm' in
the body to majordomo@kvack.org. For more info on Linux MM,
see: http://www.linux-mm.org/ .
Don't email: <a href=mailto:"dont@kvack.org"> email@kvack.org </a>
^ permalink raw reply [flat|nested] 42+ messages in thread
* [PATCH v2 1/5] mm: add numa node symlink for memory section in sysfs
2009-10-22 4:15 ` Alex Chiang
@ 2009-10-22 4:15 ` Alex Chiang
-1 siblings, 0 replies; 42+ messages in thread
From: Alex Chiang @ 2009-10-22 4:15 UTC (permalink / raw)
To: akpm
Cc: Gary Hade, Heiko Carstens, linux-kernel, linux-mm,
Badari Pulavarty, Martin Schwidefsky, Ingo Molnar
Commit c04fc586c (mm: show node to memory section relationship with
symlinks in sysfs) created symlinks from nodes to memory sections, e.g.
/sys/devices/system/node/node1/memory135 -> ../../memory/memory135
If you're examining the memory section though and are wondering what
node it might belong to, you can find it by grovelling around in
sysfs, but it's a little cumbersome.
Add a reverse symlink for each memory section that points back to the
node to which it belongs.
Cc: Martin Schwidefsky <schwidefsky@de.ibm.com>
Cc: Heiko Carstens <heiko.carstens@de.ibm.com>
Cc: Gary Hade <garyhade@us.ibm.com>
Cc: Badari Pulavarty <pbadari@us.ibm.com>
Cc: Ingo Molnar <mingo@elte.hu>
Signed-off-by: Alex Chiang <achiang@hp.com>
---
Documentation/ABI/testing/sysfs-devices-memory | 14 +++++++++++++-
Documentation/memory-hotplug.txt | 11 +++++++----
drivers/base/node.c | 11 ++++++++++-
3 files changed, 30 insertions(+), 6 deletions(-)
diff --git a/Documentation/ABI/testing/sysfs-devices-memory b/Documentation/ABI/testing/sysfs-devices-memory
index 9fe91c0..bf1627b 100644
--- a/Documentation/ABI/testing/sysfs-devices-memory
+++ b/Documentation/ABI/testing/sysfs-devices-memory
@@ -60,6 +60,19 @@ Description:
Users: hotplug memory remove tools
https://w3.opensource.ibm.com/projects/powerpc-utils/
+
+What: /sys/devices/system/memoryX/nodeY
+Date: October 2009
+Contact: Linux Memory Management list <linux-mm@kvack.org>
+Description:
+ When CONFIG_NUMA is enabled, a symbolic link that
+ points to the corresponding NUMA node directory.
+
+ For example, the following symbolic link is created for
+ memory section 9 on node0:
+ /sys/devices/system/memory/memory9/node0 -> ../../node/node0
+
+
What: /sys/devices/system/node/nodeX/memoryY
Date: September 2008
Contact: Gary Hade <garyhade@us.ibm.com>
@@ -70,4 +83,3 @@ Description:
memory section directory. For example, the following symbolic
link is created for memory section 9 on node0.
/sys/devices/system/node/node0/memory9 -> ../../memory/memory9
-
diff --git a/Documentation/memory-hotplug.txt b/Documentation/memory-hotplug.txt
index bbc8a6a..57e7e9c 100644
--- a/Documentation/memory-hotplug.txt
+++ b/Documentation/memory-hotplug.txt
@@ -160,12 +160,15 @@ Under each section, you can see 4 files.
NOTE:
These directories/files appear after physical memory hotplug phase.
-If CONFIG_NUMA is enabled the
-/sys/devices/system/memory/memoryXXX memory section
-directories can also be accessed via symbolic links located in
-the /sys/devices/system/node/node* directories. For example:
+If CONFIG_NUMA is enabled the memoryXXX/ directories can also be accessed
+via symbolic links located in the /sys/devices/system/node/node* directories.
+
+For example:
/sys/devices/system/node/node0/memory9 -> ../../memory/memory9
+A backlink will also be created:
+/sys/devices/system/memory/memory9/node0 -> ../../node/node0
+
--------------------------------
4. Physical memory hot-add phase
--------------------------------
diff --git a/drivers/base/node.c b/drivers/base/node.c
index 1fe5536..3108b21 100644
--- a/drivers/base/node.c
+++ b/drivers/base/node.c
@@ -268,6 +268,7 @@ static int get_nid_for_pfn(unsigned long pfn)
/* register memory section under specified node if it spans that node */
int register_mem_sect_under_node(struct memory_block *mem_blk, int nid)
{
+ int ret;
unsigned long pfn, sect_start_pfn, sect_end_pfn;
if (!mem_blk)
@@ -284,9 +285,15 @@ int register_mem_sect_under_node(struct memory_block *mem_blk, int nid)
continue;
if (page_nid != nid)
continue;
- return sysfs_create_link_nowarn(&node_devices[nid].sysdev.kobj,
+ ret = sysfs_create_link_nowarn(&node_devices[nid].sysdev.kobj,
&mem_blk->sysdev.kobj,
kobject_name(&mem_blk->sysdev.kobj));
+ if (ret)
+ return ret;
+
+ return sysfs_create_link_nowarn(&mem_blk->sysdev.kobj,
+ &node_devices[nid].sysdev.kobj,
+ kobject_name(&node_devices[nid].sysdev.kobj));
}
/* mem section does not span the specified node */
return 0;
@@ -315,6 +322,8 @@ int unregister_mem_sect_under_nodes(struct memory_block *mem_blk)
continue;
sysfs_remove_link(&node_devices[nid].sysdev.kobj,
kobject_name(&mem_blk->sysdev.kobj));
+ sysfs_remove_link(&mem_blk->sysdev.kobj,
+ kobject_name(&node_devices[nid].sysdev.kobj));
}
return 0;
}
^ permalink raw reply related [flat|nested] 42+ messages in thread
* [PATCH v2 1/5] mm: add numa node symlink for memory section in sysfs
@ 2009-10-22 4:15 ` Alex Chiang
0 siblings, 0 replies; 42+ messages in thread
From: Alex Chiang @ 2009-10-22 4:15 UTC (permalink / raw)
To: akpm
Cc: Gary Hade, Heiko Carstens, linux-kernel, linux-mm,
Badari Pulavarty, Martin Schwidefsky, Ingo Molnar
Commit c04fc586c (mm: show node to memory section relationship with
symlinks in sysfs) created symlinks from nodes to memory sections, e.g.
/sys/devices/system/node/node1/memory135 -> ../../memory/memory135
If you're examining the memory section though and are wondering what
node it might belong to, you can find it by grovelling around in
sysfs, but it's a little cumbersome.
Add a reverse symlink for each memory section that points back to the
node to which it belongs.
Cc: Martin Schwidefsky <schwidefsky@de.ibm.com>
Cc: Heiko Carstens <heiko.carstens@de.ibm.com>
Cc: Gary Hade <garyhade@us.ibm.com>
Cc: Badari Pulavarty <pbadari@us.ibm.com>
Cc: Ingo Molnar <mingo@elte.hu>
Signed-off-by: Alex Chiang <achiang@hp.com>
---
Documentation/ABI/testing/sysfs-devices-memory | 14 +++++++++++++-
Documentation/memory-hotplug.txt | 11 +++++++----
drivers/base/node.c | 11 ++++++++++-
3 files changed, 30 insertions(+), 6 deletions(-)
diff --git a/Documentation/ABI/testing/sysfs-devices-memory b/Documentation/ABI/testing/sysfs-devices-memory
index 9fe91c0..bf1627b 100644
--- a/Documentation/ABI/testing/sysfs-devices-memory
+++ b/Documentation/ABI/testing/sysfs-devices-memory
@@ -60,6 +60,19 @@ Description:
Users: hotplug memory remove tools
https://w3.opensource.ibm.com/projects/powerpc-utils/
+
+What: /sys/devices/system/memoryX/nodeY
+Date: October 2009
+Contact: Linux Memory Management list <linux-mm@kvack.org>
+Description:
+ When CONFIG_NUMA is enabled, a symbolic link that
+ points to the corresponding NUMA node directory.
+
+ For example, the following symbolic link is created for
+ memory section 9 on node0:
+ /sys/devices/system/memory/memory9/node0 -> ../../node/node0
+
+
What: /sys/devices/system/node/nodeX/memoryY
Date: September 2008
Contact: Gary Hade <garyhade@us.ibm.com>
@@ -70,4 +83,3 @@ Description:
memory section directory. For example, the following symbolic
link is created for memory section 9 on node0.
/sys/devices/system/node/node0/memory9 -> ../../memory/memory9
-
diff --git a/Documentation/memory-hotplug.txt b/Documentation/memory-hotplug.txt
index bbc8a6a..57e7e9c 100644
--- a/Documentation/memory-hotplug.txt
+++ b/Documentation/memory-hotplug.txt
@@ -160,12 +160,15 @@ Under each section, you can see 4 files.
NOTE:
These directories/files appear after physical memory hotplug phase.
-If CONFIG_NUMA is enabled the
-/sys/devices/system/memory/memoryXXX memory section
-directories can also be accessed via symbolic links located in
-the /sys/devices/system/node/node* directories. For example:
+If CONFIG_NUMA is enabled the memoryXXX/ directories can also be accessed
+via symbolic links located in the /sys/devices/system/node/node* directories.
+
+For example:
/sys/devices/system/node/node0/memory9 -> ../../memory/memory9
+A backlink will also be created:
+/sys/devices/system/memory/memory9/node0 -> ../../node/node0
+
--------------------------------
4. Physical memory hot-add phase
--------------------------------
diff --git a/drivers/base/node.c b/drivers/base/node.c
index 1fe5536..3108b21 100644
--- a/drivers/base/node.c
+++ b/drivers/base/node.c
@@ -268,6 +268,7 @@ static int get_nid_for_pfn(unsigned long pfn)
/* register memory section under specified node if it spans that node */
int register_mem_sect_under_node(struct memory_block *mem_blk, int nid)
{
+ int ret;
unsigned long pfn, sect_start_pfn, sect_end_pfn;
if (!mem_blk)
@@ -284,9 +285,15 @@ int register_mem_sect_under_node(struct memory_block *mem_blk, int nid)
continue;
if (page_nid != nid)
continue;
- return sysfs_create_link_nowarn(&node_devices[nid].sysdev.kobj,
+ ret = sysfs_create_link_nowarn(&node_devices[nid].sysdev.kobj,
&mem_blk->sysdev.kobj,
kobject_name(&mem_blk->sysdev.kobj));
+ if (ret)
+ return ret;
+
+ return sysfs_create_link_nowarn(&mem_blk->sysdev.kobj,
+ &node_devices[nid].sysdev.kobj,
+ kobject_name(&node_devices[nid].sysdev.kobj));
}
/* mem section does not span the specified node */
return 0;
@@ -315,6 +322,8 @@ int unregister_mem_sect_under_nodes(struct memory_block *mem_blk)
continue;
sysfs_remove_link(&node_devices[nid].sysdev.kobj,
kobject_name(&mem_blk->sysdev.kobj));
+ sysfs_remove_link(&mem_blk->sysdev.kobj,
+ kobject_name(&node_devices[nid].sysdev.kobj));
}
return 0;
}
--
To unsubscribe, send a message with 'unsubscribe linux-mm' in
the body to majordomo@kvack.org. For more info on Linux MM,
see: http://www.linux-mm.org/ .
Don't email: <a href=mailto:"dont@kvack.org"> email@kvack.org </a>
^ permalink raw reply related [flat|nested] 42+ messages in thread
* [PATCH v2 2/5] mm: refactor register_cpu_under_node()
2009-10-22 4:15 ` Alex Chiang
@ 2009-10-22 4:15 ` Alex Chiang
-1 siblings, 0 replies; 42+ messages in thread
From: Alex Chiang @ 2009-10-22 4:15 UTC (permalink / raw)
To: akpm; +Cc: linux-mm, linux-kernel
By returning early if the node is not online, we can unindent the
interesting code by one level.
No functional change.
Signed-off-by: Alex Chiang <achiang@hp.com>
---
drivers/base/node.c | 20 +++++++++++---------
1 files changed, 11 insertions(+), 9 deletions(-)
diff --git a/drivers/base/node.c b/drivers/base/node.c
index 3108b21..ef7dd22 100644
--- a/drivers/base/node.c
+++ b/drivers/base/node.c
@@ -227,16 +227,18 @@ struct node node_devices[MAX_NUMNODES];
*/
int register_cpu_under_node(unsigned int cpu, unsigned int nid)
{
- if (node_online(nid)) {
- struct sys_device *obj = get_cpu_sysdev(cpu);
- if (!obj)
- return 0;
- return sysfs_create_link(&node_devices[nid].sysdev.kobj,
- &obj->kobj,
- kobject_name(&obj->kobj));
- }
+ struct sys_device *obj;
- return 0;
+ if (!node_online(nid))
+ return 0;
+
+ obj = get_cpu_sysdev(cpu);
+ if (!obj)
+ return 0;
+
+ return sysfs_create_link(&node_devices[nid].sysdev.kobj,
+ &obj->kobj,
+ kobject_name(&obj->kobj));
}
int unregister_cpu_under_node(unsigned int cpu, unsigned int nid)
^ permalink raw reply related [flat|nested] 42+ messages in thread
* [PATCH v2 2/5] mm: refactor register_cpu_under_node()
@ 2009-10-22 4:15 ` Alex Chiang
0 siblings, 0 replies; 42+ messages in thread
From: Alex Chiang @ 2009-10-22 4:15 UTC (permalink / raw)
To: akpm; +Cc: linux-mm, linux-kernel
By returning early if the node is not online, we can unindent the
interesting code by one level.
No functional change.
Signed-off-by: Alex Chiang <achiang@hp.com>
---
drivers/base/node.c | 20 +++++++++++---------
1 files changed, 11 insertions(+), 9 deletions(-)
diff --git a/drivers/base/node.c b/drivers/base/node.c
index 3108b21..ef7dd22 100644
--- a/drivers/base/node.c
+++ b/drivers/base/node.c
@@ -227,16 +227,18 @@ struct node node_devices[MAX_NUMNODES];
*/
int register_cpu_under_node(unsigned int cpu, unsigned int nid)
{
- if (node_online(nid)) {
- struct sys_device *obj = get_cpu_sysdev(cpu);
- if (!obj)
- return 0;
- return sysfs_create_link(&node_devices[nid].sysdev.kobj,
- &obj->kobj,
- kobject_name(&obj->kobj));
- }
+ struct sys_device *obj;
- return 0;
+ if (!node_online(nid))
+ return 0;
+
+ obj = get_cpu_sysdev(cpu);
+ if (!obj)
+ return 0;
+
+ return sysfs_create_link(&node_devices[nid].sysdev.kobj,
+ &obj->kobj,
+ kobject_name(&obj->kobj));
}
int unregister_cpu_under_node(unsigned int cpu, unsigned int nid)
--
To unsubscribe, send a message with 'unsubscribe linux-mm' in
the body to majordomo@kvack.org. For more info on Linux MM,
see: http://www.linux-mm.org/ .
Don't email: <a href=mailto:"dont@kvack.org"> email@kvack.org </a>
^ permalink raw reply related [flat|nested] 42+ messages in thread
* [PATCH v2 3/5] mm: refactor unregister_cpu_under_node()
2009-10-22 4:15 ` Alex Chiang
@ 2009-10-22 4:15 ` Alex Chiang
-1 siblings, 0 replies; 42+ messages in thread
From: Alex Chiang @ 2009-10-22 4:15 UTC (permalink / raw)
To: akpm; +Cc: linux-mm, linux-kernel
By returning early if the node is not online, we can unindent the
interesting code by two levels.
No functional change.
Signed-off-by: Alex Chiang <achiang@hp.com>
---
drivers/base/node.c | 18 ++++++++++++------
1 files changed, 12 insertions(+), 6 deletions(-)
diff --git a/drivers/base/node.c b/drivers/base/node.c
index ef7dd22..ffda067 100644
--- a/drivers/base/node.c
+++ b/drivers/base/node.c
@@ -243,12 +243,18 @@ int register_cpu_under_node(unsigned int cpu, unsigned int nid)
int unregister_cpu_under_node(unsigned int cpu, unsigned int nid)
{
- if (node_online(nid)) {
- struct sys_device *obj = get_cpu_sysdev(cpu);
- if (obj)
- sysfs_remove_link(&node_devices[nid].sysdev.kobj,
- kobject_name(&obj->kobj));
- }
+ struct sys_device *obj;
+
+ if (!node_online(nid))
+ return 0;
+
+ obj = get_cpu_sysdev(cpu);
+ if (!obj)
+ return 0;
+
+ sysfs_remove_link(&node_devices[nid].sysdev.kobj,
+ kobject_name(&obj->kobj));
+
return 0;
}
^ permalink raw reply related [flat|nested] 42+ messages in thread
* [PATCH v2 3/5] mm: refactor unregister_cpu_under_node()
@ 2009-10-22 4:15 ` Alex Chiang
0 siblings, 0 replies; 42+ messages in thread
From: Alex Chiang @ 2009-10-22 4:15 UTC (permalink / raw)
To: akpm; +Cc: linux-mm, linux-kernel
By returning early if the node is not online, we can unindent the
interesting code by two levels.
No functional change.
Signed-off-by: Alex Chiang <achiang@hp.com>
---
drivers/base/node.c | 18 ++++++++++++------
1 files changed, 12 insertions(+), 6 deletions(-)
diff --git a/drivers/base/node.c b/drivers/base/node.c
index ef7dd22..ffda067 100644
--- a/drivers/base/node.c
+++ b/drivers/base/node.c
@@ -243,12 +243,18 @@ int register_cpu_under_node(unsigned int cpu, unsigned int nid)
int unregister_cpu_under_node(unsigned int cpu, unsigned int nid)
{
- if (node_online(nid)) {
- struct sys_device *obj = get_cpu_sysdev(cpu);
- if (obj)
- sysfs_remove_link(&node_devices[nid].sysdev.kobj,
- kobject_name(&obj->kobj));
- }
+ struct sys_device *obj;
+
+ if (!node_online(nid))
+ return 0;
+
+ obj = get_cpu_sysdev(cpu);
+ if (!obj)
+ return 0;
+
+ sysfs_remove_link(&node_devices[nid].sysdev.kobj,
+ kobject_name(&obj->kobj));
+
return 0;
}
--
To unsubscribe, send a message with 'unsubscribe linux-mm' in
the body to majordomo@kvack.org. For more info on Linux MM,
see: http://www.linux-mm.org/ .
Don't email: <a href=mailto:"dont@kvack.org"> email@kvack.org </a>
^ permalink raw reply related [flat|nested] 42+ messages in thread
* [PATCH v2 4/5] mm: add numa node symlink for cpu devices in sysfs
2009-10-22 4:15 ` Alex Chiang
@ 2009-10-22 4:15 ` Alex Chiang
-1 siblings, 0 replies; 42+ messages in thread
From: Alex Chiang @ 2009-10-22 4:15 UTC (permalink / raw)
To: akpm; +Cc: linux-mm, linux-kernel
You can discover which CPUs belong to a NUMA node by examining
/sys/devices/system/node/node#/
However, it's not convenient to go in the other direction, when looking at
/sys/devices/system/cpu/cpu#/
Yes, you can muck about in sysfs, but adding these symlinks makes
life a lot more convenient.
Signed-off-by: Alex Chiang <achiang@hp.com>
---
drivers/base/node.c | 11 ++++++++++-
1 files changed, 10 insertions(+), 1 deletions(-)
diff --git a/drivers/base/node.c b/drivers/base/node.c
index ffda067..24fa962 100644
--- a/drivers/base/node.c
+++ b/drivers/base/node.c
@@ -227,6 +227,7 @@ struct node node_devices[MAX_NUMNODES];
*/
int register_cpu_under_node(unsigned int cpu, unsigned int nid)
{
+ int ret;
struct sys_device *obj;
if (!node_online(nid))
@@ -236,9 +237,15 @@ int register_cpu_under_node(unsigned int cpu, unsigned int nid)
if (!obj)
return 0;
- return sysfs_create_link(&node_devices[nid].sysdev.kobj,
+ ret = sysfs_create_link(&node_devices[nid].sysdev.kobj,
&obj->kobj,
kobject_name(&obj->kobj));
+ if (ret)
+ return ret;
+
+ return sysfs_create_link(&obj->kobj,
+ &node_devices[nid].sysdev.kobj,
+ kobject_name(&node_devices[nid].sysdev.kobj));
}
int unregister_cpu_under_node(unsigned int cpu, unsigned int nid)
@@ -254,6 +261,8 @@ int unregister_cpu_under_node(unsigned int cpu, unsigned int nid)
sysfs_remove_link(&node_devices[nid].sysdev.kobj,
kobject_name(&obj->kobj));
+ sysfs_remove_link(&obj->kobj,
+ kobject_name(&node_devices[nid].sysdev.kobj));
return 0;
}
^ permalink raw reply related [flat|nested] 42+ messages in thread
* [PATCH v2 4/5] mm: add numa node symlink for cpu devices in sysfs
@ 2009-10-22 4:15 ` Alex Chiang
0 siblings, 0 replies; 42+ messages in thread
From: Alex Chiang @ 2009-10-22 4:15 UTC (permalink / raw)
To: akpm; +Cc: linux-mm, linux-kernel
You can discover which CPUs belong to a NUMA node by examining
/sys/devices/system/node/node#/
However, it's not convenient to go in the other direction, when looking at
/sys/devices/system/cpu/cpu#/
Yes, you can muck about in sysfs, but adding these symlinks makes
life a lot more convenient.
Signed-off-by: Alex Chiang <achiang@hp.com>
---
drivers/base/node.c | 11 ++++++++++-
1 files changed, 10 insertions(+), 1 deletions(-)
diff --git a/drivers/base/node.c b/drivers/base/node.c
index ffda067..24fa962 100644
--- a/drivers/base/node.c
+++ b/drivers/base/node.c
@@ -227,6 +227,7 @@ struct node node_devices[MAX_NUMNODES];
*/
int register_cpu_under_node(unsigned int cpu, unsigned int nid)
{
+ int ret;
struct sys_device *obj;
if (!node_online(nid))
@@ -236,9 +237,15 @@ int register_cpu_under_node(unsigned int cpu, unsigned int nid)
if (!obj)
return 0;
- return sysfs_create_link(&node_devices[nid].sysdev.kobj,
+ ret = sysfs_create_link(&node_devices[nid].sysdev.kobj,
&obj->kobj,
kobject_name(&obj->kobj));
+ if (ret)
+ return ret;
+
+ return sysfs_create_link(&obj->kobj,
+ &node_devices[nid].sysdev.kobj,
+ kobject_name(&node_devices[nid].sysdev.kobj));
}
int unregister_cpu_under_node(unsigned int cpu, unsigned int nid)
@@ -254,6 +261,8 @@ int unregister_cpu_under_node(unsigned int cpu, unsigned int nid)
sysfs_remove_link(&node_devices[nid].sysdev.kobj,
kobject_name(&obj->kobj));
+ sysfs_remove_link(&obj->kobj,
+ kobject_name(&node_devices[nid].sysdev.kobj));
return 0;
}
--
To unsubscribe, send a message with 'unsubscribe linux-mm' in
the body to majordomo@kvack.org. For more info on Linux MM,
see: http://www.linux-mm.org/ .
Don't email: <a href=mailto:"dont@kvack.org"> email@kvack.org </a>
^ permalink raw reply related [flat|nested] 42+ messages in thread
* [PATCH v2 5/5] Documentation: ABI: /sys/devices/system/cpu/cpu#/node
2009-10-22 4:15 ` Alex Chiang
@ 2009-10-22 4:15 ` Alex Chiang
-1 siblings, 0 replies; 42+ messages in thread
From: Alex Chiang @ 2009-10-22 4:15 UTC (permalink / raw)
To: akpm; +Cc: Randy Dunlap, linux-mm, linux-kernel, Greg KH
Describe NUMA node symlink created for CPUs when CONFIG_NUMA is set.
Cc: Greg KH <greg@kroah.com>
Cc: Randy Dunlap <randy.dunlap@oracle.com>
Signed-off-by: Alex Chiang <achiang@hp.com>
---
Documentation/ABI/testing/sysfs-devices-system-cpu | 15 +++++++++++++++
1 files changed, 15 insertions(+), 0 deletions(-)
diff --git a/Documentation/ABI/testing/sysfs-devices-system-cpu b/Documentation/ABI/testing/sysfs-devices-system-cpu
index b400c34..67813ae 100644
--- a/Documentation/ABI/testing/sysfs-devices-system-cpu
+++ b/Documentation/ABI/testing/sysfs-devices-system-cpu
@@ -79,6 +79,21 @@ Description: Discover and change the online state of a CPU.
For more information, please read Documentation/cpu-hotplug.txt
+
+What: /sys/devices/system/cpu/cpu#/node
+Date: October 2009
+Contact: Linux memory management mailing list <linux-mm@kvack.org>
+Description: Discover NUMA node a CPU belongs to
+
+ When CONFIG_NUMA is enabled, a symbolic link that points
+ to the corresponding NUMA node directory.
+
+ For example, the following symlink is created for cpu42
+ in NUMA node 2:
+
+ /sys/devices/system/cpu/cpu42/node2 -> ../../node/node2
+
+
What: /sys/devices/system/cpu/cpu#/topology/core_id
/sys/devices/system/cpu/cpu#/topology/core_siblings
/sys/devices/system/cpu/cpu#/topology/core_siblings_list
^ permalink raw reply related [flat|nested] 42+ messages in thread
* [PATCH v2 5/5] Documentation: ABI: /sys/devices/system/cpu/cpu#/node
@ 2009-10-22 4:15 ` Alex Chiang
0 siblings, 0 replies; 42+ messages in thread
From: Alex Chiang @ 2009-10-22 4:15 UTC (permalink / raw)
To: akpm; +Cc: Randy Dunlap, linux-mm, linux-kernel, Greg KH
Describe NUMA node symlink created for CPUs when CONFIG_NUMA is set.
Cc: Greg KH <greg@kroah.com>
Cc: Randy Dunlap <randy.dunlap@oracle.com>
Signed-off-by: Alex Chiang <achiang@hp.com>
---
Documentation/ABI/testing/sysfs-devices-system-cpu | 15 +++++++++++++++
1 files changed, 15 insertions(+), 0 deletions(-)
diff --git a/Documentation/ABI/testing/sysfs-devices-system-cpu b/Documentation/ABI/testing/sysfs-devices-system-cpu
index b400c34..67813ae 100644
--- a/Documentation/ABI/testing/sysfs-devices-system-cpu
+++ b/Documentation/ABI/testing/sysfs-devices-system-cpu
@@ -79,6 +79,21 @@ Description: Discover and change the online state of a CPU.
For more information, please read Documentation/cpu-hotplug.txt
+
+What: /sys/devices/system/cpu/cpu#/node
+Date: October 2009
+Contact: Linux memory management mailing list <linux-mm@kvack.org>
+Description: Discover NUMA node a CPU belongs to
+
+ When CONFIG_NUMA is enabled, a symbolic link that points
+ to the corresponding NUMA node directory.
+
+ For example, the following symlink is created for cpu42
+ in NUMA node 2:
+
+ /sys/devices/system/cpu/cpu42/node2 -> ../../node/node2
+
+
What: /sys/devices/system/cpu/cpu#/topology/core_id
/sys/devices/system/cpu/cpu#/topology/core_siblings
/sys/devices/system/cpu/cpu#/topology/core_siblings_list
--
To unsubscribe, send a message with 'unsubscribe linux-mm' in
the body to majordomo@kvack.org. For more info on Linux MM,
see: http://www.linux-mm.org/ .
Don't email: <a href=mailto:"dont@kvack.org"> email@kvack.org </a>
^ permalink raw reply related [flat|nested] 42+ messages in thread
* Re: [PATCH v2 1/5] mm: add numa node symlink for memory section in sysfs
2009-10-22 4:15 ` Alex Chiang
@ 2009-10-22 19:51 ` David Rientjes
-1 siblings, 0 replies; 42+ messages in thread
From: David Rientjes @ 2009-10-22 19:51 UTC (permalink / raw)
To: Alex Chiang
Cc: akpm, Gary Hade, Heiko Carstens, linux-kernel, linux-mm,
Badari Pulavarty, Martin Schwidefsky, Ingo Molnar
On Wed, 21 Oct 2009, Alex Chiang wrote:
> Commit c04fc586c (mm: show node to memory section relationship with
> symlinks in sysfs) created symlinks from nodes to memory sections, e.g.
>
> /sys/devices/system/node/node1/memory135 -> ../../memory/memory135
>
> If you're examining the memory section though and are wondering what
> node it might belong to, you can find it by grovelling around in
> sysfs, but it's a little cumbersome.
>
> Add a reverse symlink for each memory section that points back to the
> node to which it belongs.
>
> Cc: Martin Schwidefsky <schwidefsky@de.ibm.com>
> Cc: Heiko Carstens <heiko.carstens@de.ibm.com>
> Cc: Gary Hade <garyhade@us.ibm.com>
> Cc: Badari Pulavarty <pbadari@us.ibm.com>
> Cc: Ingo Molnar <mingo@elte.hu>
> Signed-off-by: Alex Chiang <achiang@hp.com>
Acked-by: David Rientjes <rientjes@google.com>
Very helpful backlinks to memory section nodes even though I have lots of
memory directories on some of my test machines :)
^ permalink raw reply [flat|nested] 42+ messages in thread
* Re: [PATCH v2 1/5] mm: add numa node symlink for memory section in sysfs
@ 2009-10-22 19:51 ` David Rientjes
0 siblings, 0 replies; 42+ messages in thread
From: David Rientjes @ 2009-10-22 19:51 UTC (permalink / raw)
To: Alex Chiang
Cc: akpm, Gary Hade, Heiko Carstens, linux-kernel, linux-mm,
Badari Pulavarty, Martin Schwidefsky, Ingo Molnar
On Wed, 21 Oct 2009, Alex Chiang wrote:
> Commit c04fc586c (mm: show node to memory section relationship with
> symlinks in sysfs) created symlinks from nodes to memory sections, e.g.
>
> /sys/devices/system/node/node1/memory135 -> ../../memory/memory135
>
> If you're examining the memory section though and are wondering what
> node it might belong to, you can find it by grovelling around in
> sysfs, but it's a little cumbersome.
>
> Add a reverse symlink for each memory section that points back to the
> node to which it belongs.
>
> Cc: Martin Schwidefsky <schwidefsky@de.ibm.com>
> Cc: Heiko Carstens <heiko.carstens@de.ibm.com>
> Cc: Gary Hade <garyhade@us.ibm.com>
> Cc: Badari Pulavarty <pbadari@us.ibm.com>
> Cc: Ingo Molnar <mingo@elte.hu>
> Signed-off-by: Alex Chiang <achiang@hp.com>
Acked-by: David Rientjes <rientjes@google.com>
Very helpful backlinks to memory section nodes even though I have lots of
memory directories on some of my test machines :)
--
To unsubscribe, send a message with 'unsubscribe linux-mm' in
the body to majordomo@kvack.org. For more info on Linux MM,
see: http://www.linux-mm.org/ .
Don't email: <a href=mailto:"dont@kvack.org"> email@kvack.org </a>
^ permalink raw reply [flat|nested] 42+ messages in thread
* Re: [PATCH v2 4/5] mm: add numa node symlink for cpu devices in sysfs
2009-10-22 4:15 ` Alex Chiang
@ 2009-10-22 19:52 ` David Rientjes
-1 siblings, 0 replies; 42+ messages in thread
From: David Rientjes @ 2009-10-22 19:52 UTC (permalink / raw)
To: Alex Chiang; +Cc: akpm, linux-mm, linux-kernel
On Wed, 21 Oct 2009, Alex Chiang wrote:
> You can discover which CPUs belong to a NUMA node by examining
> /sys/devices/system/node/node#/
>
> However, it's not convenient to go in the other direction, when looking at
> /sys/devices/system/cpu/cpu#/
>
> Yes, you can muck about in sysfs, but adding these symlinks makes
> life a lot more convenient.
>
> Signed-off-by: Alex Chiang <achiang@hp.com>
Acked-by: David Rientjes <rientjes@google.com>
^ permalink raw reply [flat|nested] 42+ messages in thread
* Re: [PATCH v2 4/5] mm: add numa node symlink for cpu devices in sysfs
@ 2009-10-22 19:52 ` David Rientjes
0 siblings, 0 replies; 42+ messages in thread
From: David Rientjes @ 2009-10-22 19:52 UTC (permalink / raw)
To: Alex Chiang; +Cc: akpm, linux-mm, linux-kernel
On Wed, 21 Oct 2009, Alex Chiang wrote:
> You can discover which CPUs belong to a NUMA node by examining
> /sys/devices/system/node/node#/
>
> However, it's not convenient to go in the other direction, when looking at
> /sys/devices/system/cpu/cpu#/
>
> Yes, you can muck about in sysfs, but adding these symlinks makes
> life a lot more convenient.
>
> Signed-off-by: Alex Chiang <achiang@hp.com>
Acked-by: David Rientjes <rientjes@google.com>
--
To unsubscribe, send a message with 'unsubscribe linux-mm' in
the body to majordomo@kvack.org. For more info on Linux MM,
see: http://www.linux-mm.org/ .
Don't email: <a href=mailto:"dont@kvack.org"> email@kvack.org </a>
^ permalink raw reply [flat|nested] 42+ messages in thread
* Re: [PATCH v2 1/5] mm: add numa node symlink for memory section in sysfs
2009-10-22 19:51 ` David Rientjes
@ 2009-10-27 19:59 ` Alex Chiang
-1 siblings, 0 replies; 42+ messages in thread
From: Alex Chiang @ 2009-10-27 19:59 UTC (permalink / raw)
To: David Rientjes
Cc: akpm, Gary Hade, Heiko Carstens, linux-kernel, linux-mm,
Badari Pulavarty, Martin Schwidefsky, Ingo Molnar
Thank you for ACKing, David.
S390 guys, I cc'ed you on this patch because I heard a rumour
that your memory sections may belong to more than one NUMA node?
Is that true? If so, how would you like me to handle that
situation?
Any comments on this patch series would be appreciated.
Thanks.
/ac
* David Rientjes <rientjes@google.com>:
> On Wed, 21 Oct 2009, Alex Chiang wrote:
>
> > Commit c04fc586c (mm: show node to memory section relationship with
> > symlinks in sysfs) created symlinks from nodes to memory sections, e.g.
> >
> > /sys/devices/system/node/node1/memory135 -> ../../memory/memory135
> >
> > If you're examining the memory section though and are wondering what
> > node it might belong to, you can find it by grovelling around in
> > sysfs, but it's a little cumbersome.
> >
> > Add a reverse symlink for each memory section that points back to the
> > node to which it belongs.
> >
> > Cc: Martin Schwidefsky <schwidefsky@de.ibm.com>
> > Cc: Heiko Carstens <heiko.carstens@de.ibm.com>
> > Cc: Gary Hade <garyhade@us.ibm.com>
> > Cc: Badari Pulavarty <pbadari@us.ibm.com>
> > Cc: Ingo Molnar <mingo@elte.hu>
> > Signed-off-by: Alex Chiang <achiang@hp.com>
>
> Acked-by: David Rientjes <rientjes@google.com>
>
> Very helpful backlinks to memory section nodes even though I have lots of
> memory directories on some of my test machines :)
^ permalink raw reply [flat|nested] 42+ messages in thread
* Re: [PATCH v2 1/5] mm: add numa node symlink for memory section in sysfs
@ 2009-10-27 19:59 ` Alex Chiang
0 siblings, 0 replies; 42+ messages in thread
From: Alex Chiang @ 2009-10-27 19:59 UTC (permalink / raw)
To: David Rientjes
Cc: akpm, Gary Hade, Heiko Carstens, linux-kernel, linux-mm,
Badari Pulavarty, Martin Schwidefsky, Ingo Molnar
Thank you for ACKing, David.
S390 guys, I cc'ed you on this patch because I heard a rumour
that your memory sections may belong to more than one NUMA node?
Is that true? If so, how would you like me to handle that
situation?
Any comments on this patch series would be appreciated.
Thanks.
/ac
* David Rientjes <rientjes@google.com>:
> On Wed, 21 Oct 2009, Alex Chiang wrote:
>
> > Commit c04fc586c (mm: show node to memory section relationship with
> > symlinks in sysfs) created symlinks from nodes to memory sections, e.g.
> >
> > /sys/devices/system/node/node1/memory135 -> ../../memory/memory135
> >
> > If you're examining the memory section though and are wondering what
> > node it might belong to, you can find it by grovelling around in
> > sysfs, but it's a little cumbersome.
> >
> > Add a reverse symlink for each memory section that points back to the
> > node to which it belongs.
> >
> > Cc: Martin Schwidefsky <schwidefsky@de.ibm.com>
> > Cc: Heiko Carstens <heiko.carstens@de.ibm.com>
> > Cc: Gary Hade <garyhade@us.ibm.com>
> > Cc: Badari Pulavarty <pbadari@us.ibm.com>
> > Cc: Ingo Molnar <mingo@elte.hu>
> > Signed-off-by: Alex Chiang <achiang@hp.com>
>
> Acked-by: David Rientjes <rientjes@google.com>
>
> Very helpful backlinks to memory section nodes even though I have lots of
> memory directories on some of my test machines :)
--
To unsubscribe, send a message with 'unsubscribe linux-mm' in
the body to majordomo@kvack.org. For more info on Linux MM,
see: http://www.linux-mm.org/ .
Don't email: <a href=mailto:"dont@kvack.org"> email@kvack.org </a>
^ permalink raw reply [flat|nested] 42+ messages in thread
* Re: [PATCH v2 1/5] mm: add numa node symlink for memory section in sysfs
2009-10-27 19:59 ` Alex Chiang
@ 2009-10-27 21:27 ` David Rientjes
-1 siblings, 0 replies; 42+ messages in thread
From: David Rientjes @ 2009-10-27 21:27 UTC (permalink / raw)
To: Alex Chiang
Cc: Andrew Morton, Gary Hade, Heiko Carstens, linux-kernel, linux-mm,
Badari Pulavarty, Martin Schwidefsky, Ingo Molnar
On Tue, 27 Oct 2009, Alex Chiang wrote:
> Thank you for ACKing, David.
>
> S390 guys, I cc'ed you on this patch because I heard a rumour
> that your memory sections may belong to more than one NUMA node?
> Is that true? If so, how would you like me to handle that
> situation?
>
You're referring to how unregister_mem_sect_under_nodes() should be
handled, right? register_mem_sect_under_node() already looks supported by
your patch.
Since the unregister function includes a plural "nodes," I assume that
it's possible for hotplug to register a memory section to more than one
node. That's probably lacking on x86 currently, however, because we lack
node hotplug.
I'd suggest a similiar iteration through pfn's that the register function
does checking for multiple nodes and then removing the link from all
applicable node_devices kobj when unregistering.
Maybe one of the s390 maintainers will test that?
^ permalink raw reply [flat|nested] 42+ messages in thread
* Re: [PATCH v2 1/5] mm: add numa node symlink for memory section in sysfs
@ 2009-10-27 21:27 ` David Rientjes
0 siblings, 0 replies; 42+ messages in thread
From: David Rientjes @ 2009-10-27 21:27 UTC (permalink / raw)
To: Alex Chiang
Cc: Andrew Morton, Gary Hade, Heiko Carstens, linux-kernel, linux-mm,
Badari Pulavarty, Martin Schwidefsky, Ingo Molnar
On Tue, 27 Oct 2009, Alex Chiang wrote:
> Thank you for ACKing, David.
>
> S390 guys, I cc'ed you on this patch because I heard a rumour
> that your memory sections may belong to more than one NUMA node?
> Is that true? If so, how would you like me to handle that
> situation?
>
You're referring to how unregister_mem_sect_under_nodes() should be
handled, right? register_mem_sect_under_node() already looks supported by
your patch.
Since the unregister function includes a plural "nodes," I assume that
it's possible for hotplug to register a memory section to more than one
node. That's probably lacking on x86 currently, however, because we lack
node hotplug.
I'd suggest a similiar iteration through pfn's that the register function
does checking for multiple nodes and then removing the link from all
applicable node_devices kobj when unregistering.
Maybe one of the s390 maintainers will test that?
--
To unsubscribe, send a message with 'unsubscribe linux-mm' in
the body to majordomo@kvack.org. For more info on Linux MM,
see: http://www.linux-mm.org/ .
Don't email: <a href=mailto:"dont@kvack.org"> email@kvack.org </a>
^ permalink raw reply [flat|nested] 42+ messages in thread
* Re: [PATCH v2 1/5] mm: add numa node symlink for memory section in sysfs
2009-10-27 21:27 ` David Rientjes
@ 2009-10-28 8:31 ` Heiko Carstens
-1 siblings, 0 replies; 42+ messages in thread
From: Heiko Carstens @ 2009-10-28 8:31 UTC (permalink / raw)
To: David Rientjes
Cc: Alex Chiang, Andrew Morton, Gary Hade, linux-kernel, linux-mm,
Badari Pulavarty, Martin Schwidefsky, Ingo Molnar
On Tue, Oct 27, 2009 at 02:27:56PM -0700, David Rientjes wrote:
> On Tue, 27 Oct 2009, Alex Chiang wrote:
>
> > Thank you for ACKing, David.
> >
> > S390 guys, I cc'ed you on this patch because I heard a rumour
> > that your memory sections may belong to more than one NUMA node?
> > Is that true? If so, how would you like me to handle that
> > situation?
> >
>
> You're referring to how unregister_mem_sect_under_nodes() should be
> handled, right? register_mem_sect_under_node() already looks supported by
> your patch.
>
> Since the unregister function includes a plural "nodes," I assume that
> it's possible for hotplug to register a memory section to more than one
> node. That's probably lacking on x86 currently, however, because we lack
> node hotplug.
>
> I'd suggest a similiar iteration through pfn's that the register function
> does checking for multiple nodes and then removing the link from all
> applicable node_devices kobj when unregistering.
>
> Maybe one of the s390 maintainers will test that?
The short answer is: s390 doesn't support NUMA, because the hardware doesn't
tell us to which node (book in s390 terms) a memory range belongs to.
Memory layout for a logical partition is striped: first x mbyte belong to
node 0, next x mbyte belong to node 1, etc...
Also, since there is always a hypervisor running below Linux I don't think
it would make too much sense if we would know to which node a piece of
memory belongs to: if the hypervisor decides to schedule a virtual cpu of
a logical partition to a different node then what?
^ permalink raw reply [flat|nested] 42+ messages in thread
* Re: [PATCH v2 1/5] mm: add numa node symlink for memory section in sysfs
@ 2009-10-28 8:31 ` Heiko Carstens
0 siblings, 0 replies; 42+ messages in thread
From: Heiko Carstens @ 2009-10-28 8:31 UTC (permalink / raw)
To: David Rientjes
Cc: Alex Chiang, Andrew Morton, Gary Hade, linux-kernel, linux-mm,
Badari Pulavarty, Martin Schwidefsky, Ingo Molnar
On Tue, Oct 27, 2009 at 02:27:56PM -0700, David Rientjes wrote:
> On Tue, 27 Oct 2009, Alex Chiang wrote:
>
> > Thank you for ACKing, David.
> >
> > S390 guys, I cc'ed you on this patch because I heard a rumour
> > that your memory sections may belong to more than one NUMA node?
> > Is that true? If so, how would you like me to handle that
> > situation?
> >
>
> You're referring to how unregister_mem_sect_under_nodes() should be
> handled, right? register_mem_sect_under_node() already looks supported by
> your patch.
>
> Since the unregister function includes a plural "nodes," I assume that
> it's possible for hotplug to register a memory section to more than one
> node. That's probably lacking on x86 currently, however, because we lack
> node hotplug.
>
> I'd suggest a similiar iteration through pfn's that the register function
> does checking for multiple nodes and then removing the link from all
> applicable node_devices kobj when unregistering.
>
> Maybe one of the s390 maintainers will test that?
The short answer is: s390 doesn't support NUMA, because the hardware doesn't
tell us to which node (book in s390 terms) a memory range belongs to.
Memory layout for a logical partition is striped: first x mbyte belong to
node 0, next x mbyte belong to node 1, etc...
Also, since there is always a hypervisor running below Linux I don't think
it would make too much sense if we would know to which node a piece of
memory belongs to: if the hypervisor decides to schedule a virtual cpu of
a logical partition to a different node then what?
--
To unsubscribe, send a message with 'unsubscribe linux-mm' in
the body to majordomo@kvack.org. For more info on Linux MM,
see: http://www.linux-mm.org/ .
Don't email: <a href=mailto:"dont@kvack.org"> email@kvack.org </a>
^ permalink raw reply [flat|nested] 42+ messages in thread
* Re: [PATCH v2 1/5] mm: add numa node symlink for memory section in sysfs
2009-10-28 8:31 ` Heiko Carstens
@ 2009-10-28 9:03 ` David Rientjes
-1 siblings, 0 replies; 42+ messages in thread
From: David Rientjes @ 2009-10-28 9:03 UTC (permalink / raw)
To: Heiko Carstens
Cc: Alex Chiang, Andrew Morton, Gary Hade, linux-kernel, linux-mm,
Badari Pulavarty, Martin Schwidefsky, Ingo Molnar
On Wed, 28 Oct 2009, Heiko Carstens wrote:
> The short answer is: s390 doesn't support NUMA, because the hardware doesn't
> tell us to which node (book in s390 terms) a memory range belongs to.
>
> Memory layout for a logical partition is striped: first x mbyte belong to
> node 0, next x mbyte belong to node 1, etc...
>
> Also, since there is always a hypervisor running below Linux I don't think
> it would make too much sense if we would know to which node a piece of
> memory belongs to: if the hypervisor decides to schedule a virtual cpu of
> a logical partition to a different node then what?
>
Ok, so the patchset is a no-op for s390 since it only utilizes the
!CONFIG_NUMA code.
Alex, I think the safest thing to do in unregister_mem_sect_under_nodes()
is to iterate though the section pfns and remove links to the node_device
kobjs for all the distinct pfn_to_nid()'s that it encounters.
^ permalink raw reply [flat|nested] 42+ messages in thread
* Re: [PATCH v2 1/5] mm: add numa node symlink for memory section in sysfs
@ 2009-10-28 9:03 ` David Rientjes
0 siblings, 0 replies; 42+ messages in thread
From: David Rientjes @ 2009-10-28 9:03 UTC (permalink / raw)
To: Heiko Carstens
Cc: Alex Chiang, Andrew Morton, Gary Hade, linux-kernel, linux-mm,
Badari Pulavarty, Martin Schwidefsky, Ingo Molnar
On Wed, 28 Oct 2009, Heiko Carstens wrote:
> The short answer is: s390 doesn't support NUMA, because the hardware doesn't
> tell us to which node (book in s390 terms) a memory range belongs to.
>
> Memory layout for a logical partition is striped: first x mbyte belong to
> node 0, next x mbyte belong to node 1, etc...
>
> Also, since there is always a hypervisor running below Linux I don't think
> it would make too much sense if we would know to which node a piece of
> memory belongs to: if the hypervisor decides to schedule a virtual cpu of
> a logical partition to a different node then what?
>
Ok, so the patchset is a no-op for s390 since it only utilizes the
!CONFIG_NUMA code.
Alex, I think the safest thing to do in unregister_mem_sect_under_nodes()
is to iterate though the section pfns and remove links to the node_device
kobjs for all the distinct pfn_to_nid()'s that it encounters.
--
To unsubscribe, send a message with 'unsubscribe linux-mm' in
the body to majordomo@kvack.org. For more info on Linux MM,
see: http://www.linux-mm.org/ .
Don't email: <a href=mailto:"dont@kvack.org"> email@kvack.org </a>
^ permalink raw reply [flat|nested] 42+ messages in thread
* Re: [PATCH v2 1/5] mm: add numa node symlink for memory section in sysfs
2009-10-28 9:03 ` David Rientjes
@ 2009-10-28 17:15 ` Alex Chiang
-1 siblings, 0 replies; 42+ messages in thread
From: Alex Chiang @ 2009-10-28 17:15 UTC (permalink / raw)
To: David Rientjes
Cc: Heiko Carstens, Andrew Morton, Gary Hade, linux-kernel, linux-mm,
Badari Pulavarty, Martin Schwidefsky, Ingo Molnar
* David Rientjes <rientjes@google.com>:
> On Wed, 28 Oct 2009, Heiko Carstens wrote:
>
> > The short answer is: s390 doesn't support NUMA, because the hardware doesn't
> > tell us to which node (book in s390 terms) a memory range belongs to.
> >
> > Memory layout for a logical partition is striped: first x mbyte belong to
> > node 0, next x mbyte belong to node 1, etc...
> >
> > Also, since there is always a hypervisor running below Linux I don't think
> > it would make too much sense if we would know to which node a piece of
> > memory belongs to: if the hypervisor decides to schedule a virtual cpu of
> > a logical partition to a different node then what?
> >
>
> Ok, so the patchset is a no-op for s390 since it only utilizes the
> !CONFIG_NUMA code.
Sounds good.
> Alex, I think the safest thing to do in unregister_mem_sect_under_nodes()
> is to iterate though the section pfns and remove links to the node_device
> kobjs for all the distinct pfn_to_nid()'s that it encounters.
Ok, I will respin.
Thanks!
/ac
^ permalink raw reply [flat|nested] 42+ messages in thread
* Re: [PATCH v2 1/5] mm: add numa node symlink for memory section in sysfs
@ 2009-10-28 17:15 ` Alex Chiang
0 siblings, 0 replies; 42+ messages in thread
From: Alex Chiang @ 2009-10-28 17:15 UTC (permalink / raw)
To: David Rientjes
Cc: Heiko Carstens, Andrew Morton, Gary Hade, linux-kernel, linux-mm,
Badari Pulavarty, Martin Schwidefsky, Ingo Molnar
* David Rientjes <rientjes@google.com>:
> On Wed, 28 Oct 2009, Heiko Carstens wrote:
>
> > The short answer is: s390 doesn't support NUMA, because the hardware doesn't
> > tell us to which node (book in s390 terms) a memory range belongs to.
> >
> > Memory layout for a logical partition is striped: first x mbyte belong to
> > node 0, next x mbyte belong to node 1, etc...
> >
> > Also, since there is always a hypervisor running below Linux I don't think
> > it would make too much sense if we would know to which node a piece of
> > memory belongs to: if the hypervisor decides to schedule a virtual cpu of
> > a logical partition to a different node then what?
> >
>
> Ok, so the patchset is a no-op for s390 since it only utilizes the
> !CONFIG_NUMA code.
Sounds good.
> Alex, I think the safest thing to do in unregister_mem_sect_under_nodes()
> is to iterate though the section pfns and remove links to the node_device
> kobjs for all the distinct pfn_to_nid()'s that it encounters.
Ok, I will respin.
Thanks!
/ac
--
To unsubscribe, send a message with 'unsubscribe linux-mm' in
the body to majordomo@kvack.org. For more info on Linux MM,
see: http://www.linux-mm.org/ .
Don't email: <a href=mailto:"dont@kvack.org"> email@kvack.org </a>
^ permalink raw reply [flat|nested] 42+ messages in thread
* Re: [PATCH v2 1/5] mm: add numa node symlink for memory section in sysfs
2009-10-28 9:03 ` David Rientjes
@ 2009-10-28 18:39 ` Alex Chiang
-1 siblings, 0 replies; 42+ messages in thread
From: Alex Chiang @ 2009-10-28 18:39 UTC (permalink / raw)
To: David Rientjes
Cc: Heiko Carstens, Andrew Morton, Gary Hade, linux-kernel, linux-mm,
Badari Pulavarty, Martin Schwidefsky, Ingo Molnar
* David Rientjes <rientjes@google.com>:
>
> Alex, I think the safest thing to do in unregister_mem_sect_under_nodes()
> is to iterate though the section pfns and remove links to the node_device
> kobjs for all the distinct pfn_to_nid()'s that it encounters.
Am I not understanding the code? It looks like we do this
already...
/* unregister memory section under all nodes that it spans */
int unregister_mem_sect_under_nodes(struct memory_block *mem_blk)
{
nodemask_t unlinked_nodes;
unsigned long pfn, sect_start_pfn, sect_end_pfn;
if (!mem_blk)
return -EFAULT;
nodes_clear(unlinked_nodes);
sect_start_pfn = section_nr_to_pfn(mem_blk->phys_index);
sect_end_pfn = sect_start_pfn + PAGES_PER_SECTION - 1;
for (pfn = sect_start_pfn; pfn <= sect_end_pfn; pfn++) {
int nid;
nid = get_nid_for_pfn(pfn);
if (nid < 0)
continue;
if (!node_online(nid))
continue;
if (node_test_and_set(nid, unlinked_nodes))
continue;
sysfs_remove_link(&node_devices[nid].sysdev.kobj,
kobject_name(&mem_blk->sysdev.kobj));
sysfs_remove_link(&mem_blk->sysdev.kobj,
kobject_name(&node_devices[nid].sysdev.kobj));
}
return 0;
}
Thanks,
/ac
^ permalink raw reply [flat|nested] 42+ messages in thread
* Re: [PATCH v2 1/5] mm: add numa node symlink for memory section in sysfs
@ 2009-10-28 18:39 ` Alex Chiang
0 siblings, 0 replies; 42+ messages in thread
From: Alex Chiang @ 2009-10-28 18:39 UTC (permalink / raw)
To: David Rientjes
Cc: Heiko Carstens, Andrew Morton, Gary Hade, linux-kernel, linux-mm,
Badari Pulavarty, Martin Schwidefsky, Ingo Molnar
* David Rientjes <rientjes@google.com>:
>
> Alex, I think the safest thing to do in unregister_mem_sect_under_nodes()
> is to iterate though the section pfns and remove links to the node_device
> kobjs for all the distinct pfn_to_nid()'s that it encounters.
Am I not understanding the code? It looks like we do this
already...
/* unregister memory section under all nodes that it spans */
int unregister_mem_sect_under_nodes(struct memory_block *mem_blk)
{
nodemask_t unlinked_nodes;
unsigned long pfn, sect_start_pfn, sect_end_pfn;
if (!mem_blk)
return -EFAULT;
nodes_clear(unlinked_nodes);
sect_start_pfn = section_nr_to_pfn(mem_blk->phys_index);
sect_end_pfn = sect_start_pfn + PAGES_PER_SECTION - 1;
for (pfn = sect_start_pfn; pfn <= sect_end_pfn; pfn++) {
int nid;
nid = get_nid_for_pfn(pfn);
if (nid < 0)
continue;
if (!node_online(nid))
continue;
if (node_test_and_set(nid, unlinked_nodes))
continue;
sysfs_remove_link(&node_devices[nid].sysdev.kobj,
kobject_name(&mem_blk->sysdev.kobj));
sysfs_remove_link(&mem_blk->sysdev.kobj,
kobject_name(&node_devices[nid].sysdev.kobj));
}
return 0;
}
Thanks,
/ac
--
To unsubscribe, send a message with 'unsubscribe linux-mm' in
the body to majordomo@kvack.org. For more info on Linux MM,
see: http://www.linux-mm.org/ .
Don't email: <a href=mailto:"dont@kvack.org"> email@kvack.org </a>
^ permalink raw reply [flat|nested] 42+ messages in thread
* [patch -mm] mm: slab allocate memory section nodemask for large systems
2009-10-28 18:39 ` Alex Chiang
@ 2009-10-28 20:43 ` David Rientjes
-1 siblings, 0 replies; 42+ messages in thread
From: David Rientjes @ 2009-10-28 20:43 UTC (permalink / raw)
To: Andrew Morton
Cc: Heiko Carstens, Gary Hade, linux-kernel, linux-mm,
Badari Pulavarty, Martin Schwidefsky, Ingo Molnar, Alex Chiang
On Wed, 28 Oct 2009, Alex Chiang wrote:
> Am I not understanding the code? It looks like we do this
> already...
>
> /* unregister memory section under all nodes that it spans */
> int unregister_mem_sect_under_nodes(struct memory_block *mem_blk)
> {
> nodemask_t unlinked_nodes;
> unsigned long pfn, sect_start_pfn, sect_end_pfn;
>
> if (!mem_blk)
> return -EFAULT;
> nodes_clear(unlinked_nodes);
> sect_start_pfn = section_nr_to_pfn(mem_blk->phys_index);
> sect_end_pfn = sect_start_pfn + PAGES_PER_SECTION - 1;
> for (pfn = sect_start_pfn; pfn <= sect_end_pfn; pfn++) {
> int nid;
>
> nid = get_nid_for_pfn(pfn);
> if (nid < 0)
> continue;
> if (!node_online(nid))
> continue;
> if (node_test_and_set(nid, unlinked_nodes))
> continue;
> sysfs_remove_link(&node_devices[nid].sysdev.kobj,
> kobject_name(&mem_blk->sysdev.kobj));
> sysfs_remove_link(&mem_blk->sysdev.kobj,
> kobject_name(&node_devices[nid].sysdev.kobj));
> }
> return 0;
> }
>
That shound be sufficient with the exception that allocating nodemask_t
on the stack is usually dangerous because it can be extremely large; we
typically use NODEMASK_ALLOC() for such code. It's had some changes in
-mm, but since this patchset will likely be going through that tree anyway
we can fix it now with the patch below.
Otherwise, it looks like the iteration is already there and will remove
links for memory sections bound to multiple nodes if they exist through
hotplug.
mm: slab allocate memory section nodemask for large systems
Nodemasks should not be allocated on the stack for large systems (when it
is larger than 256 bytes) since there is a threat of overflow.
This patch causes the unregister_mem_sect_under_nodes() nodemask to be
allocated on the stack for smaller systems and be allocated by slab for
larger systems.
GFP_KERNEL is used since remove_memory_block() can block.
Cc: Gary Hade <garyhade@us.ibm.com>
Cc: Badari Pulavarty <pbadari@us.ibm.com>
Signed-off-by: David Rientjes <rientjes@google.com>
---
Depends on NODEMASK_ALLOC() changes currently present only in -mm.
drivers/base/node.c | 11 +++++++----
1 files changed, 7 insertions(+), 4 deletions(-)
diff --git a/drivers/base/node.c b/drivers/base/node.c
--- a/drivers/base/node.c
+++ b/drivers/base/node.c
@@ -363,12 +363,14 @@ int register_mem_sect_under_node(struct memory_block *mem_blk, int nid)
/* unregister memory section under all nodes that it spans */
int unregister_mem_sect_under_nodes(struct memory_block *mem_blk)
{
- nodemask_t unlinked_nodes;
+ NODEMASK_ALLOC(nodemask_t, unlinked_nodes, GFP_KERNEL);
unsigned long pfn, sect_start_pfn, sect_end_pfn;
- if (!mem_blk)
+ if (!mem_blk) {
+ NODEMASK_FREE(unlinked_nodes);
return -EFAULT;
- nodes_clear(unlinked_nodes);
+ }
+ nodes_clear(*unlinked_nodes);
sect_start_pfn = section_nr_to_pfn(mem_blk->phys_index);
sect_end_pfn = sect_start_pfn + PAGES_PER_SECTION - 1;
for (pfn = sect_start_pfn; pfn <= sect_end_pfn; pfn++) {
@@ -379,13 +381,14 @@ int unregister_mem_sect_under_nodes(struct memory_block *mem_blk)
continue;
if (!node_online(nid))
continue;
- if (node_test_and_set(nid, unlinked_nodes))
+ if (node_test_and_set(nid, *unlinked_nodes))
continue;
sysfs_remove_link(&node_devices[nid].sysdev.kobj,
kobject_name(&mem_blk->sysdev.kobj));
sysfs_remove_link(&mem_blk->sysdev.kobj,
kobject_name(&node_devices[nid].sysdev.kobj));
}
+ NODEMASK_FREE(unlinked_nodes);
return 0;
}
^ permalink raw reply [flat|nested] 42+ messages in thread
* [patch -mm] mm: slab allocate memory section nodemask for large systems
@ 2009-10-28 20:43 ` David Rientjes
0 siblings, 0 replies; 42+ messages in thread
From: David Rientjes @ 2009-10-28 20:43 UTC (permalink / raw)
To: Andrew Morton
Cc: Heiko Carstens, Gary Hade, linux-kernel, linux-mm,
Badari Pulavarty, Martin Schwidefsky, Ingo Molnar, Alex Chiang
On Wed, 28 Oct 2009, Alex Chiang wrote:
> Am I not understanding the code? It looks like we do this
> already...
>
> /* unregister memory section under all nodes that it spans */
> int unregister_mem_sect_under_nodes(struct memory_block *mem_blk)
> {
> nodemask_t unlinked_nodes;
> unsigned long pfn, sect_start_pfn, sect_end_pfn;
>
> if (!mem_blk)
> return -EFAULT;
> nodes_clear(unlinked_nodes);
> sect_start_pfn = section_nr_to_pfn(mem_blk->phys_index);
> sect_end_pfn = sect_start_pfn + PAGES_PER_SECTION - 1;
> for (pfn = sect_start_pfn; pfn <= sect_end_pfn; pfn++) {
> int nid;
>
> nid = get_nid_for_pfn(pfn);
> if (nid < 0)
> continue;
> if (!node_online(nid))
> continue;
> if (node_test_and_set(nid, unlinked_nodes))
> continue;
> sysfs_remove_link(&node_devices[nid].sysdev.kobj,
> kobject_name(&mem_blk->sysdev.kobj));
> sysfs_remove_link(&mem_blk->sysdev.kobj,
> kobject_name(&node_devices[nid].sysdev.kobj));
> }
> return 0;
> }
>
That shound be sufficient with the exception that allocating nodemask_t
on the stack is usually dangerous because it can be extremely large; we
typically use NODEMASK_ALLOC() for such code. It's had some changes in
-mm, but since this patchset will likely be going through that tree anyway
we can fix it now with the patch below.
Otherwise, it looks like the iteration is already there and will remove
links for memory sections bound to multiple nodes if they exist through
hotplug.
mm: slab allocate memory section nodemask for large systems
Nodemasks should not be allocated on the stack for large systems (when it
is larger than 256 bytes) since there is a threat of overflow.
This patch causes the unregister_mem_sect_under_nodes() nodemask to be
allocated on the stack for smaller systems and be allocated by slab for
larger systems.
GFP_KERNEL is used since remove_memory_block() can block.
Cc: Gary Hade <garyhade@us.ibm.com>
Cc: Badari Pulavarty <pbadari@us.ibm.com>
Signed-off-by: David Rientjes <rientjes@google.com>
---
Depends on NODEMASK_ALLOC() changes currently present only in -mm.
drivers/base/node.c | 11 +++++++----
1 files changed, 7 insertions(+), 4 deletions(-)
diff --git a/drivers/base/node.c b/drivers/base/node.c
--- a/drivers/base/node.c
+++ b/drivers/base/node.c
@@ -363,12 +363,14 @@ int register_mem_sect_under_node(struct memory_block *mem_blk, int nid)
/* unregister memory section under all nodes that it spans */
int unregister_mem_sect_under_nodes(struct memory_block *mem_blk)
{
- nodemask_t unlinked_nodes;
+ NODEMASK_ALLOC(nodemask_t, unlinked_nodes, GFP_KERNEL);
unsigned long pfn, sect_start_pfn, sect_end_pfn;
- if (!mem_blk)
+ if (!mem_blk) {
+ NODEMASK_FREE(unlinked_nodes);
return -EFAULT;
- nodes_clear(unlinked_nodes);
+ }
+ nodes_clear(*unlinked_nodes);
sect_start_pfn = section_nr_to_pfn(mem_blk->phys_index);
sect_end_pfn = sect_start_pfn + PAGES_PER_SECTION - 1;
for (pfn = sect_start_pfn; pfn <= sect_end_pfn; pfn++) {
@@ -379,13 +381,14 @@ int unregister_mem_sect_under_nodes(struct memory_block *mem_blk)
continue;
if (!node_online(nid))
continue;
- if (node_test_and_set(nid, unlinked_nodes))
+ if (node_test_and_set(nid, *unlinked_nodes))
continue;
sysfs_remove_link(&node_devices[nid].sysdev.kobj,
kobject_name(&mem_blk->sysdev.kobj));
sysfs_remove_link(&mem_blk->sysdev.kobj,
kobject_name(&node_devices[nid].sysdev.kobj));
}
+ NODEMASK_FREE(unlinked_nodes);
return 0;
}
--
To unsubscribe, send a message with 'unsubscribe linux-mm' in
the body to majordomo@kvack.org. For more info on Linux MM,
see: http://www.linux-mm.org/ .
Don't email: <a href=mailto:"dont@kvack.org"> email@kvack.org </a>
^ permalink raw reply [flat|nested] 42+ messages in thread
* Re: [patch -mm] mm: slab allocate memory section nodemask for large systems
2009-10-28 20:43 ` David Rientjes
@ 2009-11-02 20:47 ` Alex Chiang
-1 siblings, 0 replies; 42+ messages in thread
From: Alex Chiang @ 2009-11-02 20:47 UTC (permalink / raw)
To: David Rientjes
Cc: Andrew Morton, Heiko Carstens, Gary Hade, linux-kernel, linux-mm,
Badari Pulavarty, Martin Schwidefsky, Ingo Molnar
Hi Andrew,
* David Rientjes <rientjes@google.com>:
> On Wed, 28 Oct 2009, Alex Chiang wrote:
>
> > Am I not understanding the code? It looks like we do this
> > already...
> >
> > /* unregister memory section under all nodes that it spans */
> > int unregister_mem_sect_under_nodes(struct memory_block *mem_blk)
> > {
> > nodemask_t unlinked_nodes;
> > unsigned long pfn, sect_start_pfn, sect_end_pfn;
> >
> > if (!mem_blk)
> > return -EFAULT;
> > nodes_clear(unlinked_nodes);
> > sect_start_pfn = section_nr_to_pfn(mem_blk->phys_index);
> > sect_end_pfn = sect_start_pfn + PAGES_PER_SECTION - 1;
> > for (pfn = sect_start_pfn; pfn <= sect_end_pfn; pfn++) {
> > int nid;
> >
> > nid = get_nid_for_pfn(pfn);
> > if (nid < 0)
> > continue;
> > if (!node_online(nid))
> > continue;
> > if (node_test_and_set(nid, unlinked_nodes))
> > continue;
> > sysfs_remove_link(&node_devices[nid].sysdev.kobj,
> > kobject_name(&mem_blk->sysdev.kobj));
> > sysfs_remove_link(&mem_blk->sysdev.kobj,
> > kobject_name(&node_devices[nid].sysdev.kobj));
> > }
> > return 0;
> > }
> >
>
> That shound be sufficient with the exception that allocating nodemask_t
> on the stack is usually dangerous because it can be extremely large; we
> typically use NODEMASK_ALLOC() for such code. It's had some changes in
> -mm, but since this patchset will likely be going through that tree anyway
> we can fix it now with the patch below.
>
> Otherwise, it looks like the iteration is already there and will remove
> links for memory sections bound to multiple nodes if they exist through
> hotplug.
Any comments on this patch series?
Turns out that Kame-san's fear about a memory section spanning
several nodes on certain architectures (S390) isn't really
applicable and even if it were, we have code to handle situation
anyway.
Kame-san was generally supportive of these convenience symlinks
although he did not give a formal ACK.
David has given an ACK on the two patches that do real work, as
well as supplied the below patch.
I can respin this series once more, including David's Acked-by:
and adding his patch if that makes life easier for you.
Thanks,
/ac
> mm: slab allocate memory section nodemask for large systems
>
> Nodemasks should not be allocated on the stack for large systems (when it
> is larger than 256 bytes) since there is a threat of overflow.
>
> This patch causes the unregister_mem_sect_under_nodes() nodemask to be
> allocated on the stack for smaller systems and be allocated by slab for
> larger systems.
>
> GFP_KERNEL is used since remove_memory_block() can block.
>
> Cc: Gary Hade <garyhade@us.ibm.com>
> Cc: Badari Pulavarty <pbadari@us.ibm.com>
> Signed-off-by: David Rientjes <rientjes@google.com>
> ---
> Depends on NODEMASK_ALLOC() changes currently present only in -mm.
>
> drivers/base/node.c | 11 +++++++----
> 1 files changed, 7 insertions(+), 4 deletions(-)
>
> diff --git a/drivers/base/node.c b/drivers/base/node.c
> --- a/drivers/base/node.c
> +++ b/drivers/base/node.c
> @@ -363,12 +363,14 @@ int register_mem_sect_under_node(struct memory_block *mem_blk, int nid)
> /* unregister memory section under all nodes that it spans */
> int unregister_mem_sect_under_nodes(struct memory_block *mem_blk)
> {
> - nodemask_t unlinked_nodes;
> + NODEMASK_ALLOC(nodemask_t, unlinked_nodes, GFP_KERNEL);
> unsigned long pfn, sect_start_pfn, sect_end_pfn;
>
> - if (!mem_blk)
> + if (!mem_blk) {
> + NODEMASK_FREE(unlinked_nodes);
> return -EFAULT;
> - nodes_clear(unlinked_nodes);
> + }
> + nodes_clear(*unlinked_nodes);
> sect_start_pfn = section_nr_to_pfn(mem_blk->phys_index);
> sect_end_pfn = sect_start_pfn + PAGES_PER_SECTION - 1;
> for (pfn = sect_start_pfn; pfn <= sect_end_pfn; pfn++) {
> @@ -379,13 +381,14 @@ int unregister_mem_sect_under_nodes(struct memory_block *mem_blk)
> continue;
> if (!node_online(nid))
> continue;
> - if (node_test_and_set(nid, unlinked_nodes))
> + if (node_test_and_set(nid, *unlinked_nodes))
> continue;
> sysfs_remove_link(&node_devices[nid].sysdev.kobj,
> kobject_name(&mem_blk->sysdev.kobj));
> sysfs_remove_link(&mem_blk->sysdev.kobj,
> kobject_name(&node_devices[nid].sysdev.kobj));
> }
> + NODEMASK_FREE(unlinked_nodes);
> return 0;
> }
>
>
^ permalink raw reply [flat|nested] 42+ messages in thread
* Re: [patch -mm] mm: slab allocate memory section nodemask for large systems
@ 2009-11-02 20:47 ` Alex Chiang
0 siblings, 0 replies; 42+ messages in thread
From: Alex Chiang @ 2009-11-02 20:47 UTC (permalink / raw)
To: David Rientjes
Cc: Andrew Morton, Heiko Carstens, Gary Hade, linux-kernel, linux-mm,
Badari Pulavarty, Martin Schwidefsky, Ingo Molnar
Hi Andrew,
* David Rientjes <rientjes@google.com>:
> On Wed, 28 Oct 2009, Alex Chiang wrote:
>
> > Am I not understanding the code? It looks like we do this
> > already...
> >
> > /* unregister memory section under all nodes that it spans */
> > int unregister_mem_sect_under_nodes(struct memory_block *mem_blk)
> > {
> > nodemask_t unlinked_nodes;
> > unsigned long pfn, sect_start_pfn, sect_end_pfn;
> >
> > if (!mem_blk)
> > return -EFAULT;
> > nodes_clear(unlinked_nodes);
> > sect_start_pfn = section_nr_to_pfn(mem_blk->phys_index);
> > sect_end_pfn = sect_start_pfn + PAGES_PER_SECTION - 1;
> > for (pfn = sect_start_pfn; pfn <= sect_end_pfn; pfn++) {
> > int nid;
> >
> > nid = get_nid_for_pfn(pfn);
> > if (nid < 0)
> > continue;
> > if (!node_online(nid))
> > continue;
> > if (node_test_and_set(nid, unlinked_nodes))
> > continue;
> > sysfs_remove_link(&node_devices[nid].sysdev.kobj,
> > kobject_name(&mem_blk->sysdev.kobj));
> > sysfs_remove_link(&mem_blk->sysdev.kobj,
> > kobject_name(&node_devices[nid].sysdev.kobj));
> > }
> > return 0;
> > }
> >
>
> That shound be sufficient with the exception that allocating nodemask_t
> on the stack is usually dangerous because it can be extremely large; we
> typically use NODEMASK_ALLOC() for such code. It's had some changes in
> -mm, but since this patchset will likely be going through that tree anyway
> we can fix it now with the patch below.
>
> Otherwise, it looks like the iteration is already there and will remove
> links for memory sections bound to multiple nodes if they exist through
> hotplug.
Any comments on this patch series?
Turns out that Kame-san's fear about a memory section spanning
several nodes on certain architectures (S390) isn't really
applicable and even if it were, we have code to handle situation
anyway.
Kame-san was generally supportive of these convenience symlinks
although he did not give a formal ACK.
David has given an ACK on the two patches that do real work, as
well as supplied the below patch.
I can respin this series once more, including David's Acked-by:
and adding his patch if that makes life easier for you.
Thanks,
/ac
> mm: slab allocate memory section nodemask for large systems
>
> Nodemasks should not be allocated on the stack for large systems (when it
> is larger than 256 bytes) since there is a threat of overflow.
>
> This patch causes the unregister_mem_sect_under_nodes() nodemask to be
> allocated on the stack for smaller systems and be allocated by slab for
> larger systems.
>
> GFP_KERNEL is used since remove_memory_block() can block.
>
> Cc: Gary Hade <garyhade@us.ibm.com>
> Cc: Badari Pulavarty <pbadari@us.ibm.com>
> Signed-off-by: David Rientjes <rientjes@google.com>
> ---
> Depends on NODEMASK_ALLOC() changes currently present only in -mm.
>
> drivers/base/node.c | 11 +++++++----
> 1 files changed, 7 insertions(+), 4 deletions(-)
>
> diff --git a/drivers/base/node.c b/drivers/base/node.c
> --- a/drivers/base/node.c
> +++ b/drivers/base/node.c
> @@ -363,12 +363,14 @@ int register_mem_sect_under_node(struct memory_block *mem_blk, int nid)
> /* unregister memory section under all nodes that it spans */
> int unregister_mem_sect_under_nodes(struct memory_block *mem_blk)
> {
> - nodemask_t unlinked_nodes;
> + NODEMASK_ALLOC(nodemask_t, unlinked_nodes, GFP_KERNEL);
> unsigned long pfn, sect_start_pfn, sect_end_pfn;
>
> - if (!mem_blk)
> + if (!mem_blk) {
> + NODEMASK_FREE(unlinked_nodes);
> return -EFAULT;
> - nodes_clear(unlinked_nodes);
> + }
> + nodes_clear(*unlinked_nodes);
> sect_start_pfn = section_nr_to_pfn(mem_blk->phys_index);
> sect_end_pfn = sect_start_pfn + PAGES_PER_SECTION - 1;
> for (pfn = sect_start_pfn; pfn <= sect_end_pfn; pfn++) {
> @@ -379,13 +381,14 @@ int unregister_mem_sect_under_nodes(struct memory_block *mem_blk)
> continue;
> if (!node_online(nid))
> continue;
> - if (node_test_and_set(nid, unlinked_nodes))
> + if (node_test_and_set(nid, *unlinked_nodes))
> continue;
> sysfs_remove_link(&node_devices[nid].sysdev.kobj,
> kobject_name(&mem_blk->sysdev.kobj));
> sysfs_remove_link(&mem_blk->sysdev.kobj,
> kobject_name(&node_devices[nid].sysdev.kobj));
> }
> + NODEMASK_FREE(unlinked_nodes);
> return 0;
> }
>
>
--
To unsubscribe, send a message with 'unsubscribe linux-mm' in
the body to majordomo@kvack.org. For more info on Linux MM,
see: http://www.linux-mm.org/ .
Don't email: <a href=mailto:"dont@kvack.org"> email@kvack.org </a>
^ permalink raw reply [flat|nested] 42+ messages in thread
* Re: [patch -mm] mm: slab allocate memory section nodemask for large systems
2009-11-02 20:47 ` Alex Chiang
@ 2009-11-04 2:00 ` David Rientjes
-1 siblings, 0 replies; 42+ messages in thread
From: David Rientjes @ 2009-11-04 2:00 UTC (permalink / raw)
To: Alex Chiang
Cc: Andrew Morton, Heiko Carstens, Gary Hade, linux-kernel, linux-mm,
Badari Pulavarty, Martin Schwidefsky, Ingo Molnar
On Mon, 2 Nov 2009, Alex Chiang wrote:
> Any comments on this patch series?
>
> Turns out that Kame-san's fear about a memory section spanning
> several nodes on certain architectures (S390) isn't really
> applicable and even if it were, we have code to handle situation
> anyway.
>
> Kame-san was generally supportive of these convenience symlinks
> although he did not give a formal ACK.
>
> David has given an ACK on the two patches that do real work, as
> well as supplied the below patch.
>
> I can respin this series once more, including David's Acked-by:
> and adding his patch if that makes life easier for you.
>
It's probably in Andrew's queue after getting back from the kernel summit,
it would be best to wait a week or so.
^ permalink raw reply [flat|nested] 42+ messages in thread
* Re: [patch -mm] mm: slab allocate memory section nodemask for large systems
@ 2009-11-04 2:00 ` David Rientjes
0 siblings, 0 replies; 42+ messages in thread
From: David Rientjes @ 2009-11-04 2:00 UTC (permalink / raw)
To: Alex Chiang
Cc: Andrew Morton, Heiko Carstens, Gary Hade, linux-kernel, linux-mm,
Badari Pulavarty, Martin Schwidefsky, Ingo Molnar
On Mon, 2 Nov 2009, Alex Chiang wrote:
> Any comments on this patch series?
>
> Turns out that Kame-san's fear about a memory section spanning
> several nodes on certain architectures (S390) isn't really
> applicable and even if it were, we have code to handle situation
> anyway.
>
> Kame-san was generally supportive of these convenience symlinks
> although he did not give a formal ACK.
>
> David has given an ACK on the two patches that do real work, as
> well as supplied the below patch.
>
> I can respin this series once more, including David's Acked-by:
> and adding his patch if that makes life easier for you.
>
It's probably in Andrew's queue after getting back from the kernel summit,
it would be best to wait a week or so.
--
To unsubscribe, send a message with 'unsubscribe linux-mm' in
the body to majordomo@kvack.org. For more info on Linux MM,
see: http://www.linux-mm.org/ .
Don't email: <a href=mailto:"dont@kvack.org"> email@kvack.org </a>
^ permalink raw reply [flat|nested] 42+ messages in thread
* Re: [patch -mm] mm: slab allocate memory section nodemask for large systems
2009-11-02 20:47 ` Alex Chiang
@ 2009-11-10 20:51 ` Andrew Morton
-1 siblings, 0 replies; 42+ messages in thread
From: Andrew Morton @ 2009-11-10 20:51 UTC (permalink / raw)
To: Alex Chiang
Cc: David Rientjes, Heiko Carstens, Gary Hade, linux-kernel,
linux-mm, Badari Pulavarty, Martin Schwidefsky, Ingo Molnar
On Mon, 2 Nov 2009 13:47:26 -0700
Alex Chiang <achiang@hp.com> wrote:
> I can respin this series once more, including David's Acked-by:
> and adding his patch if that makes life easier for you.
Yes, please redo and resend.
The prerequisite Documentation/ patches are a bit of a mess - some have
been cherrypicked into Greg's tree I believe and some haven't. So
please also send out whatever is needed to bring linux-next up to date.
^ permalink raw reply [flat|nested] 42+ messages in thread
* Re: [patch -mm] mm: slab allocate memory section nodemask for large systems
@ 2009-11-10 20:51 ` Andrew Morton
0 siblings, 0 replies; 42+ messages in thread
From: Andrew Morton @ 2009-11-10 20:51 UTC (permalink / raw)
To: Alex Chiang
Cc: David Rientjes, Heiko Carstens, Gary Hade, linux-kernel,
linux-mm, Badari Pulavarty, Martin Schwidefsky, Ingo Molnar
On Mon, 2 Nov 2009 13:47:26 -0700
Alex Chiang <achiang@hp.com> wrote:
> I can respin this series once more, including David's Acked-by:
> and adding his patch if that makes life easier for you.
Yes, please redo and resend.
The prerequisite Documentation/ patches are a bit of a mess - some have
been cherrypicked into Greg's tree I believe and some haven't. So
please also send out whatever is needed to bring linux-next up to date.
--
To unsubscribe, send a message with 'unsubscribe linux-mm' in
the body to majordomo@kvack.org. For more info on Linux MM,
see: http://www.linux-mm.org/ .
Don't email: <a href=mailto:"dont@kvack.org"> email@kvack.org </a>
^ permalink raw reply [flat|nested] 42+ messages in thread
* Re: [patch -mm] mm: slab allocate memory section nodemask for large systems
2009-11-10 20:51 ` Andrew Morton
@ 2009-11-10 20:55 ` David Rientjes
-1 siblings, 0 replies; 42+ messages in thread
From: David Rientjes @ 2009-11-10 20:55 UTC (permalink / raw)
To: Andrew Morton
Cc: Alex Chiang, Heiko Carstens, Gary Hade, linux-kernel, linux-mm,
Badari Pulavarty, Martin Schwidefsky, Ingo Molnar
On Tue, 10 Nov 2009, Andrew Morton wrote:
> The prerequisite Documentation/ patches are a bit of a mess - some have
> been cherrypicked into Greg's tree I believe and some haven't. So
> please also send out whatever is needed to bring linux-next up to date.
>
I'm not aware of any prerequisites for this patchset, Alex's documentation
changes have already been merged by Linus.
^ permalink raw reply [flat|nested] 42+ messages in thread
* Re: [patch -mm] mm: slab allocate memory section nodemask for large systems
@ 2009-11-10 20:55 ` David Rientjes
0 siblings, 0 replies; 42+ messages in thread
From: David Rientjes @ 2009-11-10 20:55 UTC (permalink / raw)
To: Andrew Morton
Cc: Alex Chiang, Heiko Carstens, Gary Hade, linux-kernel, linux-mm,
Badari Pulavarty, Martin Schwidefsky, Ingo Molnar
On Tue, 10 Nov 2009, Andrew Morton wrote:
> The prerequisite Documentation/ patches are a bit of a mess - some have
> been cherrypicked into Greg's tree I believe and some haven't. So
> please also send out whatever is needed to bring linux-next up to date.
>
I'm not aware of any prerequisites for this patchset, Alex's documentation
changes have already been merged by Linus.
--
To unsubscribe, send a message with 'unsubscribe linux-mm' in
the body to majordomo@kvack.org. For more info on Linux MM,
see: http://www.linux-mm.org/ .
Don't email: <a href=mailto:"dont@kvack.org"> email@kvack.org </a>
^ permalink raw reply [flat|nested] 42+ messages in thread
* Re: [patch -mm] mm: slab allocate memory section nodemask for large systems
2009-11-10 20:55 ` David Rientjes
@ 2009-11-10 21:26 ` Alex Chiang
-1 siblings, 0 replies; 42+ messages in thread
From: Alex Chiang @ 2009-11-10 21:26 UTC (permalink / raw)
To: David Rientjes
Cc: Andrew Morton, Heiko Carstens, Gary Hade, linux-kernel, linux-mm,
Badari Pulavarty, Martin Schwidefsky, Ingo Molnar
* David Rientjes <rientjes@google.com>:
> On Tue, 10 Nov 2009, Andrew Morton wrote:
>
> > The prerequisite Documentation/ patches are a bit of a mess - some have
> > been cherrypicked into Greg's tree I believe and some haven't. So
> > please also send out whatever is needed to bring linux-next up to date.
Thanks, I'll respin.
> I'm not aware of any prerequisites for this patchset, Alex's documentation
> changes have already been merged by Linus.
Correct. So I'll respin this series against... Linus's tree? Or
maybe mmotm? Please advise.
Thanks,
/ac
^ permalink raw reply [flat|nested] 42+ messages in thread
* Re: [patch -mm] mm: slab allocate memory section nodemask for large systems
@ 2009-11-10 21:26 ` Alex Chiang
0 siblings, 0 replies; 42+ messages in thread
From: Alex Chiang @ 2009-11-10 21:26 UTC (permalink / raw)
To: David Rientjes
Cc: Andrew Morton, Heiko Carstens, Gary Hade, linux-kernel, linux-mm,
Badari Pulavarty, Martin Schwidefsky, Ingo Molnar
* David Rientjes <rientjes@google.com>:
> On Tue, 10 Nov 2009, Andrew Morton wrote:
>
> > The prerequisite Documentation/ patches are a bit of a mess - some have
> > been cherrypicked into Greg's tree I believe and some haven't. So
> > please also send out whatever is needed to bring linux-next up to date.
Thanks, I'll respin.
> I'm not aware of any prerequisites for this patchset, Alex's documentation
> changes have already been merged by Linus.
Correct. So I'll respin this series against... Linus's tree? Or
maybe mmotm? Please advise.
Thanks,
/ac
--
To unsubscribe, send a message with 'unsubscribe linux-mm' in
the body to majordomo@kvack.org. For more info on Linux MM,
see: http://www.linux-mm.org/ .
Don't email: <a href=mailto:"dont@kvack.org"> email@kvack.org </a>
^ permalink raw reply [flat|nested] 42+ messages in thread
* Re: [patch -mm] mm: slab allocate memory section nodemask for large systems
2009-11-10 21:26 ` Alex Chiang
@ 2009-11-10 21:38 ` Andrew Morton
-1 siblings, 0 replies; 42+ messages in thread
From: Andrew Morton @ 2009-11-10 21:38 UTC (permalink / raw)
To: Alex Chiang
Cc: David Rientjes, Heiko Carstens, Gary Hade, linux-kernel,
linux-mm, Badari Pulavarty, Martin Schwidefsky, Ingo Molnar
On Tue, 10 Nov 2009 14:26:29 -0700
Alex Chiang <achiang@hp.com> wrote:
> > I'm not aware of any prerequisites for this patchset, Alex's documentation
> > changes have already been merged by Linus.
>
> Correct. So I'll respin this series against... Linus's tree? Or
> maybe mmotm? Please advise.
It appears that Linus's tree will be an OK base.
^ permalink raw reply [flat|nested] 42+ messages in thread
* Re: [patch -mm] mm: slab allocate memory section nodemask for large systems
@ 2009-11-10 21:38 ` Andrew Morton
0 siblings, 0 replies; 42+ messages in thread
From: Andrew Morton @ 2009-11-10 21:38 UTC (permalink / raw)
To: Alex Chiang
Cc: David Rientjes, Heiko Carstens, Gary Hade, linux-kernel,
linux-mm, Badari Pulavarty, Martin Schwidefsky, Ingo Molnar
On Tue, 10 Nov 2009 14:26:29 -0700
Alex Chiang <achiang@hp.com> wrote:
> > I'm not aware of any prerequisites for this patchset, Alex's documentation
> > changes have already been merged by Linus.
>
> Correct. So I'll respin this series against... Linus's tree? Or
> maybe mmotm? Please advise.
It appears that Linus's tree will be an OK base.
--
To unsubscribe, send a message with 'unsubscribe linux-mm' in
the body to majordomo@kvack.org. For more info on Linux MM,
see: http://www.linux-mm.org/ .
Don't email: <a href=mailto:"dont@kvack.org"> email@kvack.org </a>
^ permalink raw reply [flat|nested] 42+ messages in thread
end of thread, other threads:[~2009-11-10 21:39 UTC | newest]
Thread overview: 42+ messages (download: mbox.gz / follow: Atom feed)
-- links below jump to the message on this page --
2009-10-22 4:15 [PATCH v2 0/5] mm: modest useability enhancements for node sysfs attrs Alex Chiang
2009-10-22 4:15 ` Alex Chiang
2009-10-22 4:15 ` [PATCH v2 1/5] mm: add numa node symlink for memory section in sysfs Alex Chiang
2009-10-22 4:15 ` Alex Chiang
2009-10-22 19:51 ` David Rientjes
2009-10-22 19:51 ` David Rientjes
2009-10-27 19:59 ` Alex Chiang
2009-10-27 19:59 ` Alex Chiang
2009-10-27 21:27 ` David Rientjes
2009-10-27 21:27 ` David Rientjes
2009-10-28 8:31 ` Heiko Carstens
2009-10-28 8:31 ` Heiko Carstens
2009-10-28 9:03 ` David Rientjes
2009-10-28 9:03 ` David Rientjes
2009-10-28 17:15 ` Alex Chiang
2009-10-28 17:15 ` Alex Chiang
2009-10-28 18:39 ` Alex Chiang
2009-10-28 18:39 ` Alex Chiang
2009-10-28 20:43 ` [patch -mm] mm: slab allocate memory section nodemask for large systems David Rientjes
2009-10-28 20:43 ` David Rientjes
2009-11-02 20:47 ` Alex Chiang
2009-11-02 20:47 ` Alex Chiang
2009-11-04 2:00 ` David Rientjes
2009-11-04 2:00 ` David Rientjes
2009-11-10 20:51 ` Andrew Morton
2009-11-10 20:51 ` Andrew Morton
2009-11-10 20:55 ` David Rientjes
2009-11-10 20:55 ` David Rientjes
2009-11-10 21:26 ` Alex Chiang
2009-11-10 21:26 ` Alex Chiang
2009-11-10 21:38 ` Andrew Morton
2009-11-10 21:38 ` Andrew Morton
2009-10-22 4:15 ` [PATCH v2 2/5] mm: refactor register_cpu_under_node() Alex Chiang
2009-10-22 4:15 ` Alex Chiang
2009-10-22 4:15 ` [PATCH v2 3/5] mm: refactor unregister_cpu_under_node() Alex Chiang
2009-10-22 4:15 ` Alex Chiang
2009-10-22 4:15 ` [PATCH v2 4/5] mm: add numa node symlink for cpu devices in sysfs Alex Chiang
2009-10-22 4:15 ` Alex Chiang
2009-10-22 19:52 ` David Rientjes
2009-10-22 19:52 ` David Rientjes
2009-10-22 4:15 ` [PATCH v2 5/5] Documentation: ABI: /sys/devices/system/cpu/cpu#/node Alex Chiang
2009-10-22 4:15 ` Alex Chiang
This is an external index of several public inboxes,
see mirroring instructions on how to clone and mirror
all data and code used by this external index.