<?xml version="1.0" encoding="UTF-8"?><rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
	>
<channel>
	<title>Comments on: Opteron NUMA-effects</title>
	<atom:link href="http://www.thinkingparallel.com/2006/07/30/opteron-numa-effects/feed/" rel="self" type="application/rss+xml" />
	<link>http://www.thinkingparallel.com/2006/07/30/opteron-numa-effects/</link>
	<description>A Blog on Parallel Programming and Concurrency by Michael Suess</description>
	<pubDate>Sat, 04 Feb 2012 06:11:44 +0000</pubDate>
	<generator>http://wordpress.org/?v=2.7</generator>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
		<item>
		<title>By: Michael Suess</title>
		<link>http://www.thinkingparallel.com/2006/07/30/opteron-numa-effects/comment-page-1/#comment-20</link>
		<dc:creator>Michael Suess</dc:creator>
		<pubDate>Wed, 23 Aug 2006 20:33:26 +0000</pubDate>
		<guid isPermaLink="false">http://www.thinkingparallel.com/2006/07/30/opteron-numa-effects/#comment-20</guid>
		<description>Dieter,
thanks for your comment. I should have probably mentioned in this article that this is not a comprehensive evaluation of NUMA-effects on the Opteron. If it were, I would not have published it here but in a research paper :-), like e.g. &lt;a href="http://www.nathanr.net/honours/ICCSnuma.pdf" rel="nofollow" rel="nofollow" rel="nofollow"&gt;these guys&lt;/a&gt; (a good read by the way). I have only investigated the part of the effects related to memory latency and left out the part about memory bandwidth (mainly, because I do not have any memory-bound applications and therefore this case is not so interesting for me). Reading through the article again, I should have made that more clear, as the last sentence is probably not enough of a warning. Anyways, I will not change the article now, as the correction can be seen in the comments section.</description>
		<content:encoded><![CDATA[<p>Dieter,<br />
thanks for your comment. I should have probably mentioned in this article that this is not a comprehensive evaluation of NUMA-effects on the Opteron. If it were, I would not have published it here but in a research paper :-), like e.g. <a href="http://www.nathanr.net/honours/ICCSnuma.pdf" rel="nofollow" rel="nofollow" rel="nofollow">these guys</a> (a good read by the way). I have only investigated the part of the effects related to memory latency and left out the part about memory bandwidth (mainly, because I do not have any memory-bound applications and therefore this case is not so interesting for me). Reading through the article again, I should have made that more clear, as the last sentence is probably not enough of a warning. Anyways, I will not change the article now, as the correction can be seen in the comments section.</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Dieter</title>
		<link>http://www.thinkingparallel.com/2006/07/30/opteron-numa-effects/comment-page-1/#comment-18</link>
		<dc:creator>Dieter</dc:creator>
		<pubDate>Wed, 23 Aug 2006 16:22:23 +0000</pubDate>
		<guid isPermaLink="false">http://www.thinkingparallel.com/2006/07/30/opteron-numa-effects/#comment-18</guid>
		<description>We typically observer ccNUMA effects on the Opteron when using more than 2 threads. 
It seems that the memory architecture can sufficiently support 2 threads, whereever they are running.
But with more than 2 threads things are quite different!!

It may get worse with memory benchmarks like stream. THere you can see the difference between good and bad memory placement with 2 threads already

See
http://www.rz.rwth-aachen.de/computing/events/2005/parco05/drops_slides.pdf</description>
		<content:encoded><![CDATA[<p>We typically observer ccNUMA effects on the Opteron when using more than 2 threads.<br />
It seems that the memory architecture can sufficiently support 2 threads, whereever they are running.<br />
But with more than 2 threads things are quite different!!</p>
<p>It may get worse with memory benchmarks like stream. THere you can see the difference between good and bad memory placement with 2 threads already</p>
<p>See<br />
<a href="http://www.rz.rwth-aachen.de/computing/events/2005/parco05/drops_slides.pdf" rel="nofollow">http://www.rz.rwth-aachen.de/computing/events/2005/parco05/drops_slides.pdf</a></p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Thinking Parallel &#187; Blog Archive &#187; More information on pthread_setaffinity_np and sched_setaffinity</title>
		<link>http://www.thinkingparallel.com/2006/07/30/opteron-numa-effects/comment-page-1/#comment-15</link>
		<dc:creator>Thinking Parallel &#187; Blog Archive &#187; More information on pthread_setaffinity_np and sched_setaffinity</dc:creator>
		<pubDate>Fri, 18 Aug 2006 16:01:56 +0000</pubDate>
		<guid isPermaLink="false">http://www.thinkingparallel.com/2006/07/30/opteron-numa-effects/#comment-15</guid>
		<description>[...] Skimming through the activity logs of this blog, I can see that many people come here looking for information about pthread_setaffinity_np. I mentioned it briefly in my article about Opteron NUMA-effects, but barely touched it because I had found a more satisfying solution for my personal use (taskset). And while I do not have in depth knowledge of the function, maybe the test programs I wrote will be of some help to someone to understand the function better. I will also post my test program for sched_setaffinity here while I am at it, simply because the two offer similar functionality. [...]</description>
		<content:encoded><![CDATA[<p>[...] Skimming through the activity logs of this blog, I can see that many people come here looking for information about pthread_setaffinity_np. I mentioned it briefly in my article about Opteron NUMA-effects, but barely touched it because I had found a more satisfying solution for my personal use (taskset). And while I do not have in depth knowledge of the function, maybe the test programs I wrote will be of some help to someone to understand the function better. I will also post my test program for sched_setaffinity here while I am at it, simply because the two offer similar functionality. [...]</p>
]]></content:encoded>
	</item>
</channel>
</rss>

