<?xml version="1.0" encoding="UTF-8"?><rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
	xmlns:georss="http://www.georss.org/georss" xmlns:geo="http://www.w3.org/2003/01/geo/wgs84_pos#" xmlns:media="http://search.yahoo.com/mrss/"
		>
<channel>
	<title>Comments for Contour Line</title>
	<atom:link href="http://contourline.wordpress.com/comments/feed/" rel="self" type="application/rss+xml" />
	<link>http://contourline.wordpress.com</link>
	<description>Surround and define the edges of a subject, giving it shape and volume</description>
	<lastBuildDate>Tue, 27 Oct 2009 20:45:45 +0000</lastBuildDate>
	<generator>http://wordpress.com/</generator>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
		<item>
		<title>Comment on Barbara Tomblin is getting interviewed on Tavis Smiley show! by jmarca</title>
		<link>http://contourline.wordpress.com/2009/09/29/barbara-tomblin-is-getting-interviewed-on-tavis-smiley-show/#comment-133</link>
		<dc:creator>jmarca</dc:creator>
		<pubDate>Tue, 27 Oct 2009 20:45:45 +0000</pubDate>
		<guid isPermaLink="false">http://contourline.wordpress.com/?p=194#comment-133</guid>
		<description>Laura,

Barbara &lt;strong&gt;was&lt;/strong&gt; interviewed, and the podcast will be up eventually, but so far we&#039;ve heard nothing.  I will let her know that you want more information on her book.  But, that said, I&#039;m pretty certain that she&#039;s already dropped a copy off for &lt;a href=&quot;http://www.scrippscollege.edu/about/campus-guide/denison-library.php&quot; rel=&quot;nofollow&quot;&gt;Dennison Library&#039;s&lt;/a&gt; collection, and I also found it in Honnold&#039;s on-line catalog &lt;a href=&quot;http://blais.claremont.edu/search~S0?/aTomblin+Barbara/atomblin+barbara/1%2C1%2C2%2CB/frameset&amp;FF=atomblin+barbara&amp;1%2C%2C2&quot; rel=&quot;nofollow&quot;&gt;here&lt;/a&gt;.  You could have a student of military history review the book for your magazine, or read the book yourself, or else just quote from the book jacket!</description>
		<content:encoded><![CDATA[<p>Laura,</p>
<p>Barbara <strong>was</strong> interviewed, and the podcast will be up eventually, but so far we&#8217;ve heard nothing.  I will let her know that you want more information on her book.  But, that said, I&#8217;m pretty certain that she&#8217;s already dropped a copy off for <a href="http://www.scrippscollege.edu/about/campus-guide/denison-library.php" rel="nofollow">Dennison Library&#8217;s</a> collection, and I also found it in Honnold&#8217;s on-line catalog <a href="http://blais.claremont.edu/search~S0?/aTomblin+Barbara/atomblin+barbara/1%2C1%2C2%2CB/frameset&amp;FF=atomblin+barbara&amp;1%2C%2C2" rel="nofollow">here</a>.  You could have a student of military history review the book for your magazine, or read the book yourself, or else just quote from the book jacket!</p>
]]></content:encoded>
	</item>
	<item>
		<title>Comment on Barbara Tomblin is getting interviewed on Tavis Smiley show! by laura</title>
		<link>http://contourline.wordpress.com/2009/09/29/barbara-tomblin-is-getting-interviewed-on-tavis-smiley-show/#comment-132</link>
		<dc:creator>laura</dc:creator>
		<pubDate>Tue, 27 Oct 2009 20:24:01 +0000</pubDate>
		<guid isPermaLink="false">http://contourline.wordpress.com/?p=194#comment-132</guid>
		<description>hello! I work for the Scripps College Office of Public Relations, and am trying to find out more information about Barbara&#039;s book so we can include a segment on it in the magazine potentially. I was unable to find the podcast online, do you have a link potentially to it? Or, a short description about the book from her that might be a little more specific?
Thank you!
Best,
Laura</description>
		<content:encoded><![CDATA[<p>hello! I work for the Scripps College Office of Public Relations, and am trying to find out more information about Barbara&#8217;s book so we can include a segment on it in the magazine potentially. I was unable to find the podcast online, do you have a link potentially to it? Or, a short description about the book from her that might be a little more specific?<br />
Thank you!<br />
Best,<br />
Laura</p>
]]></content:encoded>
	</item>
	<item>
		<title>Comment on my first non-trival reduce for couchdb by links for 2009-09-30 &#171; Concurrent</title>
		<link>http://contourline.wordpress.com/2009/01/14/my-first-non-trival-reduce-for-couchdb/#comment-123</link>
		<dc:creator>links for 2009-09-30 &#171; Concurrent</dc:creator>
		<pubDate>Thu, 01 Oct 2009 04:03:04 +0000</pubDate>
		<guid isPermaLink="false">http://contourline.wordpress.com/?p=137#comment-123</guid>
		<description>[...] my first non-trival reduce for couchdb « Contour Line (tags: mapreduce , couchdb) [...]</description>
		<content:encoded><![CDATA[<p>[...] my first non-trival reduce for couchdb « Contour Line (tags: mapreduce , couchdb) [...]</p>
]]></content:encoded>
	</item>
	<item>
		<title>Comment on Close still doesn&#8217;t count &#8230; by jmarca</title>
		<link>http://contourline.wordpress.com/2009/04/10/close-still-doesnt-count/#comment-65</link>
		<dc:creator>jmarca</dc:creator>
		<pubDate>Sat, 11 Apr 2009 04:43:56 +0000</pubDate>
		<guid isPermaLink="false">http://contourline.wordpress.com/?p=166#comment-65</guid>
		<description>Hi thanks for the comment.

Actually, I want exactly one value!   I don&#039;t really want a multi-get, I want a multi-reduce.  I want to take those m documents I specify, including the duplicates, and crunch them all through my reduce function, which computes the statistic  of interest on the input.  I was hoping to be able to use multi-get in this way, since for this application, mapping without reducing isn&#039;t so helpful.

In really simple terms, bootstrapping is sampling with replacement.  So for any m randomly chosen values from a set of n, there is a finite chance that some of those choices will be duplicates.  I gather that&#039;s the whole point of the bootstrap method.  And that is the part I&#039;m having the most difficulty with in CouchDB&#039; map/reduce.

Why is bootstrapping useful?  Well, given my data, I can&#039;t justify computing things like mean and variance in the usual way.  The problem is that it is quite likely that I have lots of extreme outliers, and I&#039;m also pretty sure my data is truncated at zero and has a really fat tail, but that&#039;s about it.  With bootstrap methods, you can get away with knowing nothing, but the price is that the method is often computer-intesive.  So instead of computing the mean and variance once on the entire set once, I have to sample from the data set hundreds or thousands of times, with replacement, and compute the mean and variance for each sample, then use those to compute the bootstrap estimate of the mean and variance.  

As to your second suggestion, that just might be a pretty clever idea for tackling the balanced bootstrap, where you just do B copies of the data, randomize the lot, then split it into B subsamples of size n (if I remember correctly).   I&#039;m pretty sure as soon as you emit something random, you violate that clause in the view definition that says the map must produce the same outputs given the same inputs.   But if I can get around that, say by setting the seed of random each time, that might work pretty well.  It also might work well in a temporary view, and frankly given the randomization and multiple sample requirements, it probably doesn&#039;t help at all to have the view cached.  But I also don&#039;t want to break my database because some implementation detail relies on that same input same output rule.  So I&#039;ll have to test that carefully.</description>
		<content:encoded><![CDATA[<p>Hi thanks for the comment.</p>
<p>Actually, I want exactly one value!   I don&#8217;t really want a multi-get, I want a multi-reduce.  I want to take those m documents I specify, including the duplicates, and crunch them all through my reduce function, which computes the statistic  of interest on the input.  I was hoping to be able to use multi-get in this way, since for this application, mapping without reducing isn&#8217;t so helpful.</p>
<p>In really simple terms, bootstrapping is sampling with replacement.  So for any m randomly chosen values from a set of n, there is a finite chance that some of those choices will be duplicates.  I gather that&#8217;s the whole point of the bootstrap method.  And that is the part I&#8217;m having the most difficulty with in CouchDB&#8217; map/reduce.</p>
<p>Why is bootstrapping useful?  Well, given my data, I can&#8217;t justify computing things like mean and variance in the usual way.  The problem is that it is quite likely that I have lots of extreme outliers, and I&#8217;m also pretty sure my data is truncated at zero and has a really fat tail, but that&#8217;s about it.  With bootstrap methods, you can get away with knowing nothing, but the price is that the method is often computer-intesive.  So instead of computing the mean and variance once on the entire set once, I have to sample from the data set hundreds or thousands of times, with replacement, and compute the mean and variance for each sample, then use those to compute the bootstrap estimate of the mean and variance.  </p>
<p>As to your second suggestion, that just might be a pretty clever idea for tackling the balanced bootstrap, where you just do B copies of the data, randomize the lot, then split it into B subsamples of size n (if I remember correctly).   I&#8217;m pretty sure as soon as you emit something random, you violate that clause in the view definition that says the map must produce the same outputs given the same inputs.   But if I can get around that, say by setting the seed of random each time, that might work pretty well.  It also might work well in a temporary view, and frankly given the randomization and multiple sample requirements, it probably doesn&#8217;t help at all to have the view cached.  But I also don&#8217;t want to break my database because some implementation detail relies on that same input same output rule.  So I&#8217;ll have to test that carefully.</p>
]]></content:encoded>
	</item>
	<item>
		<title>Comment on Close still doesn&#8217;t count &#8230; by Paul J. Davis</title>
		<link>http://contourline.wordpress.com/2009/04/10/close-still-doesnt-count/#comment-64</link>
		<dc:creator>Paul J. Davis</dc:creator>
		<pubDate>Sat, 11 Apr 2009 02:39:03 +0000</pubDate>
		<guid isPermaLink="false">http://contourline.wordpress.com/?p=166#comment-64</guid>
		<description>Remember that by default a reduce wants to return you a single aggregate value for the entire view. So without group=true, multi-get doesn&#039;t really make sense as there&#039;s only a single value.

Also, if I gather right, you could try adding a random number in the range [0, 1] and then in have a map function of emit(doc.random_field * M % N, doc.float_value_of_interest) (You pick M and N to give the approximate number of values per group) and then do your statistics on those collections. I have no idea if the sampling math works there or not, but I don&#039;t know that it doesn&#039;t. :)</description>
		<content:encoded><![CDATA[<p>Remember that by default a reduce wants to return you a single aggregate value for the entire view. So without group=true, multi-get doesn&#8217;t really make sense as there&#8217;s only a single value.</p>
<p>Also, if I gather right, you could try adding a random number in the range [0, 1] and then in have a map function of emit(doc.random_field * M % N, doc.float_value_of_interest) (You pick M and N to give the approximate number of values per group) and then do your statistics on those collections. I have no idea if the sampling math works there or not, but I don&#8217;t know that it doesn&#8217;t. :)</p>
]]></content:encoded>
	</item>
	<item>
		<title>Comment on A lot of data is a lot of data by jmarca</title>
		<link>http://contourline.wordpress.com/2009/03/06/a-lot-of-data-is-a-lot-of-data/#comment-51</link>
		<dc:creator>jmarca</dc:creator>
		<pubDate>Tue, 10 Mar 2009 23:03:21 +0000</pubDate>
		<guid isPermaLink="false">http://contourline.wordpress.com/?p=150#comment-51</guid>
		<description>Cool, thanks for the pointer.  I will try it.  I&#039;m also going to throw more CPU at the problem in a few weeks.</description>
		<content:encoded><![CDATA[<p>Cool, thanks for the pointer.  I will try it.  I&#8217;m also going to throw more CPU at the problem in a few weeks.</p>
]]></content:encoded>
	</item>
	<item>
		<title>Comment on Time and space by J Chris A</title>
		<link>http://contourline.wordpress.com/2009/03/10/time-and-space/#comment-50</link>
		<dc:creator>J Chris A</dc:creator>
		<pubDate>Tue, 10 Mar 2009 22:46:46 +0000</pubDate>
		<guid isPermaLink="false">http://contourline.wordpress.com/?p=152#comment-50</guid>
		<description>You could periodically query the reduce view with various group-levels and store the rows as documents in another db, then you can do quick queries on the reduced data. Also you can thow out the source data and just keep the reductions if space is an issue.</description>
		<content:encoded><![CDATA[<p>You could periodically query the reduce view with various group-levels and store the rows as documents in another db, then you can do quick queries on the reduced data. Also you can thow out the source data and just keep the reductions if space is an issue.</p>
]]></content:encoded>
	</item>
	<item>
		<title>Comment on A lot of data is a lot of data by J Chris A</title>
		<link>http://contourline.wordpress.com/2009/03/06/a-lot-of-data-is-a-lot-of-data/#comment-49</link>
		<dc:creator>J Chris A</dc:creator>
		<pubDate>Tue, 10 Mar 2009 22:45:02 +0000</pubDate>
		<guid isPermaLink="false">http://contourline.wordpress.com/?p=150#comment-49</guid>
		<description>If you want a faster view server, try this Erlang one. It avoids the IO and JSON conversion that will happen with all non-Erlang servers. It is still fresh, you might need to hack it a bit.

http://github.com/mmcdanie/erlview/tree/master</description>
		<content:encoded><![CDATA[<p>If you want a faster view server, try this Erlang one. It avoids the IO and JSON conversion that will happen with all non-Erlang servers. It is still fresh, you might need to hack it a bit.</p>
<p><a href="http://github.com/mmcdanie/erlview/tree/master" rel="nofollow">http://github.com/mmcdanie/erlview/tree/master</a></p>
]]></content:encoded>
	</item>
	<item>
		<title>Comment on my first non-trival reduce for couchdb by jmarca</title>
		<link>http://contourline.wordpress.com/2009/01/14/my-first-non-trival-reduce-for-couchdb/#comment-43</link>
		<dc:creator>jmarca</dc:creator>
		<pubDate>Thu, 15 Jan 2009 15:00:49 +0000</pubDate>
		<guid isPermaLink="false">http://contourline.wordpress.com/?p=137#comment-43</guid>
		<description>Sure, I&#039;d be happy to.  I&#039;ll tidy up the code and post it today.</description>
		<content:encoded><![CDATA[<p>Sure, I&#8217;d be happy to.  I&#8217;ll tidy up the code and post it today.</p>
]]></content:encoded>
	</item>
	<item>
		<title>Comment on my first non-trival reduce for couchdb by Jan</title>
		<link>http://contourline.wordpress.com/2009/01/14/my-first-non-trival-reduce-for-couchdb/#comment-42</link>
		<dc:creator>Jan</dc:creator>
		<pubDate>Thu, 15 Jan 2009 10:46:06 +0000</pubDate>
		<guid isPermaLink="false">http://contourline.wordpress.com/?p=137#comment-42</guid>
		<description>Heya, 

would you mind adding this to http://wiki.apache.org/couchdb/View_Snippets? It might be useful to others :)

Thanks for sharing here in any case!

Cheers
Jan
--</description>
		<content:encoded><![CDATA[<p>Heya, </p>
<p>would you mind adding this to <a href="http://wiki.apache.org/couchdb/View_Snippets?" rel="nofollow">http://wiki.apache.org/couchdb/View_Snippets?</a> It might be useful to others :)</p>
<p>Thanks for sharing here in any case!</p>
<p>Cheers<br />
Jan<br />
&#8211;</p>
]]></content:encoded>
	</item>
</channel>
</rss>
