<?xml version="1.0" encoding="UTF-8"?>
<rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:wfw="http://wellformedweb.org/CommentAPI/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
	xmlns:slash="http://purl.org/rss/1.0/modules/slash/"
	xmlns:georss="http://www.georss.org/georss" xmlns:geo="http://www.w3.org/2003/01/geo/wgs84_pos#" xmlns:media="http://search.yahoo.com/mrss/"
	>

<channel>
	<title>Confounded by Confounding</title>
	<atom:link href="http://confounding.net/feed/" rel="self" type="application/rss+xml" />
	<link>http://confounding.net</link>
	<description>Just another WordPress.com weblog</description>
	<lastBuildDate>Fri, 17 May 2013 05:06:17 +0000</lastBuildDate>
	<language>en</language>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
	<generator>http://wordpress.com/</generator>
<cloud domain='confounding.net' port='80' path='/?rsscloud=notify' registerProcedure='' protocol='http-post' />
<image>
		<url>http://s2.wp.com/i/buttonw-com.png</url>
		<title>Confounded by Confounding</title>
		<link>http://confounding.net</link>
	</image>
	<atom:link rel="search" type="application/opensearchdescription+xml" href="http://confounding.net/osd.xml" title="Confounded by Confounding" />
	<atom:link rel='hub' href='http://confounding.net/?pushpress=hub'/>
		<item>
		<title>Crowdfunding Science: An Idea I Want To Love</title>
		<link>http://confounding.net/2013/02/26/crowdfunding-science-an-idea-i-want-to-love/</link>
		<comments>http://confounding.net/2013/02/26/crowdfunding-science-an-idea-i-want-to-love/#comments</comments>
		<pubDate>Tue, 26 Feb 2013 16:35:22 +0000</pubDate>
		<dc:creator>EpiGrad</dc:creator>
				<category><![CDATA[Epidemiology]]></category>
		<category><![CDATA[Grad School Life]]></category>
		<category><![CDATA[Soapbox]]></category>

		<guid isPermaLink="false">http://confounding.net/?p=326</guid>
		<description><![CDATA[The blog has gone a little quiet as I converge on what is hopefully The End of dissertating, which has been taking up an awful lot of time. But a recent post by DrugMonkey on their perceptions of the hurdles crowdfunding science faces inspired me to write a post. Because, while I disagree with some of [&#8230;]<img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=confounding.net&#038;blog=4193361&#038;post=326&#038;subd=confoundingblog&#038;ref=&#038;feed=1" width="1" height="1" />]]></description>
				<content:encoded><![CDATA[<p>The blog has gone a little quiet as I converge on what is hopefully The End of dissertating, which has been taking up an awful lot of time. But a recent post by <a href="http://scientopia.org/blogs/drugmonkey/2013/02/14/hurdles-for-the-crowdfunding-science-wackanuts-to-overcome/">DrugMonkey</a> on their perceptions of the hurdles crowdfunding science faces inspired me to write a post. Because, while I disagree with some of the specific points DM makes, I tend to agree &#8211; crowdfunding&#8217;s time has not yet come.</p>
<p>This makes me really, really sad. More after the jump.</p>
<p><span id="more-326"></span></p>
<p>For the purposes of this post, let me be clear about something: I&#8217;m not talking about &#8220;crowdfunded&#8221; projects that essentially involve people paying to be a data point in a study. I&#8217;m talking about using small donations from interested people to fund research that takes place in universities, research institutions and the like, and would otherwise be something that <em>could</em> be funded by traditional sources.</p>
<p>Maybe this would come in the form of something like <a href="https://www.gittip.com">Gittip</a> to say &#8220;Yeah, your cool science is worth a couple bucks a week to me&#8221;. Maybe its in a more formalized form, like the folks at <a href="http://scifundchallenge.org">SciFund Challenge</a> or the handful of startups chasing the idea.</p>
<p>Regardless, it&#8217;s an idea I would really like to see exist. It&#8217;s right up there with some benevolent soul dropping such a staggering pile of cash in my lap that I can just conduct research off the interest. Some of the reasons I in particular have a fondness for the concept:</p>
<ul>
<li><span style="line-height:13px;">It allows someone to participate, even indirectly, in the doing of science. Which is neat &#8211; Kickstarter lets people fund random ideas in technology, board games and performance art, why not science?</span></li>
<li>It&#8217;s another funding stream &#8211; and really, we could use another one.</li>
<li>It aligns particularly well with some of the work I do &#8211; crowdfunding can act as a good way to get equipment and the like that it&#8217;s normally somewhat hard to get grant funding for. Computer hardware, I&#8217;m looking at you. Computer hardware is, I&#8217;m pretty sure, the great funding frustration of theoretical labs everywhere. Thousands of dollars in Amazon AWS fees? No biggie. A $500 box to sit in the corner and chug through scripts night and day? Out of the question.</li>
<li>It aligns the interests of the public and the interests of scientists in a very direct way. I&#8217;ve been lamenting the lack of motivation to write really good code in science &#8211; as long as your code can run after the postdoc who wrote it leaves, what does it matter if it&#8217;s poorly commented, slow, and uses a ton of opaque variables?  Code, and reproducible data, and all the other things we say scientists should strive for aren&#8217;t the things scientists are <em>paid</em> for. What if they were?</li>
</ul>
<p>Like the title says, crowdfunding for science is an idea I&#8217;d love to see come to fruition. I&#8217;ve gone as far as gearing up for the last round of the SciFund challenge, putting together an application&#8230;and then withdrawing, having decided it was more trouble than it was worth. Here are my reasons why:</p>
<p><span style="line-height:13px;"><strong>Science is expensive</strong>. This is the one I&#8217;m having a really hard time getting over. Lets look at the last SciFund challenge:</span></p>
<p>45% of its projects met their funding goals. That&#8217;s a decent payline as far as these things go (better than the 10% some government grants are seeing these days), but its far from a sure thing. But beyond that, how much money were these goals for?</p>
<p>The median payout was $1440 in the last round, the mean a little under $2200. And as much as I hate to say it, that&#8217;s a drop in the bucket. That &#8211; and another $700 &#8211; would get wiped out with a single submission to an open access journal like PLoS Biology. Or a single bit of conference travel. Or an alarmingly small amount of computing time.</p>
<p>That amount is about the same as a pilot funding grant at my institution, one I&#8217;ve applied for and won, and I can tell you $2000 just doesn&#8217;t go very far. It&#8217;s good for establishing a very basic proof of concept for a study. Maybe pay for a single spurt of field work to establish the all important &#8220;Preliminary Findings&#8221;. Get a graduate somewhere for a week to get some face-time with a piece of equipment/person/animal, etc. The term I&#8217;ve taken to refer to this is &#8220;grant spackle&#8221; (as in, to fill in small holes&#8230;). Another term could be &#8220;found money&#8221;. And like $5 in your pocket that you didn&#8217;t know was there, found money is <em>awesome</em>.</p>
<p>But you don&#8217;t plan on found money. You don&#8217;t hire grad students on found money. You don&#8217;t establish a productive lab on found money.</p>
<p><strong>Time is Money</strong>. DrugMonkey points to institutional overhead as a problem &#8211; for readers who don&#8217;t know what that is, often a large portion of a grant (often upwards of 50%) goes to the institution for &#8220;indirect costs&#8221; &#8211; things like keeping the lights on and the copy machines full of toner, and other ancillary costs of doing research. At least in theory. I actually don&#8217;t think this is as big a problem as DM does &#8211; my limited foray into crowdfunding was met with &#8220;Meh, its not enough to worry about&#8221;.</p>
<p>My problem is time. The aforementioned pilot grant took an evening to fill out, and requires a single web-form report. Crowdfunded projects require videos, incentives, blog posts, Twitter and Facebook activity&#8230;and navigating ones way through the byzantine halls of the university funding office, which is set up for much larger grants, and likely hasn&#8217;t heard of your crowdfunding source before. Spending a week playing phone tag isn&#8217;t much of a problem if you&#8217;re talking about a quarter of a million dollars. For $2000? You&#8217;re going to eat through that money in a graduate student&#8217;s time rather quickly. This was really what killed it for me &#8211; the effort to reward ratio was way off.</p>
<p><strong>Is It Repeatable?</strong> Even if you had a friendly grants office, or had figured out some clever way like a separately held foundation to handle the money in a fast and streamlined fashion, and even if we were talking about considerably more money, I have concerns about repeatability. There&#8217;s been a lot of talk about the importance of social media in crowdfunding efforts for science, and that&#8217;s where my concern lies. Is there a pool of sustainable, repeatable donors &#8211; donors who don&#8217;t know the person trying to get funding? Or is this essentially passing the hat around a potential fundee&#8217;s social network, and banking on the novelty of the idea to generate a little cash? If its the latter, there&#8217;s only so many times that can be done before it runs out of steam. You might fund one project a year &#8211; will you fund two? Seven? What if they don&#8217;t pan out &#8211; we&#8217;ve seen anger centered around Kickstarter projects that have failed, what about SciFund projects that generate a whole lot of null results (something science does on occasion)? Again, that dumps crowdfunding into the realm of &#8220;found money&#8221; and something you might do in your spare time, or if you <em>really</em> need to fund that field work in Montana or wherever and can&#8217;t scrape together the cash for the ticket and food.</p>
<p>Which are noble goals, but not something I think will change the face of science. But I could be wrong &#8211; that&#8217;s what the comments section is for.</p>
<p>&nbsp;</p>
<br />Filed under: <a href='http://confounding.net/category/epidemiology/'>Epidemiology</a>, <a href='http://confounding.net/category/grad-school-life/'>Grad School Life</a>, <a href='http://confounding.net/category/soapbox/'>Soapbox</a>  <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gocomments/confoundingblog.wordpress.com/326/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/comments/confoundingblog.wordpress.com/326/" /></a> <img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=confounding.net&#038;blog=4193361&#038;post=326&#038;subd=confoundingblog&#038;ref=&#038;feed=1" width="1" height="1" />]]></content:encoded>
			<wfw:commentRss>http://confounding.net/2013/02/26/crowdfunding-science-an-idea-i-want-to-love/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
	
		<media:content url="http://0.gravatar.com/avatar/3db1b0de2f7cc078cb1f08cbfa1da35e?s=96&#38;d=identicon&#38;r=G" medium="image">
			<media:title type="html">epigrad</media:title>
		</media:content>
	</item>
		<item>
		<title>R in 2012: Shamelessly Stealing the New Years&#8217; Python Meme</title>
		<link>http://confounding.net/2013/01/04/r-in-2012-shamelessly-stealing-the-new-years-python-meme/</link>
		<comments>http://confounding.net/2013/01/04/r-in-2012-shamelessly-stealing-the-new-years-python-meme/#comments</comments>
		<pubDate>Fri, 04 Jan 2013 05:22:26 +0000</pubDate>
		<dc:creator>EpiGrad</dc:creator>
				<category><![CDATA[General]]></category>
		<category><![CDATA[R]]></category>

		<guid isPermaLink="false">http://confounding.net/?p=322</guid>
		<description><![CDATA[In a previous post I reviewed my interactions with Python the programming language and the community following the New Years&#8217; Python Meme that&#8217;s making the rounds on Twitter and the like. And now, I&#8217;m shamelessly stealing it to look at how I used R in 2012. I figure I should probably let the #2012pythonmeme stay Python-only, [&#8230;]<img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=confounding.net&#038;blog=4193361&#038;post=322&#038;subd=confoundingblog&#038;ref=&#038;feed=1" width="1" height="1" />]]></description>
				<content:encoded><![CDATA[<p>In a <a href="http://confounding.net/2013/01/04/new-years-python-meme-python-newbie-edition/">previous post</a> I reviewed my interactions with Python the programming language and the community following the New Years&#8217; Python Meme that&#8217;s making the rounds on Twitter and the like.</p>
<p>And now, I&#8217;m shamelessly stealing it to look at how I used R in 2012. I figure I should probably let the #2012pythonmeme stay Python-only, so I&#8217;m going to try #Rin2012.</p>
<p id="p1"><strong>1. What’s the coolest R application, package, or library you have discovered in 2012?</strong></p>
<p id="p2">This one is like choosing between your favorite children: RStudio and plyr are both new discoveries to me that have <strong>massively</strong> altered the way I work with R, for the better. If I had to pick one, I&#8217;d probably go with RStudio just because of the amount of work that got done this year thanks to a R-specific IDE. Pretty excited about RStudio Server too, I just need a system to run it on. Anyone want to donate one?</p>
<p id="p3"><strong>2. What new programming technique did you learn in 2012</strong></p>
<p id="p4">I&#8217;m going to cheat, and say since plyr didn&#8217;t get the coolest R package award, we&#8217;re going to give it credit here. Because parallelism in R was the new programming technique I learned this year, and for simple &#8220;Apply this function by a grouping variable&#8221; tasks, plyr and it&#8217;s connection to doMC saved some serious time. And it&#8217;s always gratifying to see all the cores on your machine go to 100%.</p>
<p id="p6"><strong>3. Which open source project did you contribute to the most in 2012? What did you do?</strong></p>
<p id="p7">I didn&#8217;t specifically contribute to any open source projects. However, I did publish the code needed for a workshop I taught on mathematical epidemiology on GitHub, and one of the papers I published has a freely available electronic appendix hosted there as well. I&#8217;ve got some plans for next year &#8211; stay tuned.</p>
<p id="p9"><strong>4. Which R blog or website did you read the most in 2012</strong></p>
<p id="p10">CrossValidated and StackOverflow for websites. The amazingly useful <a href="http://www.r-bloggers.com">R-Bloggers</a> lets me cheat in terms of blogs and say &#8220;All of them&#8221;.</p>
<p id="p11"><strong>5. What are the top things you want to learn in 2013</strong></p>
<p id="p12">I&#8217;m interested in doing some social network analysis with R, and there&#8217;s a project that I&#8217;ve got an extensive codebase for in SAS that I want to translate over to R, but haven&#8217;t the faintest clue on how to get started.</p>
<p id="p13"><strong>6. What is the top software, application, or library you wish someone would write in 2013</strong></p>
<p id="p14">Besides the magical R faeries leaving me a copy of the above mentioned SAS code in R? I&#8217;d really like to see a replacement for GillespieSSA designed for intensive, research grade projects.</p>
<p>Want to do your own list? here’s how:</p>
<ul>
<li>copy-paste the questions and answer them in your blog</li>
<li>tweet it with #Rin2012 hashtag</li>
</ul>
<br />Filed under: <a href='http://confounding.net/category/general/'>General</a>, <a href='http://confounding.net/category/r/'>R</a>  <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gocomments/confoundingblog.wordpress.com/322/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/comments/confoundingblog.wordpress.com/322/" /></a> <img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=confounding.net&#038;blog=4193361&#038;post=322&#038;subd=confoundingblog&#038;ref=&#038;feed=1" width="1" height="1" />]]></content:encoded>
			<wfw:commentRss>http://confounding.net/2013/01/04/r-in-2012-shamelessly-stealing-the-new-years-python-meme/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
	
		<media:content url="http://0.gravatar.com/avatar/3db1b0de2f7cc078cb1f08cbfa1da35e?s=96&#38;d=identicon&#38;r=G" medium="image">
			<media:title type="html">epigrad</media:title>
		</media:content>
	</item>
		<item>
		<title>New Years&#8217; Python Meme &#8211; Python Newbie Edition</title>
		<link>http://confounding.net/2013/01/04/new-years-python-meme-python-newbie-edition/</link>
		<comments>http://confounding.net/2013/01/04/new-years-python-meme-python-newbie-edition/#comments</comments>
		<pubDate>Fri, 04 Jan 2013 05:07:31 +0000</pubDate>
		<dc:creator>EpiGrad</dc:creator>
				<category><![CDATA[General]]></category>
		<category><![CDATA[Python]]></category>
		<category><![CDATA[Simulation]]></category>

		<guid isPermaLink="false">http://confounding.net/?p=319</guid>
		<description><![CDATA[Ran across this on a number of different sites, just going over this year in terms of my interaction with the Python programming language, the open source community, etc. In a slight twist, I&#8217;m going to do the same thing for R in a second post. 1. What&#8217;s the coolest Python application, framework, or library [&#8230;]<img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=confounding.net&#038;blog=4193361&#038;post=319&#038;subd=confoundingblog&#038;ref=&#038;feed=1" width="1" height="1" />]]></description>
				<content:encoded><![CDATA[<p>Ran across this on a number of different sites, just going over this year in terms of my interaction with the Python programming language, the open source community, etc. In a slight twist, I&#8217;m going to do the same thing for R in a second post.</p>
<p id="p1"><strong>1. What&#8217;s the coolest Python application, framework, or library you have discovered in 2012?</strong></p>
<p id="p2"><a href="http://www.picloud.com">PiCloud.</a> I&#8217;ve been following their growth and expansion, and while for the moment I&#8217;ve used them mostly for toy examples and experiments, I think they represent a really promising platform for the kind of on-demand, embarrassingly parallel simulation work I&#8217;ve been doing recently. Some of my Python programming got shelved this year while other stuff got done, so I&#8217;m looking forward to revisiting the platform and seeing how its developed. The PiCloud Notebook has me pretty excited.</p>
<p id="p3"><strong>2. What new programming technique did you learn in 2012</strong></p>
<p id="p4">Honestly, 2012 was the first year I really pushed myself in terms of programming, so I could probably say &#8220;All of them&#8221; and be right. Generally speaking, just thinking &#8220;I could write a function for that&#8221; was a huge step.</p>
<p id="p6"><strong>3. Which open source project did you contribute to the most in 2012? What did you do?</strong></p>
<p id="p7">Nothing really. That isn&#8217;t strictly true for R (see the post to follow this one), but my Python skills are pretty dubious. I do have some ideas bouncing around in my head however, so hopefully 2013 will be the year of a contribution or two.</p>
<p id="p9"><strong>4. Which Python blog or website did you read the most in 2012</strong></p>
<p id="p10">StackOverflow and Python Weekly.</p>
<p id="p11"><strong>5. What are the top things you want to learn in 2013</strong></p>
<p id="p12">I&#8217;d like to push myself a little more in terms of object-oriented programming. It&#8217;s something that I can see the utility of, and would make some of the &#8220;book keeping&#8221; tasks I need to do in a project or two vastly easier, but it&#8217;s still not a technique I&#8217;m comfortable with.</p>
<p>I&#8217;d also like to dabble in Django, moving from &#8220;This result comes out on the command line&#8221; into something useful to the world at large.</p>
<p id="p13"><strong>6. What is the top software, application, or library you wish someone would write in 2013</strong></p>
<p id="p14">What&#8217;s left of my dissertation. Hopefully that someone is me, or I&#8217;m screwed.</p>
<p>Want to do your own list? here&#8217;s how:</p>
<ul>
<li>copy-paste the questions and answer them in your blog</li>
<li>tweet it with #2012pythonmeme hashtag</li>
</ul>
<br />Filed under: <a href='http://confounding.net/category/general/'>General</a>, <a href='http://confounding.net/category/python/'>Python</a>, <a href='http://confounding.net/category/simulation/'>Simulation</a>  <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gocomments/confoundingblog.wordpress.com/319/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/comments/confoundingblog.wordpress.com/319/" /></a> <img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=confounding.net&#038;blog=4193361&#038;post=319&#038;subd=confoundingblog&#038;ref=&#038;feed=1" width="1" height="1" />]]></content:encoded>
			<wfw:commentRss>http://confounding.net/2013/01/04/new-years-python-meme-python-newbie-edition/feed/</wfw:commentRss>
		<slash:comments>1</slash:comments>
	
		<media:content url="http://0.gravatar.com/avatar/3db1b0de2f7cc078cb1f08cbfa1da35e?s=96&#38;d=identicon&#38;r=G" medium="image">
			<media:title type="html">epigrad</media:title>
		</media:content>
	</item>
		<item>
		<title>Elsevier Hacked: Can we get some basic security for journal editorial systems?</title>
		<link>http://confounding.net/2012/12/17/elsevier-hacked-can-we-get-some-basic-security-for-journal-editorial-systems/</link>
		<comments>http://confounding.net/2012/12/17/elsevier-hacked-can-we-get-some-basic-security-for-journal-editorial-systems/#comments</comments>
		<pubDate>Mon, 17 Dec 2012 19:10:23 +0000</pubDate>
		<dc:creator>EpiGrad</dc:creator>
				<category><![CDATA[Soapbox]]></category>

		<guid isPermaLink="false">http://confounding.net/?p=304</guid>
		<description><![CDATA[Retraction Watch has a post on the Elsevier Editorial System (ESS) being hacked at some point in the last month, and generating some paper withdrawls because the reviews for it were faked. Sadly, I am not surprised &#8211; some of the security measures taken by journals are a touch out-of-date. The text of one of [&#8230;]<img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=confounding.net&#038;blog=4193361&#038;post=304&#038;subd=confoundingblog&#038;ref=&#038;feed=1" width="1" height="1" />]]></description>
				<content:encoded><![CDATA[<p><a href="http://retractionwatch.wordpress.com/2012/12/11/elsevier-editorial-system-hacked-reviews-faked-11-retractions-follow/">Retraction Watch has a post</a> on the Elsevier Editorial System (ESS) being hacked at some point in the last month, and generating some paper withdrawls because the reviews for it were faked. Sadly, I am not surprised &#8211; some of the security measures taken by journals are a touch out-of-date.</p>
<p><span id="more-304"></span></p>
<p>The text of one of the retractions reads as follows:</p>
<blockquote>
<p id="sp0005">This article has been retracted: please see Elsevier Policy on Article Withdrawal (<a href="http://www.sciencedirect.com/science?_ob=RedirectURL&amp;_method=externObjLink&amp;_locator=url&amp;_issn=00303992&amp;_origin=article&amp;_zone=art_page&amp;_plusSign=%2B&amp;_targetURL=http%253A%252F%252Fwww.elsevier.com%252Flocate%252Fwithdrawalpolicy" target="externObjLink">http://www.elsevier.com/locate/withdrawalpolicy</a>).</p>
<p id="sp0010">This article has been retracted at the request of the Editor-in-Chief.</p>
<p id="sp0015">A referee’s report on which the editorial decision was made was found to be falsified. The referee’s report was submitted under the name of an established scientist who was not aware of the paper or the report, via a fictitious EES account. Because of the submission of a fake, but well-written and positive referee’s report, the Editor was misled into accepting the paper based upon the positive advice of what he assumed was a well-known expert in the field. This represents a clear violation of the fundamentals of the peer-review process, our publishing policies, and publishing ethics standards. The authors of this paper have been offered the option to re-submit their paper for legitimate peer review.</p>
</blockquote>
<p>While we live in an age where all websites are vulnerable, I&#8217;ve found journal submission/editorial/review sites to have a uniquely lax approach to security. As just one example, I&#8217;ve had my password sent to me, in unencrypted plain text by more than one journal&#8217;s system.</p>
<p>Plain text. That&#8217;s&#8230;pretty basic security fail. Right up there with your administrator password being &#8220;password&#8221;.</p>
<p>I assume it hasn&#8217;t been a bigger problem because peer reviewed journals are kind of obscure targets. But apparently not obscure enough.</p>
<br />Filed under: <a href='http://confounding.net/category/soapbox/'>Soapbox</a>  <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gocomments/confoundingblog.wordpress.com/304/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/comments/confoundingblog.wordpress.com/304/" /></a> <img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=confounding.net&#038;blog=4193361&#038;post=304&#038;subd=confoundingblog&#038;ref=&#038;feed=1" width="1" height="1" />]]></content:encoded>
			<wfw:commentRss>http://confounding.net/2012/12/17/elsevier-hacked-can-we-get-some-basic-security-for-journal-editorial-systems/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
	
		<media:content url="http://0.gravatar.com/avatar/3db1b0de2f7cc078cb1f08cbfa1da35e?s=96&#38;d=identicon&#38;r=G" medium="image">
			<media:title type="html">epigrad</media:title>
		</media:content>
	</item>
		<item>
		<title>End Radio Silence</title>
		<link>http://confounding.net/2012/11/14/end-radio-silence/</link>
		<comments>http://confounding.net/2012/11/14/end-radio-silence/#comments</comments>
		<pubDate>Wed, 14 Nov 2012 14:30:39 +0000</pubDate>
		<dc:creator>EpiGrad</dc:creator>
				<category><![CDATA[Grad School Life]]></category>

		<guid isPermaLink="false">https://confoundingblog.wordpress.com/?p=299</guid>
		<description><![CDATA[As you may (or may not) have noticed, this blog has been fairly quiet recently &#8211; I had entered a period of what a family member referred to as &#8220;Radio Silence&#8221;. A combination of deadlines, trying to get projects out the door, some life-related stress and two major conferences in a month will do that. [&#8230;]<img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=confounding.net&#038;blog=4193361&#038;post=299&#038;subd=confoundingblog&#038;ref=&#038;feed=1" width="1" height="1" />]]></description>
				<content:encoded><![CDATA[<p>As you may (or may not) have noticed, this blog has been fairly quiet recently &#8211; I had entered a period of what a family member referred to as &#8220;Radio Silence&#8221;. A combination of deadlines, trying to get projects out the door, some life-related stress and two major conferences in a month will do that.</p>
<p>I&#8217;m back, hopefully, and should be updating content again regularly soon &#8211; hoping to post some more coding-related posts, some science posts, and I&#8217;ve joined the O&#8217;Reilly Books review network, so there&#8217;s probably some reviews of their books from the perspective of an Epidemiologist coming down the pipeline.</p>
<p>Or at least that&#8217;s the plan.</p>
<br />Filed under: <a href='http://confounding.net/category/grad-school-life/'>Grad School Life</a>  <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gocomments/confoundingblog.wordpress.com/299/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/comments/confoundingblog.wordpress.com/299/" /></a> <img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=confounding.net&#038;blog=4193361&#038;post=299&#038;subd=confoundingblog&#038;ref=&#038;feed=1" width="1" height="1" />]]></content:encoded>
			<wfw:commentRss>http://confounding.net/2012/11/14/end-radio-silence/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
	
		<media:content url="http://0.gravatar.com/avatar/3db1b0de2f7cc078cb1f08cbfa1da35e?s=96&#38;d=identicon&#38;r=G" medium="image">
			<media:title type="html">epigrad</media:title>
		</media:content>
	</item>
		<item>
		<title>Repeatability, Replication and the Reproducibility Initiative</title>
		<link>http://confounding.net/2012/08/26/repeatability-replication-and-the-reproducibility-initiative/</link>
		<comments>http://confounding.net/2012/08/26/repeatability-replication-and-the-reproducibility-initiative/#comments</comments>
		<pubDate>Mon, 27 Aug 2012 00:38:27 +0000</pubDate>
		<dc:creator>EpiGrad</dc:creator>
				<category><![CDATA[Epidemiology]]></category>
		<category><![CDATA[Soapbox]]></category>

		<guid isPermaLink="false">http://confounding.net/?p=297</guid>
		<description><![CDATA[Almost no one will contest that being able to reproduce the findings from scientific studies is key to advancing science – I say almost no one because in my experience you can always find one person to disagree with anything if you look hard enough. We all acknowledge its possible, have entire sessions devoted to [&#8230;]<img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=confounding.net&#038;blog=4193361&#038;post=297&#038;subd=confoundingblog&#038;ref=&#038;feed=1" width="1" height="1" />]]></description>
				<content:encoded><![CDATA[<p>Almost no one will contest that being able to reproduce the findings from scientific studies is key to advancing science – I say almost no one because in my experience you can always find one person to disagree with anything if you look hard enough. We all acknowledge its possible, have entire sessions devoted to fretting about John Ioannidis’ paper (which has, ironically, gotten extended past the actual support in the paper in my opinion), and node sagely when people talk about making code available, writing clear methods sections, etc.</p>
<p>So when press releases and news reports about the <a href="https://www.scienceexchange.com/reproducibility">Reproducibility Initiative </a>started making the rounds on various blogs I read, I looked it over with interest. The concept is simple: Reproducible results are good, and should be rewarded. Validate your study through the initiative, and you&#8217;ll not only get a &#8216;Certificate of Reproducibility&#8217; (of whatever worth that might be to you) and more importantly for most career scientists, the replicated results can be published as an independent paper in the PLOS Reproducibility Collection, and the original study will be marked as reproduced in the parent journal if it&#8217;s one of the Initiative&#8217;s supporters.</p>
<p>That all sounds great&#8230;but as with all things, there&#8217;s a &#8220;but&#8230;&#8221; coming. Or, to my mind, several. More after the jump.</p>
<p><span id="more-297"></span></p>
<p><strong>Cost</strong>: How the Reproducibility Initiative (which I&#8217;m going to shorten to RI from here on out) works as I understand it is you submit your study and some supporting information to RI &#8211; which is an offshoot of the commercial <a href="https://www.scienceexchange.com">Science Exchange</a> linking up providers of expertise with those who need that expertise &#8211; and they hand it off to a blindly matched provider who can validate your results. There&#8217;s a key phrase on their site that&#8217;s important: &#8220;You’ll be responsible for payment of services rendered unless sponsored by a partner organization.&#8221;</p>
<p>That means you&#8217;re footing the bill, or your grant is. That means you&#8217;re essentially paying for a second publication for your work (who knows how valuable those will end up being on a CV) out of progressively more and more stretched research funds. For a big lab this might not be a big thing, but for a small early-stage investigator lab running on the skin of its teeth, or a doctoral student&#8217;s project that managed to swing some funding? Yeah right. Without a funding mechanism, all the RI does is look at a small subset of science &#8211; both &#8220;well heeled&#8221; and &#8220;reproducible&#8221;.</p>
<p><strong>Feasibility</strong>: This is the one that actually worries me the most. What, exactly, can Science Exchange reproduce? For the most part, it appears their strength is in wet-lab life science work. Searches for &#8216;Biostatistics&#8217; and &#8216;High performance computing&#8217; yielded at best one or two relevant results. &#8216;Mathematical modeling&#8217; produced none.</p>
<p>For many laboratory-based publications, this might work swimmingly. But how do you propose replicating a mathematical model? The best I can envision is what I&#8217;ve termed &#8220;Click Run&#8221; reproducibility, which means your code works the way you say it works and puts out what you said it puts out. I&#8217;ve discussed this a bit with a question on <a href="http://stats.stackexchange.com/questions/14999/how-are-we-defining-reproducible-research">CrossValidated</a>, but as far as I&#8217;m concerned, that level of &#8220;does your code do what we think it should&#8221; reproducibility is a low threshold.</p>
<p>If we&#8217;re talking about what you need to really &#8220;reproduce&#8221; an epidemiology study? It&#8217;s not working off the same cohort data. It&#8217;s not rerunning the code. It&#8217;s conducting <em>an entirely new study</em> of the same effect on a different population. That is what&#8217;s needed in the field, it&#8217;s what&#8217;s needed to address the concerns of someone like Ioannidis and it&#8217;s entirely outside the scope of what RI can do.</p>
<p>It seems that science often struggles once it steps away from the bench, and two-group controlled experiments. Statistics becomes murky, causation turns ephemeral, and when people talk about reproducibility, you have to start outlining exactly what you mean. I might have a post on that last one in a few days. In my mind, the goals of the RI are laudable, but between not being able to serve small labs out of budget, and entire fields out of practicality, I wonder if it will do more than double the paper count of a few select labs. That would be a shame.</p>
<p>&nbsp;</p>
<br />Filed under: <a href='http://confounding.net/category/epidemiology/'>Epidemiology</a>, <a href='http://confounding.net/category/soapbox/'>Soapbox</a>  <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gocomments/confoundingblog.wordpress.com/297/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/comments/confoundingblog.wordpress.com/297/" /></a> <img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=confounding.net&#038;blog=4193361&#038;post=297&#038;subd=confoundingblog&#038;ref=&#038;feed=1" width="1" height="1" />]]></content:encoded>
			<wfw:commentRss>http://confounding.net/2012/08/26/repeatability-replication-and-the-reproducibility-initiative/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
	
		<media:content url="http://0.gravatar.com/avatar/3db1b0de2f7cc078cb1f08cbfa1da35e?s=96&#38;d=identicon&#38;r=G" medium="image">
			<media:title type="html">epigrad</media:title>
		</media:content>
	</item>
		<item>
		<title>That&#8217;s Not How the &#8220;Law of Large Numbers&#8221; Works</title>
		<link>http://confounding.net/2012/03/12/thats-not-how-the-law-of-large-numbers-works/</link>
		<comments>http://confounding.net/2012/03/12/thats-not-how-the-law-of-large-numbers-works/#comments</comments>
		<pubDate>Mon, 12 Mar 2012 20:41:46 +0000</pubDate>
		<dc:creator>EpiGrad</dc:creator>
				<category><![CDATA[General]]></category>
		<category><![CDATA[R]]></category>
		<category><![CDATA[Simulation]]></category>
		<category><![CDATA[Soapbox]]></category>

		<guid isPermaLink="false">http://confounding.net/?p=290</guid>
		<description><![CDATA[Breaking my dissertation and administrata induced silence for a small rant combining two of my favorite things &#8211; Apple Computer Inc, and simulation. Recently, the New York Times featured the article &#8216;Apple Confronts the Law of Large Numbers&#8216;. The fundamental assertion? That the earnings growth and stock price of Apple cannot continue its rapid rise. [&#8230;]<img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=confounding.net&#038;blog=4193361&#038;post=290&#038;subd=confoundingblog&#038;ref=&#038;feed=1" width="1" height="1" />]]></description>
				<content:encoded><![CDATA[<p>Breaking my dissertation and administrata induced silence for a small rant combining two of my favorite things &#8211; Apple Computer Inc, and simulation. Recently, the New York Times featured the article &#8216;<a href="http://www.nytimes.com/2012/02/25/business/apple-confronts-the-law-of-large-numbers-common-sense.html?pagewanted=all">Apple Confronts the Law of Large Numbers</a>&#8216;. The fundamental assertion? That the earnings growth and stock price of Apple cannot continue its rapid rise. The justification? The Law of Large Numbers, and the idea that as Apple grows larger, each additional % increase in earnings, profit, etc. represents a bigger and bigger step in terms of the absolute dollar amount.</p>
<p>One problem: That&#8217;s not how the Law of Large Numbers works. More after the jump.</p>
<p><span id="more-290"></span></p>
<p>First, a definition of the Law of Large Numbers. From the article itself:</p>
<blockquote><p>the law states that a variable will revert to a mean over a large sample of results.</p></blockquote>
<p>From Wolfram Alpha:</p>
<blockquote><p>A &#8220;law of large numbers&#8221; is one of several theorems expressing the idea that as the number of trials of a random process increases, the percentage difference between the expected and actual values goes to zero.</p></blockquote>
<p>So both definitions are in agreement. So it&#8217;s not the definition of the Law that the article has gotten wrong, just its application. The most useful way (for me at least) to see the Law in action is to visualize it. Lets use a very simple simulation of a fair coin flip. We know that the average of a fair coin should be 0.50. Using some simple R code, we can simulate flipping a coin 10,000 times:</p>
<blockquote><p>set.seed(807060)<br />
x &lt;- sample(0:1, 10000, repl=T)<br />
s &lt;- cumsum(x); r &lt;- s/(1:n)<br />
plot(r, ylim=c(.01, .60), type=&#8221;l&#8221;)<br />
lines(c(0,n), c(.50,.50),col=&#8221;red&#8221;)<br />
round(cbind(x,s,r), 5)[1:10,]; r[n]</p></blockquote>
<p>This yields a plot that looks like this:</p>
<p><a href="http://confoundingblog.files.wordpress.com/2012/03/coin-toss.png"><img class="alignnone size-full wp-image-293" title="Coin Toss" alt="Results of a simulated coin toss" src="http://confoundingblog.files.wordpress.com/2012/03/coin-toss.png?w=500&#038;h=500" height="500" width="500" /></a></p>
<p>The black line is the cumulative average, with the red line being the known mean the simulation should converge to over time. Notice how after 1, 2, 10 or even 1000 coin tosses, the average <em>isn&#8217;t</em> 0.5. After 10,000, it&#8217;s considerably closer, but still not necessarily there. After many, many more? That is how the Law of Large Numbers works, and its the underpinning behind most simulation work &#8211; that with enough simulations, you should converge on the actual expected value, even if it isn&#8217;t known &#8211; or (unlike our coin toss example) its hard or impossible to arrive at analytically.</p>
<p>But this has nothing to do with Apple&#8217;s performance, past, present, or future. The company&#8217;s earnings, share price, or profits aren&#8217;t a random process. That it becomes progressively harder to deliver the same % increase as a company grows larger isn&#8217;t wrong, but it isn&#8217;t the Law of Large Numbers. Just because 500 billion <em>is</em> a large number (and in dollar terms, it is a very large number) doesn&#8217;t mean that&#8217;s what operating. There are many, many reasons to wonder if Apple&#8217;s trajectory can continue. Has it lost its small, lean operating patterns now that it&#8217;s the largest company in the U.S.? Can it continue without Steve Job&#8217;s driving personality? Can it keep its track record going? After all, eggs aren&#8217;t the only thing that comes out of a golden goose.</p>
<p>But it&#8217;s not that random processes converge toward the expectation after a progressively larger number of trials. I&#8217;d also like to note that, years ago, when Mr. Dell was calling for Apple to be scrapped and the proceeds given to shareholders, and it was trading for very low double digits instead of middling-high triple digits, noone was going &#8220;You know, the Law of Large Numbers will eventually drag Apple up. It&#8217;s a sure thing!&#8221;</p>
<p>Disclosure: I own stock in Apple. And am rather fond of simulation.</p>
<br />Filed under: <a href='http://confounding.net/category/general/'>General</a>, <a href='http://confounding.net/category/r/'>R</a>, <a href='http://confounding.net/category/simulation/'>Simulation</a>, <a href='http://confounding.net/category/soapbox/'>Soapbox</a>  <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gocomments/confoundingblog.wordpress.com/290/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/comments/confoundingblog.wordpress.com/290/" /></a> <img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=confounding.net&#038;blog=4193361&#038;post=290&#038;subd=confoundingblog&#038;ref=&#038;feed=1" width="1" height="1" />]]></content:encoded>
			<wfw:commentRss>http://confounding.net/2012/03/12/thats-not-how-the-law-of-large-numbers-works/feed/</wfw:commentRss>
		<slash:comments>1</slash:comments>
	
		<media:content url="http://0.gravatar.com/avatar/3db1b0de2f7cc078cb1f08cbfa1da35e?s=96&#38;d=identicon&#38;r=G" medium="image">
			<media:title type="html">epigrad</media:title>
		</media:content>

		<media:content url="http://confoundingblog.files.wordpress.com/2012/03/coin-toss.png" medium="image">
			<media:title type="html">Coin Toss</media:title>
		</media:content>
	</item>
		<item>
		<title>My Top 5 Mobile Apps for Scientists</title>
		<link>http://confounding.net/2012/02/16/my-top-5-mobile-apps-for-scientists/</link>
		<comments>http://confounding.net/2012/02/16/my-top-5-mobile-apps-for-scientists/#comments</comments>
		<pubDate>Thu, 16 Feb 2012 07:59:41 +0000</pubDate>
		<dc:creator>EpiGrad</dc:creator>
				<category><![CDATA[Grad School Life]]></category>
		<category><![CDATA[Macs in Research]]></category>
		<category><![CDATA[Reviews]]></category>

		<guid isPermaLink="false">http://confounding.net/?p=285</guid>
		<description><![CDATA[So Labguru recently had a blog post entitled 5 Best Mobile Apps for Research Scientists. It&#8217;s a decent list, though it&#8217;s actually the four best, since your brand new iPad app isn&#8217;t something I&#8217;m sure you can actually count in an impartial list, though it does look cool. It&#8217;s actually a better list than most. [&#8230;]<img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=confounding.net&#038;blog=4193361&#038;post=285&#038;subd=confoundingblog&#038;ref=&#038;feed=1" width="1" height="1" />]]></description>
				<content:encoded><![CDATA[<p>So Labguru recently had a blog post entitled <a href="http://blog.labguru.com/5-best-mobile-apps-for-research-scientists">5 Best Mobile Apps for Research Scientists</a>. It&#8217;s a decent list, though it&#8217;s actually the four best, since your brand new iPad app isn&#8217;t something I&#8217;m sure you can actually count in an impartial list, though it does look cool.</p>
<p>It&#8217;s actually a better list than most. I find myself getting irked when &#8220;Science&#8221; is taken to invariably mean either &#8220;Physics&#8221; or more commonly in life science blogs and the like I read, wet-lab biology/biochemistry. What about us poor theorists? Or population-level empiricists? Do we really need a list dominated by timers to make sure you take your samples out of the water bath in time?</p>
<p>After the jump are my Top 5 apps, hopefully not terribly biased toward my own research. And absolutely not featuring my own (nonexistent) app.</p>
<p><span id="more-285"></span></p>
<p><strong><br />
</strong>For the record, there aren&#8217;t in any particular order.</p>
<p><strong>#1: <a href="http://db.tt/oyIDzG8">Dropbox</a></strong>. Dropbox is seriously amazing. It&#8217;s essentially a shared, virtual drive sitting out in cyberspace, accessible from your desktop, another computer if you need it, your iPad, iPhone, Android devices, the internet&#8230;and sync&#8217;d between them all. It&#8217;s essentially replaced flash drives for me, which are both unreliable and prone to wander. It&#8217;s got a second copy of all my important files. I have it automatically set to be <a href="http://confounding.net/2011/10/03/os-x-terminal-for-fun-and-profit/">symbolically linked</a> with certain work folders on my desktop so anything I put in those folders will be automatically added to Dropbox &#8211; and any files I make on my iPad will show up on my desktop as well. And it&#8217;s free for students, and they give you a little extra space (and double the bonus space for referrals).</p>
<p><strong>#2: <a href="http://www.mekentosj.com/papers/">Papers</a></strong><a href="http://www.mekentosj.com/papers/">.</a> This one started as a Mac-only app, but has migrated to iOS, and now it&#8217;s on Windows as well. It&#8217;s the single most pleasant &#8220;paper repository&#8221; software I&#8217;ve ever used. The citation manager features don&#8217;t quite seem ready for primetime (I tried it with my proposal and they required some editing), but in terms of organizing your papers, keeping PDF files corralled, sharing and searching them? Second to none. And with the iOS application, you can keep your libraries sync&#8217;d, so as long as you have your phone or iPad, you&#8217;re never without that one paper you really need.</p>
<p><strong>#3. <a href="http://www.zinger-soft.com/iSSH_features.html">iSSH</a></strong>. iSSH may be the data science equivalent of the &#8220;time your experiments&#8221; app for the wet-lab folks. Simply put, its an easy to use, remarkably clean SSH terminal application for your iPad/iPhone. Which means if you&#8217;ve got a PC set up at home that you can SSH into, or access to the friendly local university cluster, etc. you can do it from an amazingly portable device. It extends the capabilities of the iPad and keyboard combination massively &#8211; you can now program in R, maraud around Python, even compile and run C applications remotely. Or remotely administer a server. Or check on a long-running job to see if you can come back from your coffee break.</p>
<p>My most dubious use of it was when the East Coast got hit by the earthquake/hurricane combination and I was out of town, SSHing into my machine and assuming that if I could still connect, the damage couldn&#8217;t be <em>that</em> bad. I&#8217;ve also done far more productive things on it.</p>
<p><strong>#4</strong>. <strong><a href="http://products.wolframalpha.com/">Wolfram</a></strong><a href="http://products.wolframalpha.com/"> Alpha</a>. Wolfram Alpha is just a general purpose utility &#8220;I don&#8217;t know something and I need to&#8221; program. Like Google, except it knows you&#8217;re using this for science. A search for &#8220;Clostridium&#8221; gives me the taxonomy of that genus. And other related members of the family. And members of that genus. And its taxonomic network. And it shows its sources.</p>
<p>&#8220;San Francisco&#8221; gets you a map. The current local time. And weather. And weather history. And the cost of living index, and median home price and unemployment rate and nearby airports and hospitals, etc. etc.</p>
<p>&#8220;Odds Ratio&#8221; gets you the formula for&#8230;you guessed it. And a quick calculator.</p>
<p>&#8220;Ethanol&#8221; gives you the formula, compound formula and structure diagram for ethanol. And the molecular weight, melting point, boiling point, density, solubility&#8230;are we getting the idea?</p>
<p>It&#8217;ll even do math for you. Math that&#8217;s <em>way</em> more complicated than most calculator programs.</p>
<p><strong>#5. Some Sort of Text Editor</strong>. This is a matter of personal preference. I&#8217;ve got a copy of Pages, Apple&#8217;s word processing app, and while it&#8217;s powerful, there continue to be issues hopping back and forth between Pages and Word when you have EndNote citations, and it proved to be more trouble than it&#8217;s worth. But some people will want clear, plain, &#8220;White screen, cursor, and my words&#8221; UIs, while others will prefer some of the more familiar text editor features.</p>
<p>Personally, I use &#8220;<a href="http://www.textasticapp.com/">Textastic</a>&#8220;, which has word and character counts, syntax highlighting for programming languages, and can either send files via email or (among other destinations) save directly to DropBox.</p>
<p>So there you have it. My top 5 must have iPad/iPhone apps.</p>
<br />Filed under: <a href='http://confounding.net/category/grad-school-life/'>Grad School Life</a>, <a href='http://confounding.net/category/macs-in-research/'>Macs in Research</a>, <a href='http://confounding.net/category/reviews/'>Reviews</a>  <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gocomments/confoundingblog.wordpress.com/285/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/comments/confoundingblog.wordpress.com/285/" /></a> <img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=confounding.net&#038;blog=4193361&#038;post=285&#038;subd=confoundingblog&#038;ref=&#038;feed=1" width="1" height="1" />]]></content:encoded>
			<wfw:commentRss>http://confounding.net/2012/02/16/my-top-5-mobile-apps-for-scientists/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
	
		<media:content url="http://0.gravatar.com/avatar/3db1b0de2f7cc078cb1f08cbfa1da35e?s=96&#38;d=identicon&#38;r=G" medium="image">
			<media:title type="html">epigrad</media:title>
		</media:content>
	</item>
		<item>
		<title>Revolution R and Fedora: Revisited</title>
		<link>http://confounding.net/2012/02/11/revolution-r-and-fedora-revisited/</link>
		<comments>http://confounding.net/2012/02/11/revolution-r-and-fedora-revisited/#comments</comments>
		<pubDate>Sat, 11 Feb 2012 04:44:09 +0000</pubDate>
		<dc:creator>EpiGrad</dc:creator>
				<category><![CDATA[R]]></category>

		<guid isPermaLink="false">http://confounding.net/?p=280</guid>
		<description><![CDATA[A previous post of mine had suggested that, despite them being extremely similar operating systems, and really there being no clear reason why, Revolution R 5.0, which does support Red Hat Enterprise Linux, refused to work on Fedora 16. The installation failed, dependencies could not be installed, tech support was singularly unhelpful because I wasn&#8217;t [&#8230;]<img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=confounding.net&#038;blog=4193361&#038;post=280&#038;subd=confoundingblog&#038;ref=&#038;feed=1" width="1" height="1" />]]></description>
				<content:encoded><![CDATA[<p>A <a href="http://confounding.net/2012/01/24/cbc-reviews-revolution-r-in-which-this-doesnt-go-well/">previous post</a> of mine had suggested that, despite them being extremely similar operating systems, and really there being no clear reason why, Revolution R 5.0, which does support Red Hat Enterprise Linux, refused to work on Fedora 16. The installation failed, dependencies could not be installed, tech support was singularly unhelpful because I wasn&#8217;t using RHEL 5.0, and I essentially said &#8220;nuts to this&#8221; and went back to my trusty, working, free installation of R sitting in my beloved native OS X. But today, the plot thickened&#8230;</p>
<p><span id="more-280"></span></p>
<p>Since that time, Revolution has added support for RHEL 6.0. I figured what the heck, why not give it another go, see if things will be more cooperative this time. Lo and behold, they work! That&#8217;s right, it appears that the newly updated version of Revolution R &#8211; at least the academic version which I was using &#8211; will indeed install and work on the latest version of Fedora.</p>
<p>Now, some caveats:</p>
<ul>
<li>I have no idea if it will work for everyone. I&#8217;m running a brand new download of Fedora 16 via Parallels on my Mac, installed with essentially every developer related package I could find in hopes of covering all my bases. And <strong>only the 64-bit version of Fedora</strong> works. Tried it on the 32-bit version, and it failed spectacularly.</li>
<li>It could break at any time. What with Fedora not being supported.</li>
<li>If it does break, you&#8217;re SOL. My experience with Revolution&#8217;s customer service/tech support folks to date suggests that if you&#8217;re not running RHEL (or maybe CentOS), you&#8217;re SOL if something stops working. So I&#8217;d still be a touch skittish about putting any major analysis into Revolution R running on a Fedora machine, or if you do and get it working, <em>touch nothing.</em></li>
<li>I don&#8217;t actually know that it works. I mean, it installs. &#8216;$ Revo64&#8242; works as a command. It comes up, and I can do my usual &#8220;R, are you working?&#8221; procedure, which involves nothing more than creating two vectors of random numbers, plotting them against each other, and running a quick and dirty lm(). All of that worked, but I sincerely doubt that really tested the guts of Revolution&#8217;s distribution of R. If someone wants to give me some heavier code to put it through its paces and see if it crashes, please feel free to drop me a line or comment.</li>
</ul>
<p>But there you have it. Hope for the future.</p>
<p>Now Revolution, if you&#8217;re listening? My kingdom for an OS X native version.</p>
<br />Filed under: <a href='http://confounding.net/category/r/'>R</a>  <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gocomments/confoundingblog.wordpress.com/280/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/comments/confoundingblog.wordpress.com/280/" /></a> <img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=confounding.net&#038;blog=4193361&#038;post=280&#038;subd=confoundingblog&#038;ref=&#038;feed=1" width="1" height="1" />]]></content:encoded>
			<wfw:commentRss>http://confounding.net/2012/02/11/revolution-r-and-fedora-revisited/feed/</wfw:commentRss>
		<slash:comments>1</slash:comments>
	
		<media:content url="http://0.gravatar.com/avatar/3db1b0de2f7cc078cb1f08cbfa1da35e?s=96&#38;d=identicon&#38;r=G" medium="image">
			<media:title type="html">epigrad</media:title>
		</media:content>
	</item>
		<item>
		<title>On Unpublished Software</title>
		<link>http://confounding.net/2012/02/09/on-unpublished-software/</link>
		<comments>http://confounding.net/2012/02/09/on-unpublished-software/#comments</comments>
		<pubDate>Thu, 09 Feb 2012 22:01:16 +0000</pubDate>
		<dc:creator>EpiGrad</dc:creator>
				<category><![CDATA[General]]></category>
		<category><![CDATA[R]]></category>
		<category><![CDATA[SAS]]></category>
		<category><![CDATA[Simulation]]></category>

		<guid isPermaLink="false">http://confounding.net/?p=275</guid>
		<description><![CDATA[sciseekclaimtoken-4f343317d3d60 I ran across this post at The Tree of Life entitled &#8216;Interesting new metagenomics paper w/ one big big big caveat &#8211; critical software not available&#8221;. The long and short of it? Paper appears in Science, has fancy new methodology, lacks the software for someone else to use their methodology. Blog author understandably annoyed. But I [&#8230;]<img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=confounding.net&#038;blog=4193361&#038;post=275&#038;subd=confoundingblog&#038;ref=&#038;feed=1" width="1" height="1" />]]></description>
				<content:encoded><![CDATA[<p><span style="display:none;">sciseekclaimtoken-4f343317d3d60</span></p>
<p>I ran across <a href="http://phylogenomics.blogspot.com/2012/02/interesting-new-metagenomics-paper-w.html">this post</a> at <em>The Tree of Life</em> entitled &#8216;Interesting new metagenomics paper w/ one big big big caveat &#8211; critical software not available&#8221;.</p>
<p>The long and short of it? Paper appears in Science, has fancy new methodology, lacks the software for someone else to <em>use</em> their methodology. Blog author understandably annoyed. But I have some sympathy with the authors of the paper itself, as much as I prefer the code for an analysis to be available for publication. My thoughts after the jump.</p>
<p><span id="more-275"></span></p>
<p>First, this post was born as a comment on the OP blog. For some reason, blogger hates my WordPress login.</p>
<p>Second, I understand the spirit of the post. In an ideal world, the software would be available. From a stable URL. For every single platform under the sun. Or maybe even a web interface, if the project was feeling particularly fancy. This is How It Should Be &#8482;.</p>
<p>That being said, there are reasons that isn&#8217;t true. I&#8217;ve had one collaborator essentially decide not to give out code because further analysis &#8211; that would appear in future papers &#8211; was already baked in, and they weren&#8217;t going to go through their code line by line to make sure some poor grad student&#8217;s project didn&#8217;t get scooped by someone reading the code carefully, or that in removing that stuff, they didn&#8217;t manage to otherwise break the software.</p>
<p>But lets leave that aside for the moment, and say &#8211; as the original published paper did &#8211; that the purpose of all this is a new technique, with new software, that we&#8217;re hoping people use. There&#8217;s still reasons for the paper to come out and the software not be available yet &#8211; legitimate reasons. The development of software and its use in science, while very closely linked, are actually disjoint processes that need not progress at the same pace. Some issues that have happened to me:</p>
<p>Beautiful software, ready to go, sitting idle for months waiting for the right numbers to come in to make it usable.</p>
<p>Output from software that is interesting science on its own, but the software isn&#8217;t ready for primetime. Maybe it&#8217;s got a hideous command line interface with a dozen opaque arguments that appear in no logical order. Maybe the quick and dirty solution to something that produced interesting results for several datasets is inefficient enough that it needs more memory than something like it should. Maybe the documentation is a series of ad-hoc scribbles on a white board. Or maybe it works on your machine, works on a student&#8217;s machine, but utterly breaks the first time you try it on a colleagues. Or perhaps you simply want to make it <em>better</em>, and while the science is ready to go and won&#8217;t improve by festering for six months, in six months you could have a GUI. Or better performance. Or cross-platform software. Or all of the above.</p>
<p>I can understand the bloggers frustration. And papers that do this should <em>absolutely</em> both provide enough methods detail that you could write your own software if you had the inclination, and focus on those methods, not mysterious code you don&#8217;t have access to. But when I read a paper where there&#8217;s clearly software to be had, but it&#8217;s not available yet, my first thought is &#8220;What went wrong?&#8221;</p>
<br />Filed under: <a href='http://confounding.net/category/general/'>General</a>, <a href='http://confounding.net/category/r/'>R</a>, <a href='http://confounding.net/category/sas/'>SAS</a>, <a href='http://confounding.net/category/simulation/'>Simulation</a>  <a rel="nofollow" href="http://feeds.wordpress.com/1.0/gocomments/confoundingblog.wordpress.com/275/"><img alt="" border="0" src="http://feeds.wordpress.com/1.0/comments/confoundingblog.wordpress.com/275/" /></a> <img alt="" border="0" src="http://stats.wordpress.com/b.gif?host=confounding.net&#038;blog=4193361&#038;post=275&#038;subd=confoundingblog&#038;ref=&#038;feed=1" width="1" height="1" />]]></content:encoded>
			<wfw:commentRss>http://confounding.net/2012/02/09/on-unpublished-software/feed/</wfw:commentRss>
		<slash:comments>4</slash:comments>
	
		<media:content url="http://0.gravatar.com/avatar/3db1b0de2f7cc078cb1f08cbfa1da35e?s=96&#38;d=identicon&#38;r=G" medium="image">
			<media:title type="html">epigrad</media:title>
		</media:content>
	</item>
	</channel>
</rss>
