<?xml version="1.0" encoding="UTF-8"?>
<rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:wfw="http://wellformedweb.org/CommentAPI/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
	xmlns:slash="http://purl.org/rss/1.0/modules/slash/"
	>

<channel>
	<title>GenomeQuest Industry &#187; SDM</title>
	<atom:link href="http://blog.genomequest.com/category/sdm/feed/" rel="self" type="application/rss+xml" />
	<link>http://blog.genomequest.com</link>
	<description>Conversations on the convergence of SDM, cloud computing, and applications to personalized medicine</description>
	<lastBuildDate>Thu, 12 Jan 2012 23:33:19 +0000</lastBuildDate>
	<generator>http://wordpress.org/?v=2.9.2</generator>
	<language>en</language>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
			<item>
		<title>Analysis can keep pace with sequencing</title>
		<link>http://blog.genomequest.com/2010/11/analysis-can-keep-pace-with-sequencing/</link>
		<comments>http://blog.genomequest.com/2010/11/analysis-can-keep-pace-with-sequencing/#comments</comments>
		<pubDate>Wed, 10 Nov 2010 22:47:58 +0000</pubDate>
		<dc:creator>Richard Resnick</dc:creator>
				<category><![CDATA[GenomeQuest]]></category>
		<category><![CDATA[Informatics Industry]]></category>
		<category><![CDATA[Message from CEO]]></category>
		<category><![CDATA[Personalized Medicine]]></category>
		<category><![CDATA[SDM]]></category>

		<guid isPermaLink="false">http://blog.genomequest.com/?p=273</guid>
		<description><![CDATA[Earlier in the year I wrote an <a href="http://blog.genomequest.com/2010/07/implications-of-exponential-growth-of-global-whole-genome-sequencing-capacity/" target="_blank">article about the growth curve of worldwide sequencing capacity</a> based on current and expected placements of next-generation sequencing instruments. And while worldwide capacity increases at least at a doubling every year for the next five years, I am equally excited about the progress that has been [...]]]></description>
			<content:encoded><![CDATA[<p>Earlier in the year I wrote an <a href="http://blog.genomequest.com/2010/07/implications-of-exponential-growth-of-global-whole-genome-sequencing-capacity/" target="_blank">article about the growth curve of worldwide sequencing capacity</a> based on current and expected placements of next-generation sequencing instruments. And while worldwide capacity increases at least at a doubling every year for the next five years, I am equally excited about the progress that has been made on the analysis side of the industry.</p>
<p>Everyone from Eric Green at the NHGRI to <a href="http://www.politigenomics.com/" target="_blank">David Dooling</a> at Washington University&#8217;s Genome Center to <a href="http://www.bio-itworld.com/2010/issues/sept-oct/physicans.html" target="_blank">Kevin Davies</a> at Bio-IT World rightly acknowledges that the $1,000 genome is a misnomer if you decide to include the cost of computing, analyzing, storing, and querying the data. And if you don&#8217;t &#8211; well, you&#8217;ll be a touch over budget.</p>
<p>Nevertheless, recent innovations have changed the playing field. Just two years ago the debate was whether MAQ, the fastest algorithm at the time for mapping reads, was accurate enough based on its gapless strategy. Today, there are some electrifying algorithms out there based on the Burrows Wheeler transform that move 10 times the number of reads through the same compute pipelines. And where Burrows Wheeler doesn&#8217;t work &#8211; long reads like the kinds we&#8217;re seeing more and more &#8211; there are fantastic new approaches like <a href="http://bioinformatics.oxfordjournals.org/content/26/20/2534.abstract?ijkey=f5zH80QsuCqixRH&amp;keytype=ref" target="_blank">GASSST</a> (which we use to get a 7x increase in mapping speeds with arbitrarily long reads and arbitrarily large gaps).</p>
<p>I gave a talk at ABRF this year entitled, &#8220;The Bioinformatics Bottleneck,&#8221; where I challenged an audience full of bioinformaticians that if we keep getting stuck arguing about the fastest algorithm to map reads and whether to store image files, the industry would simply move on without us. (No, Steve Lincoln, I didn&#8217;t out-Steve you.) Illumina, Life Tech, Roche, and now PacBio are going to keep making machines. Beijing Genomics is going to keep buying them. In the heat of the moment I think I said something about not only throwing out the image files, but also throwing out the reads themselves &#8211; after all it&#8217;s the representation of variation, not the reads, that researchers and clinicians will ultimately care about. As late as March 2010 this was met with shock and even disdain; but today great companies like Complete Genomics and important projects like the Personal Genome Project are doing just that &#8211; providing us with the variant calls only; not the reads. Unless of course you want them &#8211; your choice, by the way, and may your disk drives spin forever.</p>
<p>Now that&#8217;s not to say that we don&#8217;t need to keep mapping. And so we do. GenomeQuest is on track to have mapped 100 billion reads in 2010 at year end &#8211; about 11 million an hour. Of course if our systems were 100% utilized &#8211; an impossible dream for any data center &#8211; the number would be far higher. And next year we expect it will be five times that at least. Mapping goes on, powered by innovations in software, and innovations in hardware as well. Earlier this year we announced <a href="http://www.genomequest.com/GenomeQuest-SGI-WGA-Services.xhtml" target="_blank">a partnership with SGI</a> that enabled us to build a purpose-designed architecture designed specifically for mapping. By ensuring reference data availability on every compute node, we minimize network traffic; through significant redundancy in both compute and head nodes we can ensure quality of service. By the end of 2011 we expect to be able to map 300 whole human genomes at 30x coverage per month, or 6,000 exomes per month.</p>
<p>The idea that so many genomes can move through a uniform, industry tested workflow is enchanting. And it sets our sights on the next realm of opportunity: analysis of thousands of genomes at the same time. Projects such as the 1,000 Genome Project (actually planning more like 2,000 &#8211; and do you really think Durbin and Altschuler will stop there?) are troves of undiscovered knowledge on the basis of disease. Being able to overlay that on top of your own 100 exome project provides critical information on background. So using the same technology, GenomeQuest has an exciting new suite of products for multi-genome analysis that are currently available for early access. As the year rolls on into 2011, these workflows for computing allelic frequency, performing large-scale tumor/normal studies, and asking population-scale questions across thousands of genomes will be further enhanced in close collaboration with our users.</p>
<p>The end game for healthcare is genomic medicine, enabled by the sequencing of individual patients and the large-scale comparison against populations &#8211; from phenotype to clinical presentation to genotype to treatment to response. That&#8217;s why <a href="http://www.genomequest.com/beth-israel-GQ-personalized-health-care.xhtml" target="_blank">we&#8217;ve partnered with Beth Israel Deaconess Medical Center</a> &#8211; to work closely on the development of whole genome reports that can actually be usable by pathologists, like any other lab test. The idea is to map clinical actions to specific variations and present them in a way that a trained genomics physician can ultimately guide the course of treatment. There is a long road ahead and plenty of stakeholders involved, but being in the clinic in 2010 drives our thinking about the research and clinical development applications that exist today in pharma and academia.</p>
<p>Genomics is still a bit of the wild west &#8211; in a single day I can have a meeting with a pharma executive, a seminar on an FDA position-paper, a discussion with agbio researchers on the genomics of polyploid organisms, and a deep dive with a team of bioinformatics talking about word-based hashing algorithms. When genomics is in the clinic this&#8217;ll have to stop.</p>
<p>But in the meantime…</p>
]]></content:encoded>
			<wfw:commentRss>http://blog.genomequest.com/2010/11/analysis-can-keep-pace-with-sequencing/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Research Using Next Generation Sequencing: Moving from &#8220;Information Bottleneck&#8221; to &#8220;Information Renaissance&#8221;</title>
		<link>http://blog.genomequest.com/2010/06/research-using-next-generation-sequencing-moving-from-information-bottleneck-to-information-renaissance/</link>
		<comments>http://blog.genomequest.com/2010/06/research-using-next-generation-sequencing-moving-from-information-bottleneck-to-information-renaissance/#comments</comments>
		<pubDate>Fri, 04 Jun 2010 19:39:07 +0000</pubDate>
		<dc:creator>Tony Flynn</dc:creator>
				<category><![CDATA[Informatics Industry]]></category>
		<category><![CDATA[Personalized Medicine]]></category>
		<category><![CDATA[SDM]]></category>

		<guid isPermaLink="false">http://blog.genomequest.com/?p=220</guid>
		<description><![CDATA[This week, I attended the <a href="http://www.consumergeneticsshow.com/">Consumer Genomics Show</a> (more so a “clinical genomics&#8221; show). In a session on Next Generation Sequencing led by five industry experts, a recurring observation was that, on the one hand, steady investments in sequencing technology was indeed delivering breathtaking value and driving down sequencing costs – bravo indeed and [...]]]></description>
			<content:encoded><![CDATA[<p>This week, I attended the <a href="http://www.consumergeneticsshow.com/">Consumer Genomics Show</a> (more so a “clinical genomics&#8221; show). In a session on Next Generation Sequencing led by five industry experts, a recurring observation was that, on the one hand, steady investments in sequencing technology was indeed delivering breathtaking value and driving down sequencing costs – bravo indeed and a new generation of discoveries are upon us!</p>
<p>However, on the other hand, without broad use of new software to analyze this massive volume of data, this group reported an “information bottleneck” for researchers &#8212; exacerbated by a culture of free, fragmented software.</p>
<p>So what to do?</p>
<p>I, for one, am quite optimistic on the topic.</p>
<p>I think that <a href="http://nih.gov/">NIH</a> and market forces will resoundingly address the challenge.  I heard from Eric Green, director of <a href="http://www.genome.gov/">NHGRI</a>, that his team and the NIH see the bottleneck as a critical issue and they are taking steps to adjust their priorities accordingly.</p>
<p>And, while researchers in the past, may have made significant discoveries with free and varied software, I contend that the following market forces are encouraging the best of them and their organizations to substantially and positively raise their software aspirations:</p>
<ul>
<li>the data volumes in whole-genome projects simply demands a new generation of analysis methods, tools, and platforms &#8212; and providers are responding.</li>
<li>more and more research happens in partnership, requiring collaboration software to share data, intermediate findings, and discoveries &#8212; again, solutions exist.</li>
<li>more and more organizations will want to inject public studies directly into their research, remove silos, and create a “continuous/growing/queryable database of variant information” &#8212; available today.</li>
<li>more and more research will be aimed at personalized medicine, diagnostic testing, and clinical applications – areas that call for strict and documented information exchange.</li>
<li>bioinformatics teams and companies will utilize emerging sequence database systems to arm researchers with powerful new discovery applications. (In a conversation yesterday with George Church, he commented: &#8220;how could it be any other way&#8221; when I asked if he envisioned the industry moving to such systems.)</li>
</ul>
<p>All told, motivated by these opportunities, I believe that we are entering an “Information Renaissance” &#8212; powered by  sequencing + software technology!</p>
<p>Keep in mind that the answers lie in the sequence data &#8212; and researchers will not stop until they get them.  And if high-quality, fair-priced software is required to get at them, so be it. Increasingly, great software is becoming a “must have” with “must have” rewards.</p>
<p>I’ve seen this happen in other professions as they turned to computers. Electrical engineers started with free or home-grown software to mostly capture schematic drawings.  Then chip manufacturing technology exploded and a $5B electronic design automation (EDA) software industry was born.  Soon software was synthesizing whole circuitry from simple specs, placing and routing millions of transistors on chips, simulating whole systems before manufacturing, even telling you how to test the chip after manufacturing – all “must have’s” that have co-powered Moore’s Law for over 20 years.</p>
<p>So, as exciting as sequencing performance has made Life Science research in the past decade, I suggest that a co-powered &#8220;Information Renaissance&#8221; will take us to new levels of discovery.  And we all very much look forward to it!</p>
]]></content:encoded>
			<wfw:commentRss>http://blog.genomequest.com/2010/06/research-using-next-generation-sequencing-moving-from-information-bottleneck-to-information-renaissance/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>OK to move the data 1 time</title>
		<link>http://blog.genomequest.com/2010/05/ok-to-move-the-data-1-time/</link>
		<comments>http://blog.genomequest.com/2010/05/ok-to-move-the-data-1-time/#comments</comments>
		<pubDate>Mon, 17 May 2010 19:57:53 +0000</pubDate>
		<dc:creator>Richard Resnick</dc:creator>
				<category><![CDATA[Cloud Computing]]></category>
		<category><![CDATA[Informatics Industry]]></category>
		<category><![CDATA[SDM]]></category>

		<guid isPermaLink="false">http://blog.genomequest.com/?p=211</guid>
		<description><![CDATA[<a href="http://www.oicr.on.ca/research/stein.htm">Lincoln Stein</a> lays out &#8220;<a href="http://genomebiology.com/2010/11/5/207">The case for cloud computing in genome informatics</a>&#8221; pretty nicely. The article describes the inflection point of sequencing technology. That is from 1990 to 2004 &#8216;base-pair/$&#8217; doubled every 19 months versus a doubling every 5 months since 2004 to present. There is no end in sight.
Moving data to the [...]]]></description>
			<content:encoded><![CDATA[<p><a href="http://www.oicr.on.ca/research/stein.htm">Lincoln Stein</a> lays out &#8220;<a href="http://genomebiology.com/2010/11/5/207">The case for cloud computing in genome informatics</a>&#8221; pretty nicely. The article describes the inflection point of sequencing technology. That is from 1990 to 2004 &#8216;base-pair/$&#8217; doubled every 19 months versus a doubling every 5 months since 2004 to present. There is no end in sight.</p>
<p>Moving data to the cloud remains the biggest obstacle to cloud adoption. The article makes the <a href="http://www.genomeweb.com/node/940668/?hq_e=el&amp;hq_m=717858&amp;hq_l=13&amp;hq_v=c092034955">case for moving the computation to the data</a> instead of vice versa. Presupposing however that people will be willing to move their data at least 1 time. Otherwise, &#8220;moving the computation to the data&#8221; is an argument for building and maintaining a local compute cluster, nearby the sequencing instrument.</p>
<p>At GQ we realize there is no &#8220;one size fits all&#8221;. We support a hosted, cloud based solution for customers with limited IT expertise or inclination. Or, for lab operations running multiple sequencing instruments, we can install the cloud locally so you benefit from the bandwidth of the Local Area Network connecting the instrument to computing. Either way, data moves only 1 time and GQ solves for scalability with its algorithms and data model.</p>
]]></content:encoded>
			<wfw:commentRss>http://blog.genomequest.com/2010/05/ok-to-move-the-data-1-time/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Industry confusing &#8220;Cloud&#8221; with &#8220;Infrastructure&#8221;</title>
		<link>http://blog.genomequest.com/2010/04/industry-confusing-cloud-with-infrastructure/</link>
		<comments>http://blog.genomequest.com/2010/04/industry-confusing-cloud-with-infrastructure/#comments</comments>
		<pubDate>Sun, 04 Apr 2010 23:42:20 +0000</pubDate>
		<dc:creator>admin</dc:creator>
				<category><![CDATA[Cloud Computing]]></category>
		<category><![CDATA[SDM]]></category>

		<guid isPermaLink="false">http://blog.genomequest.com/?p=181</guid>
		<description><![CDATA[Earlier I <a href="http://blog.genomequest.com/?p=154">blogged</a> on the distinctions between Infrastructure, Platform, and Software-as-a-Service offerings. The message was that &#8220;cloud&#8221; is an overloaded word and takes many forms and has different customer value propositions.
A recent commentary in GenomeWeb &#8220;<a href="http://www.genomeweb.com/blog/considering-cloud-cost-isnt-everything">Considering a Cloud? Cost isn&#8217;t everything&#8230;</a>&#8221; citing the paper &#8220;<a href="http://www.genomeweb.com/sites/default/files/walker.pdf">The Real  Cost of a CPU Hour</a>&#8221; [...]]]></description>
			<content:encoded><![CDATA[<p>Earlier I <a href="http://blog.genomequest.com/?p=154">blogged</a> on the distinctions between Infrastructure, Platform, and Software-as-a-Service offerings. The message was that &#8220;cloud&#8221; is an overloaded word and takes many forms and has different customer value propositions.</p>
<p>A recent commentary in GenomeWeb &#8220;<a href="http://www.genomeweb.com/blog/considering-cloud-cost-isnt-everything">Considering a Cloud? Cost isn&#8217;t everything&#8230;</a>&#8221; citing the paper &#8220;<a href="http://www.genomeweb.com/sites/default/files/walker.pdf">The Real  Cost of a CPU Hour</a>&#8221; illustrates the confusion. The paper benchmarks <a href="http://en.wikipedia.org/wiki/High-performance_computing">HPC</a> on a dedicated cluster versus bare metal resources on Amazon EC2. The conclusion is foregone since commodity EC2 network can&#8217;t keep up with a tuned applications running on a compute cluster with a high-speed network.</p>
<p>A more informative title for the blogger and the article would be: &#8220;Considering  Infrastructure-As-A-Service? Beware if your application requires Message  Passing Interface and therefore High-Speed Network&#8221;.</p>
<p>Fortunately for large scale Sequence Data Management the predominant application mode is &#8220;embarrassingly parallel&#8221; and therefore HPC are not needed, except maybe on the Web Server where response time is critical. The statistics in the article show that  Embarrassingly Parallel applications run acceptably well on Amazon EC2, adding less than 5% overhead to the computations.</p>
]]></content:encoded>
			<wfw:commentRss>http://blog.genomequest.com/2010/04/industry-confusing-cloud-with-infrastructure/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Small step for Personalized Medicine</title>
		<link>http://blog.genomequest.com/2010/03/small-step-for-personalized-medicine/</link>
		<comments>http://blog.genomequest.com/2010/03/small-step-for-personalized-medicine/#comments</comments>
		<pubDate>Fri, 12 Mar 2010 01:01:40 +0000</pubDate>
		<dc:creator>admin</dc:creator>
				<category><![CDATA[GenomeQuest]]></category>
		<category><![CDATA[Informatics Industry]]></category>
		<category><![CDATA[SDM]]></category>

		<guid isPermaLink="false">http://blog.genomequest.com/?p=167</guid>
		<description><![CDATA[An article by Nicholas Wade, science writer for the New York Times <a href="http://www.nytimes.com/2010/03/11/health/research/11gene.html?partner=rss&#38;emc=rss">Disease  Cause Is Pinpointed With Genome</a> describes two research teams who have independently sequenced the entire genome of  patients to find the exact genetic cause of their diseases.
A fantastic research blog post at <a href="http://scienceblogs.com/geneticfuture/">Genetic Future</a> titled <a href="http://scienceblogs.com/geneticfuture/2010/03/whole_genome_sequences_dont_al.php?utm_source=feedburner&#38;utm_medium=feed&#38;utm_campaign=Feed%3A+scienceblogs%2Fgeneticfuture+%28Genetic+Future%29">Disease hunting [...]]]></description>
			<content:encoded><![CDATA[<p>An article by Nicholas Wade, science writer for the New York Times <a href="http://www.nytimes.com/2010/03/11/health/research/11gene.html?partner=rss&amp;emc=rss">Disease  Cause Is Pinpointed With Genome</a> describes two research teams who have independently sequenced the entire genome of  patients to find the exact genetic cause of their diseases.</p>
<p>A fantastic research blog post at <a href="http://scienceblogs.com/geneticfuture/">Genetic Future</a> titled <a href="http://scienceblogs.com/geneticfuture/2010/03/whole_genome_sequences_dont_al.php?utm_source=feedburner&amp;utm_medium=feed&amp;utm_campaign=Feed%3A+scienceblogs%2Fgeneticfuture+%28Genetic+Future%29">Disease hunting with whole genome sequences: the good news; the bad news</a> offers readers the scientist take on the studies that motivated Wade&#8217;s article. The author at once celebrates the technical achievements of the papers and tempers our enthusiasm citing the inherent complexity of the genome when dealing with more complex diseases.</p>
<p>Some things caught my attention about the Wade article:</p>
<ul>
<li><em>&#8220;The finding implies that common diseases, surprisingly, are caused by  rare, not common, mutations. In the last few months, researchers have  begun to conclude that a new approach is needed, one based on decoding  the entire genome of patients.&#8221;</em></li>
<li><em>&#8220;The new reports, though involving only single-gene diseases, suggest  that the whole-genome approach can be developed into a way of exploring  the roots of the common multigene diseases.&#8221;</em></li>
<li><em>“We need a way of assessing rare variants better than the genomewide  association studies can do, and whole-genome sequencing is the only way  to do that,” Dr. Lupski said.&#8221;</em></li>
</ul>
<p>We are working as hard as we can to:</p>
<ol>
<li>Aggregate as much public sequence data as possible, including 1K Genomes, PGP, and cancer genome projects</li>
<li>Provide easy ways to combine this data with new sequence data</li>
<li>Provide powerful facilities for grouping and counting on genotype-phenotype properties, <span style="text-decoration: underline;">on a very large scale</span></li>
</ol>
<p>The news is just a landmark on for the progress the industry is making to eliminate barriers and move Personalized Medicine forward: the cost of sequencing many patients is coming down, cloud computing is making unlimited computing feasible, and GenomeQuest is making it tractable to work with this data.</p>
<div><!-- This is a linked template. DO NOT EDIT IT. --></div>
]]></content:encoded>
			<wfw:commentRss>http://blog.genomequest.com/2010/03/small-step-for-personalized-medicine/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Developers Wanted</title>
		<link>http://blog.genomequest.com/2010/03/bioinformatics-and-compuational-biologists-wanted/</link>
		<comments>http://blog.genomequest.com/2010/03/bioinformatics-and-compuational-biologists-wanted/#comments</comments>
		<pubDate>Wed, 10 Mar 2010 23:19:08 +0000</pubDate>
		<dc:creator>admin</dc:creator>
				<category><![CDATA[Cloud Computing]]></category>
		<category><![CDATA[GenomeQuest]]></category>
		<category><![CDATA[SDM]]></category>

		<guid isPermaLink="false">http://blog.genomequest.com/?p=159</guid>
		<description><![CDATA[Today, we launched our <a href="http://www.genomequest.com/GenomeQuest-opens-API-for-SDM.xhtml">API’s for Sequence Data Management</a> on the cloud.
So what?
GenomeQuest is now for bioinformatics and computational biologists (we call them developers for short). These are people who prefer to write code in Unix, and prefer awk, perl, and sed to Firefox, Internet Explorer, Safari, or Chrome.
So why is that important?
With no [...]]]></description>
			<content:encoded><![CDATA[<p>Today, we launched our <a href="http://www.genomequest.com/GenomeQuest-opens-API-for-SDM.xhtml">API’s for Sequence Data Management</a> on the cloud.</p>
<p>So what?</p>
<p>GenomeQuest is now for bioinformatics and computational biologists (we call them <em>developers</em> for short). These are people who prefer to write code in Unix, and prefer awk, perl, and sed to Firefox, Internet Explorer, Safari, or Chrome.</p>
<p>So why is that important?</p>
<p>With no up-front investment, developers can use the <a href="http://wiki.genomequest.com/index.php/DeveloperAPIOverview">GQ API</a> to write large-scale sequence comparison applications for the cloud without regard for the details of the computing and the reference data. And, they can publish their applications to the <a href="http://www.genomequest.com/basic-registration/">GenomeQuest Web Application</a>, so research biologist customers can use and reuse them.</p>
]]></content:encoded>
			<wfw:commentRss>http://blog.genomequest.com/2010/03/bioinformatics-and-compuational-biologists-wanted/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>An SDM Cloud?</title>
		<link>http://blog.genomequest.com/2010/03/an-sdm-cloud/</link>
		<comments>http://blog.genomequest.com/2010/03/an-sdm-cloud/#comments</comments>
		<pubDate>Wed, 10 Mar 2010 23:05:33 +0000</pubDate>
		<dc:creator>admin</dc:creator>
				<category><![CDATA[Cloud Computing]]></category>
		<category><![CDATA[Informatics Industry]]></category>
		<category><![CDATA[SDM]]></category>

		<guid isPermaLink="false">http://blog.genomequest.com/?p=154</guid>
		<description><![CDATA[Executives in the industry sometimes ask me if we are moving our software to the “cloud”. When I say we are already a cloud, then they wonder: “<a href="http://aws.amazon.com/ec2/">then what is Amazon offering</a>?”
Its helpful to think of the cloud as a layered architecture. The <a href="http://www.bvp.com/Default.aspx">VC Bessemer</a> provides a nice definition of this layering.


Infrastructure-as-a-Service: Web-services [...]]]></description>
			<content:encoded><![CDATA[<p>Executives in the industry sometimes ask me if we are moving our software to the “cloud”. When I say we are already a cloud, then they wonder: “<a href="http://aws.amazon.com/ec2/">then what is Amazon offering</a>?”</p>
<p>Its helpful to think of the cloud as a layered architecture. The <a href="http://www.bvp.com/Default.aspx">VC Bessemer</a> provides a nice definition of this layering.</p>
<p><img class="aligncenter" title="Bessemer definition of &quot;cloud&quot;" src="http://www.bvp.com/uploadedImages/About/Investment_Practice/Cloud%20Computing%20Ecosystem(1).gif" alt="" width="347" height="249" /></p>
<ul>
<li>Infrastructure-as-a-Service: Web-services applications framework for provisioning computers, networks, and storage resources without regard to the actual hardware topology. Examples of IaaS are Amazon EC2 and <a href="http://www.eweek.com/c/a/IT-Infrastructure/SGI-Offers-Cyclone-Cloud-Computing-for-HPC-810779/">SGI</a></li>
</ul>
<ul>
<li>Platform-as-a-Service:  Web-services application framework for provisioning data, algorithm, and analysis services and developing new applications. Leveraging all the benefits of the IaaS layer, the <a href="http://wiki.genomequest.com/index.php/DeveloperAPIOverview">GenomeQuest Developer API Framework</a> provides services for managing, comparing, and mining sequence databases without regard to the underlying computers and storage.</li>
</ul>
<ul>
<li>Software-as-a-Service: A finished Web-appliction end-to-end solution for a given end-user application. Examples of this in the Life Science space include the <a href="http://www.ingenuity.com/">Ingenuity</a> IPA product and the GenomeQuest 6.3 SDM platform.</li>
</ul>
<p>To my knowledge, we have the industry&#8217;s only cloud-based &#8220;PaaS&#8221;. With a few commands from a remote desktop, developers can upload data, apply algorithms on hundreds of cores while accessing terabytes of well managed sequence data. In the future, we intend to leverage the extraordinary economies of scale offered by IaaS providers such as Amazon.</p>
<p>If you&#8217;re curious about the API&#8217;s, take a look <a href="http://wiki.genomequest.com/index.php/GenomeQuest_Documentation#Developer_API">here</a>.</p>
]]></content:encoded>
			<wfw:commentRss>http://blog.genomequest.com/2010/03/an-sdm-cloud/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Xconomy covers GQ</title>
		<link>http://blog.genomequest.com/2010/02/xconomy-covers-gq/</link>
		<comments>http://blog.genomequest.com/2010/02/xconomy-covers-gq/#comments</comments>
		<pubDate>Fri, 26 Feb 2010 16:38:34 +0000</pubDate>
		<dc:creator>admin</dc:creator>
				<category><![CDATA[Informatics Industry]]></category>
		<category><![CDATA[SDM]]></category>

		<guid isPermaLink="false">http://blog.genomequest.com/?p=146</guid>
		<description><![CDATA[Ryan McBride from Xconomy posted this nice <a href="http://www.xconomy.com/boston/2010/02/25/genomequest-wants-to-be-the-google-of-dna-data-searches/?single_page=true">article</a> on GQ yesterday. After our discussion, he asked me if the &#8220;Google of Genomics&#8221; metaphor applied and I did not deny, thus the title: &#8220;GenomeQuest Wants to Be the Google of DNA Data Searches&#8221;
]]></description>
			<content:encoded><![CDATA[<p>Ryan McBride from Xconomy posted this nice <a href="http://www.xconomy.com/boston/2010/02/25/genomequest-wants-to-be-the-google-of-dna-data-searches/?single_page=true">article</a> on GQ yesterday. After our discussion, he asked me if the &#8220;Google of Genomics&#8221; metaphor applied and I did not deny, thus the title: &#8220;GenomeQuest Wants to Be the Google of DNA Data Searches&#8221;</p>
]]></content:encoded>
			<wfw:commentRss>http://blog.genomequest.com/2010/02/xconomy-covers-gq/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>So, what&#8217;s the argument for cloud computing?</title>
		<link>http://blog.genomequest.com/2010/01/so-whats-the-argument-for-cloud-computing/</link>
		<comments>http://blog.genomequest.com/2010/01/so-whats-the-argument-for-cloud-computing/#comments</comments>
		<pubDate>Mon, 25 Jan 2010 20:48:23 +0000</pubDate>
		<dc:creator>admin</dc:creator>
				<category><![CDATA[Cloud Computing]]></category>
		<category><![CDATA[Informatics Industry]]></category>
		<category><![CDATA[SDM]]></category>
		<category><![CDATA[Bioinformatics]]></category>
		<category><![CDATA[NGS software]]></category>
		<category><![CDATA[Sequence Data Management]]></category>

		<guid isPermaLink="false">http://blog.genomequest.com/?p=122</guid>
		<description><![CDATA[A plot of the <a href="http://images.google.com/imgres?imgurl=http://www.mocom2020.com/data/2009/05/computer-power-future.gif&#38;imgrefurl=http://www.mocom2020.com/2009/05/evolution-of-computer-capacity-and-costs/&#38;usg=__nNgm1nlSJX4QpgaOrsZQnHi0TjM=&#38;h=768&#38;w=1205&#38;sz=116&#38;hl=en&#38;start=1&#38;um=1&#38;tbnid=HchqHMRDWWHr9M:&#38;tbnh=96&#38;tbnw=150&#38;prev=/images%3Fq%3Dcost%2Bof%2Bcomputing%2Bpower%26hl%3Den%26sa%3DN%26um%3D1">Evolution of Computer Capacity and Costs</a> shows that compute power will be 1,000X cheaper in 10 years. How much lower can it go? As this happens the relative cost of managing another computer goes asymptotic to zero, regardless of whether its hosted internally or externally. I don&#8217;t think there is [...]]]></description>
			<content:encoded><![CDATA[<p>A plot of the <a href="http://images.google.com/imgres?imgurl=http://www.mocom2020.com/data/2009/05/computer-power-future.gif&amp;imgrefurl=http://www.mocom2020.com/2009/05/evolution-of-computer-capacity-and-costs/&amp;usg=__nNgm1nlSJX4QpgaOrsZQnHi0TjM=&amp;h=768&amp;w=1205&amp;sz=116&amp;hl=en&amp;start=1&amp;um=1&amp;tbnid=HchqHMRDWWHr9M:&amp;tbnh=96&amp;tbnw=150&amp;prev=/images%3Fq%3Dcost%2Bof%2Bcomputing%2Bpower%26hl%3Den%26sa%3DN%26um%3D1">Evolution of Computer Capacity and Costs</a> shows that compute power will be 1,000X cheaper in 10 years. How much lower can it go? As this happens the relative cost of managing another computer goes asymptotic to zero, regardless of whether its hosted internally or externally. I don&#8217;t think there is an economic argument that shows everyone belongs on the cloud based just on hardware and system administration cost.</p>
<p>Dave Dooling at <a href="http://www.politigenomics.com/2010/01/cloudy-with-a-chance-of-sunshine.html?utm_source=feedburner&amp;utm_medium=feed&amp;utm_campaign=Feed%3A+Politigenomics+%28PolITiGenomics%29">PolITiGenomics</a> finds two good reasons for considering cloud options: when organizations have peak demands for compute power and when limitations on space/power/cooling preclude building a system in-house. These are two good reasons, but hardly enough to justify all the cloud computing hype.</p>
<p>So, what&#8217;s the argument for cloud computing?</p>
<p>Unlike computing which gets cheaper every year, people cost more every year. So, it makes sense to evaluate the annual software development and maintenance costs, the cost of managing the reference databases; integrating and maintaining new applications, the productivity of the end-users and how to change the ratio of end-user-to-support-programmer from 2-to-1 to 10-to-1 or 20-to-1. Cloud computing defined as &#8220;Infrastructure&#8221; (computers, networks, and storage) doesn&#8217;t alleviate these costs.</p>
]]></content:encoded>
			<wfw:commentRss>http://blog.genomequest.com/2010/01/so-whats-the-argument-for-cloud-computing/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Challenges in 1000 Genomes Data</title>
		<link>http://blog.genomequest.com/2010/01/challenges-in-1000-genomes-data/</link>
		<comments>http://blog.genomequest.com/2010/01/challenges-in-1000-genomes-data/#comments</comments>
		<pubDate>Mon, 18 Jan 2010 21:30:31 +0000</pubDate>
		<dc:creator>admin</dc:creator>
				<category><![CDATA[SDM]]></category>

		<guid isPermaLink="false">http://blog.genomequest.com/?p=89</guid>
		<description><![CDATA[Variant reports are not the right deliverable for a re-sequencing study.
A well written technical blog &#8216;<a href="http://www.massgenomics.org/2010/01/capture-and-subassembly-with-jay-shendure.html">MassGenomics</a>&#8216; written by Dan Koboldt illustrates why. Dan says &#8220;What’s more, with the advent of next-generation sequencing, I hate to tell you, but people are going to be reporting a lot of false positives.  I guarantee it.  So when [...]]]></description>
			<content:encoded><![CDATA[<p>Variant reports are not the right deliverable for a re-sequencing study.</p>
<p>A well written technical blog &#8216;<a href="http://www.massgenomics.org/2010/01/capture-and-subassembly-with-jay-shendure.html">MassGenomics</a>&#8216; written by Dan Koboldt illustrates why. Dan says &#8220;What’s more, with the advent of next-generation sequencing, I hate to tell you, but people are going to be reporting a lot of false positives.  I guarantee it.  So when you filter all of the variants, you might actually remove the ones you’re looking for.&#8221;</p>
<p>Its easy to see why researchers are not enthusiastic about tabular reports. They want to get into the data on their own, without intermediaries, and they want software to facilitate that, not be in the way.</p>
]]></content:encoded>
			<wfw:commentRss>http://blog.genomequest.com/2010/01/challenges-in-1000-genomes-data/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
	</channel>
</rss>

