<?xml version="1.0" encoding="UTF-8"?>
<rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:wfw="http://wellformedweb.org/CommentAPI/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
	xmlns:slash="http://purl.org/rss/1.0/modules/slash/"
	>

<channel>
	<title>GenomeQuest Industry &#187; Next Generation Sequencing</title>
	<atom:link href="http://blog.genomequest.com/tag/next-generation-sequencing/feed/" rel="self" type="application/rss+xml" />
	<link>http://blog.genomequest.com</link>
	<description>Conversations on the convergence of SDM, cloud computing, and applications to personalized medicine</description>
	<lastBuildDate>Thu, 12 Jan 2012 23:33:19 +0000</lastBuildDate>
	<generator>http://wordpress.org/?v=2.9.2</generator>
	<language>en</language>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
			<item>
		<title>How fast is your read mapping algorithm?</title>
		<link>http://blog.genomequest.com/2011/06/how-fast-is-your-read-mapping-algorithm/</link>
		<comments>http://blog.genomequest.com/2011/06/how-fast-is-your-read-mapping-algorithm/#comments</comments>
		<pubDate>Tue, 14 Jun 2011 07:25:38 +0000</pubDate>
		<dc:creator>Henk Heus</dc:creator>
				<category><![CDATA[Cloud Computing]]></category>
		<category><![CDATA[Message from Technology Team]]></category>
		<category><![CDATA[Bioinformatics]]></category>
		<category><![CDATA[Bowtie]]></category>
		<category><![CDATA[BWA]]></category>
		<category><![CDATA[GASSST]]></category>
		<category><![CDATA[GenomeQuest]]></category>
		<category><![CDATA[GenomeQuest Engine]]></category>
		<category><![CDATA[GQ-Engine]]></category>
		<category><![CDATA[Next Generation Sequencing]]></category>
		<category><![CDATA[Sequence Data Management]]></category>

		<guid isPermaLink="false">http://blog.genomequest.com/?p=379</guid>
		<description><![CDATA[
This is a question that is often asked when I demo the GenomeQuest platform to potential customers. I always answer that question in three phases.
The first phase goes like this: &#8220;It&#8217;s really fast, it&#8217;s certainly not any slower than BWA/Bowtie or anything else out there.&#8221;. Next question is always: &#8220;Well, do you have any benchmarks?&#8221;.
Which [...]]]></description>
			<content:encoded><![CDATA[<div>
<p><!-- p.p1 {margin: 0.0px 0.0px 0.0px 0.0px; line-height: 19.0px; font: 13.0px Georgia} p.p2 {margin: 0.0px 0.0px 0.0px 0.0px; line-height: 19.0px; font: 13.0px Georgia; min-height: 15.0px} li.li1 {margin: 0.0px 0.0px 0.0px 0.0px; line-height: 19.0px; font: 13.0px Georgia} ul.ul1 {list-style-type: square} -->This is a question that is often asked when I demo the GenomeQuest platform to potential customers. I always answer that question in three phases.</p>
<p>The first phase goes like this: &#8220;It&#8217;s really fast, it&#8217;s certainly not any slower than BWA/Bowtie or anything else out there.&#8221;. Next question is always: &#8220;Well, do you have any benchmarks?&#8221;.</p>
<p>Which nicely transitions me into the second phase of the answer. This phase is much more rigorous and usually starts with: &#8220;Well, it depends. Let me try to explain.&#8221;:</p>
<ul>
<li>Any good computer scientist can write a mapping algorithm that is really fast. However, that doesn&#8217;t mean anything unless it produces the kind of results that are needed. The NGS application you&#8217;re working with matters here. For example, finding genetic variation in human disease requires very accurate mapping with mismatches and indels. In contrast, with digital gene expression in maize you can cut a lot of corners. You just need to have a rough idea of the number of alignments on a transcript.</li>
<li>Then there is the matter of the data and the technology that produced it. Do you have long reads (more than 120 bp)? Will your mapper handle them? Will it also increase the number of mismatches / indels it can use to align a read? Will it significantly slow down execution, or eat up all your memory when reads get longer? Does this mapper also support local alignments when you need them? Will it align in colorspace? In paired end mode? I could go on.</li>
<li>Next there is the matter of connecting results to the following step in your pipeline. How long does it take to de-duplicate a  1TB alignment file in SAM/BAM format? Or to find those alignments who&#8217;s position overlaps with your exome capture experiment, or all dbSNP entries? At GenomeQuest we have a very efficient way of storing and handling alignments (including sequences/annotation). This saves real time, especially when compared to the alignment step itself (it saves a lot of disk space as well by the way).</li>
</ul>
<p>By the time we get to the third phase of the answer I&#8217;m usually much more confident: &#8220;Well, how fast do you need our mapping algorithm to be?&#8221;.</p>
<ul>
<li>Does it really matter how fast the read mapper is, as long as it&#8217;s comparable in performance to other algorithms for most common use cases? Does it matter if you have the alignments in 2.5 hours instead of 3? Maybe if you analyze thousands of samples per week it matters, but then other things like reliability and professional software support should matter as well.</li>
<li>Do these other algorithms scale with the hardware you throw at them? How easy is it to run a read mapping on 64 compute nodes, with 2 CPUs, 8 cores per CPU per node? What about if you double the amount of hardware? Will you go twice as fast? With the GQ- Engine  you will. Want to run on 1024 nodes? That&#8217;s possible.</li>
<li>Are you asking about speed for a single run, or the throughput for a bunch of runs? Last weekend I ran 2000 NGS read databases though our read mapping workflow (low coverage genome sequencing, about 80M reads per database). I started them on Friday afternoon, went for drinks with my friends that evening, had a nice family dinner on Saturday afternoon, and watched a movie with my kid afterwards. The runs were finished before I woke up on Sunday. No hiccups, no failed runs, no logs to monitor, and &#8211; best of all &#8211; no &#8220;one million-file&#8221; directories to organize. There were a lot of other customers on the system that weekend, doing their NGS analysis as well.</li>
</ul>
<p>If we ever meet for a demo, please ask me this question. I love to talk about it.</p>
<p>Henk Heus, Ph.D.<br />
VP Product Management &amp; Services<br />
GenomeQuest Inc.</p>
<p>At GenomeQuest we use an extended version of the GASSST read mapping algorithm (among others). Read about it here in Bioinformatics here: <a title="http://bioinformatics.oxfordjournals.org/content/26/20/2534.abstract" href="http://bioinformatics.oxfordjournals.org/content/26/20/2534.abstract" target="_self">http://bioinformatics.oxfordjournals.org/content/26/20/2534.abstract</a></p>
</div>
]]></content:encoded>
			<wfw:commentRss>http://blog.genomequest.com/2011/06/how-fast-is-your-read-mapping-algorithm/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Announcing ChIP-Seq Support</title>
		<link>http://blog.genomequest.com/2009/11/announcing-chip-seq-support/</link>
		<comments>http://blog.genomequest.com/2009/11/announcing-chip-seq-support/#comments</comments>
		<pubDate>Mon, 09 Nov 2009 13:30:19 +0000</pubDate>
		<dc:creator>GenomeQuest</dc:creator>
				<category><![CDATA[Message from Technology Team]]></category>
		<category><![CDATA[ChIP-Seq]]></category>
		<category><![CDATA[GenomeQuest]]></category>
		<category><![CDATA[GenomeQuest 6.0Beta]]></category>
		<category><![CDATA[Next Generation Sequencing]]></category>
		<category><![CDATA[NGS]]></category>
		<category><![CDATA[RNA-Seq]]></category>

		<guid isPermaLink="false">http://blog.genomequest.com/?p=75</guid>
		<description><![CDATA[GenomeQuest released its ChIP-Seq workflow this week, available to anyone with a Free Basic Account inside of GenomeQuest. ]]></description>
			<content:encoded><![CDATA[<p>We&#8217;ve released our ChIP-Seq workflow this week, available to anyone with a Free Basic Account inside of GenomeQuest. Like all of our NGS workflows, it runs in two basic steps: a mapping step and a downstream analysis step. In this case, of course, the downstream analysis is a peak-finding algorithm. We chose the MACS modeling software for peak modeling. (You can see the entire workflow&#8217;s documentation <a href="http://wiki.genomequest.com/index.php/ChipSeq_Workflow">here</a>.) Integrated into the GenomeQuest Sequence Data Management platform, it outputs a heavily annotated <strong>sequence database</strong>, which can then be interactively filtered, grouped, sorted, and mined for peaks of interest. And this can all be connected to your RNA-Seq and resequencing data to get the global picture.</p>
<p>So now researchers can go from their ChIP-Seq NGS runs directly to gene-based annotation of the peaks found by their biology. Select regions of interest, or genes of interest, or peaks of a certain class, and drill down to see the actual evidence that backs up the call.</p>
<p>We&#8217;re giving away free ChIP-Seq runs to the first 100 people to <a href="http://www.genomequest.com/basic-registration/">sign up</a>.</p>
<p>As always, feel free to leave a comment &#8211; we read every one.</p>
<p>Richard J. Resnick<br />
VP Software and Services</p>
]]></content:encoded>
			<wfw:commentRss>http://blog.genomequest.com/2009/11/announcing-chip-seq-support/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Science Advisory Board</title>
		<link>http://blog.genomequest.com/2009/08/science-advisory-board/</link>
		<comments>http://blog.genomequest.com/2009/08/science-advisory-board/#comments</comments>
		<pubDate>Thu, 27 Aug 2009 12:00:20 +0000</pubDate>
		<dc:creator>admin</dc:creator>
				<category><![CDATA[GenomeQuest]]></category>
		<category><![CDATA[Next Generation Sequencing]]></category>
		<category><![CDATA[NGS]]></category>
		<category><![CDATA[Science Advisory Board]]></category>
		<category><![CDATA[SDM]]></category>
		<category><![CDATA[Sequence Data Management]]></category>

		<guid isPermaLink="false">http://blog.genomequest.com/?p=33</guid>
		<description><![CDATA[Dr. Mark Boguski appointed to Science Advisory Board.]]></description>
			<content:encoded><![CDATA[<p>I&#8217;m excited to start our advisory board, with the appointment of Dr. Mark Boguski. Since our initial financing in 2005, our investors have urged us to build the science advisory board.</p>
<p>So what took so long? Until now, it wasn&#8217;t necessary. We provided clear business value to pharma for a well-defined use case. An advisory board might have even been a distraction.</p>
<p>So what&#8217;s changed? As we build out the sequence data management (SDM) platform, we want to see beyond this year’s application of next generation sequencing (NGS), and make sure we understand where the industry is going.</p>
<div id="attachment_44" class="wp-caption alignright" style="width: 107px"><img class="size-full wp-image-44 " title="Mark Boguski" src="http://blog.genomequest.com/wp-content/uploads/2009/08/Mark-Boguski_3.jpg" alt="Mark Boguski" width="97" height="118" /><p class="wp-caption-text">Dr. Mark Boguski</p></div>
<p>Mark is a perfect advisor for this initiative. His practical experience at NCBI, Rosetta, and Novartis, and his current vantage point at Harvard Medical School and Beth Israel, places him squarely with a view to the future uses of sequence data in a clinical setting, and with firm grounding in the practical applications of sequence data for the past 20 years.</p>
<p>It’s an honor to have Mark join us as a scientific advisor. We hope to build a diverse advisory team to complement him with skills and experiences that reflect the diversity of talents converging on the digital revolution in biology.</p>
]]></content:encoded>
			<wfw:commentRss>http://blog.genomequest.com/2009/08/science-advisory-board/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>GenomeQuest 6.0Beta in Bio-IT World</title>
		<link>http://blog.genomequest.com/2009/08/genomequest-6-0beta-in-bio-it-world/</link>
		<comments>http://blog.genomequest.com/2009/08/genomequest-6-0beta-in-bio-it-world/#comments</comments>
		<pubDate>Wed, 19 Aug 2009 13:47:41 +0000</pubDate>
		<dc:creator>admin</dc:creator>
				<category><![CDATA[Cloud Computing]]></category>
		<category><![CDATA[SDM]]></category>
		<category><![CDATA[Cloud]]></category>
		<category><![CDATA[GenomeQuest 6.0Beta]]></category>
		<category><![CDATA[Next Generation Sequencing]]></category>
		<category><![CDATA[NGS]]></category>

		<guid isPermaLink="false">http://blog.genomequest.com/?p=19</guid>
		<description><![CDATA[We have deliberately not over-played the "cloud" aspect of our offering, but as the response from the Bio-IT World article shows, there is a lot of interest in the potential economics of the cloud and customer are actively evaluating its potential.]]></description>
			<content:encoded><![CDATA[<p>I was recently <a href="http://www.bio-itworld.com/news/genomequest-cloud-next-gen-sequencing-data.html">interviewed for an article by Kevin Davies at Bio-IT World</a> on our GenomeQuest 6.0Beta launch. He immediately connected our story to &#8220;NGS&#8221; and &#8220;Cloud&#8221; computing. We have deliberately not over-played the &#8220;cloud&#8221; aspect of our offering, but as the response from the article shows, there is a lot of interest in the potential economics of the cloud and customer are actively evaluating its potential.</p>
<p>From my view, cloud vendors such as Amazon and others will follow the commodity pricing curve of the instruments and become a way to process larger and larger data sets with fewer and fewer dollars. For us, cloud computing is an opportunity to offer more customers more services at lower costs. However, the winning strategy will require value added software, knowledge and skills to harness those resources for organizations and their end-users.</p>
<p>More on this topic in a future post.</p>
]]></content:encoded>
			<wfw:commentRss>http://blog.genomequest.com/2009/08/genomequest-6-0beta-in-bio-it-world/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
	</channel>
</rss>

