<?xml version="1.0" encoding="UTF-8"?>
<rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:wfw="http://wellformedweb.org/CommentAPI/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
	xmlns:slash="http://purl.org/rss/1.0/modules/slash/"
	>

<channel>
	<title>PolITiGenomics &#187; TCGA</title>
	<atom:link href="http://www.politigenomics.com/tag/tcga/feed" rel="self" type="application/rss+xml" />
	<link>http://www.politigenomics.com</link>
	<description>Politics, Information Technology, and Genomics</description>
	<lastBuildDate>Thu, 21 Apr 2011 17:49:06 +0000</lastBuildDate>
	<language>en</language>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
	<generator>http://wordpress.org/?v=3.2.1</generator>
		<item>
		<title>The central dogma of cancer genomics</title>
		<link>http://www.politigenomics.com/2008/10/the-central-dogma-of-cancer-genomics.html</link>
		<comments>http://www.politigenomics.com/2008/10/the-central-dogma-of-cancer-genomics.html#comments</comments>
		<pubDate>Fri, 24 Oct 2008 17:44:45 +0000</pubDate>
		<dc:creator>dd</dc:creator>
				<category><![CDATA[genomics]]></category>
		<category><![CDATA[health]]></category>
		<category><![CDATA[informatics]]></category>
		<category><![CDATA[science]]></category>
		<category><![CDATA[TCGA]]></category>

		<guid isPermaLink="false">http://www.politigenomics.com/?p=466</guid>
		<description><![CDATA[Today, Chris Sander, head of the Computational Biology Center at Memorial Sloan Kettering Cancer Center (MSKCC), gave a talk at The Genome Center. His group at MSKCC performs a wide variety of research on cancer, covering a wide range of biological scales. His talk today spanned this wide range of scales, from interpreting SNPs to [...]]]></description>
			<content:encoded><![CDATA[<p>Today, <a href="http://www.mskcc.org/mskcc/html/9292.cfm">Chris Sander</a>, head of the <a href="http://cbio.mskcc.org/">Computational Biology Center</a> at <a href="http://www.mskcc.org/">Memorial Sloan Kettering Cancer Center (MSKCC)</a>, gave a talk at <a href="http://genome.wustl.edu/">The Genome Center</a>. His <a href="http://cbio.mskcc.org/research/sander.html">group at MSKCC</a> performs a wide variety of <a href="http://cbio.mskcc.org/cancergenomics/gbm/">research on cancer</a>, covering a wide range of biological scales. His talk today spanned this wide range of scales, from interpreting SNPs to modeling cellular pathways of cancer. Thanks to the <a href="http://en.wikipedia.org/wiki/Central_dogma_of_molecular_biology">central dogma of molecular biology</a>, single-nucleotide polymorphisms can readily be translated to amino acid changes. Using knowledge of protein structures and active sites, conformational or activity changes that may lead to changes in protein function can be inferred from these amino acid changes. In addition, the conservation of specific amino acids in homologous proteins across species or across paralogous proteins within a species, can indicate that specific amino acids are important to function; i.e., selective pressures have prevented these amino acids from changing over time. Applying this approach to their participation in the <a href="http://cancergenome.nih.gov/">TCGA</a> study of glioblastoma, they have developed a <a href="http://cbio.mskcc.org/cancergenomics/gbm/mutations/">web interface that allows researchers to evaluate a likelihood that the SNPs found through sequencing of glioblastoma tumors are functionally important</a>. Moving to a larger size scale, he then talked about their <a href="http://cbio.mskcc.org/cancergenomics/gbm/cna/">TCGA work evaluating copy-number variations in glioblastoma samples</a> using RAE. From there, he presented their work developing a glioblastoma pathways network from the genes that were mutated and/or amplified or deleted. Cycling through graphical pathways representations that highlighted amplications and deletions of genes in different patients, it was clear that each patient had their own disease, but there were some commonalities between them. He concluded with their efforts to <a href="http://cbio.mskcc.org/cancergenomics/gbm/pathways/">model these complex cancer pathways</a> and how different combinations of cancer drugs affect them using a set of differential equations with some ideas borrowed from neural networks. As someone who has done modeling of complex systems myself, I found their approach very satisfying; N&times;N interactions, Monte Carlo simulations, gradient-following optimization. Good stuff. The question still remains however: pathways-level analysis will cover 60-80% of people with a specific sub-type of cancer. What about the other 20-40% whose mutations don&#8217;t fall into those pathways?</p>
]]></content:encoded>
			<wfw:commentRss>http://www.politigenomics.com/2008/10/the-central-dogma-of-cancer-genomics.html/feed</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Towards a cure for cancer</title>
		<link>http://www.politigenomics.com/2008/10/towards-a-cure-for-cancer.html</link>
		<comments>http://www.politigenomics.com/2008/10/towards-a-cure-for-cancer.html#comments</comments>
		<pubDate>Wed, 22 Oct 2008 18:52:37 +0000</pubDate>
		<dc:creator>dd</dc:creator>
				<category><![CDATA[genomics]]></category>
		<category><![CDATA[health]]></category>
		<category><![CDATA[Illumina]]></category>
		<category><![CDATA[privacy]]></category>
		<category><![CDATA[science]]></category>
		<category><![CDATA[TCGA]]></category>
		<category><![CDATA[wustl]]></category>

		<guid isPermaLink="false">http://www.politigenomics.com/?p=419</guid>
		<description><![CDATA[Recently, The Genome Center&#8216;s Dr. Elaine Mardis gave a talk at the Cold Spring Harbor&#8217;s Personal Genomes meeting. The topic of the talk was the ever increasing efforts of the genomics community to understand the molecular nature of cancer. In this blog, I have discussed projects like The Cancer Genome Atlas (TCGA) and Tumor Sequencing [...]]]></description>
			<content:encoded><![CDATA[<p>Recently, <a href="http://genome.wustl.edu/">The Genome Center</a>&#8216;s Dr. Elaine Mardis gave a talk at the Cold Spring Harbor&#8217;s Personal Genomes meeting. The topic of the talk was the ever increasing efforts of the genomics community to understand the molecular nature of cancer. In this blog, I have discussed projects like <a href="http://cancergenome.nih.gov/">The Cancer Genome Atlas (TCGA)</a> and <a href="http://www.genome.gov/19517442">Tumor Sequencing Project (TSP)</a>, but what are we actually doing in these projects to try to better understand and, ultimately, cure cancer? As with most things in genomics, the answer to that question changes day to day; with ever more powerful techniques being applied to study cancer. It is the stated goal of the <a href="http://www.cancer.gov/">National Cancer Institute (NCI)</a> to end suffering and death from cancer by the middle of the next decade. This post is about how we are going to do that.</p>
<p>Cancer is a disease of the genome. Mutations occur over time in the DNA of every cell. Most of these mutations are benign; they have no affect on the normal operation of the cell.  Some mutations or combinations of mutations can be deleterious. If the mutation causes the cell to die, the result is that a single cell dies and there is likely little affect on the organism as a whole. Some mutations, however, may affect the cell cycle (cell replication) in such a way that the cell replicates uncontrollably. Since all of the uncontrollably replicating cell&#8217;s progeny have the same mutations, they and their progeny also replicate uncontrollably. Since these cells grow and divide much more quickly than the surrounding cells, these tumor cells quickly come to dominate the tissue in which they originated, starving other cells in the tissue of nutrients.  The rapid replication of cells often leads to more mutations being accumulated until the tumor metastasizes and the cancer spreads throughout the body.  Over the past few decades, cancer care has improved for many types of cancer as early detection has increased and various new chemotherapeutics have been developed.  More recently, drugs that target specific cell processes, or pathways, whose increase or decrease in function due to mutations play a role in cancer have been developed and used effectively in the clinic.  Unfortunately, the many diseases that fall under the name cancer are widely varied and extremely complex.  Breakthroughs in understanding and treating one subtype of cancer may not be applicable to other cancers or even other subtypes of the same cancer. Indeed, even within a tumor there can be heterogeneity; all cells will have the mutations that led to tumorigenesis but different subpopulations within the tumor may appear, each accumulating its own specific mutations. Thus we need a better molecular, DNA-level understanding of the many diseases collectively called cancer. In other words, for each tumor we need to understand what mutations lead to tumorigenesis, how those mutations affected cell replication, and how to reverse that change of function.</p>
<p>To begin to discover more about the molecular nature of cancer, several years ago we began the Tumor Sequencing Project. In TSP, we use both array-based technologies and DNA sequencing to study tumor and normal tissues from a cohort of about 200 patients with lung adenocarcinoma, a form of lung cancer that affects a disproportionate amount of never-smokers. <a href="http://en.wikipedia.org/wiki/SNP_array">SNP arrays</a> were used to determine <a href="http://en.wikipedia.org/wiki/Copy_number_variation">copy-number variation (CNV)</a>. The CNV work was published last year in Nature, <a href="http://www.nature.com/nature/journal/v450/n7171/abs/nature06358.html">Characterizing the cancer genome in lung adenocarcinoma</a>. While the CNV work provided valuable information about large-scale differences between the tumor and normal tissues over the entire genome, it and other array-based approaches cannot provided detailed information about specific mutations. To get a single base level resolution of the differences between an individual&#8217;s tumor and normal genomes, you have to sequence the DNA. At the time TSP started, the cost of sequencing an entire genome was much greater than running a SNP array. Therefore, the low-resolution information from our SNP array investigations and other, similar investigations in the literature were used to guide the selection of specific genes thought to play a role in cancer in general and lung adenocarcinoma in particular. For each patient, we sequenced these genes in both their tumor and normal tissues and looked for differences between the two sequences. Once you have those differences, genes annotations (definitions) can be used to determine if the mutation in the tumor tissue would lead to a change in the amino acid sequence of the protein that gene encodes. If the protein would be changed, other programs can be used  to predict whether the change would lead to a change in protein conformation and a possible loss or gain of function. The cellular pathways in which the mutated protein participates can also be researched to see if any of the pathways control cell division or other important signaling paths in the cell cycle. Once the gene and its pathways are known, they can be correlated with known <a href="http://en.wikipedia.org/wiki/Oncogene">oncogenes</a> and oncogene families. Once that is established, further experiments, e.g., induction of found mutations in disease model organisms, can be done to determine if the mutations and predicted changes do contribute to carcinogenesis. It is also interesting to see how changes that span several patients correlate with phenotypic/patient information, e.g., cancer subtype, patient outcome, age, sex, and smoking status. The initial sequencing results of the TSP will be in tomorrow&#8217;s issue of Nature, <a href="http://www.nature.com/nature/journal/v455/n7216/abs/nature07423.html">Somatic mutations affect key pathways in lung adenocarcinoma</a>.</p>
<p>The Cancer Genome Atlas pilot project, which is studying <a href="http://en.wikipedia.org/wiki/Glioblastoma_multiforme">brain</a>, <a href="http://www.cancer.gov/cancertopics/types/ovarian">ovarian</a>, and <a href="http://www.cancer.gov/CANCERTOPICS/PDQ/TREATMENT/NON-SMALL-CELL-LUNG/PATIENT">lung</a> cancer, has followed much the same path as TSP, albeit on a larger scale. Compared to TSP, a wider variety of array platforms, more patients, and DNA sequencing of more genes have been employed. Initial gene lists for sequencing were obtained through a review of the literature with subsequent lists obtained through the results of the array-based studies. The initial glioblastoma paper from TCGA was published online by Nature last month and, like the TSP paper, will be in tomorrow&#8217;s issue, <a href="http://www.nature.com/nature/journal/v455/n7216/abs/nature07385.html">Comprehensive genomic characterization defines human glioblastoma genes and core pathways</a>.</p>
<p>With both TSP and TCGA, we are able to look at the whole genome in a coarse-grained way and specific parts of the genome (genes) in a fine-grained way. So the question becomes, what are we missing by not studying the entire genome at single base resolution? The advent of next-generation/massively parallel sequencing has allowed us to begin to answer this question. About a year and a half ago, we began whole-genome sequencing of a single <a href="http://en.wikipedia.org/wiki/Acute_myeloid_leukemia">AML</a> patient&#8217;s tumor and normal genomes using the Illumina/Solexa sequencing platform. Generating such a complete picture of a single human genome raised significant privacy issues that needed to be addressed before publishing and data release. The massive amount of data generated for each of these genomes presented significant challenges to our informatics infrastructure. All of the data had to be combed through to find the small variants (single nucleotide variations and small insertions and deletions of sequence) between the tumor and normal genomes (at the time, it was not possible to detect larger variations, e.g., structural rearrangements, with the Illumina/Solexa technology). With this approach, you are not only able to find mutations in genes that have not previously been implicated in cancer, but you are also able to find mutations in sequences conserved across species, microRNAs, regulatory regions, etc.; basically anything annotated in <a href="http://www.ensembl.org/">Ensembl</a> or <a href="http://genome.ucsc.edu/">UCSC</a>. Of course, the effect of mutations in non-genic regions are more difficult to interpret than those in genic regions that alter proteins, but the efforts of the <a href="http://www.genome.gov/10005107">ENCODE</a> project, disease model organism studies, and sequencing more cancer genomes will greatly aid in the interpretation of these mutations.</p>
<p>In summary, these cancer sequencing efforts, especially whole-genome cancer sequencing, are increasing our understanding of cancer at a very rapid pace, laying the groundwork for more individualized approaches to treatment. As our knowledge of tumorigenesis increases, our ability to detect cancers early and treat cancers effectively will also increase. Ultimately, these sorts of studies will make the goal of ending suffering and death from cancer achievable.</p>
]]></content:encoded>
			<wfw:commentRss>http://www.politigenomics.com/2008/10/towards-a-cure-for-cancer.html/feed</wfw:commentRss>
		<slash:comments>6</slash:comments>
		</item>
		<item>
		<title>TCGA project finds link between methylation and patient outcome</title>
		<link>http://www.politigenomics.com/2008/09/tcga-project-finds-link-between-methylation-and-patient-outcome.html</link>
		<comments>http://www.politigenomics.com/2008/09/tcga-project-finds-link-between-methylation-and-patient-outcome.html#comments</comments>
		<pubDate>Fri, 26 Sep 2008 20:51:15 +0000</pubDate>
		<dc:creator>dd</dc:creator>
				<category><![CDATA[genomics]]></category>
		<category><![CDATA[health]]></category>
		<category><![CDATA[science]]></category>
		<category><![CDATA[TCGA]]></category>

		<guid isPermaLink="false">http://www.politigenomics.com/?p=355</guid>
		<description><![CDATA[Researcher from The University of Texas M. D. Anderson Cancer Center recently presented some of their findings from The Cancer Genome Atlas at the American Association for Cancer Research Molecular Diagnostics in Cancer Therapeutic Development. They found that methylation patterns in CpG islands correlate with patient outcomes in glioblastoma. If you are so inclined, you [...]]]></description>
			<content:encoded><![CDATA[<p>Researcher from The University of Texas M. D. Anderson Cancer Center recently presented some of their findings from <a href="http://cancergenome.nih.gov/">The Cancer Genome Atlas</a> at the American Association for Cancer Research Molecular Diagnostics in Cancer Therapeutic Development. They found that <a href="http://www.eurekalert.org/pub_releases/2008-09/aafc-mlk091508.php">methylation patterns in CpG islands correlate with patient outcomes</a> in glioblastoma. If you are so inclined, you can learn more about <a href="http://en.wikipedia.org/wiki/CpG_island">CpG islands</a> and <a href="http://en.wikipedia.org/wiki/DNA_methylation">methylation</a>.</p>
]]></content:encoded>
			<wfw:commentRss>http://www.politigenomics.com/2008/09/tcga-project-finds-link-between-methylation-and-patient-outcome.html/feed</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>The Cancer Genome Atlas</title>
		<link>http://www.politigenomics.com/2008/09/the-cancer-genome-atlas.html</link>
		<comments>http://www.politigenomics.com/2008/09/the-cancer-genome-atlas.html#comments</comments>
		<pubDate>Thu, 04 Sep 2008 20:39:15 +0000</pubDate>
		<dc:creator>dd</dc:creator>
				<category><![CDATA[genomics]]></category>
		<category><![CDATA[health]]></category>
		<category><![CDATA[science]]></category>
		<category><![CDATA[TCGA]]></category>
		<category><![CDATA[wustl]]></category>

		<guid isPermaLink="false">http://www.politigenomics.com/?p=189</guid>
		<description><![CDATA[The Cancer Genome Atlas Research Network has published its first paper, Comprehensive genomic characterization defines human glioblastoma genes and core pathways. This paper reports several novel mutations in brain cancer and reflects a tremendous effort of a truly cross-discipline and cross-institution consortium. I&#8217;ll post more on this and other cancer-related efforts soon. In the meantime, [...]]]></description>
			<content:encoded><![CDATA[<p><a href="http://cancergenome.nih.gov/">The Cancer Genome Atlas</a> Research Network has published its first paper, <a href="http://www.nature.com/nature/journal/vaop/ncurrent/abs/nature07385.html">Comprehensive genomic characterization defines human glioblastoma genes and core pathways</a>. This paper reports several novel mutations in brain cancer and reflects a tremendous effort of a truly cross-discipline and cross-institution consortium. I&#8217;ll post more on this and other cancer-related efforts soon. In the meantime, you can read the coverage in the press: <a href="http://cancergenome.nih.gov/media/news_9_4_2008.asp">NIH</a>, <a href="http://www.nature.com/news/2008/080904/full/455148a.html">Nature</a>, <a href="http://mednews.wustl.edu/news/page/normal/12291.html">Washington University Medical News</a>, <a href="http://ap.google.com/article/ALeqM5h7WfW3Tye1QjT6veG61ZdLaIOZIwD9303GFO0">AP</a>, <a href="http://www.genomeweb.com/issues/news/149208-1.html">GenomeWeb</a>, and <a href="http://blog.wired.com/wiredscience/2008/09/war-on-cancer-g.html">Wired</a> (or <a href="http://news.google.com/news?q=%22Cancer%20Genome%20Atlas%22&#038;ie=UTF-8&#038;oe=utf-8&#038;rls=org.mozilla:en-US:official&#038;client=firefox-a&#038;um=1&#038;sa=N&#038;tab=wn">choose your own source</a>).</p>
]]></content:encoded>
			<wfw:commentRss>http://www.politigenomics.com/2008/09/the-cancer-genome-atlas.html/feed</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>War on cancer</title>
		<link>http://www.politigenomics.com/2008/07/war-on-cancer.html</link>
		<comments>http://www.politigenomics.com/2008/07/war-on-cancer.html#comments</comments>
		<pubDate>Mon, 14 Jul 2008 16:02:02 +0000</pubDate>
		<dc:creator>dd</dc:creator>
				<category><![CDATA[genomics]]></category>
		<category><![CDATA[health]]></category>
		<category><![CDATA[science]]></category>
		<category><![CDATA[TCGA]]></category>
		<category><![CDATA[wustl]]></category>

		<guid isPermaLink="false">http://www.politigenomics.com/?p=106</guid>
		<description><![CDATA[There was an article in US News and World Report calling for a new war on cancer. A couple of the more far out comments on the article confirms my impression of your standard US News and World Report reader (you can recognize them on the street because they are wearing tin-foil hats). More recently, [...]]]></description>
			<content:encoded><![CDATA[<p>There was an article in US News and World Report calling for a new <a href="http://health.usnews.com/articles/health/cancer/2008/06/12/we-need-a-new-war-on-cancer.html">war on cancer</a>. A couple of the more far out <a href="http://health.usnews.com/articles/health/cancer/2008/06/12/we-need-a-new-war-on-cancer/comments/">comments on the article</a> confirms my impression of your standard US News and World Report reader (you can recognize them on the street because they are wearing tin-foil hats). More recently, Science had a news item on the recent <a href="http://www.sciencemag.org/cgi/content/full/321/5885/26a">meeting of NCI&#8217;s Board of Scientific Advisors</a> that included a report on <a href="http://cancergenome.nih.gov/">The Cancer Genome Atlas (TCGA)</a> project. The report  presented our progress in understanding glioblastoma, including some novel glioblastoma mutations discovered in our sequencing here at <a href="http://genome.wustl.edu/">The Genome Center</a>. It seems &#8220;big science&#8221; is once again teaching its critics a thing or two.</p>
<p><strong>Update:</strong> Check out my colleague&#8217;s post on this subject: <a href="http://www.massgenomics.org/2008/07/tcga-the-billion-dollar-cancer-project.html">TCGA: The billion-dollar cancer project</a>.</p>
]]></content:encoded>
			<wfw:commentRss>http://www.politigenomics.com/2008/07/war-on-cancer.html/feed</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Francis Collins stepping down</title>
		<link>http://www.politigenomics.com/2008/05/francis-collins-stepping-down.html</link>
		<comments>http://www.politigenomics.com/2008/05/francis-collins-stepping-down.html#comments</comments>
		<pubDate>Wed, 28 May 2008 20:30:54 +0000</pubDate>
		<dc:creator>dd</dc:creator>
				<category><![CDATA[genomics]]></category>
		<category><![CDATA[politics]]></category>
		<category><![CDATA[health]]></category>
		<category><![CDATA[science]]></category>
		<category><![CDATA[TCGA]]></category>

		<guid isPermaLink="false">http://www.politigenomics.com/?p=83</guid>
		<description><![CDATA[Francis Collins is stepping down as director of the National Human Genome Research Institute (NHGRI) effective August 1, 2008. He&#8217;s had a heck of a run, from overseeing the Human Genome Project on to the direct application of sequencing technology to human health issues in projects like the Tumor Sequencing Project and The Cancer Genome [...]]]></description>
			<content:encoded><![CDATA[<p><a href="http://www.genome.gov/27026551">Francis Collins is stepping down</a> as director of the <a href="http://www.genome.gov/">National Human Genome Research Institute (NHGRI)</a> effective August 1, 2008.  He&#8217;s had a heck of a run, from overseeing the <a href="http://www.genome.gov/10001772">Human Genome Project</a> on to the direct application of sequencing technology to human health issues in projects like the Tumor Sequencing Project and <a href="http://cancergenome.nih.gov/">The Cancer Genome Atlas</a>.  It is fitting that only days before he announced he is stepping down, the legislation he has worked tirelessly on for over a decade, the <a href="http://www.politigenomics.com/2008/02/genomic-privacy.html">Genetic Information Nondiscrimination Act (GINA)</a>, became <a href="http://www.politigenomics.com/2008/05/gina-becomes-law.html">law</a>.</p>
]]></content:encoded>
			<wfw:commentRss>http://www.politigenomics.com/2008/05/francis-collins-stepping-down.html/feed</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Sequencing genomes almost as a matter of course</title>
		<link>http://www.politigenomics.com/2008/02/sequencing-genomes-almost-as-matter-of.html</link>
		<comments>http://www.politigenomics.com/2008/02/sequencing-genomes-almost-as-matter-of.html#comments</comments>
		<pubDate>Tue, 05 Feb 2008 16:52:00 +0000</pubDate>
		<dc:creator>admin</dc:creator>
				<category><![CDATA[genomics]]></category>
		<category><![CDATA[IT]]></category>
		<category><![CDATA[health]]></category>
		<category><![CDATA[informatics]]></category>
		<category><![CDATA[science]]></category>
		<category><![CDATA[TCGA]]></category>
		<category><![CDATA[wustl]]></category>

		<guid isPermaLink="false">http://localhost/wordpress/?p=18</guid>
		<description><![CDATA[Here is an article on genome sequencing, specifically using next-generation sequencers, for which I was interviewed. The article appeared in St. Louis Commerce Magazine so it is a bit of a fluff piece; but another way to look at it is that it is a gentle introduction. Here is another, more detailed article I was [...]]]></description>
			<content:encoded><![CDATA[<p>Here is an <a href="http://www.stlcommercemagazine.com/archives/august2007/genomes.html">article on genome sequencing</a>, specifically using next-generation sequencers, for which I was interviewed.  The article appeared in <a href="http://www.stlcommercemagazine.com/">St. Louis Commerce Magazine</a> so it is a bit of a fluff piece; but another way to look at it is that it is a gentle introduction.</p>
<p>Here is another, more detailed article I was interviewed for in <a href="http://www.bio-itworld.com/">Bio-IT World</a> titled <a href="http://www.bio-itworld.com/issues/2007/july-aug/cover-story/">SNPing Away at Genome-Wide Disease Association Studies</a>.  It is a good overview of whole-genome array studies and how they will soon give way to whole genome sequencing using next-generation sequencers.</p>
]]></content:encoded>
			<wfw:commentRss>http://www.politigenomics.com/2008/02/sequencing-genomes-almost-as-matter-of.html/feed</wfw:commentRss>
		<slash:comments>4</slash:comments>
		</item>
		<item>
		<title>You&#8217;ve got a plumbing problem</title>
		<link>http://www.politigenomics.com/2008/01/youve-got-a-plumbing-problem.html</link>
		<comments>http://www.politigenomics.com/2008/01/youve-got-a-plumbing-problem.html#comments</comments>
		<pubDate>Tue, 15 Jan 2008 22:06:00 +0000</pubDate>
		<dc:creator>dd</dc:creator>
				<category><![CDATA[genomics]]></category>
		<category><![CDATA[health]]></category>
		<category><![CDATA[informatics]]></category>
		<category><![CDATA[science]]></category>
		<category><![CDATA[TCGA]]></category>

		<guid isPermaLink="false">http://localhost/wordpress/?p=8</guid>
		<description><![CDATA[Last week I attended the TCGA Data Portal Use Case Workshop. TCGA stands for The Cancer Genome Atlas and is an ambitious project to more fully characterize and understand the molecular, i.e., DNA-level, mechanisms at work in cancer. While the end goal is to gain a rich understanding of all types of cancer, TCGA is [...]]]></description>
			<content:encoded><![CDATA[<p>Last week I attended the <a href="http://cancergenome.nih.gov/">TCGA</a> <a href="http://tcga-data.nci.nih.gov/">Data Portal</a> Use Case Workshop.  TCGA stands for The Cancer Genome Atlas and is an ambitious project to more fully characterize and understand the molecular, i.e., DNA-level, mechanisms at work in cancer.  While the end goal is to gain a rich understanding of all types of cancer, TCGA is a three-year pilot program investigating three types of cancer (brain, ovarian, and lung) and testing whether its approach to study cancer is effective and therefore applicable to many more cancers.  Its approach involves bringing together clinicians, whole genome characterization techniques (e.g., copy number variation (CNV), expression, and methylation platforms), and high-throughput genome sequencing to study the molecular changes that lead to and propagate tumors; allowing each different platform to inform and guide investigations in the others.  For example, whole genome array studies (low resolution) have identified regions which show significant differences in tumor and normal tissues for glioblastoma (brain cancer).  Using this low resolution information, the project has identified genes in these regions that have been sequenced (high resolution) to look for actual DNA changes that lead to the anomalies seen and therefore possibility contribute to some aspect of cancer metabolism.</p>
<p>Despite some unjustified grumblings about &#8220;big science&#8221;, it is a good, very important project that will contribute greatly to the <a href="http://www.cancer.gov/">NCI</a>&#8216;s goal of ending suffering and death from cancer by the middle of the next decade.  Unfortunately, one of the biggest positives of this project is also one of its biggest challenges.  Bringing together all these disparate data sources is monumentally challenging.  Even comparing different platforms that ostensibly do the same thing, e.g., CNV using either Affymetrix or Illumina platforms, can be hard to normalize and cross compare.  Add to that clinical, methylation, expression, segmental duplication, rearrangements, and sequencing data and you have a real data integration problem.  Then after you integrate all the data, you have to make sense of it all.  Oh, and you have to do it very reliably at high throughput.</p>
<p>Getting back to the Data Portal Use Case Workshop, it was apparent that there will be a large and very diverse audience that will want to access these data in a large number of very different ways.  Some people will want to start from the patient samples and see what results correlate with those groupings.  Some will want to looks at specific patient clinical information and see if longevity correlates with anything.  Some researchers will want to see if their favorite gene is sequenced or if any of the genes in the pathway they study have mutations.  Some will want to access the data from a genomic or chromosomal standpoint, finding areas of interest and drilling down to see why they are interesting.  And so on and so on.  Regardless of where they start, researchers and clinicians will want to slice and dice the data as many ways as they can to find correlations and insights.</p>
<p>So how do you design a data portal that addresses all these needs?  How do you design a data model that ties all this data together and allows each of the different use cases described above (and many more that were not thought of during the workshop) to be pursued?  How do you display this highly multidimensional data, allowing the user to zoom in and out, and layering on more information without overwhelming the user?  It is a huge challenge worthy of a research project in and of itself.  Unfortunately, the people at the workshop seemed to be falling back to what they know: the <a href="http://genome.ucsc.edu/">UCSC browser</a>, <a href="http://www.broad.mit.edu/cancer/software/genepattern/">GenePattern</a>, the <a href="http://cgwb.nci.nih.gov/">Cancer Genome Workbench</a>, etc.  Not that these aren&#8217;t good tools, but the problem and the audience are much bigger now.  So is the price of failure.  We need to think <a href="http://www.youtube.com/watch?v=s-DqZ8jAmv0">bigger</a>.</p>
]]></content:encoded>
			<wfw:commentRss>http://www.politigenomics.com/2008/01/youve-got-a-plumbing-problem.html/feed</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
	</channel>
</rss>

