AACR 2016: NCI's Cancer Genomics Cloud Pilots

Sunday, Apr 17, 2016, 3:45 PM - 5:15 PM
Room 352, Morial Convention Center

The growth of large-scale sequence data for cancer research is rapidly out-stripping the required computational capacity for storage, processing, network transmission, and analysis. In the past investigators would mine public genomics data by downloading public data and computing on local hardware using computational tools. This model is becoming untenable given the growth of biomedical data since the advent of large-scale programs like The Cancer Genome Atlas (TCGA). At its projected completion, TCGA will generate 2.5 Petabytes (PB) of data. The difficulties caused by such scale include: 

  1. Maintaining local copies of data at this scale is not cost effective 
  2. Providing computational capacity to analyze the data is similarly constrained 
  3. Electronic transfer of data at this scale is not practical 

In this session, we will describe the initiative supported by CBIIT to create a new model for computational analysis of biological data, the NCI Cancer Genomics Cloud (CGC) Pilots. These three pilot projects combine data from the TCGA project with co-located computational capacity and an Application Programming Interface (API) that provides security and data access for developers of analytic tools. This Cloud model democratizes access to NCI-generated genomic data and provides a more cost-effective way to provide computational support to the cancer research community. Topics to be discussed in the session are a general overview of the Cancer Genomics Cloud Pilot initiative as well as a discussion of the main repository for the TCGA data, the new Genomic Data Commons (GDC)*, which will be released to the public in May 2016. Each of the three Cancer Genomics Cloud Pilots (Institute for Systems Biology, Broad Institute, Seven Bridges Genomics) will then introduce their unique Cloud Pilot products. Each Cloud Pilot will focus on how the public can effectively use these new resources for standard data analysis, data upload, and custom data analysis. Finally, for each Cloud Pilot a customer who has utilized and worked with the Cloud Pilot will discuss their experience and how by using the Cloud Pilot they were able to more efficiently conduct their research. At the end of the session we will have a 20 minute roundtable of the three Cloud Pilot representatives and the three user representatives to address any questions from the audience.

  • Anthony Kerlavage. National Cancer Institute-CBIIT, Rockville, MD
  • Ishwar Chandramouliswaran. National Cancer Institute-CBIIT, Bethesda, MD
  • Brandi Davis Dusenbery. Seven Bridges Genomics, Cambridge, MA
  • Ilya Shmulevich. Institute for Systems Biology, Seattle, WA
  • Gad Getz. MGH Cancer Center & Dept of Pathology, Charlestown, MA; and the Broad Institute of MIT and Harvard University, Cambridge, MA
  • Jeffrey H. Chuang. The Jackson Laboratory for Genomic Medicine, Farmington, CT
  • Kuan-lin Huang. Washington University in St. Louis, St. Louis, MO
  • Ignaty Leshchiner. Broad Institute, Cambridge, MA

