Sharing and Managing Lower Level Sequence Data

As TCGA continued to generate more and more data, the size of lower level sequence files had made it necessary for these data to be stored in a larger database. The Cancer Genomics Hub (CGHub) was a secure repository for storing, cataloging and accessing lower levels of sequence data such as sequences and alignments. CGHub was located at University of California Santa Cruz.

Lower level sequence data and metadata were transmitted from the Genome Sequencing Centers (GSCs) and Genome Characterization Centers (GCCs) to CGHub, where they could be accessed by the research community.

The NCI Genomic Data Commons

In June 2016, NCI's Center for Cancer Genomics launched the Genomic Data Commons (GDC), a data-sharing platform for cancer research. The Genomic Data Commons now stores all TCGA data in one location and provides additional resources for data analysis. The TCGA data in the GDC is also now harmonized, meaning that it is standardized and data can be compared across different TCGA tumor types and with other genomics projects in the GDC. Learn more about the GDC.