Access Tiers

The TCGA dataset has two data access tiers, both of which are available in the Genomic Data Commons (GDC):

  • Open Access data tier
  • Controlled Access data tier

Open Access Data Tier

The Open Access data tier comprises public data not unique to an individual. The Open Access data tier does not require user certification.

These data may consist of:

  • De-identified clinical and demographic data
  • Gene expression data
  • Copy number alterations in regions of the genome
  • Epigenetic data
  • Summaries of data compiled across individuals
  • Anonymized single amplicon DNA sequence data

Controlled Access Data Tier

The Controlled Access data tier contains data that may be unique to an individual. All data types are stripped of direct identifiers. The Controlled Access data tier requires user certification.

The following data types are in Controlled Access:

  • Primary sequence data (BAM and FASTQ files)
  • SNP6 array level 1 and level 2 data
  • Exon array level 1 and level 2 data
  • VCFs
  • Certain information in MAFs

Additional Information