Key Datasets

The NCI CRDC provides access to a variety of open and controlled datasets from NCI programs and key external cancer programs. Key datasets include: 

Dataset Name Description Accessible Through* 
The Cancer Genome Atlas (TCGA) A collaboration between NCI and the National Human Genome Research Institute (NHGRI) that has characterized tumor and normal tissues from 11,000 patients, covering 33 cancer types GDC, Broad, SB, ISB, IDC
Therapeutically Applicable Research to Generate Effective Treatments (TARGET)  A consortium of extramural and NCI investigators working to characterize and understand hard-to-treat childhood cancers and translate findings into the clinic GDC, Broad, SB, ISB
Clinical Proteomic Tumor Analysis Consortium (CPTAC) A national effort to accelerate the understanding of the molecular basis of cancer through the application of large-scale proteome and genome analysis, or proteogenomics GDC, PDC, Broad, ISB, SB, IDC
Human Cancer Model Initiative (HCMI) An international consortium that is generating novel, next-generation, tumor-derived culture models complete with genomic and clinical data GDC, SB, ISB
Cancer Genome Characterization Initiatives (CGCI) An initiative examining genomes, exomes, and transcriptomes of various types of adult and pediatric cancers GDC, SB, ISB
Foundation Medicine (FM)  Targeted sequencing data from ~18,000 adult patients generated by the Foundation Medicine Inc., molecular information company seeking to match patients with personalized treatment plans GDC, SB, ISB
Multiple Myeloma Research Foundation (MMRF) Data from nearly 1,000 patients with extensive molecular and clinical data, including longitudinal information collected over the course of disease for many patients GDC, SB, ISB
Genomics Evidence Neoplasia Information Exchange (GENIE) Over 44,000 cases from the international pan-cancer registry continuing to be collected by the American Association for Cancer Research (AACR) initiative GDC, SB, ISB
International Cancer Proteogenomic Consortium (ICPC) An international consortium that brings together more than a dozen countries to study the application of proteogenomic analysis in predicting cancer treatment success and to share data and results with researchers worldwide, hastening progress for patients PDC, SB, ISB
Children's Brain Tumor Tissue Consortium (CBTTC) A collaborative research consortia focused on identifying therapies for children with brain tumors PDC, SB, ISB
Genetics and Epidemiology of Colorectal Cancer Consortium (GECCO) A research collaboration to detect colorectal cancer susceptibility loci using genome-wide sequencing CDS, SB
Comparative molecular life history of spontaneous canine and human gliomas (GLIOMA01) Characterization of the genomic and transciptomic landscape of canine glioma to enable cross-species comparative genomic analysis of sporadic glioma ICDC, SB
Childhood Cancer Data Initiative (CCDI) Pediatric clinical care and research data generated by children’s hospitals, clinics, or networks CDS, SB
Human Tumor Atlas Network (HTAN) 3-dimensional atlases of the cellular, morphological, molecular features of human cancers over time, based on a diverse cancer patient population, including minority and underserved patients, as well as individuals with high-risk hereditary tumors CDS, IDC, SB
Applied Proteogenomics Organizational Learning and Outcomes (APOLLO) Data from a collaboration between NCI, the Department of Defense (DoD), and the Department of Veterans Affairs (VA) that incorporates proteogenomic data into patient care, with a focus on the activity and expression of the proteins that the genome encodes GDC, PDC, SB


Repositories: GDC: Genomic Data Commons; PDC: Proteomics Data Commons; IDC: Imaging Data Commons; ICDC: Integrated Canine Data Commons; CDS: Cancer Data Service

Cloud Resources: Broad: Broad Institute FireCloud; SB: Seven Bridges/Cancer Genomic Cloud; ISB: Institute for Systems Biology/Cancer Gateway in the Cloud

Users can bring their own data to combine with the existing data to perform novel analyses through the NCI Cloud Resources.