Datasets

The CRDC provides access to a variety of open, registered, and controlled datasets from NCI- and NIH-funded programs and key external cancer programs.The following list showcases a number of these datasets but it is not exhaustive. For the full list of available datasets, explore each of the CRDC Data Commons. Many of the available datasets are also accessible through the NCI Cloud Resources (CRs), where users can also bring their own data to a secure cloud workspace for comparative analysis.   

Datasets
Dataset NameDescriptionAccessible Through*
Applied Proteogenomics Organizational Learning and Outcomes (APOLLO)A collaboration between NCI, the Department of Defense (DoD), and the Department of Veterans Affairs (VA), that incorporates proteogenomic data with patient care, with a focus on the activity and expression of the proteins that the genome encodes
GDC, ISB, PDC, SB
Cancer Genome Characterization Initiatives (CGCI)An initiative examining genomes, exomes, and transcriptomes of various types of adult and pediatric cancersGDC, ISB, SB
Cancer Moonshot BiobankA national initiative to collect biospecimens and clinical data from patients receiving standard of care therapy across the U.S. The study's aim is to enhance the understanding of cancer’s molecular dynamics over time.CTDC
Children's Brain Tumor Tissue Consortium (CBTTC)A collaborative research consortia focused on identifying therapies for children with brain tumorsISB, PDC, SB 
Childhood Cancer Data Initiative (CCDI)A consortium of children’s hospitals, clinics, or networks that make their clinical care and research data accessibleCDS, SB
Clinical Proteomic Tumor Analysis Consortium (CPTAC)A national effort to accelerate the understanding of the molecular basis of cancer through the application of proteogenomics (large-scale proteome and genome analysis)FC, GDC, IDC, ISB, PDC, SB
Comparative molecular life history of spontaneous canine and human gliomas (GLIOMA01)A collaborative effort to characterize the genomic and transcriptomic landscape of canine glioma to enable cross-species comparative genomic analysis of sporadic gliomaICDC, SB
Foundation Medicine (FM)Foundation Medicine Inc., a molecular information company, makes accessible sequencing data from thousands of adult patients, in an effort to match patients with personalized treatment plansGDC, ISB, SB 
Genetics and Epidemiology of Colorectal Cancer Consortium (GECCO)A research collaboration to detect colorectal cancer susceptibility loci using genome-wide sequencingCDS, SB
Human Cancer Model Initiative (HCMI)An international consortium that is generating novel, next-generation, and tumor-derived culture models complete with genomic and clinical dataGDC, ISB, SB
Human Tumor Atlas Network (HTAN)An NIH initiative that makes accessible three-dimensional atlases of the cellular, morphological, and molecular features of human cancers over time, based on a diverse cancer patient populations, including minority and underserved patients, as well as individuals with high-risk hereditary tumors
CDS, IDC, ISB, SB
International Cancer Proteogenome Consortium (ICPC)An international consortium that brings together more than a dozen countries to study the application of proteogenomic analysis in predicting cancer treatment success, and to share data and results with researchers worldwideISB, PDC, SB
Multiple Myeloma Research Foundation (MMRF)The Foundation shares secure, molecular and clinical data, including longitudinal information collected over the course of disease for many patientsGDC, ISB, SB 
The Cancer Genome Atlas (TCGA)A collaboration between NCI and the National Human Genome Research Institute (NHGRI) that has characterized tumor and normal tissues from thousands of patients and dozens of cancer types GDC, FC, IDC, ISB, SB
Therapeutically Applicable Research to Generate Effective Treatments (TARGET)A consortium of extramural and NCI investigators working to characterize and understand hard-to-treat childhood cancers and translate findings into the clinicGDC, ISB, FC, SB 

*Abbreviations: 

Data Commons: GDC: Genomic Data Commons; PDC: Proteomics Data Commons; IDC: Imaging Data Commons; ICDC: Integrated Canine Data Commons; CDS: Cancer Data Service;  CTDC: Clinical and Translational Data Commons

Cloud Resources: FC: FireCloud/Broad Institute; SB: Seven Bridges/Cancer Genomic Cloud from Velsera; ISB: Institute for Systems Biology/Cancer Gateway in the Cloud