The NCI CRDC provides access to a variety of open and controlled datasets from NCI programs and key external cancer programs. Key datasets include:
Dataset Name | Description | Accessible Through* |
---|---|---|
The Cancer Genome Atlas (TCGA) | A collaboration between NCI and the National Human Genome Research Institute (NHGRI) that has characterized tumor and normal tissues from 11,000 patients, covering 33 cancer types | GDC, Broad, SB, ISB, IDC |
Therapeutically Applicable Research to Generate Effective Treatments (TARGET) | A consortium of extramural and NCI investigators working to characterize and understand hard-to-treat childhood cancers and translate findings into the clinic | GDC, Broad, SB, ISB |
Clinical Proteomic Tumor Analysis Consortium (CPTAC) | A national effort to accelerate the understanding of the molecular basis of cancer through the application of large-scale proteome and genome analysis, or proteogenomics | GDC, PDC, Broad, ISB, SB, IDC |
Human Cancer Model Initiative (HCMI) | An international consortium that is generating novel, next-generation, tumor-derived culture models complete with genomic and clinical data | GDC, SB, ISB |
Cancer Genome Characterization Initiatives (CGCI) | An initiative examining genomes, exomes, and transcriptomes of various types of adult and pediatric cancers | GDC, SB, ISB |
Foundation Medicine (FM) | Targeted sequencing data from ~18,000 adult patients generated by the Foundation Medicine Inc., molecular information company seeking to match patients with personalized treatment plans | GDC, SB, ISB |
Multiple Myeloma Research Foundation (MMRF) | Data from nearly 1,000 patients with extensive molecular and clinical data, including longitudinal information collected over the course of disease for many patients | GDC, SB, ISB |
Genomics Evidence Neoplasia Information Exchange (GENIE) | Over 44,000 cases from the international pan-cancer registry continuing to be collected by the American Association for Cancer Research (AACR) initiative | GDC, SB, ISB |
International Cancer Proteogenomic Consortium (ICPC) | An international consortium that brings together more than a dozen countries to study the application of proteogenomic analysis in predicting cancer treatment success and to share data and results with researchers worldwide, hastening progress for patients | PDC, SB, ISB |
Children's Brain Tumor Tissue Consortium (CBTTC) | A collaborative research consortia focused on identifying therapies for children with brain tumors | PDC, SB, ISB |
Genetics and Epidemiology of Colorectal Cancer Consortium (GECCO) | A research collaboration to detect colorectal cancer susceptibility loci using genome-wide sequencing | CDS, SB |
Comparative molecular life history of spontaneous canine and human gliomas (GLIOMA01) | Characterization of the genomic and transciptomic landscape of canine glioma to enable cross-species comparative genomic analysis of sporadic glioma | ICDC, SB |
Childhood Cancer Data Initiative (CCDI) | Pediatric clinical care and research data generated by children’s hospitals, clinics, or networks | CDS, SB |
Human Tumor Atlas Network (HTAN) | 3-dimensional atlases of the cellular, morphological, molecular features of human cancers over time, based on a diverse cancer patient population, including minority and underserved patients, as well as individuals with high-risk hereditary tumors | CDS, IDC, SB |
Applied Proteogenomics Organizational Learning and Outcomes (APOLLO) | Data from a collaboration between NCI, the Department of Defense (DoD), and the Department of Veterans Affairs (VA) that incorporates proteogenomic data into patient care, with a focus on the activity and expression of the proteins that the genome encodes | GDC, PDC, SB |
*Abbreviations:
Repositories: GDC: Genomic Data Commons; PDC: Proteomics Data Commons; IDC: Imaging Data Commons; ICDC: Integrated Canine Data Commons; CDS: Cancer Data Service
Cloud Resources: Broad: Broad Institute FireCloud; SB: Seven Bridges/Cancer Genomic Cloud; ISB: Institute for Systems Biology/Cancer Gateway in the Cloud
Users can bring their own data to combine with the existing data to perform novel analyses through the NCI Cloud Resources.