Cancer Data Service
(CDS)Overview
The Cancer Data Service (CDS) is one of several data commons within the Cancer Research Data Commons (CRDC).
The CDS provides data storage and sharing capabilities for NCI-funded studies that fall under the following categories:
• Studies with data that do not match an existing CRDC data commons
• Studies with data that do not fit current data type criteria and/or the minimum metadata standards for a CRDC data commons
CDS currently hosts a variety of data types from NCI projects such as the Human Tumor Atlas Network (HTAN), Division of Cancer Control and Population Sciences (DCCPS), and Childhood Cancer Data Initiative (CCDI) as well as data from independent research projects.
The CDS is home to both open and controlled access data, however, the CDS Portal is accessible for users to search and browse with no login. This allows users to see if the CDS has data of interest before requesting access.
Data Types
CDS is a data-type agnostic data commons. The CDS currently contains mostly genomic and some imaging data, but is open to accepting multiple data types.
Data Submission
Background information and links to instructions can be found on the CRDC Submit Data page.
Submitters will start the process by submitting the Data Submission Request found on the CRDC Submission Portal.
Contact the CRDC Help Desk for information and support.
Data Access
The CDS hosts datasets that contain controlled access data, with access permissions being controlled by dbGaP. Users should get access to controlled data through the Database for Genotypes and Phenotypes (dbGaP) provided by National Center for Biotechnology Information (NCBI).
Open-access data are publicly accessible; no approval is required.
Being a cloud repository, CDS does not facilitate direct downloads of data, owing to high data download charges. However, programs/initiatives can reach out to the CDS Help Desk to request downloads if they are willing to fund the download.
Learn more about searching the CDS data through the portal’s Access and Analysis page, which provides a step-by-step guide for searching according to various parameters: Study, Experimental Strategy, Gender, Tumor, and File Type, among others.
Data Analysis
Within the CDS portal, researchers can build a data manifest used to export data to the NCI-cloud resource, Seven Bridges Cancer Genomics Cloud, powered by Velsera (SB-CGC), which collaborates with the CDS in making data accessible for analysis. Learn more about the tools and resources available for analyzing CDS data within the secure environment provided by SB-CGC.
Datasets
Links are provided for the studies released on dbGaP and data access is available on SB-CGC
- Childhood Cancer Data Initiative (CCDI): Genomic Analysis in Pediatric Malignancies - phs002430.v1.p1
- DCCPS CIDR: The Role of Rare Coding Variation in Prostate Cancer in Men of African Ancestry - RESPOND Project 2 – phs002637.v1.p1
- CCDI: MCI - Molecular Characterization Initiative – phs002790.v6.p1
- TCGA WGS Variants Across 18 Cancer Types - phs003155.v1.p1
- CCDI: NCI CCSG CCDI Supplement Additional Genomic Submission - phs002599.v1.p1
- LCCC 1108: Development of a Tumor Molecular Analyses Program and Its Use to Support Treatment Decisions (UNCseqTM) - phs001713.v1.p1
- CPTAC Proteogenomic Study – phs001287.v17.p6
- DCCPS CIDR: The Genetic Basis of Aggressive Prostate Cancer: The Role of Rare Variation - phs001524.v2.p2
- CIDR: Discovery, Biology, and Risk of Inherited Variants in Glioma - phs002250.v1.p1
- Detection of Colorectal Cancer Susceptibility Loci Using Genome-Wide Sequencing - phs001554.v1.p1
- Discovery of colorectal cancer susceptibility genes in high-risk families - phs001787.v1.p1
- Prostate, Lung, Colorectal and Ovarian Cancer Screening Trial - phs002011.v1.p1
- Whole genome sequencing to discover familial myeloma risk genes - phs001819.v2.p1
- Feasibility and Clinical Utility of Whole Genome Profiling in Pediatric and Young Adult Cancers - phs002620.v1.p1
- Molecular Pathological Epidemiology of Colorectal Cancer - phs002050.v2.p1
- Human Tumor Atlas Network - phs002371.v5.p1
- Human Tumor Atlas Network Imaging- phs002371_image
- NCI UCSF CCDI Database for the Advancement of JMML - Integration of Metadata with "Omic" Data - phs002504.v1.p1
- CCDI Data Sharing for Pediatric Cancers USC - phs003111.v1.p1
- CCDI Free the Data: Open Sharing of Comprehensive Genomic Childhood Cancer Datasets (Kansas) - phs002529.v1.p1
- Childhood Cancer Data Initiative (CCDI): OncoKids - NGS Panel for Pediatric Malignancies - phs002518.v1.p1
- Wistar PDX Development and Trial Center - phs002432.v1.p1
- Washington University PDX Development and Trial Center - phs002305.v1.p1
- University of Texas PDX Development and Trial Center Grant - phs001980.v2.p1
- Childhood Cancer Data Initiative (CCDI): Molecular Characterization across Pediatric Brain Tumors and Other Solid and Hematologic Malignancies for Research, Diagnostic, and Precision Medicine - phs002517.v2.p2
- HCI – PDX Trial Center for Breast Cancer Therapy - phs002479.v1.p1
- Pediatric Preclinical Testing Consortium (PPTC) PDX Data Models - phs001437.v2.p1
Anatomical Sites
CDS has data on the following tumors:
- Bone
- Brain
- Breast
- Colorectal
- Lung
- Ovarian
- Prostate