Repository

Cancer Data Service

(CDS)
Enabling secure and flexible storage and sharing of data
CDS

Overview

The Cancer Data Service (CDS) is one of several data commons within the Cancer Research Data Commons (CRDC).

The CDS provides data storage and sharing capabilities for NCI-funded studies that fall under the following categories:
•    Studies with data that do not match an existing CRDC data commons 
•    Studies with data that do not fit current data type criteria and/or the minimum metadata standards for a CRDC data commons 

CDS currently hosts a variety of data types from NCI projects such as the Human Tumor Atlas Network (HTAN), Division of Cancer Control and Population Sciences (DCCPS), and Childhood Cancer Data Initiative (CCDI) as well as data from independent research projects.  

The CDS is home to both open and controlled access data, however, the CDS Portal is accessible for users to search and browse with no login. This allows users to see if the CDS has data of interest before requesting access.  

Data Types

CDS is a data-type agnostic data commons. The CDS currently contains mostly genomic and some imaging data, but is open to accepting multiple data types. 


Data Submission

Submitters will find what they need on the CDS portal's Data Submission page, including a downloadable CDS User Guide, with a step-by-step outline of the data submission process.  

Submitters will start the process by submitting the Data Submission Questionnaire

Contact the CDS Help Desk for information and support.
 


Data Access

The CDS hosts datasets that contain controlled access data, with access permissions being controlled by dbGaP. Users should get access to controlled data through the Database for Genotypes and Phenotypes (dbGaP) provided by National Center for Biotechnology Information (NCBI).  

Open-access data are publicly accessible; no approval is required.

Being a cloud repository, CDS does not facilitate direct downloads of data, owing to high data download charges. However, programs/initiatives can reach out to the CDS Help Desk to request downloads if they are willing to fund the download.  

Learn more about searching the CDS data through the portal’s Access and Analysis page, which provides a step-by-step guide for searching according to various parameters: Study, Experimental Strategy, Gender, Tumor, and File Type, among others.  
 


Data Analysis

Within the CDS portal, researchers can build a data manifest used to export data to the NCI-cloud resource, Seven Bridges Cancer Genomics Cloud, powered by Velsera (SB-CGC), which collaborates with the CDS in making data accessible for analysis. Learn more about the tools and resources available for analyzing CDS data within the secure environment provided by SB-CGC.   

Datasets

Links are provided for the studies released on dbGaP and data access is available on SB-CGC

  • Childhood Cancer Data Initiative (CCDI): Genomic Analysis in Pediatric Malignancies - phs002430.v1.p1
  • DCCPS CIDR: The Role of Rare Coding Variation in Prostate Cancer in Men of African Ancestry - RESPOND Project 2 – phs002637.v1.p1
  • CCDI: MCI - Molecular Characterization Initiative – phs002790.v5.p1
  • TCGA WGS Variants Across 18 Cancer Types - phs003155.v1.p1
  • CCDI: NCI CCSG CCDI Supplement Additional Genomic Submission - phs002599.v1.p1
  • LCCC 1108: Development of a Tumor Molecular Analyses Program and Its Use to Support Treatment Decisions (UNCseqTM) - phs001713.v1.p1
  • CPTAC Proteogenomic Study – phs001287.v17.p6
  • DCCPS CIDR: The Genetic Basis of Aggressive Prostate Cancer: The Role of Rare Variation - phs001524.v2.p2
  • CIDR: Discovery, Biology, and Risk of Inherited Variants in Glioma - phs002250.v1.p1
  • Detection of Colorectal Cancer Susceptibility Loci Using Genome-Wide Sequencing - phs001554.v1.p1
  • Discovery of colorectal cancer susceptibility genes in high-risk families - phs001787.v1.p1
  • Prostate, Lung, Colorectal and Ovarian Cancer Screening Trial - phs002011.v1.p1
  • Whole genome sequencing to discover familial myeloma risk genes - phs001819.v2.p1
  • Feasibility and Clinical Utility of Whole Genome Profiling in Pediatric and Young Adult Cancers - phs002620.v1.p1
  • Molecular Pathological Epidemiology of Colorectal Cancer - phs002050.v2.p1
  • Human Tumor Atlas Network - phs002371.v3.p1
  • NCI UCSF CCDI Database for the Advancement of JMML - Integration of Metadata with "Omic" Data - phs002504.v1.p1
  • CCDI Data Sharing for Pediatric Cancers USC - phs003111.v1.p1
  • CCDI Free the Data: Open Sharing of Comprehensive Genomic Childhood Cancer Datasets (Kansas) - phs002529.v1.p1
  • Childhood Cancer Data Initiative (CCDI): OncoKids - NGS Panel for Pediatric Malignancies - phs002518.v1.p1
  • Wistar PDX Development and Trial Center - phs002432.v1.p1
  • Washington University PDX Development and Trial Center - phs002305.v1.p1
  • University of Texas PDX Development and Trial Center Grant - phs001980.v2.p1
  • Childhood Cancer Data Initiative (CCDI): Molecular Characterization across Pediatric Brain Tumors and Other Solid and Hematologic Malignancies for Research, Diagnostic, and Precision Medicine - phs002517.v1.p1
     

 

Anatomical Sites

CDS has data on the following tumors:

  • Bone
  • Brain
  • Breast
  • Colorectal
  • Lung
  • Ovarian
  • Prostate
Learn more about CDS and how to access data