How to Submit Data

CRDC is NCI’s primary data science platform, housing data from large NCI-funded programs and projects, including The Cancer Genome Atlas (TCGA), the Clinical Proteomic Tumor Analysis Consortium (CPTAC), and Human Tumor Atlas Network (HTAN), among others.

Researchers can make a data submission request of any CRDC repository, and the request will be considered, subject to each repository’s governance guidelines. 

Below find brief information about the types of data each repository considers, links to each repository, and each repository's contact information for data submission requests.  Further below is a more detailed description of each repository. 

 Repository / Website Type of Data Contact
Genomic Data Commons (GDC)

Harmonized genomic data 

https://gdc.cancer.gov/submit-data/requesting-data-submission

Proteomic Data Commons (PDC)

 

Mass spectrometry data 

https://pdc.cancer.gov/pdc/submit-data

Imaging Data Commons* (IDC)

Radiology, Digital Pathology, Cellular Imaging, etc.  

support+submissions@canceridc.dev

Integrated Canine Data Commons (ICDC)

Genomic, Genotyping array data, clinical trial data, etc. from canine studies

https://caninecommons.cancer.gov/#/submit

Cancer Data Service (CDS)

Genomic: Non-harmonized data

Proteomic: Reverse-phase protein arrays, etc.

Other types of cancer research data: See notes above.

CDSHelpDesk@nih.gov

If your cancer research data do not meet the above categorization, please contact the CRDC helpdesk at NCICRDC@mail.nih.gov

*All clinical imaging data must be de-identified through The Cancer Imaging Archive (TCIA) before storing in the Imaging Data Commons (IDC). That process is explained on their site. 

 

Learn More About the CRDC Repositories

The following repositories are currently accessible within the CRDC.

Genomic Data Commons

The Genomic Data Commons (GDC) is a unified repository for genomic, clinical, and biospecimen data from cancer research programs. The GDC includes data from The Cancer Genome Atlas (TCGA), its pediatric equivalent, the Therapeutically Applicable Research to Generate Effective Treatments (TARGET) program, Foundation Medicine (FMI), the Cancer Cell Line Encyclopedia (CCLE), and a growing number of other sources.

Proteomic Data Commons

The Proteomic Data Commons (PDC) offers access to highly curated and standardized biospecimen, clinical, and proteomic data. The PDC includes data from the Clinical Proteomic Tumor Analysis Consortium (CPTAC) program and will grow to include other sources over time. The data are open access and can be browsed interactively using a series of filters and accessed by an API.

Imaging Data Commons

The Imaging Data Commons (IDC) provides cloud-based access to a wide variety of imaging data sets from The Cancer Imaging Archive and other NCI projects. Its connection to a wide variety of analytical tools allows researchers and data scientists to train and explore imaging models without downloading data.

Integrated Canine Data Commons

The Integrated Canine Data Commons (ICDC) includes clinical, biospecimen, genomics, proteomics, and imaging data from naturally occurring cancer in canine cancer patients. Researchers can explore multiple types and collections of open-access data directly through the portal or through one of NCI's Cloud Resources.

Cancer Data Service

The Cancer Data Service (CDS) is a repository for cancer research data generated by NCI-funded programs that do not meet the submission criteria for a specific CRDC Data Repository. This repository accommodates: 

  • data that do not meet the minimum metadata standards for submission to another CRDC Repository,
  • studies that are on a waiting list for submission to a specific CRDC Repository (e.g., GDC), and
  • studies that are in progress and need a place to store and analyze data during the acquisition phase.

Read more about NCI’s Data Sharing and Public Access Policies.