Clinical and Translational Data Commons

Unleashing the power of clinical studies data to advance cancer research


The Clinical and Translational Data Commons (CTDC) is being developed to accelerate scientific discoveries that make an impact on cancer outcomes to help people live longer, healthier lives. The CTDC supports cancer research by sharing NCI-funded clinical studies, with features including:

  • Data Exploration dashboard to quickly visualize and refine searches of aggregated public-access data,
  • Multiple data types including clinical (PDF, CSV) and molecular/sequencing data (bam/bai, vcf, tsv),
  • Data harmonization across studies through alignment with NCI’s cancer Data Standards Registry and Repository (caDSR) Common Data Elements (CDEs),
  • Data visualization and analysis on the cloud via Seven Bridges Cancer Genomics Cloud,
  • Developer resources including a Graphical user interface (GUI) and an Application Programming Interface (API), and
  • Federated identity management system for easy, secure access to registered and controlled access data.

Data Types

In addition to the metadata for filtering and sorting, the CTDC contains multiple data file types available for analysis.

  • Clinical data and reports – txt, pdf
  • Molecular findings and sequence annotation – vcf, bam


The CTDC will release datasets from studies across NCI on an ongoing basis including:

Cancer Moonshot Biobank (first release Fall 2023)
The Cancer Moonshot Biobank is collecting tissue, blood samples, and medical information, from cancer patients throughout the course of their cancer treatment to help researchers better understand how cancer changes over time and over the course of medical treatments.

This section will be updated as more datasets are added.  

Explore Clinical and Translational Data Commons