Connecting Data to Accelerate Cancer Research

The NCI Cancer Research Data Commons (CRDC) is a cloud-based data science infrastructure that provides secure access to a large, comprehensive, and expanding collection of cancer research data contributed by many NCI or NIH-funded researchers. Users can explore and use analytical and visualization tools for data analysis in the cloud.

200K+ Subjects / Participants 17PB+ Data 533 Studies 1.2K+ Public Tools and Workflows 100K+ Unique Users / Year
533 Studies 200K+ Subjects / Participants 1.2K+ Public Tools and Workflows 100K+ Unique Users / Year 17PB+ Data
  • CRDC INSIGHTS
    The Genomic Data Commons at the Ten-Year Mark

    In March 2026, the Genomic Data Commons (GDC) reached a major milestone: 233,000 unique visitors in a single month. Launched in 2016, the GDC was established to harmonize genomic data from several NCI-funded projects. Today, researchers worldwide rely on the GDC for access to more than 10 petabytes of data across 91 project datasets.  

  • CRDC INSIGHTS
    Data Sharing, Simplified: The CRDC Submission Portal at Two Years

    The NIH Data Management and Sharing (DMS) Policy sets clear expectations for NIH-funded research: scientific data should be made publicly accessible in a timely and responsible manner. The CRDC Submission Portal makes data sharing easier by providing a single, unified entry point for submitting data across multiple CRDC Data Commons. 

  • CRDC INSIGHTS
    CRDC Components: Updates June 2026

    The CRDC team, whether engaged in activities specific to the CRDC Data Commons, NCI Cloud Resource, or CRDC’s Core Services, remains focused on advancing its mission of making data and resources securely accessible to the cancer research community. The team has provided updates. 

  • CRDC INSIGHTS
    The Genomic Data Commons at the Ten-Year Mark

    In March 2026, the Genomic Data Commons (GDC) reached a major milestone: 233,000 unique visitors in a single month. Launched in 2016, the GDC was established to harmonize genomic data from several NCI-funded projects. Today, researchers worldwide rely on the GDC for access to more than 10 petabytes of data across 91 project datasets.  

  • CRDC INSIGHTS
    Data Sharing, Simplified: The CRDC Submission Portal at Two Years

    The NIH Data Management and Sharing (DMS) Policy sets clear expectations for NIH-funded research: scientific data should be made publicly accessible in a timely and responsible manner. The CRDC Submission Portal makes data sharing easier by providing a single, unified entry point for submitting data across multiple CRDC Data Commons. 

  • CRDC INSIGHTS
    CRDC Components: Updates June 2026

    The CRDC team, whether engaged in activities specific to the CRDC Data Commons, NCI Cloud Resource, or CRDC’s Core Services, remains focused on advancing its mission of making data and resources securely accessible to the cancer research community. The team has provided updates. 

Explore

Data Commons

Share, analyze, and visualize harmonized genomic data, including TCGA, TARGET, and CPTAC.

Share, analyze, and visualize proteomic data, such as CPTAC and The International Cancer Proteogenome Consortium (ICPC).

Share, analyze, and visualize multi-modal imaging data from both clinical and basic cancer research studies.

Share data from canine clinical trials, including the PRE-medical Cancer Immunotherapy Network Canine Trials (PRECINCT) and the Comparative Oncology Program.

Store and share data from NCI-funded Clinical and Translational Studies.

Hosting and sharing NCI data of multiple data types not a match for other CRDC Data Commons.

Hosting and sharing of NCI-funded population science data

Core Resources and Services

Enables users to query and connect data distributed across the CRDC for integrative analysis.

Provides secure user authentication and authorization and permanent digital object identifiers for data objects.

Enables users to query and connect data distributed across the CRDC for integrative analysis.

ABOUT CRDC

CRDC is built for researchers

  • Enable the cancer research community to share diverse data types
  • Provide secure access to data
  • Facilitate the generation of innovative tools
  • Adhere to FAIR principles of data stewardship: Findable, Accessible, Interoperable, and Reusable