Connecting Data to Accelerate Cancer Research

The NCI Cancer Research Data Commons (CRDC) is a cloud-based data science infrastructure that provides secure access to a large, comprehensive, and expanding collection of cancer research data. Users can explore and use analytical and visualization tools for data analysis in the cloud.

134K+ Subjects / Participants 82.3K+ Data 354 Studies 52M Files 9.4PB+ Unique Users / Year
354 Studies 134K+ Subjects / Participants 52M Files 9.4PB+ Unique Users / Year 82.3K+ Data
  • Latest Blog
    Using Cancer Data Science to Advance Your Research

    How can data science support your cancer research? Explore this helpful quick start guide to find out! We’ll show you an overview of how data science enhances cancer research and how you can get started applying it to your work.

  • CRDCInsights
    CRDC Resources in the Classroom

    Min Zhang, MD, PhD, a professor at Purdue University, is passionate about developing new statistical approaches for high-dimensional data involved in biological research. She often develops lesson plans and workshops for students centered around statistical methods and bioinformatic analysis with a focus on cancer data.

  • Latest Blog
    Using Cancer Data Science to Advance Your Research

    How can data science support your cancer research? Explore this helpful quick start guide to find out! We’ll show you an overview of how data science enhances cancer research and how you can get started applying it to your work.

  • CRDCInsights
    CRDC Resources in the Classroom

    Min Zhang, MD, PhD, a professor at Purdue University, is passionate about developing new statistical approaches for high-dimensional data involved in biological research. She often develops lesson plans and workshops for students centered around statistical methods and bioinformatic analysis with a focus on cancer data.

Explore

Data Commons

Store and share NCI-funded data that are not hosted elsewhere to further advance scientific discovery across a broad range of research areas.

Store and share data from NCI Clinical Studies. The resource is expected to launch soon, pending release of clinical data.

Share, analyze, and visualize harmonized genomic data, including TCGA, TARGET, and CPTAC.

Share, analyze, and visualize multi-modal imaging data from both clinical and basic cancer research studies.

Share data from canine clinical trials, including the PRE-medical Cancer Immunotherapy Network Canine Trials (PRECINCT) and the Comparative Oncology Program.

Share, analyze, and visualize proteomic data, such as CPTAC and The International Cancer Proteogenome Consortium (ICPC).

Infrastructure

Enables users to query and connect data distributed across the CRDC for integrative analysis.

Provides secure user authentication and authorization and permanent digital object identifiers for data objects.

Provides services to facilitate interoperability of data across CRDC.

Cloud Resources

Access NCI-funded datasets TARGET and TCGA along with a rich collection of other datasets and collaborative projects that are part of the biomedical ecosystem. Run analysis tools at scale and collaborate securely on a scalable cloud environment.

Access data sets using fully interactive web-based applications, including BigQuery, which is hosted on Google Cloud Platform.

Explore and analyze large datasets alongside secure and scalable analytical resources for large-scale computational research.

ABOUT CRDC

CRDC is built for researchers

  • Enable the cancer research community to share diverse data types
  • Provide secure access to data
  • Facilitate the generation of innovative tools
  • Adhere to FAIR principles of data stewardship: Findable, Accessible, Interoperable, and Reusable