Skip to main content

Main navigation

National Cancer Institute Center for Biomedical Informatics and Information Technology National Cancer Institute Center for Biomedical Informatics and Information Technology
  • CRDC
  • About About
    • About the CRDC
    • Collaborations
  • Explore Explore
    • Explore Data
    • Datasets
    • Data Commons
    • Cloud Resources
    • Core Standards and Services
  • Analyze Analyze
    • Analyze Data
    • Cloud Resources Analytical Tools
  • Submit Submit
    • Submit Data
  • Publications Publications
    • Explore Publications
    • Select Publications
    • AACR Cancer Research Series
  • News News
    • News
    • CRDC Insights
    • CRDC Insights Archive
    • Blog
  • Resources Resources
    • Resources
  • Staff

Explore Data

IN THIS SECTION
  • Overview
  • Exploring Data using Data Commons
  • Aggregated Exploration Across Data Commons
  • Data Exploration through the CRDC Cloud Resources
  • CRDC Core Standards and Services

Overview

Many large NCI-funded research projects share their rich multi-modal data with the public via the Cancer Research Data Commons (CRDC). There are a number of ways users can explore CRDC data including the CRDC’s Data Commons’ portals, the Cancer Data Aggregator API and notebooks, as well as the CRDC Cloud Resources.

Among the many NCI-funded projects that share their data via the CRDC are:

  • APOLLO: Applied Proteogenomics Organizational Learning and Outcomes Network 
  • CCDI: Childhood Cancer Data Initiative 
  • CPTAC: Clinical Proteomic Tumor Analysis Consortium 
  • HTAN: Human Tumor Atlas Network
  • TARGET: Therapeutically Applicable Research to Generate Effective Treatments 
  • TCGA: The Cancer Genome Atlas

DatasetsLearn more about CRDC hosted datasets.

Exploring Data using Data Commons

Each data commons provides a search interface to explore data by demographics, site of disease, or the name of a specific study, among other variables. Users can explore data from across multiple programs and initiatives, and can build cross-cutting “virtual” cohorts for aggregated analysis. For further analysis in a cloud-based compute environment, users can build a data manifest to pull cohort data into one of the NCI-funded Cloud Resources.  

In addition to this general exploration, many data commons provide data visualization and other analytical tools within the data portal environment. 

Data CommonsLearn more about each Data Commons. 

Aggregated Exploration Across Data Commons

The Cancer Data Aggregator (CDA) combines descriptive information about CRDC-housed data into a common model making it possible to search across multiple data commons using variables such as participant, sample, tissue, or disease.  

While anyone can browse the CDA’s indexed metadata, researchers wanting to work with controlled-access data still need to apply for appropriate access to work with actual data (vs metadata) files.

Cancer Data Aggregator Learn more about the Cancer Data Aggregator.

Data Exploration through the CRDC Cloud Resources

The CRDC Cloud Resources (CRs) also serve as entry points for exploring CRDC data. Three NCI-funded CRs, each with distinct features, provide secure workspaces and the ability to use or tailor publicly available analytical tools and workflows from their platforms. 

One of the key benefits of using the CR is that users can access the data without downloading large amounts of data to a local compute environment, which can involve high download costs.   

Cloud ResourcesLearn more about the Cloud Resources. 

CRDC Core Standards and Services

Ensuring that CRDC-housed data meet the FAIR standards – Findable, Accessible, Interoperable and Reusable - data must be organized, stored, and searchable based on common standards and terms. A suite of core data standards and services related to data tracking and secure access provide essential support to the CRDC data ecosystem. 

CRDC Standards and ServicesLearn more about CRDC standards and services.  

More Information

  • Explore Data
  • Analyze Data
  • Submit Data
  • Publications
  • Resources

Policies

  • Accessibility
  • Disclaimer
  • FOIA
  • HHS Vulnerability Disclosure

Sign up for the newsletter

Subscribe to CRDC Insights

National Cancer Institute

at the National Institutes of Health

Contact Us

  • Live Chat
  • 1-800-4-CANCER
  • NCIinfo@nih.gov
  • Site Feedback

Follow us

  • Instagram
  • Twitter
  • Facebook
  • youTube
  • Linkedin

Footer

  • U.S. Department of Health and Human Services
  • National Cancer Institute
  • National Institutes of Health
  • USA.gov