AACR Cancer Research Series on NCI Cancer Research Data Commons

A four-part manuscript series published in March 2024, in Cancer Research, one of the flagship journals of the American Association for Cancer Research (AACR), highlights the CRDC’s accomplishments from the past 10 years. The series lead authors and editors include: Anthony (Tony) Kerlavage, Jill S. Barnholtz-Sloan, Tanja Davidsen, Erika Kim, David Pot, Arthur Brady, Erin Beck, Heather Creasy, and Zhining Wang.

AACR Cancer Research Series

"As the CRDC enters its tenth year, we are showcasing how the CRDC empowers researchers. This manuscript series details how to take full advantage of its resources."

—Jill Barnholtz-Sloan, Associate Director, CBIIT Informatics and Data Science Program  

“In just ten years, the CRDC has proven itself as an important source to the broad cancer research community. This manuscript series in Cancer Research highlights how far we have come in a relatively short time, and points to the opportunities ahead as we continue to serve a foundational role in NCI's cancer data ecosystem."

—Tanja Davidsen, Chief, CBIIT Data Ecosystems Branch

 

Follow the four part series

1 lessons learned and future state

Traces the history of the CRDC over the past 10 years, noting its progress in providing access to data and tools along with training and outreach to support the cancer research community. This review also provides an assessment of the CRDC’s impact, lessons learned, and future plans to promote data sharing, data accessibility, interoperability, and reuse.

Read Part One  

2 Resources to share key Cancer Data

Describes each of the CRDC's data commons, including their unique and shared features, accomplishments, and challenges. This paper also details how the CRDC data commons implement Findable, Accessible, Interoperable, Reusable (FAIR) principles and promote data sharing in support of the NIH Data Management and Sharing Policy. 
 

Read Part Two  

3 Cloud-Based Analytical Resources

Details how the three Cloud Resources (CRs), including the Broad Institute FireCloud, Institute for Systems Biology Cancer Gateway in the Cloud (ISB-CGC), and Seven Bridges’ Cancer Genomics Cloud powered by Velsera (SB-CGC) provide access to large, cloud-hosted multi-modal cancer datasets, as well as offer tools and workspaces for performing data analysis where the data resides. Included is a review of publicly available analytical tools.

Read Part Three  

4 Core Standards and Services

Outlines core CRDC services to aggregate descriptive information from multiple studies for findability via a single interface. These standards and services aggregate and semantically harmonize multiple data types making the CRDC a single point of discovery and access for cancer research data originating from multiple sources. They also facilitate the evolution of the CRDC as one hub for managing, storing, and sharing diverse types of data.  
 

Read Part Four