Data Commons Framework


The National Cancer Institute’s Data Commons Framework Services (NCI DCFS) is a set of software services to make it easier to develop, operate, and interoperate data commons, data clouds, knowledge bases, and other resources for managing, analyzing, and sharing cancer research data that are part of the Cancer Research Data Commons (CRDC). The NCI DCFS support making data Findable, Accessible, Interoperable, and Reusable (FAIR). NCI DCFS is built using the Gen3 platform that is developed by the Center for Translational Data Science at the University of Chicago. The Gen3 data platform organizes data into projects and divides project data into two types: project data objects and project core data. Data objects are assigned GUIDs and can be stored in one or more private and public clouds and accessed using Gen3 DCF services. Project core data can be structured with data models and enriched with controlled vocabularies and ontologies. Gen3 includes AuthN and AuthZ services so that controlled access data can be included in nodes and so collaborative and team science can be supported. Gen3 DCF services also include the ability to define extensible data models, import data using the data model, and query data against the data model. Currently, the Gen3 authentication, authorization, and digital ID services are integrated into the NCI DCFS that supports the CRDC.