CRDC Insights

Updates from the Cancer Research Data Commons:
Empowering the Scientific Community to Make New Discoveries

CRDC Features and Resources

September 07, 2024

Data Commons 

  • Genomic Data Commons (GDC)
    • Earlier this spring, the GDC team added a series of video tutorials to their support page focused on building cohorts and using analysis tools for performing both gene-level variant analysis and clinical data analysis. Find these tutorials on the GDC portal
    • The GDC 2.2 McClintock Release, which offers many new features and a new Gene Expression API, was launched in June. This release signifies the official launch of GDC 2.0 and the retirement of the legacy GDC Data Portal (1.0). Note: GDC releases are named alphabetically in honor of distinguished scientists and their significant research contributions. Read more about the GDC 2.2 McClintock Release, which was named after Barbara McClintock.  
  • Integrated Canine Data Commons (ICDC):
    • A library of tutorials has been posted through ICDC’s GitHub. These tutorials demonstrate how to explore data within the ICDC and conduct analysis using the connected cloud resource, the Seven Bridges-Cancer Genomics Cloud (SB-CGC), powered by Velsera. Access the ICDC tutorials.   

Cloud Resources 

  • ISB-CGC 
    • The CRDC Cancer Data Aggregator (CDA) is now accessible through ISB-CGC BigQuery tables. ISB-CGC and the CDA have developed a Google Colab notebook demonstrating  how researchers can use the CDA to build cohorts of patients with data across multiple CRDC Data Commons, and then locate, analyze, and view that data in ISB’s Google BigQuery columnar data tables. The notebook can be found under Code in the Cloud on the CDA page at https://cda.readthedocs.io/.
    • The team presented to the NCI Bioinformatics Training & Education Program on their approach to performing data exploration and analyzing CRDC derived data in Google Cloud’s BigQuery. Access the ISB-CGC BigQuery presentation
  • SB-CGC
    • The software suite, SAS/SAS Studio, is now available for data management and analytics. Researchers can use SAS to manage and mine data, while also providing point-and-click user interfaces that guide a researcher through an analytical process. Users can now seamlessly manage, analyze, and visualize data, all within the user-friendly environment of SAS Studio. 
      • The SB-CGC Knowledge Center has a page with SAS documentation.  
      • SB-CGC users with a login can find onboarding tutorials about SAS. Note that setting up a login is easy and free.  
      • An introductory video from SAS about SAS Studio is available. 
    • The team participated in the recent CBIIT WebEx, providing a general overview of their platform and a review of some of their publicly available analytical tools. They also gave a demonstration. Access that part of the WebEx