The National Cancer Institute (NCI) will deploy an integrating biomedical informatics infrastructure, the cancer Biomedical Informatics Grid (caBIGTM), to expedite the cancer research community's access to key bloinformatics platforms. In partnership with the cancer research community, the NCI is creating a common, extensible informatics platform that integrates diverse data types and supports interoperable analytic tools. This platform will allow research groups to tap into the rich collection of emerging cancer research data while supporting their individual investigations.
This Integrative Cancer Research Workspace will provide tools and systems to enable integration and sharing of information among cancer researchers. These tools will facilitate the integration of data not only from different centers, but also data of different types, thereby enabling translational and integrative research. These tools also provide for the integration of clinical and basic research data. The Workspace is tasked to develop a well-documented and validated toolset for use throughout the cancer research community. Workspace activities will include platforms and standards to facilitate the sharing of datasets and repositories, and those appropriate for testing the caBIGTM infrastructure are being asked to participate. A major goal of this workspace will be a demonstration of how a shared informatics platform can allow a comprehensive, federated grid of information to be made available to the cancer research community.
The main goal of the Integrative Cancer Research Workspace is to assemble data, tools, and infrastructure that facilitate the cross silo use of cancer biology information to promote integrated cancer research. Working towards this goal, the NCICB is developing an integrative application framework, known as calntegrator, designed to facilitate cross data analysis in support of ongoing cancer research.
This SOW is intended to cover three major projects/initiatives at the NCI Center for Bioinformatics:
Computational Portal and Analysis System (CPAS), Cancer Genetic Markers of Susceptibility
(CGEMS) and calntegrator.
The following high level objectives are intended to be achieved by the work described herein.
CPAS: The Computational Portal and Analysis System is part of deliverable from the mouse
Biomarker Discovery Project (BDI
CGEMS: The Cancer Genetic Markers of Susceptibility project is an effort to identify germ-line singleżnucleotide polymorphisms (SNPs) that correlate with disease. As currently scoped, the CGEMS project will conduct two genotyping studies: one for prostate cancer and one fOr breast cancer. For each study, 1200 cases and 1200 controls will be genotyped for 500,000 SNPs. The goal is to identify SNPs that predispose a person to disease.
The CGEMS project is coordinated by the Office of Cancer Genomics (OCG), Division of Cancer Epidemiology and Genetics (DCEG), and the Core Genotyping Facility (CGF). Initial analysis of the data will be performed by CGF. All data to be made public will be delivered to NCICB. NCICB is responsible for long-term storage and presentation of the data through a publicly-accessible portal. This will consist of:
(a) a caBlG object API to the data,
(b) a web interface allowing query by genomic region, significance of association with disease, etc. NCICB expects to store the data in an instance of the Clinical Genomics Object Model (CGOM), expanded to accommodate data types needed for CGEMS
calntegrator:Calntegrator is a translational informatics platform that allows researchers and bioinformaticians to access and analyze clinical and experimental data from clinical trials and studies. The calntegrator framework provides a mechanism for integrating and aggregating biomedical research data and provides access to a variety of data types (e.g. lmmunohistochemistry (I HC), microarray-based gene expression, SN Ps [Single nucleotide polymorphisms], clinical trials data etc.) in a cohesive fashion. |