This article's use of external links may not follow Wikipedia's policies or guidelines.(June 2019) |
Developer(s) | NCI's Center for Biomedical Informatics and Information Technology (CBIIT), The Ohio State University Research Foundation, The University of Chicago - Argonne National Laboratory, Duke University, Booz Allen Hamilton, SemanticBits LLC, Ekagra Software Technologies |
---|---|
Type | Grid computing, Web service |
License | BSD 3-Clause |
Website | cagrid |
The cancer Biomedical Informatics Grid (caBIG) was a US government program to develop an open-source, open access information network called caGrid for secure data exchange on cancer research. The initiative was developed by the National Cancer Institute (part of the National Institutes of Health) and was maintained by the Center for Biomedical Informatics and Information Technology (CBIIT) and program managed by Booz Allen Hamilton. In 2011 a report on caBIG raised significant questions about effectiveness and oversight, and its budget and scope were significantly trimmed. In May 2012, the National Cancer Informatics Program (NCIP) was created as caBIG's successor program.
The National Cancer Institute (NCI) of the United States funded the cancer Biomedical Informatics Grid (caBIG) initiative in spring 2004, headed by Kenneth Buetow. [1] Its goal was to connect US biomedical cancer researchers using technology known as grid computing. The program, led by the Center for Bioinformatics and Information Technology (CBIIT), began with a 3-year pilot phase. The pilot phase concluded in March 2007, and a trial was announced. [2] Buetow promoted the program in 2008. [1] [3]
In addition to caGrid, the underlying infrastructure for data sharing among organizations, caBIG developed software tools, data sharing policies, and common standards and vocabularies to facilitate data sharing.
Software tools targeted:
caBIG sought to provide foundational technology for an approach to biomedicine it called a “learning healthcare system.” [4] This relies on the rapid exchange of information among all sectors of research and care, so that researchers and clinicians are able to collaboratively review and accurately incorporate the latest findings into their work. The ultimate goal was to speed the biomedical research process. It was also promoted for what is often called Personalized Medicine. caBIG technology was used in adaptive clinical trials such as the Investigation of Serial studies to Predict Your Therapeutic Response with Imaging and molecular AnaLysis 2 (I-SPY2), which was designed to use biomarkers to determine the appropriate therapy for women with advanced breast cancer. [5]
Health information technology (HIT) was promoted for management and secure exchange of medical information among researchers, health care providers, and consumers. HIT initiatives mentioning caBIG were: NCI and the American Society of Clinical Oncology initiated a collaboration to create an oncology-specific electronic health record system using caBIG standards for interoperability and that will enable oncologists to manage patient information in an electronic format that accurately captures the specific interventional issues unique to oncology. The Nationwide Health Information Network was an initiative to share patient clinical data across geographically disparate sources and create electronically linked national health information exchange. It might be somehow related.
A BIG Health Consortium was formed in 2008 to promote personalized medicine, but disbanded in 2012. [6] In July 2009, caBIG announced a collaboration with the Dr. Susan Love Research Foundation to build an online cohort of women willing to participate in clinical trials. [7] Called the Army of Women, it had a goal of one million in its database; by December 2009 the site was "launched", and about 30,000 women and men signed up by 2010. [8]
The Cancer Genome Atlas aimed to characterize more than 10,000 tumors across at least 20 cancers by 2015. caBIG provided connectivity, data standards, and tools to collect, organize, share, and analyze the diverse research data in its database. Since 2007, NCI worked with UK National Cancer Research Institute (NCRI). The two organizations shared technologies for collaborative research and the secure exchange of research data using caGrid and the NCRI Oncology Information Exchange (ONIX) web portal announced in August 2009. [9] ONIX shut down in March 2012. [10] The Duke Cancer Institute used caBIG clinical trials tools in their collaboration with the Beijing Cancer Hospital of Peking University. [11]
The project intended to connect 65 NCI-designated cancer centers to enable collaborative research. Participating institutions could either “adopt” caBIG tools to share data directly through caGrid, or “adapt” commercial or in-house developed software to be caBIG-compatible. The caBIG program developed software development kits (SDKs) for interoperable software tools, and instructions on the process of adapting existing tools or developing applications to be caBIG-compatible.
The Enterprise Support Network program included domain-specific expertise, and support service providers, third party organizations that provide assistance on a contract-for-services basis. [12] A web portal using the Liferay software was available from 2008 to 2013. [13]
Since 2004, the caBIG program used open-source communities, adapted from other public-private partnerships. The caBIG program produced software under contract to software development teams largely within the commercial research community.[ citation needed ]
In general, software developed under US government contracts is the property of the US government and the US taxpayers. Depending on the terms in specific contracts, they might be accessible only by request under the Freedom of Information Act (FOIA). The timeliness of response to such requests might preclude a requester from ever gaining any secondary value from software released under a FOIA request.
The caBIG program placed the all caBIG software in a software repository freely accessible for download. Open source means anyone can modify the downloaded software; however, the licensing applied to the downloaded software allows greater flexibility than is typical. An individual or enterprise is allowed to contribute the modified code back to the caBIG program but is not required to do so. Likewise, the modifications can be made available as open source but are not required to be made available as open source. The caBIG licensing even allows the use of the caBIG applications and components, combined with additions and modifications, to be released as commercial products. These aspects of the caBIG program actually encourage commercialization of caBIG technology.
In 2008, GlaxoSmithKline announced it would share cancer cell genomic data with caBIG. [14] Some private companies claimed benefits from caBIG technology in 2010. [15]
A caGrid community web site was created in 2007. [16] The 1.x version of the core software was added to a GitHub project in mid-2013, under the BSD 3-Clause license. [17] It used version 4.03 of the Globus Toolkit, and the Taverna workbench system to manage workflow and the Business Process Execution Language. [17] [18] [19] Software called Introduce was developed around 2006. [20] Contributors included the Ohio State University Center for Clinical and Translational Science, Duke University, University of Chicago - Argonne National Laboratory, and private companies Booze Allen Hamilton, Ekagra Software Technologies and Semantic Bits. [16]
By 2008, some questioned if the program was benefiting large pharmaceutical companies. [21] By 2011, the project had spent an estimated $350 million. [22] Although the goal was considered laudable, much of the software was unevenly adopted after being developed at great expense to compete with commercial offerings. In March 2011, an NCI working group assessment concluded that caBIG "...expanded far beyond those goals to implement an overly complex and ambitious software enterprise of NCI-branded tools, especially in the Clinical Trial Management System (CTMS) space. These have produced limited traction in the cancer community, compete against established commercial vendors, and create financially untenable long-term maintenance and support commitments for the NCI". [2] In 2012, the NCI announced a new program the National Cancer Informatics Program (NCIP) as a successor to caBIG. [23] [24] [25]
Developer(s) | NCI's Center for Biomedical Informatics and Information Technology (CBIIT), The Ohio State University Research Foundation, The University of Chicago - Argonne National Laboratory, Duke University, Booz Allen Hamilton, SemanticBits LLC, Ekagra Software Technologies |
---|---|
Operating system | Cross-platform |
Type | Grid computing, Web service |
License | BSD 3-Clause |
Website | cagrid |
The caGrid computer network and software supported the cancer Biomedical Informatics Grid (caBIG) initiative of the National Cancer Institute of the US National Institutes of Health.
caBIG was a voluntary virtual informatics infrastructure that connects data, research tools, scientists, and organizations.
In 2013, the National Cancer Informatics Program (NCIP) re-released caGrid under the BSD 3-Clause license, and migrated the source repository to github.
caGrid used version 4.03 of the Globus Toolkit, produced by the Globus Alliance.
The caGrid project and much of its funding was managed by Booz Allen Hamilton
The caGrid Portal was a Web-based application built on Liferay that enables users to discover and interact with the services that are available on the caGrid infrastructure. Portal serves as the primary visualization tool for the caGrid middleware. It also served as a caBIG information source. Through the caGrid Portal, users had access to information about caBIG participants, caGrid points of contact (POCs), and caGrid-related news and events.
caGrid workflow uses:
In March 2011, the NCI published an extensive review of CaBIG, the NCI CBIIT program that funded the caGrid software development (see , ), which included a long list of problems with the program, and recommended that most of the software development projects should be discontinued.
Health informatics is the study and implementation of computer structures and algorithms to improve communication, understanding, and management of medical information. It can be view as branch of engineering and applied science.
The National Cancer Institute (NCI) coordinates the United States National Cancer Program and is part of the National Institutes of Health (NIH), which is one of eleven agencies that are part of the U.S. Department of Health and Human Services. The NCI conducts and supports research, training, health information dissemination, and other activities related to the causes, prevention, diagnosis, and treatment of cancer; the supportive care of cancer patients and their families; and cancer survivorship.
The Pediatric Oncology Group (POG) was a U.S. and Canadian clinical trial cooperative group created with the mission of studying childhood cancers. It was formed by the merger of the pediatric divisions of two other cooperative groups, the Southwest Oncology Group (SWOG) and the CALGB. In 2000, POG merged with several other pediatric cooperative groups to form the Children's Oncology Group (COG). This merger has seen its fair share of problems, especially with regard to integrating the various databases associated with each individual cooperative group. One such initiative to consolidate these databases involves caBIG or the cancer Biomedical Informatics Grid, which is guided and supported by the National Cancer Institute (NCI) in Bethesda, Maryland.
Carole Anne Goble, is a British academic who is Professor of Computer Science at the University of Manchester. She is principal investigator (PI) of the myGrid, BioCatalogue and myExperiment projects and co-leads the Information Management Group (IMG) with Norman Paton.
The myGrid consortium produces and uses a suite of tools design to “help e-Scientists get on with science and get on with scientists”. The tools support the creation of e-laboratories and have been used in domains as diverse as systems biology, social science, music, astronomy, multimedia and chemistry.
Apache Taverna was an open source software tool for designing and executing workflows, initially created by the myGrid project under the name Taverna Workbench, then a project under the Apache incubator. Taverna allowed users to integrate many different software components, including WSDL SOAP or REST Web services, such as those provided by the National Center for Biotechnology Information, the European Bioinformatics Institute, the DNA Databank of Japan (DDBJ), SoapLab, BioMOBY and EMBOSS. The set of available services was not finite and users could import new service descriptions into the Taverna Workbench.
Biomax Informatics is a Munich-based software company specializing in research software for bioinformatics. Biomax was founded in 1997 and has its roots in the Munich Information Center for Protein Sequences (MIPS). The company's customer base consists of companies and research organizations in the areas of drug discovery, diagnostics, fine chemicals, food and plant production. In addition to exclusive software tools, Biomax Informatics provides services and curated knowledge bases.
Renaissance Computing Institute (RENCI) was launched in 2004 as a collaboration involving the State of North Carolina, University of North Carolina at Chapel Hill (UNC-CH), Duke University, and North Carolina State University. RENCI is organizationally structured as a research institute within UNC-CH, and its main campus is located in Chapel Hill, NC, a few miles from the UNC-CH campus. RENCI has engagement centers at UNC-CH, Duke University (Durham), and North Carolina State University (Raleigh).
The "war on cancer" is the effort to find a cure for cancer by increased research to improve the understanding of cancer biology and the development of more effective cancer treatments, such as targeted drug therapies. The aim of such efforts is to eradicate cancer as a major cause of death. The signing of the National Cancer Act of 1971 by United States president Richard Nixon is generally viewed as the beginning of this effort, though it was not described as a "war" in the legislation itself.
The LONI Pipeline is a free distributed system for designing, executing, monitoring and sharing scientific workflows on grid computing architectures. Pipeline allows users to connect and run any number of different software tools, and conveniently visualize and download the results.
Discovery Net is one of the earliest examples of a scientific workflow system allowing users to coordinate the execution of remote services based on Web service and Grid Services standards. The system was designed and implemented at Imperial College London as part of the Discovery Net pilot project funded by the UK e-Science Programme. Many of the concepts pioneered by Discovery Net have been later incorporated into a variety of other scientific workflow systems.
LabKey Server is a software suite available for scientists to integrate, analyze, and share biomedical research data. The platform provides a secure data repository that allows web-based querying, reporting, and collaborating across a range of data sources. Specific scientific applications and workflows can be added on top of the basic platform and leverage a data processing pipeline.
REDCap is a browser-based, metadata-driven EDC software and workflow methodology for designing clinical and translational research databases. It is widely used in the academic research community: the REDCap Consortium is a collaborative, international network of more than 5900 institutional partners in 145 countries, with more than 2.1 million total end-users employing the software. Over 19,000 journal articles cite REDCap.
BioMart is a community-driven project to provide a single point of access to distributed research data. The BioMart project contributes open source software and data services to the international scientific community. Although the BioMart software is primarily used by the biomedical research community, it is designed in such a way that any type of data can be incorporated into the BioMart framework. The BioMart project originated at the European Bioinformatics Institute as a data management solution for the Human Genome Project. Since then, BioMart has grown to become a multi-institute collaboration involving various database projects on five continents.
Translational bioinformatics (TBI) is a field that emerged in the 2010s to study health informatics, focused on the convergence of molecular bioinformatics, biostatistics, statistical genetics and clinical informatics. Its focus is on applying informatics methodology to the increasing amount of biomedical and genomic data to formulate knowledge and medical tools, which can be utilized by scientists, clinicians, and patients. Furthermore, it involves applying biomedical research to improve human health through the use of computer-based information system. TBI employs data mining and analyzing biomedical informatics in order to generate clinical knowledge for application. Clinical knowledge includes finding similarities in patient populations, interpreting biological information to suggest therapy treatments and predict health outcomes.
Christopher G. Chute is a Bloomberg Distinguished Professor at Johns Hopkins University, physician-scientist and biomedical informatician known for biomedical terminologies and health information technology (IT) standards. He chairs the World Health Organization Revision Steering Group for the revision of the International Classification of Diseases (ICD-11).
A bioinformatics workflow management system is a specialized form of workflow management system designed specifically to compose and execute a series of computational or data manipulation steps, or a workflow, that relate to bioinformatics.
The OnlineHPC was a free public web service that supplied tools to deal with high performance computers and online workflow editor. OnlineHPC allowed users to design and execute workflows using the online workflow designer and to work with high performance computers – clusters and clouds. Access to high performance resources was available as directly from the service user interface, as from workflow components. The workflow engine of the OnlineHPC service was Taverna as traditionally used for scientific workflow execution in such domains, as bioinformatics, cheminformatics, medicine, astronomy, social science, music, and digital preservation.
Jill Suzanne Barnholtz-Sloan is an American biostatistician and data scientist specialized in cancer epidemiology and etiologic investigations of brain tumors. She is a senior investigator and associate director for informatics and data science at the National Cancer Institute.