GW-led consortium and FDA release new specifications to advance genomic data analysis
WASHINGTON (Sept. 29, 2017) — The George Washington University (GW) and the U.S. Food and Drug Administration (FDA) have published a BioCompute Object Specification Document for research and clinical trial use, which details a new framework for communication of High-throughput Sequencing (HTS) computations and data analysis, known as BioCompute Objects (BCOs). This framework will allow researchers to share and compare their work more effectively, enabling more reproducible efforts across bioinformatics as a whole. It will also facilitate workflow exchange between the FDA, pharmaceutical companies, bioinformatics platform providers and researchers as they move through the regulatory submissions process.
The work was done in collaboration with investigators from Harvard Medical School, Seven Bridges, and more.
Right now, computational researchers use a variety of bioinformatics software with different environments and parameters, which leads to variation in research protocols and difficulty reproducing results. Because there is no standard of communication, or common language used for computational biology, researchers cannot clearly communicate all the variables that impact their data analysis results, leading to the need for other researchers to expend additional time and effort to understand exactly what was done in a particular study.
"We want to make it as easy to get the exact parameters for biocomputation as it is to get a recipe for salmon," Raja Mazumder, PhD, associate professor of biochemistry and molecular medicine at the GW School of Medicine and Health Sciences. "The bioinformatics field is still evolving. With standards for BCOs, researchers will receive better information when comparing or building on existing research. It will allow all bioinformatics researchers to know the important components of the 'recipe,' so everyone is speaking the same language."
The BioCompute Object Specification Document was published on the Open Science Framework site, a free, open-source commons for scientists.
"BioCompute objects represent an important development in community-driven harmonization frameworks that meet the need for standardization as a prerequisite for a deeper understanding of community-generated nucleic acid sequence information," said Vahan Simonyan, PhD, HIVE Team Principal Investigator in FDA's Center for Biologics Evaluation and Research, Office of Biostatistics and Epidemiology. "This effort has the potential to advance modern biological and medical data analysis and help improve patient health outcomes."
Major contributions were made by Gil Alterovitz of Harvard Medical School and FHIR Genomics and Dennis A. Dean of Seven Bridges Genomics.
"BCOs have the potential to profoundly transform and optimize the dissemination and communication of next-generation sequencing among potential stakeholders, including: scientific labs, clinical laboratories, and regulatory agencies," said Alterovitz, assistant professor at Harvard Medical School and the Computational Health Informatics Program, Boston Children's Hospital.
The BCO framework was created with input from over 300 key stakeholders who attended a recent workshop hosted by GW in coordination with the FDA. The attendees included medical researchers, regulatory scientists, HTS or next-generation sequencing data platform developers, pharmaceutical scientists and bioinformaticians, big data experts, and more.
"We work with numerous research and development teams across the industry and standardizing data analysis workflows is one of the biggest challenges our customers face," said Dean, senior scientist at Seven Bridges. "Having a common standard that enables greater transparency and reproducibility is absolutely key when it comes to improving the effectiveness of biomedical research and seeking regulatory approval for new therapies."
Mazumder, Simonyan, Alterovitz, and Dean also collaborated with Carol Goble and Stian Soiland-Reyes of the University of Manchester, the ELIXIR-UK node and the EU's BioExcel Centre of Excellence; Michael Crusoe of the Common Workflow Language; and Eric Donaldson from the FDA.
The BioCompute Object Specification Document is available at https://osf.io/h59uh/ and a commentary paper on the utility of BioCompute Objects tilted "Enabling Precision Medicine via standard communication of NGS provenance, analysis, and results" is available at https://www.biorxiv.org/content/early/2017/09/21/191783.
Media: For inquiries, please contact Lisa Anderson at [email protected] or 202-994-3121.
About the GW School of Medicine and Health Sciences:
Founded in 1824, the GW School of Medicine and Health Sciences (SMHS) was the first medical school in the nation's capital and is the 11th oldest in the country. Working together in our nation's capital, with integrity and resolve, the GW SMHS is committed to improving the health and well-being of our local, national and global communities. smhs.gwu.edu