SBIR Phase II: Sample Classification and Biomarker Discovery by Comprehensive Metabolomic Analysis
This Small Business Innovation Research Phase II project proposes to develop a system for automated classification of biological samples and discovery of biomarkers. The system will be designed to perform comprehensive pattern analysis of state-of-the-art biochemical separations generated by comprehensive two-dimensional chromatography (GCxGC) with high-resolution mass spectrometry (HRMS). The pairing of GCxGC and HRMS combines highly effective molecular separations with precise elemental analysis. A critical challenge for effective utilization of GCxGC-HRMS for biochemical sample classification and biomarker discovery is the difficulty of analyzing and interpreting the massive, complex data for metabolomic features. The quantity and complexity of the data, as well as the large dimensionality of the metabolome, and the possibility that significant chemical characteristics may be subtle and involve patterns of multiple constituents, necessitate investigation and development of new bioinformatics. The principal technical objective is an innovative framework for comprehensive feature matching and analysis across many samples. Specifically, the framework will incorporate advanced methods for multidimensional peak detection, peak pattern matching across large sample sets, data alignment, comprehensive feature matching, and multi-sample analyses (e.g., classification and biomarker discovery) with large sample sets. The anticipated result is a commercial system for automated multi-sample analysis. The broader impact/commercial potential of this project will be realized through improved informatics for biological classification and biomarker discovery. These tools will enable researchers to better understand biochemical processes and to discover metabolic biomarkers, which could lead to improved methods for disease diagnoses and treatments. These information technologies will foster utilization of advanced GCxGC-HRMS instrumentation, thereby contributing to the impetus for future instrument development. The informatics developed in this project also will be relevant for other classification problems involving multidimensional, multispectral data, including other applications (such as biofuels), other types of chemical analyses (such as multidimensional spectroscopy), and other fields (such as remote-sensing multispectral geospatial imagers). This project will contribute to national competitiveness in the global market for analytical technologies and will contribute to workforce development by involving students in research experiences through internships and student projects. Software developed in the project and an example dataset will be available to educational institutions to allow students to more easily explore biochemical complexity. less This Small Business Innovation Research Phase II project proposes to develop a system for automated classification of biological samples and discovery of biomarkers. The system will be designed to perform comprehensive pattern analysis of state-of-the-art biochemical separations generated by comprehensive two-dimensional chromatography (GCxGC) with high-resolution mass spectrometry (HRMS). The pairing of GCxGC and HRMS combines highly effective molecular separations with precise elemental analysis. A critical challenge... more
GC Image, LLC, is developing informatics for automated classification of biological samples and discovery of biomarkers. The system performs comprehensive pattern analysis of state-of-the-art biochemical separations generated by comprehensive two-dimensional gas chromatography (GCxGC) and comprehensive two-dimensional liquid chromatography (LCxLC) with high-resolution mass spectrometry (HRMS). The pairing of GCxGC or LCxLC with HRMS combines highly effective molecular separations with precise elemental analysis. By developing and disseminating improved informatics for biological classification and biomarker discovery, this project will enable applications researchers to better understand biochemical processes and to discover metabolomic biomarkers for improved methods for disease diagnoses and treatments.
The post-doctoral researcher will work with the R&D team at GC Image, LLC, to develop and demonstrate innovative methods and tools for bioinformatics data processing, visualization, and analysis. This work is conducted in collaboration with advanced research laboratories in prominent universities and leading companies world-wide and the results of the research are presented at premier conferences and published in top journals. Some time will be allowed for independent research and GC Image, LLC, will support application for adjunct faculty status at the University of Nebraska - Lincoln during the award.
The successful candidate should fit one of two profiles:
* Education and background in computer science, software engineering, or related disciplines.
* Education and background in analytical chemistry, biochemistry, biosciences (esp. metabolomics), or related disciplines.
For candidates with primarily a computing background, prior work in data science, pattern recognition, machine learning, or related areas is desired, but not required.
For candidates with primarily a biochemical science background, prior work in chromatography, mass spectrometry, metabolomics, or related areas is desired, but not required.
For all candidates, prior work in bioinformatics, cheminformatics, and chemometrics is desired, but not required.
For all candidates, the ability to program in a high-level language, preferably Java and/or R, is required.