BBSRC Portfolio Analyser
Award details
14 NSFBIO: Mining of imaging flow cytometry data for label free, single cell analysis
Reference
BB/N005163/1
Principal Investigator / Supervisor
Professor Paul Rees
Co-Investigators /
Co-Supervisors
Professor Huw Summers
,
Professor Catherine Thornton
Institution
Swansea University
Department
College of Engineering
Funding type
Research
Value (£)
147,922
Status
Completed
Type
Research Grant
Start date
25/09/2015
End date
24/09/2018
Duration
36 months
Abstract
In this project, we will leverage our expertise and existing microscopy-oriented software to develop a data mining methodology for the image data generated using an imaging cytometer. With experimental data from our project partners, we will optimize and prove these methodologies in biological applications in areas as diverse as stem cell differentiation, cell cycle analysis, and immune system activation. The novelty of this work includes: 1. Developing label-free assays. In addition to measuring morphological parameters that are standard for fluorescence microscopy images, we will emphasize the extraction of information present in two channels that do not require exogenous labels: brightfield and darkfield (the latter represents the scatter from the cell at right angles to the excitation). Both of these are captured by imaging flow cytometers but have thus far been largely (or completely) ignored. They have also generally been ignored using conventional microscopy because image-processing algorithms struggle to accurately delineate cell borders in both types of images - a problem that is avoided in imaging flow cytometry, because the cells are physically separate as they flow through the instrument. We expect that the inclusion of rich data from these channels, together with advanced machine learning algorithms, will enable scoring phenotypes and cellular states using fewer labels; in fact, in some key cases our proposed advancements will enable biologists to make relevant measurements of their cells with no staining required, expanding the types of cells and biological questions that can be addressed with the technology. 2. Quantifying heterogeneity at multiple levels. We will quantify the variation of the parameters derived from these cell images; both in a cell population from an individual and among cell populations from a cohort of people. 3. Creating a new, open-source workflow for advanced analysis of imaging flow cytometry data.
Summary
The project is a collaboration between researchers at Swansea University, UK and scientists at the Broad Institute of Harvard and MIT, Cambridge, US. The project will develop and demonstrate software to mine data from imaging flow cytometers. These instruments can capture thousands of images of cells per second. The images can in theory be analyzed to precisely measure hundreds of features related to cellular morphology; this project is to develop advanced machine-learning software to accomplish this, unlocking the otherwise hidden information within the images. The software will be developed, improved, and validated in several demonstration experiments involving the cell cycle, the component cells of primary blood, immune cell activation, and stem cell identity. The goal will be to use as few or indeed no fluorescent biomarkers, eliminating the need to perturb cells. The resulting open-source software will be freely available to scientists worldwide for both applied and clinical research, and will be accompanied by user-friendly training materials and in-person workshops. The project is collaborative and interdisciplinary and includes training early career-stage scientists in computational biology, via the existing Scientists without Borders program. The project involves close collaboration with a host of researchers from both the UK and US who use imaging flow cytometers and builds on a previous successful interdisciplinary collaboration in biological data mining by the teams at the Broad Institute and Swansea University. In order to devise the novel software and methodology to mine the large datasets acquired using imaging flow cytometry, the team will develop algorithms to seamlessly import data from an imaging cytometer, robustly segment cells, quality-filter them (e.g., for debris and blur), and quantify morphological parameters (usually hundreds) for each cell (usually thousands), including various measures of size, shape, and texture. Using these features,trained machine-learning algorithms will identify cell phenotypes of interest or otherwise characterize the state of cell in driving biological projects from project partners who use imaging flow cytometry in a host of biological research studies. The goal will be to use as few or indeed no fluorescent biomarkers, eliminating the need to perturb cells. The project will give the scientific community a validated, open-source software toolbox of image processing and machine learning algorithms readily usable by biologists.
Impact Summary
To ensure broad impact, dissemination, and sustainability of the proposed research, the algorithms we develop for imaging flow cytometry will be packaged into user friendly, open-source software to allow any researcher to apply these advanced image analysis techniques in their own laboratories. Dr. Carpenter's group at the Broad Institute has an excellent track record of serving the needs of end users and ensuring software is disseminated and adopted by the community. For example, their CellProfiler software has already been cited more than 1,000 times. For this project, close collaboration with the laboratories leading the driving biological projects will ensure that end users contribute to the design and evaluation of the tools so that the end result serves the biological community's needs. Specifically, we will disseminate algorithms as modules in the CellProfiler software package following our customary software engineering best practices. CellProfiler is already widely known and mature, is open-source and user-friendly, reads >100 image file formats, enables reproducible research, is cross-platform (Windows, Mac, and Unix), has a visualization/analysis companion called CellProfiler Analyst, and is being interfaced with other open-source projects (e.g., ImageJ/NIH Image, MicroManager). Contributing to existing software eliminates the waste of building and maintaining a separate software package solely for imaging flow cytometry data, and facilitates sustainability. Online, we will distribute example images, CellProfiler analysis pipelines, and written tutorials so the algorithms can be used easily by biologists. For computer scientists developing and testing new algorithms, we will contribute the validation images and ground truth to our Broad Bioimage Benchmark Collection (BBBC, www.broadinstitute.org/bbbc).
Committee
Research Committee D (Molecules, cells and industrial biotechnology)
Research Topics
Technology and Methods Development
Research Priority
X – Research Priority information not available
Research Initiative
UK BBSRC-US NSF/BIO (NSFBIO) [2014]
Funding Scheme
X – not Funded via a specific Funding Scheme
I accept the
terms and conditions of use
(opens in new window)
export PDF file
back to list
new search