Computational Analysis of Flow Cytometry Data

dc.contributor.advisorDundar, Murat
dc.contributor.authorIrvine, Allison W.
dc.contributor.otherTuceryan, Mihran
dc.contributor.otherMukhopadhyay, Snehasis
dc.contributor.otherFang, Shiaofen
dc.date.accessioned2013-07-12T17:21:14Z
dc.date.available2013-07-12T17:21:14Z
dc.date.issued2013-07-12
dc.degree.date2012en_US
dc.degree.disciplineDepartment of Computer and Information Scienceen_US
dc.degree.grantorPurdue Universityen_US
dc.degree.levelM.S.en_US
dc.descriptionIndiana University-Purdue University Indianapolis (IUPUI)en_US
dc.description.abstractThe objective of this thesis is to compare automated methods for performing analysis of flow cytometry data. Flow cytometry is an important and efficient tool for analyzing the characteristics of cells. It is used in several fields, including immunology, pathology, marine biology, and molecular biology. Flow cytometry measures light scatter from cells and fluorescent emission from dyes which are attached to cells. There are two main tasks that must be performed. The first is the adjustment of measured fluorescence from the cells to correct for the overlap of the spectra of the fluorescent markers used to characterize a cell’s chemical characteristics. The second is to use the amount of markers present in each cell to identify its phenotype. Several methods are compared to perform these tasks. The Unconstrained Least Squares, Orthogonal Subspace Projection, Fully Constrained Least Squares and Fully Constrained One Norm methods are used to perform compensation and compared. The fully constrained least squares method of compensation gives the overall best results in terms of accuracy and running time. Spectral Clustering, Gaussian Mixture Modeling, Naive Bayes classification, Support Vector Machine and Expectation Maximization using a gaussian mixture model are used to classify cells based on the amounts of dyes present in each cell. The generative models created by the Naive Bayes and Gaussian mixture modeling methods performed classification of cells most accurately. These supervised methods may be the most useful when online classification is necessary, such as in cell sorting applications of flow cytometers. Unsupervised methods may be used to completely replace manual analysis when no training data is given. Expectation Maximization combined with a cluster merging post-processing step gives the best results of the unsupervised methods considered.en_US
dc.identifier.urihttps://hdl.handle.net/1805/3367
dc.identifier.urihttp://dx.doi.org/10.7912/C2/2297
dc.language.isoen_USen_US
dc.subjectmachine learningen_US
dc.subjectbioinformaticsen_US
dc.subjectflow cytometryen_US
dc.subject.lcshBioinformaticsen_US
dc.subject.lcshFlow cytometryen_US
dc.subject.lcshSupport vector machinesen_US
dc.subject.lcshSupervised learning (Machine learning)en_US
dc.subject.lcshLeast squares -- Computer programsen_US
dc.subject.lcshComputational intelligenceen_US
dc.subject.lcshMultivariate analysisen_US
dc.subject.lcshNonlinear control theoryen_US
dc.subject.lcshFunctions of several complex variablesen_US
dc.titleComputational Analysis of Flow Cytometry Dataen_US
Files
Original bundle
Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
thesisAndForms-07-24.pdf
Size:
931.08 KB
Format:
Adobe Portable Document Format
Description:
MS Thesis
License bundle
Now showing 1 - 1 of 1
No Thumbnail Available
Name:
license.txt
Size:
1.88 KB
Format:
Item-specific license agreed upon to submission
Description: