Please use this identifier to cite or link to this item: https://hdl.handle.net/2440/112036
Citations
Scopus Web of ScienceĀ® Altmetric
?
?
Full metadata record
DC FieldValueLanguage
dc.contributor.advisorKoch, Inge-
dc.contributor.authorConway, Annie-
dc.date.issued2016-
dc.identifier.urihttp://hdl.handle.net/2440/112036-
dc.description.abstractThis thesis presents a toolbox for the exploratory analysis of multivariate data, in particular proteomics imaging mass spectrometry data. Typically such data consist of 15000 - 20000 spectra with a spatial component, and for each spectrum ion intensities are recorded at specific masses. Clustering is a focus of this thesis, with discussion of k-means clustering and clustering with principal component analysis (PCA). Theoretical results relating PCA and clustering are given based on Ding and He (2004), and detailed and corrected proofs of the authors' results are presented. The benefits of transformations prior to clustering of the data are explored. Transformations include normalisation, peak intensity correction (PIC), binary and log transformations. A number of techniques for comparing different clustering results are also discussed and these include set based comparisons with the Jaccard distance, an information based criterion (variation of information), point-pair comparisons (Rand index) and a modified version of the prediction strength of Tibshirani and Walther (2005). These exploratory analyses are applied to imaging mass spectrometry data taken from patients with ovarian cancer. The data are taken from slices of cancerous tissue. The analyses in this thesis are primarily focused on data from one patient, with some techniques demonstrated on other patients for comparison.en
dc.subjectclusteringen
dc.subjectproteomicsen
dc.subjectmultivariate data analysisen
dc.subjecthigh-dimensional data analysisen
dc.subjectmachine learningen
dc.titleClustering of proteomics imaging mass spectrometry dataen
dc.typeThesesen
dc.contributor.schoolSchool of Mathematical Sciencesen
dc.provenanceThis electronic version is made publicly available by the University of Adelaide in accordance with its open access policy for student theses. Copyright in this thesis remains with the author. This thesis may incorporate third party material which has been used by the author pursuant to Fair Dealing exceptions. If you are the owner of any included third party copyright material you wish to be removed from this electronic version, please complete the take down form located at: http://www.adelaide.edu.au/legalsen
dc.description.dissertationThesis (M.Phil.) -- University of Adelaide, School of Mathematical Sciences, 2016.en
dc.identifier.doi10.4225/55/5af4e722c538f-
Appears in Collections:Research Theses

Files in This Item:
File Description SizeFormat 
01front.pdf227.86 kBAdobe PDFView/Open
02whole.pdf1.88 MBAdobe PDFView/Open
Permissions
  Restricted Access
Library staff access only484.7 kBAdobe PDFView/Open
Restricted
  Restricted Access
Library staff access only1.93 MBAdobe PDFView/Open


Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.