Integrated Correlation Analysis of Proteomics and Transcriptomics Data in Alzheimer's Disease

dc.contributor.advisorLiu, Xiaowen
dc.contributor.authorModekurty, Suneeta
dc.contributor.otherWan, Jun
dc.contributor.otherZheng, Jiaping
dc.date.accessioned2021-01-18T12:07:06Z
dc.date.available2021-01-18T12:07:06Z
dc.date.issued2020-12
dc.degree.date2020en_US
dc.degree.disciplineSchool of Informaticsen
dc.degree.grantorIndiana Universityen_US
dc.degree.levelM.S.en_US
dc.descriptionIndiana University-Purdue University Indianapolis (IUPUI)en_US
dc.description.abstractWe wanted to see if there existed any significant correlations between two -omics layers. So, here, we performed a correlation analysis to study the disease. The pipeline building consisted of first performing the differential expression of two datasets (proteomics and transcriptomics) individually. An in-depth analysis of the proteomics data was performed, followed by differential expression analysis of RNA seq data and then a correlational analysis of the differentially expressed proteins (from proteomics data) and genes (from RNA seq data). From our analysis, we found fascinating information about the correlations between proteins and genes in AD. We performed a correlation analysis of AD (N= 84), Control (N = 31), and PSP (N = 85) samples for proteomics data and got 114 differentially expressed proteins (DEPs = 114). The RNA seq data had AD (N = 82), Control (N = 31) and PSP (N = 84) samples which gave us 61 differentially expressed genes (DEGs = 61). A correlation analysis using Spearman’s correlation coefficient method between proteins involved in AD revealed 192 very significant correlations with p-value <= 0.00000000000005. The mean correlation coefficient was quite high (r = 0.52). A correlation analysis using Spearman’s correlation coefficient method between genes involved in AD revealed 208 very significant correlations with p-value <= 0.00000000000005. The mean correlation coefficient was quite high (r = 0.52). A correlation analysis using Spearman’s correlation coefficient method between proteins and genes involved in AD revealed 395 significant correlations with p-value <= 0.0001. The correlation coefficient (quite high of +0.53), which might help in understanding the molecular pathways behind the disease could uncover new prospects of understanding the disease as well as design treatments. We observed that different genes interact with different proteins (correlation coefficient r >= 0.5, p-value < 0.05). We also observed that a single protein interacts with multiple genes, and a single gene is interestingly associated with multiple proteins. The patterns of correlations are also different in that a protein/gene positively correlates with some proteins/genes and negatively with some other proteins/genes. We hope that this observation is quite useful. However, understanding how it works and how they interact with each other needs further assessment at the molecular level.en_US
dc.identifier.doi10.7912/sjaw-8k50
dc.identifier.urihttps://hdl.handle.net/1805/24876
dc.identifier.urihttps://doi.org/10.7912/sjaw-8k50
dc.language.isoen_USen_US
dc.subjectMulti-omicsen_US
dc.subjectIntegrationen_US
dc.subjectCorrelation analysisen_US
dc.subjectProteomicsen_US
dc.subjectTranscriptomicsen_US
dc.subjectAlzheimer's Diseaseen_US
dc.titleIntegrated Correlation Analysis of Proteomics and Transcriptomics Data in Alzheimer's Diseaseen_US
dc.typeThesisen
Files
Original bundle
Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
Thesis_Suneeta_Modekurty.pdf
Size:
5.36 MB
Format:
Adobe Portable Document Format
Description:
Main article
License bundle
Now showing 1 - 1 of 1
No Thumbnail Available
Name:
license.txt
Size:
1.99 KB
Format:
Item-specific license agreed upon to submission
Description: