Network analysis of pseudogene-gene relationships: from pseudogene evolution to their functional potentials

dc.contributor.authorJohnson, Travis S.
dc.contributor.authorLi, Sihong
dc.contributor.authorKho, Jonathan R.
dc.contributor.authorHuang, Kun
dc.contributor.authorZhang, Yan
dc.contributor.departmentMedicine, School of Medicineen_US
dc.date.accessioned2018-06-08T17:11:51Z
dc.date.available2018-06-08T17:11:51Z
dc.date.issued2018
dc.description.abstractPseudogenes are fossil relatives of genes. Pseudogenes have long been thought of as "junk DNAs", since they do not code proteins in normal tissues. Although most of the human pseudogenes do not have noticeable functions, ∼20% of them exhibit transcriptional activity. There has been evidence showing that some pseudogenes adopted functions as lncRNAs and work as regulators of gene expression. Furthermore, pseudogenes can even be "reactivated" in some conditions, such as cancer initiation. Some pseudogenes are transcribed in specific cancer types, and some are even translated into proteins as observed in several cancer cell lines. All the above have shown that pseudogenes could have functional roles or potentials in the genome. Evaluating the relationships between pseudogenes and their gene counterparts could help us reveal the evolutionary path of pseudogenes and associate pseudogenes with functional potentials. It also provides an insight into the regulatory networks involving pseudogenes with transcriptional and even translational activities.In this study, we develop a novel approach integrating graph analysis, sequence alignment and functional analysis to evaluate pseudogene-gene relationships, and apply it to human gene homologs and pseudogenes. We generated a comprehensive set of 445 pseudogene-gene (PGG) families from the original 3,281 gene families (13.56%). Of these 438 (98.4% PGG, 13.3% total) were non-trivial (containing more than one pseudogene). Each PGG family contains multiple genes and pseudogenes with high sequence similarity. For each family, we generate a sequence alignment network and phylogenetic trees recapitulating the evolutionary paths. We find evidence supporting the evolution history of olfactory family (both genes and pseudogenes) in human, which also supports the validity of our analysis method. Next, we evaluate these networks in respect to the gene ontology from which we identify functions enriched in these pseudogene-gene families and infer functional impact of pseudogenes involved in the networks. This demonstrates the application of our PGG network database in the study of pseudogene function in disease context.en_US
dc.eprint.versionAuthor's manuscripten_US
dc.identifier.citationJohnson, T. S., Li, S., Kho, J. R., Huang, K., & Zhang, Y. (2018). Network analysis of pseudogene-gene relationships: from pseudogene evolution to their functional potentials. Pacific Symposium on Biocomputing. Pacific Symposium on Biocomputing, 23, 536–547.en_US
dc.identifier.urihttps://hdl.handle.net/1805/16429
dc.language.isoen_USen_US
dc.publisherPacific Symposium on Biocomputingen_US
dc.relation.journalPacific Symposium on Biocomputingen_US
dc.rightsPublisher Policyen_US
dc.sourcePMCen_US
dc.subjectPseudogene-gene (PGG) relationshipen_US
dc.subjectNetwork analysisen_US
dc.subjectPseudogene functionen_US
dc.subjectPGG network databaseen_US
dc.titleNetwork analysis of pseudogene-gene relationships: from pseudogene evolution to their functional potentialsen_US
dc.typeConference proceedingsen_US
Files
Original bundle
Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
nihms921878.pdf
Size:
805.16 KB
Format:
Adobe Portable Document Format
Description:
Main article
License bundle
Now showing 1 - 1 of 1
No Thumbnail Available
Name:
license.txt
Size:
1.99 KB
Format:
Item-specific license agreed upon to submission
Description: