- Browse by Author
Browsing by Author "Shen, Jikui"
Now showing 1 - 4 of 4
Results Per Page
Sort Options
Item Advanced Functions Embedded in the Second Version of Database, Global Evaluation of SARS-CoV-2/hCoV-19 Sequences 2(Frontiers Media, 2022-04-11) Li, Kailing; Wang, Audrey K.Y.; Liu, Sheng; Fang, Shuyi; Lu, Alex Z.; Shen, Jikui; Yang, Lei; Hu, Chang-Deng; Yang, Kai; Wan, Jun; BioHealth Informatics, School of Informatics and ComputingThe Global Evaluation of SARS-CoV-2/hCoV-19 Sequences 2 (GESS v2 https://shiny.ph.iu.edu/GESS_v2/) is an updated version of GESS, which has offered a handy query platform to analyze single-nucleotide variants (SNVs) on millions of high coverages and high-quality severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) complete genomes provided by the Global Initiative on Sharing Avian Influenza Data (GISAID). Including the tools in the first version, the GESS v2 is embedded with new functions, which allow users to search SNVs, given the viral nucleotide or amino acid sequence. The GESS v2 helps users to identify SNVs or SARS-CoV-2 lineages enriched in countries of user's interest and show the migration path of a selected lineage on a world map during specific time periods chosen by the users. In addition, the GESS v2 can recognize the dynamic variations of newly emerging SNVs in each month to help users monitor SNVs, which will potentially become dominant soon. More importantly, multiple sets of analyzed results about SNVs can be downloaded directly from the GESS v2 by which users can conduct their own independent research. With these significant updates, the GESS v2 will continue to serve as a public open platform for researchers to explore SARS-CoV-2 evolutionary patterns from the perspectives of the prevalence and impact of SNVs.Item Genetic Spectrum and Distinct Evolution Patterns of SARS-CoV-2(Frontiers Media, 2020-09-25) Liu, Sheng; Shen, Jikui; Fang, Shuyi; Li, Kailing; Liu, Juli; Yang, Lei; Hu, Chang-Deng; Wan, Jun; Medical and Molecular Genetics, School of MedicineFour signature groups of frequently occurred single-nucleotide variants (SNVs) were identified in over twenty-eight thousand high-quality and high-coverage SARS-CoV-2 complete genome sequences, representing different viral strains. Some SNVs predominated but were mutually exclusively presented in patients from different countries and areas. These major SNV signatures exhibited distinguishable evolution patterns over time. A few hundred patients were detected with multiple viral strain-representing mutations simultaneously, which may stand for possible co-infection or potential homogenous recombination of SARS-CoV-2 in environment or within the viral host. Interestingly nucleotide substitutions among SARS-CoV-2 genomes tended to switch between bat RaTG13 coronavirus sequence and Wuhan-Hu-1 genome, indicating the higher genetic instability or tolerance of mutations on those sites or suggesting that major viral strains might exist between Wuhan-Hu-1 and RaTG13 coronavirus.Item GESS: a database of global evaluation of SARS-CoV-2/hCoV-19 sequences(Oxford University Press, 2020-10-12) Fang, Shuyi; Li, Kailing; Shen, Jikui; Liu, Sheng; Liu, Juli; Yang, Lei; Hu, Chang-Deng; Wan, Jun; BioHealth Informatics, School of Informatics and ComputingThe COVID-19 outbreak has become a global emergency since December 2019. Analysis of SARS-CoV-2 sequences can uncover single nucleotide variants (SNVs) and corresponding evolution patterns. The Global Evaluation of SARS-CoV-2/hCoV-19 Sequences (GESS, https://wan-bioinfo.shinyapps.io/GESS/) is a resource to provide comprehensive analysis results based on tens of thousands of high-coverage and high-quality SARS-CoV-2 complete genomes. The database allows user to browse, search and download SNVs at any individual or multiple SARS-CoV-2 genomic positions, or within a chosen genomic region or protein, or in certain country/area of interest. GESS reveals geographical distributions of SNVs around the world and across the states of USA, while exhibiting time-dependent patterns for SNV occurrences which reflect development of SARS-CoV-2 genomes. For each month, the top 100 SNVs that were firstly identified world-widely can be retrieved. GESS also explores SNVs occurring simultaneously with specific SNVs of user's interests. Furthermore, the database can be of great help to calibrate mutation rates and identify conserved genome regions. Taken together, GESS is a powerful resource and tool to monitor SARS-CoV-2 migration and evolution according to featured genomic variations. It provides potential directive information for prevalence prediction, related public health policy making, and vaccine designs.Item Updated SARS-CoV-2 Single Nucleotide Variants and Mortality Association(Cold Spring Harbor Laboratory Press, 2021) Fang, Shuyi; Liu, Sheng; Shen, Jikui; Lu, Alex Z.; Zhang, Yucheng; Li, Kailing; Liu, Juli; Yang, Lei; Hu, Chang-Deng; Wan, Jun; BioHealth Informatics, School of Informatics and ComputingSince its outbreak in December 2019, COVID-19 has caused 100,5844,555 cases and 2,167,313 deaths as of Jan 27, 2021. Comparing our previous study of SARS-CoV-2 single nucleotide variants (SNVs) before June 2020, we found out that the SNV clustering had changed considerably since June 2020. Apart from that the group SNVs represented by two non-synonymous mutations A23403G (S: D614G) and C14408T (ORF1ab: P4715L) became dominant and carried by over 95% genomes, a few emerging groups of SNVs were recognized with sharply increased monthly occurrence ratios up to 70% in November 2020. Further investigation revealed that several SNVs were strongly associated with the mortality, but they presented distinct distribution in specific countries, e.g., Brazil, USA, Saudi Arabia, India, and Italy. SNVs including G25088T, T25A, G29861T and G29864A were adopted in a regularized logistic regression model to predict the mortality status in Brazil with the AUC of 0.84. Protein structure analysis showed that the emerging subgroups of non-synonymous SNVs and those mortality-related ones in Brazil were located on protein surface area. The clashes in protein structure introduced by these mutations might in turn affect virus pathogenesis through conformation changes, leading to the difference in transmission and virulence. Particularly, we found that SNVs tended to occur in intrinsic disordered regions (IDRs) of Spike (S) and ORF1ab, suggesting a critical role of SNVs in protein IDRs to determine protein folding and immune evasion.