Identification of host transcriptome-guided repurposable drugs for SARS-CoV-1 infections and their validation with SARS-CoV-2 infections by using the integrated bioinformatics approaches

Fee Faysal Ahmed, Md. Selim Reza, Md. Shahin Sarker, Md. Samiul Islam, Md. Parvez Mosharaf, Sohel Hasan,Md. Nurul Haque Mollah

Abstract

Severe Acute Respiratory Syndrome Coronavirus-2 (SARS-CoV-2) is one of the most severe global pandemic due to its high pathogenicity and death rate starting from the end of 2019. Though there are some vaccines available against SAER-CoV-2 infections, we are worried about their effectiveness, due to its unstable sequence patterns. Therefore, beside vaccines, globally effective supporting drugs are also required for the treatment against SARS-CoV-2 infection. To explore commonly effective repurposable drugs for the treatment against different variants of coronavirus infections, in this article, an attempt was made to explore host genomic biomarkers guided repurposable drugs for SARS-CoV-1 infections and their validation with SARS-CoV-2 infections by using the integrated bioinformatics approaches. At first, we identified 138 differentially expressed genes (DEGs) between SARS-CoV-1 infected and control samples by analyzing high throughput gene-expression profiles to select drug target key receptors. Then we identified top-ranked 11 key DEGs (SMAD4, GSK3B, SIRT1, ATM, RIPK1, PRKACB, MED17, CCT2, BIRC3, ETS1 and TXN) as hub genes (HubGs) by protein-protein interaction (PPI) network analysis of DEGs highlighting their functions, pathways, regulators and linkage with other disease risks that may influence SARS-CoV-1 infections.

Introduction

The severe acute respiratory syndrome coronavirus (SARS-CoV) is an alarming global health concern starting from the early 21st century. Now this virus is known as SARS-CoV-1. The SARS-CoV-1 is a feverish respiratory tract disease which was first identified in Guangdong Province, China in 2002. It then spread to 29 countries and was first officially recognized in March 2003 [1]. This virus is named Coronaviruses (CoVs) because of its characteristic halo structure under an electron microscope (corona, crown-like). Latin word “corona” means crown or “halo” and coronavirus particles display a crown-like fringe typically referred to as “spikes” under electron microscopy. The CoVs are non-segmented single-stranded RNA viruses covered with envelop which can cause illness ranging in severity from the common cold to severe and fatal illness or even death. On the basis of serotype and genome, the coronavirus subfamily is divided into four genera: α, β, γ and which has long been recognized as important veterinary pathogens that causes severe to lethal respiratory and enteric diseases in birds as well as mammals. More than 8,000 cases of infection and 774 deaths were reported worldwide due to the outbreak of this coronavirus (CoV) between March 2003 and July 2003 [2]. During the outbreak, the average mortality rate was around 9.6% [3, 4]. Koch’s postulated that SARS-CoV-1 was related to pathogenesis and poses a significant threat to human health [5]. Acute respiratory distress syndrome (ARDS) was developed in 16% of the total SARS-CoV-1 patients and the mortality rate became 50% of these types of SARS-CoV-1 patients [6, 7].

Materials and methods

2.1 Data sources and descriptions

We used both original data and metadata associated with SARS-CoV infections to reach the goal of this study as described in subsections 2.1.1–2.1.2.
2.1.1 Collection of host microarray gene-expression profiles to explore drug target proteins.
We collected gene expression profiling of peripheral blood mononuclear cells (PBMC) with SARS-CoV-1 infection as original data to explore host genomic biomarkers (drug target proteins). The dataset was downloaded from the affymetrix human HG-Focus target array platform under the NCBI Gene Expression Omnibus (GEO) data repository (https://www.ncbi.nlm.nih.gov/geo/query/acc.cgi?acc=GSE1739) with the accession number GSE1739 [47]. It consisted of 14 samples, where the number of SARS cases was 10 and the number matched control sample was 4. This dataset was first analyzed by Reghunathan et al. [2005] to understand the host response to SARS-CoV-1 infection from gene expression level [48].

Results

3.1 Identification of differential expression of genes (DEGs)

To identify DEGs between SARS-CoV-1 infected and control samples, we analyzed a publicly available gene expression dataset (GSE-1739) by using statistical LIMMA approach and identified 141 DEGs, where 87 DEGs were up-regulated and 51 DEGs were down-regulated. A volcano plot was constructed to display the status of all genes simultaneously, where red color indicate significant up-regulated, blue color are significant down-regulated genes and black color indicate insignificant genes. The DEGs were selected with the threshold of adjusted p-values <0.05 and the absolute of fold change values >1.0 (see Fig 2A). We constructed Heatmap to observe the performance of DEGs on clustering/classification of samples into case and control groups through the hierarchical clustering approach (see Fig 2C). We observed that DEGs were separated into up-regulated and down-regulated groups, and samples were separated into case and control groups properly (see Fig 2B). We provided the up-regulated and down-regulated DEGs in S2 Table for further investigation by the other researchers.

Discussions

In this study, we analyzed high throughput host gene-expression profiles for SARS-CoV-1 infections to identify key genomic biomarkers (drug target proteins) highlighting their functions, pathways, regulatory factors (TFs, miRNAs and chemicals), associated comorbidities and repurposable drugs for SARS-CoV-1 infections by using the integrated bioinformatics approaches. At first we identified 138 DEGs from host gene-expression profiles. Then we detected 11 HubGs as genomic biomarkers (SMAD4, GSK3B, SIRT1, ATM, RIPK1, PRKACB, MED17, CCT2, BIRC3, ETS1 and TXN) by the PPI network analysis of 138 DEGs (Fig 3, S3 Table). These 11 genomic biomarkers were treated as the key players for signal transduction during disease development. In particular, the key gene GSK3B may lead to the viral replication, initiation of oxidative stress, and inflammation during SARS-COV-2 infection [76, 77]. Recent studies also demonstrate that GSK3B is a putative therapeutic target to combat the SARS-COV-2 pandemic [41, 76]. The 2nd key gene SIRT1 is a key regulator of ACE2 levels [78]. The SIRT1 gene could control viral entry of SARS-CoV and viral replication [79–81]. So, SIRT1 inhibitors may be a valid solution to treat novel coronavirus. The 4th key gene PRKACB is strongly linked with SARS-COV-2 infections [82].

Conclusion

In this article, we attempted to explore commonly effective supporting drugs for the treatment against different variants of coronavirus infections. Selection of both drug target proteins and agents from a large number of alternatives are equally important in drug discovery by molecular docking. Therefore, firstly, we identified 17 drug target proteins (SMAD4, GSK3B, SIRT1, ATM, RIPK1, PRKACB, MED17, CCT2, BIRC3, ETS1, TXN, FOXC1, GATA2, YY1, FOXL1, TP53 and SRF) from a large number of alternatives by analyzing microarray gene-expression profiles of SARS-CoV-1 infected and control samples based on the integrated statistics and bioinformatics approaches. Then, we identified our proposed target proteins guided top-ranked 7 repurposable drugs (Rapamycin, Tacrolimus, Torin-2, Redotinib, Ivermectin, Danoprevir, Daclatasvir) for the treatment against SARS-CoV-1infections. Then, we validated these 7 candidate-drugs against the state-of-the-arts top-ranked 11 target proteins (MX1, IRF7, NFKBIA, STAT1, IL6, TNF, CCL20, CXCL8, VEGFA, CASP3, ICAM1) that are associated with different variants of SARS-CoV-2 infections by molecular docking simulation and found their significant binding affinity scores.

Acknowledgments

We humbly apologized to those scientists whose research is not cited here. We are very much grateful and thankful to all reviewers for their valuable comments suggestions that help us to improve the quality of the manuscript.

Citation: Ahmed FF, Reza MS, Sarker MS, Islam MS, Mosharaf MP, Hasan S, et al. (2022) Identification of host transcriptome-guided repurposable drugs for SARS-CoV-1 infections and their validation with SARS-CoV-2 infections by using the integrated bioinformatics approaches. PLoS ONE 17(4): e0266124. https://doi.org/10.1371/journal.pone.0266124

Editor: Usman Ali Ashfaq, Government College University Faisalabad, PAKISTAN

Received: July 31, 2021; Accepted: March 15, 2022; Published: April 7, 2022

Copyright: © 2022 Ahmed et al. This is an open access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
Data Availability: All relevant data are within the paper and its Supporting Information files.

Funding: The author(s) received no specific funding for this work.

Competing interests: The authors have declared that no competing interests exist.

https://journals.plos.org/plosone/article?id=10.1371/journal.pone.0266124#abstract0