NCBI Home Page NCBI Site Search page NCBI Guide that lists and describes the NCBI resources
Conserved domains on  [gi|190360135|sp|P0C6U0|]
View 

RecName: Full=Replicase polyprotein 1a; Short=pp1a; AltName: Full=ORF1a polyprotein; Contains: RecName: Full=Non-structural protein 1; Short=nsp1; AltName: Full=p28; Contains: RecName: Full=Non-structural protein 2; Short=nsp2; AltName: Full=p65; Contains: RecName: Full=Papain-like protease nsp3; Short=PL-PRO; AltName: Full=Non-structural protein 3; Short=nsp3; AltName: Full=PL1-PRO/PL2-PRO; AltName: Full=PL1/PL2; AltName: Full=PL2-PRO; AltName: Full=Papain-like proteinases 1/2; AltName: Full=p210; Contains: RecName: Full=Non-structural protein 4; Short=nsp4; AltName: Full=Peptide HD2; AltName: Full=p44; Contains: RecName: Full=3C-like proteinase nsp5; Short=3CL-PRO; Short=3CLp; AltName: Full=M-PRO; AltName: Full=nsp5; AltName: Full=p27; Contains: RecName: Full=Non-structural protein 6; Sh

Protein Classification

Graphical summary

 Zoom to residue level

show extra options »

Show site features     Horizontal zoom: ×

List of domain hits

Name Accession Description Interval E-value
TM_Y_MHV-like_Nsp3_C cd21714
C-terminus of non-structural protein 3, including transmembrane and Y domains, from murine ...
2195-2749 0e+00

C-terminus of non-structural protein 3, including transmembrane and Y domains, from murine hepatitis virus and betacoronavirus in the A lineage; This model represents the C-terminus of non-structural protein 3 (Nsp3) from betacoronavirus in the embecovirus subgenus (A lineage), including murine hepatitis virus (MHV) and Human coronavirus HKU1. This conserved C-terminus includes two transmembrane (TM) regions TM1 and TM2, an ectodomain (3Ecto) between the TM1 and TM2 that is glycosylated and located on the lumenal side of the ER, an amphiphatic region (AH1) that is not membrane-spanning, and a large Y domain of approximately 370 residues. Nsp3 is a large multi-functional multi-domain protein that is an essential component of the replication/transcription complex (RTC), which carries out RNA synthesis, RNA processing, and interference with the host cell innate immune system. In MHV and the related Severe acute respiratory syndrome-related coronavirus (SARS-CoV), the TM1, 3Ecto and TM2 domains are important for the papain-like protease (PL2pro) domain to process Nsp3-Nsp4 cleavage. It has also been shown that the interaction of 3Ecto with the lumenal loop of Nsp4 is essential for ER rearrangements in cells infected with SARS-CoV or MHV. The Y domain, located at the cytosolic side of the ER, consists of the Y1 and CoV-Y subdomains, which are conserved in nidovirus and coronavirus, respectively. Functional information about the Y domain is limited; it has been shown that Nsp3 binding to Nsp4 is less efficient without the Y domain.


:

Pssm-ID: 409662  Cd Length: 555  Bit Score: 1114.06  E-value: 0e+00
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 190360135 2195 VARGACIIATIFLLWFNFIYANVIFSDFYLPKIGFLPTFVGKIAQWIKNTFSLVTICDLYSIQDVGFKNQYCNGSIACQF 2274
Cdd:cd21714     1 VARGFFIIATIFLLWFNFLYANVIFSDFYLPNIGFLPTFVGKIVQWFKNTFGLVTICDLYSVSDVGFKSQFCNGSMACQL 80
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 190360135 2275 CLAGFDMLDNYKAIDVVQYEADRRAFVDYTGVLKIVIELIVSYALYTAWFYPLFALISIQILTTWLPELFMLSTLHWSVR 2354
Cdd:cd21714    81 CLSGFDMLDNYKAIDVVQYEVDRRVFFDYTSVLKLVVELVVSYALYTVWFYPLFCLIGLQLLTTWLPEFFMLETLHWSVR 160
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 190360135 2355 LLVSLANMLPAHVFMRFYIIIASFIKLFSLFRHVAYGCSKSGCLFCYKRNRSLRVKCSTIVGGMIRYYDAMANGGTGFCS 2434
Cdd:cd21714   161 LFVFLANMLPAHVFLRFYIVVTAMYKIFCLFRHVVYGCSKPGCLFCYKRNRSVRVKCSTIVGGMLRYYDVMANGGTGFCS 240
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 190360135 2435 KHQWNCIDCDSYKPGNTFITVEAALDLSKELKRPIQPTDVAYHTVTDVKQVGCYMRLFYDRDGQRTYDDVNASLFVDYSN 2514
Cdd:cd21714   241 KHQWNCINCDSYKPGNTFITVEAAAELSKELKRPVNPTDVAYYTVTDVKQVGCSMRLFYERDGQRVYDDVNASLFVDMNG 320
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 190360135 2515 LLHSKVKSVPNMHVVVVENDADKANFLNAAVFYAQSLFRPILMVDKNLITTANTGTSVTETMFDVYVDTFLSMFDVDKKS 2594
Cdd:cd21714   321 LLHSKVKGVPNTHVVVVENDADKANFLNAAVFYAQSLFRPMLMVDKKLITTANTGTSVSQTMFDVYVDTFLSMFDVDRKS 400
                         410       420       430       440       450       460       470       480
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 190360135 2595 LNALIATAHSSIKQGTQICKVLDTFLSCARKSCSIDSDVDTKCLADSVMSAVSAGLELTDESCNNLVPTYLKGDNIVAAD 2674
Cdd:cd21714   401 LNSFINTAHSSLKEGVQLEKVLDTFIGCARKSCSIDSDVDTKCIAKSVMSAVAAGLEFTDESCNNLVPTYIKSDNIVAAD 480
                         490       500       510       520       530       540       550
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*
gi 190360135 2675 LGVLIQNSAKHVQGNVAKIAGVSCIWSVDAFNQLSSDFQHKLKKACCKTGLKLKLTYNKQMANVSVLTTPFSLKG 2749
Cdd:cd21714   481 LGVLIQNSAKHVQGNVAKAANVACIWSVDAFNQLSSDFQHKLKKACVKTGLKLKLTYNKQEANVSILTTPFSLKG 555
betaCoV_Nsp2_MHV-like cd21519
betacoronavirus non-structural protein 2 (Nsp2) similar to MHV Nsp2/p65 and related proteins ...
249-851 0e+00

betacoronavirus non-structural protein 2 (Nsp2) similar to MHV Nsp2/p65 and related proteins from betacoronaviruses in the A lineage; Coronavirus non-structural proteins (Nsps) are encoded in ORF1a and ORF1b. Post infection, the genomic RNA is released into the cytoplasm of the cell and translated into two long polyproteins (pp), pp1a and pp1ab, which are then autoproteolytically cleaved by two viral proteases Nsp3 and Nsp5 into smaller subunits. Nsp2 is one of these subunits. This subgroup includes Nsp2 from Murine hepatitis virus (MHV) and betacoronaviruses in the embecovirus subgenus (A lineage). It belongs to a family which includes Severe acute respiratory syndrome coronavirus (SARS-CoV) Nsp2. The function of Nsp2 remains unclear. SARS-CoV Nsp2, rather than playing a role in viral replication, may be involved in altering the host cell environment; deletion of Nsp2 from the SARS-CoV genome results in only a modest reduction in viral titers, and it has been shown to interact with two host proteins, prohibitin 1 (PHB1) and PHB2 which have been implicated in cellular functions, including cell-cycle progression, cell migration, cellular differentiation, apoptosis, and mitochondrial biogenesis. MHV Nsp2, also known as p65, different from SARS-CoV Nsp2, may play an important role in the viral life cycle.


:

Pssm-ID: 394870  Cd Length: 586  Bit Score: 1085.10  E-value: 0e+00
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 190360135  249 PLLYVDQYGCDYTGGLADGLEAYADKTLQEMKALFPIWSQELPFDVTVAWHVVRDPRYVMRLQSASTIRSVAYVANPTED 328
Cdd:cd21519     1 PLLFVDQYGCDYTGKLAEGLEAYGDFSLQEMKELFPVWSQSLDFDVVVAWHVVRDPRFVMRLQTLATIRSIEYVAQPTED 80
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 190360135  329 LCDGSVVIKEPVHVYADDSIILRQHNLVDIMSCFYmeaDAVVNAFYGVDLKDCGFVMQFGYIDCEQDLCDFKGWVPGNMI 408
Cdd:cd21519    81 LVDGDVVIREPVHLLAADAIVLKLPKLVDVMQHTD---DSVVESIYKVKLCDCGFVMQFGYVDCCQDDCDFRGWVPGNMI 157
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 190360135  409 DGFACTTCGHVYETGDLLAQSSGVLPVNPVLHTKSAAGYGGfgckDSFTLYGQTVVYFGGCVYWSPARNIWIPILKSSVK 488
Cdd:cd21519   158 DGFACPSCGHVYGPSELLAQSSGVIPENPVLFTNSTDTVNQ----DSFKLYGHSVVPFGGCVYWSPYPGMWIPIIKSSVK 233
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 190360135  489 SYDGLVYTGVVGCKAIVKETNLICKALYLDYVQHKCGNLHQRELLGVSDVWHKQLLLNRGVYKPLLENIDYFNMRRAKFS 568
Cdd:cd21519   234 SYDGMVYTGVVGCKTIVKETDAICKALYLDYVQHKCGNLEQREILGLDDVWHKQLLLNRGDYSLLLENIDYFVMRRAKFS 313
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 190360135  569 LETFTVCADGFMPFLLDDLVPRAYYLAVSGQAFCDYAGKICHAVVSKSKELLDVSLDSLGAAIHYLNSKIVDLAQHFSDF 648
Cdd:cd21519   314 CETATVCDEGFVPFLLDGLVPRSYYLIKSGQAFTSLMSKFGQEVADMCMEMLVLSMDSVSVATFYIKKNVGKLASQFKAL 393
                         410       420       430       440       450       460       470       480
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 190360135  649 GTSFVSKIVHFFKTFTTSTALAFAWVLFHVLHGAYIVVESDIYFVKNIPRYASAVAQAFRSVAKVVLDSLRVTFIDGLSC 728
Cdd:cd21519   394 GAKFVKKLIEWFKAFTDTTALAFAWLLYHVLNGAYIVVESDIYFVKSVPDYARNVVRKFQTFFKMLLDCVKVTFLKGLSV 473
                         490       500       510       520       530       540       550       560
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 190360135  729 FKIGRRRICLSGSKIYEVERGLLHSSQLPLDVYDltmpsqvqktkQKPIYLKGSGSDFSLADSVVEVVTTSLTPCGYSEP 808
Cdd:cd21519   474 FKTGRGRVCFAGNKVYKVSRGLLSGFVLPSDVQE-----------SQLTFLEGVAEPVVVEDDVVEVVKTPLTPCGYCKP 542
                         570       580       590       600
                  ....*....|....*....|....*....|....*....|....
gi 190360135  809 PKVADKICIVDNVYMAKAGDKYYPVVVDGH-VGLLDQAWRVPCA 851
Cdd:cd21519   543 PKSAEKICIVDNVYMAKCGDKFYPVVVDDDtIGLLDQAWRFPCA 586
B-CoV_A_NSP1 pfam11963
Betacoronavirus, lineage A, NSP1; This family the N-terminal region of the Betacoronavirus ...
1-354 0e+00

Betacoronavirus, lineage A, NSP1; This family the N-terminal region of the Betacoronavirus polyprotein which contains non-structural protein 1 (Nsp1) from Betacoronavirus lineage A. This protein is important for viral replication and pathogenesis. It suppresses the host innate immune functions by inhibiting type I interferon expression and host antiviral signalling pathways.


:

Pssm-ID: 152398  Cd Length: 355  Bit Score: 660.48  E-value: 0e+00
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 190360135     1 MSKINKYGLELHWAPEFPWMFEDAEEKLDNPSSSEVDIVCSTTAQKLETGGICPENHVMVDCRRLLKQECCVQSSLIREI 80
Cdd:pfam11963    1 MAKMGKYGLGFKWAPEFPWMLPDASEKLGNPERSEEDGFCPSTAQEPEVKGKTLVNHVRVDCRRLLAQECCVQSALIRDI 80
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 190360135    81 VMNTRPYDLEVLLQDALQSREAVLVTPPLGMSLEACYVRGCNPNGWTMGLFRRRSVCNTGRCAVNKHVAYQLYMIDPAGV 160
Cdd:pfam11963   81 FVDEDPQKVEVLTMMALQSGSAVLVKPPLRLSVQAWHSLGVLPKGYAMGLFRRYCLCNTRECKCDAHVAFQLFMVQPDGV 160
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 190360135   161 CFGAGQFVGWVIPLAFMPVQSRKFIVPWVMYLRKCGEKGAYNKDHKRGGFE-HVYNFKVEDAYDLVHDEPKGKFSKKAYA 239
Cdd:pfam11963  161 CFGNGRFIGWFVPVTFMPEYAKKWLQPWSIYLRKGGNKGSVTSDHFRRAFTmPVYDFNVEDAYAEVHDEPKGKYSQKAYA 240
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 190360135   240 LIRGYRGVKPLLYVDQYGCDYTGGLADGLEAYADKTLQEMKALFPIWSQELPFDVTVAWHVVRDPRYVMRLQSASTIRSV 319
Cdd:pfam11963  241 LLRGYRGVKPVLFVDQYGCDYTGCLADGLEAYGDYTLQDMKQLQPVWLANLDFDVVVAWHVVRDPRAVMRLQTIATICGI 320
                          330       340       350
                   ....*....|....*....|....*....|....*
gi 190360135   320 AYVANPTEDLCDGSVVIKEPVHVYADDSIILRQHN 354
Cdd:pfam11963  321 AYVAQPTEDVVDGDVVIKEPVHLLSADAIVLRLPS 355
CoV_Nsp5_Mpro super family cl40471
coronavirus non-structural protein 5, also called Main protease (Mpro); This family contains ...
3250-3543 2.12e-177

coronavirus non-structural protein 5, also called Main protease (Mpro); This family contains the coronavirus (CoV) non-structural protein 5 (Nsp5) also called the Main protease (Mpro), or 3C-like protease (3CLpro). CoVs utilize a multi-subunit replication/transcription machinery. A set of non-structural proteins (Nsps) generated as cleavage products of the ORF1a and ORF1ab viral polyproteins assemble to facilitate viral replication and transcription. Mpro/Nsp5 is a key enzyme in this process, making it a high value target for the development of anti-coronavirus therapeutics. These enzymes belong to the MEROPS peptidase C30 family, where the active site residues His and Cys form a catalytic dyad. The structures of Mpro/Nsp5 consist of three domains with the first two containing anti-parallel beta barrels and the third consisting of an arrangement of alpha-helices. The catalytic residues are found in a cleft between the first two domains. Mpro/Nsp5 requires a Gln residue in the P1 position of the substrate and space for only small amino-acid residues such as Gly, Ala, or Ser in the P1' position; since there is no known human protease with a specificity for Gln at the cleavage site of the substrate, these viral proteases are suitable targets for the development of antiviral drugs.


The actual alignment was detected with superfamily member cd21666:

Pssm-ID: 424102  Cd Length: 297  Bit Score: 547.00  E-value: 2.12e-177
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 190360135 3250 VKMVNPTSKVEPCIVSVTYGNMTLNGLWLGDKVYCPRHVICSASDMTNPDYTNLLCRVTSSDFTVLFDRLSLTVMSYQMQ 3329
Cdd:cd21666     1 RKMAFPSGKVEGCMVQVTCGTMTLNGLWLDDTVYCPRHVICTAEDMLNPNYEDLLIRKTNHSFLVQAGNVQLRVIGHSMQ 80
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 190360135 3330 GCMLVLTVTLQNSRTPKYTFGVVKPGETFTVLAAYNGKPQGAFHVTMRSSYTIKGSFLCGSCGSVVYVIMGDCVKFVYMH 3409
Cdd:cd21666    81 GCLLRLTVDTSNPKTPKYKFVRVKPGQTFSVLACYNGSPSGVYQCAMRPNHTIKGSFLCGSCGSVGYNIDGDCVSFCYMH 160
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 190360135 3410 QLELSTGCHTGTDFNGDFYGPYKDAQVVQLPVQDYIQSVNFVAWLYAAILNNCNWFVQSDKCSVEDFNVWALSNGFSQVK 3489
Cdd:cd21666   161 QMELPTGVHTGTDLEGKFYGPFVDRQTAQAAGTDTTITLNVLAWLYAAVLNGDRWFVNRFTTTLNDFNLWAMKYNYEPLT 240
                         250       260       270       280       290
                  ....*....|....*....|....*....|....*....|....*....|....*..
gi 190360135 3490 SD--LVIDALASMTGVSLETLLAAIKHLKNGF-QGRQIMGSCSFEDELTPSDVYQQL 3543
Cdd:cd21666   241 QDhvDILDPLAAQTGIAVEDMLAALKELLQGGmQGRTILGSTILEDEFTPFDVVRQC 297
cv_Nsp4_TM cd21473
coronavirus non-structural protein 4 (Nsp4) transmembrane domain; Nsp4 may be involved in ...
2761-3143 8.85e-163

coronavirus non-structural protein 4 (Nsp4) transmembrane domain; Nsp4 may be involved in coronavirus-induced membrane remodeling. In order to assemble the replication-transcription complex (RTC), coronavirus induces the rearrangement of host endoplasmic reticulum (ER) membrane into double membrane vesicles (DMVs), zippered ER, or ER spherules. DMV formation has been observed in SARS-CoV cells overexpressing the three transmembrane-containing non-structural proteins of viral replicase polyprotein 1ab: Nsp3, Nsp4 and Nsp6. Together, Nsp3, Nsp4, and Nsp6 have the ability to induce the formation of DMVs that are similar to those seen in SARS-CoV-infected cells.


:

Pssm-ID: 394836  Cd Length: 376  Bit Score: 508.67  E-value: 8.85e-163
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 190360135 2761 FVLSLVCFIGLWCLMPTYTVHKSDFQLPVYASYKVLDNGVIRDVSVEDVCFANKFEQFDQWYESTFGlSYYSNSMACPIV 2840
Cdd:cd21473     1 FLWLLLAAILLYAFLPSYSVFTVTVSSFPGYDFKVIENGVLRDIRSTDTCFANKFVNFDSWYQAKYG-SVPTNSKSCPIV 79
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 190360135 2841 VAVVDQDFGStVFNVPTKVLRYGYHVLHFITHALSADGVQCYTPHSQISYSNFYASGCVLSSACTMFAMaDGSPQPYCYT 2920
Cdd:cd21473    80 VGVIDDVRGS-VPGVPAGVLLVGKTLVHFVQTVFFGDTVVCYTPDGVITYDSFYTSACVFNSACTYLTG-LGGRQLYCYD 157
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 190360135 2921 DGLMQNASLYSSLVPHVRYNLANAkGFIRFPEVLREGLVRIVRTRSMPYCRVGLCEEADEGICFNFNGSWVLNNDYYRsl 3000
Cdd:cd21473   158 TGLVEGAKLYSDLLPHVRYKLVDG-NYIKFPEVILEGGPRIVRTLATTYCRVGECEDSKAGVCVSFDGFWVYNNDYYG-- 234
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 190360135 3001 PGTFCGRDVFDLIYQLFKGLAQPVDFLALTASSIAGAILAVIVVLVFYYLIKLKRAFGDyTSIVFVNVIVWCVNFMMLFV 3080
Cdd:cd21473   235 PGVYCGDGLFDLLTNLLSGFFQPVSVFALSGQLLFNTIVAILAVLACYYVQKFKRAFGD-MSVVVVTVVAAALVNNVLYV 313
                         330       340       350       360       370       380
                  ....*....|....*....|....*....|....*....|....*....|....*....|...
gi 190360135 3081 FQVYPTLSCVYAICYFYATLYFPSEISVIMHLQWLVMYGTIMPLWFCLLYISVVVSNHAFWVF 3143
Cdd:cd21473   314 VTQNPLLMIVYAVLYFYATLYLTYERAWIMHLGWVVAYGPIAPWWLLALYVVAVLYDYLPWFF 376
betaCoV_PLPro cd21732
betacoronavirus papain-like protease; This model represents the papain-like protease (PLPro) ...
1563-1861 3.03e-156

betacoronavirus papain-like protease; This model represents the papain-like protease (PLPro) found in non-structural protein 3 (Nsp3) of betacoronavirus, including highly pathogenic betacoronaviruses such as Severe acute respiratory syndrome-related coronavirus (SARS-CoV), SARS-CoV2 (also called 2019 novel CoV or 2019-nCoV), and Middle East respiratory syndrome-related (MERS) CoV. CoVs utilize a multi-subunit replication/transcription machinery. A set of non-structural proteins (Nsps) generated as cleavage products of the ORF1a and ORF1ab viral polyproteins assemble to facilitate viral replication and transcription. PLPro is a key enzyme in this process, making it a high value target for the development of anti-coronavirus therapeutics. PLPro, which belongs to the MEROPS peptidase C16 family, participates in the proteolytic processing of the N-terminal region of the replicase polyprotein; it can cleave Nsp1|Nsp2, Nsp2|Nsp3, and Nsp3|Nsp4 sites and its activity is dependent on zinc. In SARS-CoV and murine hepatitis virus (MHV), the C-terminal non-structural protein 3 region spanning transmembrane regions TM1 and TM2 with 3Ecto domain in between, are important for the PL2pro domain to process Nsp3-Nsp4 cleavage. Besides cleaving the polyproteins, PLPro also possesses a related enzymatic activity to promote virus replication: deubiquitinating (DUB) and de-ISGylating activities. Both, ubiquitin (Ub) and Ub-like interferon-stimulated gene product 15 (ISG15), are involved in preventing viral infection; coronaviruses utilize Ubl-conjugating pathways to counter the pro-inflammatory properties of Ubl-conjugated host proteins via the action of PLPro, which processes both 'Lys-48'- and 'Lys-63'-linked polyubiquitin chains from cellular substrates. The Nsp3 PLPro domain of many of these CoVs has also been shown to antagonize host innate immune induction of type I interferon by interacting with IRF3 and blocking its activation. Interactions of SARS-CoV and MERS-CoV with antiviral interferon (IFN) responses of human cells are remarkably different; high-dose IFN treatment (type I and type III) shows MERS-CoV was substantially more IFN sensitive than SARS-CoV. This may be due to differences in the architecture of the oxyanion hole and of the S3 as well as the S5 specificity sites, despite the overall structures of SARS-CoV and MERS-CoV PLPro being similar.


:

Pssm-ID: 409649  Cd Length: 304  Bit Score: 486.32  E-value: 3.03e-156
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 190360135 1563 DKVDILLTVDGVNFTNRFVPVGESFGKSLGNVFCDGVNVTKHKCDINYKGKVFFQFDNLSSEDLKAVRSSFNFDQK-ELL 1641
Cdd:cd21732     1 KTIEVLTTVDGVNFRTVLVNNGETFGKQLGNVFCDGVDVTKTKPSAKYEGKVLFQADNLSAEELEAVEYYYGFDDPtFLL 80
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 190360135 1642 AYYNMLVNCSKWQVVFNGKYFTFKQANNNCFVNVSCLMLQSLNLKFKIVQWQEAWLEFRSGRPARFVSLVLAKGGFKFGD 1721
Cdd:cd21732    81 RYYSALAHVKKWKFVVVDGYFSLKQADNNCYLNAACLMLQQLDLKFNTPALQEAYYEFRAGDPLRFVALVLAYGNFTFGE 160
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 190360135 1722 PADSRDFLRVVFSQVDLTGAICDFEIACK-CGVKQEQRTGVDAVMHFGTLSREDLEIGYTVDCSCGKKLIHCVRFDV-PF 1799
Cdd:cd21732   161 PDDARDFLRVVLSHADLVSARRVLEEVCKvCGVKQEQRTGVDAVMYFGTLSLDDLYKGYTIDCSCGRKAIRYLVEQVpPF 240
                         250       260       270       280       290       300
                  ....*....|....*....|....*....|....*....|....*....|....*....|....
gi 190360135 1800 LICSNTPASVKLPKG-VGSANIFKGDK-VGHYVHVKCEQSYQLYDASNVKKVTDVTGNLSDCLY 1861
Cdd:cd21732   241 LLMSNTPTEVPLPTGdFVAANVFTGDEsVGHYTHVKNKSLLYLYDAGNVKKTSDLKGPVTDVLY 304
betaCoV-Nsp6 cd21560
betacoronavirus non-structural protein 6; Coronaviruses (CoV) redirect and rearrange host cell ...
3550-3836 1.44e-141

betacoronavirus non-structural protein 6; Coronaviruses (CoV) redirect and rearrange host cell membranes as part of the viral genome replication and transcription machinery; they induce the formation of double-membrane vesicles in infected cells. CoV non-structural protein 6 (Nsp6), a transmembrane-containing protein, together with Nsp3 and Nsp4, have the ability to induce double-membrane vesicles that are similar to those observed in severe acute respiratory syndrome (SARS) coronavirus-infected cells. By itself, Nsp6 can generate autophagosomes from the endoplasmic reticulum. Autophagosomes are normally generated as a cellular response to starvation to carry cellular organelles and long-lived proteins to lysosomes for degradation. Degradation through autophagy may provide an innate defense against virus infection, or conversely, autophagosomes can promote infection by facilitating the assembly of replicase proteins. In addition to initiating autophagosome formation, Nsp6 also limits autophagosome expansion regardless of how they were induced, i.e. whether they were induced directly by Nsp6, or indirectly by starvation or chemical inhibition of MTOR signaling. This may favor coronavirus infection by compromising the ability of autophagosomes to deliver viral components to lysosomes for degradation.


:

Pssm-ID: 394846  Cd Length: 290  Bit Score: 443.61  E-value: 1.44e-141
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 190360135 3550 SKRTRLVKGIVCWIMASTFLFSCIITAFVKWTMFMYVTTNMLSITFCALCVIS-LTMLLVKHKHLYLTMYIIPVLFTLLY 3628
Cdd:cd21560     1 SKVKRVVKGTLHWLLATFVLFYLIILQLTKWTMFMYLTETMLLPLTPALCCVSaCVMLLVKHKHTFLTLFLLPVLLTLAY 80
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 190360135 3629 NNYLVVYKQTFRGYVYAWLSYYVPSVEYTYTDEVIYGMLLLIGMVfVTLRSINQYLFSFIMFVGRVISVVSLWYMGSnLE 3708
Cdd:cd21560    81 YNYVYVPKSSFLGYVYNWLNYVNPYVDYTYTDEVTYGSLLLVLML-VTMRLVNHDAFSRVWAVCRVITWVYMWYTGS-LE 158
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 190360135 3709 EEILLMLASLFGTYTWTTA---LSMAAAK-VIAKWVAVNVLYFTDIPQIKIVLVCYLFIGYIISCYWGLFSLMNSLFRMP 3784
Cdd:cd21560   159 ESALSYLTFLFSVTTNYTGvvtVSLALAKfITALWLAYNPLLFLDIPEVKCVLLVYLFIGYICTCYFGVFSLLNRLFRCP 238
                         250       260       270       280       290
                  ....*....|....*....|....*....|....*....|....*....|..
gi 190360135 3785 LGVYNYKISVQELRYMNANGLRPPKNSFEALMLNFKLLGIGGVPIIEVSQFQ 3836
Cdd:cd21560   239 LGVYDYKVSTQEFRYMNANGLRPPRNSWEALMLNFKLLGIGGVPCIKVSTVQ 290
betaCoV_Nsp8 cd21831
betacoronavirus non-structural protein 8; This model represents the non-structural protein 8 ...
3929-4122 2.41e-112

betacoronavirus non-structural protein 8; This model represents the non-structural protein 8 (Nsp8) the highly pathogenic betacoronaviruses that include Severe acute respiratory syndrome-related coronavirus (SARS-CoV), SARS-CoV2 (also called 2019 novel CoV or 2019-nCoV), and Middle East respiratory syndrome-related (MERS) CoV. CoVs utilize a multi-subunit replication/transcription machinery. A set of non-structural proteins (Nsps) generated as cleavage products of the ORF1a and ORF1ab viral polyproteins assemble to facilitate viral replication and transcription. Upon processing of the Nsp7-10 region by protease M (Mpro), the released four small proteins Nsp7, Nsp8, Nsp9, and Nsp10 form functional complexes with CoV core enzymes and thereby stimulate replication. Most importantly, a complex of Nsp8 with Nsp7 has been shown to activate and confer processivity to the RNA-synthesizing activity of Nsp12, the RNA-dependent RNA-polymerase (RdRp); in SARS-CoV, point mutations in the genes encoding Nsp8 and Nsp7 have been shown to delay virus growth. Nsp8 and Nsp7 cooperate in activating the primer-dependent activity of the Nsp12 RdRp such that the level of their association may constitute a limiting factor for obtaining a high RNA polymerase activity. The subsequent Nsp7/Nsp8/Nsp12 polymerase complex is then able to associate with an active bifunctional Nsp14, which includes N-terminal 3' to 5' exoribonuclease (ExoN) and C-terminal N7-guanine cap methyltransferase (N7-MTase) activities, thus representing a unique coronavirus Nsp assembly that incorporates RdRp, exoribonuclease, and N7-MTase activities. Interaction of Nsp8 with Nsp7 appears to be conserved across the coronavirus family, making these proteins interesting drug targets. Nsp8 has a novel 'golf-club' fold composed of an N-terminal 'shaft' domain and a C-terminal 'head' domain. The shaft domain contains three helices, one of which is very long, while the head domain contains another three helices and seven beta-strands, forming an alpha/beta fold. SARS-CoV Nsp8 forms a 8:8 hexadecameric supercomplex with Nsp7 that adopts a hollow cylinder-like structure with a large central channel and positive electrostatic properties in the cylinder; the Nsp7/Nsp8 complex functions as a noncanonical RNA polymerase capable of synthesizing RNA of up to the template length.


:

Pssm-ID: 409258  Cd Length: 196  Bit Score: 356.02  E-value: 2.41e-112
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 190360135 3929 SEFVNMASFVEYEVAKKNLDEACSSGSANQQQLKQLEKACNIAKSAYERDRAVARKLERMADLALTNMYKEARINDKKSK 4008
Cdd:cd21831     1 SEFSNLASYAEYETAQKAYDEAVASGDASPQVLKALKKAVNVAKSAYEKDKAVARKLERMADQAMTSMYKQARAEDKKSK 80
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 190360135 4009 VVSALQTMLFSMVRKLDNQALNSILDNAVKGCVPLNAIPSLAANTLTIIVPDKSVYDQVVDNVYVTYAGNVWQIQTIQDS 4088
Cdd:cd21831    81 VVSAMQTMLFGMIRKLDNDALNNIINNARNGCVPLSIIPLTAANKLRVVVPDYSVYKQVVDGPTLTYAGALWDIQQINDA 160
                         170       180       190
                  ....*....|....*....|....*....|....*..
gi 190360135 4089 DGTNKQLHEISDDC---NWPLVIIANRHNEvSATALQ 4122
Cdd:cd21831   161 DGKIVQLSDITEDSenlAWPLVVTATRANS-SAVKLQ 196
alpha_betaCoV_Nsp10 cd21901
alphacoronavirus and betacoronavirus non-structural protein 10; This model represents the ...
4233-4362 7.80e-86

alphacoronavirus and betacoronavirus non-structural protein 10; This model represents the non-structural protein 10 (Nsp10) of alpha- and betacoronaviruses, including highly pathogenic betacoronaviruses such as Severe acute respiratory syndrome-related coronavirus (SARS-CoV), SARS-CoV2 (also called 2019 novel CoV or 2019-nCoV), Middle East respiratory syndrome-related (MERS) CoV, and alphacoronaviruses such as Human coronavirus 229E. CoVs utilize a multi-subunit replication/transcription machinery. A set of non-structural proteins (Nsps) generated as cleavage products of the ORF1a and ORF1ab viral polyproteins assemble to facilitate viral replication and transcription. Upon processing of the Nsp7-10 region by protease M (Mpro), the released four small proteins Nsp7, Nsp8, Nsp9, and Nsp10 form functional complexes with CoV core enzymes and thereby stimulate replication. Coronaviruses cap their mRNAs; RNA cap methylation may involve at least three proteins: Nsp10, Nsp14, and Nsp16. Nsp10 serves as a cofactor for both Nsp14 and Nsp16. Nsp14 consists of 2 domains with different enzymatic activities: an N-terminal ExoN domain and a C-terminal cap (guanine-N7) methyltransferase (N7-MTase) domain. The association of Nsp10 with Nsp14 enhances Nsp14's exoribonuclease (ExoN) activity, and not its N7-Mtase activity. ExoN is important for proofreading and therefore, the prevention of lethal mutations. The Nsp10/Nsp14 complex hydrolyzes double-stranded RNA in a 3' to 5' direction as well as a single mismatched nucleotide at the 3'-end, mimicking an erroneous replication product, and may function in a replicative mismatch repair mechanism. Nsp16 Cap-0 specific (nucleoside-2'-O-)-methyltransferase (2'OMTase) acts sequentially to Nsp14 MTase in RNA capping methylation, and methylates the RNA cap at the ribose 2'-O position; it catalyzes the conversion of the cap-0 structure on m7GpppA-RNA to a cap-1 structure. The association of Nsp10 with Nsp16 enhances Nsp16's 2'OMTase activity, possibly through enhanced RNA binding affinity. Additionally, transmissible gastroenteritis virus (TGEV) Nsp10, Nsp16 and their complex can interact with DII4, which normally binds to Notch receptors; this interaction may disturb Notch signaling. Nsp10 also binds 2 zinc ions with high affinity.


:

Pssm-ID: 409326  Cd Length: 130  Bit Score: 276.86  E-value: 7.80e-86
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 190360135 4233 AGTATEYASNSSILSLCAFSVDPKKTYLDFIQQGGTPIANCVKMLCDHAGTGMAITVKPDATTSQDSYGGASVCIYCRAR 4312
Cdd:cd21901     1 AGKQTEVASNSSLLTLCAFAVDPAKTYLDAVKSGGKPVGNCVKMLTNGTGTGQAITVKPEANTNQDSYGGASVCLYCRAH 80
                          90       100       110       120       130
                  ....*....|....*....|....*....|....*....|....*....|
gi 190360135 4313 VEHPDVDGLCKLRGKFVQVPVGIKDPVSYVLTHDVCQVCGFWRDGSCSCV 4362
Cdd:cd21901    81 VEHPDMDGVCKLKGKYVQVPLGTNDPVRFCLENDVCKVCGCWLGNGCSCD 130
MHV-like_Nsp3_NAB cd21824
nucleic acid binding domain of non-structural protein 3 from murine hepatitis virus and ...
1898-2011 1.00e-76

nucleic acid binding domain of non-structural protein 3 from murine hepatitis virus and betacoronavirus in the A lineage; This model represents the nucleic acid binding (NAB) domain of non-structural protein 3 (Nsp3) from betacoronavirus in the embecovirus subgenus (A lineage), including murine hepatitis virus (MHV) and Human coronavirus HKU1. The NAB domain represents a new fold, with a parallel four-strand beta-sheet holding two alpha-helices of three and four turns that are oriented antiparallel to the beta-strands. NAB is a cytoplasmic domain located between the papain-like protease (PLPro) and betacoronavirus-specific marker (betaSM) domains of CoV Nsp3. Nsp3 is a large multi-functional multi-domain protein that is an essential component of the replication/transcription complex (RTC), which carries out RNA synthesis, RNA processing, and interference with the host cell innate immune system. The NAB domain both binds ssRNA and unwinds dsDNA. It prefers to bind ssRNA containing repeats of three consecutive guanines. A group of residues that form a positively charged patch on the protein surface of SARS-CoV Nsp3 NAB serves as the binding site of nucleic acids. This site is conserved in the NAB of Nsp3 from betacoronavirus in the sarbecovirus subgenus (B lineage), but is not conserved in the Nsp3 NAB from betacoronaviruses in the A lineage.


:

Pssm-ID: 409350  Cd Length: 119  Bit Score: 250.45  E-value: 1.00e-76
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 190360135 1898 GKYYTQRIIKAQFKTFEKVDGVYTNFKLIGHTVCDILNAKLGFDSSKEFVEYKVTEWPTATGDVVLATDDLYVKRYERGC 1977
Cdd:cd21824     1 GKYYTKPIIKAQFKTFEKVDGVYTNFKLVGHTICDKLNAKLGFDSSKPFVEYKVTEWPTATGDVVLASDDLYVKRYEKGC 80
                          90       100       110
                  ....*....|....*....|....*....|....
gi 190360135 1978 ITFGKPVIWLSHEQASLNSLTYFNRPLLVDENKF 2011
Cdd:cd21824    81 ITFGKPVIWLGHEEASLNSLTYFNRPSLVDENKF 114
MHV-like_Nsp3_betaSM cd21812
betacoronavirus-specific marker of non-structural protein 3 from murine hepatitis virus and ...
2026-2146 1.93e-75

betacoronavirus-specific marker of non-structural protein 3 from murine hepatitis virus and betacoronavirus in the A lineage; This model represents the betacoronavirus-specific marker (betaSM), also called group 2-specific marker (G2M), of non-structural protein 3 (Nsp3) from betacoronavirus in the embecovirus subgenus (A lineage), including murine hepatitis virus (MHV) and Human coronavirus HKU1. The betaSM/G2M is located C-terminal to the nucleic acid-binding (NAB) domain. This region is absent in alpha- and deltacoronavirus Nsp3; there is a gammacoronavirus-specific marker (gammaSM) at this position in gammacoronavirus Nsp3. Nsp3 is a large multi-functional multi-domain protein that is an essential component of the replication/transcription complex (RTC), which carries out RNA synthesis, RNA processing, and interference with the host cell innate immune system. Little is known about the betaSM/G2M domain; it is predicted to be non-enzymatic and may be an intrinsically disordered region. The betaSM/G2M domain is part of the predicted PLnc domain (made up of 385 amino acids) of the related SARS-CoV Nsp3 that may function as a replication/transcription scaffold, with interactions to Nsp5, Nsp12, Nsp13, Nsp14, and Nsp16.


:

Pssm-ID: 409627  Cd Length: 125  Bit Score: 247.21  E-value: 1.93e-75
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 190360135 2026 SESDAKEPKEINIIKLSGVKKPFKVEDSVIVNDDTSEIKYVKSLSIVDVYDMWLTGCRCVVRTANALSRAVNVPTIRKFI 2105
Cdd:cd21812     5 SQSDSKQAKPVKIVKLNGVKKPFKVEDSVVVNDDTSETKVVKSLSIVDVYDMWLTGCRYVVWTANALSRLVNVPTVREYV 84
                          90       100       110       120
                  ....*....|....*....|....*....|....*....|.
gi 190360135 2106 KFGMTLVSIPIDLLNLREIKPVFNVVKAVRNKISACFNFIK 2146
Cdd:cd21812    85 KFGMTVISIPIDLLNLRDDKQEFVVPKVVKAKVSACYNFIK 125
Peptidase_C16 super family cl03374
Peptidase C16 family;
1049-1285 1.68e-74

Peptidase C16 family;


The actual alignment was detected with superfamily member pfam01831:

Pssm-ID: 460353  Cd Length: 249  Bit Score: 249.61  E-value: 1.68e-74
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 190360135  1049 FEFYTTEPEFV--KVLDLYVPKATRNNCWLRSVLAVMQKLPCQFKDKNLQDLWVLYKQQYSQLFVDTLVNKIPANIVVPQ 1126
Cdd:pfam01831   11 FAFAAEFPDELhfASCGFGNPAIEEEDCFCPSAAIEMKSKGKEFKDHEMQKCSLLPAAECCQCFADILDIFVDEDIIKPE 90
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 190360135  1127 GGYVADFAYWFLTLCDWQCVAYWKCIKCDLALKLKGLDAMFFYGDVVSHVCKCGESMVLIDVDVPFTAHFALKDKLFCAF 1206
Cdd:pfam01831   91 AGTMAAFAFFFASLCKFKARANIQALECDGELKKQAADALFFRGCLCNHMCCCCDAHTAFHADIPQPDGFCLGDDKFCAF 170
                          170       180       190       200       210       220       230
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 190360135  1207 ITKRSVYKAACVVDVNDSHSMAVVDGKQIDDHRITSITSDKFDFIIGHGMSFSMTTFEIAQLYGSCITPNVCFVKGDII 1285
Cdd:pfam01831  171 FTPRKAFPAAAAQDLNDCHILARKEGKKGDGKSGHFFIADKFDFMDFNGEDACEEPFELAKGKGSCIAPALCFGKGDVI 249
betaCoV_Nsp9 cd21898
betacoronavirus non-structural protein 9; This model represents the non-structural protein 9 ...
4123-4232 3.91e-70

betacoronavirus non-structural protein 9; This model represents the non-structural protein 9 (Nsp9) from betacoronaviruses including highly pathogenic Severe acute respiratory syndrome-related coronavirus (SARS-CoV), SARS-CoV2 (also called 2019 novel CoV or 2019-nCoV), and Middle East respiratory syndrome-related (MERS) CoV. CoVs utilize a multi-subunit replication/transcription machinery assembled from a set of non-structural proteins (Nsps) generated as cleavage products of the ORF1a and ORF1ab viral polyproteins. All of these Nsps, except for Nsp1 and Nsp2, are considered essential for transcription, replication, and translation of the viral RNA. Nsp9, with Nsp7, Nsp8, and Nsp10, localizes within the replication complex. Nsp9 is an essential single-stranded RNA-binding protein for coronavirus replication; it shares structural similarity to the oligosaccharide-binding (OB) fold, which is characteristic of proteins that bind to ssDNA or ssRNA. Nsp9 requires dimerization for binding and orienting RNA for subsequent use by the replicase machinery. CoV Nsp9s have diverse forms of dimerization that promote their biological function, which may help elucidate the mechanism underlying CoVs replication and contribute to the development of antiviral drugs. Generally, dimers are formed via interaction of the parallel alpha-helices containing the protein-protein interaction motif GXXXG; additionally, the N-finger region may also play a critical role in dimerization as seen in porcine delta coronavirus (PDCoV) Nsp9. As a member of the replication complex, Nsp9 may not have a specific RNA-binding sequence but may act in conjunction with other Nsps as a processivity factor, as shown by mutation studies indicating that Nsp9 is a key ingredient that intimately engages other proteins in the replicase complex to mediate efficient virus transcription and replication.


:

Pssm-ID: 409331  Cd Length: 111  Bit Score: 231.52  E-value: 3.91e-70
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 190360135 4123 NNELMPAKLKTQVVNSGPDQT-CNTPTQCYYNNSYNGKIVYAILSDVDGLKYTKILKDDGNFVVLELDPPCKFTVQDVKG 4201
Cdd:cd21898     1 NNELMPQGLKTMVVTAGPDQTaCNTPALAYYNNVQGGRMVMAILSDVDGLKYAKVEKSDGGFVVLELDPPCKFLVQTPKG 80
                          90       100       110
                  ....*....|....*....|....*....|.
gi 190360135 4202 LKIKYLYFVKGCNTLARGWVVGTISSTVRLQ 4232
Cdd:cd21898    81 PKVKYLYFVKGLNNLHRGQVLGTIAATVRLQ 111
CoV_NSP4_C pfam16348
Coronavirus replicase NSP4, C-terminal; This is the C-terminal domain of the coronavirus ...
3157-3244 4.30e-46

Coronavirus replicase NSP4, C-terminal; This is the C-terminal domain of the coronavirus nonstructural protein 4 (NSP4). NSP4 is encoded by ORF1a/1ab and proteolytically released from the pp1a/1ab polyprotein. It is a membrane-spanning protein which is thought to anchor the viral replication-transcription complex (RTC) to modified endoplasmic reticulum membranes. This predominantly alpha-helical domain may be involved in protein-protein interactions. It has been shown that in Betacoronavirus, the coexpression of NSP3 and NSP4 results in a membrane rearrangement to induce double-membrane vesicles (DMVs) and convoluted membranes (CMs), playing a critical role in SARS-CoV replication. There are two well conserved amino acid residues (H120 and F121) in NSP4 among Betacoronavirus, essential for membrane rearrangements during interaction with NSP3.


:

Pssm-ID: 465099  Cd Length: 92  Bit Score: 161.93  E-value: 4.30e-46
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 190360135  3157 GTFEEMALTTFMITKDSYCKLKNSLSDVAFNRYLSLYNKYRYYSGKMDTAAYREAACSQLAKAMDTFtNNNGSDVLYQPP 3236
Cdd:pfam16348    6 GTFEEAALGTFVIDKESYEKLKNSISLDKFNRYLSLYNKYKYYSGKMDEADYREACCAHLAKALEDF-SNSGNDVLYTPP 84

                   ....*...
gi 190360135  3237 TASVSTSF 3244
Cdd:pfam16348   85 TVSVTSSL 92
DPUP_MHV_Nsp3 cd21524
DPUP (domain preceding Ubl2 and PLP2) of non-structural protein 3 (Nsp3) from murine hepatitis ...
1489-1562 1.37e-44

DPUP (domain preceding Ubl2 and PLP2) of non-structural protein 3 (Nsp3) from murine hepatitis virus and related betacoronaviruses in the A lineage; This subfamily contains the DPUP (domain preceding Ubl2 and PLP2) of murine hepatitis virus (MHV) non-structural protein 3 (Nsp3) and other Nsp3s from betacoronaviruses in the embecovirus subgenera (A lineage), including human CoV OC43, rabbit CoV HKU14 and porcine hemagglutinating encephalomyelitis virus (HEV), among others. Non-structural protein 3 (Nsp3) is a large multi-functional multi-domain protein that is an essential component of the replication/transcription complex (RTC), which carries out RNA synthesis, RNA processing, and interference with the host cell innate immune system. MHV Nsp3 contains a DPUP that is located N-terminal to the ubiquitin-like domain 2 (Ubl2) and papain-like protease 2 (PLP2) catalytic domain. It is structurally similar to the Severe Acute Respiratory Syndrome (SARS) CoV unique domain C (SUD-C), adopting a frataxin-like fold that has structural similarity to DNA-binding domains of DNA-modifying enzymes. SUD-C is also located N-terminal to Ubl2 and PLP2 in SARS Nsp3, similar to the DPUP of MHV Nsp3; however, unlike DPUP, it is preceded by SUD-N and SUD-M macrodomains that are absent in MHV Nsp3. Though structurally similar, there is little sequence similarity between DPUP and SUD-C. SARS SUD-C has been shown to bind to single-stranded RNA and recognize purine bases more strongly than pyrimidine bases; it also regulates the RNA binding behavior of the SARS SUD-M macrodomain. It is not known whether DPUP functions in the same way.


:

Pssm-ID: 394840  Cd Length: 75  Bit Score: 156.81  E-value: 1.37e-44
                          10        20        30        40        50        60        70
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....
gi 190360135 1489 LDDDARTFVQSNVDVVPEGWRVVNKFYQINGVRTVKYFECPGGIDICSQDKVFGYVQQGSFNKATVAQIKALFL 1562
Cdd:cd21524     2 LDDDARVFVQANMDNLPEDWRLVNKFDVINGVRTIKYFECPGGIFICSQGKDFGYVQNGSFKKATVSQIRALLA 75
Macro_X_Nsp3-like cd21557
X-domain (or Mac1 domain) of viral non-structural protein 3 and related macrodomains; The ...
1294-1418 1.95e-41

X-domain (or Mac1 domain) of viral non-structural protein 3 and related macrodomains; The X-domain, also called Mac1, is the macrodomain found in riboviral non-structural protein 3 (Nsp3), including the Nsp3 of Severe Acute Respiratory Syndrome Coronavirus (SARS-CoV) as well as SARS-CoV-2, and other coronaviruses (alpha-, beta-, gamma-, and deltacoronavirus), among others. The SARS-CoV-2 Nsp3 Mac1 is highly conserved among all CoVs, and binds to and hydrolyzes mono-ADP-ribose (MAR) from target proteins. It appears to counter host-mediated antiviral ADP-ribosylation, a post-translational modification that is part of the host response to viral infections. Mac1 is essential for pathogenesis in multiple animal models of CoV infection, implicating it as a virulence factor and potential therapeutic target. Assays show that the de-MARylating activity leads to a rapid loss of substrate, and that Mac1 could not hydrolyze poly-ADP-ribose; thus, Mac1 is a MAR-hydrolase (mono-ADP ribosylhydrolase). Mac1 was originally named ADP-ribose-1"-phosphatase (ADRP) based on data demonstrating that it could remove the phosphate group from ADP-ribose-1"-phosphate; however, activity was modest and was unclear why this would impact a virus infection. This family also includes the X-domain of Avian infectious bronchitis virus (IBV) strain Beaudette coronavirus that does not bind ADP-ribose; the triple glycine sequence found in the X-domains of SARS-CoV and human coronavirus 229E (HCoV229E), which are involved in ADP-ribose binding, is not conserved in the IBV X-domain. SARS-CoVs have two other macrodomains referred to as the SUD-N (N-terminal subdomain, or Mac2) and SUD-M (middle SUD subdomain, or Mac3) of the SARS-unique domain (SUD), which also do not bind ADP-ribose; these bind G-quadruplexes (unusual nucleic-acid structures formed by consecutive guanosine nucleotides). SARS-CoV SUD-N and SUD-M are not included in this group.


:

Pssm-ID: 438957  Cd Length: 127  Bit Score: 150.01  E-value: 1.95e-41
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 190360135 1294 EVVVNPANGHMAHGGGVAKAIAVAAGQQFVKEtTDMVKSKGVCATGDCYVSTGGKLCKTVLNVVGPDARTqgKQSYALLE 1373
Cdd:cd21557     2 DVVVNAANENLKHGGGVAGAIYKATGGAFQKE-SDYIKKNGPLKVGTAVLLPGHGLAKNIIHVVGPRKRK--GQDDQLLA 78
                          90       100       110       120
                  ....*....|....*....|....*....|....*....|....*....
gi 190360135 1374 RVYKHLNK-YDCVVTTLISAGIFSVPSDVSLTYLLGTAKK---QVVLVS 1418
Cdd:cd21557    79 AAYKAVNKeYGSVLTPLLSAGIFGVPPEQSLNALLDAVDTtdaDVTVYC 127
betaCoV_Nsp7 cd21827
betacoronavirus non-structural protein 7; This model represents the non-structural protein 7 ...
3837-3925 2.70e-37

betacoronavirus non-structural protein 7; This model represents the non-structural protein 7 (Nsp7) of betacoronaviruses including the highly pathogenic Severe acute respiratory syndrome-related coronavirus (SARS-CoV), SARS-CoV2 (also called 2019 novel CoV or 2019-nCoV), and Middle East respiratory syndrome-related (MERS) CoV. CoVs utilize a multi-subunit replication/transcription machinery. A set of non-structural proteins (Nsps) generated as cleavage products of the ORF1a and ORF1ab viral polyproteins assemble to facilitate viral replication and transcription. Upon processing of the Nsp7-10 region by protease M (Mpro), the released four small proteins Nsp7, Nsp8, Nsp9 and Nsp10 form functional complexes with CoV core enzymes and stimulate replication. Most importantly, a complex of Nsp7 with Nsp8 has been shown to activate and confer processivity to the RNA-synthesizing activity of Nsp12, the RNA-dependent RNA-polymerase (RdRp); in SARS-CoV, point mutations in the NSP7- or NSP8-coding region have been shown to delay virus growth. Nsp7 and Nsp8 cooperate in activating the primer-dependent activity of the Nsp12 RdRp such that the level of their association may constitute a limiting factor for obtaining a high RNA polymerase activity. The subsequent Nsp7/Nsp8/Nsp12 polymerase complex is then able to associate with an active bifunctional Nsp14, which includes N-terminal 3' to 5' exoribonuclease (ExoN) and C-terminal N7-guanine cap methyltransferase (N7-MTase) activities, thus representing a unique coronavirus Nsp assembly that incorporates RdRp, exoribonuclease, and N7-MTase activities. Interaction of Nsp7 with Nsp8 appears to be conserved across the coronavirus family, making these proteins interesting drug targets. Nsp7 has a 4-helical bundle conformation which is strongly affected by its interaction with Nsp8, especially where it concerns alpha-helix 4. SARS-CoV Nsp7 forms a 8:8 hexadecameric supercomplex with Nsp8 that adopts a hollow cylinder-like structure with a large central channel and positive electrostatic properties in the cylinder; the Nsp7/Nsp8 complex functions as a noncanonical RNA polymerase capable of synthesizing RNA of up to template length.


:

Pssm-ID: 409253  Cd Length: 83  Bit Score: 136.42  E-value: 2.70e-37
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 190360135 3837 SKLTDVKCANVVLLNCLQHLHVASNSKLWQYCSTLHNEILATSDLGVAFEKLAQLLIVLFANPAAVDskCLTSIEEVCDD 3916
Cdd:cd21827     1 SKLTDVKCTSVVLLSVLQQLHVESNSKLWAYCVKLHNDILAAKDPTEAFEKFVSLLSVLLSFPGAVD--LDALCSELLDN 78

                  ....*....
gi 190360135 3917 YakdnTVLQ 3925
Cdd:cd21827    79 P----TVLQ 83
Ubl1_cv_Nsp3_N-like cd21467
first ubiquitin-like (Ubl) domain located at the N-terminus of coronavirus SARS-CoV ...
872-961 3.48e-34

first ubiquitin-like (Ubl) domain located at the N-terminus of coronavirus SARS-CoV non-structural protein 3 (Nsp3) and related proteins; This ubiquitin-like (Ubl) domain (Ubl1) is found at the N-terminus of coronavirus Nsp3, a large multi-functional multi-domain protein which is an essential component of the replication/transcription complex (RTC). The functions of Ubl1 in CoVs are related to single-stranded RNA (ssRNA) binding and to interacting with the nucleocapsid (N) protein. SARS-CoV Ubl1 has been shown to bind ssRNA having AUA patterns, and since the 5'-UTR of the SARS-CoV genome has a number of AUA repeats, it may bind there. In mouse hepatitis virus (MHV), this Ubl1 domain binds the cognate N protein. Adjacent to Ubl1 is a Glu-rich acidic region (also referred to as hypervariable region, HVR); Ubl1 together with HVR has been called Nsp3a. Currently, the function of HVR in CoVs is unknown. This model corresponds to one of two Ubl domains in Nsp3; the other is located N-terminal to the papain-like protease (PLpro) and is not represented by this model.


:

Pssm-ID: 394822  Cd Length: 89  Bit Score: 127.69  E-value: 3.48e-34
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 190360135  872 TIKVFYELDKDFNTILNTACGVFEVDDTVDMEEFYAVVIDAIEEKLSPCKELEgVGAKVSAFLQKLEDNSLFLFDEAGEE 951
Cdd:cd21467     1 TVKVTYELDEVLDTILNKACSPFEVEKDLTVEEFADVVQDAVEEKLSPLLELP-LGDKVDADLDDFIDNPCYLFDEDGDE 79
                          90
                  ....*....|
gi 190360135  952 VLAPKLYCAF 961
Cdd:cd21467    80 VLASEMYCSF 89
 
Name Accession Description Interval E-value
TM_Y_MHV-like_Nsp3_C cd21714
C-terminus of non-structural protein 3, including transmembrane and Y domains, from murine ...
2195-2749 0e+00

C-terminus of non-structural protein 3, including transmembrane and Y domains, from murine hepatitis virus and betacoronavirus in the A lineage; This model represents the C-terminus of non-structural protein 3 (Nsp3) from betacoronavirus in the embecovirus subgenus (A lineage), including murine hepatitis virus (MHV) and Human coronavirus HKU1. This conserved C-terminus includes two transmembrane (TM) regions TM1 and TM2, an ectodomain (3Ecto) between the TM1 and TM2 that is glycosylated and located on the lumenal side of the ER, an amphiphatic region (AH1) that is not membrane-spanning, and a large Y domain of approximately 370 residues. Nsp3 is a large multi-functional multi-domain protein that is an essential component of the replication/transcription complex (RTC), which carries out RNA synthesis, RNA processing, and interference with the host cell innate immune system. In MHV and the related Severe acute respiratory syndrome-related coronavirus (SARS-CoV), the TM1, 3Ecto and TM2 domains are important for the papain-like protease (PL2pro) domain to process Nsp3-Nsp4 cleavage. It has also been shown that the interaction of 3Ecto with the lumenal loop of Nsp4 is essential for ER rearrangements in cells infected with SARS-CoV or MHV. The Y domain, located at the cytosolic side of the ER, consists of the Y1 and CoV-Y subdomains, which are conserved in nidovirus and coronavirus, respectively. Functional information about the Y domain is limited; it has been shown that Nsp3 binding to Nsp4 is less efficient without the Y domain.


Pssm-ID: 409662  Cd Length: 555  Bit Score: 1114.06  E-value: 0e+00
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 190360135 2195 VARGACIIATIFLLWFNFIYANVIFSDFYLPKIGFLPTFVGKIAQWIKNTFSLVTICDLYSIQDVGFKNQYCNGSIACQF 2274
Cdd:cd21714     1 VARGFFIIATIFLLWFNFLYANVIFSDFYLPNIGFLPTFVGKIVQWFKNTFGLVTICDLYSVSDVGFKSQFCNGSMACQL 80
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 190360135 2275 CLAGFDMLDNYKAIDVVQYEADRRAFVDYTGVLKIVIELIVSYALYTAWFYPLFALISIQILTTWLPELFMLSTLHWSVR 2354
Cdd:cd21714    81 CLSGFDMLDNYKAIDVVQYEVDRRVFFDYTSVLKLVVELVVSYALYTVWFYPLFCLIGLQLLTTWLPEFFMLETLHWSVR 160
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 190360135 2355 LLVSLANMLPAHVFMRFYIIIASFIKLFSLFRHVAYGCSKSGCLFCYKRNRSLRVKCSTIVGGMIRYYDAMANGGTGFCS 2434
Cdd:cd21714   161 LFVFLANMLPAHVFLRFYIVVTAMYKIFCLFRHVVYGCSKPGCLFCYKRNRSVRVKCSTIVGGMLRYYDVMANGGTGFCS 240
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 190360135 2435 KHQWNCIDCDSYKPGNTFITVEAALDLSKELKRPIQPTDVAYHTVTDVKQVGCYMRLFYDRDGQRTYDDVNASLFVDYSN 2514
Cdd:cd21714   241 KHQWNCINCDSYKPGNTFITVEAAAELSKELKRPVNPTDVAYYTVTDVKQVGCSMRLFYERDGQRVYDDVNASLFVDMNG 320
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 190360135 2515 LLHSKVKSVPNMHVVVVENDADKANFLNAAVFYAQSLFRPILMVDKNLITTANTGTSVTETMFDVYVDTFLSMFDVDKKS 2594
Cdd:cd21714   321 LLHSKVKGVPNTHVVVVENDADKANFLNAAVFYAQSLFRPMLMVDKKLITTANTGTSVSQTMFDVYVDTFLSMFDVDRKS 400
                         410       420       430       440       450       460       470       480
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 190360135 2595 LNALIATAHSSIKQGTQICKVLDTFLSCARKSCSIDSDVDTKCLADSVMSAVSAGLELTDESCNNLVPTYLKGDNIVAAD 2674
Cdd:cd21714   401 LNSFINTAHSSLKEGVQLEKVLDTFIGCARKSCSIDSDVDTKCIAKSVMSAVAAGLEFTDESCNNLVPTYIKSDNIVAAD 480
                         490       500       510       520       530       540       550
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*
gi 190360135 2675 LGVLIQNSAKHVQGNVAKIAGVSCIWSVDAFNQLSSDFQHKLKKACCKTGLKLKLTYNKQMANVSVLTTPFSLKG 2749
Cdd:cd21714   481 LGVLIQNSAKHVQGNVAKAANVACIWSVDAFNQLSSDFQHKLKKACVKTGLKLKLTYNKQEANVSILTTPFSLKG 555
betaCoV_Nsp2_MHV-like cd21519
betacoronavirus non-structural protein 2 (Nsp2) similar to MHV Nsp2/p65 and related proteins ...
249-851 0e+00

betacoronavirus non-structural protein 2 (Nsp2) similar to MHV Nsp2/p65 and related proteins from betacoronaviruses in the A lineage; Coronavirus non-structural proteins (Nsps) are encoded in ORF1a and ORF1b. Post infection, the genomic RNA is released into the cytoplasm of the cell and translated into two long polyproteins (pp), pp1a and pp1ab, which are then autoproteolytically cleaved by two viral proteases Nsp3 and Nsp5 into smaller subunits. Nsp2 is one of these subunits. This subgroup includes Nsp2 from Murine hepatitis virus (MHV) and betacoronaviruses in the embecovirus subgenus (A lineage). It belongs to a family which includes Severe acute respiratory syndrome coronavirus (SARS-CoV) Nsp2. The function of Nsp2 remains unclear. SARS-CoV Nsp2, rather than playing a role in viral replication, may be involved in altering the host cell environment; deletion of Nsp2 from the SARS-CoV genome results in only a modest reduction in viral titers, and it has been shown to interact with two host proteins, prohibitin 1 (PHB1) and PHB2 which have been implicated in cellular functions, including cell-cycle progression, cell migration, cellular differentiation, apoptosis, and mitochondrial biogenesis. MHV Nsp2, also known as p65, different from SARS-CoV Nsp2, may play an important role in the viral life cycle.


Pssm-ID: 394870  Cd Length: 586  Bit Score: 1085.10  E-value: 0e+00
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 190360135  249 PLLYVDQYGCDYTGGLADGLEAYADKTLQEMKALFPIWSQELPFDVTVAWHVVRDPRYVMRLQSASTIRSVAYVANPTED 328
Cdd:cd21519     1 PLLFVDQYGCDYTGKLAEGLEAYGDFSLQEMKELFPVWSQSLDFDVVVAWHVVRDPRFVMRLQTLATIRSIEYVAQPTED 80
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 190360135  329 LCDGSVVIKEPVHVYADDSIILRQHNLVDIMSCFYmeaDAVVNAFYGVDLKDCGFVMQFGYIDCEQDLCDFKGWVPGNMI 408
Cdd:cd21519    81 LVDGDVVIREPVHLLAADAIVLKLPKLVDVMQHTD---DSVVESIYKVKLCDCGFVMQFGYVDCCQDDCDFRGWVPGNMI 157
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 190360135  409 DGFACTTCGHVYETGDLLAQSSGVLPVNPVLHTKSAAGYGGfgckDSFTLYGQTVVYFGGCVYWSPARNIWIPILKSSVK 488
Cdd:cd21519   158 DGFACPSCGHVYGPSELLAQSSGVIPENPVLFTNSTDTVNQ----DSFKLYGHSVVPFGGCVYWSPYPGMWIPIIKSSVK 233
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 190360135  489 SYDGLVYTGVVGCKAIVKETNLICKALYLDYVQHKCGNLHQRELLGVSDVWHKQLLLNRGVYKPLLENIDYFNMRRAKFS 568
Cdd:cd21519   234 SYDGMVYTGVVGCKTIVKETDAICKALYLDYVQHKCGNLEQREILGLDDVWHKQLLLNRGDYSLLLENIDYFVMRRAKFS 313
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 190360135  569 LETFTVCADGFMPFLLDDLVPRAYYLAVSGQAFCDYAGKICHAVVSKSKELLDVSLDSLGAAIHYLNSKIVDLAQHFSDF 648
Cdd:cd21519   314 CETATVCDEGFVPFLLDGLVPRSYYLIKSGQAFTSLMSKFGQEVADMCMEMLVLSMDSVSVATFYIKKNVGKLASQFKAL 393
                         410       420       430       440       450       460       470       480
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 190360135  649 GTSFVSKIVHFFKTFTTSTALAFAWVLFHVLHGAYIVVESDIYFVKNIPRYASAVAQAFRSVAKVVLDSLRVTFIDGLSC 728
Cdd:cd21519   394 GAKFVKKLIEWFKAFTDTTALAFAWLLYHVLNGAYIVVESDIYFVKSVPDYARNVVRKFQTFFKMLLDCVKVTFLKGLSV 473
                         490       500       510       520       530       540       550       560
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 190360135  729 FKIGRRRICLSGSKIYEVERGLLHSSQLPLDVYDltmpsqvqktkQKPIYLKGSGSDFSLADSVVEVVTTSLTPCGYSEP 808
Cdd:cd21519   474 FKTGRGRVCFAGNKVYKVSRGLLSGFVLPSDVQE-----------SQLTFLEGVAEPVVVEDDVVEVVKTPLTPCGYCKP 542
                         570       580       590       600
                  ....*....|....*....|....*....|....*....|....
gi 190360135  809 PKVADKICIVDNVYMAKAGDKYYPVVVDGH-VGLLDQAWRVPCA 851
Cdd:cd21519   543 PKSAEKICIVDNVYMAKCGDKFYPVVVDDDtIGLLDQAWRFPCA 586
B-CoV_A_NSP1 pfam11963
Betacoronavirus, lineage A, NSP1; This family the N-terminal region of the Betacoronavirus ...
1-354 0e+00

Betacoronavirus, lineage A, NSP1; This family the N-terminal region of the Betacoronavirus polyprotein which contains non-structural protein 1 (Nsp1) from Betacoronavirus lineage A. This protein is important for viral replication and pathogenesis. It suppresses the host innate immune functions by inhibiting type I interferon expression and host antiviral signalling pathways.


Pssm-ID: 152398  Cd Length: 355  Bit Score: 660.48  E-value: 0e+00
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 190360135     1 MSKINKYGLELHWAPEFPWMFEDAEEKLDNPSSSEVDIVCSTTAQKLETGGICPENHVMVDCRRLLKQECCVQSSLIREI 80
Cdd:pfam11963    1 MAKMGKYGLGFKWAPEFPWMLPDASEKLGNPERSEEDGFCPSTAQEPEVKGKTLVNHVRVDCRRLLAQECCVQSALIRDI 80
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 190360135    81 VMNTRPYDLEVLLQDALQSREAVLVTPPLGMSLEACYVRGCNPNGWTMGLFRRRSVCNTGRCAVNKHVAYQLYMIDPAGV 160
Cdd:pfam11963   81 FVDEDPQKVEVLTMMALQSGSAVLVKPPLRLSVQAWHSLGVLPKGYAMGLFRRYCLCNTRECKCDAHVAFQLFMVQPDGV 160
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 190360135   161 CFGAGQFVGWVIPLAFMPVQSRKFIVPWVMYLRKCGEKGAYNKDHKRGGFE-HVYNFKVEDAYDLVHDEPKGKFSKKAYA 239
Cdd:pfam11963  161 CFGNGRFIGWFVPVTFMPEYAKKWLQPWSIYLRKGGNKGSVTSDHFRRAFTmPVYDFNVEDAYAEVHDEPKGKYSQKAYA 240
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 190360135   240 LIRGYRGVKPLLYVDQYGCDYTGGLADGLEAYADKTLQEMKALFPIWSQELPFDVTVAWHVVRDPRYVMRLQSASTIRSV 319
Cdd:pfam11963  241 LLRGYRGVKPVLFVDQYGCDYTGCLADGLEAYGDYTLQDMKQLQPVWLANLDFDVVVAWHVVRDPRAVMRLQTIATICGI 320
                          330       340       350
                   ....*....|....*....|....*....|....*
gi 190360135   320 AYVANPTEDLCDGSVVIKEPVHVYADDSIILRQHN 354
Cdd:pfam11963  321 AYVAQPTEDVVDGDVVIKEPVHLLSADAIVLRLPS 355
betaCoV_Nsp5_Mpro cd21666
betacoronavirus non-structural protein 5, also called Main protease (Mpro); This subfamily ...
3250-3543 2.12e-177

betacoronavirus non-structural protein 5, also called Main protease (Mpro); This subfamily contains the coronavirus (CoV) non-structural protein 5 (Nsp5) also called the Main protease (Mpro), or 3C-like protease (3CLpro), found in betacoronaviruses. CoVs utilize a multi-subunit replication/transcription machinery. A set of non-structural proteins (Nsps) generated as cleavage products of the ORF1a and ORF1ab viral polyproteins assemble to facilitate viral replication and transcription. Mpro/Nsp5 is a key enzyme in this process, making it a high value target for the development of anti-coronavirus therapeutics. These enzymes belong to the MEROPS peptidase C30 family, where the active site residues His and Cys form a catalytic dyad. The structures of Mpro/Nsp5 consist of three domains with the first two containing anti-parallel beta barrels and the third consisting of an arrangement of alpha-helices. The catalytic residues are found in a cleft between the first two domains. Mpro requires a Gln residue in the P1 position of the substrate and space for only small amino-acid residues such as Gly, Ala, or Ser in the P1' position; since there is no known human protease with a specificity for Gln at the cleavage site of the substrate, these viral proteases are suitable targets for the development of antiviral drugs.


Pssm-ID: 394887  Cd Length: 297  Bit Score: 547.00  E-value: 2.12e-177
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 190360135 3250 VKMVNPTSKVEPCIVSVTYGNMTLNGLWLGDKVYCPRHVICSASDMTNPDYTNLLCRVTSSDFTVLFDRLSLTVMSYQMQ 3329
Cdd:cd21666     1 RKMAFPSGKVEGCMVQVTCGTMTLNGLWLDDTVYCPRHVICTAEDMLNPNYEDLLIRKTNHSFLVQAGNVQLRVIGHSMQ 80
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 190360135 3330 GCMLVLTVTLQNSRTPKYTFGVVKPGETFTVLAAYNGKPQGAFHVTMRSSYTIKGSFLCGSCGSVVYVIMGDCVKFVYMH 3409
Cdd:cd21666    81 GCLLRLTVDTSNPKTPKYKFVRVKPGQTFSVLACYNGSPSGVYQCAMRPNHTIKGSFLCGSCGSVGYNIDGDCVSFCYMH 160
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 190360135 3410 QLELSTGCHTGTDFNGDFYGPYKDAQVVQLPVQDYIQSVNFVAWLYAAILNNCNWFVQSDKCSVEDFNVWALSNGFSQVK 3489
Cdd:cd21666   161 QMELPTGVHTGTDLEGKFYGPFVDRQTAQAAGTDTTITLNVLAWLYAAVLNGDRWFVNRFTTTLNDFNLWAMKYNYEPLT 240
                         250       260       270       280       290
                  ....*....|....*....|....*....|....*....|....*....|....*..
gi 190360135 3490 SD--LVIDALASMTGVSLETLLAAIKHLKNGF-QGRQIMGSCSFEDELTPSDVYQQL 3543
Cdd:cd21666   241 QDhvDILDPLAAQTGIAVEDMLAALKELLQGGmQGRTILGSTILEDEFTPFDVVRQC 297
CoV_NSP3_C pfam19218
Coronavirus replicase NSP3, C-terminal; This family represents the C-terminal region of ...
2248-2736 4.34e-173

Coronavirus replicase NSP3, C-terminal; This family represents the C-terminal region of non-structural protein NSP3 (also known as nsp3). NSP3 is the product of ORF1a. It is found in human SARS coronavirus polyprotein 1a and 1ab, and in related coronavirus polyproteins. It is a multifunctional protein comprising up to 16 different domains and regions. NSP3 binds to viral RNA, nucleocapsid protein, as well as other viral proteins and participates in polyprotein processing.


Pssm-ID: 466002  Cd Length: 463  Bit Score: 541.93  E-value: 4.34e-173
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 190360135  2248 VTICD----LYSIQDVgFKNQYCNGSIACQFCLAGFDMLDNYKAIDVVQYEADRRAFVDYTGVLKIVIELIVSYALYTAW 2323
Cdd:pfam19218    1 GYPCDgyvdGYSNSSF-NKSDYCNGSILCKACLSGYDSLHDYPHLKVVQQPVKDPLFVDVTPLFYFAIELFVALALFGGT 79
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 190360135  2324 FYPLFALISIQILTTWLPELFMLSTLHWsvrllvsLANMLPAHVFMRFYIIIASFIKLFSLFRHVAYGCSKSGCLFCYKR 2403
Cdd:pfam19218   80 FVRVFLLYFLQQYVNFFGVYLGLQDYSW-------FLTLIPFDSFLREYVVLFYVIKLYRFLKHVVFGCKKPSCLACSKS 152
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 190360135  2404 NRSLRVKCSTIVGGMIRYYDAMANGGTGFCSKHQWNCIDCDSYKPGNTFITVEAALDLSKELKRPIQPTDVAYHTVTDVK 2483
Cdd:pfam19218  153 ARLTRVPVSTVVNGSKKSFYVNANGGTKFCKKHNFFCKNCDSYGPGNTFINDEVAEDLSNVTKRSVKPTDPAYYEVDKVE 232
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 190360135  2484 QVGCYMRLFYDRDGQRTYDDVNASLFVDYSNLLHSKVKSVPNMHVVVVEND-ADKANFLNAAVFYAQSLFRPILMVDKNL 2562
Cdd:pfam19218  233 FQNGFYYLYSGREFWRYYFDVTVSKYSDKEVLKNCNIKGYPLDDFIVYNSNgSNLAQAKNACVYYSQLLCKPIKLVDSNL 312
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 190360135  2563 ITTANTGTSVTETMFDVYVDTFLSMFDVDKKSLNALIATAHssikqgtqickvldtflscarkscSIDSDVDTKCLADSV 2642
Cdd:pfam19218  313 LSSLGDSVDVNGALHDAFVEVLLNSFNVDLSKCKTLIECKK------------------------DLGSDVDTDSFVNAV 368
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 190360135  2643 MSAVSAGLELTDESCNNLVPTYLK-GDNIVAADLGVLIQNSAKHVQGNVAKIAGVSCIWSVDAFNQLSSDFQHKLKKACC 2721
Cdd:pfam19218  369 LNAHRYDVLLTDDSFNNFVPTYAKpEDSLSTHDLAVCIRFGAKIVNHNVLKKENVPVVWSADDFLKLSEEARKYIVKTAK 448
                          490
                   ....*....|....*
gi 190360135  2722 KTGLKLKLTYNKQMA 2736
Cdd:pfam19218  449 KKGVTFMLTFNTNRM 463
cv_Nsp4_TM cd21473
coronavirus non-structural protein 4 (Nsp4) transmembrane domain; Nsp4 may be involved in ...
2761-3143 8.85e-163

coronavirus non-structural protein 4 (Nsp4) transmembrane domain; Nsp4 may be involved in coronavirus-induced membrane remodeling. In order to assemble the replication-transcription complex (RTC), coronavirus induces the rearrangement of host endoplasmic reticulum (ER) membrane into double membrane vesicles (DMVs), zippered ER, or ER spherules. DMV formation has been observed in SARS-CoV cells overexpressing the three transmembrane-containing non-structural proteins of viral replicase polyprotein 1ab: Nsp3, Nsp4 and Nsp6. Together, Nsp3, Nsp4, and Nsp6 have the ability to induce the formation of DMVs that are similar to those seen in SARS-CoV-infected cells.


Pssm-ID: 394836  Cd Length: 376  Bit Score: 508.67  E-value: 8.85e-163
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 190360135 2761 FVLSLVCFIGLWCLMPTYTVHKSDFQLPVYASYKVLDNGVIRDVSVEDVCFANKFEQFDQWYESTFGlSYYSNSMACPIV 2840
Cdd:cd21473     1 FLWLLLAAILLYAFLPSYSVFTVTVSSFPGYDFKVIENGVLRDIRSTDTCFANKFVNFDSWYQAKYG-SVPTNSKSCPIV 79
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 190360135 2841 VAVVDQDFGStVFNVPTKVLRYGYHVLHFITHALSADGVQCYTPHSQISYSNFYASGCVLSSACTMFAMaDGSPQPYCYT 2920
Cdd:cd21473    80 VGVIDDVRGS-VPGVPAGVLLVGKTLVHFVQTVFFGDTVVCYTPDGVITYDSFYTSACVFNSACTYLTG-LGGRQLYCYD 157
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 190360135 2921 DGLMQNASLYSSLVPHVRYNLANAkGFIRFPEVLREGLVRIVRTRSMPYCRVGLCEEADEGICFNFNGSWVLNNDYYRsl 3000
Cdd:cd21473   158 TGLVEGAKLYSDLLPHVRYKLVDG-NYIKFPEVILEGGPRIVRTLATTYCRVGECEDSKAGVCVSFDGFWVYNNDYYG-- 234
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 190360135 3001 PGTFCGRDVFDLIYQLFKGLAQPVDFLALTASSIAGAILAVIVVLVFYYLIKLKRAFGDyTSIVFVNVIVWCVNFMMLFV 3080
Cdd:cd21473   235 PGVYCGDGLFDLLTNLLSGFFQPVSVFALSGQLLFNTIVAILAVLACYYVQKFKRAFGD-MSVVVVTVVAAALVNNVLYV 313
                         330       340       350       360       370       380
                  ....*....|....*....|....*....|....*....|....*....|....*....|...
gi 190360135 3081 FQVYPTLSCVYAICYFYATLYFPSEISVIMHLQWLVMYGTIMPLWFCLLYISVVVSNHAFWVF 3143
Cdd:cd21473   314 VTQNPLLMIVYAVLYFYATLYLTYERAWIMHLGWVVAYGPIAPWWLLALYVVAVLYDYLPWFF 376
MHV-like_Nsp1 cd21879
non-structural protein 1 from murine hepatitis virus and betacoronavirus in the A lineage; ...
6-241 2.86e-158

non-structural protein 1 from murine hepatitis virus and betacoronavirus in the A lineage; This model represents the non-structural protein 1 (Nsp1) from betacoronavirus in the embecovirus subgenus (A lineage), including murine hepatitis virus (MHV), bovine coronavirus (BCoV) and Human coronavirus HKU1. CoVs utilize a multi-subunit replication/transcription machinery assembled from a set of non-structural proteins (Nsps) generated as cleavage products of the ORF1a and ORF1ab viral polyproteins. Nsp1 is the N-terminal cleavage product released from the ORF1a polyprotein by the action of papain-like protease (PLpro). Though Nsp1s of alphaCoVs and betaCoVs share structural similarity, they show no significant sequence similarity and may be considered as genus-specific markers. Despite low sequence similarity, the Nsp1s of alphaCoVs and betaCoVs exhibit remarkably similar biological functions, and are involved in the regulation of both host and viral gene expression. CoV Nsp1 induces suppression of host gene expression and interferes with host immune response. It inhibits host gene expression in two ways: by targeting the translation and stability of cellular mRNAs, and by inhibiting mRNA translation and inducing an endonucleolytic RNA cleavage in the 5'-UTR of cellular mRNAs through its tight association with the 40S ribosomal subunit, a key component of the cellular translation machinery. Inhibition of host mRNA translation includes that of type I interferons, major components of the host innate immune response. Nsp1 is critical in regulating viral replication and gene expression, as shown by multiple evidences, including: mutations in the Nsp1 coding region of the transmissible gastroenteritis virus (TGEV) and MHV genomes cause drastic reduction or elimination of infectious virus; BCoV Nsp1 is an RNA-binding protein that interacts with cis-acting replication elements in the 5'-UTR of the BCoV genome, implying its potential role in the regulation of viral translation or replication; and SARS-CoV Nsp1 enhances virus replication by binding to a stem-loop structure in the 5'-UTR of its genome.


Pssm-ID: 409341  Cd Length: 236  Bit Score: 489.20  E-value: 2.86e-158
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 190360135    6 KYGLELHWAPEFPWMFEDAEEKLDNPSSSEVDIVCSTTAQKLETGGICPENHVMVDCRRLLKQECCVQSSLIREIVMNTR 85
Cdd:cd21879     1 KYGLGLKWAPEFPWMFEDAEEKLGNPSSSEEDGFCPTTAQKLETVGICLENHVKVDCRRLLKQECCVQSNLIRDIFVDTD 80
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 190360135   86 PYDLEVLLQDALQSREAVLVTPPLGMSLEACYVRGCNPNGWTMGLFRRRSVCNTGRCAVNKHVAYQLYMIDPAGVCFGAG 165
Cdd:cd21879    81 PYDVEVLTQDALQSGEAVLVKPPLRMSLEACYKLGCLPKGWAMGLFRRRCVCNTGRCGVDKHVAYQLFMIDPDGVCLGAG 160
                         170       180       190       200       210       220       230
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*.
gi 190360135  166 QFVGWVIPLAFMPVQSRKFIVPWVMYLRKCGEKGAYNKDHKRGGFEHVYNFKVEDAYDLVHDEPKGKFSKKAYALI 241
Cdd:cd21879   161 RFIGWVVPLAFIPEYARKWLQPWVIYLRKYGEKGAYTKGHKRGGFGHVYDFKVEDAYDEVHDEPKGKYSKKAYALL 236
betaCoV_PLPro cd21732
betacoronavirus papain-like protease; This model represents the papain-like protease (PLPro) ...
1563-1861 3.03e-156

betacoronavirus papain-like protease; This model represents the papain-like protease (PLPro) found in non-structural protein 3 (Nsp3) of betacoronavirus, including highly pathogenic betacoronaviruses such as Severe acute respiratory syndrome-related coronavirus (SARS-CoV), SARS-CoV2 (also called 2019 novel CoV or 2019-nCoV), and Middle East respiratory syndrome-related (MERS) CoV. CoVs utilize a multi-subunit replication/transcription machinery. A set of non-structural proteins (Nsps) generated as cleavage products of the ORF1a and ORF1ab viral polyproteins assemble to facilitate viral replication and transcription. PLPro is a key enzyme in this process, making it a high value target for the development of anti-coronavirus therapeutics. PLPro, which belongs to the MEROPS peptidase C16 family, participates in the proteolytic processing of the N-terminal region of the replicase polyprotein; it can cleave Nsp1|Nsp2, Nsp2|Nsp3, and Nsp3|Nsp4 sites and its activity is dependent on zinc. In SARS-CoV and murine hepatitis virus (MHV), the C-terminal non-structural protein 3 region spanning transmembrane regions TM1 and TM2 with 3Ecto domain in between, are important for the PL2pro domain to process Nsp3-Nsp4 cleavage. Besides cleaving the polyproteins, PLPro also possesses a related enzymatic activity to promote virus replication: deubiquitinating (DUB) and de-ISGylating activities. Both, ubiquitin (Ub) and Ub-like interferon-stimulated gene product 15 (ISG15), are involved in preventing viral infection; coronaviruses utilize Ubl-conjugating pathways to counter the pro-inflammatory properties of Ubl-conjugated host proteins via the action of PLPro, which processes both 'Lys-48'- and 'Lys-63'-linked polyubiquitin chains from cellular substrates. The Nsp3 PLPro domain of many of these CoVs has also been shown to antagonize host innate immune induction of type I interferon by interacting with IRF3 and blocking its activation. Interactions of SARS-CoV and MERS-CoV with antiviral interferon (IFN) responses of human cells are remarkably different; high-dose IFN treatment (type I and type III) shows MERS-CoV was substantially more IFN sensitive than SARS-CoV. This may be due to differences in the architecture of the oxyanion hole and of the S3 as well as the S5 specificity sites, despite the overall structures of SARS-CoV and MERS-CoV PLPro being similar.


Pssm-ID: 409649  Cd Length: 304  Bit Score: 486.32  E-value: 3.03e-156
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 190360135 1563 DKVDILLTVDGVNFTNRFVPVGESFGKSLGNVFCDGVNVTKHKCDINYKGKVFFQFDNLSSEDLKAVRSSFNFDQK-ELL 1641
Cdd:cd21732     1 KTIEVLTTVDGVNFRTVLVNNGETFGKQLGNVFCDGVDVTKTKPSAKYEGKVLFQADNLSAEELEAVEYYYGFDDPtFLL 80
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 190360135 1642 AYYNMLVNCSKWQVVFNGKYFTFKQANNNCFVNVSCLMLQSLNLKFKIVQWQEAWLEFRSGRPARFVSLVLAKGGFKFGD 1721
Cdd:cd21732    81 RYYSALAHVKKWKFVVVDGYFSLKQADNNCYLNAACLMLQQLDLKFNTPALQEAYYEFRAGDPLRFVALVLAYGNFTFGE 160
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 190360135 1722 PADSRDFLRVVFSQVDLTGAICDFEIACK-CGVKQEQRTGVDAVMHFGTLSREDLEIGYTVDCSCGKKLIHCVRFDV-PF 1799
Cdd:cd21732   161 PDDARDFLRVVLSHADLVSARRVLEEVCKvCGVKQEQRTGVDAVMYFGTLSLDDLYKGYTIDCSCGRKAIRYLVEQVpPF 240
                         250       260       270       280       290       300
                  ....*....|....*....|....*....|....*....|....*....|....*....|....
gi 190360135 1800 LICSNTPASVKLPKG-VGSANIFKGDK-VGHYVHVKCEQSYQLYDASNVKKVTDVTGNLSDCLY 1861
Cdd:cd21732   241 LLMSNTPTEVPLPTGdFVAANVFTGDEsVGHYTHVKNKSLLYLYDAGNVKKTSDLKGPVTDVLY 304
CoV_NSP4_N pfam19217
Coronavirus replicase NSP4, N-terminal; This is the N-terminal domain of the coronavirus ...
2772-3128 3.73e-151

Coronavirus replicase NSP4, N-terminal; This is the N-terminal domain of the coronavirus nonstructural protein 4 (NSP4). NSP4 is encoded by ORF1a/1ab and proteolytically released from the pp1a/1ab polyprotein. NSP4 is a membrane-spanning protein which is thought to anchor the viral replication-transcription complex to modified endoplasmic reticulum membranes. This N-terminal region represents the membrane spanning region, covering four transmembrane regions.


Pssm-ID: 466001  Cd Length: 351  Bit Score: 474.07  E-value: 3.73e-151
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 190360135  2772 WCLMPTYTVHKSDFQLPVYASYKVLDNGVIRDVSVEDVCFANKFEQFDQWYESTFGlsYYSNSMACPIVVAVVDQDFGST 2851
Cdd:pfam19217    1 YALSPTFFNTVVYFVSDPVYDFKVIENGVLRDFRSTDTCFHNKFDNFDSWHQAKFG--SPTNSRSCPIVVGVVDEVVGRV 78
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 190360135  2852 VFNVPTKVLRYGYHVLHFITHALSADGVQCYTPHSQISYSNFYASGCVLSSACTMFAMADGSPQPYCYTDGLMQNASLYS 2931
Cdd:pfam19217   79 VPGVPAGVALVGGTILHFVTRVFFGAGNVCYTPSGVVTYESFSASACVFNSACTTLTGLGGTRVLYCYDDGLVEGAKLYS 158
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 190360135  2932 SLVPHVRYNLANAkGFIRFPEVLREGLVRIVRTRSMPYCRVGLCEEADEGICFNFNGSWVLNNDYYrslPGTFCGRDVFD 3011
Cdd:pfam19217  159 DLVPHVRYKLVDG-NYVKLPEVLFRGGFRIVRTLATTYCRVGECEDSKAGVCVGFDRSFVYNNDFG---PGVYCGSGFLS 234
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 190360135  3012 LIYQLFKGLAQPVDFLALTASSIAGAILAVIVVLVFYYLIKLKRAFGDYTSIVFVNVIVWCVNFMMLFVFQVYPTLSCVY 3091
Cdd:pfam19217  235 LLTNVFSGFNTPISVFALTGQLMFNCVVALIAVCVCYYVLKFKRAFGDYSTGVLTVVLATLVNNLSYFVTQVNPVLMIVY 314
                          330       340       350
                   ....*....|....*....|....*....|....*..
gi 190360135  3092 AICYFYATLYFPSEISVIMHLQWLVMYGTIMPLWFCL 3128
Cdd:pfam19217  315 AVLYFYATLYVTPEYAWIWHLGFLVAYVPLAPWWVLL 351
Peptidase_C30 pfam05409
Coronavirus endopeptidase C30; This Coronavirus (CoV) domain, peptidase C30, is also known as ...
3275-3549 6.84e-149

Coronavirus endopeptidase C30; This Coronavirus (CoV) domain, peptidase C30, is also known as 3C-like proteinase (3CL-pro), or CoV main protease (M-pro) domain. CoV M-pro is a dimer where each subunit is composed of three domains I, II and III,,. Domains I and II consist of six-stranded antiparallel beta barrels and together resemble the architecture of chymotrypsin, and of picornaviruses 3C proteinases. The substrate-binding site is located in a cleft between these two domains. The catalytic site is situated at the centre of the cleft. A long loop connects domain II to the C-terminal domain (domain III). This latter domain has been implicated in the proteolytic activity of M-pro. In the active site of M-pro, Cys and His form a catalytic dyad,.


Pssm-ID: 398852  Cd Length: 274  Bit Score: 464.22  E-value: 6.84e-149
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 190360135  3275 GLWLGDKVYCPRHVICSASDMTnPDYTNLLCRVTSSDFTVLFDRLSLTVMSYQMQGCMLVLTVTLQNSRTPKYTFGVVKP 3354
Cdd:pfam05409    1 GLWLGDTVYCPRHVIGSFTGML-PQYEHLLSIARNHDFCVVSGGVQLTVVSAKMQGAILVLKVHTNNPNTPKYKFVRLKP 79
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 190360135  3355 GETFTVLAAYNGKPQGAFHVTMRSSYTIKGSFLCGSCGSVVYVIMGDCVKFVYMHQLELSTGCHTGTDFNGDFYGPYKDA 3434
Cdd:pfam05409   80 GESFTILAAYDGCPQGVYHVTMRSNHTIKGSFLNGACGSVGYNLKGGTVCFVYMHHLELPNGSHTGTDLEGVFYGPYVDE 159
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 190360135  3435 QVVQLPVQDYIQSVNFVAWLYAAILNNCNWFVQSDKCSVEDFNVWALSNGFSQVKSDLVIDALASMTGVSLETLLAAIKH 3514
Cdd:pfam05409  160 EVAQLEGTDQTYTDNVVAWLYAAIINGPRWFLASTTVSLEDFNAWAMTNGFTPFPCEDAILGLAAKTGVSVERLLAAIKV 239
                          250       260       270
                   ....*....|....*....|....*....|....*
gi 190360135  3515 LKNGFQGRQIMGSCSFEDELTPSDVYQQLAGIKLQ 3549
Cdd:pfam05409  240 LNNGFGGRTILGSPSLEDEFTPEDVYNQMAGVTLQ 274
betaCoV-Nsp6 cd21560
betacoronavirus non-structural protein 6; Coronaviruses (CoV) redirect and rearrange host cell ...
3550-3836 1.44e-141

betacoronavirus non-structural protein 6; Coronaviruses (CoV) redirect and rearrange host cell membranes as part of the viral genome replication and transcription machinery; they induce the formation of double-membrane vesicles in infected cells. CoV non-structural protein 6 (Nsp6), a transmembrane-containing protein, together with Nsp3 and Nsp4, have the ability to induce double-membrane vesicles that are similar to those observed in severe acute respiratory syndrome (SARS) coronavirus-infected cells. By itself, Nsp6 can generate autophagosomes from the endoplasmic reticulum. Autophagosomes are normally generated as a cellular response to starvation to carry cellular organelles and long-lived proteins to lysosomes for degradation. Degradation through autophagy may provide an innate defense against virus infection, or conversely, autophagosomes can promote infection by facilitating the assembly of replicase proteins. In addition to initiating autophagosome formation, Nsp6 also limits autophagosome expansion regardless of how they were induced, i.e. whether they were induced directly by Nsp6, or indirectly by starvation or chemical inhibition of MTOR signaling. This may favor coronavirus infection by compromising the ability of autophagosomes to deliver viral components to lysosomes for degradation.


Pssm-ID: 394846  Cd Length: 290  Bit Score: 443.61  E-value: 1.44e-141
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 190360135 3550 SKRTRLVKGIVCWIMASTFLFSCIITAFVKWTMFMYVTTNMLSITFCALCVIS-LTMLLVKHKHLYLTMYIIPVLFTLLY 3628
Cdd:cd21560     1 SKVKRVVKGTLHWLLATFVLFYLIILQLTKWTMFMYLTETMLLPLTPALCCVSaCVMLLVKHKHTFLTLFLLPVLLTLAY 80
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 190360135 3629 NNYLVVYKQTFRGYVYAWLSYYVPSVEYTYTDEVIYGMLLLIGMVfVTLRSINQYLFSFIMFVGRVISVVSLWYMGSnLE 3708
Cdd:cd21560    81 YNYVYVPKSSFLGYVYNWLNYVNPYVDYTYTDEVTYGSLLLVLML-VTMRLVNHDAFSRVWAVCRVITWVYMWYTGS-LE 158
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 190360135 3709 EEILLMLASLFGTYTWTTA---LSMAAAK-VIAKWVAVNVLYFTDIPQIKIVLVCYLFIGYIISCYWGLFSLMNSLFRMP 3784
Cdd:cd21560   159 ESALSYLTFLFSVTTNYTGvvtVSLALAKfITALWLAYNPLLFLDIPEVKCVLLVYLFIGYICTCYFGVFSLLNRLFRCP 238
                         250       260       270       280       290
                  ....*....|....*....|....*....|....*....|....*....|..
gi 190360135 3785 LGVYNYKISVQELRYMNANGLRPPKNSFEALMLNFKLLGIGGVPIIEVSQFQ 3836
Cdd:cd21560   239 LGVYDYKVSTQEFRYMNANGLRPPRNSWEALMLNFKLLGIGGVPCIKVSTVQ 290
betaCoV_Nsp8 cd21831
betacoronavirus non-structural protein 8; This model represents the non-structural protein 8 ...
3929-4122 2.41e-112

betacoronavirus non-structural protein 8; This model represents the non-structural protein 8 (Nsp8) the highly pathogenic betacoronaviruses that include Severe acute respiratory syndrome-related coronavirus (SARS-CoV), SARS-CoV2 (also called 2019 novel CoV or 2019-nCoV), and Middle East respiratory syndrome-related (MERS) CoV. CoVs utilize a multi-subunit replication/transcription machinery. A set of non-structural proteins (Nsps) generated as cleavage products of the ORF1a and ORF1ab viral polyproteins assemble to facilitate viral replication and transcription. Upon processing of the Nsp7-10 region by protease M (Mpro), the released four small proteins Nsp7, Nsp8, Nsp9, and Nsp10 form functional complexes with CoV core enzymes and thereby stimulate replication. Most importantly, a complex of Nsp8 with Nsp7 has been shown to activate and confer processivity to the RNA-synthesizing activity of Nsp12, the RNA-dependent RNA-polymerase (RdRp); in SARS-CoV, point mutations in the genes encoding Nsp8 and Nsp7 have been shown to delay virus growth. Nsp8 and Nsp7 cooperate in activating the primer-dependent activity of the Nsp12 RdRp such that the level of their association may constitute a limiting factor for obtaining a high RNA polymerase activity. The subsequent Nsp7/Nsp8/Nsp12 polymerase complex is then able to associate with an active bifunctional Nsp14, which includes N-terminal 3' to 5' exoribonuclease (ExoN) and C-terminal N7-guanine cap methyltransferase (N7-MTase) activities, thus representing a unique coronavirus Nsp assembly that incorporates RdRp, exoribonuclease, and N7-MTase activities. Interaction of Nsp8 with Nsp7 appears to be conserved across the coronavirus family, making these proteins interesting drug targets. Nsp8 has a novel 'golf-club' fold composed of an N-terminal 'shaft' domain and a C-terminal 'head' domain. The shaft domain contains three helices, one of which is very long, while the head domain contains another three helices and seven beta-strands, forming an alpha/beta fold. SARS-CoV Nsp8 forms a 8:8 hexadecameric supercomplex with Nsp7 that adopts a hollow cylinder-like structure with a large central channel and positive electrostatic properties in the cylinder; the Nsp7/Nsp8 complex functions as a noncanonical RNA polymerase capable of synthesizing RNA of up to the template length.


Pssm-ID: 409258  Cd Length: 196  Bit Score: 356.02  E-value: 2.41e-112
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 190360135 3929 SEFVNMASFVEYEVAKKNLDEACSSGSANQQQLKQLEKACNIAKSAYERDRAVARKLERMADLALTNMYKEARINDKKSK 4008
Cdd:cd21831     1 SEFSNLASYAEYETAQKAYDEAVASGDASPQVLKALKKAVNVAKSAYEKDKAVARKLERMADQAMTSMYKQARAEDKKSK 80
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 190360135 4009 VVSALQTMLFSMVRKLDNQALNSILDNAVKGCVPLNAIPSLAANTLTIIVPDKSVYDQVVDNVYVTYAGNVWQIQTIQDS 4088
Cdd:cd21831    81 VVSAMQTMLFGMIRKLDNDALNNIINNARNGCVPLSIIPLTAANKLRVVVPDYSVYKQVVDGPTLTYAGALWDIQQINDA 160
                         170       180       190
                  ....*....|....*....|....*....|....*..
gi 190360135 4089 DGTNKQLHEISDDC---NWPLVIIANRHNEvSATALQ 4122
Cdd:cd21831   161 DGKIVQLSDITEDSenlAWPLVVTATRANS-SAVKLQ 196
CoV_NSP8 pfam08717
Coronavirus replicase NSP8; Viral NSP8 (non structural protein 8) forms a hexadecameric ...
3926-4116 1.74e-97

Coronavirus replicase NSP8; Viral NSP8 (non structural protein 8) forms a hexadecameric supercomplex with NSP7 that adopts a hollow cylinder-like structure. The dimensions of the central channel and positive electrostatic properties of the cylinder imply that it confers processivity on RNA-dependent RNA polymerase. NSP7 and NSP8 heterodimers play a role in the stabilization of NSP12 regions involved in RNA binding and are essential for a highly active NSP12 polymerase complex. It has been demonstrated that NSP8 acts as an oligo(U)-templated polyadenylyltransferase but also has robust (mono/oligo) adenylate transferase activities. NSP8 has N- and C-terminal D/ExD/E conserved motifs, being the N-terminal motif critical for RNA polymerase activity as these residues are part of the Mg2-binding active site.


Pssm-ID: 400866  Cd Length: 197  Bit Score: 313.32  E-value: 1.74e-97
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 190360135  3926 ALQSEFVNMASFVEYEVAKKNLDEACSSGSAnQQQLKQLEKACNIAKSAYERDRAVARKLERMADLALTNMYKEARINDK 4005
Cdd:pfam08717    1 SVASEFSSLPSYAAYETAKEAYEEAVANGSS-QQVLKQLKKACNIAKSEFDRDAAVQKKLEKMAEQAMTQMYKEARAVDR 79
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 190360135  4006 KSKVVSALQTMLFSMVRKLDNQALNSILDNAVKGCVPLNAIPSLAANTLTIIVPDKSVYDQVVDNVYVTYAGNVWQIQTI 4085
Cdd:pfam08717   80 KSKVVSAMHTLLFSMLRKLDNSALNTIINNARNGVVPLNIIPATTAAKLTVVVPDYETFVKVVDGNTVTYAGAVWEIQEV 159
                          170       180       190
                   ....*....|....*....|....*....|....*
gi 190360135  4086 QDSDGTNKQLHEI----SDDCNWPLVIIANRHNEV 4116
Cdd:pfam08717  160 KDADGKIVHLKEItmdnSPNLAWPLIVTAERANSA 194
alpha_betaCoV_Nsp10 cd21901
alphacoronavirus and betacoronavirus non-structural protein 10; This model represents the ...
4233-4362 7.80e-86

alphacoronavirus and betacoronavirus non-structural protein 10; This model represents the non-structural protein 10 (Nsp10) of alpha- and betacoronaviruses, including highly pathogenic betacoronaviruses such as Severe acute respiratory syndrome-related coronavirus (SARS-CoV), SARS-CoV2 (also called 2019 novel CoV or 2019-nCoV), Middle East respiratory syndrome-related (MERS) CoV, and alphacoronaviruses such as Human coronavirus 229E. CoVs utilize a multi-subunit replication/transcription machinery. A set of non-structural proteins (Nsps) generated as cleavage products of the ORF1a and ORF1ab viral polyproteins assemble to facilitate viral replication and transcription. Upon processing of the Nsp7-10 region by protease M (Mpro), the released four small proteins Nsp7, Nsp8, Nsp9, and Nsp10 form functional complexes with CoV core enzymes and thereby stimulate replication. Coronaviruses cap their mRNAs; RNA cap methylation may involve at least three proteins: Nsp10, Nsp14, and Nsp16. Nsp10 serves as a cofactor for both Nsp14 and Nsp16. Nsp14 consists of 2 domains with different enzymatic activities: an N-terminal ExoN domain and a C-terminal cap (guanine-N7) methyltransferase (N7-MTase) domain. The association of Nsp10 with Nsp14 enhances Nsp14's exoribonuclease (ExoN) activity, and not its N7-Mtase activity. ExoN is important for proofreading and therefore, the prevention of lethal mutations. The Nsp10/Nsp14 complex hydrolyzes double-stranded RNA in a 3' to 5' direction as well as a single mismatched nucleotide at the 3'-end, mimicking an erroneous replication product, and may function in a replicative mismatch repair mechanism. Nsp16 Cap-0 specific (nucleoside-2'-O-)-methyltransferase (2'OMTase) acts sequentially to Nsp14 MTase in RNA capping methylation, and methylates the RNA cap at the ribose 2'-O position; it catalyzes the conversion of the cap-0 structure on m7GpppA-RNA to a cap-1 structure. The association of Nsp10 with Nsp16 enhances Nsp16's 2'OMTase activity, possibly through enhanced RNA binding affinity. Additionally, transmissible gastroenteritis virus (TGEV) Nsp10, Nsp16 and their complex can interact with DII4, which normally binds to Notch receptors; this interaction may disturb Notch signaling. Nsp10 also binds 2 zinc ions with high affinity.


Pssm-ID: 409326  Cd Length: 130  Bit Score: 276.86  E-value: 7.80e-86
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 190360135 4233 AGTATEYASNSSILSLCAFSVDPKKTYLDFIQQGGTPIANCVKMLCDHAGTGMAITVKPDATTSQDSYGGASVCIYCRAR 4312
Cdd:cd21901     1 AGKQTEVASNSSLLTLCAFAVDPAKTYLDAVKSGGKPVGNCVKMLTNGTGTGQAITVKPEANTNQDSYGGASVCLYCRAH 80
                          90       100       110       120       130
                  ....*....|....*....|....*....|....*....|....*....|
gi 190360135 4313 VEHPDVDGLCKLRGKFVQVPVGIKDPVSYVLTHDVCQVCGFWRDGSCSCV 4362
Cdd:cd21901    81 VEHPDMDGVCKLKGKYVQVPLGTNDPVRFCLENDVCKVCGCWLGNGCSCD 130
MHV-like_Nsp3_NAB cd21824
nucleic acid binding domain of non-structural protein 3 from murine hepatitis virus and ...
1898-2011 1.00e-76

nucleic acid binding domain of non-structural protein 3 from murine hepatitis virus and betacoronavirus in the A lineage; This model represents the nucleic acid binding (NAB) domain of non-structural protein 3 (Nsp3) from betacoronavirus in the embecovirus subgenus (A lineage), including murine hepatitis virus (MHV) and Human coronavirus HKU1. The NAB domain represents a new fold, with a parallel four-strand beta-sheet holding two alpha-helices of three and four turns that are oriented antiparallel to the beta-strands. NAB is a cytoplasmic domain located between the papain-like protease (PLPro) and betacoronavirus-specific marker (betaSM) domains of CoV Nsp3. Nsp3 is a large multi-functional multi-domain protein that is an essential component of the replication/transcription complex (RTC), which carries out RNA synthesis, RNA processing, and interference with the host cell innate immune system. The NAB domain both binds ssRNA and unwinds dsDNA. It prefers to bind ssRNA containing repeats of three consecutive guanines. A group of residues that form a positively charged patch on the protein surface of SARS-CoV Nsp3 NAB serves as the binding site of nucleic acids. This site is conserved in the NAB of Nsp3 from betacoronavirus in the sarbecovirus subgenus (B lineage), but is not conserved in the Nsp3 NAB from betacoronaviruses in the A lineage.


Pssm-ID: 409350  Cd Length: 119  Bit Score: 250.45  E-value: 1.00e-76
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 190360135 1898 GKYYTQRIIKAQFKTFEKVDGVYTNFKLIGHTVCDILNAKLGFDSSKEFVEYKVTEWPTATGDVVLATDDLYVKRYERGC 1977
Cdd:cd21824     1 GKYYTKPIIKAQFKTFEKVDGVYTNFKLVGHTICDKLNAKLGFDSSKPFVEYKVTEWPTATGDVVLASDDLYVKRYEKGC 80
                          90       100       110
                  ....*....|....*....|....*....|....
gi 190360135 1978 ITFGKPVIWLSHEQASLNSLTYFNRPLLVDENKF 2011
Cdd:cd21824    81 ITFGKPVIWLGHEEASLNSLTYFNRPSLVDENKF 114
MHV-like_Nsp3_betaSM cd21812
betacoronavirus-specific marker of non-structural protein 3 from murine hepatitis virus and ...
2026-2146 1.93e-75

betacoronavirus-specific marker of non-structural protein 3 from murine hepatitis virus and betacoronavirus in the A lineage; This model represents the betacoronavirus-specific marker (betaSM), also called group 2-specific marker (G2M), of non-structural protein 3 (Nsp3) from betacoronavirus in the embecovirus subgenus (A lineage), including murine hepatitis virus (MHV) and Human coronavirus HKU1. The betaSM/G2M is located C-terminal to the nucleic acid-binding (NAB) domain. This region is absent in alpha- and deltacoronavirus Nsp3; there is a gammacoronavirus-specific marker (gammaSM) at this position in gammacoronavirus Nsp3. Nsp3 is a large multi-functional multi-domain protein that is an essential component of the replication/transcription complex (RTC), which carries out RNA synthesis, RNA processing, and interference with the host cell innate immune system. Little is known about the betaSM/G2M domain; it is predicted to be non-enzymatic and may be an intrinsically disordered region. The betaSM/G2M domain is part of the predicted PLnc domain (made up of 385 amino acids) of the related SARS-CoV Nsp3 that may function as a replication/transcription scaffold, with interactions to Nsp5, Nsp12, Nsp13, Nsp14, and Nsp16.


Pssm-ID: 409627  Cd Length: 125  Bit Score: 247.21  E-value: 1.93e-75
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 190360135 2026 SESDAKEPKEINIIKLSGVKKPFKVEDSVIVNDDTSEIKYVKSLSIVDVYDMWLTGCRCVVRTANALSRAVNVPTIRKFI 2105
Cdd:cd21812     5 SQSDSKQAKPVKIVKLNGVKKPFKVEDSVVVNDDTSETKVVKSLSIVDVYDMWLTGCRYVVWTANALSRLVNVPTVREYV 84
                          90       100       110       120
                  ....*....|....*....|....*....|....*....|.
gi 190360135 2106 KFGMTLVSIPIDLLNLREIKPVFNVVKAVRNKISACFNFIK 2146
Cdd:cd21812    85 KFGMTVISIPIDLLNLRDDKQEFVVPKVVKAKVSACYNFIK 125
Peptidase_C16 pfam01831
Peptidase C16 family;
1049-1285 1.68e-74

Peptidase C16 family;


Pssm-ID: 460353  Cd Length: 249  Bit Score: 249.61  E-value: 1.68e-74
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 190360135  1049 FEFYTTEPEFV--KVLDLYVPKATRNNCWLRSVLAVMQKLPCQFKDKNLQDLWVLYKQQYSQLFVDTLVNKIPANIVVPQ 1126
Cdd:pfam01831   11 FAFAAEFPDELhfASCGFGNPAIEEEDCFCPSAAIEMKSKGKEFKDHEMQKCSLLPAAECCQCFADILDIFVDEDIIKPE 90
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 190360135  1127 GGYVADFAYWFLTLCDWQCVAYWKCIKCDLALKLKGLDAMFFYGDVVSHVCKCGESMVLIDVDVPFTAHFALKDKLFCAF 1206
Cdd:pfam01831   91 AGTMAAFAFFFASLCKFKARANIQALECDGELKKQAADALFFRGCLCNHMCCCCDAHTAFHADIPQPDGFCLGDDKFCAF 170
                          170       180       190       200       210       220       230
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 190360135  1207 ITKRSVYKAACVVDVNDSHSMAVVDGKQIDDHRITSITSDKFDFIIGHGMSFSMTTFEIAQLYGSCITPNVCFVKGDII 1285
Cdd:pfam01831  171 FTPRKAFPAAAAQDLNDCHILARKEGKKGDGKSGHFFIADKFDFMDFNGEDACEEPFELAKGKGSCIAPALCFGKGDVI 249
CoV_NSP10 pfam09401
Coronavirus RNA synthesis protein NSP10; Non-structural protein 10 (NSP10) is involved in RNA ...
4244-4362 1.91e-74

Coronavirus RNA synthesis protein NSP10; Non-structural protein 10 (NSP10) is involved in RNA synthesis. It is synthesized as a polyprotein whose cleavage generates many non-structural proteins. NSP10 contains two zinc binding motifs and forms two anti-parallel helices which are stacked against an irregular beta sheet. A cluster of basic residues on the protein surface suggests a nucleic acid-binding function. NSP10 interacts with NSP14 and NSP16 and regulates their respective ExoN and 2-O-MTase activities. When binding to the N-terminal of NSP14, nsp10 allows the ExoN active site to adopt a stably closed conformation and is an allosteric regulator that stabilizes NSP16. The residue Tyr-96 plays a crucial role in the NSP10-NSP16/NSP14 interaction. This residue is specific for SARS-CoV NSP10 and is a phenylalanine in most other Coronavirus homologs.


Pssm-ID: 462788  Cd Length: 119  Bit Score: 243.89  E-value: 1.91e-74
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 190360135  4244 SILSLCAFSVDPKKTYLDFIQQGGTPIANCVKMLCDHAGTGMAITVKPDATTSQDSYGGASVCIYCRARVEHPDVDGLCK 4323
Cdd:pfam09401    1 SLLSLCAFAVDPAKAYLDYLAQGGQPITNCVKMLCNHAGTGMAITVKPEANTDQDSYGGASVCLYCRAHIEHPNVDGLCQ 80
                           90       100       110
                   ....*....|....*....|....*....|....*....
gi 190360135  4324 LRGKFVQVPVGIKDPVSYVLTHDVCQVCGFWRDGSCSCV 4362
Cdd:pfam09401   81 LKGKFVQIPTGTKDPVSFCLTNTVCTVCGCWLGYGCSCD 119
CoV_NSP6 pfam19213
Coronavirus replicase NSP6; This entry represents proteins found in Coronaviruses and includes ...
3577-3836 6.79e-71

Coronavirus replicase NSP6; This entry represents proteins found in Coronaviruses and includes the Non-structural Protein 6 (NSP6). Coronaviruses encode large replicase polyproteins which are proteolytically processed by viral proteases to generate mature Nonstructural Proteins (NSPs). NSP6 is a membrane protein containing 6 transmembrane domains with a large C-terminal tail. NSP6 from the avian coronavirus, infectious bronchitis virus (IBV) and the mouse hepatitis virus (MHV) have been shown to localize to the ER and to generate autophagosomes. Coronavirus NSP6 proteins have also been shown to limit autophagosome expansion. This may favour coronavirus infection by reducing the ability of autophagosomes to deliver viral components to lysosomes for degradation. NSP6 from IBV, MHV and severe acute respiratory syndrome coronavirus (SARS-CoV) have also been found to activate autophagy.


Pssm-ID: 465997  Cd Length: 260  Bit Score: 239.84  E-value: 6.79e-71
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 190360135  3577 FVKWTMFMYVTTNMLS-ITFCALCVISLTMLLVKHKHLYLTMYIIPVLFTLLYNNYLVVYK-QTFRGYVYAWlsyyvpSV 3654
Cdd:pfam19213    1 LLMYTALYWLPPNLITpVLPVLTCVSAILTLFIKHKVLFLTTFLLPSVVVMAYYNFTWDYYpNSFLRTVYDY------HF 74
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 190360135  3655 EYTYTDEVIYGMLLLIGMVFV--TLRSINqYLFSFIMFVGRVISVVSLWYMGSNLEEE-----ILLMLASLFGTYTWTTA 3727
Cdd:pfam19213   75 SLTSFDLQGYFNIASCVFVNVlhTYRFVR-SKYSIATYLVSLVVSVYMYVIGYALLTAtdvlsLLFMVLSLLTSYWYVGA 153
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 190360135  3728 LSMAAAKVIAKWVavNVLYFTDIPQIKIVLVCYLFIGYIISCYWGLFSLMNSLFRMPLGVYNYKISVQELRYMNANGLRP 3807
Cdd:pfam19213  154 IAYKLAKYIVVYV--PPSLIAVFGDIKVVLLVYVCIGYVCCVYFGILYWINRFTKLTLGVYDFKVSAAEFKYMVANGLSA 231
                          250       260
                   ....*....|....*....|....*....
gi 190360135  3808 PKNSFEALMLNFKLLGIGGVPIIEVSQFQ 3836
Cdd:pfam19213  232 PRNVFEALILNFKLLGIGGNRTIKISTVQ 260
betaCoV_Nsp9 cd21898
betacoronavirus non-structural protein 9; This model represents the non-structural protein 9 ...
4123-4232 3.91e-70

betacoronavirus non-structural protein 9; This model represents the non-structural protein 9 (Nsp9) from betacoronaviruses including highly pathogenic Severe acute respiratory syndrome-related coronavirus (SARS-CoV), SARS-CoV2 (also called 2019 novel CoV or 2019-nCoV), and Middle East respiratory syndrome-related (MERS) CoV. CoVs utilize a multi-subunit replication/transcription machinery assembled from a set of non-structural proteins (Nsps) generated as cleavage products of the ORF1a and ORF1ab viral polyproteins. All of these Nsps, except for Nsp1 and Nsp2, are considered essential for transcription, replication, and translation of the viral RNA. Nsp9, with Nsp7, Nsp8, and Nsp10, localizes within the replication complex. Nsp9 is an essential single-stranded RNA-binding protein for coronavirus replication; it shares structural similarity to the oligosaccharide-binding (OB) fold, which is characteristic of proteins that bind to ssDNA or ssRNA. Nsp9 requires dimerization for binding and orienting RNA for subsequent use by the replicase machinery. CoV Nsp9s have diverse forms of dimerization that promote their biological function, which may help elucidate the mechanism underlying CoVs replication and contribute to the development of antiviral drugs. Generally, dimers are formed via interaction of the parallel alpha-helices containing the protein-protein interaction motif GXXXG; additionally, the N-finger region may also play a critical role in dimerization as seen in porcine delta coronavirus (PDCoV) Nsp9. As a member of the replication complex, Nsp9 may not have a specific RNA-binding sequence but may act in conjunction with other Nsps as a processivity factor, as shown by mutation studies indicating that Nsp9 is a key ingredient that intimately engages other proteins in the replicase complex to mediate efficient virus transcription and replication.


Pssm-ID: 409331  Cd Length: 111  Bit Score: 231.52  E-value: 3.91e-70
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 190360135 4123 NNELMPAKLKTQVVNSGPDQT-CNTPTQCYYNNSYNGKIVYAILSDVDGLKYTKILKDDGNFVVLELDPPCKFTVQDVKG 4201
Cdd:cd21898     1 NNELMPQGLKTMVVTAGPDQTaCNTPALAYYNNVQGGRMVMAILSDVDGLKYAKVEKSDGGFVVLELDPPCKFLVQTPKG 80
                          90       100       110
                  ....*....|....*....|....*....|.
gi 190360135 4202 LKIKYLYFVKGCNTLARGWVVGTISSTVRLQ 4232
Cdd:cd21898    81 PKVKYLYFVKGLNNLHRGQVLGTIAATVRLQ 111
CoV_NSP9 pfam08710
Coronavirus replicase NSP9; Nsp9 is a single-stranded RNA-binding viral protein involved in ...
4123-4232 2.17e-52

Coronavirus replicase NSP9; Nsp9 is a single-stranded RNA-binding viral protein involved in RNA synthesis. Several crystallographic structures of nsp9 have shown that it is composed of seven beta strands and a single alpha helix. Nsp9 proteins have N-finger motifs and highly conserved GXXXG motifs that both play critical roles in dimerization. The conserved helix-helix dimer interface containing a GXXXG protein-protein interaction motif is biologically relevant to SARS-CoV replication.


Pssm-ID: 285872  Cd Length: 111  Bit Score: 180.75  E-value: 2.17e-52
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 190360135  4123 NNELMPAKLKTQVVNSGPDQT-CNTPTQCYYNNSYNGKIVYAILSDVDGLKYTKILKDDGNFVVLELDPPCKFTVQDVKG 4201
Cdd:pfam08710    1 NNELMPGKLKTKACKAGVTDAhCSVEGKAYYNNEGGGSFVYAILSSNPNLKYAKFEKEDGNVIYVELEPPCRFVVDTPKG 80
                           90       100       110
                   ....*....|....*....|....*....|.
gi 190360135  4202 LKIKYLYFVKGCNTLARGWVVGTISSTVRLQ 4232
Cdd:pfam08710   81 PEVKYLYFVKNLNNLRRGMVLGYISATVRLQ 111
bCoV_NAB pfam16251
Betacoronavirus nucleic acid-binding (NAB); This is the nucleic acid-binding domain (NAB) from ...
1902-2011 2.22e-46

Betacoronavirus nucleic acid-binding (NAB); This is the nucleic acid-binding domain (NAB) from the multidomain nonstructural protein NSP3, and described as NSP3e domain. NSP3 is part of Orf1a polyproteins in SARS-CoV. It is an essential component of the replication/transcription complex. The global domain of the NAB represents a new fold, with a parallel four-strand beta-sheet holding two alpha-helices of three and four turns that are oriented antiparallel to the beta-strands and a group of residues form a positively charged patch on the protein surface as the binding site responsible for binding affinity for nucleic acids. When binding to ssRNA, the NAB prefers sequences with repeats of three consecutive Gs, such as (GGGA)5 and (GGGA)2. A positively charged surface patch (Lys75, Lys76, Lys99, and Arg106) is involved in RNA binding.


Pssm-ID: 406621  Cd Length: 129  Bit Score: 164.26  E-value: 2.22e-46
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 190360135  1902 TQRIIKAQFKTFEKVDGVYTNFKLI--GHTVCDILNAKLGFDSSKEFV-EYKVTEWPTATGDVVLATDDLYVKRYERGCI 1978
Cdd:pfam16251   11 TKPIIKAQFRTFEKVDGVYDNFKLTcsGHKFADDLNAKLGFDCNKPASrELKITEFPDANGDVVAADDDHYSARFKKGAI 90
                           90       100       110
                   ....*....|....*....|....*....|....
gi 190360135  1979 TFGKPVIWLSHEQASLNSLTYFNRPLLV-DENKF 2011
Cdd:pfam16251   91 LFGKPIVWLGHEEAALKKLTFFNKPNTVcLECKF 124
CoV_NSP4_C pfam16348
Coronavirus replicase NSP4, C-terminal; This is the C-terminal domain of the coronavirus ...
3157-3244 4.30e-46

Coronavirus replicase NSP4, C-terminal; This is the C-terminal domain of the coronavirus nonstructural protein 4 (NSP4). NSP4 is encoded by ORF1a/1ab and proteolytically released from the pp1a/1ab polyprotein. It is a membrane-spanning protein which is thought to anchor the viral replication-transcription complex (RTC) to modified endoplasmic reticulum membranes. This predominantly alpha-helical domain may be involved in protein-protein interactions. It has been shown that in Betacoronavirus, the coexpression of NSP3 and NSP4 results in a membrane rearrangement to induce double-membrane vesicles (DMVs) and convoluted membranes (CMs), playing a critical role in SARS-CoV replication. There are two well conserved amino acid residues (H120 and F121) in NSP4 among Betacoronavirus, essential for membrane rearrangements during interaction with NSP3.


Pssm-ID: 465099  Cd Length: 92  Bit Score: 161.93  E-value: 4.30e-46
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 190360135  3157 GTFEEMALTTFMITKDSYCKLKNSLSDVAFNRYLSLYNKYRYYSGKMDTAAYREAACSQLAKAMDTFtNNNGSDVLYQPP 3236
Cdd:pfam16348    6 GTFEEAALGTFVIDKESYEKLKNSISLDKFNRYLSLYNKYKYYSGKMDEADYREACCAHLAKALEDF-SNSGNDVLYTPP 84

                   ....*...
gi 190360135  3237 TASVSTSF 3244
Cdd:pfam16348   85 TVSVTSSL 92
DPUP_MHV_Nsp3 cd21524
DPUP (domain preceding Ubl2 and PLP2) of non-structural protein 3 (Nsp3) from murine hepatitis ...
1489-1562 1.37e-44

DPUP (domain preceding Ubl2 and PLP2) of non-structural protein 3 (Nsp3) from murine hepatitis virus and related betacoronaviruses in the A lineage; This subfamily contains the DPUP (domain preceding Ubl2 and PLP2) of murine hepatitis virus (MHV) non-structural protein 3 (Nsp3) and other Nsp3s from betacoronaviruses in the embecovirus subgenera (A lineage), including human CoV OC43, rabbit CoV HKU14 and porcine hemagglutinating encephalomyelitis virus (HEV), among others. Non-structural protein 3 (Nsp3) is a large multi-functional multi-domain protein that is an essential component of the replication/transcription complex (RTC), which carries out RNA synthesis, RNA processing, and interference with the host cell innate immune system. MHV Nsp3 contains a DPUP that is located N-terminal to the ubiquitin-like domain 2 (Ubl2) and papain-like protease 2 (PLP2) catalytic domain. It is structurally similar to the Severe Acute Respiratory Syndrome (SARS) CoV unique domain C (SUD-C), adopting a frataxin-like fold that has structural similarity to DNA-binding domains of DNA-modifying enzymes. SUD-C is also located N-terminal to Ubl2 and PLP2 in SARS Nsp3, similar to the DPUP of MHV Nsp3; however, unlike DPUP, it is preceded by SUD-N and SUD-M macrodomains that are absent in MHV Nsp3. Though structurally similar, there is little sequence similarity between DPUP and SUD-C. SARS SUD-C has been shown to bind to single-stranded RNA and recognize purine bases more strongly than pyrimidine bases; it also regulates the RNA binding behavior of the SARS SUD-M macrodomain. It is not known whether DPUP functions in the same way.


Pssm-ID: 394840  Cd Length: 75  Bit Score: 156.81  E-value: 1.37e-44
                          10        20        30        40        50        60        70
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....
gi 190360135 1489 LDDDARTFVQSNVDVVPEGWRVVNKFYQINGVRTVKYFECPGGIDICSQDKVFGYVQQGSFNKATVAQIKALFL 1562
Cdd:cd21524     2 LDDDARVFVQANMDNLPEDWRLVNKFDVINGVRTIKYFECPGGIFICSQGKDFGYVQNGSFKKATVSQIRALLA 75
Macro_X_Nsp3-like cd21557
X-domain (or Mac1 domain) of viral non-structural protein 3 and related macrodomains; The ...
1294-1418 1.95e-41

X-domain (or Mac1 domain) of viral non-structural protein 3 and related macrodomains; The X-domain, also called Mac1, is the macrodomain found in riboviral non-structural protein 3 (Nsp3), including the Nsp3 of Severe Acute Respiratory Syndrome Coronavirus (SARS-CoV) as well as SARS-CoV-2, and other coronaviruses (alpha-, beta-, gamma-, and deltacoronavirus), among others. The SARS-CoV-2 Nsp3 Mac1 is highly conserved among all CoVs, and binds to and hydrolyzes mono-ADP-ribose (MAR) from target proteins. It appears to counter host-mediated antiviral ADP-ribosylation, a post-translational modification that is part of the host response to viral infections. Mac1 is essential for pathogenesis in multiple animal models of CoV infection, implicating it as a virulence factor and potential therapeutic target. Assays show that the de-MARylating activity leads to a rapid loss of substrate, and that Mac1 could not hydrolyze poly-ADP-ribose; thus, Mac1 is a MAR-hydrolase (mono-ADP ribosylhydrolase). Mac1 was originally named ADP-ribose-1"-phosphatase (ADRP) based on data demonstrating that it could remove the phosphate group from ADP-ribose-1"-phosphate; however, activity was modest and was unclear why this would impact a virus infection. This family also includes the X-domain of Avian infectious bronchitis virus (IBV) strain Beaudette coronavirus that does not bind ADP-ribose; the triple glycine sequence found in the X-domains of SARS-CoV and human coronavirus 229E (HCoV229E), which are involved in ADP-ribose binding, is not conserved in the IBV X-domain. SARS-CoVs have two other macrodomains referred to as the SUD-N (N-terminal subdomain, or Mac2) and SUD-M (middle SUD subdomain, or Mac3) of the SARS-unique domain (SUD), which also do not bind ADP-ribose; these bind G-quadruplexes (unusual nucleic-acid structures formed by consecutive guanosine nucleotides). SARS-CoV SUD-N and SUD-M are not included in this group.


Pssm-ID: 438957  Cd Length: 127  Bit Score: 150.01  E-value: 1.95e-41
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 190360135 1294 EVVVNPANGHMAHGGGVAKAIAVAAGQQFVKEtTDMVKSKGVCATGDCYVSTGGKLCKTVLNVVGPDARTqgKQSYALLE 1373
Cdd:cd21557     2 DVVVNAANENLKHGGGVAGAIYKATGGAFQKE-SDYIKKNGPLKVGTAVLLPGHGLAKNIIHVVGPRKRK--GQDDQLLA 78
                          90       100       110       120
                  ....*....|....*....|....*....|....*....|....*....
gi 190360135 1374 RVYKHLNK-YDCVVTTLISAGIFSVPSDVSLTYLLGTAKK---QVVLVS 1418
Cdd:cd21557    79 AAYKAVNKeYGSVLTPLLSAGIFGVPPEQSLNALLDAVDTtdaDVTVYC 127
betaCoV_Nsp7 cd21827
betacoronavirus non-structural protein 7; This model represents the non-structural protein 7 ...
3837-3925 2.70e-37

betacoronavirus non-structural protein 7; This model represents the non-structural protein 7 (Nsp7) of betacoronaviruses including the highly pathogenic Severe acute respiratory syndrome-related coronavirus (SARS-CoV), SARS-CoV2 (also called 2019 novel CoV or 2019-nCoV), and Middle East respiratory syndrome-related (MERS) CoV. CoVs utilize a multi-subunit replication/transcription machinery. A set of non-structural proteins (Nsps) generated as cleavage products of the ORF1a and ORF1ab viral polyproteins assemble to facilitate viral replication and transcription. Upon processing of the Nsp7-10 region by protease M (Mpro), the released four small proteins Nsp7, Nsp8, Nsp9 and Nsp10 form functional complexes with CoV core enzymes and stimulate replication. Most importantly, a complex of Nsp7 with Nsp8 has been shown to activate and confer processivity to the RNA-synthesizing activity of Nsp12, the RNA-dependent RNA-polymerase (RdRp); in SARS-CoV, point mutations in the NSP7- or NSP8-coding region have been shown to delay virus growth. Nsp7 and Nsp8 cooperate in activating the primer-dependent activity of the Nsp12 RdRp such that the level of their association may constitute a limiting factor for obtaining a high RNA polymerase activity. The subsequent Nsp7/Nsp8/Nsp12 polymerase complex is then able to associate with an active bifunctional Nsp14, which includes N-terminal 3' to 5' exoribonuclease (ExoN) and C-terminal N7-guanine cap methyltransferase (N7-MTase) activities, thus representing a unique coronavirus Nsp assembly that incorporates RdRp, exoribonuclease, and N7-MTase activities. Interaction of Nsp7 with Nsp8 appears to be conserved across the coronavirus family, making these proteins interesting drug targets. Nsp7 has a 4-helical bundle conformation which is strongly affected by its interaction with Nsp8, especially where it concerns alpha-helix 4. SARS-CoV Nsp7 forms a 8:8 hexadecameric supercomplex with Nsp8 that adopts a hollow cylinder-like structure with a large central channel and positive electrostatic properties in the cylinder; the Nsp7/Nsp8 complex functions as a noncanonical RNA polymerase capable of synthesizing RNA of up to template length.


Pssm-ID: 409253  Cd Length: 83  Bit Score: 136.42  E-value: 2.70e-37
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 190360135 3837 SKLTDVKCANVVLLNCLQHLHVASNSKLWQYCSTLHNEILATSDLGVAFEKLAQLLIVLFANPAAVDskCLTSIEEVCDD 3916
Cdd:cd21827     1 SKLTDVKCTSVVLLSVLQQLHVESNSKLWAYCVKLHNDILAAKDPTEAFEKFVSLLSVLLSFPGAVD--LDALCSELLDN 78

                  ....*....
gi 190360135 3917 YakdnTVLQ 3925
Cdd:cd21827    79 P----TVLQ 83
CoV_NSP7 pfam08716
Coronavirus replicase NSP7; NSP7 (non structural protein 7) has been implicated in viral RNA ...
3837-3925 6.44e-37

Coronavirus replicase NSP7; NSP7 (non structural protein 7) has been implicated in viral RNA replication and is predominantly alpha helical in structure. It forms a hexadecameric supercomplex with NSP8 that adopts a hollow cylinder-like structure. The dimensions of the central channel and positive electrostatic properties of the cylinder imply that it confers processivity on RNA-dependent RNA polymerase. NSP7 and NSP8 heterodimers play a role in the stabilization of NSP12 regions involved in RNA binding and are essential for a highly active NSP12 polymerase complex.


Pssm-ID: 285878  Cd Length: 83  Bit Score: 135.27  E-value: 6.44e-37
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 190360135  3837 SKLTDVKCANVVLLNCLQHLHVASNSKLWQYCSTLHNEILATSDLGVAFEKLAQLLIVLFANPAAVDskcltsIEEVCDD 3916
Cdd:pfam08716    1 SKLTDVKCTNVVLLGLLQKLHVESNSKLWAYCVELHNEILLCDDPTEAFEKLLALLAVLLSKHSAVD------LSDLCDS 74

                   ....*....
gi 190360135  3917 YAKDNTVLQ 3925
Cdd:pfam08716   75 YLENRTILQ 83
CoV_peptidase pfam08715
Coronavirus papain-like peptidase; This entry contains coronavirus cysteine endopeptidases ...
1567-1868 3.35e-35

Coronavirus papain-like peptidase; This entry contains coronavirus cysteine endopeptidases that belong to MEROPS peptidase family C16 and are required for proteolytic processing of the replicase polyprotein. All coronaviruses encode between one and two accessory cysteine proteinases that recognize and process one or two sites in the amino-terminal half of the replicase polyprotein during assembly of the viral replication complex. HCoV and TGEV encode two accessory proteinases, called coronavirus papain-like proteinase 1 and 2 (PL1-PRO and PL2-PRO). IBV and SARS encodes only one called PL-PRO. The structure of this protein has shown it adopts a fold similar that of de-ubiquitinating enzymes. The peptidase family C16 domain is about 260 amino acids in length. This domain is predicted to have an alpha-beta structural organization known as the papain-like fold. It consists of three alpha-helices and three strands of antiparallel beta-sheet.


Pssm-ID: 430171  Cd Length: 318  Bit Score: 138.96  E-value: 3.35e-35
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 190360135  1567 ILLTVDGVNFTNRFVPVGESFGKSLGNVFCDGVNVTKHKCDINYKGKVFFQF---DNLSSEDLKAVRSSFNFDQKELLAY 1643
Cdd:pfam08715    7 IYLTEDGVNYHSIVVKPGDSLGQQFGQVYAKNKDLSGVFPADDVEDKEILYVpttDWVEFYGFKSILEYYTLDASKYVIY 86
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 190360135  1644 YNMLvncsKWQVVFNGKYFTFKQANNNCFVNVSCLMLQSLNLKFKIVQWQEAWLEFRSGRPARFVSLVLAKGGFKFGDPA 1723
Cdd:pfam08715   87 LSAL----TKNVQYVDGFLILKWRDNNCWISSVIVALQAAKIRFKGQFLTEAWAKLLGGDPTDFVAWCYASCTAKVGDFG 162
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 190360135  1724 DSRDFLRVVFSQVDL--TGAICDFEIACKCGVKQEQRTGVDAVMHFGTLSREDLEIGYTVDCSCGKKLIHCVR-FDVPFL 1800
Cdd:pfam08715  163 DANWTLTNLAEHFDAeyTNAFLKKRVCCNCGIKSYELRGLEACIQVRATNLDHFKTGYSNCCVCGANNTDEVIeASLPYL 242
                          250       260       270       280       290       300       310
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|.
gi 190360135  1801 ICSNT---PASVKLPKGVGSANIFKGDKVGHYVHVKCEQSyqLYDASNVKKVTDVTGNLSDCLYLKNLKQT 1868
Cdd:pfam08715  243 LLSATdgpAAVDCLEDGVGTVAFVGSTNSGHYTYQTAKQA--FYDGAKDRKFGKKSPYVTAVYTRFAFKNE 311
Ubl1_cv_Nsp3_N-like cd21467
first ubiquitin-like (Ubl) domain located at the N-terminus of coronavirus SARS-CoV ...
872-961 3.48e-34

first ubiquitin-like (Ubl) domain located at the N-terminus of coronavirus SARS-CoV non-structural protein 3 (Nsp3) and related proteins; This ubiquitin-like (Ubl) domain (Ubl1) is found at the N-terminus of coronavirus Nsp3, a large multi-functional multi-domain protein which is an essential component of the replication/transcription complex (RTC). The functions of Ubl1 in CoVs are related to single-stranded RNA (ssRNA) binding and to interacting with the nucleocapsid (N) protein. SARS-CoV Ubl1 has been shown to bind ssRNA having AUA patterns, and since the 5'-UTR of the SARS-CoV genome has a number of AUA repeats, it may bind there. In mouse hepatitis virus (MHV), this Ubl1 domain binds the cognate N protein. Adjacent to Ubl1 is a Glu-rich acidic region (also referred to as hypervariable region, HVR); Ubl1 together with HVR has been called Nsp3a. Currently, the function of HVR in CoVs is unknown. This model corresponds to one of two Ubl domains in Nsp3; the other is located N-terminal to the papain-like protease (PLpro) and is not represented by this model.


Pssm-ID: 394822  Cd Length: 89  Bit Score: 127.69  E-value: 3.48e-34
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 190360135  872 TIKVFYELDKDFNTILNTACGVFEVDDTVDMEEFYAVVIDAIEEKLSPCKELEgVGAKVSAFLQKLEDNSLFLFDEAGEE 951
Cdd:cd21467     1 TVKVTYELDEVLDTILNKACSPFEVEKDLTVEEFADVVQDAVEEKLSPLLELP-LGDKVDADLDDFIDNPCYLFDEDGDE 79
                          90
                  ....*....|
gi 190360135  952 VLAPKLYCAF 961
Cdd:cd21467    80 VLASEMYCSF 89
A1pp smart00506
Appr-1"-p processing enzyme; Function determined by Martzen et al. Extended family detected by ...
1276-1406 3.39e-29

Appr-1"-p processing enzyme; Function determined by Martzen et al. Extended family detected by reciprocal PSI-BLAST searches (unpublished results, and Pehrson _ Fuji).


Pssm-ID: 214701  Cd Length: 133  Bit Score: 115.10  E-value: 3.39e-29
                            10        20        30        40        50        60        70        80
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 190360135   1276 NVCFVKGDIIKVskrvKAEVVVNPANGHMAHGGGVAKAIAVAAGQQFVKEttDMVKS-KGVCATGDCYVSTGGKL-CKTV 1353
Cdd:smart00506    1 ILKVVKGDITKP----RADAIVNAANSDGAHGGGVAGAIARAAGKALSKE--EVRKLaGGECPVGTAVVTEGGNLpAKYV 74
                            90       100       110       120       130
                    ....*....|....*....|....*....|....*....|....*....|....*....
gi 190360135   1354 LNVVGPDARTQGKQSYALLERVYKHL------NKYDCVVTTLISAGIFSVPSDVSLTYL 1406
Cdd:smart00506   75 IHAVGPRASGHSKEGFELLENAYRNClelaieLGITSVALPLIGTGIYGVPKDRSAQAL 133
Macro pfam01661
Macro domain; The Macro or A1pp domain is a module of about 180 amino acids which can bind ...
1297-1400 4.60e-23

Macro domain; The Macro or A1pp domain is a module of about 180 amino acids which can bind ADP-ribose (an NAD metabolite) or related ligands. Binding to ADP-ribose could be either covalent or non-covalent: in certain cases it is believed to bind non-covalently; while in other cases (such as Aprataxin) it appears to bind both non-covalently through a zinc finger motif, and covalently through a separate region of the protein. This domain is found in a number of otherwise unrelated proteins. It is found at the C-terminus of the macro-H2A histone protein 4 and also in the non-structural proteins of several types of ssRNA viruses such as NSP3 from alpha-viruses and coronaviruses. This domain is also found on its own in a family of proteins from bacteria, archaebacteria and eukaryotes. The 3D structure of the SARS-CoV Macro domain has a mixed alpha/beta fold consisting of a central seven-stranded twisted mixed beta sheet sandwiched between two alpha helices on one face, and three on the other. The final alpha-helix, located on the edge of the central beta-sheet, forms the C terminus of the protein. The crystal structure of AF1521 (a Macro domain-only protein from Archaeoglobus fulgidus) has also been reported and compared with other Macro domain containing proteins. Several Macro domain only proteins are shorter than AF1521, and appear to lack either the first strand of the beta-sheet or the C-terminal helix 5. Well conserved residues form a hydrophobic cleft and cluster around the AF1521-ADP-ribose binding site.


Pssm-ID: 460286  Cd Length: 116  Bit Score: 96.87  E-value: 4.60e-23
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 190360135  1297 VNPANGHMAHGGGVAKAIAVAAGQQFVKETTDMVksKGVCATGDCYVSTGGKL-CKTVLNVVGPDARTQGKQS-YALLER 1374
Cdd:pfam01661    1 VNAANSRLLGGGGVAGAIHRAAGPELLEECRELK--KGGCPTGEAVVTPGGNLpAKYVIHTVGPTWRHGGSHGeEELLES 78
                           90       100       110
                   ....*....|....*....|....*....|..
gi 190360135  1375 VYKHL------NKYDCVVTTLISAGIFSVPSD 1400
Cdd:pfam01661   79 CYRNAlalaeeLGIKSIAFPAISTGIYGFPWE 110
YmdB COG2110
O-acetyl-ADP-ribose deacetylase (regulator of RNase III), contains Macro domain [Translation, ...
1277-1398 8.88e-20

O-acetyl-ADP-ribose deacetylase (regulator of RNase III), contains Macro domain [Translation, ribosomal structure and biogenesis];


Pssm-ID: 441713  Cd Length: 168  Bit Score: 89.47  E-value: 8.88e-20
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 190360135 1277 VCFVKGDIIKVskrvKAEVVVNPANGHMAHGGGVAKAIAVAAGQQFVKETTDMVKSKGvCATGDCYVSTGGKL-CKTVLN 1355
Cdd:COG2110     1 IEIVQGDITEL----DVDAIVNAANSSLLGGGGVAGAIHRAAGPELLEECRRLCKQGG-CPTGEAVITPAGNLpAKYVIH 75
                          90       100       110       120
                  ....*....|....*....|....*....|....*....|....*....
gi 190360135 1356 VVGPDARTQGKQSYALLERVYKHL----NKYDC--VVTTLISAGIFSVP 1398
Cdd:COG2110    76 TVGPVWRGGGPSEEELLASCYRNSlelaEELGIrsIAFPAIGTGVGGFP 124
CoV_NSP2_C pfam19212
Coronavirus replicase NSP2, C-terminal; This entry corresponds to a presumed domain found at ...
678-851 3.70e-17

Coronavirus replicase NSP2, C-terminal; This entry corresponds to a presumed domain found at the C-terminus of Coronavirus non-structural protein 2 (NSP2). NSP2 is encoded by ORF1a/1ab and proteolytically released from the pp1a/1ab polyprotein. The function of NSP2 is uncertain. This presumed domain is found in two copies in some viral NSP2 proteins. This domain is found in both alpha and betacoronaviruses.


Pssm-ID: 465996  Cd Length: 156  Bit Score: 81.54  E-value: 3.70e-17
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 190360135   678 VLHGAYIVVESDIYFVknIPRYASAVAQAFRSVAKVVLDSLRVTFIDGLSCFKIGRRRICLSGskiyevergLLHSsqlp 757
Cdd:pfam19212    1 LKNAKFTVVNGGIVFV--VPKKFKSLVGTLLDLLNKLFDSLVDTVKIAGVKFKAGGTYYLFSN---------ALVK---- 65
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 190360135   758 ldVYDLTMPSQVQKT-KQKPIYLKGSGSDFSLADSVVEVVTTSLTPCGYSEPPKVADKICIVDNVYMAKAGDKYYPVVVD 836
Cdd:pfam19212   66 --VVSVKLKGKKQAGlKGAKEATVFVGATVPVTPTRVEVVTVELEEVDYVPPPVVVGYVVVIDGYAFYKSGDEYYPASTD 143
                          170
                   ....*....|....*
gi 190360135   837 GHVglLDQAWRVPCA 851
Cdd:pfam19212  144 GVV--VPPVFKLKGG 156
PRK00431 PRK00431
ADP-ribose-binding protein;
1280-1426 3.94e-15

ADP-ribose-binding protein;


Pssm-ID: 234759  Cd Length: 177  Bit Score: 76.42  E-value: 3.94e-15
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 190360135 1280 VKGDIIKVSkrvkAEVVVNPANGHMAHGGGVAKAIAVAAGQQFVKETTDMVKSKGVCATGDCYVSTGGKL-CKTVLNVVG 1358
Cdd:PRK00431    8 VQGDITELE----VDAIVNAANSSLLGGGGVDGAIHRAAGPEILEECRELRQQQGPCPTGEAVITSAGRLpAKYVIHTVG 83
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 190360135 1359 PDARTQGKQSYALLERVYKH-LNKYDCV-VTTL----ISAGIFSVPSD---------VSLTYLLGTAKKQVVLVSNNQED 1423
Cdd:PRK00431   84 PVWRGGEDNEAELLASAYRNsLRLAAELgLRSIafpaISTGVYGYPLEdaariavktVREFLTRHKSPEEVYFVCYDEEA 163

                  ...
gi 190360135 1424 FDL 1426
Cdd:PRK00431  164 YRL 166
 
Name Accession Description Interval E-value
TM_Y_MHV-like_Nsp3_C cd21714
C-terminus of non-structural protein 3, including transmembrane and Y domains, from murine ...
2195-2749 0e+00

C-terminus of non-structural protein 3, including transmembrane and Y domains, from murine hepatitis virus and betacoronavirus in the A lineage; This model represents the C-terminus of non-structural protein 3 (Nsp3) from betacoronavirus in the embecovirus subgenus (A lineage), including murine hepatitis virus (MHV) and Human coronavirus HKU1. This conserved C-terminus includes two transmembrane (TM) regions TM1 and TM2, an ectodomain (3Ecto) between the TM1 and TM2 that is glycosylated and located on the lumenal side of the ER, an amphiphatic region (AH1) that is not membrane-spanning, and a large Y domain of approximately 370 residues. Nsp3 is a large multi-functional multi-domain protein that is an essential component of the replication/transcription complex (RTC), which carries out RNA synthesis, RNA processing, and interference with the host cell innate immune system. In MHV and the related Severe acute respiratory syndrome-related coronavirus (SARS-CoV), the TM1, 3Ecto and TM2 domains are important for the papain-like protease (PL2pro) domain to process Nsp3-Nsp4 cleavage. It has also been shown that the interaction of 3Ecto with the lumenal loop of Nsp4 is essential for ER rearrangements in cells infected with SARS-CoV or MHV. The Y domain, located at the cytosolic side of the ER, consists of the Y1 and CoV-Y subdomains, which are conserved in nidovirus and coronavirus, respectively. Functional information about the Y domain is limited; it has been shown that Nsp3 binding to Nsp4 is less efficient without the Y domain.


Pssm-ID: 409662  Cd Length: 555  Bit Score: 1114.06  E-value: 0e+00
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 190360135 2195 VARGACIIATIFLLWFNFIYANVIFSDFYLPKIGFLPTFVGKIAQWIKNTFSLVTICDLYSIQDVGFKNQYCNGSIACQF 2274
Cdd:cd21714     1 VARGFFIIATIFLLWFNFLYANVIFSDFYLPNIGFLPTFVGKIVQWFKNTFGLVTICDLYSVSDVGFKSQFCNGSMACQL 80
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 190360135 2275 CLAGFDMLDNYKAIDVVQYEADRRAFVDYTGVLKIVIELIVSYALYTAWFYPLFALISIQILTTWLPELFMLSTLHWSVR 2354
Cdd:cd21714    81 CLSGFDMLDNYKAIDVVQYEVDRRVFFDYTSVLKLVVELVVSYALYTVWFYPLFCLIGLQLLTTWLPEFFMLETLHWSVR 160
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 190360135 2355 LLVSLANMLPAHVFMRFYIIIASFIKLFSLFRHVAYGCSKSGCLFCYKRNRSLRVKCSTIVGGMIRYYDAMANGGTGFCS 2434
Cdd:cd21714   161 LFVFLANMLPAHVFLRFYIVVTAMYKIFCLFRHVVYGCSKPGCLFCYKRNRSVRVKCSTIVGGMLRYYDVMANGGTGFCS 240
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 190360135 2435 KHQWNCIDCDSYKPGNTFITVEAALDLSKELKRPIQPTDVAYHTVTDVKQVGCYMRLFYDRDGQRTYDDVNASLFVDYSN 2514
Cdd:cd21714   241 KHQWNCINCDSYKPGNTFITVEAAAELSKELKRPVNPTDVAYYTVTDVKQVGCSMRLFYERDGQRVYDDVNASLFVDMNG 320
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 190360135 2515 LLHSKVKSVPNMHVVVVENDADKANFLNAAVFYAQSLFRPILMVDKNLITTANTGTSVTETMFDVYVDTFLSMFDVDKKS 2594
Cdd:cd21714   321 LLHSKVKGVPNTHVVVVENDADKANFLNAAVFYAQSLFRPMLMVDKKLITTANTGTSVSQTMFDVYVDTFLSMFDVDRKS 400
                         410       420       430       440       450       460       470       480
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 190360135 2595 LNALIATAHSSIKQGTQICKVLDTFLSCARKSCSIDSDVDTKCLADSVMSAVSAGLELTDESCNNLVPTYLKGDNIVAAD 2674
Cdd:cd21714   401 LNSFINTAHSSLKEGVQLEKVLDTFIGCARKSCSIDSDVDTKCIAKSVMSAVAAGLEFTDESCNNLVPTYIKSDNIVAAD 480
                         490       500       510       520       530       540       550
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*
gi 190360135 2675 LGVLIQNSAKHVQGNVAKIAGVSCIWSVDAFNQLSSDFQHKLKKACCKTGLKLKLTYNKQMANVSVLTTPFSLKG 2749
Cdd:cd21714   481 LGVLIQNSAKHVQGNVAKAANVACIWSVDAFNQLSSDFQHKLKKACVKTGLKLKLTYNKQEANVSILTTPFSLKG 555
betaCoV_Nsp2_MHV-like cd21519
betacoronavirus non-structural protein 2 (Nsp2) similar to MHV Nsp2/p65 and related proteins ...
249-851 0e+00

betacoronavirus non-structural protein 2 (Nsp2) similar to MHV Nsp2/p65 and related proteins from betacoronaviruses in the A lineage; Coronavirus non-structural proteins (Nsps) are encoded in ORF1a and ORF1b. Post infection, the genomic RNA is released into the cytoplasm of the cell and translated into two long polyproteins (pp), pp1a and pp1ab, which are then autoproteolytically cleaved by two viral proteases Nsp3 and Nsp5 into smaller subunits. Nsp2 is one of these subunits. This subgroup includes Nsp2 from Murine hepatitis virus (MHV) and betacoronaviruses in the embecovirus subgenus (A lineage). It belongs to a family which includes Severe acute respiratory syndrome coronavirus (SARS-CoV) Nsp2. The function of Nsp2 remains unclear. SARS-CoV Nsp2, rather than playing a role in viral replication, may be involved in altering the host cell environment; deletion of Nsp2 from the SARS-CoV genome results in only a modest reduction in viral titers, and it has been shown to interact with two host proteins, prohibitin 1 (PHB1) and PHB2 which have been implicated in cellular functions, including cell-cycle progression, cell migration, cellular differentiation, apoptosis, and mitochondrial biogenesis. MHV Nsp2, also known as p65, different from SARS-CoV Nsp2, may play an important role in the viral life cycle.


Pssm-ID: 394870  Cd Length: 586  Bit Score: 1085.10  E-value: 0e+00
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 190360135  249 PLLYVDQYGCDYTGGLADGLEAYADKTLQEMKALFPIWSQELPFDVTVAWHVVRDPRYVMRLQSASTIRSVAYVANPTED 328
Cdd:cd21519     1 PLLFVDQYGCDYTGKLAEGLEAYGDFSLQEMKELFPVWSQSLDFDVVVAWHVVRDPRFVMRLQTLATIRSIEYVAQPTED 80
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 190360135  329 LCDGSVVIKEPVHVYADDSIILRQHNLVDIMSCFYmeaDAVVNAFYGVDLKDCGFVMQFGYIDCEQDLCDFKGWVPGNMI 408
Cdd:cd21519    81 LVDGDVVIREPVHLLAADAIVLKLPKLVDVMQHTD---DSVVESIYKVKLCDCGFVMQFGYVDCCQDDCDFRGWVPGNMI 157
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 190360135  409 DGFACTTCGHVYETGDLLAQSSGVLPVNPVLHTKSAAGYGGfgckDSFTLYGQTVVYFGGCVYWSPARNIWIPILKSSVK 488
Cdd:cd21519   158 DGFACPSCGHVYGPSELLAQSSGVIPENPVLFTNSTDTVNQ----DSFKLYGHSVVPFGGCVYWSPYPGMWIPIIKSSVK 233
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 190360135  489 SYDGLVYTGVVGCKAIVKETNLICKALYLDYVQHKCGNLHQRELLGVSDVWHKQLLLNRGVYKPLLENIDYFNMRRAKFS 568
Cdd:cd21519   234 SYDGMVYTGVVGCKTIVKETDAICKALYLDYVQHKCGNLEQREILGLDDVWHKQLLLNRGDYSLLLENIDYFVMRRAKFS 313
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 190360135  569 LETFTVCADGFMPFLLDDLVPRAYYLAVSGQAFCDYAGKICHAVVSKSKELLDVSLDSLGAAIHYLNSKIVDLAQHFSDF 648
Cdd:cd21519   314 CETATVCDEGFVPFLLDGLVPRSYYLIKSGQAFTSLMSKFGQEVADMCMEMLVLSMDSVSVATFYIKKNVGKLASQFKAL 393
                         410       420       430       440       450       460       470       480
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 190360135  649 GTSFVSKIVHFFKTFTTSTALAFAWVLFHVLHGAYIVVESDIYFVKNIPRYASAVAQAFRSVAKVVLDSLRVTFIDGLSC 728
Cdd:cd21519   394 GAKFVKKLIEWFKAFTDTTALAFAWLLYHVLNGAYIVVESDIYFVKSVPDYARNVVRKFQTFFKMLLDCVKVTFLKGLSV 473
                         490       500       510       520       530       540       550       560
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 190360135  729 FKIGRRRICLSGSKIYEVERGLLHSSQLPLDVYDltmpsqvqktkQKPIYLKGSGSDFSLADSVVEVVTTSLTPCGYSEP 808
Cdd:cd21519   474 FKTGRGRVCFAGNKVYKVSRGLLSGFVLPSDVQE-----------SQLTFLEGVAEPVVVEDDVVEVVKTPLTPCGYCKP 542
                         570       580       590       600
                  ....*....|....*....|....*....|....*....|....
gi 190360135  809 PKVADKICIVDNVYMAKAGDKYYPVVVDGH-VGLLDQAWRVPCA 851
Cdd:cd21519   543 PKSAEKICIVDNVYMAKCGDKFYPVVVDDDtIGLLDQAWRFPCA 586
TM_Y_betaCoV_Nsp3_C cd21713
C-terminus of betacoronavirus non-structural protein 3, including transmembrane and Y domains; ...
2195-2749 0e+00

C-terminus of betacoronavirus non-structural protein 3, including transmembrane and Y domains; This model represents the C-terminus of non-structural protein 3 (Nsp3) from betacoronavirus, including highly pathogenic betacoronaviruses such as Severe acute respiratory syndrome-related coronavirus (SARS-CoV), SARS-CoV2 (also called 2019 novel CoV or 2019-nCoV), and Middle East respiratory syndrome-related (MERS) CoV. This conserved C-terminus includes two transmembrane (TM) regions TM1 and TM2, an ectodomain (3Ecto) between the TM1 and TM2 that is glycosylated and located on the lumenal side of the ER, an amphiphatic region (AH1) that is not membrane-spanning, and a large Y domain of approximately 370 residues. Nsp3 is a large multi-functional multi-domain protein that is an essential component of the replication/transcription complex (RTC), which carries out RNA synthesis, RNA processing, and interference with the host cell innate immune system. In SARS-CoV and murine hepatitis virus (MHV), the TM1, 3Ecto and TM2 domains are important for the papain-like protease (PL2pro) domain to process Nsp3-Nsp4 cleavage. It has also been shown that the interaction of 3Ecto with the lumenal loop of Nsp4 is essential for ER rearrangements in cells infected with SARS-CoV or MHV. The Y domain, located at the cytosolic side of the ER, consists of the Y1 and CoV-Y subdomains, which are conserved in nidovirus and coronavirus, respectively. Functional information about the Y domain is limited; it has been shown that Nsp3 binding to Nsp4 is less efficient without the Y domain.


Pssm-ID: 409661  Cd Length: 545  Bit Score: 672.67  E-value: 0e+00
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 190360135 2195 VARGACIIATIFLLWFNFIYANVIFSDFylpkigflPTFVGKIAQWIKNTFSLVTICDLYSIQDVGFKNQYCNGSIACQF 2274
Cdd:cd21713     1 VSLLLFLCLTVLLLWFNFLYANFILSDS--------PTFVGSIVAWFKYTLGISTICDFYQVTYLGDISEFCTGSMLCSL 72
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 190360135 2275 CLAGFDMLDNYKAIDVVQYEADRRAFVDYtgVLKIVIELIVSYALYTAWFYPLFALISIQILTTWLPELFMLStlHWSVR 2354
Cdd:cd21713    73 CLSGMDSLDNYDALNMVQHTVSSRLSDDY--IFKLVLELFFAYLLYTVAFYVLGLLAILQLFFSYLPLFFMLN--SWLVV 148
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 190360135 2355 LLVSLANMLPAHVFMRFYIIIASFIKLFSLFRHVAYGCSKSGCLFCYKRNRSLRVKCSTIVGGMIRYYDAMANGGTGFCS 2434
Cdd:cd21713   149 LFVYVINMVPASTLVRMYIVVASLYFVYKLYVHVVYGCNDTACLMCYKRNRATRVECSTVVNGSKRSFYVMANGGTGFCT 228
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 190360135 2435 KHQWNCIDCDSYKPGNTFITVEAALDLSKELKRPIQPTDVAYHTVTDVKQVGCYMRLFYDRDGQRTYDDVNASLFVDYSN 2514
Cdd:cd21713   229 KHNWNCVNCDTYGPGNTFICDEVAADLSTQFKRPINPTDSSYYSVTSVEVKNGSVHLYYERDGQRVYERFSLSLFVNLDK 308
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 190360135 2515 LLHSKVKSVP--NMHVVVVENDADKANFLNAAVFYAQSLFRPILMVDKNLITTANTGTSVTETMFDVYVDTFLSMFDVDK 2592
Cdd:cd21713   309 LKHSEVKGSPpfNVIVFDASNRAEENGAKSAAVYYSQLLCKPILLVDKKLVTTVGDSAEVARKMFDAYVNSFLSTYNVTM 388
                         410       420       430       440       450       460       470       480
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 190360135 2593 KSLNALIATAHSSIKQGTQICKVLDTFLSCARKSCSIDSDVDTKCLADSVMSAVSAGLELTDESCNNLVPTYLKGDNIVA 2672
Cdd:cd21713   389 DKLKTLVSTAHNSLKEGVQLEQVLKTFIGAARQKAAVESDVETKDIVKCVQLAHQADVDFTTDSCNNLVPTYVKVDTITT 468
                         490       500       510       520       530       540       550
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*..
gi 190360135 2673 ADLGVLIQNSAKHVQGNVAKIAGVSCIWSVDAFNQLSSDFQHKLKKACCKTGLKLKLTYNKQMANVSVLTTPFSLKG 2749
Cdd:cd21713   469 ADLGVLIDNNAKHVNANVAKAANVALIWNVAAFLKLSESLRRQLRSAARKTGLNFKLTTSKLRAVVPILTTPFSLKG 545
B-CoV_A_NSP1 pfam11963
Betacoronavirus, lineage A, NSP1; This family the N-terminal region of the Betacoronavirus ...
1-354 0e+00

Betacoronavirus, lineage A, NSP1; This family the N-terminal region of the Betacoronavirus polyprotein which contains non-structural protein 1 (Nsp1) from Betacoronavirus lineage A. This protein is important for viral replication and pathogenesis. It suppresses the host innate immune functions by inhibiting type I interferon expression and host antiviral signalling pathways.


Pssm-ID: 152398  Cd Length: 355  Bit Score: 660.48  E-value: 0e+00
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 190360135     1 MSKINKYGLELHWAPEFPWMFEDAEEKLDNPSSSEVDIVCSTTAQKLETGGICPENHVMVDCRRLLKQECCVQSSLIREI 80
Cdd:pfam11963    1 MAKMGKYGLGFKWAPEFPWMLPDASEKLGNPERSEEDGFCPSTAQEPEVKGKTLVNHVRVDCRRLLAQECCVQSALIRDI 80
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 190360135    81 VMNTRPYDLEVLLQDALQSREAVLVTPPLGMSLEACYVRGCNPNGWTMGLFRRRSVCNTGRCAVNKHVAYQLYMIDPAGV 160
Cdd:pfam11963   81 FVDEDPQKVEVLTMMALQSGSAVLVKPPLRLSVQAWHSLGVLPKGYAMGLFRRYCLCNTRECKCDAHVAFQLFMVQPDGV 160
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 190360135   161 CFGAGQFVGWVIPLAFMPVQSRKFIVPWVMYLRKCGEKGAYNKDHKRGGFE-HVYNFKVEDAYDLVHDEPKGKFSKKAYA 239
Cdd:pfam11963  161 CFGNGRFIGWFVPVTFMPEYAKKWLQPWSIYLRKGGNKGSVTSDHFRRAFTmPVYDFNVEDAYAEVHDEPKGKYSQKAYA 240
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 190360135   240 LIRGYRGVKPLLYVDQYGCDYTGGLADGLEAYADKTLQEMKALFPIWSQELPFDVTVAWHVVRDPRYVMRLQSASTIRSV 319
Cdd:pfam11963  241 LLRGYRGVKPVLFVDQYGCDYTGCLADGLEAYGDYTLQDMKQLQPVWLANLDFDVVVAWHVVRDPRAVMRLQTIATICGI 320
                          330       340       350
                   ....*....|....*....|....*....|....*
gi 190360135   320 AYVANPTEDLCDGSVVIKEPVHVYADDSIILRQHN 354
Cdd:pfam11963  321 AYVAQPTEDVVDGDVVIKEPVHLLSADAIVLRLPS 355
betaCoV_Nsp5_Mpro cd21666
betacoronavirus non-structural protein 5, also called Main protease (Mpro); This subfamily ...
3250-3543 2.12e-177

betacoronavirus non-structural protein 5, also called Main protease (Mpro); This subfamily contains the coronavirus (CoV) non-structural protein 5 (Nsp5) also called the Main protease (Mpro), or 3C-like protease (3CLpro), found in betacoronaviruses. CoVs utilize a multi-subunit replication/transcription machinery. A set of non-structural proteins (Nsps) generated as cleavage products of the ORF1a and ORF1ab viral polyproteins assemble to facilitate viral replication and transcription. Mpro/Nsp5 is a key enzyme in this process, making it a high value target for the development of anti-coronavirus therapeutics. These enzymes belong to the MEROPS peptidase C30 family, where the active site residues His and Cys form a catalytic dyad. The structures of Mpro/Nsp5 consist of three domains with the first two containing anti-parallel beta barrels and the third consisting of an arrangement of alpha-helices. The catalytic residues are found in a cleft between the first two domains. Mpro requires a Gln residue in the P1 position of the substrate and space for only small amino-acid residues such as Gly, Ala, or Ser in the P1' position; since there is no known human protease with a specificity for Gln at the cleavage site of the substrate, these viral proteases are suitable targets for the development of antiviral drugs.


Pssm-ID: 394887  Cd Length: 297  Bit Score: 547.00  E-value: 2.12e-177
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 190360135 3250 VKMVNPTSKVEPCIVSVTYGNMTLNGLWLGDKVYCPRHVICSASDMTNPDYTNLLCRVTSSDFTVLFDRLSLTVMSYQMQ 3329
Cdd:cd21666     1 RKMAFPSGKVEGCMVQVTCGTMTLNGLWLDDTVYCPRHVICTAEDMLNPNYEDLLIRKTNHSFLVQAGNVQLRVIGHSMQ 80
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 190360135 3330 GCMLVLTVTLQNSRTPKYTFGVVKPGETFTVLAAYNGKPQGAFHVTMRSSYTIKGSFLCGSCGSVVYVIMGDCVKFVYMH 3409
Cdd:cd21666    81 GCLLRLTVDTSNPKTPKYKFVRVKPGQTFSVLACYNGSPSGVYQCAMRPNHTIKGSFLCGSCGSVGYNIDGDCVSFCYMH 160
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 190360135 3410 QLELSTGCHTGTDFNGDFYGPYKDAQVVQLPVQDYIQSVNFVAWLYAAILNNCNWFVQSDKCSVEDFNVWALSNGFSQVK 3489
Cdd:cd21666   161 QMELPTGVHTGTDLEGKFYGPFVDRQTAQAAGTDTTITLNVLAWLYAAVLNGDRWFVNRFTTTLNDFNLWAMKYNYEPLT 240
                         250       260       270       280       290
                  ....*....|....*....|....*....|....*....|....*....|....*..
gi 190360135 3490 SD--LVIDALASMTGVSLETLLAAIKHLKNGF-QGRQIMGSCSFEDELTPSDVYQQL 3543
Cdd:cd21666   241 QDhvDILDPLAAQTGIAVEDMLAALKELLQGGmQGRTILGSTILEDEFTPFDVVRQC 297
CoV_NSP3_C pfam19218
Coronavirus replicase NSP3, C-terminal; This family represents the C-terminal region of ...
2248-2736 4.34e-173

Coronavirus replicase NSP3, C-terminal; This family represents the C-terminal region of non-structural protein NSP3 (also known as nsp3). NSP3 is the product of ORF1a. It is found in human SARS coronavirus polyprotein 1a and 1ab, and in related coronavirus polyproteins. It is a multifunctional protein comprising up to 16 different domains and regions. NSP3 binds to viral RNA, nucleocapsid protein, as well as other viral proteins and participates in polyprotein processing.


Pssm-ID: 466002  Cd Length: 463  Bit Score: 541.93  E-value: 4.34e-173
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 190360135  2248 VTICD----LYSIQDVgFKNQYCNGSIACQFCLAGFDMLDNYKAIDVVQYEADRRAFVDYTGVLKIVIELIVSYALYTAW 2323
Cdd:pfam19218    1 GYPCDgyvdGYSNSSF-NKSDYCNGSILCKACLSGYDSLHDYPHLKVVQQPVKDPLFVDVTPLFYFAIELFVALALFGGT 79
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 190360135  2324 FYPLFALISIQILTTWLPELFMLSTLHWsvrllvsLANMLPAHVFMRFYIIIASFIKLFSLFRHVAYGCSKSGCLFCYKR 2403
Cdd:pfam19218   80 FVRVFLLYFLQQYVNFFGVYLGLQDYSW-------FLTLIPFDSFLREYVVLFYVIKLYRFLKHVVFGCKKPSCLACSKS 152
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 190360135  2404 NRSLRVKCSTIVGGMIRYYDAMANGGTGFCSKHQWNCIDCDSYKPGNTFITVEAALDLSKELKRPIQPTDVAYHTVTDVK 2483
Cdd:pfam19218  153 ARLTRVPVSTVVNGSKKSFYVNANGGTKFCKKHNFFCKNCDSYGPGNTFINDEVAEDLSNVTKRSVKPTDPAYYEVDKVE 232
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 190360135  2484 QVGCYMRLFYDRDGQRTYDDVNASLFVDYSNLLHSKVKSVPNMHVVVVEND-ADKANFLNAAVFYAQSLFRPILMVDKNL 2562
Cdd:pfam19218  233 FQNGFYYLYSGREFWRYYFDVTVSKYSDKEVLKNCNIKGYPLDDFIVYNSNgSNLAQAKNACVYYSQLLCKPIKLVDSNL 312
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 190360135  2563 ITTANTGTSVTETMFDVYVDTFLSMFDVDKKSLNALIATAHssikqgtqickvldtflscarkscSIDSDVDTKCLADSV 2642
Cdd:pfam19218  313 LSSLGDSVDVNGALHDAFVEVLLNSFNVDLSKCKTLIECKK------------------------DLGSDVDTDSFVNAV 368
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 190360135  2643 MSAVSAGLELTDESCNNLVPTYLK-GDNIVAADLGVLIQNSAKHVQGNVAKIAGVSCIWSVDAFNQLSSDFQHKLKKACC 2721
Cdd:pfam19218  369 LNAHRYDVLLTDDSFNNFVPTYAKpEDSLSTHDLAVCIRFGAKIVNHNVLKKENVPVVWSADDFLKLSEEARKYIVKTAK 448
                          490
                   ....*....|....*
gi 190360135  2722 KTGLKLKLTYNKQMA 2736
Cdd:pfam19218  449 KKGVTFMLTFNTNRM 463
cv_Nsp4_TM cd21473
coronavirus non-structural protein 4 (Nsp4) transmembrane domain; Nsp4 may be involved in ...
2761-3143 8.85e-163

coronavirus non-structural protein 4 (Nsp4) transmembrane domain; Nsp4 may be involved in coronavirus-induced membrane remodeling. In order to assemble the replication-transcription complex (RTC), coronavirus induces the rearrangement of host endoplasmic reticulum (ER) membrane into double membrane vesicles (DMVs), zippered ER, or ER spherules. DMV formation has been observed in SARS-CoV cells overexpressing the three transmembrane-containing non-structural proteins of viral replicase polyprotein 1ab: Nsp3, Nsp4 and Nsp6. Together, Nsp3, Nsp4, and Nsp6 have the ability to induce the formation of DMVs that are similar to those seen in SARS-CoV-infected cells.


Pssm-ID: 394836  Cd Length: 376  Bit Score: 508.67  E-value: 8.85e-163
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 190360135 2761 FVLSLVCFIGLWCLMPTYTVHKSDFQLPVYASYKVLDNGVIRDVSVEDVCFANKFEQFDQWYESTFGlSYYSNSMACPIV 2840
Cdd:cd21473     1 FLWLLLAAILLYAFLPSYSVFTVTVSSFPGYDFKVIENGVLRDIRSTDTCFANKFVNFDSWYQAKYG-SVPTNSKSCPIV 79
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 190360135 2841 VAVVDQDFGStVFNVPTKVLRYGYHVLHFITHALSADGVQCYTPHSQISYSNFYASGCVLSSACTMFAMaDGSPQPYCYT 2920
Cdd:cd21473    80 VGVIDDVRGS-VPGVPAGVLLVGKTLVHFVQTVFFGDTVVCYTPDGVITYDSFYTSACVFNSACTYLTG-LGGRQLYCYD 157
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 190360135 2921 DGLMQNASLYSSLVPHVRYNLANAkGFIRFPEVLREGLVRIVRTRSMPYCRVGLCEEADEGICFNFNGSWVLNNDYYRsl 3000
Cdd:cd21473   158 TGLVEGAKLYSDLLPHVRYKLVDG-NYIKFPEVILEGGPRIVRTLATTYCRVGECEDSKAGVCVSFDGFWVYNNDYYG-- 234
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 190360135 3001 PGTFCGRDVFDLIYQLFKGLAQPVDFLALTASSIAGAILAVIVVLVFYYLIKLKRAFGDyTSIVFVNVIVWCVNFMMLFV 3080
Cdd:cd21473   235 PGVYCGDGLFDLLTNLLSGFFQPVSVFALSGQLLFNTIVAILAVLACYYVQKFKRAFGD-MSVVVVTVVAAALVNNVLYV 313
                         330       340       350       360       370       380
                  ....*....|....*....|....*....|....*....|....*....|....*....|...
gi 190360135 3081 FQVYPTLSCVYAICYFYATLYFPSEISVIMHLQWLVMYGTIMPLWFCLLYISVVVSNHAFWVF 3143
Cdd:cd21473   314 VTQNPLLMIVYAVLYFYATLYLTYERAWIMHLGWVVAYGPIAPWWLLALYVVAVLYDYLPWFF 376
MHV-like_Nsp1 cd21879
non-structural protein 1 from murine hepatitis virus and betacoronavirus in the A lineage; ...
6-241 2.86e-158

non-structural protein 1 from murine hepatitis virus and betacoronavirus in the A lineage; This model represents the non-structural protein 1 (Nsp1) from betacoronavirus in the embecovirus subgenus (A lineage), including murine hepatitis virus (MHV), bovine coronavirus (BCoV) and Human coronavirus HKU1. CoVs utilize a multi-subunit replication/transcription machinery assembled from a set of non-structural proteins (Nsps) generated as cleavage products of the ORF1a and ORF1ab viral polyproteins. Nsp1 is the N-terminal cleavage product released from the ORF1a polyprotein by the action of papain-like protease (PLpro). Though Nsp1s of alphaCoVs and betaCoVs share structural similarity, they show no significant sequence similarity and may be considered as genus-specific markers. Despite low sequence similarity, the Nsp1s of alphaCoVs and betaCoVs exhibit remarkably similar biological functions, and are involved in the regulation of both host and viral gene expression. CoV Nsp1 induces suppression of host gene expression and interferes with host immune response. It inhibits host gene expression in two ways: by targeting the translation and stability of cellular mRNAs, and by inhibiting mRNA translation and inducing an endonucleolytic RNA cleavage in the 5'-UTR of cellular mRNAs through its tight association with the 40S ribosomal subunit, a key component of the cellular translation machinery. Inhibition of host mRNA translation includes that of type I interferons, major components of the host innate immune response. Nsp1 is critical in regulating viral replication and gene expression, as shown by multiple evidences, including: mutations in the Nsp1 coding region of the transmissible gastroenteritis virus (TGEV) and MHV genomes cause drastic reduction or elimination of infectious virus; BCoV Nsp1 is an RNA-binding protein that interacts with cis-acting replication elements in the 5'-UTR of the BCoV genome, implying its potential role in the regulation of viral translation or replication; and SARS-CoV Nsp1 enhances virus replication by binding to a stem-loop structure in the 5'-UTR of its genome.


Pssm-ID: 409341  Cd Length: 236  Bit Score: 489.20  E-value: 2.86e-158
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 190360135    6 KYGLELHWAPEFPWMFEDAEEKLDNPSSSEVDIVCSTTAQKLETGGICPENHVMVDCRRLLKQECCVQSSLIREIVMNTR 85
Cdd:cd21879     1 KYGLGLKWAPEFPWMFEDAEEKLGNPSSSEEDGFCPTTAQKLETVGICLENHVKVDCRRLLKQECCVQSNLIRDIFVDTD 80
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 190360135   86 PYDLEVLLQDALQSREAVLVTPPLGMSLEACYVRGCNPNGWTMGLFRRRSVCNTGRCAVNKHVAYQLYMIDPAGVCFGAG 165
Cdd:cd21879    81 PYDVEVLTQDALQSGEAVLVKPPLRMSLEACYKLGCLPKGWAMGLFRRRCVCNTGRCGVDKHVAYQLFMIDPDGVCLGAG 160
                         170       180       190       200       210       220       230
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*.
gi 190360135  166 QFVGWVIPLAFMPVQSRKFIVPWVMYLRKCGEKGAYNKDHKRGGFEHVYNFKVEDAYDLVHDEPKGKFSKKAYALI 241
Cdd:cd21879   161 RFIGWVVPLAFIPEYARKWLQPWVIYLRKYGEKGAYTKGHKRGGFGHVYDFKVEDAYDEVHDEPKGKYSKKAYALL 236
CoV_Nsp5_Mpro cd21646
coronavirus non-structural protein 5, also called Main protease (Mpro); This family contains ...
3250-3542 1.71e-156

coronavirus non-structural protein 5, also called Main protease (Mpro); This family contains the coronavirus (CoV) non-structural protein 5 (Nsp5) also called the Main protease (Mpro), or 3C-like protease (3CLpro). CoVs utilize a multi-subunit replication/transcription machinery. A set of non-structural proteins (Nsps) generated as cleavage products of the ORF1a and ORF1ab viral polyproteins assemble to facilitate viral replication and transcription. Mpro/Nsp5 is a key enzyme in this process, making it a high value target for the development of anti-coronavirus therapeutics. These enzymes belong to the MEROPS peptidase C30 family, where the active site residues His and Cys form a catalytic dyad. The structures of Mpro/Nsp5 consist of three domains with the first two containing anti-parallel beta barrels and the third consisting of an arrangement of alpha-helices. The catalytic residues are found in a cleft between the first two domains. Mpro/Nsp5 requires a Gln residue in the P1 position of the substrate and space for only small amino-acid residues such as Gly, Ala, or Ser in the P1' position; since there is no known human protease with a specificity for Gln at the cleavage site of the substrate, these viral proteases are suitable targets for the development of antiviral drugs.


Pssm-ID: 394885  Cd Length: 292  Bit Score: 486.54  E-value: 1.71e-156
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 190360135 3250 VKMVNPTSKVEPCIVSVTYGNMTLNGLWLGDKVYCPRHVICSASDmTNPDYTNLLCRVTSSDFTVLFDRLSLTVMSYQMQ 3329
Cdd:cd21646     1 KKMAQPSGKVERCMVSVTYGSTTLNGLWLDDTVYCPRHVICKSTT-SGPDYDDLLSRARNHNFSVQSGGVQLRVVGVTMQ 79
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 190360135 3330 GCMLVLTVTLQNSRTPKYTFGVVKPGETFTVLAAYNGKPQGAFHVTMRSSYTIKGSFLCGSCGSVVYVIMGDCVKFVYMH 3409
Cdd:cd21646    80 GALLRLKVDTSNPHTPKYKFVTVKPGDSFTILACYNGSPSGVYGVNMRSNYTIKGSFLNGACGSVGYNIDGGTVEFCYMH 159
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 190360135 3410 QLELSTGCHTGTDFNGDFYGPYKDAQVVQLPVQDYIQSVNFVAWLYAAILNNCNWFVQSDKCSVEDFNVWALSNGFSQVK 3489
Cdd:cd21646   160 HLELPNGCHTGTDLTGKFYGPYVDQQVAQVEGADTLITDNVVAWLYAAIINGDRWWLNSSRTTVNDFNEWAMANGFTPVS 239
                         250       260       270       280       290
                  ....*....|....*....|....*....|....*....|....*....|...
gi 190360135 3490 SDLVIDALASMTGVSLETLLAAIKHLKNGFQGRQIMGSCSFEDELTPSDVYQQ 3542
Cdd:cd21646   240 QVDCLSILAAKTGVSVERLLAAIQQLHQNFGGKQILGSTSLEDEFTPEDVVRQ 292
betaCoV_PLPro cd21732
betacoronavirus papain-like protease; This model represents the papain-like protease (PLPro) ...
1563-1861 3.03e-156

betacoronavirus papain-like protease; This model represents the papain-like protease (PLPro) found in non-structural protein 3 (Nsp3) of betacoronavirus, including highly pathogenic betacoronaviruses such as Severe acute respiratory syndrome-related coronavirus (SARS-CoV), SARS-CoV2 (also called 2019 novel CoV or 2019-nCoV), and Middle East respiratory syndrome-related (MERS) CoV. CoVs utilize a multi-subunit replication/transcription machinery. A set of non-structural proteins (Nsps) generated as cleavage products of the ORF1a and ORF1ab viral polyproteins assemble to facilitate viral replication and transcription. PLPro is a key enzyme in this process, making it a high value target for the development of anti-coronavirus therapeutics. PLPro, which belongs to the MEROPS peptidase C16 family, participates in the proteolytic processing of the N-terminal region of the replicase polyprotein; it can cleave Nsp1|Nsp2, Nsp2|Nsp3, and Nsp3|Nsp4 sites and its activity is dependent on zinc. In SARS-CoV and murine hepatitis virus (MHV), the C-terminal non-structural protein 3 region spanning transmembrane regions TM1 and TM2 with 3Ecto domain in between, are important for the PL2pro domain to process Nsp3-Nsp4 cleavage. Besides cleaving the polyproteins, PLPro also possesses a related enzymatic activity to promote virus replication: deubiquitinating (DUB) and de-ISGylating activities. Both, ubiquitin (Ub) and Ub-like interferon-stimulated gene product 15 (ISG15), are involved in preventing viral infection; coronaviruses utilize Ubl-conjugating pathways to counter the pro-inflammatory properties of Ubl-conjugated host proteins via the action of PLPro, which processes both 'Lys-48'- and 'Lys-63'-linked polyubiquitin chains from cellular substrates. The Nsp3 PLPro domain of many of these CoVs has also been shown to antagonize host innate immune induction of type I interferon by interacting with IRF3 and blocking its activation. Interactions of SARS-CoV and MERS-CoV with antiviral interferon (IFN) responses of human cells are remarkably different; high-dose IFN treatment (type I and type III) shows MERS-CoV was substantially more IFN sensitive than SARS-CoV. This may be due to differences in the architecture of the oxyanion hole and of the S3 as well as the S5 specificity sites, despite the overall structures of SARS-CoV and MERS-CoV PLPro being similar.


Pssm-ID: 409649  Cd Length: 304  Bit Score: 486.32  E-value: 3.03e-156
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 190360135 1563 DKVDILLTVDGVNFTNRFVPVGESFGKSLGNVFCDGVNVTKHKCDINYKGKVFFQFDNLSSEDLKAVRSSFNFDQK-ELL 1641
Cdd:cd21732     1 KTIEVLTTVDGVNFRTVLVNNGETFGKQLGNVFCDGVDVTKTKPSAKYEGKVLFQADNLSAEELEAVEYYYGFDDPtFLL 80
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 190360135 1642 AYYNMLVNCSKWQVVFNGKYFTFKQANNNCFVNVSCLMLQSLNLKFKIVQWQEAWLEFRSGRPARFVSLVLAKGGFKFGD 1721
Cdd:cd21732    81 RYYSALAHVKKWKFVVVDGYFSLKQADNNCYLNAACLMLQQLDLKFNTPALQEAYYEFRAGDPLRFVALVLAYGNFTFGE 160
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 190360135 1722 PADSRDFLRVVFSQVDLTGAICDFEIACK-CGVKQEQRTGVDAVMHFGTLSREDLEIGYTVDCSCGKKLIHCVRFDV-PF 1799
Cdd:cd21732   161 PDDARDFLRVVLSHADLVSARRVLEEVCKvCGVKQEQRTGVDAVMYFGTLSLDDLYKGYTIDCSCGRKAIRYLVEQVpPF 240
                         250       260       270       280       290       300
                  ....*....|....*....|....*....|....*....|....*....|....*....|....
gi 190360135 1800 LICSNTPASVKLPKG-VGSANIFKGDK-VGHYVHVKCEQSYQLYDASNVKKVTDVTGNLSDCLY 1861
Cdd:cd21732   241 LLMSNTPTEVPLPTGdFVAANVFTGDEsVGHYTHVKNKSLLYLYDAGNVKKTSDLKGPVTDVLY 304
CoV_NSP4_N pfam19217
Coronavirus replicase NSP4, N-terminal; This is the N-terminal domain of the coronavirus ...
2772-3128 3.73e-151

Coronavirus replicase NSP4, N-terminal; This is the N-terminal domain of the coronavirus nonstructural protein 4 (NSP4). NSP4 is encoded by ORF1a/1ab and proteolytically released from the pp1a/1ab polyprotein. NSP4 is a membrane-spanning protein which is thought to anchor the viral replication-transcription complex to modified endoplasmic reticulum membranes. This N-terminal region represents the membrane spanning region, covering four transmembrane regions.


Pssm-ID: 466001  Cd Length: 351  Bit Score: 474.07  E-value: 3.73e-151
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 190360135  2772 WCLMPTYTVHKSDFQLPVYASYKVLDNGVIRDVSVEDVCFANKFEQFDQWYESTFGlsYYSNSMACPIVVAVVDQDFGST 2851
Cdd:pfam19217    1 YALSPTFFNTVVYFVSDPVYDFKVIENGVLRDFRSTDTCFHNKFDNFDSWHQAKFG--SPTNSRSCPIVVGVVDEVVGRV 78
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 190360135  2852 VFNVPTKVLRYGYHVLHFITHALSADGVQCYTPHSQISYSNFYASGCVLSSACTMFAMADGSPQPYCYTDGLMQNASLYS 2931
Cdd:pfam19217   79 VPGVPAGVALVGGTILHFVTRVFFGAGNVCYTPSGVVTYESFSASACVFNSACTTLTGLGGTRVLYCYDDGLVEGAKLYS 158
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 190360135  2932 SLVPHVRYNLANAkGFIRFPEVLREGLVRIVRTRSMPYCRVGLCEEADEGICFNFNGSWVLNNDYYrslPGTFCGRDVFD 3011
Cdd:pfam19217  159 DLVPHVRYKLVDG-NYVKLPEVLFRGGFRIVRTLATTYCRVGECEDSKAGVCVGFDRSFVYNNDFG---PGVYCGSGFLS 234
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 190360135  3012 LIYQLFKGLAQPVDFLALTASSIAGAILAVIVVLVFYYLIKLKRAFGDYTSIVFVNVIVWCVNFMMLFVFQVYPTLSCVY 3091
Cdd:pfam19217  235 LLTNVFSGFNTPISVFALTGQLMFNCVVALIAVCVCYYVLKFKRAFGDYSTGVLTVVLATLVNNLSYFVTQVNPVLMIVY 314
                          330       340       350
                   ....*....|....*....|....*....|....*..
gi 190360135  3092 AICYFYATLYFPSEISVIMHLQWLVMYGTIMPLWFCL 3128
Cdd:pfam19217  315 AVLYFYATLYVTPEYAWIWHLGFLVAYVPLAPWWVLL 351
Peptidase_C30 pfam05409
Coronavirus endopeptidase C30; This Coronavirus (CoV) domain, peptidase C30, is also known as ...
3275-3549 6.84e-149

Coronavirus endopeptidase C30; This Coronavirus (CoV) domain, peptidase C30, is also known as 3C-like proteinase (3CL-pro), or CoV main protease (M-pro) domain. CoV M-pro is a dimer where each subunit is composed of three domains I, II and III,,. Domains I and II consist of six-stranded antiparallel beta barrels and together resemble the architecture of chymotrypsin, and of picornaviruses 3C proteinases. The substrate-binding site is located in a cleft between these two domains. The catalytic site is situated at the centre of the cleft. A long loop connects domain II to the C-terminal domain (domain III). This latter domain has been implicated in the proteolytic activity of M-pro. In the active site of M-pro, Cys and His form a catalytic dyad,.


Pssm-ID: 398852  Cd Length: 274  Bit Score: 464.22  E-value: 6.84e-149
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 190360135  3275 GLWLGDKVYCPRHVICSASDMTnPDYTNLLCRVTSSDFTVLFDRLSLTVMSYQMQGCMLVLTVTLQNSRTPKYTFGVVKP 3354
Cdd:pfam05409    1 GLWLGDTVYCPRHVIGSFTGML-PQYEHLLSIARNHDFCVVSGGVQLTVVSAKMQGAILVLKVHTNNPNTPKYKFVRLKP 79
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 190360135  3355 GETFTVLAAYNGKPQGAFHVTMRSSYTIKGSFLCGSCGSVVYVIMGDCVKFVYMHQLELSTGCHTGTDFNGDFYGPYKDA 3434
Cdd:pfam05409   80 GESFTILAAYDGCPQGVYHVTMRSNHTIKGSFLNGACGSVGYNLKGGTVCFVYMHHLELPNGSHTGTDLEGVFYGPYVDE 159
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 190360135  3435 QVVQLPVQDYIQSVNFVAWLYAAILNNCNWFVQSDKCSVEDFNVWALSNGFSQVKSDLVIDALASMTGVSLETLLAAIKH 3514
Cdd:pfam05409  160 EVAQLEGTDQTYTDNVVAWLYAAIINGPRWFLASTTVSLEDFNAWAMTNGFTPFPCEDAILGLAAKTGVSVERLLAAIKV 239
                          250       260       270
                   ....*....|....*....|....*....|....*
gi 190360135  3515 LKNGFQGRQIMGSCSFEDELTPSDVYQQLAGIKLQ 3549
Cdd:pfam05409  240 LNNGFGGRTILGSPSLEDEFTPEDVYNQMAGVTLQ 274
betaCoV-Nsp6 cd21560
betacoronavirus non-structural protein 6; Coronaviruses (CoV) redirect and rearrange host cell ...
3550-3836 1.44e-141

betacoronavirus non-structural protein 6; Coronaviruses (CoV) redirect and rearrange host cell membranes as part of the viral genome replication and transcription machinery; they induce the formation of double-membrane vesicles in infected cells. CoV non-structural protein 6 (Nsp6), a transmembrane-containing protein, together with Nsp3 and Nsp4, have the ability to induce double-membrane vesicles that are similar to those observed in severe acute respiratory syndrome (SARS) coronavirus-infected cells. By itself, Nsp6 can generate autophagosomes from the endoplasmic reticulum. Autophagosomes are normally generated as a cellular response to starvation to carry cellular organelles and long-lived proteins to lysosomes for degradation. Degradation through autophagy may provide an innate defense against virus infection, or conversely, autophagosomes can promote infection by facilitating the assembly of replicase proteins. In addition to initiating autophagosome formation, Nsp6 also limits autophagosome expansion regardless of how they were induced, i.e. whether they were induced directly by Nsp6, or indirectly by starvation or chemical inhibition of MTOR signaling. This may favor coronavirus infection by compromising the ability of autophagosomes to deliver viral components to lysosomes for degradation.


Pssm-ID: 394846  Cd Length: 290  Bit Score: 443.61  E-value: 1.44e-141
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 190360135 3550 SKRTRLVKGIVCWIMASTFLFSCIITAFVKWTMFMYVTTNMLSITFCALCVIS-LTMLLVKHKHLYLTMYIIPVLFTLLY 3628
Cdd:cd21560     1 SKVKRVVKGTLHWLLATFVLFYLIILQLTKWTMFMYLTETMLLPLTPALCCVSaCVMLLVKHKHTFLTLFLLPVLLTLAY 80
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 190360135 3629 NNYLVVYKQTFRGYVYAWLSYYVPSVEYTYTDEVIYGMLLLIGMVfVTLRSINQYLFSFIMFVGRVISVVSLWYMGSnLE 3708
Cdd:cd21560    81 YNYVYVPKSSFLGYVYNWLNYVNPYVDYTYTDEVTYGSLLLVLML-VTMRLVNHDAFSRVWAVCRVITWVYMWYTGS-LE 158
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 190360135 3709 EEILLMLASLFGTYTWTTA---LSMAAAK-VIAKWVAVNVLYFTDIPQIKIVLVCYLFIGYIISCYWGLFSLMNSLFRMP 3784
Cdd:cd21560   159 ESALSYLTFLFSVTTNYTGvvtVSLALAKfITALWLAYNPLLFLDIPEVKCVLLVYLFIGYICTCYFGVFSLLNRLFRCP 238
                         250       260       270       280       290
                  ....*....|....*....|....*....|....*....|....*....|..
gi 190360135 3785 LGVYNYKISVQELRYMNANGLRPPKNSFEALMLNFKLLGIGGVPIIEVSQFQ 3836
Cdd:cd21560   239 LGVYDYKVSTQEFRYMNANGLRPPRNSWEALMLNFKLLGIGGVPCIKVSTVQ 290
alphaCoV_Nsp5_Mpro cd21665
alphacoronavirus non-structural protein 5, also called Main protease (Mpro); This subfamily ...
3251-3545 2.19e-115

alphacoronavirus non-structural protein 5, also called Main protease (Mpro); This subfamily contains the coronavirus (CoV) non-structural protein 5 (Nsp5) also called the Main protease (Mpro), or 3C-like protease (3CLpro), found in alphacoronaviruses. CoVs utilize a multi-subunit replication/transcription machinery. A set of non-structural proteins (Nsps) generated as cleavage products of the ORF1a and ORF1ab viral polyproteins assemble to facilitate viral replication and transcription. Mpro/Nsp5 is a key enzyme in this process, making it a high value target for the development of anti-coronavirus therapeutics. These enzymes belong to the MEROPS peptidase C30 family, where the active site residues His and Cys form a catalytic dyad. The structures of Mpro/Nsp5 consist of three domains with the first two containing anti-parallel beta barrels and the third consisting of an arrangement of alpha-helices. The catalytic residues are found in a cleft between the first two domains. Mpro/Nsp5 requires a Gln residue in the P1 position of the substrate and space for only small amino-acid residues such as Gly, Ala, or Ser in the P1' position; since there is no known human protease with a specificity for Gln at the cleavage site of the substrate, these viral proteases are suitable targets for the development of antiviral drugs.


Pssm-ID: 394886  Cd Length: 296  Bit Score: 368.93  E-value: 2.19e-115
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 190360135 3251 KMVNPTSKVEPCIVSVTYGNMTLNGLWLGDKVYCPRHVIcsASDMTNP-DYTNLLCRVTSSDFTVLFDRLSLTVMSYQMQ 3329
Cdd:cd21665     3 KMAQPSGVVEKCVVRVSYGNMVLNGLWLGDTVYCPRHVI--ASDTTSTiDYDHEYSLMRLHNFSISVGNVFLGVVGVTMR 80
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 190360135 3330 GCMLVLTVTLQNSRTPKYTFGVVKPGETFTVLAAYNGKPQGAFHVTMRSSYTIKGSFLCGSCGSVVYVIMGDCVKFVYMH 3409
Cdd:cd21665    81 GALLVIKVNQNNVNTPKYTFRTLKPGDSFNILACYDGVPSGVYGVNMRTNYTIKGSFINGACGSPGYNLNNGTVEFCYMH 160
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 190360135 3410 QLELSTGCHTGTDFNGDFYGPYKDAQVVQLPVQDYIQSVNFVAWLYAAILNNCNWFVQSDKCSVEDFNVWALSNGFSQVK 3489
Cdd:cd21665   161 QLELGSGCHVGSDLDGVMYGGYEDQPTLQVEGANVLVTENVVAFLYAALLNGCNWWLSSDRVTVEAFNEWAVANGFTTVS 240
                         250       260       270       280       290
                  ....*....|....*....|....*....|....*....|....*....|....*.
gi 190360135 3490 SDLVIDALASMTGVSLETLLAAIKHLKNGFQGRQIMGSCSFEDELTPSDVYQQLAG 3545
Cdd:cd21665   241 STDCFSILAAKTGVDVERLLAAIQRLSKGFGGKTILGYTSLTDEFTLSEVIKQMYG 296
betaCoV_Nsp8 cd21831
betacoronavirus non-structural protein 8; This model represents the non-structural protein 8 ...
3929-4122 2.41e-112

betacoronavirus non-structural protein 8; This model represents the non-structural protein 8 (Nsp8) the highly pathogenic betacoronaviruses that include Severe acute respiratory syndrome-related coronavirus (SARS-CoV), SARS-CoV2 (also called 2019 novel CoV or 2019-nCoV), and Middle East respiratory syndrome-related (MERS) CoV. CoVs utilize a multi-subunit replication/transcription machinery. A set of non-structural proteins (Nsps) generated as cleavage products of the ORF1a and ORF1ab viral polyproteins assemble to facilitate viral replication and transcription. Upon processing of the Nsp7-10 region by protease M (Mpro), the released four small proteins Nsp7, Nsp8, Nsp9, and Nsp10 form functional complexes with CoV core enzymes and thereby stimulate replication. Most importantly, a complex of Nsp8 with Nsp7 has been shown to activate and confer processivity to the RNA-synthesizing activity of Nsp12, the RNA-dependent RNA-polymerase (RdRp); in SARS-CoV, point mutations in the genes encoding Nsp8 and Nsp7 have been shown to delay virus growth. Nsp8 and Nsp7 cooperate in activating the primer-dependent activity of the Nsp12 RdRp such that the level of their association may constitute a limiting factor for obtaining a high RNA polymerase activity. The subsequent Nsp7/Nsp8/Nsp12 polymerase complex is then able to associate with an active bifunctional Nsp14, which includes N-terminal 3' to 5' exoribonuclease (ExoN) and C-terminal N7-guanine cap methyltransferase (N7-MTase) activities, thus representing a unique coronavirus Nsp assembly that incorporates RdRp, exoribonuclease, and N7-MTase activities. Interaction of Nsp8 with Nsp7 appears to be conserved across the coronavirus family, making these proteins interesting drug targets. Nsp8 has a novel 'golf-club' fold composed of an N-terminal 'shaft' domain and a C-terminal 'head' domain. The shaft domain contains three helices, one of which is very long, while the head domain contains another three helices and seven beta-strands, forming an alpha/beta fold. SARS-CoV Nsp8 forms a 8:8 hexadecameric supercomplex with Nsp7 that adopts a hollow cylinder-like structure with a large central channel and positive electrostatic properties in the cylinder; the Nsp7/Nsp8 complex functions as a noncanonical RNA polymerase capable of synthesizing RNA of up to the template length.


Pssm-ID: 409258  Cd Length: 196  Bit Score: 356.02  E-value: 2.41e-112
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 190360135 3929 SEFVNMASFVEYEVAKKNLDEACSSGSANQQQLKQLEKACNIAKSAYERDRAVARKLERMADLALTNMYKEARINDKKSK 4008
Cdd:cd21831     1 SEFSNLASYAEYETAQKAYDEAVASGDASPQVLKALKKAVNVAKSAYEKDKAVARKLERMADQAMTSMYKQARAEDKKSK 80
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 190360135 4009 VVSALQTMLFSMVRKLDNQALNSILDNAVKGCVPLNAIPSLAANTLTIIVPDKSVYDQVVDNVYVTYAGNVWQIQTIQDS 4088
Cdd:cd21831    81 VVSAMQTMLFGMIRKLDNDALNNIINNARNGCVPLSIIPLTAANKLRVVVPDYSVYKQVVDGPTLTYAGALWDIQQINDA 160
                         170       180       190
                  ....*....|....*....|....*....|....*..
gi 190360135 4089 DGTNKQLHEISDDC---NWPLVIIANRHNEvSATALQ 4122
Cdd:cd21831   161 DGKIVQLSDITEDSenlAWPLVVTATRANS-SAVKLQ 196
TM_Y_SARS-CoV-like_Nsp3_C cd21717
C-terminus of non-structural protein 3, including transmembrane and Y domains, from Severe ...
2265-2749 1.33e-110

C-terminus of non-structural protein 3, including transmembrane and Y domains, from Severe acute respiratory syndrome-related coronavirus and betacoronavirus in the B lineage; This model represents the C-terminus of non-structural protein 3 (Nsp3) from betacoronavirus in the sarbecovirus subgenus (B lineage), including highly pathogenic human coronaviruses such as Severe acute respiratory syndrome-related coronavirus (SARS-CoV) and SARS-CoV2 (also called 2019 novel CoV or 2019-nCoV). This conserved C-terminus includes two transmembrane (TM) regions TM1 and TM2, an ectodomain (3Ecto) between the TM1 and TM2 that is glycosylated and located on the lumenal side of the ER, an amphiphatic region (AH1) that is not membrane-spanning, and a large Y domain of approximately 370 residues. Nsp3 is a large multi-functional multi-domain protein that is an essential component of the replication/transcription complex (RTC), which carries out RNA synthesis, RNA processing, and interference with the host cell innate immune system. In SARS-CoV and the related murine hepatitis virus (MHV), the TM1, 3Ecto and TM2 domains are important for the papain-like protease (PL2pro) domain to process Nsp3-Nsp4 cleavage. It has also been shown that the interaction of 3Ecto with the lumenal loop of Nsp4 is essential for ER rearrangements in cells infected with SARS-CoV or MHV. The Y domain, located at the cytosolic side of the ER, consists of the Y1 and CoV-Y subdomains, which are conserved in nidovirus and coronavirus, respectively. Functional information about the Y domain is limited; it has been shown that Nsp3 binding to Nsp4 is less efficient without the Y domain.


Pssm-ID: 409665  Cd Length: 531  Bit Score: 365.08  E-value: 1.33e-110
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 190360135 2265 YCNGSIACQFCLAGFDMLDNYKAIDVVQYEADRRAFvDYTgVLKIVIELIVSYALYTAWFYPLFALISIQILTTWLPELF 2344
Cdd:cd21717    50 FCEGSFPCSVCLSGLDSLDSYPALETIQVTISSYKL-DLT-ILGLAAEWFLAYMLFTKFFYLLGLSAIMQVFFGYFASHF 127
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 190360135 2345 MLSTlhWSVRLLVSLANMLPAHVFMRFYIIIASFIKLFSLFRHVAYGCSKSGCLFCYKRNRSLRVKCSTIVGGMIRYYDA 2424
Cdd:cd21717   128 ISNS--WLMWFIISIVQMAPVSAMVRMYIFFASFYYIWKSYVHIMDGCTSSTCMMCYKRNRATRVECTTIVNGMKRSFYV 205
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 190360135 2425 MANGGTGFCSKHQWNCIDCDSYKPGNTFITVEAALDLSKELKRPIQPTDVAYHTVTDVKQVGCYMRLFYDRDGQRTYDDV 2504
Cdd:cd21717   206 YANGGRGFCKTHNWNCLNCDTFCAGSTFISDEVARDLSLQFKRPINPTDQSSYVVDSVAVKNGALHLYFDKAGQKTYERH 285
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 190360135 2505 NASLFVDYSNLLHSKVKSVPNMHVVVVENDA--DKANFLNAAVFYAQSLFRPILMVDKNLITTANTGTSVTETMFDVYVD 2582
Cdd:cd21717   286 PLSHFVNLDNLRANNTKGSLPINVIVFDGKSkcDESAAKSASVYYSQLMCQPILLLDQALVSDVGDSTEVSVKMFDAYVD 365
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 190360135 2583 TFLSMFDVDKKSLNALIATAHSSIKQGTQICKVLDTFLSCARKSCsIDSDVDTKCLADSVMSAVSAGLELTDESCNNLVP 2662
Cdd:cd21717   366 TFSATFSVPMEKLKALVATAHSELAKGVALDGVLSTFVSAARQGV-VDTDVDTKDVIECLKLSHHSDLEVTGDSCNNFML 444
                         410       420       430       440       450       460       470       480
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 190360135 2663 TYLKGDNIVAADLGVLIQNSAKHVQGNVAKIAGVSCIWSVDAFNQLSSDFQHKLKKACCKTGLKLKLTYNKQMANVSVLT 2742
Cdd:cd21717   445 TYNKVENMTPRDLGACIDCNARHINAQVAKSHNVSLIWNVKDYMSLSEQLRKQIRSAAKKNNIPFRLTCATTRQVVNVIT 524

                  ....*..
gi 190360135 2743 TPFSLKG 2749
Cdd:cd21717   525 TKISLKG 531
CoV_NSP8 pfam08717
Coronavirus replicase NSP8; Viral NSP8 (non structural protein 8) forms a hexadecameric ...
3926-4116 1.74e-97

Coronavirus replicase NSP8; Viral NSP8 (non structural protein 8) forms a hexadecameric supercomplex with NSP7 that adopts a hollow cylinder-like structure. The dimensions of the central channel and positive electrostatic properties of the cylinder imply that it confers processivity on RNA-dependent RNA polymerase. NSP7 and NSP8 heterodimers play a role in the stabilization of NSP12 regions involved in RNA binding and are essential for a highly active NSP12 polymerase complex. It has been demonstrated that NSP8 acts as an oligo(U)-templated polyadenylyltransferase but also has robust (mono/oligo) adenylate transferase activities. NSP8 has N- and C-terminal D/ExD/E conserved motifs, being the N-terminal motif critical for RNA polymerase activity as these residues are part of the Mg2-binding active site.


Pssm-ID: 400866  Cd Length: 197  Bit Score: 313.32  E-value: 1.74e-97
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 190360135  3926 ALQSEFVNMASFVEYEVAKKNLDEACSSGSAnQQQLKQLEKACNIAKSAYERDRAVARKLERMADLALTNMYKEARINDK 4005
Cdd:pfam08717    1 SVASEFSSLPSYAAYETAKEAYEEAVANGSS-QQVLKQLKKACNIAKSEFDRDAAVQKKLEKMAEQAMTQMYKEARAVDR 79
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 190360135  4006 KSKVVSALQTMLFSMVRKLDNQALNSILDNAVKGCVPLNAIPSLAANTLTIIVPDKSVYDQVVDNVYVTYAGNVWQIQTI 4085
Cdd:pfam08717   80 KSKVVSAMHTLLFSMLRKLDNSALNTIINNARNGVVPLNIIPATTAAKLTVVVPDYETFVKVVDGNTVTYAGAVWEIQEV 159
                          170       180       190
                   ....*....|....*....|....*....|....*
gi 190360135  4086 QDSDGTNKQLHEI----SDDCNWPLVIIANRHNEV 4116
Cdd:pfam08717  160 KDADGKIVHLKEItmdnSPNLAWPLIVTAERANSA 194
TM_Y_MERS-CoV-like_Nsp3_C cd21716
C-terminus of non-structural protein 3, including transmembrane and Y domains, from Middle ...
2263-2746 4.86e-92

C-terminus of non-structural protein 3, including transmembrane and Y domains, from Middle East respiratory syndrome-related coronavirus and betacoronavirus in the C lineage; This model represents the C-terminus of non-structural protein 3 (Nsp3) from betacoronavirus in the merbecovirus subgenus (C lineage), including Middle East respiratory syndrome-related coronavirus (MERS-CoV) and Tylonycteris bat coronavirus HKU4. This conserved C-terminus includes two transmembrane (TM) regions TM1 and TM2, an ectodomain (3Ecto) between the TM1 and TM2 that is glycosylated and located on the lumenal side of the ER, an amphiphatic region (AH1) that is not membrane-spanning, and a large Y domain of approximately 370 residues. Nsp3 is a large multi-functional multi-domain protein that is an essential component of the replication/transcription complex (RTC), which carries out RNA synthesis, RNA processing, and interference with the host cell innate immune system. In the related betacoronaviruses, Severe acute respiratory syndrome-related coronavirus (SARS-CoV) and murine hepatitis virus (MHV), the TM1, 3Ecto and TM2 domains are important for the papain-like protease (PL2pro) domain to process Nsp3-Nsp4 cleavage. It has also been shown that the interaction of 3Ecto with the lumenal loop of Nsp4 is essential for ER rearrangements in cells infected with SARS-CoV or MHV. The Y domain, located at the cytosolic side of the ER, consists of the Y1 and CoV-Y subdomains, which are conserved in nidovirus and coronavirus, respectively. Functional information about the Y domain is limited; it has been shown that Nsp3 binding to Nsp4 is less efficient without the Y domain.


Pssm-ID: 409664  Cd Length: 566  Bit Score: 312.51  E-value: 4.86e-92
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 190360135 2263 NQYC-NGSIACQFCLAGFDMLDNYKAIDVVQYEADRRAF-VDYtgvLKIVIELIVSYALYTAWFYPLFALISIQ------ 2334
Cdd:cd21716    71 PDFCaNRSALCNWCLIGQDSITHYSALKMVQTHLSHYVLnIDW---LWFALELLLAYVLYTSAFNWLLLACTLQyffaqt 147
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 190360135 2335 -ILTTWLPELFMLSTLHWsvrllvsLANMLPAHVFMRFYIIIASFIKLFSLFRHVAYGCSKSGCLFCYKRNRSLRVKCST 2413
Cdd:cd21716   148 sAFVDWRSYNYVVSGIFL-------LFTHIPLDGLVRIYNVLACLWFLRKFYNHVINGCKDTACLLCYKRNRLTRVEAST 220
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 190360135 2414 IVGGMIRYYDAMANGGTGFCSKHQWNCIDCDSYKPGNTFITVEAALDLSKELKRPIQPTDVAYHTVTDVKQVGCYMRLFY 2493
Cdd:cd21716   221 VVCGGKRTFYITANGGTSFCRRHNWNCVDCDTAGVGNTFICEEVANDLTTSLRRLVKPTDRSHYYVDSVEVKDTVVQLNY 300
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 190360135 2494 DRDGQRTYDDVNASLFvdySNLLHSKVKSV-------PNMHVVVVE-NDADKANFLNAA-VFYAQSLFRPILMVDKNLIT 2564
Cdd:cd21716   301 RRDGQSCYERFPLCYF---TNLDKLKFKEVcktttgiPEHNFIIYDsSDRGQENLARSAcVYYSQVLCKPILLVDSNLVT 377
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 190360135 2565 TANTGTSVTETMFDVYVDTFLSMFDVDKKSLNALIATAHSSIKQGTQICKVLDTFLSCARKSCSIDSDVDTKCLADSVMS 2644
Cdd:cd21716   378 SVGDSSEIAIKMFDSFVNSFVSLYNVTRDKLEKLISTARDGVKRGDNFQSVLKTFIDAARGPAGVESDVETNEIVDAVQY 457
                         410       420       430       440       450       460       470       480
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 190360135 2645 AVSAGLELTDESCNNLVPTYLKGDNIVAADLGVLIQNSAKHVQGNVAKIAGVSCIWSVDAFNQLSSDFQHKLKKACCKTG 2724
Cdd:cd21716   458 AHKHDIQLTTESYNNYVPSYVKPDSVATSDLGSLIDCNAASVNQTSMRNANGACIWNAAAYMKLSDSLKRQIRIACRKCN 537
                         490       500
                  ....*....|....*....|..
gi 190360135 2725 LKLKLTYNKQMANVSVLTTPFS 2746
Cdd:cd21716   538 LNFRLTTSKLRANDNILSVKFS 559
TM_Y_HKU9-like_Nsp3_C cd21715
C-terminus of non-structural protein 3, including transmembrane and Y domains, from Rousettus ...
2241-2749 6.44e-91

C-terminus of non-structural protein 3, including transmembrane and Y domains, from Rousettus bat coronavirus HKU9 and betacoronavirus in the D lineage; This model represents the C-terminus of non-structural protein 3 (Nsp3) from betacoronavirus in the nobecovirus subgenus (D lineage), including Rousettus bat coronavirus HKU9. This conserved C-terminus includes two transmembrane (TM) regions TM1 and TM2, an ectodomain (3Ecto) between the TM1 and TM2 that is glycosylated and located on the lumenal side of the ER, an amphiphatic region (AH1) that is not membrane-spanning, and a large Y domain of approximately 370 residues. Nsp3 is a large multi-functional multi-domain protein that is an essential component of the replication/transcription complex (RTC), which carries out RNA synthesis, RNA processing, and interference with the host cell innate immune system. In the related betacoronaviruses, Severe acute respiratory syndrome-related coronavirus (SARS-CoV) and murine hepatitis virus (MHV), the TM1, 3Ecto and TM2 domains are important for the papain-like protease (PL2pro) domain to process Nsp3-Nsp4 cleavage. It has also been shown that the interaction of 3Ecto with the lumenal loop of Nsp4 is essential for ER rearrangements in cells infected with SARS-CoV or MHV. The Y domain, located at the cytosolic side of the ER, consists of the Y1 and CoV-Y subdomains, which are conserved in nidovirus and coronavirus, respectively. Functional information about the Y domain is limited; it has been shown that Nsp3 binding to Nsp4 is less efficient without the Y domain.


Pssm-ID: 409663  Cd Length: 526  Bit Score: 307.94  E-value: 6.44e-91
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 190360135 2241 IKNTFSLVTICDLYSIQDVGfknqycNGSIAcQFCLAGFDMLDnYKAIDVVQYEADrrAFVDYTGVLkIVIELIVSYALY 2320
Cdd:cd21715    28 FKHFLGIVMPCDYVLVNETG------TGWLH-HLCMAGMDGLD-YPALRMQQHRYG--SPYDYTYIL-MLLEAFCAYLLY 96
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 190360135 2321 TAWFYPLFALISIQILTTWLPELFMLStlhWSVRLLVSLANMLPAHVFMRFYIIIASFIKLFSLFRHVAYGCSKSGCLFC 2400
Cdd:cd21715    97 TPALPIVGILAVLHLLVLYLPIPLGNS---WLVVFLYYIIRLVPFTSMLRMYIVIAFLWLCYKGFVHVRYGCNNVACLMC 173
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 190360135 2401 YKRNRSLRVKCSTIVGGMIRYYDAMANGGTGFCSKHQWNCIDCDSYKPGNTFITVEAALDLSKELKRPIQPTDVAYHTVT 2480
Cdd:cd21715   174 YKKNVAKRIECSTVVNGVKRMFYVNANGGTYFCTKHNWNCVSCDTYTVDSTFISRQVALDLSAQFKRPINHTDEAYYEVT 253
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 190360135 2481 DVKQVGCYMRLFYDRDGQRTYDDVNASLFVDYSNLLHSKVKSV-PNMHVVVVE--NDADKANFLNAAVFYAQSLFRPILM 2557
Cdd:cd21715   254 SVEVRNGYVYCYFDSDGQRSYERFPMDAFTNVSKLHYSELKGAaPAFNVLVFDatNRIEENAVKTAAIYYAQLACKPILL 333
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 190360135 2558 VDKNLITTANTGTSVTETMFDVYVDTFLSMFDVDKKSLNALIATAHSSIKQGTQICKVLDTFLSCAR-KSCSIDSDVDTK 2636
Cdd:cd21715   334 VDKRMVGVVGDDATIAKAMFEAYAQNYLLKYSIAMDKVKHLYSTALQQIASGMTVESVLKVFVGSTRaEAKDLESDVDTN 413
                         410       420       430       440       450       460       470       480
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 190360135 2637 CLADSVMSAVSAGLELTDESCNNLVPTYLKGDNIVAADLGVLIQNSAKHVQGNVAKIAGVSCIWSVDAFNQLSSDFQHKL 2716
Cdd:cd21715   414 DLVSCIRLCHQEGWDWTTDSWNNLVPTYIKQDTLSTLEVGQFMTANARYVNANVAKGAAVNLVWRYADFIKLSESMRRQL 493
                         490       500       510
                  ....*....|....*....|....*....|...
gi 190360135 2717 KKACCKTGLKLKLTYNKQMANVSVLTTPFSLKG 2749
Cdd:cd21715   494 RVAARKTGLNLLVTTSSLKADVPCVVTPFKIVG 526
gammaCoV_Nsp5_Mpro cd21667
gammacoronavirus non-structural protein 5, also called Main protease (Mpro); This subfamily ...
3248-3549 2.53e-87

gammacoronavirus non-structural protein 5, also called Main protease (Mpro); This subfamily contains the coronavirus (CoV) non-structural protein 5 (Nsp5) also called the Main protease (Mpro), or 3C-like protease (3CLpro), found in gammacoronaviruses. CoVs utilize a multi-subunit replication/transcription machinery. A set of non-structural proteins (Nsps) generated as cleavage products of the ORF1a and ORF1ab viral polyproteins assemble to facilitate viral replication and transcription. Mpro/Nsp5 is a key enzyme in this process, making it a high value target for the development of anti-coronavirus therapeutics. These enzymes belong to the MEROPS peptidase C30 family, where the active site residues His and Cys form a catalytic dyad. The structures of Mpro/Nsp5 consist of three domains with the first two containing anti-parallel beta barrels and the third consisting of an arrangement of alpha-helices. The catalytic residues are found in a cleft between the first two domains. Mpro/Nsp5 requires a Gln residue in the P1 position of the substrate and space for only small amino-acid residues such as Gly, Ala, or Ser in the P1' position; since there is no known human protease with a specificity for Gln at the cleavage site of the substrate, these viral proteases are suitable targets for the development of antiviral drugs.


Pssm-ID: 394888  Cd Length: 306  Bit Score: 288.99  E-value: 2.53e-87
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 190360135 3248 GIVKMVNPTSKVEPCIVSVTYGNMTLNGLWLGDKVYCPRHVIcsaSDMTNPDYTNLLCRVTSSDFTVLF-DRLSLTVMSY 3326
Cdd:cd21667     1 GFKKLVSPSSAVEKCIVSVSYRGNNLNGLWLGDSIYCPRHVL---GKFSGDQWQDVLNLANNHEFEVVTqNGVTLNVVSR 77
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 190360135 3327 QMQGCMLVLTVTLQNSRTPKYTFGVVKPGETFTVLAAYNGKPQGAFHVTMRSSYTIKGSFLCGSCGSVVYVIMGDCVKFV 3406
Cdd:cd21667    78 RLKGAVLILQTAVANANTPKYKFVKANCGDSFTIACSYGGTVVGLYPVTMRSNGTIRASFLAGACGSVGFNIEKGVVNFF 157
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 190360135 3407 YMHQLELSTGCHTGTDFNGDFYGPYKDAQVVQLPVQDYIQSVNFVAWLYAAILNNCN---WF---VQSDKCSVEDFNVWA 3480
Cdd:cd21667   158 YMHHLELPNALHTGTDLMGEFYGGYVDEEVAQRVPPDNLVTNNIVAWLYAAIISVKEssfSLpkwLESTTVSVEDYNKWA 237
                         250       260       270       280       290       300
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 190360135 3481 LSNGFSQVKSDLVIDALASMTGVSLETLLAAIKHLKNGFQGRQIMGSCSFEDELTPSDVYQQLAGIKLQ 3549
Cdd:cd21667   238 SDNGFTPFSTSTAITKLSAITGVDVCKLLRTIMVKSAQWGSDPILGQYNFEDELTPESVFNQVGGVRLQ 306
alpha_betaCoV_Nsp10 cd21901
alphacoronavirus and betacoronavirus non-structural protein 10; This model represents the ...
4233-4362 7.80e-86

alphacoronavirus and betacoronavirus non-structural protein 10; This model represents the non-structural protein 10 (Nsp10) of alpha- and betacoronaviruses, including highly pathogenic betacoronaviruses such as Severe acute respiratory syndrome-related coronavirus (SARS-CoV), SARS-CoV2 (also called 2019 novel CoV or 2019-nCoV), Middle East respiratory syndrome-related (MERS) CoV, and alphacoronaviruses such as Human coronavirus 229E. CoVs utilize a multi-subunit replication/transcription machinery. A set of non-structural proteins (Nsps) generated as cleavage products of the ORF1a and ORF1ab viral polyproteins assemble to facilitate viral replication and transcription. Upon processing of the Nsp7-10 region by protease M (Mpro), the released four small proteins Nsp7, Nsp8, Nsp9, and Nsp10 form functional complexes with CoV core enzymes and thereby stimulate replication. Coronaviruses cap their mRNAs; RNA cap methylation may involve at least three proteins: Nsp10, Nsp14, and Nsp16. Nsp10 serves as a cofactor for both Nsp14 and Nsp16. Nsp14 consists of 2 domains with different enzymatic activities: an N-terminal ExoN domain and a C-terminal cap (guanine-N7) methyltransferase (N7-MTase) domain. The association of Nsp10 with Nsp14 enhances Nsp14's exoribonuclease (ExoN) activity, and not its N7-Mtase activity. ExoN is important for proofreading and therefore, the prevention of lethal mutations. The Nsp10/Nsp14 complex hydrolyzes double-stranded RNA in a 3' to 5' direction as well as a single mismatched nucleotide at the 3'-end, mimicking an erroneous replication product, and may function in a replicative mismatch repair mechanism. Nsp16 Cap-0 specific (nucleoside-2'-O-)-methyltransferase (2'OMTase) acts sequentially to Nsp14 MTase in RNA capping methylation, and methylates the RNA cap at the ribose 2'-O position; it catalyzes the conversion of the cap-0 structure on m7GpppA-RNA to a cap-1 structure. The association of Nsp10 with Nsp16 enhances Nsp16's 2'OMTase activity, possibly through enhanced RNA binding affinity. Additionally, transmissible gastroenteritis virus (TGEV) Nsp10, Nsp16 and their complex can interact with DII4, which normally binds to Notch receptors; this interaction may disturb Notch signaling. Nsp10 also binds 2 zinc ions with high affinity.


Pssm-ID: 409326  Cd Length: 130  Bit Score: 276.86  E-value: 7.80e-86
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 190360135 4233 AGTATEYASNSSILSLCAFSVDPKKTYLDFIQQGGTPIANCVKMLCDHAGTGMAITVKPDATTSQDSYGGASVCIYCRAR 4312
Cdd:cd21901     1 AGKQTEVASNSSLLTLCAFAVDPAKTYLDAVKSGGKPVGNCVKMLTNGTGTGQAITVKPEANTNQDSYGGASVCLYCRAH 80
                          90       100       110       120       130
                  ....*....|....*....|....*....|....*....|....*....|
gi 190360135 4313 VEHPDVDGLCKLRGKFVQVPVGIKDPVSYVLTHDVCQVCGFWRDGSCSCV 4362
Cdd:cd21901    81 VEHPDMDGVCKLKGKYVQVPLGTNDPVRFCLENDVCKVCGCWLGNGCSCD 130
CoV_PLPro cd21688
Coronavirus (CoV) papain-like protease (PLPro); This model represents the papain-like protease ...
1564-1861 1.39e-85

Coronavirus (CoV) papain-like protease (PLPro); This model represents the papain-like protease (PLPro) found in non-structural protein 3 (Nsp3) of alpha-, beta-, gamma-, and deltacoronavirus, including highly pathogenic betacoronaviruses such as Severe acute respiratory syndrome-related coronavirus (SARS-CoV), SARS-CoV2 (also called 2019 novel CoV or 2019-nCoV), and Middle East respiratory syndrome-related (MERS) CoV. CoVs utilize a multi-subunit replication/transcription machinery. A set of non-structural proteins (Nsps) generated as cleavage products of the ORF1a and ORF1ab viral polyproteins assemble to facilitate viral replication and transcription. PLPro is a key enzyme in this process, making it a high value target for the development of anti-coronavirus therapeutics. PLPro, which belongs to the MEROPS peptidase C16 family, participates in the proteolytic processing of the N-terminal region of the replicase polyprotein; it can cleave Nsp1|Nsp2, Nsp2|Nsp3, and Nsp3|Nsp4 sites and its activity is dependent on zinc. Besides cleaving the polyproteins, PLPro also possesses a related enzymatic activity to promote virus replication: deubiquitinating (DUB) and de-ISGylating activities. Both, ubiquitin (Ub) and Ub-like interferon-stimulated gene product 15 (ISG15), are involved in preventing viral infection; coronaviruses utilize Ubl-conjugating pathways to counter the pro-inflammatory properties of Ubl-conjugated host proteins via the action of PLPro, which processes both 'Lys-48'- and 'Lys-63'-linked polyubiquitin chains from cellular substrates. The Nsp3 PLPro domain in many of these CoVs has also been shown to antagonize host innate immune induction of type I interferon by interacting with IRF3 and blocking its activation.


Pssm-ID: 409647  Cd Length: 299  Bit Score: 283.61  E-value: 1.39e-85
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 190360135 1564 KVDILLTVDGVNFTNRFVPVGESFGKSLGNVFCDGVNVTKHKCDiNYKGKVFFQfdnLSSEDLKAVRSSFNF-DQKELLA 1642
Cdd:cd21688     2 TKKVLVTVDGVNFRTIVVTTGDTYGQQLGPVYLDGADVTKGKPD-NHEGETFFV---LPSTPDKAALEYYGFlDPSFLGR 77
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 190360135 1643 YYNMLVNCSKWQVVfnGKYFTFKQANNNCFVNVSCLMLQSLNLKFKIVQWQEAWLEFRSGRPARFVSLVLAKGGFKFGDP 1722
Cdd:cd21688    78 YLSTLAHKWKVKVV--DGLRSLKWSDNNCYVSAVILALQQLKIKFKAPALQEAWNKFLGGDPARFVALIYASGNKTVGEP 155
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 190360135 1723 ADSRDFLRVVFSQVDLTGAICDFEIACK-CGVKQEQRTGVDAVMHFGTLSREDLEIGYTVDCSCG-KKLIHCVRFDVPFL 1800
Cdd:cd21688   156 GDVRETLTHLLQHADLSSATRVLRVVCKhCGIKTTTLTGVEAVMYVGALSYDDLKTGVSIPCPCGgEWTVQVIQQESPFL 235
                         250       260       270       280       290       300
                  ....*....|....*....|....*....|....*....|....*....|....*....|....
gi 190360135 1801 ICSN-TPASVKLPKGVG-SANIFKGD-KVGHYVHVKCEQSYQLYDASNVKKVTDVTGNLSDCLY 1861
Cdd:cd21688   236 LLSAaPPAEYKLQQDTFvAANVFTGNtNVGHYTHVTAKELLQKFDGAKVTKTSEDKGPVTDVLY 299
MHV-like_Nsp3_NAB cd21824
nucleic acid binding domain of non-structural protein 3 from murine hepatitis virus and ...
1898-2011 1.00e-76

nucleic acid binding domain of non-structural protein 3 from murine hepatitis virus and betacoronavirus in the A lineage; This model represents the nucleic acid binding (NAB) domain of non-structural protein 3 (Nsp3) from betacoronavirus in the embecovirus subgenus (A lineage), including murine hepatitis virus (MHV) and Human coronavirus HKU1. The NAB domain represents a new fold, with a parallel four-strand beta-sheet holding two alpha-helices of three and four turns that are oriented antiparallel to the beta-strands. NAB is a cytoplasmic domain located between the papain-like protease (PLPro) and betacoronavirus-specific marker (betaSM) domains of CoV Nsp3. Nsp3 is a large multi-functional multi-domain protein that is an essential component of the replication/transcription complex (RTC), which carries out RNA synthesis, RNA processing, and interference with the host cell innate immune system. The NAB domain both binds ssRNA and unwinds dsDNA. It prefers to bind ssRNA containing repeats of three consecutive guanines. A group of residues that form a positively charged patch on the protein surface of SARS-CoV Nsp3 NAB serves as the binding site of nucleic acids. This site is conserved in the NAB of Nsp3 from betacoronavirus in the sarbecovirus subgenus (B lineage), but is not conserved in the Nsp3 NAB from betacoronaviruses in the A lineage.


Pssm-ID: 409350  Cd Length: 119  Bit Score: 250.45  E-value: 1.00e-76
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 190360135 1898 GKYYTQRIIKAQFKTFEKVDGVYTNFKLIGHTVCDILNAKLGFDSSKEFVEYKVTEWPTATGDVVLATDDLYVKRYERGC 1977
Cdd:cd21824     1 GKYYTKPIIKAQFKTFEKVDGVYTNFKLVGHTICDKLNAKLGFDSSKPFVEYKVTEWPTATGDVVLASDDLYVKRYEKGC 80
                          90       100       110
                  ....*....|....*....|....*....|....
gi 190360135 1978 ITFGKPVIWLSHEQASLNSLTYFNRPLLVDENKF 2011
Cdd:cd21824    81 ITFGKPVIWLGHEEASLNSLTYFNRPSLVDENKF 114
MHV-like_Nsp3_betaSM cd21812
betacoronavirus-specific marker of non-structural protein 3 from murine hepatitis virus and ...
2026-2146 1.93e-75

betacoronavirus-specific marker of non-structural protein 3 from murine hepatitis virus and betacoronavirus in the A lineage; This model represents the betacoronavirus-specific marker (betaSM), also called group 2-specific marker (G2M), of non-structural protein 3 (Nsp3) from betacoronavirus in the embecovirus subgenus (A lineage), including murine hepatitis virus (MHV) and Human coronavirus HKU1. The betaSM/G2M is located C-terminal to the nucleic acid-binding (NAB) domain. This region is absent in alpha- and deltacoronavirus Nsp3; there is a gammacoronavirus-specific marker (gammaSM) at this position in gammacoronavirus Nsp3. Nsp3 is a large multi-functional multi-domain protein that is an essential component of the replication/transcription complex (RTC), which carries out RNA synthesis, RNA processing, and interference with the host cell innate immune system. Little is known about the betaSM/G2M domain; it is predicted to be non-enzymatic and may be an intrinsically disordered region. The betaSM/G2M domain is part of the predicted PLnc domain (made up of 385 amino acids) of the related SARS-CoV Nsp3 that may function as a replication/transcription scaffold, with interactions to Nsp5, Nsp12, Nsp13, Nsp14, and Nsp16.


Pssm-ID: 409627  Cd Length: 125  Bit Score: 247.21  E-value: 1.93e-75
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 190360135 2026 SESDAKEPKEINIIKLSGVKKPFKVEDSVIVNDDTSEIKYVKSLSIVDVYDMWLTGCRCVVRTANALSRAVNVPTIRKFI 2105
Cdd:cd21812     5 SQSDSKQAKPVKIVKLNGVKKPFKVEDSVVVNDDTSETKVVKSLSIVDVYDMWLTGCRYVVWTANALSRLVNVPTVREYV 84
                          90       100       110       120
                  ....*....|....*....|....*....|....*....|.
gi 190360135 2106 KFGMTLVSIPIDLLNLREIKPVFNVVKAVRNKISACFNFIK 2146
Cdd:cd21812    85 KFGMTVISIPIDLLNLRDDKQEFVVPKVVKAKVSACYNFIK 125
Peptidase_C16 pfam01831
Peptidase C16 family;
1049-1285 1.68e-74

Peptidase C16 family;


Pssm-ID: 460353  Cd Length: 249  Bit Score: 249.61  E-value: 1.68e-74
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 190360135  1049 FEFYTTEPEFV--KVLDLYVPKATRNNCWLRSVLAVMQKLPCQFKDKNLQDLWVLYKQQYSQLFVDTLVNKIPANIVVPQ 1126
Cdd:pfam01831   11 FAFAAEFPDELhfASCGFGNPAIEEEDCFCPSAAIEMKSKGKEFKDHEMQKCSLLPAAECCQCFADILDIFVDEDIIKPE 90
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 190360135  1127 GGYVADFAYWFLTLCDWQCVAYWKCIKCDLALKLKGLDAMFFYGDVVSHVCKCGESMVLIDVDVPFTAHFALKDKLFCAF 1206
Cdd:pfam01831   91 AGTMAAFAFFFASLCKFKARANIQALECDGELKKQAADALFFRGCLCNHMCCCCDAHTAFHADIPQPDGFCLGDDKFCAF 170
                          170       180       190       200       210       220       230
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 190360135  1207 ITKRSVYKAACVVDVNDSHSMAVVDGKQIDDHRITSITSDKFDFIIGHGMSFSMTTFEIAQLYGSCITPNVCFVKGDII 1285
Cdd:pfam01831  171 FTPRKAFPAAAAQDLNDCHILARKEGKKGDGKSGHFFIADKFDFMDFNGEDACEEPFELAKGKGSCIAPALCFGKGDVI 249
CoV_NSP10 pfam09401
Coronavirus RNA synthesis protein NSP10; Non-structural protein 10 (NSP10) is involved in RNA ...
4244-4362 1.91e-74

Coronavirus RNA synthesis protein NSP10; Non-structural protein 10 (NSP10) is involved in RNA synthesis. It is synthesized as a polyprotein whose cleavage generates many non-structural proteins. NSP10 contains two zinc binding motifs and forms two anti-parallel helices which are stacked against an irregular beta sheet. A cluster of basic residues on the protein surface suggests a nucleic acid-binding function. NSP10 interacts with NSP14 and NSP16 and regulates their respective ExoN and 2-O-MTase activities. When binding to the N-terminal of NSP14, nsp10 allows the ExoN active site to adopt a stably closed conformation and is an allosteric regulator that stabilizes NSP16. The residue Tyr-96 plays a crucial role in the NSP10-NSP16/NSP14 interaction. This residue is specific for SARS-CoV NSP10 and is a phenylalanine in most other Coronavirus homologs.


Pssm-ID: 462788  Cd Length: 119  Bit Score: 243.89  E-value: 1.91e-74
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 190360135  4244 SILSLCAFSVDPKKTYLDFIQQGGTPIANCVKMLCDHAGTGMAITVKPDATTSQDSYGGASVCIYCRARVEHPDVDGLCK 4323
Cdd:pfam09401    1 SLLSLCAFAVDPAKAYLDYLAQGGQPITNCVKMLCNHAGTGMAITVKPEANTDQDSYGGASVCLYCRAHIEHPNVDGLCQ 80
                           90       100       110
                   ....*....|....*....|....*....|....*....
gi 190360135  4324 LRGKFVQVPVGIKDPVSYVLTHDVCQVCGFWRDGSCSCV 4362
Cdd:pfam09401   81 LKGKFVQIPTGTKDPVSFCLTNTVCTVCGCWLGYGCSCD 119
CoV_NSP6 pfam19213
Coronavirus replicase NSP6; This entry represents proteins found in Coronaviruses and includes ...
3577-3836 6.79e-71

Coronavirus replicase NSP6; This entry represents proteins found in Coronaviruses and includes the Non-structural Protein 6 (NSP6). Coronaviruses encode large replicase polyproteins which are proteolytically processed by viral proteases to generate mature Nonstructural Proteins (NSPs). NSP6 is a membrane protein containing 6 transmembrane domains with a large C-terminal tail. NSP6 from the avian coronavirus, infectious bronchitis virus (IBV) and the mouse hepatitis virus (MHV) have been shown to localize to the ER and to generate autophagosomes. Coronavirus NSP6 proteins have also been shown to limit autophagosome expansion. This may favour coronavirus infection by reducing the ability of autophagosomes to deliver viral components to lysosomes for degradation. NSP6 from IBV, MHV and severe acute respiratory syndrome coronavirus (SARS-CoV) have also been found to activate autophagy.


Pssm-ID: 465997  Cd Length: 260  Bit Score: 239.84  E-value: 6.79e-71
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 190360135  3577 FVKWTMFMYVTTNMLS-ITFCALCVISLTMLLVKHKHLYLTMYIIPVLFTLLYNNYLVVYK-QTFRGYVYAWlsyyvpSV 3654
Cdd:pfam19213    1 LLMYTALYWLPPNLITpVLPVLTCVSAILTLFIKHKVLFLTTFLLPSVVVMAYYNFTWDYYpNSFLRTVYDY------HF 74
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 190360135  3655 EYTYTDEVIYGMLLLIGMVFV--TLRSINqYLFSFIMFVGRVISVVSLWYMGSNLEEE-----ILLMLASLFGTYTWTTA 3727
Cdd:pfam19213   75 SLTSFDLQGYFNIASCVFVNVlhTYRFVR-SKYSIATYLVSLVVSVYMYVIGYALLTAtdvlsLLFMVLSLLTSYWYVGA 153
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 190360135  3728 LSMAAAKVIAKWVavNVLYFTDIPQIKIVLVCYLFIGYIISCYWGLFSLMNSLFRMPLGVYNYKISVQELRYMNANGLRP 3807
Cdd:pfam19213  154 IAYKLAKYIVVYV--PPSLIAVFGDIKVVLLVYVCIGYVCCVYFGILYWINRFTKLTLGVYDFKVSAAEFKYMVANGLSA 231
                          250       260
                   ....*....|....*....|....*....
gi 190360135  3808 PKNSFEALMLNFKLLGIGGVPIIEVSQFQ 3836
Cdd:pfam19213  232 PRNVFEALILNFKLLGIGGNRTIKISTVQ 260
betaCoV_Nsp9 cd21898
betacoronavirus non-structural protein 9; This model represents the non-structural protein 9 ...
4123-4232 3.91e-70

betacoronavirus non-structural protein 9; This model represents the non-structural protein 9 (Nsp9) from betacoronaviruses including highly pathogenic Severe acute respiratory syndrome-related coronavirus (SARS-CoV), SARS-CoV2 (also called 2019 novel CoV or 2019-nCoV), and Middle East respiratory syndrome-related (MERS) CoV. CoVs utilize a multi-subunit replication/transcription machinery assembled from a set of non-structural proteins (Nsps) generated as cleavage products of the ORF1a and ORF1ab viral polyproteins. All of these Nsps, except for Nsp1 and Nsp2, are considered essential for transcription, replication, and translation of the viral RNA. Nsp9, with Nsp7, Nsp8, and Nsp10, localizes within the replication complex. Nsp9 is an essential single-stranded RNA-binding protein for coronavirus replication; it shares structural similarity to the oligosaccharide-binding (OB) fold, which is characteristic of proteins that bind to ssDNA or ssRNA. Nsp9 requires dimerization for binding and orienting RNA for subsequent use by the replicase machinery. CoV Nsp9s have diverse forms of dimerization that promote their biological function, which may help elucidate the mechanism underlying CoVs replication and contribute to the development of antiviral drugs. Generally, dimers are formed via interaction of the parallel alpha-helices containing the protein-protein interaction motif GXXXG; additionally, the N-finger region may also play a critical role in dimerization as seen in porcine delta coronavirus (PDCoV) Nsp9. As a member of the replication complex, Nsp9 may not have a specific RNA-binding sequence but may act in conjunction with other Nsps as a processivity factor, as shown by mutation studies indicating that Nsp9 is a key ingredient that intimately engages other proteins in the replicase complex to mediate efficient virus transcription and replication.


Pssm-ID: 409331  Cd Length: 111  Bit Score: 231.52  E-value: 3.91e-70
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 190360135 4123 NNELMPAKLKTQVVNSGPDQT-CNTPTQCYYNNSYNGKIVYAILSDVDGLKYTKILKDDGNFVVLELDPPCKFTVQDVKG 4201
Cdd:cd21898     1 NNELMPQGLKTMVVTAGPDQTaCNTPALAYYNNVQGGRMVMAILSDVDGLKYAKVEKSDGGFVVLELDPPCKFLVQTPKG 80
                          90       100       110
                  ....*....|....*....|....*....|.
gi 190360135 4202 LKIKYLYFVKGCNTLARGWVVGTISSTVRLQ 4232
Cdd:cd21898    81 PKVKYLYFVKGLNNLHRGQVLGTIAATVRLQ 111
deltaCoV_Nsp5_Mpro cd21668
deltacoronavirus non-structural protein 5, also called Main protease (Mpro); This subfamily ...
3246-3542 1.05e-66

deltacoronavirus non-structural protein 5, also called Main protease (Mpro); This subfamily contains the coronavirus (CoV) non-structural protein 5 (Nsp5) also called the Main protease (Mpro), or 3C-like protease (3CLpro), found in deltacoronaviruses. CoVs utilize a multi-subunit replication/transcription machinery. A set of non-structural proteins (Nsps) generated as cleavage products of the ORF1a and ORF1ab viral polyproteins assemble to facilitate viral replication and transcription. Mpro/Nsp5 is a key enzyme in this process, making it a high value target for the development of anti-coronavirus therapeutics. These enzymes belong to the MEROPS peptidase C30 family, where the active site residues His and Cys form a catalytic dyad. The structures of Mpro/Nsp5 consist of three domains with the first two containing anti-parallel beta barrels and the third consisting of an arrangement of alpha-helices. The catalytic residues are found in a cleft between the first two domains. Mpro/Nsp5 requires a Gln residue in the P1 position of the substrate and space for only small amino-acid residues such as Gly, Ala, or Ser in the P1' position; since there is no known human protease with a specificity for Gln at the cleavage site of the substrate, these viral proteases are suitable targets for the development of antiviral drugs.


Pssm-ID: 394889  Cd Length: 302  Bit Score: 229.31  E-value: 1.05e-66
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 190360135 3246 QSGIVKMVNPTSKVEPCIVSVTYGNMTLNGLWLGDKVYCPRHVICSASDMTNPDYTNLL-CRvtssDFTVL--FDRLSLT 3322
Cdd:cd21668     1 QAGIKILLHPSGVVERCMVSVTYNGSTLNGIWLHNVVYCPRHVIGKYTGSQWQDMVSIAdCR----DFVIFcpTQGIQLT 76
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 190360135 3323 VMSYQMQGCMLVLTVTLQNSRTPKYTFGVVKPGETFTVLAAYNGKPQGAFHVTMRSSYTIKGSFLCGSCGSVVYVIMGDC 3402
Cdd:cd21668    77 VQSVKMVGAVLQLTVHTKNLHTPDYEFERATPGSSMTIACAYDGIVRNVYHVVLQTNNLIYASFLNGACGSVGYTLKGKT 156
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 190360135 3403 VKFVYMHQLELSTGCHTGTDFNGDFYGPYKDAQVVQLPVQDYIQSVNFVAWLYAAILN---NCNWFVQSDkCSVEDFNVW 3479
Cdd:cd21668   157 LLLHYMHHLEFNNKTHGGTDLHGHFYGPYVDEEVAQHQTAFQYYTDNVVAQIYAHLLTidaKPKWLASQE-ISVEDFNEW 235
                         250       260       270       280       290       300
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*..
gi 190360135 3480 ALSNGFSQVKSDL----VIDALASMTGVSLETLLAAIKHLKNGFQGRQIMGSCSFEDELTPSDVYQQ 3542
Cdd:cd21668   236 AANNSFANFPCESsnmaYLEGLAQTTKVSVGRVLNTIIQLTLNRGGALIMGKPDFECDWTPEMVYNQ 302
alphaCoV_Nsp8 cd21830
alphacoronavirus non-structural protein 8; This model represents the non-structural protein 8 ...
3929-4112 7.90e-66

alphacoronavirus non-structural protein 8; This model represents the non-structural protein 8 (Nsp8) region of alphacoronaviruses that include Feline infectious peritonitis virus (FCoV), Human coronavirus NL63 (HCoV-NL63), and Porcine epidemic diarrhea coronavirus (PEDV), among others. CoVs utilize a multi-subunit replication/transcription machinery. A set of non-structural proteins (Nsps) generated as cleavage products of the ORF1a and ORF1ab viral polyproteins assemble to facilitate viral replication and transcription. Upon processing of the Nsp7-10 region by protease M (Mpro), the released four small proteins Nsp7, Nsp8, Nsp9, and Nsp10 form functional complexes with CoV core enzymes and thereby stimulate replication. Most importantly, a complex of Nsp8 with Nsp7 has been shown to activate and confer processivity to the RNA-synthesizing activity of Nsp12, the RNA-dependent RNA-polymerase (RdRp); in SARS-CoV, point mutations in the genes encoding Nsp8 and Nsp7 have been shown to delay virus growth. Nsp8 and Nsp7 cooperate in activating the primer-dependent activity of the Nsp12 RdRp such that the level of their association may constitute a limiting factor for obtaining a high RNA polymerase activity. The subsequent Nsp7/Nsp8/Nsp12 polymerase complex is then able to associate with an active bifunctional Nsp14, which includes N-terminal 3' to 5' exoribonuclease (ExoN) and C-terminal N7-guanine cap methyltransferase (N7-MTase) activities, thus representing a unique coronavirus Nsp assembly that incorporates RdRp, exoribonuclease, and N7-MTase activities. Interaction of Nsp8 with Nsp7 appears to be conserved across the coronavirus family, making these proteins interesting drug targets. Nsp8 has a novel 'golf-club' fold composed of an N-terminal 'shaft' domain and a C-terminal 'head' domain. The shaft domain contains three helices, one of which is very long, while the head domain contains another three helices and seven beta-strands, forming an alpha/beta fold. FCoV Nsp8 forms a 1:2 heterotrimer with Nsp7; the Nsp7/Nsp8 complex functions as a noncanonical RNA polymerase capable of synthesizing RNA of up to the template length.


Pssm-ID: 409257  Cd Length: 195  Bit Score: 222.61  E-value: 7.90e-66
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 190360135 3929 SEFVNMASFVEYEVAKKNLDEACSSGSaNQQQLKQLEKACNIAKSAYERDRAVARKLERMADLALTNMYKEARINDKKSK 4008
Cdd:cd21830     4 STFANMPSFIAYETARQDYEDAVKNGS-SPQLIKQLKKAMNIAKSEFDREASVQRKLDRMAEQAAAQMYKEARAVNRKSK 82
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 190360135 4009 VVSALQTMLFSMVRKLDNQALNSILDNAVKGCVPLNAIPSLAANTLTIIVPDKSVYDQVVDNVYVTYAGNVWQIQTIQDS 4088
Cdd:cd21830    83 VISAMHSLLFGMLRRLDMSSVDTILNLAKDGVVPLSIIPAASATRLVVVVPDLESFSKIRRDGCVHYAGVVWTIVDIKDN 162
                         170       180
                  ....*....|....*....|....*...
gi 190360135 4089 DGTNKQLHEI----SDDCNWPLVIIANR 4112
Cdd:cd21830   163 DGKVVHLKEVtaanEESLAWPLHLNCER 190
CoV_Nsp8 cd21816
Coronavirus non-structural protein 8; This model represents the non-structural protein 8 (Nsp8) ...
3929-4116 1.12e-65

Coronavirus non-structural protein 8; This model represents the non-structural protein 8 (Nsp8) of alpha-, beta-, gamma- and deltacoronaviruses, including highly pathogenic betacoronaviruses such as Severe acute respiratory syndrome-related coronavirus (SARS-CoV), SARS-CoV2 (also called 2019 novel CoV or 2019-nCoV), and Middle East respiratory syndrome-related (MERS) CoV. CoVs utilize a multi-subunit replication/transcription machinery. A set of non-structural proteins (Nsps) generated as cleavage products of the ORF1a and ORF1ab viral polyproteins assemble to facilitate viral replication and transcription. Upon processing of the Nsp7-10 region by protease M (Mpro), the released four small proteins Nsp7, Nsp8, Nsp9, and Nsp10 form functional complexes with CoV core enzymes and thereby stimulate replication. Most importantly, a complex of Nsp8 with Nsp7 has been shown to activate and confer processivity to the RNA-synthesizing activity of Nsp12, the RNA-dependent RNA-polymerase (RdRp); in SARS-CoV, point mutations in the genes encoding Nsp8 and Nsp7 have been shown to delay virus growth. Nsp8 and Nsp7 cooperate in activating the primer-dependent activity of the Nsp12 RdRp such that the level of their association may constitute a limiting factor for obtaining a high RNA polymerase activity. The subsequent Nsp7/Nsp8/Nsp12 polymerase complex is then able to associate with an active bifunctional Nsp14, which includes N-terminal 3' to 5' exoribonuclease (ExoN) and C-terminal N7-guanine cap methyltransferase (N7-MTase) activities, thus representing a unique coronavirus Nsp assembly that incorporates RdRp, exoribonuclease, and N7-MTase activities. Interaction of Nsp8 with Nsp7 appears to be conserved across the coronavirus family, making these proteins interesting drug targets. Nsp8 has a novel 'golf-club' fold composed of an N-terminal 'shaft' domain and a C-terminal 'head' domain. The shaft domain contains three helices, one of which is very long, while the head domain contains another three helices and seven beta-strands, forming an alpha/beta fold. SARS-CoV Nsp8 forms a 8:8 hexadecameric supercomplex with Nsp7 that adopts a hollow cylinder-like structure with a large central channel and positive electrostatic properties in the cylinder, while Feline infectious peritonitis virus Nsp8 forms a 1:2 heterotrimer with Nsp7. Regardless of their oligomeric structure, the Nsp7/Nsp8 complex functions as a noncanonical RNA polymerase capable of synthesizing RNA of up to the template length.


Pssm-ID: 409256  Cd Length: 194  Bit Score: 222.01  E-value: 1.12e-65
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 190360135 3929 SEFVNMASFVEYEVAKKNLDEACSSGsANQQQLKQLEKACNIAKSAYERDRAVARKLERMADLALTNMYKEARINDKKSK 4008
Cdd:cd21816     1 SEFSHLPSYAAYATAQAAYEQAVKNG-DSPQELKKLTKALNIAKSEFDRDAAVQKKLEKMADQAMTSMYKEARAEDRRAK 79
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 190360135 4009 VVSALQTMLFSMVRKLDNQALNSILDNAVKGCVPLNAIPSLAANTLTIIVPDKSVYDQVVDNVYVTYAGNVWQIQTIQDS 4088
Cdd:cd21816    80 ITSAMHALLFSMLKKLDSDAVNNIFEQARDGVVPLNIIPLTTANKLMVVIPDYETYKKTVDGNTFTYAGALWSIVTVVDA 159
                         170       180       190
                  ....*....|....*....|....*....|..
gi 190360135 4089 DGTNKQLHEI----SDDCNWPLVIIANRHNEV 4116
Cdd:cd21816   160 DGKIVHLSEInmdnSPNIAWPLIVTCLRAGAV 191
CoV_Nsp10 cd21872
coronavirus non-structural protein 10; This model represents the non-structural protein 10 ...
4233-4361 5.77e-62

coronavirus non-structural protein 10; This model represents the non-structural protein 10 (Nsp10) of alpha-, beta-, gamma- and deltacoronaviruses, including highly pathogenic betacoronaviruses such as Severe acute respiratory syndrome-related coronavirus (SARS-CoV), SARS-CoV2 (also called 2019 novel CoV or 2019-nCoV), and Middle East respiratory syndrome-related (MERS) CoV. CoVs utilize a multi-subunit replication/transcription machinery. A set of non-structural proteins (Nsps) generated as cleavage products of the ORF1a and ORF1ab viral polyproteins assemble to facilitate viral replication and transcription. Upon processing of the Nsp7-10 region by protease M (Mpro), the released four small proteins Nsp7, Nsp8, Nsp9, and Nsp10 form functional complexes with CoV core enzymes and thereby stimulate replication. Coronaviruses cap their mRNAs; RNA cap methylation may involve at least three proteins: Nsp10, Nsp14, and Nsp16. Nsp10 serves as a cofactor for both Nsp14 and Nsp16. Nsp14 consists of 2 domains with different enzymatic activities: an N-terminal ExoN domain and a C-terminal cap (guanine-N7) methyltransferase (N7-MTase) domain. The association of Nsp10 with Nsp14 enhances Nsp14's exoribonuclease (ExoN) activity, and not its N7-Mtase activity. ExoN is important for proofreading and therefore, the prevention of lethal mutations. The Nsp10/Nsp14 complex hydrolyzes double-stranded RNA in a 3' to 5' direction as well as a single mismatched nucleotide at the 3'-end, mimicking an erroneous replication product, and may function in a replicative mismatch repair mechanism. Nsp16 Cap-0 specific (nucleoside-2'-O-)-methyltransferase (2'OMTase) acts sequentially to Nsp14 MTase in RNA capping methylation, and methylates the RNA cap at the ribose 2'-O position; it catalyzes the conversion of the cap-0 structure on m7GpppA-RNA to a cap-1 structure. The association of Nsp10 with Nsp16 enhances Nsp16's 2'OMTase activity, possibly through enhanced RNA binding affinity. Additionally, transmissible gastroenteritis virus (TGEV) Nsp10, Nsp16, and their complex can interact with DII4, which normally binds to Notch receptors; this interaction may disturb Notch signaling. Nsp10 also binds 2 zinc ions with high affinity.


Pssm-ID: 409325  Cd Length: 131  Bit Score: 208.86  E-value: 5.77e-62
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 190360135 4233 AGTATEYASNSSILSLCAFSVDPKKTYLDFIQQGGTPIANCVKMLCDHAGTGMAITVKPDATTSQDSYGGASVCIYCRAR 4312
Cdd:cd21872     1 AGNATEVPANSTVLSFCAFAVDPAKAYKDYLASGGQPITNCVKMLCTHTGTGQAITVKPEANMDQESFGGASVCLYCRAH 80
                          90       100       110       120       130
                  ....*....|....*....|....*....|....*....|....*....|
gi 190360135 4313 VEHPDVDGLCKLRGKFVQVPV-GIKDPVSYVLTHDVCQVCGFWRDGSCSC 4361
Cdd:cd21872    81 IDHPNPDGFCDYKGKFVQIPTtCANDPVGFTLRNTVCTVCQMWKGYGCSC 130
betaCoV_Nsp2_SARS_MHV-like cd21515
betacoronavirus non-structural protein 2 (Nsp2), similar to SARS-CoV Nsp2 and MHV Nsp2 (p65), ...
251-800 3.44e-57

betacoronavirus non-structural protein 2 (Nsp2), similar to SARS-CoV Nsp2 and MHV Nsp2 (p65), and related proteins; Coronavirus non-structural proteins (Nsps) are encoded in ORF1a and ORF1b. Post infection, the genomic RNA is released into the cytoplasm of the cell and translated into two long polyproteins (pp), pp1a and pp1ab, which are then autoproteolytically cleaved by two viral proteases Nsp3 and Nsp5 into smaller subunits. Nsp2 is one of these subunits. This family includes Severe acute respiratory syndrome coronavirus (SARS-CoV) Nsp2, SARS-CoV-2 Nsp2, and Murine hepatitis virus (MHV) Nsp2 (also known as p65). The function of Nsp2 remains unclear. SARS-CoV Nsp2 rather than playing a role in viral replication, may be involved in altering the host cell environment; deletion of Nsp2 from the SARS-CoV genome results in only a modest reduction in viral titers. It has been shown to interact with two host proteins, prohibitin 1 (PHB1) and PHB2 which have been implicated in cellular functions, including cell-cycle progression, cell migration, cellular differentiation, apoptosis, and mitochondrial biogenesis. MHV Nsp2/p65, different from SARS-CoV Nsp2, may play an important role in the viral life cycle.


Pssm-ID: 439198  Cd Length: 562  Bit Score: 210.40  E-value: 3.44e-57
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 190360135  251 LYVDQYGCDYTGGLADGLEAYADKTLQE----------MKALFPIWSQELPFDVTVAWHVVRdPRYVMRLQSASTIRSVA 320
Cdd:cd21515     3 RYVDQYFCGPDGYPLECIKDLLAKAGKSsctlsdeqldFKELKRGGYCCRDHEHEIAWYVER-SDAPYELQTPFTIKSAK 81
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 190360135  321 YvanptedlcdGSVVIKEPVHVYADDSII-LRQHNLVDIMSCFYMEADAVvnaFYGVD-LKDCGFVMQFGYIDCeqDLCD 398
Cdd:cd21515    82 K----------DTFKGEVPAFVFPLNSKVkVLKPRVVKKKLEGFMGKIRT---VYPVAsPNECNPMTLSALMKC--DHCD 146
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 190360135  399 FKGWVPGNMIdGFACtTCGHVYETGDLLAQSSGVLPVNPVLHTKSAAG--------------YGGFGCKdSFTLYGQTVV 464
Cdd:cd21515   147 ETSWQTGNFV-GATC-LCGAEYTLTKEDATSAGYLPPGAVVKMPCPACkndevgpehsfadyHNSSGIK-TFLRKGGRTV 223
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 190360135  465 YFGGCVYWSP----ARNIWIPILKSSVKSYdglvYTGVVGCKAIVKETNLICKALYldyvqhkcgNLHQRELLGVSDVWH 540
Cdd:cd21515   224 PFGGCVFAYVgcynGCAYWVPRAWSNIGSN----HTGVVGSGVEVLNDDLLEILLR---------EKVNINIVGDFKLNE 290
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 190360135  541 KQLLLNRGVYKPLLENIDYFNMRRAKFSleTFTVCADGFMPFLLDDLVPRAYYLAVSGQ------AFCDYAGKICHAVVS 614
Cdd:cd21515   291 EVVIILASFSASVLAFVDTVKGLDFETF--KFIVESCGNFPVTKGKFVPGAWNLGKSKQvltplpAFPSQAAMVVRSIFA 368
                         410       420       430       440       450       460       470       480
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 190360135  615 KSKELLDVSLDSLGAAIHYLNSKIVDLAQHFSDfgtsfvskIVHFFKTFTTSTALAFAWVLfhvlhGAYIVVESDIY--- 691
Cdd:cd21515   369 RTVFTATHSVPALQEAAITIIDGISPQALRLLD--------AMRFTADLVTNSVLAMAYVT-----GGLVQVTSQWLdnl 435
                         490       500       510       520       530       540       550       560
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 190360135  692 ------FVKNIPRYASAVAQAFrsvAKVVLDSLRVTFIDGLSCFKIGRRRICLSGSKIYEVERGLLHSSQLPLdvyDLTM 765
Cdd:cd21515   436 fgtvvdLLKPVLEWLEEKISSG---IEFLIDLWEILKLLVTGAYKIVKGQIVLAGKNVSEVVQSFLSVLNKAL---GLLL 509
                         570       580       590
                  ....*....|....*....|....*....|....*
gi 190360135  766 PSQVQKTKQKpiYLKGSGSDFSLADSVVEVVTTSL 800
Cdd:cd21515   510 PLKAPKEELF--LTEGDTVDTSLTSEEVVVKTGVL 542
TM_Y_CoV_Nsp3_C cd21686
C-terminus of coronavirus non-structural protein 3, including transmembrane and Y domains; ...
2262-2744 1.41e-56

C-terminus of coronavirus non-structural protein 3, including transmembrane and Y domains; This model represents the C-terminus of non-structural protein 3 (Nsp3) from alpha-, beta-, gamma-, and deltacoronavirus, including highly pathogenic betacoronaviruses such as Severe acute respiratory syndrome-related coronavirus (SARS-CoV), SARS-CoV2 (also called 2019 novel CoV or 2019-nCoV), and Middle East respiratory syndrome-related (MERS) CoV. This conserved C-terminus includes two transmembrane (TM) regions TM1 and TM2, an ectodomain (3Ecto) between the TM1 and TM2 that is glycosylated and located on the lumenal side of the ER, an amphiphatic region (AH1) that is not membrane-spanning, and a large Y domain of approximately 370 residues. Nsp3 is a large multi-functional multi-domain protein that is an essential component of the replication/transcription complex (RTC), which carries out RNA synthesis, RNA processing, and interference with the host cell innate immune system. In SARS-CoV and murine hepatitis virus (MHV), the TM1, 3Ecto and TM2 domains are important for the papain-like protease (PL2pro) domain to process Nsp3-Nsp4 cleavage. It has also been shown that the interaction of 3Ecto with the lumenal loop of Nsp4 is essential for ER rearrangements in cells infected with SARS-CoV or MHV. The Y domain, located at the cytosolic side of the ER, consists of the Y1 and CoV-Y subdomains, which are conserved in nidovirus and coronavirus, respectively. Functional information about the Y domain is limited; it has been shown that Nsp3 binding to Nsp4 is less efficient without the Y domain.


Pssm-ID: 409657  Cd Length: 476  Bit Score: 206.27  E-value: 1.41e-56
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 190360135 2262 KNQYCNGSIACQFCLAGFDMLDNYKAIDVVQyeadrrafvdytgvlKIVIELIVSYALYTAWFYPLFALISIQILTTWLP 2341
Cdd:cd21686    52 YNSYCAGDLVCQVCLDGQDSLHLYPHLRVVQ---------------QPLQTTDYTVYALSLILYLANMTLFMGTFIVTFF 116
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 190360135 2342 ELFMLSTLHWSVRLLVSLAnmlPAHVFMRFYIIIasFIKLFSLFRHVAYGCSKSGCLFCYKRNRSLRVKCSTIVGGMIRY 2421
Cdd:cd21686   117 VNFYGVGIPFYGWLLIDVP---QSAFMMTFSVFF--FYYVLKFFVHVTHGCKIPTCMVCAKLARPPRVEVETVVQGRKYS 191
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 190360135 2422 YDAMANGGTGFCSKHQWNCIDCDSYKPGNTFITVEAALDLSKELKRPIQPTDVAYHTVTDVKqvgCYMRLFYDRDGQRTY 2501
Cdd:cd21686   192 FYVYTNGGFTFCKEHNFYCKNCDLYGPGCTFISDEVAEELSRATKLSVKPTAPAFLLVDDVE---VQNDVVFARAKYNQN 268
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 190360135 2502 DDVNASLFVDYSNllhskvksvpnmhVVVVENDADKANFL----NAAVFYAQSLFRPILMVDKNLIttantgtsvtETMF 2577
Cdd:cd21686   269 AHVSLSKFSDIPD-------------FIIAANFGSNCEQLstakNAAVYYSQDLCKPILILDQALS----------RPID 325
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 190360135 2578 DVYVDTFLSMFDVDKKSLNALIATAhSSIKQGTQIcKVLDTFLScarkscsidsdvdtkcladSVMSAVSAGLELTDESC 2657
Cdd:cd21686   326 NYQEVASRIEKYYPVAKIKPTGDIF-TDIKQGTDG-EASDSAIN-------------------AAVLAHQRDVEFTGDSF 384
                         410       420       430       440       450       460       470       480
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 190360135 2658 NNLVPTYLKGDNIVAADLGVlIQNSAKHVQGNVAKIAGVSCIWSVDAFNQLSSDFQHKLKKACCKTGLKLKLTYNKQMAN 2737
Cdd:cd21686   385 NNILPSYAKDESKLTAEDQA-MSVIAESGNANVNVKGTIPVVWLVADFIRLSEQARKYIISAAKKNGVTFALTPSTLRMR 463

                  ....*..
gi 190360135 2738 VSVLTTP 2744
Cdd:cd21686   464 GNIATQP 470
CoV_NSP9 pfam08710
Coronavirus replicase NSP9; Nsp9 is a single-stranded RNA-binding viral protein involved in ...
4123-4232 2.17e-52

Coronavirus replicase NSP9; Nsp9 is a single-stranded RNA-binding viral protein involved in RNA synthesis. Several crystallographic structures of nsp9 have shown that it is composed of seven beta strands and a single alpha helix. Nsp9 proteins have N-finger motifs and highly conserved GXXXG motifs that both play critical roles in dimerization. The conserved helix-helix dimer interface containing a GXXXG protein-protein interaction motif is biologically relevant to SARS-CoV replication.


Pssm-ID: 285872  Cd Length: 111  Bit Score: 180.75  E-value: 2.17e-52
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 190360135  4123 NNELMPAKLKTQVVNSGPDQT-CNTPTQCYYNNSYNGKIVYAILSDVDGLKYTKILKDDGNFVVLELDPPCKFTVQDVKG 4201
Cdd:pfam08710    1 NNELMPGKLKTKACKAGVTDAhCSVEGKAYYNNEGGGSFVYAILSSNPNLKYAKFEKEDGNVIYVELEPPCRFVVDTPKG 80
                           90       100       110
                   ....*....|....*....|....*....|.
gi 190360135  4202 LKIKYLYFVKGCNTLARGWVVGTISSTVRLQ 4232
Cdd:pfam08710   81 PEVKYLYFVKNLNNLRRGMVLGYISATVRLQ 111
gammaCoV_Nsp8 cd21832
gammacoronavirus non-structural protein 8; This model represents the non-structural protein 8 ...
3926-4122 1.70e-51

gammacoronavirus non-structural protein 8; This model represents the non-structural protein 8 (Nsp8) region of gammacoronaviruses that include Avian infectious bronchitis virus (IBV) and Canada goose coronavirus (CGCoV), among others. CoVs utilize a multi-subunit replication/transcription machinery. A set of non-structural proteins (Nsps) generated as cleavage products of the ORF1a and ORF1ab viral polyproteins assemble to facilitate viral replication and transcription. Upon processing of the Nsp7-10 region by protease M (Mpro), the released four small proteins Nsp7, Nsp8, Nsp9, and Nsp10 form functional complexes with CoV core enzymes and thereby stimulate replication. Most importantly, a complex of Nsp8 with Nsp7 has been shown to activate and confer processivity to the RNA-synthesizing activity of Nsp12, the RNA-dependent RNA-polymerase (RdRp); in SARS-CoV, point mutations in the genes encoding Nsp8 and Nsp7 have been shown to delay virus growth. Nsp8 and Nsp7 cooperate in activating the primer-dependent activity of the Nsp12 RdRp such that the level of their association may constitute a limiting factor for obtaining a high RNA polymerase activity. The subsequent Nsp7/Nsp8/Nsp12 polymerase complex is then able to associate with an active bifunctional Nsp14, which includes N-terminal 3' to 5' exoribonuclease (ExoN) and C-terminal N7-guanine cap methyltransferase (N7-MTase) activities, thus representing a unique coronavirus Nsp assembly that incorporates RdRp, exoribonuclease, and N7-MTase activities. Interaction of Nsp8 with Nsp7 appears to be conserved across the coronavirus family, making these proteins interesting drug targets. Nsp8 has a novel 'golf-club' fold composed of an N-terminal 'shaft' domain and a C-terminal 'head' domain. The shaft domain contains three helices, one of which is very long, while the head domain contains another three helices and seven beta-strands, forming an alpha/beta fold. SARS-CoV Nsp8 forms a 8:8 hexadecameric supercomplex with Nsp7 that adopts a hollow cylinder-like structure with a large central channel and positive electrostatic properties in the cylinder, while Feline infectious peritonitis virus Nsp8 forms a 1:2 heterotrimer with Nsp7. Regardless of their oligomeric structure, the Nsp7/Nsp8 complex functions as a noncanonical RNA polymerase capable of synthesizing RNA of up to the template length.


Pssm-ID: 409259  Cd Length: 210  Bit Score: 182.08  E-value: 1.70e-51
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 190360135 3926 ALQSEFVNMASFVEYEVAKKNLDEACS---SGSANQQQLKQLEKACNIAKSAYERDRAVARKLERMADLALTNMYKEARI 4002
Cdd:cd21832     1 SVTQEFSHIPSYAEYERAKDLYEKVLAdskNGGVTQQELAAYRKAANIAKSVFDRDLAVQKKLDSMAERAMTTMYKEARV 80
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 190360135 4003 NDKKSKVVSALQTMLFSMVRKLDNQALNSILDNAVKGCVPLNAIPSLAANTLTIIVPDKSVYDQVVDNVYVTYAGNVWQI 4082
Cdd:cd21832    81 TDRRAKLVSSLHALLFSMLKKIDSEKLNVLFDQASSGVVPLATVPIVCSNKLTLVIPDPETWVKCVEGMHVTYSTVVWNI 160
                         170       180       190       200       210
                  ....*....|....*....|....*....|....*....|....*....|...
gi 190360135 4083 QTIQDSDGTnkQLHEIS-----------DDCNWPLVI--IANRHNEVSAtALQ 4122
Cdd:cd21832   161 DTVIDADGT--ELHPTStgsgltycisgDNIAWPLKVnlTRNGHNKVDA-VLQ 210
TM_Y_alphaCoV_Nsp3_C cd21712
C-terminus of alphacoronavirus non-structural protein 3, including transmembrane and Y domains; ...
2262-2749 2.23e-48

C-terminus of alphacoronavirus non-structural protein 3, including transmembrane and Y domains; This model represents the C-terminus of non-structural protein 3 (Nsp3) from alphacoronavirus, including Porcine epidemic diarrhea virus and Human coronavirus 229E, among others. This conserved C-terminus includes two transmembrane (TM) regions TM1 and TM2, an ectodomain (3Ecto) between the TM1 and TM2 that is glycosylated and located on the lumenal side of the ER, an amphiphatic region (AH1) that is not membrane-spanning, and a large Y domain of approximately 370 residues. Nsp3 is a large multi-functional multi-domain protein that is an essential component of the replication/transcription complex (RTC), which carries out RNA synthesis, RNA processing, and interference with the host cell innate immune system. In the related betacoronaviruses, Severe acute respiratory syndrome-related coronavirus (SARS-CoV) and murine hepatitis virus (MHV), the TM1, 3Ecto and TM2 domains are important for the papain-like protease (PL2pro) domain to process Nsp3-Nsp4 cleavage. It has also been shown that the interaction of 3Ecto with the lumenal loop of Nsp4 is essential for ER rearrangements in cells infected with SARS-CoV or MHV. The Y domain, located at the cytosolic side of the ER, consists of the Y1 and CoV-Y subdomains, which are conserved in nidovirus and coronavirus, respectively. Functional information about the Y domain is limited; it has been shown that Nsp3 binding to Nsp4 is less efficient without the Y domain.


Pssm-ID: 409660  Cd Length: 501  Bit Score: 182.83  E-value: 2.23e-48
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 190360135 2262 KNQYCNGSIACQFCLAGFDMLDNYKAIDVV-QYeadrrafvdytgvlkivieliVSYALYtAWFYPLFALISIQILTTWL 2340
Cdd:cd21712    54 KSEVCGNSLLCKACLAGYDELSDFPHLQVVwDH---------------------VSDPLF-SNVLPLFYFAFLLIFGNNY 111
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 190360135 2341 PELFML-STLHWSVRLLVSLA--------NMLPAHVFMRFYIIIASFIKLFSLFRHVAYGCSKSGCLFCYKRNRSLRVKC 2411
Cdd:cd21712   112 VRCFLLyFVAQYINNWGVYFGyqdyswflHFVPFDSFSDEIVVIFIVVKVLLFLKHVIFGCDKPSCKACSKSARLTRIPV 191
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 190360135 2412 STIVGGMIRYYDAMANGGTGFCSKHQWNCIDCDSYKPGNTFITVEAALDLSKELKRPIQPTDVAYHTVTDVKQVGCYMRL 2491
Cdd:cd21712   192 QTIVNGSMKSFYVHANGGGKFCKKHNFFCVNCDSYGVGNTFINDEVARELSNVVKTTVQPTGPAYIEVDKVEFSNGFYYL 271
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 190360135 2492 FYDRDGQRTYDDVNASlfvDYSnllhskVKSV-PNMHVV---VVEND--ADKANFLNAAVFYAQSLFRPILMVDKNLITT 2565
Cdd:cd21712   272 YSGDTFWRYNFDITEK---KYS------CKEVlKNCNLLddfIVYNNngSNVAQVKNACVYFSQLLCKPIKLVDSALLSS 342
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 190360135 2566 ANTGTSvtETMFDVYVDTFLSMFDVDKKSlnaliatahssikqgtqiCKVLDTFLSCARKSCSIDSDVdtkclaDSVMSA 2645
Cdd:cd21712   343 LSVDFN--GALHKAFVKVLKNSFNKDLSN------------------CKTLEECKKALGLDVSDDEFE------SAVSNA 396
                         410       420       430       440       450       460       470       480
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 190360135 2646 VSAGLELTDESCNNLVPTYLK-GDNIVAADLGVLIQNSAKHVQGNVAKIAGVSCIWSVDAFNQLSSDFQHKLKKACCKTG 2724
Cdd:cd21712   397 HRYDVLLTDRSFNNFVTSYAKpEEKLSTHDIAVCMRAGAKVVNHNVLTKENVPIVWLAKDFSALSEEARKYIVKTTKAKG 476
                         490       500
                  ....*....|....*....|....*
gi 190360135 2725 LKLKLTYNKQMANVSVLTTPFSLKG 2749
Cdd:cd21712   477 VNFLLTFNDNRMTTTLPAVSIVSKK 501
bCoV_NAB pfam16251
Betacoronavirus nucleic acid-binding (NAB); This is the nucleic acid-binding domain (NAB) from ...
1902-2011 2.22e-46

Betacoronavirus nucleic acid-binding (NAB); This is the nucleic acid-binding domain (NAB) from the multidomain nonstructural protein NSP3, and described as NSP3e domain. NSP3 is part of Orf1a polyproteins in SARS-CoV. It is an essential component of the replication/transcription complex. The global domain of the NAB represents a new fold, with a parallel four-strand beta-sheet holding two alpha-helices of three and four turns that are oriented antiparallel to the beta-strands and a group of residues form a positively charged patch on the protein surface as the binding site responsible for binding affinity for nucleic acids. When binding to ssRNA, the NAB prefers sequences with repeats of three consecutive Gs, such as (GGGA)5 and (GGGA)2. A positively charged surface patch (Lys75, Lys76, Lys99, and Arg106) is involved in RNA binding.


Pssm-ID: 406621  Cd Length: 129  Bit Score: 164.26  E-value: 2.22e-46
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 190360135  1902 TQRIIKAQFKTFEKVDGVYTNFKLI--GHTVCDILNAKLGFDSSKEFV-EYKVTEWPTATGDVVLATDDLYVKRYERGCI 1978
Cdd:pfam16251   11 TKPIIKAQFRTFEKVDGVYDNFKLTcsGHKFADDLNAKLGFDCNKPASrELKITEFPDANGDVVAADDDHYSARFKKGAI 90
                           90       100       110
                   ....*....|....*....|....*....|....
gi 190360135  1979 TFGKPVIWLSHEQASLNSLTYFNRPLLV-DENKF 2011
Cdd:pfam16251   91 LFGKPIVWLGHEEAALKKLTFFNKPNTVcLECKF 124
CoV_NSP4_C pfam16348
Coronavirus replicase NSP4, C-terminal; This is the C-terminal domain of the coronavirus ...
3157-3244 4.30e-46

Coronavirus replicase NSP4, C-terminal; This is the C-terminal domain of the coronavirus nonstructural protein 4 (NSP4). NSP4 is encoded by ORF1a/1ab and proteolytically released from the pp1a/1ab polyprotein. It is a membrane-spanning protein which is thought to anchor the viral replication-transcription complex (RTC) to modified endoplasmic reticulum membranes. This predominantly alpha-helical domain may be involved in protein-protein interactions. It has been shown that in Betacoronavirus, the coexpression of NSP3 and NSP4 results in a membrane rearrangement to induce double-membrane vesicles (DMVs) and convoluted membranes (CMs), playing a critical role in SARS-CoV replication. There are two well conserved amino acid residues (H120 and F121) in NSP4 among Betacoronavirus, essential for membrane rearrangements during interaction with NSP3.


Pssm-ID: 465099  Cd Length: 92  Bit Score: 161.93  E-value: 4.30e-46
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 190360135  3157 GTFEEMALTTFMITKDSYCKLKNSLSDVAFNRYLSLYNKYRYYSGKMDTAAYREAACSQLAKAMDTFtNNNGSDVLYQPP 3236
Cdd:pfam16348    6 GTFEEAALGTFVIDKESYEKLKNSISLDKFNRYLSLYNKYKYYSGKMDEADYREACCAHLAKALEDF-SNSGNDVLYTPP 84

                   ....*...
gi 190360135  3237 TASVSTSF 3244
Cdd:pfam16348   85 TVSVTSSL 92
gammaCoV_Nsp10 cd21902
gammacoronavirus non-structural protein 10; This model represents the non-structural protein ...
4234-4361 2.69e-45

gammacoronavirus non-structural protein 10; This model represents the non-structural protein 10 (Nsp10) of gammacoronaviruses, including Infectious bronchitis virus (IBV)and Bottlenose dolphin coronavirus HKU22(BdCoV HKU22). CoVs utilize a multi-subunit replication/transcription machinery. A set of non-structural proteins (Nsps) generated as cleavage products of the ORF1a and ORF1ab viral polyproteins assemble to facilitate viral replication and transcription. Upon processing of the Nsp7-10 region by protease M (Mpro), the released four small proteins Nsp7, Nsp8, Nsp9, and Nsp10 form functional complexes with CoV core enzymes and thereby stimulate replication. Coronaviruses cap their mRNAs; RNA cap methylation may involve at least three proteins: Nsp10, Nsp14, and Nsp16. Nsp10 serves as a cofactor for both Nsp14 and Nsp16. Nsp14 consists of 2 domains with different enzymatic activities: an N-terminal ExoN domain and a C-terminal cap (guanine-N7) methyltransferase (N7-MTase) domain. The association of Nsp10 with Nsp14 enhances Nsp14's exoribonuclease (ExoN) activity, and not its N7-Mtase activity. ExoN is important for proofreading and therefore, the prevention of lethal mutations. The Nsp10/Nsp14 complex hydrolyzes double-stranded RNA in a 3' to 5' direction as well as a single mismatched nucleotide at the 3'-end, mimicking an erroneous replication product, and may function in a replicative mismatch repair mechanism. Nsp16 Cap-0 specific (nucleoside-2'-O-)-methyltransferase (2'OMTase) acts sequentially to Nsp14 MTase in RNA capping methylation and methylates the RNA cap at the ribose 2'-O position; it catalyzes the conversion of the cap-0 structure on m7GpppA-RNA to a cap-1 structure. The association of Nsp10 with Nsp16 enhances Nsp16's 2'OMTase activity, possibly through enhanced RNA binding affinity. Additionally, transmissible gastroenteritis virus (TGEV) Nsp10, Nsp16 and their complex can interact with DII4, which normally binds to Notch receptors; this interaction may disturb Notch signaling. Nsp10 also binds 2 zinc ions with high affinity.


Pssm-ID: 409327  Cd Length: 134  Bit Score: 161.22  E-value: 2.69e-45
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 190360135 4234 GTATEYASNSSILSLCAFSVDPKKTYLDFIQQGGTPIANCVKMLCDHAGTGMAITVKPDATTSQDSYGGASVCIYCRARV 4313
Cdd:cd21902     2 GHETEEVDAVGILSLCSFAVDPADTYCKYVAAGNQPLGNCVKMLTVHNGSGFAITSKPSPTPDQDSYGGASVCLYCRAHI 81
                          90       100       110       120       130
                  ....*....|....*....|....*....|....*....|....*....|..
gi 190360135 4314 EHP----DVDGLCKLRGKFVQVPVGIKDPVSYVLTHDVCQVCGFWRDGSCSC 4361
Cdd:cd21902    82 AHPggagNLDGRCQFKGSFVQIPTTEKDPVGFCLRNKVCTVCQCWIGYGCQC 133
alphaCoV-Nsp6 cd21558
alphacoronavirus non-structural protein 6; Coronaviruses (CoV) redirect and rearrange host ...
3535-3836 5.26e-45

alphacoronavirus non-structural protein 6; Coronaviruses (CoV) redirect and rearrange host cell membranes as part of the viral genome replication and transcription machinery; they induce the formation of double-membrane vesicles in infected cells. CoV non-structural protein 6 (Nsp6), a transmembrane-containing protein, together with Nsp3 and Nsp4, have the ability to induce double-membrane vesicles that are similar to those observed in severe acute respiratory syndrome (SARS) coronavirus-infected cells. By itself, Nsp6 can generate autophagosomes from the endoplasmic reticulum. Autophagosomes are normally generated as a cellular response to starvation to carry cellular organelles and long-lived proteins to lysosomes for degradation. Degradation through autophagy may provide an innate defense against virus infection, or conversely, autophagosomes can promote infection by facilitating the assembly of replicase proteins. In addition to initiating autophagosome formation, Nsp6 also limits autophagosome expansion regardless of how they were induced, i.e. whether they were induced directly by Nsp6, or indirectly by starvation or chemical inhibition of MTOR signaling. This may favor coronavirus infection by compromising the ability of autophagosomes to deliver viral components to lysosomes for degradation.


Pssm-ID: 394844  Cd Length: 293  Bit Score: 166.60  E-value: 5.26e-45
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 190360135 3535 TPSDVYQQLAGIKLQS-KRTRLVKGIvcwiMASTFLFSCIITAFVKWTMFMYVTTNMLSITFCALCVIS-LTMLLVKHKH 3612
Cdd:cd21558     2 TTSEVIKQMYGVNLQSgKVKSAFKNV----LLVGVFLFMFWSELLMYTSFFWINPGLVTPVFLVLVLVSlLLTLFLKHKM 77
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 190360135 3613 LYLTMYIIPVLFtllynnYLVVYKQTFRGYVYAWLSYYVpSVEYTYTDEVIYGMLLLIGMVFVTLrsinQYLFSFIMFVG 3692
Cdd:cd21558    78 LFLQTFLLPSVI------VTAFYNLAWDYYVTAVLAEYF-DYHVSLMSFDIQGVLNIFVCLFVFF----LHTYRFVTSGT 146
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 190360135 3693 RVIS-----VVSLW-YMGSNLEEEILLMLASLFGTYTWTTALSMAAAKVIakwVAVNVLYFTDIPQIKIVLVCYLFIGYI 3766
Cdd:cd21558   147 SWFTyvvslVFVLYnYFYGNDYLSLLMMVLSSITNNWYVGAIAYKLAYYI---VYVPPSLVADFGTVKAVMLVYVALGYL 223
                         250       260       270       280       290       300       310
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 190360135 3767 ISCYWGLFSLMNSLFRMPLGVYNYKISVQELRYMNANGLRPPKNSFEALMLNFKLLGIGGVPIIEVSQFQ 3836
Cdd:cd21558   224 CCVYYGILYWINRFTKLTLGVYDFKVSAAEFKYMVANGLKAPTGVFDALLLSFKLIGIGGERTIKISTVQ 293
Peptidase_C16 pfam01831
Peptidase C16 family;
1-248 7.45e-45

Peptidase C16 family;


Pssm-ID: 460353  Cd Length: 249  Bit Score: 164.48  E-value: 7.45e-45
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 190360135     1 MSKINKYGLELHWAPEFPWMFEDAEEKLDNPSSSEVDIVCSTTAQKLETGGICPENHVMVDCRRLLKQECCVQSSLIREI 80
Cdd:pfam01831    1 AADAGCSEAGFAFAAEFPDELHFASCGFGNPAIEEEDCFCPSAAIEMKSKGKEFKDHEMQKCSLLPAAECCQCFADILDI 80
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 190360135    81 VMNTRPYDLEVLLQDALQSREAVLVTPPLGMSLEACYVRGCNPNGWTMGLFRRRSVCNTGRCAVNKHVAYQLYMIDPAGV 160
Cdd:pfam01831   81 FVDEDIIKPEAGTMAAFAFFFASLCKFKARANIQALECDGELKKQAADALFFRGCLCNHMCCCCDAHTAFHADIPQPDGF 160
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 190360135   161 CFGAGQFVGWVIPLAFMPVQSRKFIVPWVMYLRKCGEKGAYNKDHKRGGFEH-VYNFKVEDAYDLVHDEPKGKFSKKAYA 239
Cdd:pfam01831  161 CLGDDKFCAFFTPRKAFPAAAAQDLNDCHILARKEGKKGDGKSGHFFIADKFdFMDFNGEDACEEPFELAKGKGSCIAPA 240

                   ....*....
gi 190360135   240 LIRGYRGVK 248
Cdd:pfam01831  241 LCFGKGDVI 249
DPUP_MHV_Nsp3 cd21524
DPUP (domain preceding Ubl2 and PLP2) of non-structural protein 3 (Nsp3) from murine hepatitis ...
1489-1562 1.37e-44

DPUP (domain preceding Ubl2 and PLP2) of non-structural protein 3 (Nsp3) from murine hepatitis virus and related betacoronaviruses in the A lineage; This subfamily contains the DPUP (domain preceding Ubl2 and PLP2) of murine hepatitis virus (MHV) non-structural protein 3 (Nsp3) and other Nsp3s from betacoronaviruses in the embecovirus subgenera (A lineage), including human CoV OC43, rabbit CoV HKU14 and porcine hemagglutinating encephalomyelitis virus (HEV), among others. Non-structural protein 3 (Nsp3) is a large multi-functional multi-domain protein that is an essential component of the replication/transcription complex (RTC), which carries out RNA synthesis, RNA processing, and interference with the host cell innate immune system. MHV Nsp3 contains a DPUP that is located N-terminal to the ubiquitin-like domain 2 (Ubl2) and papain-like protease 2 (PLP2) catalytic domain. It is structurally similar to the Severe Acute Respiratory Syndrome (SARS) CoV unique domain C (SUD-C), adopting a frataxin-like fold that has structural similarity to DNA-binding domains of DNA-modifying enzymes. SUD-C is also located N-terminal to Ubl2 and PLP2 in SARS Nsp3, similar to the DPUP of MHV Nsp3; however, unlike DPUP, it is preceded by SUD-N and SUD-M macrodomains that are absent in MHV Nsp3. Though structurally similar, there is little sequence similarity between DPUP and SUD-C. SARS SUD-C has been shown to bind to single-stranded RNA and recognize purine bases more strongly than pyrimidine bases; it also regulates the RNA binding behavior of the SARS SUD-M macrodomain. It is not known whether DPUP functions in the same way.


Pssm-ID: 394840  Cd Length: 75  Bit Score: 156.81  E-value: 1.37e-44
                          10        20        30        40        50        60        70
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....
gi 190360135 1489 LDDDARTFVQSNVDVVPEGWRVVNKFYQINGVRTVKYFECPGGIDICSQDKVFGYVQQGSFNKATVAQIKALFL 1562
Cdd:cd21524     2 LDDDARVFVQANMDNLPEDWRLVNKFDVINGVRTIKYFECPGGIFICSQGKDFGYVQNGSFKKATVSQIRALLA 75
deltaCoV_Nsp10 cd21903
deltacoronavirus non-structural protein 10; This model represents the non-structural protein ...
4234-4362 1.55e-42

deltacoronavirus non-structural protein 10; This model represents the non-structural protein 10 (Nsp10) of deltacoronaviruses, including Thrush coronavirus HKU12-600 and Wigeon coronavirus HKU20. CoVs utilize a multi-subunit replication/transcription machinery. A set of non-structural proteins (Nsps) generated as cleavage products of the ORF1a and ORF1ab viral polyproteins assemble to facilitate viral replication and transcription. Upon processing of the Nsp7-10 region by protease M (Mpro), the released four small proteins Nsp7, Nsp8, Nsp9, and Nsp10 form functional complexes with CoV core enzymes and thereby stimulate replication. Coronaviruses cap their mRNAs; RNA cap methylation may involve at least three proteins: Nsp10, Nsp14, and Nsp16. Nsp10 serves as a cofactor for both Nsp14 and Nsp16. Nsp14 consists of 2 domains with different enzymatic activities: an N-terminal ExoN domain and a C-terminal cap (guanine-N7) methyltransferase (N7-MTase) domain. The association of Nsp10 with Nsp14 enhances Nsp14's exoribonuclease (ExoN) activity, and not its N7-Mtase activity. ExoN is important for proofreading and therefore, the prevention of lethal mutations. The Nsp10/Nsp14 complex hydrolyzes double-stranded RNA in a 3' to 5' direction as well as a single mismatched nucleotide at the 3'-end, mimicking an erroneous replication product, and may function in a replicative mismatch repair mechanism. Nsp16 Cap-0 specific (nucleoside-2'-O-)-methyltransferase (2'OMTase) acts sequentially to Nsp14 MTase in RNA capping methylation and methylates the RNA cap at the ribose 2'-O position; it catalyzes the conversion of the cap-0 structure on m7GpppA-RNA to a cap-1 structure. The association of Nsp10 with Nsp16 enhances Nsp16's 2'OMTase activity, possibly through enhanced RNA binding affinity. Additionally, transmissible gastroenteritis virus (TGEV) Nsp10, Nsp16 and their complex can interact with DII4, which normally binds to Notch receptors; this interaction may disturb Notch signaling. Nsp10 also binds 2 zinc ions with high affinity.


Pssm-ID: 409328  Cd Length: 128  Bit Score: 153.09  E-value: 1.55e-42
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 190360135 4234 GTATEYASNSSILSLCAFSVDPKKTYLDFIQQGGTPIANCVKMLCDhAGTGMAITVKPDATTSQDSYGGASVCIYCRARV 4313
Cdd:cd21903     2 GTQIEYQENASLLTYLAFAVDPKEAYLKHLADGGKPIQGCIQMIAP-LGPGFAVTTKPQPNEHQYSYGGASICLYCRAHI 80
                          90       100       110       120
                  ....*....|....*....|....*....|....*....|....*....
gi 190360135 4314 EHPDVDGLCKLRGKFVQVPVGiKDPVSYVLTHDVCQVCGFWRDGSCSCV 4362
Cdd:cd21903    81 PHPGVDGRCPYKGRFVHIDKD-KEPVSFALTHEPCNSCQRWVNYDCTCG 128
Macro_X_Nsp3-like cd21557
X-domain (or Mac1 domain) of viral non-structural protein 3 and related macrodomains; The ...
1294-1418 1.95e-41

X-domain (or Mac1 domain) of viral non-structural protein 3 and related macrodomains; The X-domain, also called Mac1, is the macrodomain found in riboviral non-structural protein 3 (Nsp3), including the Nsp3 of Severe Acute Respiratory Syndrome Coronavirus (SARS-CoV) as well as SARS-CoV-2, and other coronaviruses (alpha-, beta-, gamma-, and deltacoronavirus), among others. The SARS-CoV-2 Nsp3 Mac1 is highly conserved among all CoVs, and binds to and hydrolyzes mono-ADP-ribose (MAR) from target proteins. It appears to counter host-mediated antiviral ADP-ribosylation, a post-translational modification that is part of the host response to viral infections. Mac1 is essential for pathogenesis in multiple animal models of CoV infection, implicating it as a virulence factor and potential therapeutic target. Assays show that the de-MARylating activity leads to a rapid loss of substrate, and that Mac1 could not hydrolyze poly-ADP-ribose; thus, Mac1 is a MAR-hydrolase (mono-ADP ribosylhydrolase). Mac1 was originally named ADP-ribose-1"-phosphatase (ADRP) based on data demonstrating that it could remove the phosphate group from ADP-ribose-1"-phosphate; however, activity was modest and was unclear why this would impact a virus infection. This family also includes the X-domain of Avian infectious bronchitis virus (IBV) strain Beaudette coronavirus that does not bind ADP-ribose; the triple glycine sequence found in the X-domains of SARS-CoV and human coronavirus 229E (HCoV229E), which are involved in ADP-ribose binding, is not conserved in the IBV X-domain. SARS-CoVs have two other macrodomains referred to as the SUD-N (N-terminal subdomain, or Mac2) and SUD-M (middle SUD subdomain, or Mac3) of the SARS-unique domain (SUD), which also do not bind ADP-ribose; these bind G-quadruplexes (unusual nucleic-acid structures formed by consecutive guanosine nucleotides). SARS-CoV SUD-N and SUD-M are not included in this group.


Pssm-ID: 438957  Cd Length: 127  Bit Score: 150.01  E-value: 1.95e-41
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 190360135 1294 EVVVNPANGHMAHGGGVAKAIAVAAGQQFVKEtTDMVKSKGVCATGDCYVSTGGKLCKTVLNVVGPDARTqgKQSYALLE 1373
Cdd:cd21557     2 DVVVNAANENLKHGGGVAGAIYKATGGAFQKE-SDYIKKNGPLKVGTAVLLPGHGLAKNIIHVVGPRKRK--GQDDQLLA 78
                          90       100       110       120
                  ....*....|....*....|....*....|....*....|....*....
gi 190360135 1374 RVYKHLNK-YDCVVTTLISAGIFSVPSDVSLTYLLGTAKK---QVVLVS 1418
Cdd:cd21557    79 AAYKAVNKeYGSVLTPLLSAGIFGVPPEQSLNALLDAVDTtdaDVTVYC 127
alpha_betaCoV_Nsp2 cd21511
alpha- and betacoronavirus non-structural protein 2; Coronavirus Nsps are encoded in ORF1a and ...
251-500 2.40e-40

alpha- and betacoronavirus non-structural protein 2; Coronavirus Nsps are encoded in ORF1a and ORF1b. Post infection, the genomic RNA is released into the cytoplasm of the cell and translated into two long polyproteins (pp), pp1a and pp1ab, which are then autoproteolytically cleaved by two viral proteases Nsp3 and Nsp5 into smaller subunits. Nsp2 is one of these subunits. This alpha- and betacoronavirus family includes alphacoronavirus human coronavirus 229E (HCoV-229E) Nsp2, betacoronavirus Severe acute respiratory syndrome coronavirus (SARS-CoV) Nsp2, SARS-CoV-2 Nsp2, and Murine hepatitis virus (MHV) Nsp2 (also known as p65). The function of Nsp2 remains unclear. SARS-CoV Nsp2, rather than playing a role in viral replication, may be involved in altering the host cell environment; deletion of Nsp2 from the SARS-CoV genome results in only a modest reduction in viral titers. It has been shown to interact with two host proteins, prohibitin 1 (PHB1) and PHB2, which have been implicated in cellular functions, including cell-cycle progression, cell migration, cellular differentiation, apoptosis, and mitochondrial biogenesis. MHV Nsp2/p65, different from SARS-CoV Nsp2, may play an important role in the viral life cycle. This family may be distantly related to the gammacoronavirus Avian infectious bronchitis virus (IBV) Nsp2; IBV Nsp2 is a weak protein kinase R (PKR) antagonist, which may suggest that it plays a role in interfering with intracellular immunity.


Pssm-ID: 439197  Cd Length: 399  Bit Score: 156.56  E-value: 2.40e-40
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 190360135  251 LYVDQYGCDYTGGLADGLEAYADKTLQEM---------KALFPIWSQELPFDVTVAWHVVRdPRYVMRLQSASTIRSVAy 321
Cdd:cd21511     3 TYVDQYGCGPDGKPVECIKDLLDVAKKGSctlseqldgIELKNGVYDLRDHEVVIAWYVER-KDVPYEKQTIFTIKSAK- 80
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 190360135  322 vanptedlcDGSVVIKEPVHVYADDSII-LRQHNLVDIMSCFYMeadAVVNAFYGV-DLKDCGFVMQFGYIDCeqDLCDF 399
Cdd:cd21511    81 ---------FGTFVGEVPAHVFPLNSIVkEIQPRVKKKKKVTLS---GVIRSFYSKaSPNECNPITLSALVKC--THCDE 146
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 190360135  400 KGWVPGNMIDGFACtTCGHVYETGDLLAQSSGVLPVNPVLHTKSAAGYGgfgcKDSFTLYGQTVVYFGGCVYWSPA---- 475
Cdd:cd21511   147 KSWQTGDFVDGFTC-ECGAEYLNWKLDAQSSGVLPPGAVVKTQCPACVN----RETFLRGGGRIVYFGGAVYSYVGcing 221
                         250       260
                  ....*....|....*....|....*
gi 190360135  476 RNIWIPILKSSVKSYdglvYTGVVG 500
Cdd:cd21511   222 VAYWVPRASSSVGCF----HTGVVG 242
CoV_Nsp6 cd21526
coronavirus non-structural protein 6; Coronaviruses (CoV) redirect and rearrange host cell ...
3539-3836 6.55e-38

coronavirus non-structural protein 6; Coronaviruses (CoV) redirect and rearrange host cell membranes as part of the viral genome replication and transcription machinery; they induce the formation of double-membrane vesicles in infected cells. CoV non-structural protein 6 (Nsp6), a transmembrane-containing protein, together with Nsp3 and Nsp4, have the ability to induce double-membrane vesicles that are similar to those observed in severe acute respiratory syndrome (SARS) coronavirus-infected cells. By itself, Nsp6 can generate autophagosomes from the endoplasmic reticulum. Autophagosomes are normally generated as a cellular response to starvation to carry cellular organelles and long-lived proteins to lysosomes for degradation. Degradation through autophagy may provide an innate defense against virus infection, or conversely, autophagosomes can promote infection by facilitating the assembly of replicase proteins. In addition to initiating autophagosome formation, Nsp6 also limits autophagosome expansion regardless of how they were induced, i.e. whether they were induced directly by Nsp6, or indirectly by starvation or chemical inhibition of MTOR signaling. This may favor coronavirus infection by compromising the ability of autophagosomes to deliver viral components to lysosomes for degradation.


Pssm-ID: 394843  Cd Length: 287  Bit Score: 145.75  E-value: 6.55e-38
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 190360135 3539 VYQQLAGIKLQSKRTRLVKgivCWIMaSTFLFSCIITAF-VKWTMFMYvttnMLSITFCALCVISLTmllVKHKHLYLTM 3617
Cdd:cd21526     1 VYNQAPGVLLQSVFVVKKT---STFW-SHFLFAAFTMLLaAPLVFPVH----AYVILLMCFTVVTFT---VKHKVAFLTT 69
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 190360135 3618 YIIPVLFTLlynnylVVYKQTFRGYVYAWLSYYVPSVEYTYTDEVIYGMLLLIGMVFVTLRSINQYL----------FSF 3687
Cdd:cd21526    70 FLLPSLITM------VAIANTFWIQVVTFLRTWYDTVFVSPIAQDLYGYTVALYMLIYAGLATNYTLktlryratsfLSF 143
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 190360135 3688 IMFVGRVISVVSLWYMGSNLEEEILLMLASLFGTYTWTTALSMAAAKVIAKwvavnVLYFTDIPQIKIVLVCYLFIGYII 3767
Cdd:cd21526   144 LMQNFLTLYTAHYAYKLLPWTESLLFTALTMLSSHSLIGAIVFWLARWMLR-----VEYPIIFPDLAIRVLAYNVIGYVC 218
                         250       260       270       280       290       300
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 190360135 3768 SCYWGLFSLMNSLFRMPLGVYNYKISVQELRYMNANGLRPPKNSFEALMLNFKLLGIGGVPIIEVSQFQ 3836
Cdd:cd21526   219 TCYFGLMWLANRFFTLTLGVYDYMVSVEQFRYMMAVKLNPPKNAFEVFILNIKLLGIGGNRNIKVATVQ 287
betaCoV_Nsp7 cd21827
betacoronavirus non-structural protein 7; This model represents the non-structural protein 7 ...
3837-3925 2.70e-37

betacoronavirus non-structural protein 7; This model represents the non-structural protein 7 (Nsp7) of betacoronaviruses including the highly pathogenic Severe acute respiratory syndrome-related coronavirus (SARS-CoV), SARS-CoV2 (also called 2019 novel CoV or 2019-nCoV), and Middle East respiratory syndrome-related (MERS) CoV. CoVs utilize a multi-subunit replication/transcription machinery. A set of non-structural proteins (Nsps) generated as cleavage products of the ORF1a and ORF1ab viral polyproteins assemble to facilitate viral replication and transcription. Upon processing of the Nsp7-10 region by protease M (Mpro), the released four small proteins Nsp7, Nsp8, Nsp9 and Nsp10 form functional complexes with CoV core enzymes and stimulate replication. Most importantly, a complex of Nsp7 with Nsp8 has been shown to activate and confer processivity to the RNA-synthesizing activity of Nsp12, the RNA-dependent RNA-polymerase (RdRp); in SARS-CoV, point mutations in the NSP7- or NSP8-coding region have been shown to delay virus growth. Nsp7 and Nsp8 cooperate in activating the primer-dependent activity of the Nsp12 RdRp such that the level of their association may constitute a limiting factor for obtaining a high RNA polymerase activity. The subsequent Nsp7/Nsp8/Nsp12 polymerase complex is then able to associate with an active bifunctional Nsp14, which includes N-terminal 3' to 5' exoribonuclease (ExoN) and C-terminal N7-guanine cap methyltransferase (N7-MTase) activities, thus representing a unique coronavirus Nsp assembly that incorporates RdRp, exoribonuclease, and N7-MTase activities. Interaction of Nsp7 with Nsp8 appears to be conserved across the coronavirus family, making these proteins interesting drug targets. Nsp7 has a 4-helical bundle conformation which is strongly affected by its interaction with Nsp8, especially where it concerns alpha-helix 4. SARS-CoV Nsp7 forms a 8:8 hexadecameric supercomplex with Nsp8 that adopts a hollow cylinder-like structure with a large central channel and positive electrostatic properties in the cylinder; the Nsp7/Nsp8 complex functions as a noncanonical RNA polymerase capable of synthesizing RNA of up to template length.


Pssm-ID: 409253  Cd Length: 83  Bit Score: 136.42  E-value: 2.70e-37
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 190360135 3837 SKLTDVKCANVVLLNCLQHLHVASNSKLWQYCSTLHNEILATSDLGVAFEKLAQLLIVLFANPAAVDskCLTSIEEVCDD 3916
Cdd:cd21827     1 SKLTDVKCTSVVLLSVLQQLHVESNSKLWAYCVKLHNDILAAKDPTEAFEKFVSLLSVLLSFPGAVD--LDALCSELLDN 78

                  ....*....
gi 190360135 3917 YakdnTVLQ 3925
Cdd:cd21827    79 P----TVLQ 83
CoV_NSP7 pfam08716
Coronavirus replicase NSP7; NSP7 (non structural protein 7) has been implicated in viral RNA ...
3837-3925 6.44e-37

Coronavirus replicase NSP7; NSP7 (non structural protein 7) has been implicated in viral RNA replication and is predominantly alpha helical in structure. It forms a hexadecameric supercomplex with NSP8 that adopts a hollow cylinder-like structure. The dimensions of the central channel and positive electrostatic properties of the cylinder imply that it confers processivity on RNA-dependent RNA polymerase. NSP7 and NSP8 heterodimers play a role in the stabilization of NSP12 regions involved in RNA binding and are essential for a highly active NSP12 polymerase complex.


Pssm-ID: 285878  Cd Length: 83  Bit Score: 135.27  E-value: 6.44e-37
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 190360135  3837 SKLTDVKCANVVLLNCLQHLHVASNSKLWQYCSTLHNEILATSDLGVAFEKLAQLLIVLFANPAAVDskcltsIEEVCDD 3916
Cdd:pfam08716    1 SKLTDVKCTNVVLLGLLQKLHVESNSKLWAYCVELHNEILLCDDPTEAFEKLLALLAVLLSKHSAVD------LSDLCDS 74

                   ....*....
gi 190360135  3917 YAKDNTVLQ 3925
Cdd:pfam08716   75 YLENRTILQ 83
deltaCoV-Nsp6 cd21561
deltacoronavirus non-structural protein 6; Coronaviruses (CoV) redirect and rearrange host ...
3530-3836 4.73e-36

deltacoronavirus non-structural protein 6; Coronaviruses (CoV) redirect and rearrange host cell membranes as part of the viral genome replication and transcription machinery; they induce the formation of double-membrane vesicles in infected cells. CoV non-structural protein 6 (Nsp6), a transmembrane-containing protein, together with Nsp3 and Nsp4, have the ability to induce double-membrane vesicles that are similar to those observed in severe acute respiratory syndrome (SARS) coronavirus-infected cells. By itself, Nsp6 can generate autophagosomes from the endoplasmic reticulum. Autophagosomes are normally generated as a cellular response to starvation to carry cellular organelles and long-lived proteins to lysosomes for degradation. Degradation through autophagy may provide an innate defense against virus infection, or conversely, autophagosomes can promote infection by facilitating the assembly of replicase proteins. In addition to initiating autophagosome formation, Nsp6 also limits autophagosome expansion regardless of how they were induced, i.e. whether they were induced directly by Nsp6, or indirectly by starvation or chemical inhibition of MTOR signaling. This may favor coronavirus infection by compromising the ability of autophagosomes to deliver viral components to lysosomes for degradation.


Pssm-ID: 394847  Cd Length: 296  Bit Score: 140.58  E-value: 4.73e-36
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 190360135 3530 FEDELTPSDVYQQlAGIKLQSKrtrLVKGIVCWIMasTFLFSCIITAFVKWTMFmyvTTNMLSITFC-ALCVISLTMLLV 3608
Cdd:cd21561     1 FECDWTPEMVYNQ-APINLQSG---VVKKTCMWFF--HFLFMAVIFLLAALHVF---PVHLYPIVLPvFTILAFLLTLTI 71
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 190360135 3609 KHKHLYLTMYIIPVLFTLLYNNYLVVYKQTFRGYVYAWLSYYVPSVEYTYTDEVIYGMLLLIGMVFVTLRSINQYLFSFI 3688
Cdd:cd21561    72 KHTVVFTTTYLLPSLLMMVVNANTFWIPNTYLRSIYEYVFGSFISERLYGYTVALYILVYAQLAINYTLRTRRYRATSFI 151
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 190360135 3689 MFVGRVIS---VVSLWY--MGSNLEEEILLMLASLFGTYTWTTALSMaaakviakWVAVNVLYFTDIPQIKIVLVCYLFI 3763
Cdd:cd21561   152 SFCMQALQygyVAHIVYrlLTTPWTEGLLFTAFSLLTSHPLLAALSW--------WLAGRIPLPLILPDLAIRVIVYYVI 223
                         250       260       270       280       290       300       310
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|...
gi 190360135 3764 GYIISCYWGLFSLMNSLFRMPLGVYNYKISVQELRYMNANGLRPPKNSFEALMLNFKLLGIGGVPIIEVSQFQ 3836
Cdd:cd21561   224 GYVMCMRFGLFWLINKFTTIPMGTYKYMVSIEQLKYMMAVKMSPPRNAFEVLWANIRLLGLGGNRNIAVSTVQ 296
CoV_Nsp9 cd21881
coronavirus non-structural protein 9; This model represents the non-structural protein 9 (Nsp9) ...
4123-4232 4.98e-36

coronavirus non-structural protein 9; This model represents the non-structural protein 9 (Nsp9) from coronaviruses, including highly pathogenic betacoronaviruses such as Severe acute respiratory syndrome-related coronavirus (SARS-CoV), SARS-CoV2 (also called 2019 novel CoV or 2019-nCoV), and Middle East respiratory syndrome-related (MERS) CoV. CoVs utilize a multi-subunit replication/transcription machinery assembled from a set of non-structural proteins (Nsps) generated as cleavage products of the ORF1a and ORF1ab viral polyproteins. All of these Nsps, except for Nsp1 and Nsp2, are considered essential for transcription, replication, and translation of the viral RNA. Nsp9, with Nsp7, Nsp8, and Nsp10, localizes within the replication complex. Nsp9 is an essential single-stranded RNA-binding protein for CoV replication; it shares structural similarity to the oligosaccharide-binding (OB) fold, which is characteristic of proteins that bind to ssDNA or ssRNA. Nsp9 requires dimerization for binding and orienting RNA for subsequent use by the replicase machinery. CoV Nsp9s have diverse forms of dimerization that promote their biological function, which may help elucidate the mechanism underlying CoVs replication and contribute to the development of antiviral drugs. Generally, dimers are formed via interaction of the parallel alpha-helices containing the protein-protein interaction motif GXXXG at the C-terminus; additionally, the N-finger region may also play a critical role in dimerization as seen in porcine delta coronavirus (PDCoV) Nsp9. As a member of the replication complex, Nsp9 may not have a specific RNA-binding sequence but may act in conjunction with other Nsps as a processivity factor, as shown by mutation studies indicating that Nsp9 is a key ingredient that intimately engages other proteins in the replicase complex to mediate efficient virus transcription and replication.


Pssm-ID: 409329  Cd Length: 111  Bit Score: 133.79  E-value: 4.98e-36
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 190360135 4123 NNELMPAKLKTQVVNSGPDQTCNTPT-QCYYNNSYNGKIVYAILSDVDGLKYTKILKDDGNFVVLELDPPCKFTVQDVKG 4201
Cdd:cd21881     1 NNELSPVALKQMSCAAGTDQTCTDDEaKAYYNNSKGGRFVLAITSDKPDLKVARFLKEDGGTIYTELEPPCRFVTDVPKG 80
                          90       100       110
                  ....*....|....*....|....*....|.
gi 190360135 4202 LKIKYLYFVKGCNTLARGWVVGTISSTVRLQ 4232
Cdd:cd21881    81 PKVKYLYFIKNLNSLNRGMVLGSISATVRLQ 111
CoV_peptidase pfam08715
Coronavirus papain-like peptidase; This entry contains coronavirus cysteine endopeptidases ...
1567-1868 3.35e-35

Coronavirus papain-like peptidase; This entry contains coronavirus cysteine endopeptidases that belong to MEROPS peptidase family C16 and are required for proteolytic processing of the replicase polyprotein. All coronaviruses encode between one and two accessory cysteine proteinases that recognize and process one or two sites in the amino-terminal half of the replicase polyprotein during assembly of the viral replication complex. HCoV and TGEV encode two accessory proteinases, called coronavirus papain-like proteinase 1 and 2 (PL1-PRO and PL2-PRO). IBV and SARS encodes only one called PL-PRO. The structure of this protein has shown it adopts a fold similar that of de-ubiquitinating enzymes. The peptidase family C16 domain is about 260 amino acids in length. This domain is predicted to have an alpha-beta structural organization known as the papain-like fold. It consists of three alpha-helices and three strands of antiparallel beta-sheet.


Pssm-ID: 430171  Cd Length: 318  Bit Score: 138.96  E-value: 3.35e-35
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 190360135  1567 ILLTVDGVNFTNRFVPVGESFGKSLGNVFCDGVNVTKHKCDINYKGKVFFQF---DNLSSEDLKAVRSSFNFDQKELLAY 1643
Cdd:pfam08715    7 IYLTEDGVNYHSIVVKPGDSLGQQFGQVYAKNKDLSGVFPADDVEDKEILYVpttDWVEFYGFKSILEYYTLDASKYVIY 86
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 190360135  1644 YNMLvncsKWQVVFNGKYFTFKQANNNCFVNVSCLMLQSLNLKFKIVQWQEAWLEFRSGRPARFVSLVLAKGGFKFGDPA 1723
Cdd:pfam08715   87 LSAL----TKNVQYVDGFLILKWRDNNCWISSVIVALQAAKIRFKGQFLTEAWAKLLGGDPTDFVAWCYASCTAKVGDFG 162
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 190360135  1724 DSRDFLRVVFSQVDL--TGAICDFEIACKCGVKQEQRTGVDAVMHFGTLSREDLEIGYTVDCSCGKKLIHCVR-FDVPFL 1800
Cdd:pfam08715  163 DANWTLTNLAEHFDAeyTNAFLKKRVCCNCGIKSYELRGLEACIQVRATNLDHFKTGYSNCCVCGANNTDEVIeASLPYL 242
                          250       260       270       280       290       300       310
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|.
gi 190360135  1801 ICSNT---PASVKLPKGVGSANIFKGDKVGHYVHVKCEQSyqLYDASNVKKVTDVTGNLSDCLYLKNLKQT 1868
Cdd:pfam08715  243 LLSATdgpAAVDCLEDGVGTVAFVGSTNSGHYTYQTAKQA--FYDGAKDRKFGKKSPYVTAVYTRFAFKNE 311
Ubl1_cv_Nsp3_N-like cd21467
first ubiquitin-like (Ubl) domain located at the N-terminus of coronavirus SARS-CoV ...
872-961 3.48e-34

first ubiquitin-like (Ubl) domain located at the N-terminus of coronavirus SARS-CoV non-structural protein 3 (Nsp3) and related proteins; This ubiquitin-like (Ubl) domain (Ubl1) is found at the N-terminus of coronavirus Nsp3, a large multi-functional multi-domain protein which is an essential component of the replication/transcription complex (RTC). The functions of Ubl1 in CoVs are related to single-stranded RNA (ssRNA) binding and to interacting with the nucleocapsid (N) protein. SARS-CoV Ubl1 has been shown to bind ssRNA having AUA patterns, and since the 5'-UTR of the SARS-CoV genome has a number of AUA repeats, it may bind there. In mouse hepatitis virus (MHV), this Ubl1 domain binds the cognate N protein. Adjacent to Ubl1 is a Glu-rich acidic region (also referred to as hypervariable region, HVR); Ubl1 together with HVR has been called Nsp3a. Currently, the function of HVR in CoVs is unknown. This model corresponds to one of two Ubl domains in Nsp3; the other is located N-terminal to the papain-like protease (PLpro) and is not represented by this model.


Pssm-ID: 394822  Cd Length: 89  Bit Score: 127.69  E-value: 3.48e-34
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 190360135  872 TIKVFYELDKDFNTILNTACGVFEVDDTVDMEEFYAVVIDAIEEKLSPCKELEgVGAKVSAFLQKLEDNSLFLFDEAGEE 951
Cdd:cd21467     1 TVKVTYELDEVLDTILNKACSPFEVEKDLTVEEFADVVQDAVEEKLSPLLELP-LGDKVDADLDDFIDNPCYLFDEDGDE 79
                          90
                  ....*....|
gi 190360135  952 VLAPKLYCAF 961
Cdd:cd21467    80 VLASEMYCSF 89
betaCoV_Nsp3_NAB cd21795
nucleic acid binding domain of betacoronavirus non-structural protein 3; This model represents ...
1908-2010 1.91e-30

nucleic acid binding domain of betacoronavirus non-structural protein 3; This model represents the nucleic acid binding (NAB) domain of non-structural protein 3 (Nsp3) from betacoronavirus including highly pathogenic human coronaviruses (CoVs) such as Severe acute respiratory syndrome-related coronavirus (SARS-CoV) and SARS-CoV2 (also called 2019 novel CoV or 2019-nCoV). The NAB domain represents a new fold, with a parallel four-strand beta-sheet holding two alpha-helices of three and four turns that are oriented antiparallel to the beta-strands. NAB is a cytoplasmic domain located between the papain-like protease (PLPro) and betacoronavirus-specific marker (betaSM) domains of CoV Nsp3. Nsp3 is a large multi-functional multi-domain protein that is an essential component of the replication/transcription complex (RTC), which carries out RNA synthesis, RNA processing, and interference with the host cell innate immune system. The NAB domain both binds ssRNA and unwinds dsDNA. It prefers to bind ssRNA containing repeats of three consecutive guanines. A group of residues that form a positively charged patch on the protein surface of SARS-CoV Nsp3 NAB serves as the binding site of nucleic acids. This site is conserved in the NAB of Nsp3 from betacoronavirus in the sarbecovirus subgenus (B lineage), but may not be conserved in the Nsp3 NAB from betacoronaviruses in other lineages.


Pssm-ID: 409347  Cd Length: 110  Bit Score: 118.06  E-value: 1.91e-30
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 190360135 1908 AQFKTFEKVDGVYTNFKLI---GHTVCDILNAKLGFdsSKEFVEYKVTEWPTATGDVVLATDDLYVKRYERGCITFGKPV 1984
Cdd:cd21795     1 LDVPAAPKPVTVYDNFKLVscqNQSIADDFNRTLGF--TKPGSELLLTVYPNTSGDVVAVSDDNYTVVYKKGSLLMGKPV 78
                          90       100
                  ....*....|....*....|....*.
gi 190360135 1985 IWLsHEQASLNSLTYFNRPLLVDENK 2010
Cdd:cd21795    79 LWV-HKNNTWKKLVPLNKPNVVCLRN 103
A1pp smart00506
Appr-1"-p processing enzyme; Function determined by Martzen et al. Extended family detected by ...
1276-1406 3.39e-29

Appr-1"-p processing enzyme; Function determined by Martzen et al. Extended family detected by reciprocal PSI-BLAST searches (unpublished results, and Pehrson _ Fuji).


Pssm-ID: 214701  Cd Length: 133  Bit Score: 115.10  E-value: 3.39e-29
                            10        20        30        40        50        60        70        80
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 190360135   1276 NVCFVKGDIIKVskrvKAEVVVNPANGHMAHGGGVAKAIAVAAGQQFVKEttDMVKS-KGVCATGDCYVSTGGKL-CKTV 1353
Cdd:smart00506    1 ILKVVKGDITKP----RADAIVNAANSDGAHGGGVAGAIARAAGKALSKE--EVRKLaGGECPVGTAVVTEGGNLpAKYV 74
                            90       100       110       120       130
                    ....*....|....*....|....*....|....*....|....*....|....*....
gi 190360135   1354 LNVVGPDARTQGKQSYALLERVYKHL------NKYDCVVTTLISAGIFSVPSDVSLTYL 1406
Cdd:smart00506   75 IHAVGPRASGHSKEGFELLENAYRNClelaieLGITSVALPLIGTGIYGVPKDRSAQAL 133
alphaCoV_Nsp9 cd21897
alphacoronavirus non-structural protein 9; This model represents the non-structural protein 9 ...
4123-4232 3.54e-28

alphacoronavirus non-structural protein 9; This model represents the non-structural protein 9 (Nsp9) of alphacoronaviruses, including Porcine epidemic diarrhea virus (PEDV), Porcine transmissible gastroenteritis coronavirus (TGEV), and Human coronavirus 229E. CoVs utilize a multi-subunit replication/transcription machinery assembled from a set of non-structural proteins (Nsps) generated as cleavage products of the ORF1a and ORF1ab viral polyproteins. All of these Nsps, except for Nsp1 and Nsp2, are considered essential for transcription, replication, and translation of the viral RNA. Nsp9, with Nsp7, Nsp8, and Nsp10, localizes within the replication complex. Nsp9 is an essential single-stranded RNA-binding protein for coronavirus replication; it shares structural similarity to the oligosaccharide-binding (OB) fold, which is characteristic of proteins that bind to ssDNA or ssRNA. Nsp9 requires dimerization for binding and orienting RNA for subsequent use by the replicase machinery. CoV Nsp9s have diverse forms of dimerization that promote their biological function, which may help elucidate the mechanism underlying CoVs replication and contribute to the development of antiviral drugs. Generally, dimers are formed via interaction of the parallel alpha-helices containing the protein-protein interaction motif GXXXG; additionally, the N-finger region may also play a critical role in dimerization as seen in porcine delta coronavirus (PDCoV) Nsp9. As a member of the replication complex, Nsp9 may not have a specific RNA-binding sequence but may act in conjunction with other Nsps as a processivity factor, as shown by mutation studies indicating that Nsp9 is a key ingredient that intimately engages other proteins in the replicase complex to mediate efficient virus transcription and replication.


Pssm-ID: 409330  Cd Length: 108  Bit Score: 111.25  E-value: 3.54e-28
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 190360135 4123 NNELMPAKLKTQVVNSGPDQTCNTPTQCYynNSYNGK-IVYAILSDVDGLKYTKiLKDDGNFVVLELDPPCKFTVQDVKG 4201
Cdd:cd21897     1 NNEIMPGKLKQRAVKAEGDGFSGDGKALY--NNEGGKtFMYAFIADKPDLKYVK-WEFDGGCNTIELEPPCKFLVDTPNG 77
                          90       100       110
                  ....*....|....*....|....*....|.
gi 190360135 4202 LKIKYLYFVKGCNTLARGWVVGTISSTVRLQ 4232
Cdd:cd21897    78 PQIKYLYFVKNLNTLRRGAVLGYIGATVRLQ 108
gammaCoV_Nsp9 cd21899
gammacoronavirus non-structural protein 9; This model represents the non-structural protein 9 ...
4121-4232 1.60e-26

gammacoronavirus non-structural protein 9; This model represents the non-structural protein 9 (Nsp9) from gammacoronaviruses such as Avian infectious bronchitis virus (IBV). CoVs utilize a multi-subunit replication/transcription machinery assembled from a set of non-structural proteins (Nsps) generated as cleavage products of the ORF1a and ORF1ab viral polyproteins. All of these Nsps, except for Nsp1 and Nsp2, are considered essential for transcription, replication, and translation of the viral RNA. Nsp9, with Nsp7, Nsp8, and Nsp10, localizes within the replication complex. Nsp9 is an essential single-stranded RNA-binding protein for coronavirus replication; it shares structural similarity to the oligosaccharide-binding (OB) fold, which is characteristic of proteins that bind to ssDNA or ssRNA. Nsp9 requires dimerization for binding and orienting RNA for subsequent use by the replicase machinery. CoV Nsp9s have diverse forms of dimerization that promote their biological function, which may help elucidate the mechanism underlying CoVs replication and contribute to the development of antiviral drugs. Generally, dimers are formed via interaction of the parallel alpha-helices containing the protein-protein interaction motif GXXXG; additionally, the N-finger region may also play a critical role in dimerization as seen in porcine delta coronavirus (PDCoV) Nsp9. As a member of the replication complex, Nsp9 may not have a specific RNA-binding sequence but may act in conjunction with other Nsps as a processivity factor, as shown by mutation studies indicating that Nsp9 is a key ingredient that intimately engages other proteins in the replicase complex to mediate efficient virus transcription and replication.


Pssm-ID: 409332  Cd Length: 113  Bit Score: 106.86  E-value: 1.60e-26
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 190360135 4121 LQNNELMPAKLKTQVVNSGPDQT-CNTPTQCYYNNSYNGKIVYAILSDVDGLKYTKILKDDGNFVVLELDPPCKFTVQDV 4199
Cdd:cd21899     1 LQNNELMPHGVKTKACVAGVDQAhCSVESKCYYTNISGNSVVAAITSSNPNLKVASFLNEAGNQIYVDLDPPCKFGMKVG 80
                          90       100       110
                  ....*....|....*....|....*....|...
gi 190360135 4200 KGLKIKYLYFVKGCNTLARGWVVGTISSTVRLQ 4232
Cdd:cd21899    81 DKVEVVYLYFIKNTRSIVRGMVLGAISNVVVLQ 113
TM_Y_deltaCoV_Nsp3_C cd21711
C-terminus of deltacoronavirus non-structural protein 3, including transmembrane and Y domains; ...
2263-2750 3.91e-26

C-terminus of deltacoronavirus non-structural protein 3, including transmembrane and Y domains; This model represents the C-terminus of non-structural protein 3 (Nsp3) from deltacoronavirus, including Magpie-robin coronavirus HKU18 and Bulbul coronavirus HKU11, among others. This conserved C-terminus includes two transmembrane (TM) regions TM1 and TM2, an ectodomain (3Ecto) between the TM1 and TM2 that is glycosylated and located on the lumenal side of the ER, an amphiphatic region (AH1) that is not membrane-spanning, and a large Y domain of approximately 370 residues. Nsp3 is a large multi-functional multi-domain protein that is an essential component of the replication/transcription complex (RTC), which carries out RNA synthesis, RNA processing, and interference with the host cell innate immune system. In the related betacoronaviruses, Severe acute respiratory syndrome-related coronavirus (SARS-CoV) and murine hepatitis virus (MHV), the TM1, 3Ecto and TM2 domains are important for the papain-like protease (PL2pro) domain to process Nsp3-Nsp4 cleavage. It has also been shown that the interaction of 3Ecto with the lumenal loop of Nsp4 is essential for ER rearrangements in cells infected with SARS-CoV or MHV. The Y domain, located at the cytosolic side of the ER, consists of the Y1 and CoV-Y subdomains, which are conserved in nidovirus and coronavirus, respectively. Functional information about the Y domain is limited; it has been shown that Nsp3 binding to Nsp4 is less efficient without the Y domain.


Pssm-ID: 409659  Cd Length: 490  Bit Score: 115.96  E-value: 3.91e-26
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 190360135 2263 NQYCNGSIACQFCLAGFDMLDNYKaidvvqyeadrrafvdytgvlkiviELIVSYALYTAWFYPLFALISIQILTTwlPE 2342
Cdd:cd21711    54 NSFCAGDLTCQACFDGQDSLHLYK-------------------------HLRVNQQPVQTTDYTVYALSIVLLLAN--PT 106
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 190360135 2343 LFMLSTLhwsvrlLVSLANM------------LPAHVFMRFYIIIASFIKLFSLFRHVAYGCSKSGCLFCYKRNRSLRVK 2410
Cdd:cd21711   107 LVLGTLL------VVFFVNFygvqipfygtlqLDYQNTLVMVFSVYYFYKVMKFFRHLAKGCKKPTCSICAKKRIPPTIT 180
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 190360135 2411 CSTIVGGMIRYYDAMANGGTGFCSKHQWNCIDCDSYKPGNTFITvEAALDLSKELKRPIQPTDVAYHTVTDVK-QVGCYM 2489
Cdd:cd21711   181 VETVVQGRKYPSVIETNGGFNICKEHNFYCKNCDSQTPGTFIPT-EAVESLSRKTRLSVKPTAPAYLLARDVEcQTDVVV 259
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 190360135 2490 -RLFYDRDGQRT---YDDVNASLFVDYSNLLHSkvkSVPNmhvVVVENDADKANFLNA----AVFYAQSLFRPILMVDKN 2561
Cdd:cd21711   260 aRATHNGNAHVCiskYSDIRTVDQLLKPTPLFS---YTPD---VIIAADFDNAGSLKTakelAVVLSMDLKRTIIIIDQA 333
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 190360135 2562 LITTANTGTSVTETMFDVYvdtflsmfdvdkkSLNALIATA--HSSIKQGTQiCKVLDTFLSCArkscsidsdvdtkcla 2639
Cdd:cd21711   334 YSRPIDNYQEVKSRIEKYY-------------PFQKITPTGdiFADIKQATN-GQASDSAINAA---------------- 383
                         410       420       430       440       450       460       470       480
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 190360135 2640 dsvMSAVSAGLELTDESCNNLVPTY-LKGDNIVAADLGVLIQ-NSAKhvqGNVaKIAGVSCIWSVDAFNQLSSDFQHKLK 2717
Cdd:cd21711   384 ---ILAVQRGLDFTIDNPNNILPHYaFDFSTLSAEDQSTLIEsGCAK---GNL-KGTNVGVVLSANLVTRLSQKAIRVIA 456
                         490       500       510
                  ....*....|....*....|....*....|....
gi 190360135 2718 KACCKTGLKLKLTYNKQMANVSVLTTPFS-LKGG 2750
Cdd:cd21711   457 NAASRNGVTCAVTPSTLVLRGNIATQPLTrIKAG 490
Macro_Af1521_BAL-like cd02907
macrodomain, Af1521-like family; Macrodomains are found in a variety of proteins with diverse ...
1276-1401 3.16e-25

macrodomain, Af1521-like family; Macrodomains are found in a variety of proteins with diverse cellular functions, as a stand-alone domain or in combination with other domains like in histone macroH2A and some PARPs (poly ADP-ribose polymerases). Macrodomains can recognize ADP-ribose (ADPr) in both its free and protein-linked forms, in related ligands, such as O-acyl-ADP-ribose (OAADPr), and even in ligands unrelated to ADPr. The macrodomains in this family show similarity to Af1521, a protein from Archaeoglobus fulgidus containing a stand-alone macrodomain. Af1521 binds ADP-ribose and exhibits phosphatase activity toward ADP-ribose-1"-monophosphate (Appr-1"-p). Also included in this family are the N-terminal (or first) macrodomains of BAL (B-aggressive lymphoma) proteins which contain multiple macrodomains, such as the first macrodomain of mono-ADP-ribosyltransferase PARP14 (PARP-14, also known as ADP-ribosyltransferase diphtheria toxin-like 8, ATRD8, B aggressive lymphoma protein 2, or BAL2). Most BAL proteins also contain a C-terminal PARP active site and are also named as PARPs. Human BAL1 (or PARP-9) was originally identified as a risk-related gene in diffuse large B-cell lymphoma that promotes malignant B-cell migration. Some BAL family proteins exhibit PARP activity. Poly (ADP-ribosyl)ation is an immediate DNA-damage-dependent post-translational modification of histones and other nuclear proteins. BAL proteins may also function as transcriptional repressors.


Pssm-ID: 394877 [Multi-domain]  Cd Length: 158  Bit Score: 104.88  E-value: 3.16e-25
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 190360135 1276 NVCFVKGDIIKVskrvKAEVVVNPANGHMAHGGGVAKAIAVAAGQQFVKETTDMVKSKGVCATGDCYVSTGGKL-CKTVL 1354
Cdd:cd02907     3 KVSVYKGDITKE----KVDAIVNAANERLKHGGGVAGAISKAGGPEIQEECDKYIKKNGKLRVGEVVVTSAGKLpCKYVI 78
                          90       100       110       120       130
                  ....*....|....*....|....*....|....*....|....*....|....
gi 190360135 1355 NVVGPDARTQGKQ-SYALLER-VYKHLNKYD-CVVTTL----ISAGIFSVPSDV 1401
Cdd:cd02907    79 HAVGPRWSGGSKEeCEDLLYKaVLNSLEEAEeLKATSIaipaISSGIFGFPLDL 132
deltaCoV_Nsp8 cd21833
deltacoronavirus non-structural protein 8; This model represents the non-structural protein 8 ...
3932-4082 4.53e-25

deltacoronavirus non-structural protein 8; This model represents the non-structural protein 8 (Nsp8) region of deltacoronaviruses that include White-eye coronavirus HKU16 and Quail coronavirus UAE-HKU30, among others. CoVs utilize a multi-subunit replication/transcription machinery. A set of non-structural proteins (Nsps) generated as cleavage products of the ORF1a and ORF1ab viral polyproteins assemble to facilitate viral replication and transcription. Upon processing of the Nsp7-10 region by protease M (Mpro), the released four small proteins Nsp7, Nsp8, Nsp9, and Nsp10 form functional complexes with CoV core enzymes and thereby stimulate replication. Most importantly, a complex of Nsp8 with Nsp7 has been shown to activate and confer processivity to the RNA-synthesizing activity of Nsp12, the RNA-dependent RNA-polymerase (RdRp); in SARS-CoV, point mutations in the genes encoding Nsp8 and Nsp7 have been shown to delay virus growth. Nsp8 and Nsp7 cooperate in activating the primer-dependent activity of the Nsp12 RdRp such that the level of their association may constitute a limiting factor for obtaining a high RNA polymerase activity. The subsequent Nsp7/Nsp8/Nsp12 polymerase complex is then able to associate with an active bifunctional Nsp14, which includes N-terminal 3' to 5' exoribonuclease (ExoN) and C-terminal N7-guanine cap methyltransferase (N7-MTase) activities, thus representing a unique coronavirus Nsp assembly that incorporates RdRp, exoribonuclease, and N7-MTase activities. Interaction of Nsp8 with Nsp7 appears to be conserved across the coronavirus family, making these proteins interesting drug targets. Nsp8 has a novel 'golf-club' fold composed of an N-terminal 'shaft' domain and a C-terminal 'head' domain. The shaft domain contains three helices, one of which is very long, while the head domain contains another three helices and seven beta-strands, forming an alpha/beta fold. SARS-CoV Nsp8 forms a 8:8 hexadecameric supercomplex with Nsp7 that adopts a hollow cylinder-like structure with a large central channel and positive electrostatic properties in the cylinder, while Feline infectious peritonitis virus Nsp8 forms a 1:2 heterotrimer with Nsp7. Regardless of their oligomeric structure, the Nsp7/Nsp8 complex functions as a noncanonical RNA polymerase capable of synthesizing RNA of up to the template length.


Pssm-ID: 409260  Cd Length: 189  Bit Score: 105.48  E-value: 4.53e-25
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 190360135 3932 VNMASFVEYEVAKKNLDEACSSGSANQQQLKQLeKACNIAKSAYERDRAVARKLERMADLALTNMYKEARINDKKSKVVS 4011
Cdd:cd21833     7 INLDSYRIYKEADAAYKKSVELNEPPQEQKKKL-KAVNIAKAEWEREAASQRKLEKLADAAMKSMYLAERAEDRRIKLTS 85
                          90       100       110       120       130       140       150
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|.
gi 190360135 4012 ALQTMLFSMVRKLDNQALNSILDNAVKGCVPLNAIPSLAANTLTIIVPDKSVYDQVVDNVYVTYAGNVWQI 4082
Cdd:cd21833    86 GLTAMLYHMLRRLDSDRVKALFECAKQQILPIHAIVGVSNDNLKVIFNDKESYLQYVDGNTLIYKGVRYTI 156
Macro pfam01661
Macro domain; The Macro or A1pp domain is a module of about 180 amino acids which can bind ...
1297-1400 4.60e-23

Macro domain; The Macro or A1pp domain is a module of about 180 amino acids which can bind ADP-ribose (an NAD metabolite) or related ligands. Binding to ADP-ribose could be either covalent or non-covalent: in certain cases it is believed to bind non-covalently; while in other cases (such as Aprataxin) it appears to bind both non-covalently through a zinc finger motif, and covalently through a separate region of the protein. This domain is found in a number of otherwise unrelated proteins. It is found at the C-terminus of the macro-H2A histone protein 4 and also in the non-structural proteins of several types of ssRNA viruses such as NSP3 from alpha-viruses and coronaviruses. This domain is also found on its own in a family of proteins from bacteria, archaebacteria and eukaryotes. The 3D structure of the SARS-CoV Macro domain has a mixed alpha/beta fold consisting of a central seven-stranded twisted mixed beta sheet sandwiched between two alpha helices on one face, and three on the other. The final alpha-helix, located on the edge of the central beta-sheet, forms the C terminus of the protein. The crystal structure of AF1521 (a Macro domain-only protein from Archaeoglobus fulgidus) has also been reported and compared with other Macro domain containing proteins. Several Macro domain only proteins are shorter than AF1521, and appear to lack either the first strand of the beta-sheet or the C-terminal helix 5. Well conserved residues form a hydrophobic cleft and cluster around the AF1521-ADP-ribose binding site.


Pssm-ID: 460286  Cd Length: 116  Bit Score: 96.87  E-value: 4.60e-23
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 190360135  1297 VNPANGHMAHGGGVAKAIAVAAGQQFVKETTDMVksKGVCATGDCYVSTGGKL-CKTVLNVVGPDARTQGKQS-YALLER 1374
Cdd:pfam01661    1 VNAANSRLLGGGGVAGAIHRAAGPELLEECRELK--KGGCPTGEAVVTPGGNLpAKYVIHTVGPTWRHGGSHGeEELLES 78
                           90       100       110
                   ....*....|....*....|....*....|..
gi 190360135  1375 VYKHL------NKYDCVVTTLISAGIFSVPSD 1400
Cdd:pfam01661   79 CYRNAlalaeeLGIKSIAFPAISTGIYGFPWE 110
TM_Y_gammaCoV_Nsp3_C cd21710
C-terminus of gammacoronavirus non-structural protein 3, including transmembrane and Y domains; ...
2201-2683 5.42e-22

C-terminus of gammacoronavirus non-structural protein 3, including transmembrane and Y domains; This model represents the C-terminus of non-structural protein 3 (Nsp3) from gammacoronavirus, including Infectious bronchitis virus. This conserved C-terminus includes two transmembrane (TM) regions TM1 and TM2, an ectodomain (3Ecto) between the TM1 and TM2 that is glycosylated and located on the lumenal side of the ER, an amphiphatic region (AH1) that is not membrane-spanning, and a large Y domain of approximately 370 residues. Nsp3 is a large multi-functional multi-domain protein that is an essential component of the replication/transcription complex (RTC), which carries out RNA synthesis, RNA processing, and interference with the host cell innate immune system. In the related betacoronaviruses, Severe acute respiratory syndrome-related coronavirus (SARS-CoV) and murine hepatitis virus (MHV), the TM1, 3Ecto and TM2 domains are important for the papain-like protease (PL2pro) domain to process Nsp3-Nsp4 cleavage. It has also been shown that the interaction of 3Ecto with the lumenal loop of Nsp4 is essential for ER rearrangements in cells infected with SARS-CoV or MHV. The Y domain, located at the cytosolic side of the ER, consists of the Y1 and CoV-Y subdomains, which are conserved in nidovirus and coronavirus, respectively. Functional information about the Y domain is limited; it has been shown that Nsp3 binding to Nsp4 is less efficient without the Y domain.


Pssm-ID: 409658  Cd Length: 525  Bit Score: 103.68  E-value: 5.42e-22
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 190360135 2201 IIATIFLLWFNFIYANVIFSDFYLPKIGFLPTFVGKIAQWIKNTFSLVticdlysiqdvgfknQYCNGSIACQFCLAGFD 2280
Cdd:cd21710    12 VFTALLILWFVYTSNPVMFTGIRVLDFLFEGSFCGPYNDYGKDSFDVL---------------RYCGDDFTCRVCLHDKD 76
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 190360135 2281 MLDNYK-AIDVVQYEADRRAFVDYTgvlkivielivsyalyTAWFYPLFALISIQ------ILTTWLPELFMLST----- 2348
Cdd:cd21710    77 SLHLYKhAYSVEQFYKDAVSGISFN----------------WNWLYLVFLILFVKpvagfvIICYCVKYLVLSSTvlqtg 140
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 190360135 2349 ---LHWSVRLLVSLANMLPAHvfmrFYIIIasFIKLFSLFRHVAYgCSKSGCLFCYKRNRSLRVKCSTIVGGMIRYYDAM 2425
Cdd:cd21710   141 vgfLDWFIQTVFTHFNFMGAG----FYFWL--FYKIYIQVHHILY-CKDITCEVCKRVARSNRHEVSVVVGGRKQLVHVY 213
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 190360135 2426 ANGGTGFCSKHQWNCIDCDSYKPGNTFITVEAALDLSKELKRPIQPTDVAYHTVTDVKQVGCYMRLFYdRDGQRTYDDVN 2505
Cdd:cd21710   214 TNSGYNFCKRHNWYCRNCDKYGHQNTFMSPEVAGELSEKLKRHVKPTAHAYHVVDDACLVDDFVNLKY-KAATPGKDGAH 292
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 190360135 2506 ASL-------FVDYSNLLHSKVK--SVPNMHVVVVENDADKA--NFLNAAVFYAQSLFRPILMVDKNLITTANTgTSVTE 2574
Cdd:cd21710   293 SAVkcfsvsdFLKKAVFLKDALKceQISNDSFIVCNTQSAHAleEAKNAAIYYAQYLCKPILILDQALYEQLVV-EPVSK 371
                         410       420       430       440       450       460       470       480
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 190360135 2575 TMFDVYVDTFLSMFDVDKKSLNaliatahssIKQGTqickVLDTFLSCARKSCSIDSDVdtKCLADSVmsavsaglELTD 2654
Cdd:cd21710   372 SVVDKVCSILSNIISVDTAALN---------YKAGT----LRDALLSVTKDEEAVDMAI--FCHNNDV--------EYTS 428
                         490       500       510
                  ....*....|....*....|....*....|
gi 190360135 2655 ESCNNLVPTY-LKGDNIVAADLGVLIQNSA 2683
Cdd:cd21710   429 DGFTNVVPSYgIDTDKLTPRDRGFLINADA 458
alphaCoV_Nsp7 cd21826
alphacoronavirus non-structural protein 7; This model represents the non-structural protein 7 ...
3837-3925 1.74e-21

alphacoronavirus non-structural protein 7; This model represents the non-structural protein 7 (Nsp7) of alphacoronaviruses that include Feline infectious peritonitis virus (FCoV), Human coronavirus NL63 (HCoV-NL63), and Porcine transmissible gastroenteritis coronavirus (TGEV), among others. CoVs utilize a multi-subunit replication/transcription machinery. A set of non-structural proteins (Nsps) generated as cleavage products of the ORF1a and ORF1ab viral polyproteins assemble to facilitate viral replication and transcription. Upon processing of the Nsp7-10 region by protease M (Mpro), the released four small proteins Nsp7, Nsp8, Nsp9 and Nsp10 form functional complexes with CoV core enzymes and stimulate replication. Most importantly, a complex of Nsp7 with Nsp8 has been shown to activate and confer processivity to the RNA-synthesizing activity of Nsp12, the RNA-dependent RNA-polymerase (RdRp); in SARS-CoV, point mutations in the NSP7- or NSP8-coding region have been shown to delay virus growth. Nsp7 and Nsp8 cooperate in activating the primer-dependent activity of the Nsp12 RdRp such that the level of their association may constitute a limiting factor for obtaining a high RNA polymerase activity. The subsequent Nsp7/Nsp8/Nsp12 polymerase complex is then able to associate with an active bifunctional Nsp14, which includes N-terminal 3' to 5' exoribonuclease (ExoN) and C-terminal N7-guanine cap methyltransferase (N7-MTase) activities, thus representing a unique coronavirus Nsp assembly that incorporates RdRp, exoribonuclease, and N7-MTase activities. Interaction of Nsp7 with Nsp8 appears to be conserved across the coronavirus family, making these proteins interesting drug targets. Nsp7 has a 4-helical bundle conformation which is strongly affected by its interaction with Nsp8, especially where it concerns alpha-helix 4. FCoV Nsp7 forms a 2:1 heterotrimer with Nsp8; the Nsp7/Nsp8 complex functions as a noncanonical RNA polymerase capable of synthesizing RNA of up to template length.


Pssm-ID: 409252  Cd Length: 83  Bit Score: 91.27  E-value: 1.74e-21
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 190360135 3837 SKLTDVKCANVVLLNCLQHLHVASNSKLWQYCSTLHNEILATSDLGVAFEKLAQLLIVLFANPAAVDskcltsIEEVCDD 3916
Cdd:cd21826     1 SKLTDIKCTNVVLLGCLSSMNVAANSKEWAYCVDLHNKINLCDDPEKAQEMLLALLAFFLSKQKDFG------LDDLLDS 74

                  ....*....
gi 190360135 3917 YAKDNTVLQ 3925
Cdd:cd21826    75 YFDNNSILQ 83
gammaCoV-Nsp6 cd21559
gammacoronavirus non-structural protein 6; Coronaviruses (CoV) redirect and rearrange host ...
3539-3836 2.71e-21

gammacoronavirus non-structural protein 6; Coronaviruses (CoV) redirect and rearrange host cell membranes as part of the viral genome replication and transcription machinery; they induce the formation of double-membrane vesicles in infected cells. CoV non-structural protein 6 (Nsp6), a transmembrane-containing protein, together with Nsp3 and Nsp4, have the ability to induce double-membrane vesicles that are similar to those observed in severe acute respiratory syndrome (SARS) coronavirus-infected cells. By itself, Nsp6 can generate autophagosomes from the endoplasmic reticulum. Autophagosomes are normally generated as a cellular response to starvation to carry cellular organelles and long-lived proteins to lysosomes for degradation. Degradation through autophagy may provide an innate defense against virus infection, or conversely, autophagosomes can promote infection by facilitating the assembly of replicase proteins. In addition to initiating autophagosome formation, Nsp6 also limits autophagosome expansion regardless of how they were induced, i.e. whether they were induced directly by Nsp6, or indirectly by starvation or chemical inhibition of MTOR signaling. This may favor coronavirus infection by compromising the ability of autophagosomes to deliver viral components to lysosomes for degradation.


Pssm-ID: 394845  Cd Length: 307  Bit Score: 97.92  E-value: 2.71e-21
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 190360135 3539 VYQQLAGIKLQSK-RTRLVKGIVCWIMASTFLFS-CIITAFVKWTMFMYVTTNMlsITFCALCVISLTmllVKHKHLYLT 3616
Cdd:cd21559     3 VFNQVGGVRLQSSfVKKATSWFWSRCVLACFLFVlCAIVLFTAVPLKYYVHAAV--ILLVAVLFISFT---VKHVMAFMD 77
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 190360135 3617 MYIIPVLFTLLYNNYLVVyKQTFRGYVYAWLSYYVPSVEYTYTDEVIYGMLLLIgMVFVTLR------SINQYLFSFIMF 3690
Cdd:cd21559    78 TFLLPTLCTVIIGVCAEV-PFIYNTLISQVVIFFSQWYDPVVFDTVVPWMFLPL-VLYTAFKcvqgcySINSFSTSLLVL 155
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 190360135 3691 V-----GRVISVVSLW---YMGSNLEEEILLMLASLFGTYTWTTALSMAAAKvIAKWVA--VNVLYFTDIPQIKIVLVCy 3760
Cdd:cd21559   156 YqfmklGFVIYTSSNTltaYTEGNWELFFELVHTTVLANFSSNSLIGLIVFK-IAKWMLyyCNATYFNSYVLMAVMVNV- 233
                         250       260       270       280       290       300       310
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*.
gi 190360135 3761 lfIGYIISCYWGLFSLMNSLFRMPLGVYNYKISVQELRYMNANGLRPPKNSFEALMLNFKLLGIGGVPIIEVSQFQ 3836
Cdd:cd21559   234 --IGWLFTCYFGLYWWLNKVFGLTLGKYNYKVSVEQYKYMCLHKIRPPKSVWDVFSTNMLIQGIGGERVLPIATVQ 307
YmdB COG2110
O-acetyl-ADP-ribose deacetylase (regulator of RNase III), contains Macro domain [Translation, ...
1277-1398 8.88e-20

O-acetyl-ADP-ribose deacetylase (regulator of RNase III), contains Macro domain [Translation, ribosomal structure and biogenesis];


Pssm-ID: 441713  Cd Length: 168  Bit Score: 89.47  E-value: 8.88e-20
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 190360135 1277 VCFVKGDIIKVskrvKAEVVVNPANGHMAHGGGVAKAIAVAAGQQFVKETTDMVKSKGvCATGDCYVSTGGKL-CKTVLN 1355
Cdd:COG2110     1 IEIVQGDITEL----DVDAIVNAANSSLLGGGGVAGAIHRAAGPELLEECRRLCKQGG-CPTGEAVITPAGNLpAKYVIH 75
                          90       100       110       120
                  ....*....|....*....|....*....|....*....|....*....
gi 190360135 1356 VVGPDARTQGKQSYALLERVYKHL----NKYDC--VVTTLISAGIFSVP 1398
Cdd:COG2110    76 TVGPVWRGGGPSEEELLASCYRNSlelaEELGIrsIAFPAIGTGVGGFP 124
gammaCoV_PLPro cd21733
gammacoronavirus papain-like protease; This model represents the papain-like protease (PLPro) ...
1567-1829 1.08e-19

gammacoronavirus papain-like protease; This model represents the papain-like protease (PLPro) found in non-structural protein 3 (Nsp3) of gammacoronavirus, including Avian coronavirus, Canada goose coronavirus, and Beluga whale coronavirus SW1. CoVs utilize a multi-subunit replication/transcription machinery. A set of non-structural proteins (Nsps) generated as cleavage products of the ORF1a and ORF1ab viral polyproteins assemble to facilitate viral replication and transcription. PLPro is a key enzyme in this process, making it a high value target for the development of anti-coronavirus therapeutics. PLPro, which belongs to the MEROPS peptidase C16 family, participates in the proteolytic processing of the N-terminal region of the replicase polyprotein; it can cleave Nsp1|Nsp2, Nsp2|Nsp3, and Nsp3|Nsp4 sites and its activity is dependent on zinc. Besides cleaving the polyproteins, PLPro also possesses a related enzymatic activity to promote virus replication: deubiquitinating (DUB) and de-ISGylating activities. Both, ubiquitin (Ub) and Ub-like interferon-stimulated gene product 15 (ISG15), are involved in preventing viral infection; coronaviruses utilize Ubl-conjugating pathways to counter the pro-inflammatory properties of Ubl-conjugated host proteins via the action of PLPro, which processes both 'Lys-48'- and 'Lys-63'-linked polyubiquitin chains from cellular substrates. The Nsp3 PLPro domain in several CoVs has also been shown to antagonize host innate immune induction of type I interferon by interacting with IRF3 and blocking its activation.


Pssm-ID: 409650  Cd Length: 304  Bit Score: 93.26  E-value: 1.08e-19
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 190360135 1567 ILLTVDGVNFTNRFVPVGESFGKsLGNVFCDGVNVTKhKCDINYKGKVFfqfdnLSSEDlKAVRSSFNFDQKELLAYYNM 1646
Cdd:cd21733     7 IYLTEDGVKYRSVVVKPGDSLSQ-FGQVFARNKTVFT-ADDVEDKEILF-----IPTTD-KAVLEYYGLDAQKYVIYLQT 78
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 190360135 1647 LVNcsKWQVVFNGKYFTFKQANNNCFVNVSCLMLQSLNLKFK-IVqwQEAWLEFRSGRPARFVSLVLAKGGFKFGDPADS 1725
Cdd:cd21733    79 LAQ--KWNVQYRDNFLILEWRDGNCWISSAIVLLQAAKIRFKgFL--AEAWAKFLGGDPTEFVAWCYASCNAKVGDFSDA 154
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 190360135 1726 R---DFLRVVFSqVDLTGAICDFEIACKCGVKQEQRTGVDA---------VMHFGTlsredleiGYTVDCSCGKKLI-HC 1792
Cdd:cd21733   155 NwllANLAEYFD-ADYTNAFLKRRVSCNCGVKNYELRGLEAciqpvrapnLLHFKT--------QYSNCPTCGANSVdEV 225
                         250       260       270       280
                  ....*....|....*....|....*....|....*....|
gi 190360135 1793 VRFDVPF--LICSNTPASVKLPKGVGSANIFKGD-KVGHY 1829
Cdd:cd21733   226 VEASLPYllLLATDGPATVDCDENAVGNVVFIGStNSGHC 265
CoV_Nsp7 cd21811
coronavirus non-structural protein 7; This model represents the non-structural protein 7 (Nsp7) ...
3837-3925 2.74e-19

coronavirus non-structural protein 7; This model represents the non-structural protein 7 (Nsp7) of alpha-, beta-, gamma- and deltacoronaviruses, including highly pathogenic betacoronaviruses such as Severe acute respiratory syndrome-related coronavirus (SARS-CoV), SARS-CoV2 (also called 2019 novel CoV or 2019-nCoV), and Middle East respiratory syndrome-related (MERS) CoV. CoVs utilize a multi-subunit replication/transcription machinery. A set of non-structural proteins (Nsps) generated as cleavage products of the ORF1a and ORF1ab viral polyproteins assemble to facilitate viral replication and transcription. Upon processing of the Nsp7-10 region by protease M (Mpro), the released four small proteins Nsp7, Nsp8, Nsp9 and Nsp10 form functional complexes with CoV core enzymes and stimulate replication. Most importantly, a complex of Nsp7 with Nsp8 has been shown to activate and confer processivity to the RNA-synthesizing activity of Nsp12, the RNA-dependent RNA-polymerase (RdRp); in SARS-CoV, point mutations in the NSP7- or NSP8-coding region have been shown to delay virus growth. Nsp7 and Nsp8 cooperate in activating the primer-dependent activity of the Nsp12 RdRp such that the level of their association may constitute a limiting factor for obtaining a high RNA polymerase activity. The subsequent Nsp7/Nsp8/Nsp12 polymerase complex is then able to associate with an active bifunctional Nsp14, which includes N-terminal 3' to 5' exoribonuclease (ExoN) and C-terminal N7-guanine cap methyltransferase (N7-MTase) activities, thus representing a unique coronavirus Nsp assembly that incorporates RdRp, exoribonuclease, and N7-MTase activities. Interaction of Nsp7 with Nsp8 appears to be conserved across the coronavirus family, making these proteins interesting drug targets. Nsp7 has a 4-helical bundle conformation which is strongly affected by its interaction with Nsp8, especially where it concerns alpha-helix 4. SARS-CoV Nsp7 forms a 8:8 hexadecameric supercomplex with Nsp8 that adopts a hollow cylinder-like structure with a large central channel and positive electrostatic properties in the cylinder, while Feline infectious peritonitis virus Nsp7 forms a 2:1 heterotrimer with Nsp8. Regardless of their oligomeric structure, the Nsp7/Nsp8 complex functions as a noncanonical RNA polymerase capable of synthesizing RNA of up to template length.


Pssm-ID: 409251  Cd Length: 83  Bit Score: 85.23  E-value: 2.74e-19
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 190360135 3837 SKLTDVKCANVVLLNCLQHLHVASNSKLWQYCSTLHNEILATSDLGVAFEKLAQLLIVLFANPAAVDskcltsIEEVCDD 3916
Cdd:cd21811     1 SKLTDVKCTAVVLLSLLQKLRVESNSKLWKQCVQLHNDILLAKDTTEVFEKLVSLLSVLLSMQGAVD------LNRLCEE 74

                  ....*....
gi 190360135 3917 YAKDNTVLQ 3925
Cdd:cd21811    75 MLENRAVLQ 83
deltaCoV_Nsp9 cd21900
deltacoronavirus non-structural protein 9; This model represents the non-structural protein 9 ...
4123-4232 8.11e-19

deltacoronavirus non-structural protein 9; This model represents the non-structural protein 9 (Nsp9) from deltacoronaviruses such as the Porcine delta coronavirus (PDCoV) Porcine coronavirus HKU15. CoVs utilize a multi-subunit replication/transcription machinery assembled from a set of non-structural proteins (Nsps) generated as cleavage products of the ORF1a and ORF1ab viral polyproteins. All of these Nsps, except for Nsp1 and Nsp2, are considered essential for transcription, replication, and translation of the viral RNA. Nsp9, with Nsp7, Nsp8, and Nsp10, localizes within the replication complex. Nsp9 is an essential single-stranded RNA-binding protein for coronavirus replication; it shares structural similarity to the oligosaccharide-binding (OB) fold, which is characteristic of proteins that bind to ssDNA or ssRNA. Nsp9 requires dimerization for binding and orienting RNA for subsequent use by the replicase machinery. CoV Nsp9s have diverse forms of dimerization that promote their biological function, which may help elucidate the mechanism underlying CoVs replication and contribute to the development of antiviral drugs. Generally, dimers are formed via interaction of the parallel alpha-helices containing the protein-protein interaction motif GXXXG; additionally, the N-finger region may also play a critical role in dimerization as seen in porcine delta coronavirus (PDCoV) Nsp9. As a member of the replication complex, Nsp9 may not have a specific RNA-binding sequence but may act in conjunction with other Nsps as a processivity factor, as shown by mutation studies indicating that Nsp9 is a key ingredient that intimately engages other proteins in the replicase complex to mediate efficient virus transcription and replication.


Pssm-ID: 409333  Cd Length: 109  Bit Score: 84.79  E-value: 8.11e-19
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 190360135 4123 NNELMPAKLKTQVvNSGPDQTCNTPT-QCYYNNSYNGKIVYAILSDVDGLKYTKILKDDGNfVVLELDPPCKFTVQDVKG 4201
Cdd:cd21900     1 NNELCLRNVFTAQ-NTASDGNGNESTaKSFYVSRTGKKILVAVTSTKDNLKTVTCDTDTGK-VVLNLDPPMRFSHVVGGK 78
                          90       100       110
                  ....*....|....*....|....*....|.
gi 190360135 4202 LKIKYLYFVKGCNTLARGWVVGTISSTVRLQ 4232
Cdd:cd21900    79 QSVVYLYFIQNISSLNRGMVIGHISGTTILQ 109
gammaCoV_Nsp7 cd21828
gammacoronavirus non-structural protein 7; This model represents the non-structural protein 7 ...
3837-3925 2.57e-17

gammacoronavirus non-structural protein 7; This model represents the non-structural protein 7 (Nsp7) of gammacoronaviruses that include Avian infectious bronchitis virus (IBV) and Canada goose coronavirus (CGCoV), among others. CoVs utilize a multi-subunit replication/transcription machinery. A set of non-structural proteins (Nsps) generated as cleavage products of the ORF1a and ORF1ab viral polyproteins assemble to facilitate viral replication and transcription. Upon processing of the Nsp7-10 region by protease M (Mpro), the released four small proteins Nsp7, Nsp8, Nsp9 and Nsp10 form functional complexes with CoV core enzymes and stimulate replication. Most importantly, a complex of Nsp7 with Nsp8 has been shown to activate and confer processivity to the RNA-synthesizing activity of Nsp12, the RNA-dependent RNA-polymerase (RdRp); in SARS-CoV, point mutations in the NSP7- or NSP8-coding region have been shown to delay virus growth. Nsp7 and Nsp8 cooperate in activating the primer-dependent activity of the Nsp12 RdRp such that the level of their association may constitute a limiting factor for obtaining a high RNA polymerase activity. The subsequent Nsp7/Nsp8/Nsp12 polymerase complex is then able to associate with an active bifunctional Nsp14, which includes N-terminal 3' to 5' exoribonuclease (ExoN) and C-terminal N7-guanine cap methyltransferase (N7-MTase) activities, thus representing a unique coronavirus Nsp assembly that incorporates RdRp, exoribonuclease, and N7-MTase activities. Interaction of Nsp7 with Nsp8 appears to be conserved across the coronavirus family, making these proteins interesting drug targets. Nsp7 has a 4-helical bundle conformation which is strongly affected by its interaction with Nsp8, especially where it concerns alpha-helix 4. SARS-CoV Nsp7 forms a 8:8 hexadecameric supercomplex with Nsp8 that adopts a hollow cylinder-like structure with a large central channel and positive electrostatic properties in the cylinder, while Feline infectious peritonitis virus Nsp7 forms a 2:1 heterotrimer with Nsp8. Regardless of their oligomeric structure, the Nsp7/Nsp8 complex functions as a noncanonical RNA polymerase capable of synthesizing RNA of up to template length.


Pssm-ID: 409254  Cd Length: 83  Bit Score: 79.45  E-value: 2.57e-17
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 190360135 3837 SKLTDVKCANVVLLNCLQHLHVASNSKLWQYCSTLHNEILATSDLGVAFEKLAQLLIVLFANPAAVDskcltsIEEVCDD 3916
Cdd:cd21828     1 SKLTDVKCTTVVLMQLLTKLNVEANSKMHKYLVELHNKILASDDVVECMDNLLGMLVTLLCIDSTID------LSEYCDD 74

                  ....*....
gi 190360135 3917 YAKDNTVLQ 3925
Cdd:cd21828    75 ILKRSTVLQ 83
CoV_NSP2_C pfam19212
Coronavirus replicase NSP2, C-terminal; This entry corresponds to a presumed domain found at ...
678-851 3.70e-17

Coronavirus replicase NSP2, C-terminal; This entry corresponds to a presumed domain found at the C-terminus of Coronavirus non-structural protein 2 (NSP2). NSP2 is encoded by ORF1a/1ab and proteolytically released from the pp1a/1ab polyprotein. The function of NSP2 is uncertain. This presumed domain is found in two copies in some viral NSP2 proteins. This domain is found in both alpha and betacoronaviruses.


Pssm-ID: 465996  Cd Length: 156  Bit Score: 81.54  E-value: 3.70e-17
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 190360135   678 VLHGAYIVVESDIYFVknIPRYASAVAQAFRSVAKVVLDSLRVTFIDGLSCFKIGRRRICLSGskiyevergLLHSsqlp 757
Cdd:pfam19212    1 LKNAKFTVVNGGIVFV--VPKKFKSLVGTLLDLLNKLFDSLVDTVKIAGVKFKAGGTYYLFSN---------ALVK---- 65
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 190360135   758 ldVYDLTMPSQVQKT-KQKPIYLKGSGSDFSLADSVVEVVTTSLTPCGYSEPPKVADKICIVDNVYMAKAGDKYYPVVVD 836
Cdd:pfam19212   66 --VVSVKLKGKKQAGlKGAKEATVFVGATVPVTPTRVEVVTVELEEVDYVPPPVVVGYVVVIDGYAFYKSGDEYYPASTD 143
                          170
                   ....*....|....*
gi 190360135   837 GHVglLDQAWRVPCA 851
Cdd:pfam19212  144 GVV--VPPVFKLKGG 156
betaCoV_Nsp3_betaSM cd21727
betacoronavirus-specific marker of betacoronavirus non-structural protein 3; This model ...
2039-2134 1.38e-15

betacoronavirus-specific marker of betacoronavirus non-structural protein 3; This model represents the betacoronavirus-specific marker (betaSM), also called group 2-specific marker (G2M), of non-structural protein 3 (Nsp3) from betacoronavirus, including highly pathogenic human coronaviruses such as Severe acute respiratory syndrome-related coronavirus (SARS-CoV) and SARS-CoV2 (also called 2019 novel CoV or 2019-nCoV). The betaSM/G2M is located C-terminal to the nucleic acid-binding (NAB) domain. This region is absent in alpha- and deltacoronavirus Nsp3; there is a gammacoronavirus-specific marker (gammaSM) at this position in gammacoronavirus Nsp3. Nsp3 is a large multi-functional multi-domain protein that is an essential component of the replication/transcription complex (RTC), which carries out RNA synthesis, RNA processing, and interference with the host cell innate immune system. Little is known about the betaSM/G2M domain; it is predicted to be non-enzymatic and may be an intrinsically disordered region. The betaSM/G2M domain is part of the predicted PLnc domain (made up of 385 amino acids) of SARS-CoV Nsp3 that may function as a replication/transcription scaffold, with interactions to Nsp5, Nsp12, Nsp13, Nsp14, and Nsp16.


Pssm-ID: 409626  Cd Length: 125  Bit Score: 76.03  E-value: 1.38e-15
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 190360135 2039 IKLSGVKKPFKVEDSVIVNDDTSEIKYVKSLSIVDVYDMWLTGCR-CVVRTANALSRAVNVPTIRK--FIKFGMTLVSIP 2115
Cdd:cd21727    18 VILKGLKKPFVVNGNVSVVDNDSGTKVVEELSKTDLYTMYVDGKYqVVVLKANELSRVLGLHTVEShaAVNVLASGSVTR 97
                          90
                  ....*....|....*....
gi 190360135 2116 IDLLNLREIKPVFNVVKAV 2134
Cdd:cd21727    98 YAKLLLRASFYFVEFTKAT 116
PRK00431 PRK00431
ADP-ribose-binding protein;
1280-1426 3.94e-15

ADP-ribose-binding protein;


Pssm-ID: 234759  Cd Length: 177  Bit Score: 76.42  E-value: 3.94e-15
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 190360135 1280 VKGDIIKVSkrvkAEVVVNPANGHMAHGGGVAKAIAVAAGQQFVKETTDMVKSKGVCATGDCYVSTGGKL-CKTVLNVVG 1358
Cdd:PRK00431    8 VQGDITELE----VDAIVNAANSSLLGGGGVDGAIHRAAGPEILEECRELRQQQGPCPTGEAVITSAGRLpAKYVIHTVG 83
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 190360135 1359 PDARTQGKQSYALLERVYKH-LNKYDCV-VTTL----ISAGIFSVPSD---------VSLTYLLGTAKKQVVLVSNNQED 1423
Cdd:PRK00431   84 PVWRGGEDNEAELLASAYRNsLRLAAELgLRSIafpaISTGVYGYPLEdaariavktVREFLTRHKSPEEVYFVCYDEEA 163

                  ...
gi 190360135 1424 FDL 1426
Cdd:PRK00431  164 YRL 166
Macro_OAADPr_deacetylase cd02908
macrodomain, O-acetyl-ADP-ribose (OAADPr) family; Macrodomains are found in a variety of ...
1277-1429 2.28e-14

macrodomain, O-acetyl-ADP-ribose (OAADPr) family; Macrodomains are found in a variety of proteins with diverse cellular functions, as a stand-alone domain or in combination with other domains like in histone macroH2A and some PARPs (poly ADP-ribose polymerases). Macrodomains can recognize ADP-ribose (ADPr) in both its free and protein-linked forms, in related ligands, such as O-acyl-ADP-ribose (OAADPr), and even in ligands unrelated to ADPr. This family includes eukaryotic macrodomain proteins such as human MacroD1 and MacroD2, and bacterial proteins such as Escherichia coli YmdB; these have been shown to be O-acetyl-ADP-ribose (OAADPr) deacetylases that efficiently catalyze the hydrolysis of OAADPr to produce ADP-ribose and free acetate. OAADPr is a sirtuin reaction product generated from the NAD+-dependent protein deacetylation reactions and has been implicated as a signaling molecule. By acting on mono-ADP-ribosylated substrates, OAADPr deacetylases may reverse cellular ADP-ribosylation.


Pssm-ID: 438955  Cd Length: 166  Bit Score: 73.70  E-value: 2.28e-14
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 190360135 1277 VCFVKGDIIKVskrvKAEVVVNPANGHMAHGGGVAKAIAVAAGQQFVKETtdmVKSKGVCATGDCYVSTGGKL-CKTVLN 1355
Cdd:cd02908     2 ISLWRGDITKL----EVDAIVNAANSSLLGGGGVDGAIHRAAGPELLEEC---RKLGGVCPTGEAKITPGYNLpAKYVIH 74
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 190360135 1356 VVGPDARTQGKQSYALLERVYKH------LNKYDCVVTTLISAGIFSVPSD----VSLTYLLGTAKK-----QVVLVSNN 1420
Cdd:cd02908    75 TVGPIGEGGVEEEPELLASCYRSslelalENGLKSIAFPCISTGIYGYPNEeaaeIALNTVREWLEEhdkidRIIFVVFL 154

                  ....*....
gi 190360135 1421 QEDFDLISK 1429
Cdd:cd02908   155 DEDYKIYEE 163
Macro_SF cd02749
macrodomain superfamily; Macrodomains are found in a variety of proteins with diverse cellular ...
1295-1407 3.32e-13

macrodomain superfamily; Macrodomains are found in a variety of proteins with diverse cellular functions, as a stand-alone domain or in combination with other domains like in histone macroH2A and some PARPs (poly ADP-ribose polymerases). Macrodomains can recognize ADP-ribose (ADPr) in both its free and protein-linked forms, in related ligands, such as O-acyl-ADP-ribose (OAADPr), and even in ligands unrelated to ADPr. Macrodomains include the yeast macrodomain Poa1 which is a phosphatase of ADP-ribose-1"-phosphate, a by-product of tRNA splicing. Some macrodomains have ADPr-unrelated binding partners such as the coronavirus SUD-N (N-terminal subdomain) and SUD-M (middle subdomain) of the SARS-unique domain (SUD) which bind G-quadruplexes (unusual nucleic-acid structures formed by consecutive guanosine nucleotides). Macrodomains regulate a wide variety of cellular and organismal processes, including DNA damage repair, signal transduction, and immune response.


Pssm-ID: 394871  Cd Length: 121  Bit Score: 68.96  E-value: 3.32e-13
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 190360135 1295 VVVNPANGHMAHGGGVAKAIAVAAGQQFVKETTDmVKSKGVCATGDCYVSTGGKL-CKTVLNVVGPDARTQgKQSYALLE 1373
Cdd:cd02749     2 AIVNPANNDLYLGGGVAKAISKKAGGDLQEECEE-RKKNGYLKVGEVAVTKGGNLpARYIIHVVGPVASSK-KKTYEPLK 79
                          90       100       110       120
                  ....*....|....*....|....*....|....*....|
gi 190360135 1374 RVYKHLN------KYDCVVTTLISAGIFSVPSDVSLTYLL 1407
Cdd:cd02749    80 KCVKNCLsladekGLKSVAFPAIGTGIAGFPPEEAARIML 119
SUD_C_DPUP_CoV_Nsp3 cd21513
C-terminal SARS-Unique Domain (SUD) of betacoronavirus non-structural protein 3 (Nsp3); This ...
1491-1560 1.47e-12

C-terminal SARS-Unique Domain (SUD) of betacoronavirus non-structural protein 3 (Nsp3); This family contains the SUD-C of Nsp3 from Severe Acute Respiratory Syndrome (SARS) coronavirus (CoV), Middle East respiratory syndrome-related (MERS) CoV, and Rousettus bat CoV HKU9, as well as the DPUP (domain preceding Ubl2 and PLP2) of murine hepatitis virus (MHV) Nsp3. Though structurally similar, there is little sequence similarity between these four domain subfamilies: SARS SUD-C, MERS SUD-C, HKU9 SUD-C, and MHV DPUP. Non-structural protein 3 (Nsp3) is a large multi-functional multi-domain protein that is an essential component of the replication/transcription complex (RTC), which carries out RNA synthesis, RNA processing, and interference with the host cell innate immune system. Nsp3 of SARS coronavirus includes a SARS-unique domain (SUD) consisting of three globular domains separated by short linker peptide segments: SUD-N, SUD-M, and SUD-C. SUD-N and SUD-M are macro domains which bind G-quadruplexes (unusual nucleic-acid structures formed by consecutive guanosine nucleotides). The SUD-C domain adopts a frataxin-like fold and has structural similarity to DNA-binding domains of DNA-modifying enzymes. It binds to single-stranded RNA and recognizes purine bases more strongly than pyrimidine bases. SUD-C also regulates the RNA binding behavior of the SUD-M macrodomain. SUD-C is not as specific to SARS CoV Nsp3 as originally thought, and is conserved in the Nsp3s of all four lineages (A-D) of betacoronavirus.


Pssm-ID: 394838  Cd Length: 71  Bit Score: 65.65  E-value: 1.47e-12
                          10        20        30        40        50        60        70
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 190360135 1491 DDARTFVQSNVDVVPEGWRVVNKFYQINGVRTVKYFECPGGIdICSQDKVFGYVQQGSFNKATVAQIKAL 1560
Cdd:cd21513     1 TDERVFVQAVMLNGPRDWRLVNKFDSVDGVRYKKYLKRGGIF-VCSQDKKFYYVQNDVFLEFSVSKIRAL 69
alphaCoV_PLPro cd21731
alphacoronavirus papain-like protease; This model represents the papain-like protease (PLPro) ...
1565-1848 1.55e-11

alphacoronavirus papain-like protease; This model represents the papain-like protease (PLPro) found in non-structural protein 3 (Nsp3) of alphacoronavirus, including Swine acute diarrhea syndrome coronavirus (SADS-CoV) which causes severe diarrhea in piglets, and Human coronavirus 229E which infects humans and bats and causes the common cold. CoVs utilize a multi-subunit replication/transcription machinery. A set of non-structural proteins (Nsps) generated as cleavage products of the ORF1a and ORF1ab viral polyproteins assemble to facilitate viral replication and transcription. PLPro is a key enzyme in this process, making it a high value target for the development of anti-coronavirus therapeutics. PLPro, which belongs to the MEROPS peptidase C16 family, participates in the proteolytic processing of the N-terminal region of the replicase polyprotein; it can cleave Nsp1|Nsp2, Nsp2|Nsp3, and Nsp3|Nsp4 sites and its activity is dependent on zinc. Besides cleaving the polyproteins, PLPro also possesses a related enzymatic activity to promote virus replication: deubiquitinating (DUB) and de-ISGylating activities. Both, ubiquitin (Ub) and Ub-like interferon-stimulated gene product 15 (ISG15), are involved in preventing viral infection; coronaviruses utilize Ubl-conjugating pathways to counter the pro-inflammatory properties of Ubl-conjugated host proteins via the action of PLPro, which processes both 'Lys-48'- and 'Lys-63'-linked polyubiquitin chains from cellular substrates. The Nsp3 PLPro domain in SADS-CoV and many others has also been shown to antagonize host innate immune induction of type I interferon by interacting with IRF3 and blocking its activation.


Pssm-ID: 409648  Cd Length: 289  Bit Score: 68.42  E-value: 1.55e-11
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 190360135 1565 VDILLTVDGVNFTNRFVPVGESFGKSLGNVFCDGVNVTKHKCDI-NYKG---------KVFFQFDN---LSSEDlkavRS 1631
Cdd:cd21731     3 VVVKVTEDGRNVKDVVVDTDKTFGEQLGVCSVNDKDVTGVVPPDdSDKVvsvapdvdwDSHYGFPNaavFHTLD----HS 78
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 190360135 1632 SFNFDQKELlayynmlvncskwqvvfNGKYfTFKQANNNCFVNVSCLMLQSLNLKFKIVQWQEAWLEFRSGRPARFVSLV 1711
Cdd:cd21731    79 AYAFESDIV-----------------NGKR-VLKQSDNNCWVNAVCLQLQFAKPTFKSEGLQALWNKFLTGDVAGFVHWL 140
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 190360135 1712 LAKGGFKFGDPADSRDFLRVVFSQVDLTGAI-------CDfeiACKcgvkqEQRTGVDAVMHfGTLSREDLEIGYtvdCS 1784
Cdd:cd21731   141 YWITGANKGDPGDAENTLNKLSKYLVSSGSVtverttgCD---SCN-----SKRTVTTPVVN-ASVLRSGVDDGV---CK 208
                         250       260       270       280       290       300       310
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 190360135 1785 CGKKLI---HCVRFDVPFLICSNTPAS--VKLPKGVgSANIFKGD-KVGHYVhVKCEQSYQLYDASNVKK 1848
Cdd:cd21731   209 HGVKVTtrvVSVKGTVIITSVGKPVVSdaLLLLDGV-SYTAFSGDvDNGHYT-VYDKATGKVYDGDKTVS 276
CoV_NSP2_N pfam19211
Coronavirus replicase NSP2, N-terminal; This entry corresponds to the N-terminal region of ...
252-471 5.17e-10

Coronavirus replicase NSP2, N-terminal; This entry corresponds to the N-terminal region of coronavirus non-structural protein 2. NSP2 is encoded by ORF1a/1ab and proteolytically released from the pp1a/1ab polyprotein. The function of this protein is uncertain. This region contains numerous conserved and semi-conserved cysteine residues.


Pssm-ID: 465995 [Multi-domain]  Cd Length: 204  Bit Score: 62.36  E-value: 5.17e-10
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 190360135   252 YVDQYGCDYTGG-LADGLEAYADKTLQEmkalfpiwSQELPFD---VTVAWHVVRDPRYVMRlQSASTIRSVAYVANPTE 327
Cdd:pfam19211    4 PVDQYMCGADGKpVLPEDTWCFKDYFGD--------DGEIVLNggtYRKAWKVVRKNVPYPK-QSLFTINSITYLGDIPH 74
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 190360135   328 DLCDGSV--VIKEPVH----VYADDSIILRQHNLVDIMSCFYMEADAVVNAFYGVdlkdcgfvmqfGYIDCEqdlCDFKG 401
Cdd:pfam19211   75 VLPNGAVlhVAPRVKKskkvVLSEKYKSLYDTYGSPFVTNGSTLLEIVPKPVFHH-----------ALVKCS---CGRES 140
                          170       180       190       200       210       220       230
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 190360135   402 WVPGNMIdGFACtTCGHVYetGDLLAQSSGVLPVNPVLHTKSAAGYGgfgckdsftlygqtVVYFGGCVY 471
Cdd:pfam19211  141 WTVGDWS-GFKC-LCCGVY--GKPICVSAGDVKPGDVLITKAPVGRG--------------KKFFGGAVL 192
Macro_H2A-like cd02904
macrodomain, macroH2A-like family; Macrodomains are found in a variety of proteins with ...
1280-1359 4.14e-08

macrodomain, macroH2A-like family; Macrodomains are found in a variety of proteins with diverse cellular functions, as a stand-alone domain or in combination with other domains like in histone macroH2A and some PARPs (poly ADP-ribose polymerases). Macrodomains can recognize ADP-ribose (ADPr) in both its free and protein-linked forms, in related ligands, such as O-acyl-ADP-ribose (OAADPr), and even in ligands unrelated to ADPr. Members of this family are similar to macroH2A, a variant of the major-type core histone H2A, which contains an N-terminal H2A domain and a C-terminal nonhistone macrodomain. Histone macroH2A is enriched on the inactive X chromosome of mammalian female cells. It does not bind poly ADP-ribose, but does bind the monomeric SirT1 metabolite O-acetyl-ADP-ribose (OAADPR) with high affinity through its macrodomain. This family also includes the ADP-ribose binding macrodomain of the macroH2A variant, macroH2A1.1. The macroH2A1.1 isoform inhibits PARP1-dependent DNA-damage induced chromatin dynamics. The putative ADP-ribose binding pocket of the human macroH2A2 macrodomain exhibits marked structural differences compared with the macroH2A1.1 variant.


Pssm-ID: 394875  Cd Length: 188  Bit Score: 56.17  E-value: 4.14e-08
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 190360135 1280 VKGDIIKVskrvKAEVVVNPANGHMAHGGGVAKAIAVAAGQQFVKETTDMVKSKGVCATGDCYVSTGGKL-CKTVLNVVG 1358
Cdd:cd02904    23 VQGDIASI----KADAIVHPTNATFYLGGEVGSALEKAGGKEFVEEVKELRKSNGPLEVAGAAISPGHNLpAKFVIHCNS 98

                  .
gi 190360135 1359 P 1359
Cdd:cd02904    99 P 99
betaCoV_Nsp2_MERS-like cd21517
betacoronavirus non-structural protein 2 (Nsp2) similar to MERS-CoV Nsp2, and related proteins ...
372-522 3.64e-06

betacoronavirus non-structural protein 2 (Nsp2) similar to MERS-CoV Nsp2, and related proteins from betacoronaviruses in the C lineage; Coronavirus non-structural proteins (Nsps) are encoded in ORF1a and ORF1b. Post infection, the genomic RNA is released into the cytoplasm of the cell and translated into two long polyproteins (pp), pp1a and pp1ab, which are then autoproteolytically cleaved by two viral proteases Nsp3 and Nsp5 into smaller subunits. Nsp2 is one of these subunits. This subgroup includes Nsp2 from Middle East respiratory syndrome-related coronavirus (MERS-CoV) and betacoronaviruses in the merbecovirus subgenus (C lineage). It belongs to a family which includes Severe acute respiratory syndrome coronavirus (SARS-CoV) Nsp2, and Murine hepatitis virus (MHV) Nsp2 (also known as p65). The function of Nsp2 remains unclear. SARS-CoV Nsp2, rather than playing a role in viral replication, may be involved in altering the host cell environment; deletion of Nsp2 from the SARS-CoV genome results in only a modest reduction in viral titers. It has been shown to interact with two host proteins, prohibitin 1 (PHB1) and PHB2, which have been implicated in cellular functions, including cell-cycle progression, cell migration, cellular differentiation, apoptosis, and mitochondrial biogenesis. MHV Nsp2/p65, different from SARS-CoV Nsp2, may play an important role in the viral life cycle.


Pssm-ID: 394868  Cd Length: 660  Bit Score: 53.20  E-value: 3.64e-06
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 190360135  372 AFYGVD-LKDCGFVMQFGYIDCEQdlCDFKGWVPGNMIDGFACTtCGHVYETGDLLAQSSGVLPVNPVLHTKSAAGYGGf 450
Cdd:cd21517   124 TFYGKDaVENPSYIYHSAFVDCTS--CGNGSWLTGNAVQGFACD-CGASYSANDVELQSSGLVKPNALFCATCPFAKGD- 199
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 190360135  451 GCKDSFT-LYGQTVVY-----------------FGGCVY----WSPARNIWIPILKSSVKSYDGLVYTGVVGCKAIVKET 508
Cdd:cd21517   200 SCSSSCKhTVAQVVSYlsekcvvepdgksftltFGGVVYaymgCSEGTMYFVPRAKSVVSRIGDAIFTGCVGTWSKVTQI 279
                         170
                  ....*....|....
gi 190360135  509 nlicKALYLDYVQH 522
Cdd:cd21517   280 ----ANLFLEQAQR 289
betaCoV_Nsp1 cd21876
non-structural protein 1 from betacoronavirus; This model represents the non-structural ...
57-196 4.65e-06

non-structural protein 1 from betacoronavirus; This model represents the non-structural protein 1 (Nsp1) from betacoronaviruses, including highly pathogenic coronaviruses such as Severe acute respiratory syndrome-related coronavirus (SARS-CoV), SARS-CoV2 (also called 2019 novel CoV or 2019-nCoV), and Middle East respiratory syndrome-related (MERS) CoV. CoVs utilize a multi-subunit replication/transcription machinery assembled from a set of non-structural proteins (Nsps) generated as cleavage products of the ORF1a and ORF1ab viral polyproteins. Nsp1 is the N-terminal cleavage product released from the ORF1a polyprotein by the action of papain-like protease (PLpro). Though Nsp1s of alphaCoVs and betaCoVs share structural similarity, they show no significant sequence similarity and may be considered as genus-specific markers. Despite low sequence similarity, the Nsp1s of alphaCoVs and betaCoVs exhibit remarkably similar biological functions, and are involved in the regulation of both host and viral gene expression. CoV Nsp1 induces suppression of host gene expression and interferes with host immune response. It inhibits host gene expression in two ways: by targeting the translation and stability of cellular mRNAs, and by inhibiting mRNA translation and inducing an endonucleolytic RNA cleavage in the 5'-UTR of cellular mRNAs through its tight association with the 40S ribosomal subunit, a key component of the cellular translation machinery. Inhibition of host mRNA translation includes that of type I interferons, major components of the host innate immune response. Nsp1 is critical in regulating viral replication and gene expression, as shown by multiple evidences, including: mutations in the Nsp1 coding region of the transmissible gastroenteritis virus (TGEV) and murine hepatitis virus (MHV) genomes cause drastic reduction or elimination of infectious virus; bovine coronavirus (BCoV) Nsp1 is an RNA-binding protein that interacts with cis-acting replication elements in the 5'-UTR of the BCoV genome, implying its potential role in the regulation of viral translation or replication; and SARS-CoV Nsp1 enhances virus replication by binding to a stem-loop structure in the 5'-UTR of its genome.


Pssm-ID: 409338  Cd Length: 114  Bit Score: 48.56  E-value: 4.65e-06
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 190360135   57 HVMVDCRRLLKQECCVQSSLIR-EIVMNTrpydlevlLQDALQSREAVLVTPPLGMS--LEACYV----RGCNPNGWtmg 129
Cdd:cd21876     1 HVSLTLPWLQALENPVQPWIDRpEEALES--------AKAALAEGKLVFVPPYKGLHplLPGPRVflvrRHGNPTRP--- 69
                          90       100       110       120       130       140
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*..
gi 190360135  130 lfrrrsvCNTGRCAVNKHVAYqlymidpagvCFGAGQFVGWviplaFMPVQSRKFIVPWVMYLRKCG 196
Cdd:cd21876    70 -------FDVRELAADADGVN----------YGRSGRTIGV-----LVPLDGEQPYGYINILLRKYG 114
Macro_BAL-like cd02903
macrodomain, B-aggressive lymphoma (BAL)-like family; Macrodomains are found in a variety of ...
1280-1359 1.33e-05

macrodomain, B-aggressive lymphoma (BAL)-like family; Macrodomains are found in a variety of proteins with diverse cellular functions, as a stand-alone domain or in combination with other domains like in histone macroH2A and some PARPs (poly ADP-ribose polymerases). Macrodomains can recognize ADP-ribose (ADPr) in both its free and protein-linked forms, in related ligands, such as O-acyl-ADP-ribose (OAADPr), and even in ligands unrelated to ADPr. Members of this family show similarity to BAL (B-aggressive lymphoma) proteins, which contain one to three macrodomains. Most BAL family macrodomains belong to this family except for the most N-terminal domain in multiple-domain containing proteins. This family includes the second and third macrodomains of mono-ADP-ribosyltransferase PARP14 (PARP-14, also known as ADP-ribosyltransferase diphtheria toxin-like 8, ATRD8, B aggressive lymphoma protein 2, or BAL2). Most BAL proteins also contain a C-terminal PARP active site and are also named as PARPs. Human BAL1 (or PARP-9) was originally identified as a risk-related gene in diffuse large B-cell lymphoma that promotes malignant B-cell migration. Some BAL family proteins exhibit PARP activity. Poly (ADP-ribosyl)ation is an immediate DNA-damage-dependent post-translational modification of histones and other nuclear proteins. BAL proteins may also function as transcriptional repressors.


Pssm-ID: 394874  Cd Length: 175  Bit Score: 48.79  E-value: 1.33e-05
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 190360135 1280 VKGDIIKVskrvKAEVVVNPANGHMAHGGGVAKAIAVAAGQQFVKETTDMVKSKgvcATGDCYVSTGGKL-CKTVLNVVG 1358
Cdd:cd02903    13 VKGDITKE----KTDVIVNSVSSDLLLKGGVSKAILKAAGPELQDECANQGKQP---ASGDVIVTSGGNLpCKYVYHVVL 85

                  .
gi 190360135 1359 P 1359
Cdd:cd02903    86 P 86
MERS-CoV-like_Nsp3_NAB cd21823
nucleic acid binding domain of non-structural protein 3 from Middle East respiratory ...
1919-2005 3.92e-05

nucleic acid binding domain of non-structural protein 3 from Middle East respiratory syndrome-related coronavirus and betacoronavirus in the C lineage; This model represents the nucleic acid binding (NAB) domain of non-structural protein 3 (Nsp3) from betacoronavirus in the merbecovirus subgenus (C lineage), including Middle East respiratory syndrome-related coronavirus (MERS-CoV) and Tylonycteris bat coronavirus HKU4. The NAB domain represents a new fold, with a parallel four-strand beta-sheet holding two alpha-helices of three and four turns that are oriented antiparallel to the beta-strands. NAB is a cytoplasmic domain located between the papain-like protease (PLPro) and betacoronavirus-specific marker (betaSM) domains of CoV Nsp3. Nsp3 is a large multi-functional multi-domain protein that is an essential component of the replication/transcription complex (RTC), which carries out RNA synthesis, RNA processing, and interference with the host cell innate immune system. The NAB domain both binds ssRNA and unwinds dsDNA. It prefers to bind ssRNA containing repeats of three consecutive guanines. A group of residues that form a positively charged patch on the protein surface of SARS-CoV Nsp3 NAB serves as the binding site of nucleic acids. This site is conserved in the NAB of Nsp3 from betacoronavirus in the sarbecovirus subgenus (B lineage), and appears to be partially conserved in the Nsp3 NAB from betacoronaviruses in the C lineage.


Pssm-ID: 409349  Cd Length: 123  Bit Score: 45.89  E-value: 3.92e-05
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 190360135 1919 VYTNFKLIGH-------TVCDILNAKLGFDSSKEFVE-YKVTEWPTATGDVVLATDDLYVKRYERGCITFGKPVIWLSHe 1990
Cdd:cd21823    21 VYTNSCLVASdgtpggdAISLAFNNLLGFDESKPVSKkLTYSLLPNEDGDVLLAEFSTYDPIYKNGAMLKGKPILWVNN- 99
                          90
                  ....*....|....*.
gi 190360135 1991 qASLNS-LTYFNRPLL 2005
Cdd:cd21823   100 -GLFDSaLNKFNRASL 114
deltaCoV_PLPro cd21734
deltacoronavirus papain-like protease; This model represents the papain-like protease (PLPro) ...
1643-1739 4.51e-05

deltacoronavirus papain-like protease; This model represents the papain-like protease (PLPro) found in the non-structural protein 3 (Nsp3) region of deltacoronavirus, including Porcine deltacoronavirus, Bulbul coronavirus HKU11, and Common moorhen coronavirus HKU21. CoVs utilize a multi-subunit replication/transcription machinery. A set of non-structural proteins (Nsps) generated as cleavage products of the ORF1a and ORF1ab viral polyproteins assemble to facilitate viral replication and transcription. PLPro is a key enzyme in this process, making it a high value target for the development of anti-coronavirus therapeutics. PLPro, which belongs to the MEROPS peptidase C16 family, participates in the proteolytic processing of the N-terminal region of the replicase polyprotein; it can cleave Nsp1|Nsp2, Nsp2|Nsp3, and Nsp3|Nsp4 sites and its activity is dependent on zinc. Besides cleaving the polyproteins, PLPro also possesses a related enzymatic activity to promote virus replication: deubiquitinating (DUB) and de-ISGylating activities. Both, ubiquitin (Ub) and Ub-like interferon-stimulated gene product 15 (ISG15), are involved in preventing viral infection; coronaviruses utilize Ubl-conjugating pathways to counter the pro-inflammatory properties of Ubl-conjugated host proteins via the action of PLPro, which processes both 'Lys-48'- and 'Lys-63'-linked polyubiquitin chains from cellular substrates. The Nsp3 PLPro domain in many of these CoVs has also been shown to antagonize host innate immune induction of type I interferon by interacting with IRF3 and blocking its activation.


Pssm-ID: 409651  Cd Length: 313  Bit Score: 48.96  E-value: 4.51e-05
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 190360135 1643 YYNMLVNCS-KWQVVFNGKYFTFKQANNNCFVNVSCLMLQSLNLKFKIVqWQEAWLEFRSGRPARFVSLVLAKGGFKFGD 1721
Cdd:cd21734    78 YCVYLKYCHhKWSVSRTNGLMHLKQKDNNCFVSAAINLFQNTHYQLRPA-IDALYQEYLNGNPSRFVAWIYASTNQEIGE 156
                          90
                  ....*....|....*...
gi 190360135 1722 PADSRDFLRVVFSQVDLT 1739
Cdd:cd21734   157 MGCPQQVLSLLVNNSNAK 174
Macro_Ttha0132-like cd03330
Macrodomain, uncharacterized family similar to Thermus thermophilus hypothetical protein ...
1277-1363 2.76e-04

Macrodomain, uncharacterized family similar to Thermus thermophilus hypothetical protein Ttha0132; Macrodomains are found in a variety of proteins with diverse cellular functions, as a stand-alone domain or in combination with other domains like in histone macroH2A and some PARPs (poly ADP-ribose polymerases). Macrodomains can recognize ADP-ribose (ADPr) in both its free and protein-linked forms, in related ligands, such as O-acyl-ADP-ribose (OAADPr), and even in ligands unrelated to ADPr. Macrodomains include the yeast macrodomain Poa1 which is a phosphatase of ADP-ribose-1"-phosphate, a by-product of tRNA splicing. Some macrodomains have ADPr-unrelated binding partners such as the coronavirus SUD-N (N-terminal subdomain) and SUD-M (middle subdomain) of the SARS-unique domain (SUD) which bind G-quadruplexes (unusual nucleic-acid structures formed by consecutive guanosine nucleotides). Macrodomains regulate a wide variety of cellular and organismal processes, including DNA damage repair, signal transduction, and immune response. This family is composed of uncharacterized proteins containing a stand-alone macrodomain, similar to Thermus thermophilus hypothetical protein Ttha0132.


Pssm-ID: 394879  Cd Length: 147  Bit Score: 44.35  E-value: 2.76e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 190360135 1277 VCFVKGDIIKVSkrvkAEVVVNPANGHMAHGGGVAKAIAVAAGQQFVKETTdmvkSKGVCATGDCYVSTGGKL-CKTVLN 1355
Cdd:cd03330     2 LIVVQGDITEQD----ADAIVNAANRRLLMGSGVAGAIKRKGGEEIEREAM----RKGPIRVGEAVETGAGKLpAKYVIH 73
                          90
                  ....*....|
gi 190360135 1356 --VVGPDART 1363
Cdd:cd03330    74 aaVMGMPGRS 83
 
Blast search parameters
Data Source: Precalculated data, version = cdd.v.3.21
Preset Options:Database: CDSEARCH/cdd   Low complexity filter: no  Composition Based Adjustment: yes   E-value threshold: 0.01

References:

  • Wang J et al. (2023), "The conserved domain database in 2023", Nucleic Acids Res.51(D)384-8.
  • Lu S et al. (2020), "The conserved domain database in 2020", Nucleic Acids Res.48(D)265-8.
  • Marchler-Bauer A et al. (2017), "CDD/SPARCLE: functional classification of proteins via subfamily domain architectures.", Nucleic Acids Res.45(D)200-3.
Help | Disclaimer | Write to the Help Desk
NCBI | NLM | NIH