NCBI Home Page NCBI Site Search page NCBI Guide that lists and describes the NCBI resources
Conserved domains on  [gi|971745448|ref|YP_009199609|]
View 

spike glycoprotein [BtMr-AlphaCoV/SAX2011]

Protein Classification

Graphical summary

 Zoom to residue level

show extra options »

Show site features     Horizontal zoom: ×

List of domain hits

Name Accession Description Interval E-value
alphaCoV_Spike_SD1-2_S1-S2_S2 cd22369
SD-1 and SD-2 subdomains, the S1/S2 cleavage region, and the S2 fusion subunit of the spike (S) ...
652-1316 0e+00

SD-1 and SD-2 subdomains, the S1/S2 cleavage region, and the S2 fusion subunit of the spike (S) protein from alphacoronaviruses; This group contains the SD-1 and SD-2 subdomains of the S1 subunit C-terminal domain (C-domain), the S1/S2 cleavage region, and the S2 fusion subunit of the spike (S) glycoprotein from alphacoronaviruses including human coronaviruses (HCoVs), HCoV-NL63, and HCoV-229E, and porcine coronaviruses, transmissible gastroenteritis virus (TGEV) and porcine epidemic diarrhea virus (PEDV), among others. The CoV S protein is an envelope glycoprotein that plays a very important role in viral attachment, fusion, and entry into host cells, and serves as a major target for the development of neutralizing antibodies, inhibitors of viral entry, and vaccines. It is synthesized as a precursor protein that is cleaved into an N-terminal S1 subunit (~700 amino acids) and a C-terminal S2 subunit (~600 amino acids) that mediates attachment and membrane fusion, respectively. Three S1/S2 heterodimers assemble to form a trimer spike protruding from the viral envelope. The S1 subunit contains a receptor-binding domain (RBD), while the S2 subunit contains the coronavirus fusion machinery and is primarily alpha-helical. S1 contains two structurally independent domains, the N-terminal domain (NTD) and the C-domain. The S1 C-domain also contains two subdomains (SD-1 and SD-2), which connect the S1 and S2 subunits. Depending on the virus, either the NTD or the C-domain can serve as the receptor-binding domain (RBD). While the RBD of mouse hepatitis virus (MHV) is located at the NTD, most CoVs, including SARS-CoV-2, SARS-CoV and MERS-CoV use the C-domain to bind their receptors. The S2 subunit comprises the fusion peptide (FP), a second proteolytic site (S2'), followed by an internal fusion peptide (IFP), and two heptad-repeat domains (HR1 and HR2) preceding the transmembrane domain (TM). After binding of the S1 subunit RBD on the virion to its receptor on the target cell, the HR1 and HR2 domains interact with each other to form a six-helix bundle (6-HB) fusion core, bringing viral and cellular membranes into close proximity for fusion and infection. In order to catalyze the membrane fusion reaction, CoV S needs to be primed through cleavage at the S1/S2 and S2' sites. In the case of human-infecting coronaviruses such as SARS-CoV-2, HCoV-OC43, MERS-CoV, and HKU1 the spike protein contains an insertion of (R/K)-(2X)n-(R/K) (furin cleavage motif) at the S1/S2 site, which is absent in SARS-CoV and other SARS-related coronaviruses, as well as Rousettus bat coronavirus HKU9. The region modeled in this cd (SD-1 and SD-2, the S1/S2 cleavage region, and the S2 fusion subunit) plays an essential role in viral entry by initiating fusion of the viral and cellular membranes.


:

Pssm-ID: 411956 [Multi-domain]  Cd Length: 666  Bit Score: 1318.44  E-value: 0e+00
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 971745448  652 DMSQVYLDTCTTYTIYGMTGRGVITRSNNTFITGLYYTSNAGNLLAYKNSTTGVVYNVYPCQLSSQVAVISDAIVGMASS 731
Cdd:cd22369     1 DPSVVHLNVCTDYTIYGITGRGIIRKSNSTYIAGLYYTSNSGQLLGFKNSTTGEVFSVTPCQLSSQVAVVSDNIVGVMSA 80
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 971745448  732 TPNVSIDFNVTVVADNFYYLSNSAQPCDQPVLTYAGIGICSDGSITNSTARRAAADPVSPVISGNISVPTNFTFSVQVEY 811
Cdd:cd22369    81 TNNVSLGFNNTIETPSFYYHSNGAENCTEPVLTYGSIGVCADGSITEVTPRSVSPEPVSPIITGNISIPSNFTVSVQVEY 160
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 971745448  812 IQLMLKPVTVDCSVYVCNGNPRCLQLLAQYASACRTIEQALQLSARLESVEVNSMISISQEALALGVIDNFKHDFNLTNV 891
Cdd:cd22369   161 LQMYLKPVSVDCSTYVCNGNPRCLQLLTQYASACRTIEEALQLSARLESVEVNSMITVSEEALRLANISTFFDDYNLSAV 240
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 971745448  892 LPASVGAKSAVEDLLFDKVVTSGLGTVDADYKECASRTANTVAEVGCVQYYNGIMVLPGVVDQSLLAQYSAALTGAMVFG 971
Cdd:cd22369   241 LPAGVGGRSAIEDLLFDKVVTSGLGTVDEDYKACTKGLGIAAADVACAQYYNGIMVLPGVVDAEKMALYTASLTGGMVLG 320
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 971745448  972 GVTAGAAVPFSIAVQSRLNYLALQTDVLQRNQQQLANSFNAAMGNITEAFGRVNDAIEQTSHAISTVAQALDKVQTVVND 1051
Cdd:cd22369   321 GFTAAAAIPFSLAVQSRLNYVALQTDVLQRNQQILANSFNSAMGNITVAFSEVNDAIQQTSDAINTVAQALNKVQNVVNE 400
                         410       420       430       440       450       460       470       480
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 971745448 1052 QGLALSQLTKQLASNFQAISSSIEDLYNRLDRVEADLQVDRLITGRLAALNAFVAQQLTKYTDVRASRQLAQDKINECVK 1131
Cdd:cd22369   401 QGQALSQLTKQLASNFQAISSSIEDIYNRLDGLAADAQVDRLITGRLAALNAFVTQTLTKYTEVRASRQLAQQKINECVK 480
                         490       500       510       520       530       540       550       560
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 971745448 1132 SQSFRYGFCGNGTHVFSVVNAAPDGMMFFHSVLLPTAYMEVAAFSGLCVEGNGYVLRD-TGNVLFEKNGQYLITARKMFE 1210
Cdd:cd22369   481 SQSSRYGFCGNGTHLFSIVNAAPDGIMFLHTVLLPTEYVTVAAWAGLCVDGKAYVLRDdVVLTLFKLNDKYYVTPRDMFE 560
                         570       580       590       600       610       620       630       640
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 971745448 1211 PRVPQTSDFVQITGCDVVYLNVTRDELPTVIPDYIDVNSTVEDILSKLPNRTTPEFDLDIFNATYLNLTGEIADLTARSE 1290
Cdd:cd22369   561 PRVPVSSDFVQISNCNVTYVNITSDELPEVIPDYIDVNKTLEEFLANLPNYTLPDLPLDIFNATYLNLTGEIADLENKSE 640
                         650       660
                  ....*....|....*....|....*.
gi 971745448 1291 SLKNTTLELKELIANINATLVDLEWL 1316
Cdd:cd22369   641 SLLNTTVELQELIDNINNTLVDLEWL 666
CoV_S1 pfam01600
Coronavirus spike glycoprotein S1; This family represents the spike glycoprotein (S) of ...
215-641 1.52e-174

Coronavirus spike glycoprotein S1; This family represents the spike glycoprotein (S) of coronaviruses. The spike protein is arranged in trimers on the surface of the viral membrane and is essential for viral entry. The spike protein is translated as a large polypeptide that is subsequently cleaved to the distal S1, responsible for receptor binding, and the membrane-anchored S2 responsible for membrane fusion. The coronavirus (SARS-CoV) S1 subunit is composed of two distinct domains: an N-terminal domain (S1 NTD) and a receptor-binding domain (S1 RBD) also referred to as the S1 CTD or domain B. Each of these domains have been implicated in binding to host receptors. However, most coronaviruses are not known to utilize both the S1 NTD and S1 RBD for viral entry. This entry contains spike protein from both alpha and gamma coronaviruses but excludes the spike protein from beta-coronaviruses such as SARS-CoV.


:

Pssm-ID: 460262  Cd Length: 412  Bit Score: 525.37  E-value: 1.52e-174
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 971745448   215 YTLCDNCTGFPQHVFATMENGEIPPSFNFANWFYLTNSSSPVSSRVVGLQPLLLTCLWPIPALLGTATDITFDRNGtSDV 294
Cdd:pfam01600    1 YSVCTNCDGFPDNVFAVEEGGYIPPSFSFNNWFYLTNSSTPVDGRVVSNQPLLLNCLWPIPSLNGTTLKVYFNGSI-PNG 79
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 971745448   295 RCNGFAS-NETADAMRFSLNFTDS-AVFAKEGVITLKTLSN-TFKFSCSNSSTYQAP-YVIPFGHIDQPYYCFTTFYINe 370
Cdd:pfam01600   80 RCNGYSNkNGTVDAIRFNLNFTASdSVFAGAGSISLNTVGGvTYSFSCSNSSTPVGAsHQIPFGATDQPYYCFVNYNGN- 158
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 971745448   371 taGTTTTSFVGMLPPVVREFVITKTGNVYLNGYRIFTVDDVVSVNFNISSTDHRDFWTVAFVKNTEVMLDIEDTYIKQLL 450
Cdd:pfam01600  159 --ISTTSQFVGILPPVVREIVISRYGDFYVNGYRYFSTPPLESVNFNLTSGDTSDFWTVAFANYTDVLVNINNTSIQRIL 236
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 971745448   451 YCNTPLNVVKCQQLKFVLDDGFYSYSSPVDEVLPRTIVRLPRLMTHNFLNFTIFVSFyfdddkqarpDGGFYECATCAPK 530
Cdd:pfam01600  237 YCDSPLNSIKCQQLSFSLPDGFYSTSSVEVVQLPRTFVTLPKFATHSFVNLTVSVSF----------DGGGGPPSLSALS 306
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 971745448   531 YYKLAFVDDwtsavssNVTSICVNYASFTTRLFTFYAGTTAGVHLGLETGTCPFSFDTLNNYLTFGSLCFSLV-ANGGCT 609
Cdd:pfam01600  307 EVNLTINGT-------NNTSLCVNTSQFTVNLNFTCTSTAYGYTAEIRTGTCPFSFDKLNNYLSFGSICFSLVpSGGGCT 379
                          410       420       430
                   ....*....|....*....|....*....|..
gi 971745448   610 MNIVTQGPYGLPHTIAVLYVSYTEGDNIIGVP 641
Cdd:pfam01600  380 MDIVTKYWNGSFVKVGSLYVSYSEGDNITGVP 411
CoV_S2_C pfam19214
Coronavirus spike glycoprotein S2, intravirion; This entry represents the cysteine rich ...
1346-1387 1.21e-10

Coronavirus spike glycoprotein S2, intravirion; This entry represents the cysteine rich intravirion region found at the C-terminus of coronavirus spike proteins (S). These cysteine residues are targets for palmitoylation, necessary for efficiently S incorporation into virions and S-mediated membrane fusions.


:

Pssm-ID: 465998  Cd Length: 42  Bit Score: 57.81  E-value: 1.21e-10
                           10        20        30        40
                   ....*....|....*....|....*....|....*....|....
gi 971745448  1346 FCCCSTGCCGifSCMASSCGACCDIRG--TKLQRYEAIEKVHVQ 1387
Cdd:pfam19214    1 FCCCCTGCCG--CCFGCSCGGCCDSYDkrDDVYPAEVVEKVHVQ 42
 
Name Accession Description Interval E-value
alphaCoV_Spike_SD1-2_S1-S2_S2 cd22369
SD-1 and SD-2 subdomains, the S1/S2 cleavage region, and the S2 fusion subunit of the spike (S) ...
652-1316 0e+00

SD-1 and SD-2 subdomains, the S1/S2 cleavage region, and the S2 fusion subunit of the spike (S) protein from alphacoronaviruses; This group contains the SD-1 and SD-2 subdomains of the S1 subunit C-terminal domain (C-domain), the S1/S2 cleavage region, and the S2 fusion subunit of the spike (S) glycoprotein from alphacoronaviruses including human coronaviruses (HCoVs), HCoV-NL63, and HCoV-229E, and porcine coronaviruses, transmissible gastroenteritis virus (TGEV) and porcine epidemic diarrhea virus (PEDV), among others. The CoV S protein is an envelope glycoprotein that plays a very important role in viral attachment, fusion, and entry into host cells, and serves as a major target for the development of neutralizing antibodies, inhibitors of viral entry, and vaccines. It is synthesized as a precursor protein that is cleaved into an N-terminal S1 subunit (~700 amino acids) and a C-terminal S2 subunit (~600 amino acids) that mediates attachment and membrane fusion, respectively. Three S1/S2 heterodimers assemble to form a trimer spike protruding from the viral envelope. The S1 subunit contains a receptor-binding domain (RBD), while the S2 subunit contains the coronavirus fusion machinery and is primarily alpha-helical. S1 contains two structurally independent domains, the N-terminal domain (NTD) and the C-domain. The S1 C-domain also contains two subdomains (SD-1 and SD-2), which connect the S1 and S2 subunits. Depending on the virus, either the NTD or the C-domain can serve as the receptor-binding domain (RBD). While the RBD of mouse hepatitis virus (MHV) is located at the NTD, most CoVs, including SARS-CoV-2, SARS-CoV and MERS-CoV use the C-domain to bind their receptors. The S2 subunit comprises the fusion peptide (FP), a second proteolytic site (S2'), followed by an internal fusion peptide (IFP), and two heptad-repeat domains (HR1 and HR2) preceding the transmembrane domain (TM). After binding of the S1 subunit RBD on the virion to its receptor on the target cell, the HR1 and HR2 domains interact with each other to form a six-helix bundle (6-HB) fusion core, bringing viral and cellular membranes into close proximity for fusion and infection. In order to catalyze the membrane fusion reaction, CoV S needs to be primed through cleavage at the S1/S2 and S2' sites. In the case of human-infecting coronaviruses such as SARS-CoV-2, HCoV-OC43, MERS-CoV, and HKU1 the spike protein contains an insertion of (R/K)-(2X)n-(R/K) (furin cleavage motif) at the S1/S2 site, which is absent in SARS-CoV and other SARS-related coronaviruses, as well as Rousettus bat coronavirus HKU9. The region modeled in this cd (SD-1 and SD-2, the S1/S2 cleavage region, and the S2 fusion subunit) plays an essential role in viral entry by initiating fusion of the viral and cellular membranes.


Pssm-ID: 411956 [Multi-domain]  Cd Length: 666  Bit Score: 1318.44  E-value: 0e+00
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 971745448  652 DMSQVYLDTCTTYTIYGMTGRGVITRSNNTFITGLYYTSNAGNLLAYKNSTTGVVYNVYPCQLSSQVAVISDAIVGMASS 731
Cdd:cd22369     1 DPSVVHLNVCTDYTIYGITGRGIIRKSNSTYIAGLYYTSNSGQLLGFKNSTTGEVFSVTPCQLSSQVAVVSDNIVGVMSA 80
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 971745448  732 TPNVSIDFNVTVVADNFYYLSNSAQPCDQPVLTYAGIGICSDGSITNSTARRAAADPVSPVISGNISVPTNFTFSVQVEY 811
Cdd:cd22369    81 TNNVSLGFNNTIETPSFYYHSNGAENCTEPVLTYGSIGVCADGSITEVTPRSVSPEPVSPIITGNISIPSNFTVSVQVEY 160
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 971745448  812 IQLMLKPVTVDCSVYVCNGNPRCLQLLAQYASACRTIEQALQLSARLESVEVNSMISISQEALALGVIDNFKHDFNLTNV 891
Cdd:cd22369   161 LQMYLKPVSVDCSTYVCNGNPRCLQLLTQYASACRTIEEALQLSARLESVEVNSMITVSEEALRLANISTFFDDYNLSAV 240
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 971745448  892 LPASVGAKSAVEDLLFDKVVTSGLGTVDADYKECASRTANTVAEVGCVQYYNGIMVLPGVVDQSLLAQYSAALTGAMVFG 971
Cdd:cd22369   241 LPAGVGGRSAIEDLLFDKVVTSGLGTVDEDYKACTKGLGIAAADVACAQYYNGIMVLPGVVDAEKMALYTASLTGGMVLG 320
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 971745448  972 GVTAGAAVPFSIAVQSRLNYLALQTDVLQRNQQQLANSFNAAMGNITEAFGRVNDAIEQTSHAISTVAQALDKVQTVVND 1051
Cdd:cd22369   321 GFTAAAAIPFSLAVQSRLNYVALQTDVLQRNQQILANSFNSAMGNITVAFSEVNDAIQQTSDAINTVAQALNKVQNVVNE 400
                         410       420       430       440       450       460       470       480
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 971745448 1052 QGLALSQLTKQLASNFQAISSSIEDLYNRLDRVEADLQVDRLITGRLAALNAFVAQQLTKYTDVRASRQLAQDKINECVK 1131
Cdd:cd22369   401 QGQALSQLTKQLASNFQAISSSIEDIYNRLDGLAADAQVDRLITGRLAALNAFVTQTLTKYTEVRASRQLAQQKINECVK 480
                         490       500       510       520       530       540       550       560
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 971745448 1132 SQSFRYGFCGNGTHVFSVVNAAPDGMMFFHSVLLPTAYMEVAAFSGLCVEGNGYVLRD-TGNVLFEKNGQYLITARKMFE 1210
Cdd:cd22369   481 SQSSRYGFCGNGTHLFSIVNAAPDGIMFLHTVLLPTEYVTVAAWAGLCVDGKAYVLRDdVVLTLFKLNDKYYVTPRDMFE 560
                         570       580       590       600       610       620       630       640
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 971745448 1211 PRVPQTSDFVQITGCDVVYLNVTRDELPTVIPDYIDVNSTVEDILSKLPNRTTPEFDLDIFNATYLNLTGEIADLTARSE 1290
Cdd:cd22369   561 PRVPVSSDFVQISNCNVTYVNITSDELPEVIPDYIDVNKTLEEFLANLPNYTLPDLPLDIFNATYLNLTGEIADLENKSE 640
                         650       660
                  ....*....|....*....|....*.
gi 971745448 1291 SLKNTTLELKELIANINATLVDLEWL 1316
Cdd:cd22369   641 SLLNTTVELQELIDNINNTLVDLEWL 666
CoV_S2 pfam01601
Coronavirus spike glycoprotein S2; The coronavirus spike glycoprotein forms the characteriztic ...
795-1324 0e+00

Coronavirus spike glycoprotein S2; The coronavirus spike glycoprotein forms the characteriztic 'corona' after which the group is named. The Spike glycoprotein is translated as a large polypeptide that is subsequently cleaved to S1 pfam01600 and S2,. The S2 subunit normally contains multiple key components, including one or more fusion peptides (FP), a second proteolytic site (S2') and two conserved heptad repeats (HRs), driving membrane penetration and virus-cell fusion. The HRs can trimerize into a coiled-coil structure built of three HR1-HR2 helical hairpins presenting as a canonical six-helix bundle and drag the virus envelope and the host cell bilayer into close proximity, preparing for fusion to occur.


Pssm-ID: 460263  Cd Length: 502  Bit Score: 779.53  E-value: 0e+00
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 971745448   795 GNISVPTNFTFSVQVEYIQLMLKPVTVDCSVYVCNGNPRCLQLLAQYASACRTIEQALQLSARLESVEVNSMISISQEAL 874
Cdd:pfam01601    1 GNISIPTNFTISVQTEYIQTTSPKVSVDCAQYVCNGNERCLQLLVQYGSFCSTIEQALQGSARLEDVEVLSMLSISNRAL 80
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 971745448   875 ALGVIDNFKHDFNLTNVLPASVGAKSAVEDLLFDKVVTSGLGTVDAdYKECAsrTANTVAEVGCVQYYNGIMVLPGVVDQ 954
Cdd:pfam01601   81 TLATISNFGSDFNFSSFLPCLNSGRSAIEDLLFDKVVTSGLGTVDA-YKKCT--KGTSIADLVCAQYYNGIMVLPGVVDA 157
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 971745448   955 SLLAQYSAALTGAMVFGGVT-AGAAVPFSIAVQSRLNYLALQTDVLQRNQQQLANSFNAAMGNITEAFgrvndaieqtsh 1033
Cdd:pfam01601  158 EKMAMYTASLTGGMAFGGLTgAAAAIPFALAVQARLNYLGLQTDVLQENQKILANAFNNAVGNITDGF------------ 225
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 971745448  1034 aiSTVAQALDKVQTVVNDQGLALSQLTKQLASNFQAISSSIEDLYNRLDRVEADLQVDRLITGRLAALNAFVAQQLTKYT 1113
Cdd:pfam01601  226 --TTTASALSKIQDVVNANAQALNQLTQQLSNNFGAISSSIQDIYSRLDQLEADAQVDRLINGRLAALNAFVTQQLTKAS 303
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 971745448  1114 DVRASRQLAQDKINECVKSQSFRYGFCGNGTHVFSVVNAAPDGMMFFHSVLLPTAYMEVAAFSGLCVEGN-GYVLRDTGN 1192
Cdd:pfam01601  304 EVKASRQLAQQKVNECVKSQSSRYGFCGNGTHLFSLPQAAPNGIMFLHTVLVPTEYITVKATPGLCVNGTtGYAPRDGQF 383
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 971745448  1193 VLFEkNGQYLITARKMFEPRVPQTSDFVQITGCDVVYLNVTRDELPTVIPDYIDVNSTVEDILSKLpNRTTPEFDLDIFN 1272
Cdd:pfam01601  384 VLNN-TSNWYITPRNMYQPRPITGSDFVQISSCDVNFVNITNTKLPPLIPDYVDFNKELEDIYKNL-NSTLPDLDLDIFN 461
                          490       500       510       520       530
                   ....*....|....*....|....*....|....*....|....*....|..
gi 971745448  1273 ATYLNLTGEIADLTarseslknttlELKELIANINATLVDLEWLNRVETYIK 1324
Cdd:pfam01601  462 ATILNLTDEIKDLE-----------RLQELIDNLNQTLVDLEWLNRYETYIK 502
CoV_S1 pfam01600
Coronavirus spike glycoprotein S1; This family represents the spike glycoprotein (S) of ...
215-641 1.52e-174

Coronavirus spike glycoprotein S1; This family represents the spike glycoprotein (S) of coronaviruses. The spike protein is arranged in trimers on the surface of the viral membrane and is essential for viral entry. The spike protein is translated as a large polypeptide that is subsequently cleaved to the distal S1, responsible for receptor binding, and the membrane-anchored S2 responsible for membrane fusion. The coronavirus (SARS-CoV) S1 subunit is composed of two distinct domains: an N-terminal domain (S1 NTD) and a receptor-binding domain (S1 RBD) also referred to as the S1 CTD or domain B. Each of these domains have been implicated in binding to host receptors. However, most coronaviruses are not known to utilize both the S1 NTD and S1 RBD for viral entry. This entry contains spike protein from both alpha and gamma coronaviruses but excludes the spike protein from beta-coronaviruses such as SARS-CoV.


Pssm-ID: 460262  Cd Length: 412  Bit Score: 525.37  E-value: 1.52e-174
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 971745448   215 YTLCDNCTGFPQHVFATMENGEIPPSFNFANWFYLTNSSSPVSSRVVGLQPLLLTCLWPIPALLGTATDITFDRNGtSDV 294
Cdd:pfam01600    1 YSVCTNCDGFPDNVFAVEEGGYIPPSFSFNNWFYLTNSSTPVDGRVVSNQPLLLNCLWPIPSLNGTTLKVYFNGSI-PNG 79
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 971745448   295 RCNGFAS-NETADAMRFSLNFTDS-AVFAKEGVITLKTLSN-TFKFSCSNSSTYQAP-YVIPFGHIDQPYYCFTTFYINe 370
Cdd:pfam01600   80 RCNGYSNkNGTVDAIRFNLNFTASdSVFAGAGSISLNTVGGvTYSFSCSNSSTPVGAsHQIPFGATDQPYYCFVNYNGN- 158
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 971745448   371 taGTTTTSFVGMLPPVVREFVITKTGNVYLNGYRIFTVDDVVSVNFNISSTDHRDFWTVAFVKNTEVMLDIEDTYIKQLL 450
Cdd:pfam01600  159 --ISTTSQFVGILPPVVREIVISRYGDFYVNGYRYFSTPPLESVNFNLTSGDTSDFWTVAFANYTDVLVNINNTSIQRIL 236
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 971745448   451 YCNTPLNVVKCQQLKFVLDDGFYSYSSPVDEVLPRTIVRLPRLMTHNFLNFTIFVSFyfdddkqarpDGGFYECATCAPK 530
Cdd:pfam01600  237 YCDSPLNSIKCQQLSFSLPDGFYSTSSVEVVQLPRTFVTLPKFATHSFVNLTVSVSF----------DGGGGPPSLSALS 306
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 971745448   531 YYKLAFVDDwtsavssNVTSICVNYASFTTRLFTFYAGTTAGVHLGLETGTCPFSFDTLNNYLTFGSLCFSLV-ANGGCT 609
Cdd:pfam01600  307 EVNLTINGT-------NNTSLCVNTSQFTVNLNFTCTSTAYGYTAEIRTGTCPFSFDKLNNYLSFGSICFSLVpSGGGCT 379
                          410       420       430
                   ....*....|....*....|....*....|..
gi 971745448   610 MNIVTQGPYGLPHTIAVLYVSYTEGDNIIGVP 641
Cdd:pfam01600  380 MDIVTKYWNGSFVKVGSLYVSYSEGDNITGVP 411
CoV_S2_C pfam19214
Coronavirus spike glycoprotein S2, intravirion; This entry represents the cysteine rich ...
1346-1387 1.21e-10

Coronavirus spike glycoprotein S2, intravirion; This entry represents the cysteine rich intravirion region found at the C-terminus of coronavirus spike proteins (S). These cysteine residues are targets for palmitoylation, necessary for efficiently S incorporation into virions and S-mediated membrane fusions.


Pssm-ID: 465998  Cd Length: 42  Bit Score: 57.81  E-value: 1.21e-10
                           10        20        30        40
                   ....*....|....*....|....*....|....*....|....
gi 971745448  1346 FCCCSTGCCGifSCMASSCGACCDIRG--TKLQRYEAIEKVHVQ 1387
Cdd:pfam19214    1 FCCCCTGCCG--CCFGCSCGGCCDSYDkrDDVYPAEVVEKVHVQ 42
CwlO1 COG3883
Uncharacterized N-terminal coiled-coil domain of peptidoglycan hydrolase CwlO [Function ...
965-1133 1.29e-04

Uncharacterized N-terminal coiled-coil domain of peptidoglycan hydrolase CwlO [Function unknown];


Pssm-ID: 443091 [Multi-domain]  Cd Length: 379  Bit Score: 45.98  E-value: 1.29e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 971745448  965 TGAMVFGGVTAGAAVPFSIAVQSRLNYLALQTDVLQRNQQQLANSFNAAMGNITEAFGRVNDA---IEQTSHAISTVAQA 1041
Cdd:COG3883     1 ALALALAAPTPAFADPQIQAKQKELSELQAELEAAQAELDALQAELEELNEEYNELQAELEALqaeIDKLQAEIAEAEAE 80
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 971745448 1042 LDKVQTVVNDQgLALSQLTKQLASNFQAI--SSSIEDLynrLDRVEAdlqVDRLITGRLAALNAFVAQQltkyTDVRASR 1119
Cdd:COG3883    81 IEERREELGER-ARALYRSGGSVSYLDVLlgSESFSDF---LDRLSA---LSKIADADADLLEELKADK----AELEAKK 149
                         170
                  ....*....|....
gi 971745448 1120 QLAQDKINECVKSQ 1133
Cdd:COG3883   150 AELEAKLAELEALK 163
 
Name Accession Description Interval E-value
alphaCoV_Spike_SD1-2_S1-S2_S2 cd22369
SD-1 and SD-2 subdomains, the S1/S2 cleavage region, and the S2 fusion subunit of the spike (S) ...
652-1316 0e+00

SD-1 and SD-2 subdomains, the S1/S2 cleavage region, and the S2 fusion subunit of the spike (S) protein from alphacoronaviruses; This group contains the SD-1 and SD-2 subdomains of the S1 subunit C-terminal domain (C-domain), the S1/S2 cleavage region, and the S2 fusion subunit of the spike (S) glycoprotein from alphacoronaviruses including human coronaviruses (HCoVs), HCoV-NL63, and HCoV-229E, and porcine coronaviruses, transmissible gastroenteritis virus (TGEV) and porcine epidemic diarrhea virus (PEDV), among others. The CoV S protein is an envelope glycoprotein that plays a very important role in viral attachment, fusion, and entry into host cells, and serves as a major target for the development of neutralizing antibodies, inhibitors of viral entry, and vaccines. It is synthesized as a precursor protein that is cleaved into an N-terminal S1 subunit (~700 amino acids) and a C-terminal S2 subunit (~600 amino acids) that mediates attachment and membrane fusion, respectively. Three S1/S2 heterodimers assemble to form a trimer spike protruding from the viral envelope. The S1 subunit contains a receptor-binding domain (RBD), while the S2 subunit contains the coronavirus fusion machinery and is primarily alpha-helical. S1 contains two structurally independent domains, the N-terminal domain (NTD) and the C-domain. The S1 C-domain also contains two subdomains (SD-1 and SD-2), which connect the S1 and S2 subunits. Depending on the virus, either the NTD or the C-domain can serve as the receptor-binding domain (RBD). While the RBD of mouse hepatitis virus (MHV) is located at the NTD, most CoVs, including SARS-CoV-2, SARS-CoV and MERS-CoV use the C-domain to bind their receptors. The S2 subunit comprises the fusion peptide (FP), a second proteolytic site (S2'), followed by an internal fusion peptide (IFP), and two heptad-repeat domains (HR1 and HR2) preceding the transmembrane domain (TM). After binding of the S1 subunit RBD on the virion to its receptor on the target cell, the HR1 and HR2 domains interact with each other to form a six-helix bundle (6-HB) fusion core, bringing viral and cellular membranes into close proximity for fusion and infection. In order to catalyze the membrane fusion reaction, CoV S needs to be primed through cleavage at the S1/S2 and S2' sites. In the case of human-infecting coronaviruses such as SARS-CoV-2, HCoV-OC43, MERS-CoV, and HKU1 the spike protein contains an insertion of (R/K)-(2X)n-(R/K) (furin cleavage motif) at the S1/S2 site, which is absent in SARS-CoV and other SARS-related coronaviruses, as well as Rousettus bat coronavirus HKU9. The region modeled in this cd (SD-1 and SD-2, the S1/S2 cleavage region, and the S2 fusion subunit) plays an essential role in viral entry by initiating fusion of the viral and cellular membranes.


Pssm-ID: 411956 [Multi-domain]  Cd Length: 666  Bit Score: 1318.44  E-value: 0e+00
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 971745448  652 DMSQVYLDTCTTYTIYGMTGRGVITRSNNTFITGLYYTSNAGNLLAYKNSTTGVVYNVYPCQLSSQVAVISDAIVGMASS 731
Cdd:cd22369     1 DPSVVHLNVCTDYTIYGITGRGIIRKSNSTYIAGLYYTSNSGQLLGFKNSTTGEVFSVTPCQLSSQVAVVSDNIVGVMSA 80
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 971745448  732 TPNVSIDFNVTVVADNFYYLSNSAQPCDQPVLTYAGIGICSDGSITNSTARRAAADPVSPVISGNISVPTNFTFSVQVEY 811
Cdd:cd22369    81 TNNVSLGFNNTIETPSFYYHSNGAENCTEPVLTYGSIGVCADGSITEVTPRSVSPEPVSPIITGNISIPSNFTVSVQVEY 160
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 971745448  812 IQLMLKPVTVDCSVYVCNGNPRCLQLLAQYASACRTIEQALQLSARLESVEVNSMISISQEALALGVIDNFKHDFNLTNV 891
Cdd:cd22369   161 LQMYLKPVSVDCSTYVCNGNPRCLQLLTQYASACRTIEEALQLSARLESVEVNSMITVSEEALRLANISTFFDDYNLSAV 240
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 971745448  892 LPASVGAKSAVEDLLFDKVVTSGLGTVDADYKECASRTANTVAEVGCVQYYNGIMVLPGVVDQSLLAQYSAALTGAMVFG 971
Cdd:cd22369   241 LPAGVGGRSAIEDLLFDKVVTSGLGTVDEDYKACTKGLGIAAADVACAQYYNGIMVLPGVVDAEKMALYTASLTGGMVLG 320
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 971745448  972 GVTAGAAVPFSIAVQSRLNYLALQTDVLQRNQQQLANSFNAAMGNITEAFGRVNDAIEQTSHAISTVAQALDKVQTVVND 1051
Cdd:cd22369   321 GFTAAAAIPFSLAVQSRLNYVALQTDVLQRNQQILANSFNSAMGNITVAFSEVNDAIQQTSDAINTVAQALNKVQNVVNE 400
                         410       420       430       440       450       460       470       480
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 971745448 1052 QGLALSQLTKQLASNFQAISSSIEDLYNRLDRVEADLQVDRLITGRLAALNAFVAQQLTKYTDVRASRQLAQDKINECVK 1131
Cdd:cd22369   401 QGQALSQLTKQLASNFQAISSSIEDIYNRLDGLAADAQVDRLITGRLAALNAFVTQTLTKYTEVRASRQLAQQKINECVK 480
                         490       500       510       520       530       540       550       560
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 971745448 1132 SQSFRYGFCGNGTHVFSVVNAAPDGMMFFHSVLLPTAYMEVAAFSGLCVEGNGYVLRD-TGNVLFEKNGQYLITARKMFE 1210
Cdd:cd22369   481 SQSSRYGFCGNGTHLFSIVNAAPDGIMFLHTVLLPTEYVTVAAWAGLCVDGKAYVLRDdVVLTLFKLNDKYYVTPRDMFE 560
                         570       580       590       600       610       620       630       640
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 971745448 1211 PRVPQTSDFVQITGCDVVYLNVTRDELPTVIPDYIDVNSTVEDILSKLPNRTTPEFDLDIFNATYLNLTGEIADLTARSE 1290
Cdd:cd22369   561 PRVPVSSDFVQISNCNVTYVNITSDELPEVIPDYIDVNKTLEEFLANLPNYTLPDLPLDIFNATYLNLTGEIADLENKSE 640
                         650       660
                  ....*....|....*....|....*.
gi 971745448 1291 SLKNTTLELKELIANINATLVDLEWL 1316
Cdd:cd22369   641 SLLNTTVELQELIDNINNTLVDLEWL 666
PDEV-like_Spike_SD1-2_S1-2_S2 cd22376
SD-1 and SD-2 subdomains, the S1/S2 cleavage region, and the S2 fusion subunit of the spike (S) ...
652-1321 0e+00

SD-1 and SD-2 subdomains, the S1/S2 cleavage region, and the S2 fusion subunit of the spike (S) glycoprotein from Porcine epidemic diarrhea virus and related alphacoronavirus; This group contains the SD-1 and SD-2 subdomains of the S1 subunit C-terminal domain (C-domain), the S1/S2 cleavage region, and the S2 fusion subunit of the spike (S) glycoprotein from alphacoronaviruses, including porcine epidemic diarrhea virus (PEDV), Scotophilus bat coronavirus, and swine enteric coronavirus, among others. The CoV S protein is an envelope glycoprotein that plays a very important role in viral attachment, fusion, and entry into host cells, and serves as a major target for the development of neutralizing antibodies, inhibitors of viral entry, and vaccines. It is synthesized as a precursor protein that is cleaved into an N-terminal S1 subunit (~700 amino acids) and a C-terminal S2 subunit (~600 amino acids) that mediates attachment and membrane fusion, respectively. Three S1/S2 heterodimers assemble to form a trimer spike protruding from the viral envelope. The S1 subunit contains a receptor-binding domain (RBD), while the S2 subunit contains the coronavirus fusion machinery and is primarily alpha-helical. S1 contains two structurally independent domains, the N-terminal domain (NTD) and the C-domain. The S1 C-domain also contains two subdomains (SD-1 and SD-2), which connect the S1 and S2 subunits. Depending on the virus, either the NTD or the C-domain can serve as the receptor-binding domain (RBD). While the RBD of mouse hepatitis virus (MHV) is located at the NTD, most CoVs, including SARS-CoV-2, SARS-CoV and MERS-CoV use the C-domain to bind their receptors. The S2 subunit comprises the fusion peptide (FP), a second proteolytic site (S2'), followed by an internal fusion peptide (IFP) and two heptad-repeat domains (HR1 and HR2) preceding the transmembrane domain (TM). After binding of the S1 subunit RBD on the virion to its receptor on the target cell, the HR1 and HR2 domains interact with each other to form a six-helix bundle (6-HB) fusion core, bringing viral and cellular membranes into close proximity for fusion and infection. In order to catalyze the membrane fusion reaction, CoV S needs to be primed through cleavage at the S1/S2 and S2' sites. In the case of human-infecting coronaviruses such as SARS-CoV-2, HCoV-OC43, MERS-CoV, and HKU1 the spike protein contains an insertion of (R/K)-(2X)n-(R/K) (furin cleavage motif) at the S1/S2 site, which is absent in SARS-CoV and other SARS-related coronaviruses, as well as Rousettus bat coronavirus HKU9. The region modeled in this cd (SD-1 and SD-2, the S1/S2 cleavage region, and the S2 fusion subunit) plays an essential role in viral entry by initiating fusion of the viral and cellular membranes.


Pssm-ID: 411963 [Multi-domain]  Cd Length: 673  Bit Score: 1028.16  E-value: 0e+00
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 971745448  652 DMSQVYLDTCTTYTIYGMTGRGVITRSNNTFITGLYYTSNAGNLLAYKNSTTGVVYNVYPCQLSSQVAVISDAIVGMASS 731
Cdd:cd22376     1 DVSFMTLDVCTKYTIYGFKGEGIITLTNSSLLGGVYYTSDSGQLLAFKNVTSGAIYSVTPCSFSQQAAYVDDDIVGVISS 80
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 971745448  732 TPNVSidFNVTVVADNFYYLSNSAQPCDQPVLTYAGIGICSDGSITnSTARRAAADPVSPVISGNISVPTNFTFSVQVEY 811
Cdd:cd22376    81 LSNST--FNSTRELPGFFYHSNDGSNCTEPVLVYSNIGVCKSGSIG-YVPSQSGQPKIAPMVTGNISIPTNFTMSIRTEY 157
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 971745448  812 IQLMLKPVTVDCSVYVCNGNPRCLQLLAQYASACRTIEQALQLSARLESVEVNSMISISQEALALGVIDNFKHD-FNLTN 890
Cdd:cd22376   158 LQLYNTPVSVDCAMYVCNGNSRCKQLLTQYTSACKTIESALQLSARLESVEVNSMLTISEEALQLATISSFNGGgYNFTN 237
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 971745448  891 VLPASVGAKSAVEDLLFDKVVTSGLGTVDADYKECASrtANTVAEVGCVQYYNGIMVLPGVVDQSLLAQYSAALTGAMVF 970
Cdd:cd22376   238 VLGASVQKRSFIEDLLFNKVVTNGLGTVDEDYKRCSN--GLSVADLVCAQYYSGVMVLPGVVDAEKLHMYSASLIGGMVL 315
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 971745448  971 GGVTAGAAVPFSIAVQSRLNYLALQTDVLQRNQQQLANSFNAAMGNITEAFGRVNDAIEQTSHAISTVAQALDKVQTVVN 1050
Cdd:cd22376   316 GGITAAAALPFSYAVQARLNYVALQTDVLQRNQQLLAESFNSAIGNITSAFESVKEAISQTSQGLNTVAHALTKVQDVVN 395
                         410       420       430       440       450       460       470       480
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 971745448 1051 DQGLALSQLTKQLASNFQAISSSIEDLYNRLDRVEADLQVDRLITGRLAALNAFVAQQLTKYTDVRASRQLAQDKINECV 1130
Cdd:cd22376   396 SQGAALNQLTVQLQHNFQAISSSIDDIYSRLDQLSADAQVDRLITGRLSALNAFVAQTLTKYTEVQASRKLAQQKVNECV 475
                         490       500       510       520       530       540       550       560
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 971745448 1131 KSQSFRYGFCGN-GTHVFSVVNAAPDGMMFFHSVLLPTAYMEVAAFSGLCVEGNGY-VLRDTGN-----VLFEKNGQYLI 1203
Cdd:cd22376   476 KSQSQRYGFCGGdGEHIFSLVQAAPQGLLFLHTVLVPGDFVNVTAIAGLCVDDEIAlTLREPGVlftheVLTYTATEYFV 555
                         570       580       590       600       610       620       630       640
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 971745448 1204 TARKMFEPRVPQTSDFVQITGCDVVYLNVTRDELPTVIPDYIDVNSTVEDILSKLPNRTTPEFDLDIFNATYLNLTGEIA 1283
Cdd:cd22376   556 SPRKMFEPRKPTVSDFVQIESCVVTYVNLTSDQLPDVIPDYIDVNKTLDEILASLPNRTGPSLPLDVFNATYLNLTGEIA 635
                         650       660       670
                  ....*....|....*....|....*....|....*...
gi 971745448 1284 DLTARSESLKNTTLELKELIANINATLVDLEWLNRVET 1321
Cdd:cd22376   636 DLEQRSESLRNTTEELRSLIYNINNTLVDLEWLNRVET 673
delta-PiCoV-like_Spike_SD1-2_S1-S2_S2 cd22374
SD-1 and SD-2 subdomains, the S1/S2 cleavage region, and the S2 fusion subunit of the spike (S) ...
631-1367 0e+00

SD-1 and SD-2 subdomains, the S1/S2 cleavage region, and the S2 fusion subunit of the spike (S) glycoprotein from Pigeon coronavirus UAE-HKU29, and related avian deltacoronaviruses; This group contains the SD-1 and SD-2 subdomains of the S1 subunit C-terminal domain (C-domain), the S1/S2 cleavage region, and the S2 fusion subunit of the spike (S) glycoprotein from Pigeon coronavirus UAE-HKU29, and related avian deltacoronaviruses including Falcon coronavirus UAE-HKU27, Magpie-robin coronavirus HKU18, Sparrow coronavirus HKU17, and Night heron coronavirus HKU19. The CoV S protein is an envelope glycoprotein that plays a very important role in viral attachment, fusion, and entry into host cells, and serves as a major target for the development of neutralizing antibodies, inhibitors of viral entry, and vaccines. It is synthesized as a precursor protein that is cleaved into an N-terminal S1 subunit (~700 amino acids) and a C-terminal S2 subunit (~600 amino acids) that mediates attachment and membrane fusion, respectively. Three S1/S2 heterodimers assemble to form a trimer spike protruding from the viral envelope. The S1 subunit contains a receptor-binding domain (RBD), while the S2 subunit contains the coronavirus fusion machinery and is primarily alpha-helical. S1 contains two structurally independent domains, the N-terminal domain (NTD) and the (C-domain. The S1 C-domain also contains two subdomains (SD-1 and SD-2), which connect the S1 and S2 subunits. Depending on the virus, either the NTD or the C-domain can serve as the receptor-binding domain (RBD). While the RBD of mouse hepatitis virus (MHV) is located at the NTD, most CoVs, including SARS-CoV-2, SARS-CoV and MERS-CoV use the C-domain to bind their receptors. The S2 subunit comprises the fusion peptide (FP), a second proteolytic site (S2'), followed by an internal fusion peptide (IFP) and two heptad-repeat domains (HR1 and HR2) preceding the transmembrane domain (TM). After binding of the S1 subunit RBD on the virion to its receptor on the target cell, the HR1 and HR2 domains interact with each other to form a six-helix bundle (6-HB) fusion core, bringing viral and cellular membranes into close proximity for fusion and infection. In order to catalyze the membrane fusion reaction, CoV S needs to be primed through cleavage at the S1/S2 and S2' sites. In the case of human-infecting coronaviruses such as SARS-CoV-2, HCoV-OC43, MERS-CoV, and HCoV-KU1, the spike protein contains an insertion of (R/K)-(2X)n-(R/K) (furin cleavage motif) at the S1/S2 site, which is absent in SARS-CoV and other SARS-related coronaviruses, as well as Ro-BatCoV HKU9. The region modeled in this cd (SD-1 and SD-2, the S1/S2 cleavage region, and the S2 fusion subunit) plays an essential role in viral entry by initiating fusion of the viral and cellular membranes.


Pssm-ID: 411961 [Multi-domain]  Cd Length: 739  Bit Score: 951.25  E-value: 0e+00
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 971745448  631 YTEGDNIIGVPLsniPPLGVSDMSQVYLDTCTTYTIYGMTGRGVITRSNNTFITGLYYTSNAGNLLAYKNSTTGVVYNVY 710
Cdd:cd22374     1 YQPGNSITAMPQ---PSTGTTDISTVYLDVCTKYNIYGKTGTGIIRLTNQSYIAGLYYTSPSGDLLAFKNVTTQTVYSVT 77
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 971745448  711 PCQLSSQVAVISDAIVGMASSTPNVSI-DFNVTVVADNFYYLSNSAQPCDQPVLTYAGIGICSDGSITNSTARRAAADPV 789
Cdd:cd22374    78 PCRLSSQVAVYNGSIIAAFTSTENFTIaDFTYSRATPMFYYHSIGNDTCETPVITFGSIGVCPGGGLHFVDPTSNEFTNV 157
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 971745448  790 SPVISGNISVPTNFTFSVQVEYIQLMLKPVTVDCSVYVCNGNPRCLQLLAQYASACRTIEQALQLSARLESVEVNSMISI 869
Cdd:cd22374   158 VPISTQNISIPKNFTVSIQTEYIQIEQQPVTVDCRQYVCNGNPRCLQLLMQYTSACSTIEQALSLNARLEAASIQTMLTY 237
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 971745448  870 SQEALALGVIDNFKHD---FNLTNVLPASVGAKSAVEDLLFDKVVTSGLGTVDADYKECASrtANTVAEVGCVQYYNGIM 946
Cdd:cd22374   238 SPETLKLANITNFQSDdvnYNLTNILPKKYQGRSAIEDLLFDKVVTNGLGTVDQDYKACTN--GVSIADLVCAQYYNGIM 315
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 971745448  947 VLPGVVDQSLLAQYSAALTGAMVFGGVTAGAAVPFSIAVQSRLNYLALQTDVLQRNQQQLANSFNAAMGNITEAFGRVND 1026
Cdd:cd22374   316 VLPGVADPEKMAQYTASLTGGMVFGGLTSAAAIPFSLAVQSRLNYVALQTDVLQQNQQILADSFNNAMGNITLAFKEVSE 395
                         410       420       430       440       450       460       470       480
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 971745448 1027 AIEQTSHAISTVAQALDKVQTVVNDQGLALSQLTKQLASNFQAISSSIEDLYNRLDRVEADLQVDRLITGRLAALNAFVA 1106
Cdd:cd22374   396 GLSQVSGAITTVANALTKIQTVVNSQGQALATLTEQLANNFQAISASIADIYNRLNQLEADAQVDRLITGRLAALNAFVT 475
                         490       500       510       520       530       540       550       560
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 971745448 1107 QQLTKYTDVRASRQLAQDKINECVKSQSFRYGFCGNGTHVFSVVNAAPDGMMFFHSVLLPTAYMEVAAFSGLCVEGNGYV 1186
Cdd:cd22374   476 QTLSKLAEVRQARQLALDKINECVKSQSSRYGFCGNGTHLFSIVNAAPYGFVFFHTVLLPTQYATVQAYSGICQNGRALA 555
                         570       580       590       600       610       620       630       640
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 971745448 1187 LRDTGNVLFEKNGQYLITARKMFEPRVPQTSDFVQITGCDVVYLNVTRDELPTVIPDYIDVNSTVEDILSKLPNRTTPEF 1266
Cdd:cd22374   556 LKDPSLALFRGTDKYLVTPRNMYQPRTAAQADFVYIESCTVTYLNLTDTTIDAVIPDYVDVNKTVEDILNNLPNYTKPDL 635
                         650       660       670       680       690       700       710       720
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 971745448 1267 DLDIFNATYLNLTGEIADLTARSESLKNTTLELKELIANINATLVDLEWLNRVETYIKWPWWVWLIIVLVLILFTCLMLF 1346
Cdd:cd22374   636 DIGRYNNTILNLTTEINDLNGRAENLSQIVENLEEYIKKINATLVDLEWLNRVETYIKWPWWVWLLIALAITAFVCILVT 715
                         730       740
                  ....*....|....*....|..
gi 971745448 1347 CCCSTGCC-GIFSCmassCGAC 1367
Cdd:cd22374   716 IFLCTGCCgGCFGC----CGGC 733
TGEV-like_Spike_SD1-2_S1-S2_S2 cd22377
SD-1 and SD-2 subdomains, the S1/S2 cleavage region, and the S2 fusion subunit of the spike (S) ...
652-1362 0e+00

SD-1 and SD-2 subdomains, the S1/S2 cleavage region, and the S2 fusion subunit of the spike (S) glycoprotein from transmissible gastroenteritis virus and related alphacoronaviruses; This group contains the SD-1 and SD-2 subdomains of the S1 subunit C-terminal domain (C-domain), the S1/S2 cleavage region, and the S2 fusion subunit of the spike (S) glycoprotein from porcine transmissible gastroenteritis virus (TGEV), canine coronavirus (CCoV), and feline coronavirus (FCoV). They display greater than 96% sequence identity and have been grouped in the same species, alphacoronavirus 1, within the Alphacoronavirus genus. The CoV S protein is an envelope glycoprotein that plays a very important role in viral attachment, fusion, and entry into host cells, and serves as a major target for the development of neutralizing antibodies, inhibitors of viral entry, and vaccines. It is synthesized as a precursor protein that is cleaved into an N-terminal S1 subunit (~700 amino acids) and a C-terminal S2 subunit (~600 amino acids) that mediates attachment and membrane fusion, respectively. Three S1/S2 heterodimers assemble to form a trimer spike protruding from the viral envelope. The S1 subunit contains a receptor-binding domain (RBD), while the S2 subunit contains the coronavirus fusion machinery and is primarily alpha-helical. S1 contains two structurally independent domains, the N-terminal domain (NTD) and the C-domain. The S1 C-domain also contains two subdomains (SD-1 and SD-2), which connect the S1 and S2 subunits. Depending on the virus, either the NTD or the C-domain can serve as the receptor-binding domain (RBD). While the RBD of mouse hepatitis virus (MHV) is located at the NTD, most CoVs, including SARS-CoV-2, SARS-CoV and MERS-CoV use the C-domain to bind their receptors. The S2 subunit comprises the fusion peptide (FP), a second proteolytic site (S2'), followed by an internal fusion peptide (IFP) and two heptad-repeat domains (HR1 and HR2) preceding the transmembrane domain (TM). After binding of the S1 subunit RBD on the virion to its receptor on the target cell, the HR1 and HR2 domains interact with each other to form a six-helix bundle (6-HB) fusion core, bringing viral and cellular membranes into close proximity for fusion and infection. In order to catalyze the membrane fusion reaction, CoV S needs to be primed through cleavage at the S1/S2 and S2' sites. In the case of human-infecting coronaviruses such as SARS-CoV-2, HCoV-OC43, MERS-CoV, and HKU1, the spike protein contains an insertion of (R/K)-(2X)n-(R/K) (furin cleavage motif) at the S1/S2 site, which is absent in SARS-CoV and other SARS-related coronaviruses, as well as Rousettus bat coronavirus HKU9. The region modeled in this cd (SD-1 and SD-2, the S1/S2 cleavage region, and the S2 fusion subunit) plays an essential role in viral entry by initiating fusion of the viral and cellular membranes.


Pssm-ID: 411964 [Multi-domain]  Cd Length: 751  Bit Score: 948.81  E-value: 0e+00
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 971745448  652 DMSQVYLDTCTTYTIYGMTGRGVITRSNNTFITGLYYTSNAGNLLAYKNSTTGVVYNVYPCQLSSQVAVISDAIVGMASS 731
Cdd:cd22377     1 DISVLVKDECTDYNIYGFQGTGIIRNTTSRLVAGLYYTSISGDLLAFKNSTTGEIFTVVPCDLTAQAAVINDEIVGAITS 80
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 971745448  732 -----------------TPNVSIDFNVTVVADNFYYLS---NSAQPCDQPVLTYAGIGICSDGSI-----TNSTARRAAA 786
Cdd:cd22377    81 vnqtdlfefvnhtqsrrSRRSTLGLVHTYTMPQFYYITkwnNDTSTNCTSVITYSSFAICNTGEIkyvnvTHVEIVDDSI 160
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 971745448  787 DPVSPVISGNISVPTNFTFSVQVEYIQLMLKPVTVDCSVYVCNGNPRCLQLLAQYASACRTIEQALQLSARLESVEVNSM 866
Cdd:cd22377   161 GVIKPISTGNITIPKNFTVAVQAEYIQIQVKPVVVDCAKYVCNGNRHCLKLLTQYTSACQTIENALNLGARLESLMLNDM 240
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 971745448  867 ISISQEALALGVIDNFKHDF------------NLTNVLPASVGAKSAVEDLLFDKVVTSGLGTVDADYKECASRTanTVA 934
Cdd:cd22377   241 ITVSDRSLELATVEKFNSTVlggeklggfyfdGLKDLLPPRIGKRSAIEDLLFNKVVTSGLGTVDDDYKKCSAGT--DVA 318
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 971745448  935 EVGCVQYYNGIMVLPGVVDQSLLAQYSAALTGAMVFGGVTAGAAVPFSIAVQSRLNYLALQTDVLQRNQQQLANSFNAAM 1014
Cdd:cd22377   319 DLVCAQYYNGIMVLPGVVDDNKMAMYTASLIGGMALGSITSAVAVPFAMQVQARLNYVALQTDVLQENQKILANAFNNAI 398
                         410       420       430       440       450       460       470       480
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 971745448 1015 GNITEAFGRVNDAIEQTSHAISTVAQALDKVQTVVNDQGLALSQLTKQLASNFQAISSSIEDLYNRLDRVEADLQVDRLI 1094
Cdd:cd22377   399 GNITLALGKVSNAITTTSDGFNTMASALTKIQSVVNQQGEALSQLTSQLQKNFQAISSSIAEIYNRLEKVEADAQVDRLI 478
                         490       500       510       520       530       540       550       560
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 971745448 1095 TGRLAALNAFVAQQLTKYTDVRASRQLAQDKINECVKSQSFRYGFCGNGTHVFSVVNAAPDGMMFFHSVLLPTAYMEVAA 1174
Cdd:cd22377   479 TGRLAALNAYVSQTLTQYAEVKASRQLAMEKVNECVKSQSDRYGFCGNGTHLFSLVNSAPDGLLFFHTVLLPTEWEEVTA 558
                         570       580       590       600       610       620       630       640
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 971745448 1175 FSGLCV-EGNGYVLRDTGNVLFEKNGQYLITARKMFEPRVPQTSDFVQITGCDVVYLNVTRDELPTVIPDYIDVNSTVED 1253
Cdd:cd22377   559 WSGICVnDTYAYVLKDFLTSIFSYNGTYMVTPRNMFQPRKPQMSDFVQITSCEVTFLNTTYTTFQEIVIDYIDINKTIAD 638
                         650       660       670       680       690       700       710       720
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 971745448 1254 ILSKL-PNRTTPEFDL--DIFNATYLNLTGEIADLTARSESLKNTTLELKELIANINATLVDLEWLNRVETYIKWPWWVW 1330
Cdd:cd22377   639 MLEQYnPNYTVPELDLqlEIFNQTKLNLTAEIDQLEQRADNLTNIAHELQQYIDNLNKTLVDLEWLNRIETYVKWPWYVW 718
                         730       740       750
                  ....*....|....*....|....*....|..
gi 971745448 1331 LIIVLVLILFTCLMLFCCCSTGCCGIFSCMAS 1362
Cdd:cd22377   719 LLIGLVVVFCIPLLLFCCLSTGCCGCFGCLGS 750
HCoV-NL63-229E-like_Spike_SD1-2_S1-S2_S2 cd22375
SD-1 and SD-2 subdomains, the S1/S2 cleavage region, and the S2 fusion subunit of the spike (S) ...
652-1324 0e+00

SD-1 and SD-2 subdomains, the S1/S2 cleavage region, and the S2 fusion subunit of the spike (S) glycoproteins from HCoV-NL63, HCoV-229E, and related alphacoronavirus; This group contains the SD-1 and SD-2 subdomains of the S1 subunit C-terminal domain (C-domain), the S1/S2 cleavage region, and the S2 fusion subunit of the spike (S) glycoprotein from alphacoronaviruses, including human coronaviruses (HCoVs), HCoV-NL63 and HCoV-229E. The CoV S protein is an envelope glycoprotein that plays a very important role in viral attachment, fusion, and entry into host cells, and serves as a major target for the development of neutralizing antibodies, inhibitors of viral entry, and vaccines. It is synthesized as a precursor protein that is cleaved into an N-terminal S1 subunit (~700 amino acids) and a C-terminal S2 subunit (~600 amino acids) that mediates attachment and membrane fusion, respectively. Three S1/S2 heterodimers assemble to form a trimer spike protruding from the viral envelope. The S1 subunit contains a receptor-binding domain (RBD), while the S2 subunit contains the coronavirus fusion machinery and is primarily alpha-helical. S1 contains two structurally independent domains, the N-terminal domain (NTD) and the C-domain. The S1 C-domain also contains two subdomains (SD-1 and SD-2), which connect the S1 and S2 subunits. Depending on the virus, either the NTD or the C-domain can serve as the receptor-binding domain (RBD). While the RBD of mouse hepatitis virus (MHV) is located at the NTD, most CoVs, including SARS-CoV-2, SARS-CoV and MERS-CoV use the C-domain to bind their receptors. The S2 subunit comprises the fusion peptide (FP), a second proteolytic site (S2'), followed by an internal fusion peptide (IFP) and two heptad-repeat domains (HR1 and HR2) preceding the transmembrane domain (TM). After binding of the S1 subunit RBD on the virion to its receptor on the target cell, the HR1 and HR2 domains interact with each other to form a six-helix bundle (6-HB) fusion core, bringing viral and cellular membranes into close proximity for fusion and infection. In order to catalyze the membrane fusion reaction, CoV S needs to be primed through cleavage at the S1/S2 and S2' sites. In the case of human-infecting coronaviruses such as SARS-CoV-2, HCoV-OC43, MERS-CoV, and HKU1, the spike protein contains an insertion of (R/K)-(2X)n-(R/K) (furin cleavage motif) at the S1/S2 site, which is absent in SARS-CoV and other SARS-related coronaviruses, as well as Rousettus bat coronavirus HKU9. The region modeled in this cd (SD-1 and SD-2, the S1/S2 cleavage region, and the S2 fusion subunit) plays an essential role in viral entry by initiating fusion of the viral and cellular membranes.


Pssm-ID: 411962 [Multi-domain]  Cd Length: 677  Bit Score: 880.78  E-value: 0e+00
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 971745448  652 DMSQVYLDTCTTYTIYGMTGRGVITRSNNTFITGLYYTSNAGNLLAYKNSTTGVVYNVYPCQLSSQVAVISDAIVGMASS 731
Cdd:cd22375     1 SFSNVVLNNCTKYNIYDYSGTGVIRSSNDSFIGGITYTSNSGNLLGFKDVSTGTIYSITPCNPPDQVVVYQQAIVGAMLS 80
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 971745448  732 TPNVSIDFNVTVVADNFYYLSNSAQPCDQPVLTYAGIGICSDGSITNSTARRAAADPVSPVISGNISVPTNFTFSVQVEY 811
Cdd:cd22375    81 ENETRYGLSNVVELPNFYYASNGTYNCTDAVLTYSNFGICADGSIIPVRPRNVSDNGVSAIVTANLSIPSNWTTSVQVEY 160
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 971745448  812 IQLMLKPVTVDCSVYVCNGNPRCLQLLAQYASACRTIEQALQLSARLESVEVNSMISISQEALALGVIDNFKhDFNLTNV 891
Cdd:cd22375   161 LQITSTPIVVDCSTYVCNGNPRCVELLKQYTSACKTIEDALRLSARLESADVSSMLTFDSNAFTLANVSSFG-DYNLSSV 239
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 971745448  892 LP------ASVGAKSAVEDLLFDKVVTSGLGTVDADYKECASrtANTVAEVGCVQYYNGIMVLPGVVDQSLLAQYSAALT 965
Cdd:cd22375   240 LPqlptsgSRIAGRSAIEDLLFSKVVTSGLGTVDADYKSCTK--GLSIADLACAQYYNGIMVLPGVADAERMAMYTGSLI 317
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 971745448  966 GAMVFGGVTAGAAVPFSIAVQSRLNYLALQTDVLQRNQQQLANSFNAAMGNITEAFGRVNDAIEQTSHAISTVAQALDKV 1045
Cdd:cd22375   318 GGMALGGLTSAAAIPFSLALQARLNYVALQTDVLQENQKILAASFNKAMTNIVDAFTGVNDAITQTSQAIQTVATALNKI 397
                         410       420       430       440       450       460       470       480
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 971745448 1046 QTVVNDQGLALSQLTKQLASNFQAISSSIEDLYNRLDRVEADLQVDRLITGRLAALNAFVAQQLTKYTDVRASRQLAQDK 1125
Cdd:cd22375   398 QDVVNQQGNALNHLTSQLRQNFQAISSSIQAIYDRLDTIQADQQVDRLITGRLAALNAFVSQTLTKYTEVRASRQLAQQK 477
                         490       500       510       520       530       540       550       560
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 971745448 1126 INECVKSQSFRYGFCGNGTHVFSVVNAAPDGMMFFHSVLLPTAYMEVAAFSGLCVEG-NGYVLRDTGNVLFEKNGQYLIT 1204
Cdd:cd22375   478 VNECVKSQSNRYGFCGNGTHIFSIVNAAPEGLVFLHTVLLPTQYKDVEAWSGLCVDGvNGYVLRQPNLALYKDGGVFRIT 557
                         570       580       590       600       610       620       630       640
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 971745448 1205 ARKMFEPRVPQTSDFVQITGCDVVYLNVTRDELPTVIPDYIDVNSTVEDILSKLPNRTTPEFDLDIFNATYLNLTGEIAD 1284
Cdd:cd22375   558 SRVMFEPRIPTMADFVQIENCNVTFVNISRSELQTIVPEYVDVNKTLQELIEKLPNYTVPDLDLDQYNQTILNLTSEIST 637
                         650       660       670       680
                  ....*....|....*....|....*....|....*....|
gi 971745448 1285 LTARSESLKNTTLELKELIANINATLVDLEWLNRVETYIK 1324
Cdd:cd22375   638 LENKSAELNYTVQKLQTLIDNINSTLVDLKWLNRVETYIK 677
delta-PDCoV-like_Spike_SD1-2_S1-S2_S2 cd22373
SD-1 and SD-2 subdomains, the S1/S2 cleavage region, and the S2 fusion subunit of the spike (S) ...
659-1307 0e+00

SD-1 and SD-2 subdomains, the S1/S2 cleavage region, and the S2 fusion subunit of the spike (S) glycoprotein from porcine coronavirus HKU15, avian coronaviruses, and related deltacoronaviruses; This group contains the SD-1 and SD-2 subdomains of the S1 subunit C-terminal domain (C-domain), the S1/S2 cleavage region, and the S2 fusion subunit of the spike (S) glycoprotein from porcine coronavirus PDCoV, and several avian coronaviruses such as quail deltacoronavirus (QdCoV) UAE-HKU30, white-eye coronavirus HKU16, common moorhen coronavirus HKU21, thrush CoV HKU12, and munia CoV HKU13, all from the Buldecovirus subgenus of deltacoronaviruses. The CoV S protein is an envelope glycoprotein that plays a very important role in viral attachment, fusion, and entry into host cells, and serves as a major target for the development of neutralizing antibodies, inhibitors of viral entry, and vaccines. It is synthesized as a precursor protein that is cleaved into an N-terminal S1 subunit (~700 amino acids) and a C-terminal S2 subunit (~600 amino acids) that mediates attachment and membrane fusion, respectively. Three S1/S2 heterodimers assemble to form a trimer spike protruding from the viral envelope. The S1 subunit contains a receptor-binding domain (RBD), while the S2 subunit contains the coronavirus fusion machinery and is primarily alpha-helical. S1 contains two structurally independent domains, the N-terminal domain (NTD) and the C-domain. The S1 C-domain also contains two subdomains (SD-1 and SD-2), which connect the S1 and S2 subunits. Depending on the virus, either the NTD or the C-domain can serve as the receptor-binding domain (RBD). While the RBD of mouse hepatitis virus (MHV) is located at the NTD, most CoVs, including SARS-CoV-2, SARS-CoV and MERS-CoV use the C-domain to bind their receptors. The S2 subunit comprises the fusion peptide (FP), a second proteolytic site (S2'), followed by an internal fusion peptide (IFP) and two heptad-repeat domains (HR1 and HR2) preceding the transmembrane domain (TM). After binding of the S1 subunit RBD on the virion to its receptor on the target cell, the HR1 and HR2 domains interact with each other to form a six-helix bundle (6-HB) fusion core, bringing viral and cellular membranes into close proximity for fusion and infection. In order to catalyze the membrane fusion reaction, CoV S needs to be primed through cleavage at the S1/S2 and S2' sites. In the case of human-infecting coronaviruses such as SARS-CoV-2, HCoV-OC43, MERS-CoV, and HCoV-KU1, the spike protein contains an insertion of (R/K)-(2X)n-(R/K) (furin cleavage motif) at the S1/S2 site, which is absent in SARS-CoV and other SARS-related coronaviruses, as well as Ro-BatCoV HKU9. The region modeled in this cd (SD-1 and SD-2, the S1/S2 cleavage region, and the S2 fusion subunit) plays an essential role in viral entry by initiating fusion of the viral and cellular membranes.


Pssm-ID: 411960 [Multi-domain]  Cd Length: 648  Bit Score: 856.83  E-value: 0e+00
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 971745448  659 DTCTTYTIYGMTGRGVITRSNNTFITGLYYTSNAGNLLAYKNSTTGVVYNVYPCQLSSQVAVISDAIVGMASSTPNVSID 738
Cdd:cd22373     1 DVCTDYTIYGVSGTGIIKPSDLQLHNGIAFTSPTGELYAFKNITTGKTYQVLPCETPSQLIVINNTIVGAITSSNSTENG 80
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 971745448  739 FNVTVVADNFYYLSNSAQ-PCDQPVLTYAGIGICSDGSITNSTARRAAADPVSPVISGNISVPTNFTFSVQVEYIQLMLK 817
Cdd:cd22373    81 FTTTIVTPTFYYSTNATSfNCTKPVLSYGPISVCSDGAIVGTSTLQDTRPSIVSLYDGEVEIPSAFTLSVQTEYLQVQAE 160
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 971745448  818 PVTVDCSVYVCNGNPRCLQLLAQYASACRTIEQALQLSARLESVEVNSMISISQEALALGVIDNFKHDFNLTNVLPASVG 897
Cdd:cd22373   161 QVVVDCPQYVCNGNSRCLQLLAQYTSACSNIESALHSSAQLDSREITNMFQTSTQSLELANITNFKGDYNFTSILTTKIG 240
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 971745448  898 AKSAVEDLLFDKVVTSGLGTVDADYKECASRTAntVAEVGCVQYYNGIMVLPGVVDQSLLAQYSAALTGAMVFGGVTAGA 977
Cdd:cd22373   241 GRSAIEDLLFNKVVTNGLGTVDQDYKSCSKDMA--IADLVCSQYYNGIMVLPGVVDAEKMAMYTGSLTGAMVFGGLTAAA 318
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 971745448  978 AVPFSIAVQSRLNYLALQTDVLQRNQQQLANSFNAAMGNITEAFGRVNDAIEQTSHAISTVAQALDKVQTVVNDQGLALS 1057
Cdd:cd22373   319 AIPFSTAVQARLNYVALQTNVLQENQKILAESFNQAVGNISLALSSVNDAIQQTSEALNTVANAINKIQTVVNQQGEALS 398
                         410       420       430       440       450       460       470       480
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 971745448 1058 QLTKQLASNFQAISSSIEDLYNRLDRVEADLQVDRLITGRLAALNAFVAQQLTKYTDVRASRQLAQDKINECVKSQSFRY 1137
Cdd:cd22373   399 HLTAQLSNNFQAISTSIQDIYNRLDEVEANQQVDRLITGRLAALNAYVTQLLNQMSQIRQSRLLAQQKINECVKSQSSRY 478
                         490       500       510       520       530       540       550       560
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 971745448 1138 GFCGNGTHVFSVVNAAPDGMMFFHSVLLPTAYMEVAAFSGLCVEGNGYVLRDTGNVLFEKNGQYLITARKMFEPRVPQTS 1217
Cdd:cd22373   479 GFCGNGTHLFSITQAAPNGIFFMHAVLVPTKFTRVNASAGICVDNTKGYSLQPQLILYQFNNSWRVTPRNMYEPRLPRQA 558
                         570       580       590       600       610       620       630       640
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 971745448 1218 DFVQITGCDVVYLNVTRDELPTVIPDYIDVNSTVEDILSKLPNRTTPEFDLDIFNATYLNLTGEIADLTARSESLKNTTL 1297
Cdd:cd22373   559 DFIPLTDCSVTFYNTTAADLPNIIPDYVDVNQTVSDIIDNLPTPTPPQLDVDIYNNTILNLTQEINDLQERSKNLSQIAD 638
                         650
                  ....*....|
gi 971745448 1298 ELKELIANIN 1307
Cdd:cd22373   639 RLQQYIDNLN 648
CoV_S2 pfam01601
Coronavirus spike glycoprotein S2; The coronavirus spike glycoprotein forms the characteriztic ...
795-1324 0e+00

Coronavirus spike glycoprotein S2; The coronavirus spike glycoprotein forms the characteriztic 'corona' after which the group is named. The Spike glycoprotein is translated as a large polypeptide that is subsequently cleaved to S1 pfam01600 and S2,. The S2 subunit normally contains multiple key components, including one or more fusion peptides (FP), a second proteolytic site (S2') and two conserved heptad repeats (HRs), driving membrane penetration and virus-cell fusion. The HRs can trimerize into a coiled-coil structure built of three HR1-HR2 helical hairpins presenting as a canonical six-helix bundle and drag the virus envelope and the host cell bilayer into close proximity, preparing for fusion to occur.


Pssm-ID: 460263  Cd Length: 502  Bit Score: 779.53  E-value: 0e+00
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 971745448   795 GNISVPTNFTFSVQVEYIQLMLKPVTVDCSVYVCNGNPRCLQLLAQYASACRTIEQALQLSARLESVEVNSMISISQEAL 874
Cdd:pfam01601    1 GNISIPTNFTISVQTEYIQTTSPKVSVDCAQYVCNGNERCLQLLVQYGSFCSTIEQALQGSARLEDVEVLSMLSISNRAL 80
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 971745448   875 ALGVIDNFKHDFNLTNVLPASVGAKSAVEDLLFDKVVTSGLGTVDAdYKECAsrTANTVAEVGCVQYYNGIMVLPGVVDQ 954
Cdd:pfam01601   81 TLATISNFGSDFNFSSFLPCLNSGRSAIEDLLFDKVVTSGLGTVDA-YKKCT--KGTSIADLVCAQYYNGIMVLPGVVDA 157
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 971745448   955 SLLAQYSAALTGAMVFGGVT-AGAAVPFSIAVQSRLNYLALQTDVLQRNQQQLANSFNAAMGNITEAFgrvndaieqtsh 1033
Cdd:pfam01601  158 EKMAMYTASLTGGMAFGGLTgAAAAIPFALAVQARLNYLGLQTDVLQENQKILANAFNNAVGNITDGF------------ 225
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 971745448  1034 aiSTVAQALDKVQTVVNDQGLALSQLTKQLASNFQAISSSIEDLYNRLDRVEADLQVDRLITGRLAALNAFVAQQLTKYT 1113
Cdd:pfam01601  226 --TTTASALSKIQDVVNANAQALNQLTQQLSNNFGAISSSIQDIYSRLDQLEADAQVDRLINGRLAALNAFVTQQLTKAS 303
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 971745448  1114 DVRASRQLAQDKINECVKSQSFRYGFCGNGTHVFSVVNAAPDGMMFFHSVLLPTAYMEVAAFSGLCVEGN-GYVLRDTGN 1192
Cdd:pfam01601  304 EVKASRQLAQQKVNECVKSQSSRYGFCGNGTHLFSLPQAAPNGIMFLHTVLVPTEYITVKATPGLCVNGTtGYAPRDGQF 383
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 971745448  1193 VLFEkNGQYLITARKMFEPRVPQTSDFVQITGCDVVYLNVTRDELPTVIPDYIDVNSTVEDILSKLpNRTTPEFDLDIFN 1272
Cdd:pfam01601  384 VLNN-TSNWYITPRNMYQPRPITGSDFVQISSCDVNFVNITNTKLPPLIPDYVDFNKELEDIYKNL-NSTLPDLDLDIFN 461
                          490       500       510       520       530
                   ....*....|....*....|....*....|....*....|....*....|..
gi 971745448  1273 ATYLNLTGEIADLTarseslknttlELKELIANINATLVDLEWLNRVETYIK 1324
Cdd:pfam01601  462 ATILNLTDEIKDLE-----------RLQELIDNLNQTLVDLEWLNRYETYIK 502
CoV_Spike_S1-S2_S2 cd21698
S1/S2 cleavage region and the S2 fusion subunit of coronavirus spike (S) proteins; This model ...
765-1302 0e+00

S1/S2 cleavage region and the S2 fusion subunit of coronavirus spike (S) proteins; This model represents the S1/S2 cleavage region and the S2 subunit of the spike (S) glycoprotein from coronavirus (CoVs), including three highly pathogenic human CoVs, Middle East respiratory syndrome coronavirus (MERS-CoV), Severe acute respiratory syndrome (SARS) coronavirus (SARS-CoV), and SARS coronavirus 2 (SARS-CoV-2), also known as a 2019 novel coronavirus (2019-nCoV). The CoV S protein is an envelope glycoprotein that plays a very important role in viral attachment, fusion, and entry into host cells, and serves as a major target for the development of neutralizing antibodies, inhibitors of viral entry, and vaccines. It is synthesized as a precursor protein that is cleaved into an N-terminal S1 subunit (~700 amino acids) and a C-terminal S2 subunit (~600 amino acids) that mediates attachment and membrane fusion, respectively. Three S1/S2 heterodimers assemble to form a trimer spike protruding from the viral envelope. The S1 subunit contains a receptor-binding domain (RBD), while the S2 subunit contains the coronavirus fusion machinery and is primarily alpha-helical. S1 contains two structurally independent domains, the N-terminal domain (NTD) and the C-terminal domain (C-domain). S1 C-domain also contains two subdomains (SD-1 and SD-2), which connect S1 and S2. Depending on the virus, either the NTD or the C-domain can serve as the receptor-binding domain (RBD). While the RBD of mouse hepatitis virus (MHV) is located at the NTD, most CoVs, including SARS-CoV-2, SARS-CoV, and MERS-CoV use the C-domain to bind their receptors. The S2 subunit comprises the fusion peptide (FP), a second proteolytic site (S2'), followed by an internal fusion peptide (IFP), and two heptad-repeat domains (HR1 and HR2) preceding the transmembrane domain (TM). After binding of the S1 subunit RBD on the virion to its receptor on the target cell, the HR1 and HR2 domains interact with each other to form a six-helix bundle (6-HB) fusion core, bringing viral and cellular membranes into close proximity for fusion and infection. In order to catalyze the membrane fusion reaction, CoV S needs to be primed through cleavage at the S1/S2 and S2' sites. Notably, SARS-CoV-2 has a functional polybasic (furin) cleavage site through the insertion of PRRAR*SV (* indicates the cleavage site) at the S1/S2 interface, which is absent in SARS-CoV and other SARS-related CoVs. The S1/S2 cleavage region and the S2 fusion subunit play an essential role in viral entry by initiating fusion of the viral and cellular membranes.


Pssm-ID: 411955 [Multi-domain]  Cd Length: 523  Bit Score: 674.13  E-value: 0e+00
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 971745448  765 YAGIGICSDGSITNSTARRAAADPVSPVISGNISVPTNFTFSVQVEYIQLMLKPVTVDCSVYVCNGNPRCLQLLAQYASA 844
Cdd:cd21698     1 YGGICICYDGAIYTVSTGQEESPSIVAISTENIAIPSNFTLSVTTEYLQVTMTKVSVDCTTYVCGGSPRCKNLLLQYGSA 80
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 971745448  845 CRTIEQALQLSARLESVEVNSMISISQEALALGVIDNFKhDFNLTNVLPA--SVGAKSAVEDLLFDKVVTSGLGTVDADY 922
Cdd:cd21698    81 CDTIEQALRGIAVLEDSEVSNMFSTSKQALKLAIIKSFG-GFNFSQILPTpsRPSGRSAIEDLLFTKVVTAGLGTVDQYK 159
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 971745448  923 KeCASRTAntVAEVGCVQYYNGIMVLPGVVDQSLLAQYSAALTGAMVFGGVTAGAAVPFSIAVQSRLNYLALQTDVLQRN 1002
Cdd:cd21698   160 N-CTKGIA--IADLACAQYYNGIMVLPPVADAEKMAMYTGSLTAGMVFGGITAAAAIPFSLAMQARLNYVGLQQNVLLEN 236
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 971745448 1003 QQQLANSFNAAMGNITEAFgrvndaieqtshaiSTVAQALDKVQTVVNDQGLALSQLTKQLASNFQAISSSIEDLYNRLD 1082
Cdd:cd21698   237 QKLLANSFNKAIGNISDAF--------------SSTSSALQKIQDVVNQQAQALNTLTSQLSNNFGAISSSIQDIYQRLD 302
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 971745448 1083 RVEADLQVDRLITGRLAALNAFVAQQLTKYTDVRASRQLAQDKINECVKSQSFRYGFCGNGTHVFSVVNAAPDGMMFFHS 1162
Cdd:cd21698   303 KLEADVQVDRLITGRLAALNAFVTQQLIKAAEVRQSRRLAQQKINECVKSQSSRYGFCGNGTHLFSIPQSAPSGIVFLHT 382
                         410       420       430       440       450       460       470       480
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 971745448 1163 VLLPTAYMEVAAFSGLCVEG-NGYVLRDTGnVLFEKNGQYLITARKMFEPRVPQTSDFVQITGCD--VVYLNVTRdELPT 1239
Cdd:cd21698   383 VLVPTSYKNVTAYPGICVDGkAGSPLEGPL-VFIQNNNHWFVTPRNMYEPRIITTADFVQITSCDanVTIVNNTV-NLDP 460
                         490       500       510       520       530       540
                  ....*....|....*....|....*....|....*....|....*....|....*....|...
gi 971745448 1240 VIPDYIDVNSTVEDILSKLPNRTTPEFDLDIFNATYLNLTGEIADLTARSESLKNTTLELKEL 1302
Cdd:cd21698   461 VIPDYVDVNEELDDYIQNLPNHTLPDLDLSGYNATILNISSEIDRLNEVAKNLNQSVVELQEY 523
CoV_S1 pfam01600
Coronavirus spike glycoprotein S1; This family represents the spike glycoprotein (S) of ...
215-641 1.52e-174

Coronavirus spike glycoprotein S1; This family represents the spike glycoprotein (S) of coronaviruses. The spike protein is arranged in trimers on the surface of the viral membrane and is essential for viral entry. The spike protein is translated as a large polypeptide that is subsequently cleaved to the distal S1, responsible for receptor binding, and the membrane-anchored S2 responsible for membrane fusion. The coronavirus (SARS-CoV) S1 subunit is composed of two distinct domains: an N-terminal domain (S1 NTD) and a receptor-binding domain (S1 RBD) also referred to as the S1 CTD or domain B. Each of these domains have been implicated in binding to host receptors. However, most coronaviruses are not known to utilize both the S1 NTD and S1 RBD for viral entry. This entry contains spike protein from both alpha and gamma coronaviruses but excludes the spike protein from beta-coronaviruses such as SARS-CoV.


Pssm-ID: 460262  Cd Length: 412  Bit Score: 525.37  E-value: 1.52e-174
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 971745448   215 YTLCDNCTGFPQHVFATMENGEIPPSFNFANWFYLTNSSSPVSSRVVGLQPLLLTCLWPIPALLGTATDITFDRNGtSDV 294
Cdd:pfam01600    1 YSVCTNCDGFPDNVFAVEEGGYIPPSFSFNNWFYLTNSSTPVDGRVVSNQPLLLNCLWPIPSLNGTTLKVYFNGSI-PNG 79
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 971745448   295 RCNGFAS-NETADAMRFSLNFTDS-AVFAKEGVITLKTLSN-TFKFSCSNSSTYQAP-YVIPFGHIDQPYYCFTTFYINe 370
Cdd:pfam01600   80 RCNGYSNkNGTVDAIRFNLNFTASdSVFAGAGSISLNTVGGvTYSFSCSNSSTPVGAsHQIPFGATDQPYYCFVNYNGN- 158
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 971745448   371 taGTTTTSFVGMLPPVVREFVITKTGNVYLNGYRIFTVDDVVSVNFNISSTDHRDFWTVAFVKNTEVMLDIEDTYIKQLL 450
Cdd:pfam01600  159 --ISTTSQFVGILPPVVREIVISRYGDFYVNGYRYFSTPPLESVNFNLTSGDTSDFWTVAFANYTDVLVNINNTSIQRIL 236
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 971745448   451 YCNTPLNVVKCQQLKFVLDDGFYSYSSPVDEVLPRTIVRLPRLMTHNFLNFTIFVSFyfdddkqarpDGGFYECATCAPK 530
Cdd:pfam01600  237 YCDSPLNSIKCQQLSFSLPDGFYSTSSVEVVQLPRTFVTLPKFATHSFVNLTVSVSF----------DGGGGPPSLSALS 306
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 971745448   531 YYKLAFVDDwtsavssNVTSICVNYASFTTRLFTFYAGTTAGVHLGLETGTCPFSFDTLNNYLTFGSLCFSLV-ANGGCT 609
Cdd:pfam01600  307 EVNLTINGT-------NNTSLCVNTSQFTVNLNFTCTSTAYGYTAEIRTGTCPFSFDKLNNYLSFGSICFSLVpSGGGCT 379
                          410       420       430
                   ....*....|....*....|....*....|..
gi 971745448   610 MNIVTQGPYGLPHTIAVLYVSYTEGDNIIGVP 641
Cdd:pfam01600  380 MDIVTKYWNGSFVKVGSLYVSYSEGDNITGVP 411
gammaCoV_Spike_SD1-2_S1-S2_S2 cd22372
SD-1 and SD-2 subdomains, the S1/S2 cleavage region, and the S2 fusion subunit of the spike (S) ...
656-1316 1.30e-159

SD-1 and SD-2 subdomains, the S1/S2 cleavage region, and the S2 fusion subunit of the spike (S) glycoprotein from avian infectious bronchitis coronavirus (IBV) and related gammacoronaviruses; This group contains the SD-1 and SD-2 subdomains of the S1 subunit C-terminal domain (C-domain), the S1/S2 cleavage region, and the S2 fusion subunit of the spike (S) glycoprotein from gammacoronaviruses, including avian infectious bronchitis virus, and Beluga whale coronavirus SW1 (whale-CoV SW1). The CoV S protein is an envelope glycoprotein that plays a very important role in viral attachment, fusion, and entry into host cells, and serves as a major target for the development of neutralizing antibodies, inhibitors of viral entry, and vaccines. It is synthesized as a precursor protein that is cleaved into an N-terminal S1 subunit (~700 amino acids) and a C-terminal S2 subunit (~600 amino acids) that mediates attachment and membrane fusion, respectively. Three S1/S2 heterodimers assemble to form a trimer spike protruding from the viral envelope. The S1 subunit contains a receptor-binding domain (RBD), while the S2 subunit contains the coronavirus fusion machinery and is primarily alpha-helical. S1 contains two structurally independent domains, the N-terminal domain (NTD) and the C-domain. The S1 C-domain also contains two subdomains (SD-1 and SD-2), which connect the S1 and S2 subunits. Depending on the virus, either the NTD or the C-domain can serve as the receptor-binding domain (RBD). While the RBD of mouse hepatitis virus (MHV) is located at the NTD, most CoVs, including SARS-CoV-2, SARS-CoV and MERS-CoV use the C-domain to bind their receptors. The S2 subunit comprises the fusion peptide (FP), a second proteolytic site (S2'), followed by an internal fusion peptide (IFP) and two heptad-repeat domains (HR1 and HR2) preceding the transmembrane domain (TM). After binding of the S1 subunit RBD on the virion to its receptor on the target cell, the HR1 and HR2 domains interact with each other to form a six-helix bundle (6-HB) fusion core, bringing viral and cellular membranes into close proximity for fusion and infection. In order to catalyze the membrane fusion reaction, CoV S needs to be primed through cleavage at the S1/S2 and S2' sites. In the case of human-infecting coronaviruses such as SARS-CoV-2, HCoV-OC43, MERS-CoV, and HCoV-KU1, the spike protein contains an insertion of (R/K)-(2X)n-(R/K) (furin cleavage motif) at the S1/S2 site, which is absent in SARS-CoV and other SARS-related coronaviruses, as well as Ro-BatCoV HKU9. The region modeled in this cd (SD-1 and SD-2, the S1/S2 cleavage region, and the S2 fusion subunit) plays an essential role in viral entry by initiating fusion of the viral and cellular membranes.


Pssm-ID: 411959 [Multi-domain]  Cd Length: 661  Bit Score: 495.66  E-value: 1.30e-159
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 971745448  656 VYLDTCTTYTIYGMTGRGVIT------RSNNTFITG---LYYTSNAGNLLAYKNSTTGVVYNVYPCQLSSQVAVISDA-I 725
Cdd:cd22372     4 ITLNKCVDYNIYGRVGQGFITnvtdsaADYNYLADGglaILDTSGAIDIFVVQGEYGLNYYKVNPCEDVNQQFVVSGGnL 83
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 971745448  726 VGMASSTPNVSIDFnvtvVADNFYYL----------SNSAQPCDQPVLTYAGIGICSDGSItNSTARRAAADPVSPV--I 793
Cdd:cd22372    84 VGILTSRNETGSQL----LENQFYIKltngtrrrrrSISENVTSCPYVSYGKFCIKPDGSI-STIVPQELETFVAPLlnV 158
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 971745448  794 SGNISVPTNFTFSVQVEYIQLMLKPVTVDCSVYVCNGNPRCLQLLAQYASACRTIEQALQLSARLESVEVNSMISISQEA 873
Cdd:cd22372   159 TENVLIPNSFNLTVTDEYIQTRMDKVQINCLQYVCGNSLECRKLFQQYGPVCDNILSIVNSVNQKEDMELLSFYSSTKPG 238
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 971745448  874 LALG-VIDNFK-HDFNLTNVL--PASVGAKSAVEDLLFDKVVTSGLGTVDAdYKECASRTANTVAEVGCVQYYNGIMVLP 949
Cdd:cd22372   239 GFNTpVFNNVStGGFNISLLLppPSSPQGRSFIEDLLFTKVETVGLPTDDA-YKKCTAGPLGFLKDLVCAQEYNGLLVLP 317
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 971745448  950 GVVDQSLLAQYSAALTGAMVFGGVTAGAAVPFSIAVQSRLNYLALQTDVLQRNQQQLANSFNAAMGNITEAFgrvndaie 1029
Cdd:cd22372   318 PIITAEMQTMYTGSLVASMAFGGITAAGAIPFATQIQARINHLGITQSLLLKNQEKIAASFNKAIGHMQEGF-------- 389
                         410       420       430       440       450       460       470       480
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 971745448 1030 qtshaiSTVAQALDKVQTVVNDQGLALSQLTKQLASNFQAISSSIEDLYNRLDRVEADLQVDRLITGRLAALNAFVAQQL 1109
Cdd:cd22372   390 ------RSTSLALQQIQDVVNKQSAILTETMASLNKNFGAISSVIQDIYQQLDAIQADAQVDRLITGRLSSLSVLASAKQ 463
                         490       500       510       520       530       540       550       560
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 971745448 1110 TKYTDVRASRQLAQDKINECVKSQSFRYGFCGNGTHVFSVVNAAPDGMMFFHSVLLPTAYMEVAAFSGLCVEGN-----G 1184
Cdd:cd22372   464 AEYYKVSQQRELATQKINECVKSQSNRYGFCGNGRHVLTIPQNAPNGIVFIHFTYTPESFVNVTAIVGFCVNPAngsqyA 543
                         570       580       590       600       610       620       630       640
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 971745448 1185 YVLRDTGNVLFEKNGQYLITARKMFEPRVPQTSDFVQITGCDVVYLNVTRDELPT-VIPDYIDVNstveDILSKLPNRTT 1263
Cdd:cd22372   544 IVPANGRGIFIQVNGTYYITARDMYMPRDITAGDIVTLTSCQANYVSVNKTVITTfVDNDDFDFD----DELSKWWNETK 619
                         650       660       670       680       690
                  ....*....|....*....|....*....|....*....|....*....|....*.
gi 971745448 1264 PEF-DLDIFNAT--YLNLTGEIADltarseslknttleLKELIANINATLVDLEWL 1316
Cdd:cd22372   620 HELpDFDQFNYTipILNISNEIDR--------------IQEVIQGLNDSLIDLETL 661
betaCoV_Spike_SD1-2_S1-S2_S2 cd22370
SD-1 and SD-2 subdomains, the S1/S2 cleavage region, and the S2 fusion subunit of the spike (S) ...
666-1302 1.21e-140

SD-1 and SD-2 subdomains, the S1/S2 cleavage region, and the S2 fusion subunit of the spike (S) glycoprotein from betacoronaviruses; This family contains the SD-1 and SD-2 subdomains of the S1 subunit C-terminal domain (C-domain), the S1/S2 cleavage region, and the S2 fusion subunit of the spike (S) glycoprotein from betacoronaviruses, including three highly pathogenic human coronaviruses (CoVs), Middle East respiratory syndrome coronavirus (MERS-CoV), Severe acute respiratory syndrome (SARS) coronavirus (SARS-CoV), and SARS coronavirus 2 (SARS-CoV-2), also known as a 2019 novel coronavirus (2019-nCoV). The CoV S protein is an envelope glycoprotein that plays a very important role in viral attachment, fusion, and entry into host cells, and serves as a major target for the development of neutralizing antibodies, inhibitors of viral entry, and vaccines. It is synthesized as a precursor protein that is cleaved into an N-terminal S1 subunit (~700 amino acids) and a C-terminal S2 subunit (~600 amino acids) that mediates attachment and membrane fusion, respectively. Three S1/S2 heterodimers assemble to form a trimer spike protruding from the viral envelope. The S1 subunit contains a receptor-binding domain (RBD), while the S2 subunit contains the coronavirus fusion machinery and is primarily alpha-helical. S1 contains two structurally independent domains, the N-terminal domain (NTD) and the C-domain. The S1 C-domain also contains two subdomains (SD-1 and SD-2), which connect the S1 and S2 subunits. Depending on the virus, either the NTD or the C-domain can serve as the receptor-binding domain (RBD). While the RBD of mouse hepatitis virus (MHV) is located at the NTD, most CoVs, including SARS-CoV-2, SARS-CoV and MERS-CoV use the C-domain to bind their receptors. The S2 subunit comprises the fusion peptide (FP), a second proteolytic site (S2'), followed by an internal fusion peptide (IFP) and two heptad-repeat domains (HR1 and HR2) preceding the transmembrane domain (TM). After binding of the S1 subunit RBD on the virion to its receptor on the target cell, the HR1 and HR2 domains interact with each other to form a six-helix bundle (6-HB) fusion core, bringing viral and cellular membranes into close proximity for fusion and infection. In order to catalyze the membrane fusion reaction, CoV S needs to be primed through cleavage at the S1/S2 and S2' sites. In the case of human-infecting coronaviruses such as SARS-CoV-2, HCoV-OC43, MERS-CoV, and HKU1, the spike protein contains an insertion of (R/K)-(2X)n-(R/K) (furin cleavage motif) at the S1/S2 site, which is absent in SARS-CoV and other SARS-related coronaviruses, as well as Rousettus bat coronavirus HKU9. The region modeled in this cd (SD-1 and SD-2, the S1/S2 cleavage region, and the S2 fusion subunit) plays an essential role in viral entry by initiating fusion of the viral and cellular membranes.


Pssm-ID: 411957 [Multi-domain]  Cd Length: 667  Bit Score: 445.39  E-value: 1.21e-140
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 971745448  666 IYGMTGRGVITRSNNTFIT--GLYYTSNaGNLLAYKNSTTGVVYNVYPCqLSSQVAVISDAivgmaSSTPNVSIDFN--- 740
Cdd:cd22370     1 LYGYTGTGVLTETNATFLPfqNFGYDSN-GNLIAFKDPQTNTIYTILPC-VSGPVSVITPG-----NNTNEVAVLYNgln 73
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 971745448  741 ---VTV------------VADNFY--------------YLSNSAQPCDQPVltyaGIGICSDGSI-TNSTARRAAA---- 786
Cdd:cd22370    74 cseVPSaisavsltpwwrVYSSTSnyfdtpvgcllgavNSSNNSYECDLPL----GAGLCASYTTqSVLRSRSVASrsir 149
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 971745448  787 ------DPVSPV----ISGN--ISVPTNFTFSVQVEYIQLMLKPVTVDCSVYVCNGNPRCLQLLAQYASACRTIEQALQL 854
Cdd:cd22370   150 lttmsfFAENSVdvevAYSNfsIQIPTNFTIAVTEEFIPTTMPKVTVDCAQYVCGDSSECSNLLLQYGTFCDNINRALTG 229
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 971745448  855 SARLESVEVNSMISISQEALALGVIDNFKHDFNLTNVLP-----ASVGAKSAVEDLLFDKVVTSGLGTVDAdYKECASRT 929
Cdd:cd22370   230 VALLQDKNQLEVFASVKQIVKTPAPLKDFGGFNFSSLLPclgsnGGSSARSAIEDLLFNKVTLADVGFMKQ-YDDCTGGS 308
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 971745448  930 AntVAEVGCVQYYNGIMVLPGVVDQSLLAQYSAALTGAMVFGGVTAG----AAVPFSIAVQSRLNYLALQTDVLQRNQQQ 1005
Cdd:cd22370   309 A--ARDLICAQSFNGLKVLPPLLTDEMIAAYTSALLGGTATSGWTFGassaAQIPFAMQMAYRFNGIGVTQQVLVENQKL 386
                         410       420       430       440       450       460       470       480
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 971745448 1006 LANSFNAAMGNITEAFgrvndaieqtshaiSTVAQALDKVQTVVNDQGLALSQLTKQLASNFQAISSSIEDLYNRLDRVE 1085
Cdd:cd22370   387 IANKFNQALGSIQTGF--------------TATNSALAKLQDVVNQNAQALNTLVKQLSNNFGAISSSLNDILSRLDKLE 452
                         490       500       510       520       530       540       550       560
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 971745448 1086 ADLQVDRLITGRLAALNAFVAQQLTKYTDVRASRQLAQDKINECVKSQSFRYGFCGNGTHVFSVVNAAPDGMMFFHSVLL 1165
Cdd:cd22370   453 ADVQIDRLINGRLQVLQTYVTQQLIRASEIRASAQLAAQKMSECVKGQSKRVDFCGNGTHLMSFPQSAPNGVVFLHVTYK 532
                         570       580       590       600       610       620       630       640
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 971745448 1166 PTAYMEVAAFSGLCVEGNGYVLRDtgNVLFEKNGQYLITARKMFEPrVPQTSDFVQITG-CDVVYLNVTRDELPTVIPDY 1244
Cdd:cd22370   533 PTSYKNVTTAPAICHNGKAYFPKE--GVFVKNNNSWMFTGRNFYEP-EIITTDNTFYSGsCDVNFTYVNNTVYNPLQPEL 609
                         650       660       670       680       690
                  ....*....|....*....|....*....|....*....|....*....|....*....
gi 971745448 1245 IDVNSTVEDILsKLPNRTTPEF-DLDIFNATYLNLTGEIADLTARSESLKNTTLELKEL 1302
Cdd:cd22370   610 DDFKAELDKFF-KNHTSPDPNLgDLSGINASFVDLQKEMDTLQEVVKQLNESLIDLKEL 667
alphaCoV-HKU2-like_Spike_SD1-2_S1-S2_S2 cd22371
SD-1 and SD-2 subdomains, the S1/S2 cleavage region, and the S2 fusion subunit of the CoV ...
666-1368 1.50e-119

SD-1 and SD-2 subdomains, the S1/S2 cleavage region, and the S2 fusion subunit of the CoV spike (S) glycoprotein from Rhinolophus bat coronavirus HKU2 and related alphacoronaviruses; This group contains the SD-1 and SD-2 subdomains of the S1 subunit C-terminal domain (C-domain), the S1/S2 cleavage region, and the S2 fusion subunit of the spike (S) glycoprotein from Wencheng shrew coronavirus (WESV), Lucheng Rn rat coronavirus (LRNV), and two bat viruses (Rhinolophus bat coronavirus HKU2 and BtRf-AlphaCoV/YN2012). Members of this group form a distinct cluster that is separated from the other alphacoronaviruses. The CoV S protein is an envelope glycoprotein that plays a very important role in viral attachment, fusion, and entry into host cells, and serves as a major target for the development of neutralizing antibodies, inhibitors of viral entry, and vaccines. It is synthesized as a precursor protein that is cleaved into an N-terminal S1 subunit (~700 amino acids) and a C-terminal S2 subunit (~600 amino acids) that mediates attachment and membrane fusion, respectively. Three S1/S2 heterodimers assemble to form a trimer spike protruding from the viral envelope. The S1 subunit contains a receptor-binding domain (RBD), while the S2 subunit contains the coronavirus fusion machinery and is primarily alpha-helical. S1 contains two structurally independent domains, the N-terminal domain (NTD) and the C-domain. The S1 C-domain also contains two subdomains (SD-1 and SD-2), which connect the S1 and S2 subunits. Depending on the virus, either the NTD or the C-domain can serve as the receptor-binding domain (RBD). While the RBD of mouse hepatitis virus (MHV) is located at the NTD, most CoVs, including SARS-CoV-2, SARS-CoV and MERS-CoV use the C-domain to bind their receptors. The S2 subunit comprises the fusion peptide (FP), a second proteolytic site (S2'), followed by an internal fusion peptide (IFP) and two heptad-repeat domains (HR1 and HR2) preceding the transmembrane domain (TM). After binding of the S1 subunit RBD on the virion to its receptor on the target cell, the HR1 and HR2 domains interact with each other to form a six-helix bundle (6-HB) fusion core, bringing viral and cellular membranes into close proximity for fusion and infection. In order to catalyze the membrane fusion reaction, CoV S needs to be primed through cleavage at the S1/S2 and S2' sites. In the case of human-infecting coronaviruses such as SARS-CoV-2, HCoV-OC43, MERS-CoV, and HCoV-KU1, the spike protein contains an insertion of (R/K)-(2X)n-(R/K) (furin cleavage motif) at the S1/S2 site, which is absent in SARS-CoV and other SARS-related coronaviruses, as well as Ro-BatCoV HKU9. The region modeled in this cd (SD-1 and SD-2, the S1/S2 cleavage region, and the S2 fusion subunit) plays an essential role in viral entry by initiating fusion of the viral and cellular membranes.


Pssm-ID: 411958 [Multi-domain]  Cd Length: 686  Bit Score: 389.15  E-value: 1.50e-119
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 971745448  666 IYGMTGRGVITRSNNTF---ITGLYYtsnaGNLLAYKNsTTGVVYNVYPC-QLSSQVAVISDAIVGM------------- 728
Cdd:cd22371     1 IDGVTFQGILYETNFTFdsfYNLLYK----GSMVKYVR-ILGVVYEVEPCnEFSYSVLKNNSSSYGTlysgadcnqidtk 75
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 971745448  729 ASSTPNVSIDFNVTVVA--DNFYYLSNSAQPCDQPVltyaGIGICSDGSITNSTARR-----AAADPVSPVISG-NISVP 800
Cdd:cd22371    76 TFRFKARSHTGTNTSLGclFNASYTNDTYTTCLNPL----GNGFCADVNVTSPVVGNigiqkHDTDYVRPILTEqFIELP 151
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 971745448  801 TNFTFSVQVEYIQLMLKPVTVDCSVYVCNGNPRCLQLLAQYASACRTIEQALQLSarleSVEVNSMISISQEALAL---G 877
Cdd:cd22371   152 LDHQLVVKEQFLQTSMPKFDVDCERYICDVSKACRELLFKYGGFCSKITADIKGS----SILLDSQILGLYKTIAVdfsS 227
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 971745448  878 VIDNFKhDFNLTNVLPASvGAKSAVEDLLFDKVVTSGLGTVDaDYKECASrtaNTVAEVGCVQYYNGIMVLPGVVDQSLL 957
Cdd:cd22371   228 PDVDFG-DFNFSMFMSEK-NGRSFIEDLLFDKIVTTGPGFYQ-DYYDCKK---MNLQDLTCAQYYNGIMVIPPIMDDETI 301
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 971745448  958 AQYSAALTGAM---VFGGvtAGAAVPFSIAVQSRLNYLALQTDVLQRNQQQLANSFNaamgNITEafgrvndaieQTSHA 1034
Cdd:cd22371   302 GMYGGIVAASMtagLFGG--QAGMVTWNTAMAGRLNALGVTQDALVEDVNKLANGFN----NLTQ----------SVSKL 365
                         410       420       430       440       450       460       470       480
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 971745448 1035 ISTVAQALDKVQTVVNDQGLALSQLTKQLASNFQAISSSIEDLYNRLDRVEADLQVDRLITGRLAALNAFVAQQLTKYTD 1114
Cdd:cd22371   366 AKTTSQALSAIQAVVNQNAAQVEQLVQGLSENFGAISNNFEVIAERLEKLEADQQMDRLINGRMNVLQNFVTNYKLKISE 445
                         490       500       510       520       530       540       550       560
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 971745448 1115 VRASRQLAQDKINECVKSQSFRYGFCGNGTHVFSVVNAAPDGMMFFHSVLLPTAYMEVAAFSGLCVEGN-GYVLRDTGNV 1193
Cdd:cd22371   446 LKSTQRLVQSLINECVYAQSLRNGFCGDGLHVMSLMQNAPDGIMFFHYTLKPNNTIIVKTTPGLCLSNEvCIKPIDAKFG 525
                         570       580       590       600       610       620       630       640
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 971745448 1194 LF--EKNGQYLITARKMFEPRvpqtsdfvQITGCDVVYlnVTRDElptvipDYIDVNSTVEDIlsKLPnrTTPEFDLDiF 1271
Cdd:cd22371   526 VLvsANDSYWHFTPRNIYNPE--------NITNSNIIA--VSGGA------NYTTVNNTIDII--EPP--QNPPIDEE-F 584
                         650       660       670       680       690       700       710       720
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 971745448 1272 NATYLNLTGE---IADLTARSESLkNTTLE---LKELIANINATLVDLEWLNRVETYIKWPWWVWLIIVLVLILFTCLML 1345
Cdd:cd22371   585 RELYKNVTLEleqLKNITFDMSKL-NLTYEidrLNEIAENVSKLHVTVSEFNKYVQYVKWPWYVWLAIFLVLILFSFLML 663
                         730       740
                  ....*....|....*....|...
gi 971745448 1346 FCCCSTGCCGIFSCMASSCGACC 1368
Cdd:cd22371   664 WCCCATGCCGCCGCCGAACNSCC 686
bat-HKU9-CoV-like_Spike_SD1-2_S1-S2_S2 cd22381
SD-1 and SD-2 subdomains, the S1/S2 cleavage region, and the S2 fusion subunit of the spike (S) ...
666-1368 3.42e-112

SD-1 and SD-2 subdomains, the S1/S2 cleavage region, and the S2 fusion subunit of the spike (S) glycoprotein from Rousettus bat coronavirus HKU9 and related betacoronaviruses in the D lineage; This group contains the SD-1 and SD-2 subdomains of the S1 subunit C-terminal domain (C-domain), the S1/S2 cleavage region, and the S2 fusion subunit of the spike (S) glycoprotein from betacoronaviruses in the nobecovirus subgenus (D lineage), including Rousettus bat coronavirus HKU9 (Ro-BatCoV HKU9). The CoV S protein is an envelope glycoprotein that plays a very important role in viral attachment, fusion, and entry into host cells, and serves as a major target for the development of neutralizing antibodies, inhibitors of viral entry, and vaccines. It is synthesized as a precursor protein that is cleaved into an N-terminal S1 subunit (~700 amino acids) and a C-terminal S2 subunit (~600 amino acids) that mediates attachment and membrane fusion, respectively. Three S1/S2 heterodimers assemble to form a trimer spike protruding from the viral envelope. The S1 subunit contains a receptor-binding domain (RBD), while the S2 subunit contains the coronavirus fusion machinery and is primarily alpha-helical. S1 contains two structurally independent domains, the N-terminal domain (NTD) and the C-domain. The S1 C-domain also contains two subdomains (SD-1 and SD-2), which connect the S1 and S2 subunits. Depending on the virus, either the NTD or the C-domain can serve as the receptor-binding domain (RBD). While the RBD of mouse hepatitis virus (MHV) is located at the NTD, most CoVs, including SARS-CoV-2, SARS-CoV and MERS-CoV use the C-domain to bind their receptors. The S2 subunit comprises the fusion peptide (FP), a second proteolytic site (S2'), followed by an internal fusion peptide (IFP) and two heptad-repeat domains (HR1 and HR2) preceding the transmembrane domain (TM). After binding of the S1 subunit RBD on the virion to its receptor on the target cell, the HR1 and HR2 domains interact with each other to form a six-helix bundle (6-HB) fusion core, bringing viral and cellular membranes into close proximity for fusion and infection. In order to catalyze the membrane fusion reaction, CoV S needs to be primed through cleavage at the S1/S2 and S2' sites. In the case of human-infecting coronaviruses such as SARS-CoV-2, HCoV-OC43, MERS-CoV, and HCoV-KU1, the spike protein contains an insertion of (R/K)-(2X)n-(R/K) (furin cleavage motif) at the S1/S2 site, which is absent in SARS-CoV and other SARS-related coronaviruses, as well as Ro-BatCoV HKU9. The region modeled in this cd (SD-1 and SD-2, the S1/S2 cleavage region, and the S2 fusion subunit) plays an essential role in viral entry by initiating fusion of the viral and cellular membranes.


Pssm-ID: 411968 [Multi-domain]  Cd Length: 731  Bit Score: 370.62  E-value: 3.42e-112
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 971745448  666 IYGMTGRGVITRSNNTFITG-LYYTSNAGNLLAYKNSTTgvVYNVYPCqLSSQVAVISDA-------IVGMASSTPNVSI 737
Cdd:cd22381     1 LYGYTGTGVLSTSNLTIPDSkVFSASSTGDIIAVSVNGT--VYSISPC-VSVPISVGYDPgferallFNGLSCSERARAV 77
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 971745448  738 -----DFNVTVVAD--------------NFYYLSN-SAQPCDQPVltyaGIGICSDGSITNSTARRA------------- 784
Cdd:cd22381    78 sepasDYWRASVSDganntfdtpsgcvyNVINRTTiTVNQCSMPL----GNSLCLVNNTTAVSARGSlsllslvtydply 153
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 971745448  785 --AADPVSPVISgnISVPTNFTFSVQVEYIQLMLKPVTVDCSVYVCNGNPRCLQLLAQYASACRTIEQALqlsARLESVE 862
Cdd:cd22381   154 dsSVTPLTPVYW--VSIPTNFTLAATTEYIQTTAPKINIDCAKYLCGDSSRCLTVLLQYGTFCDDVNKAL---ARVSTIL 228
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 971745448  863 VNSMISISQEaLALGVIDN----FKHDFNLTNV---LPASVGAK---SAVEDLLFDKVVTSGLGTVDAdYKEC-ASRTAN 931
Cdd:cd22381   229 DASLVSLVSE-LTSDVVRSenlaFDGDYNFTGLmgcLGSNCNSKsyrSALSDLLYNKVKVADPGFMQS-YQKCiDSQWGG 306
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 971745448  932 TVAEVGCVQYYNGIMVLPGVVDQSLLAQYSAALTGAMVFGG----VTAGAAVPFSIAVQSRLNYLALQTDVLQRNQQQLA 1007
Cdd:cd22381   307 NIRDLICTQTFNGISVLPPIVSPGMQALYTSLLVGAVASSGytfgITSVGVIPFATQLQFRLNGLGVTTQVLVENQKLIA 386
                         410       420       430       440       450       460       470       480
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 971745448 1008 NSFNAAMGNITEAFGRVNdaieqtshaistvaQALDKVQTVVNDQGLALSQLTKQLASNFQAISSSIEDLYNRLDRVEAD 1087
Cdd:cd22381   387 NSFNKALVSIQKGFDATN--------------QALSKMQTVINQHAQQLQTLVQQLGNSFGAISSSINEIFSRLDGLEAN 452
                         490       500       510       520       530       540       550       560
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 971745448 1088 LQVDRLITGRLAALNAFVAQQLTKYTDVRASRQLAQDKINECVKSQSFRYGFCGNGTHVFSVVNAAPDGMMFFHSVLLPT 1167
Cdd:cd22381   453 AEVDRLINGRMVVLNTYVTQLLIQASEVRAQAALAKQKISECVKAQSLRNDFCGNGTHVLSIPQLAPNGVLFIHYSYQPT 532
                         570       580       590       600       610       620       630       640
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 971745448 1168 AYMEVAAFSGLCVEGNGYVLRDTGNVLFEKNGQYLITARKMFEPRVPQTSDFVQITGCDVVYLNVTRDELPTVIPDYIDV 1247
Cdd:cd22381   533 AYALVQTAAGLCFNGTGYAPRGGLFVLPNNSNLWHFTKMNFYNPVNISYSNTQVLTSCSVNYTTVNYTVLNPSEPSDFNF 612
                         650       660       670       680       690       700       710       720
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 971745448 1248 NSTVEDILSKLPNRTTPEFDLDIFNATYLNLTGEIADLTarseslknttlelkELIANINATLVDLEWLNRVETYIKWPW 1327
Cdd:cd22381   613 QEEFDKWYKNQSSQFNNTFNPSDFNFSTVDVNEQLATLT--------------DVVKQLNESFIDLKKLNVYEQTIKWPW 678
                         730       740       750       760
                  ....*....|....*....|....*....|....*....|...
gi 971745448 1328 WVWL--IIVLVLILFTCLMLfcCCSTGCCGIFSCMAsSCGACC 1368
Cdd:cd22381   679 YVWLamIAGLVGLALAVVML--LCMTNCCSCFKGMC-SCKQCQ 718
MERS-CoV-like_Spike_SD1-2_S1-S2_S2 cd22379
SD-1 and SD-2 subdomains, the S1/S2 cleavage region, and the S2 fusion subunit of the spike (S) ...
666-1302 1.28e-105

SD-1 and SD-2 subdomains, the S1/S2 cleavage region, and the S2 fusion subunit of the spike (S) glycoprotein from Middle East respiratory syndrome coronavirus and related betacoronaviruses in the C lineage; This group contains the SD-1 and SD-2 subdomains of the S1 subunit C-terminal domain (C-domain), the S1/S2 cleavage region, and the S2 fusion subunit of the spike (S) glycoprotein from betacoronaviruses in the merbecovirus subgenus (C lineage), including Middle East respiratory syndrome coronavirus (MERS-CoV). The CoV S protein is an envelope glycoprotein that plays a very important role in viral attachment, fusion, and entry into host cells, and serves as a major target for the development of neutralizing antibodies, inhibitors of viral entry, and vaccines. It is synthesized as a precursor protein that is cleaved into an N-terminal S1 subunit (~700 amino acids) and a C-terminal S2 subunit (~600 amino acids) that mediates attachment and membrane fusion, respectively. Three S1/S2 heterodimers assemble to form a trimer spike protruding from the viral envelope. The S1 subunit contains a receptor-binding domain (RBD), while the S2 subunit contains the coronavirus fusion machinery and is primarily alpha-helical. S1 contains two structurally independent domains, the N-terminal domain (NTD) and the C-domain. The S1 C-domain also contains two subdomains (SD-1 and SD-2), which connect the S1 and S2 subunits. Depending on the virus, either the NTD or the C-domain can serve as the receptor-binding domain (RBD). While the RBD of mouse hepatitis virus (MHV) is located at the NTD, most CoVs, including SARS-CoV-2, SARS-CoV and MERS-CoV use the C-domain to bind their receptors. The S2 subunit comprises the fusion peptide (FP), a second proteolytic site (S2'), followed by an internal fusion peptide (IFP) and two heptad-repeat domains (HR1 and HR2) preceding the transmembrane domain (TM). After binding of the S1 subunit RBD on the virion to its receptor on the target cell, the HR1 and HR2 domains interact with each other to form a six-helix bundle (6-HB) fusion core, bringing viral and cellular membranes into close proximity for fusion and infection. In order to catalyze the membrane fusion reaction, CoV S needs to be primed through cleavage at the S1/S2 and S2' sites. In the case of human-infecting coronaviruses such as SARS-CoV-2, HCoV-OC43, MERS-CoV, and HCoV-KU1, the spike protein contains an insertion of (R/K)-(2X)n-(R/K) (furin cleavage motif) at the S1/S2 site, which is absent in SARS-CoV and other SARS-related coronaviruses, as well as Rousettus bat coronavirus HKU9. The region modeled in this cd (SD-1 and SD-2, the S1/S2 cleavage region, and the S2 fusion subunit) plays an essential role in viral entry by initiating fusion of the viral and cellular membranes.


Pssm-ID: 411966 [Multi-domain]  Cd Length: 682  Bit Score: 351.02  E-value: 1.28e-105
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 971745448  666 IYGMTGRGVITRSNNTFITGLYYTSNA-GNLLAYkNSTTGVVYNVYPCqLSSQVAVISDaivgmaSSTPNVSIDF----- 739
Cdd:cd22379     1 LYGVTGRGVFQNCTAVGIRQQRFVYDSfDNLVGY-HSDDGNYYCVRPC-VSVPVSVIYD------KSTNTHATLFgsvac 72
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 971745448  740 -NVTVVADNFYYLSNS---AQPCDQPVLTYAGigiCSDGSITNS---------------------TARRAAADPVSPVIS 794
Cdd:cd22379    73 eHISTMMSQFSRSTQSmlrRRSTNGPLQTAVG---CVIGLVNTSltvedcklplgqslcavpptlTPRSVSSVPGEQLAS 149
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 971745448  795 GN----------------ISVPTNFTFSVQVEYIQLMLKPVTVDCSVYVCNGNPRCLQLLAQYASACRTIEQALQLSARL 858
Cdd:cd22379   150 INfnhplqvdqlnssgfkVSIPTNFSFGVTQEYIQTTIQKVTVDCKQYVCNGFEKCEQLLREYGQFCSKINQALHGANLR 229
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 971745448  859 ESVEVNSMISISQEALALGVIDNFKHDFNLTNVLPASVG-----AKSAVEDLLFDKVVTSGLGTVDAdYKECASRTANTV 933
Cdd:cd22379   230 QDDSVRNLFASIKTSQSQPLIAGLGGDFNLTLLEPPSIStgsrsYRSAIEDLLFDKVTIADPGYMQG-YDECMKQGPPSA 308
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 971745448  934 AEVGCVQYYNGIMVLPGVVDQSLLAQYSAALTGAMVFGGVTAG----AAVPFSIAVQSRLNYLALQTDVLQRNQQQLANS 1009
Cdd:cd22379   309 RDLICAQYVAGYKVLPPLYDVNMEAAYTSSLLGSIAGAGWTAGlssfAAIPFAQSIFYRLNGVGITQQVLSENQKLIANK 388
                         410       420       430       440       450       460       470       480
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 971745448 1010 FNAAMGNITEAFgrvndaieqtshaiSTVAQALDKVQTVVNDQGLALSQLTKQLASNFQAISSSIEDLYNRLDRVEADLQ 1089
Cdd:cd22379   389 FNQALGAMQTGF--------------TTTNLAFQKVQDAVNANAQALSKLASELSNTFGAISSSIGDILKRLDVLEQEAQ 454
                         490       500       510       520       530       540       550       560
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 971745448 1090 VDRLITGRLAALNAFVAQQLTKYTDVRASRQLAQDKINECVKSQSFRYGFCGNGTHVFSVVNAAPDGMMFFHSVLLPTAY 1169
Cdd:cd22379   455 IDRLINGRLTSLNAFVAQQLVRSETAARSAQLAKDKVNECVKSQSKRNGFCGQGTHIVSFVINAPNGLYFFHVGYVPTNH 534
                         570       580       590       600       610       620       630       640
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 971745448 1170 MEVAAFSGLCVEGN---------GY-VLRDTGNVlfekNGQYLITARKMFEPRvPQTSDFVQITGCDVVYLNVTRDELPT 1239
Cdd:cd22379   535 VNVTAAYGLCDSANptnciapvnGYfIKNNTTRI----VDEWSYTGSSFYAPE-PITSANTRYVSPDVTFQNLSNNLPPP 609
                         650       660       670       680       690       700
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 971745448 1240 VIPDYIDVNSTVE------DILSKLPNRTtpefDLDIFNATYLNLTGEIADLTARSESLKNTTLELKEL 1302
Cdd:cd22379   610 LLSNSTDIDFKDEleeffkNVSSQIPNFG----SISQINTTLLDLSDEMLSLQQVVKALNESYIDLKEL 674
HKU1-CoV-like_Spike_SD1-2_S1-S2_S2 cd22380
SD-1 and SD-2 subdomains, the S1/S2 cleavage region, and the S2 fusion subunit of the spike (S) ...
666-1302 3.81e-99

SD-1 and SD-2 subdomains, the S1/S2 cleavage region, and the S2 fusion subunit of the spike (S) glycoprotein from human HKU1 and OC43 coronaviruses and related betacoronaviruses in the A lineage; This group contains the SD-1 and SD-2 subdomains of the S1 subunit C-terminal domain (C-domain), the S1/S2 cleavage region, and the S2 fusion subunit of the spike (S) glycoprotein from betacoronaviruses in the embecovirus subgenus (A lineage), including highly pathogenic human coronaviruses (CoVs), HKU1 and OC43 CoVs, as well as murine hepatitis virus (MHV). The CoV S protein is an envelope glycoprotein that plays a very important role in viral attachment, fusion, and entry into host cells, and serves as a major target for the development of neutralizing antibodies, inhibitors of viral entry, and vaccines. It is synthesized as a precursor protein that is cleaved into an N-terminal S1 subunit (~700 amino acids) and a C-terminal S2 subunit (~600 amino acids) that mediates attachment and membrane fusion, respectively. Three S1/S2 heterodimers assemble to form a trimer spike protruding from the viral envelope. The S1 subunit contains a receptor-binding domain (RBD), while the S2 subunit contains the coronavirus fusion machinery and is primarily alpha-helical. S1 contains two structurally independent domains, the N-terminal domain (NTD) and the C-domain. The S1 C-domain also contains two subdomains (SD-1 and SD-2), which connect the S1 and S2 subunits. Depending on the virus, either the NTD or the C-domain can serve as the receptor-binding domain (RBD). While the RBD of MHV is located at the NTD, most CoVs, including SARS-CoV-2, SARS-CoV and MERS-CoV use the C-domain to bind their receptors. The S2 subunit comprises the fusion peptide (FP), a second proteolytic site (S2'), followed by an internal fusion peptide (IFP) and two heptad-repeat domains (HR1 and HR2) preceding the transmembrane domain (TM). After binding of the S1 subunit RBD on the virion to its receptor on the target cell, the HR1 and HR2 domains interact with each other to form a six-helix bundle (6-HB) fusion core, bringing viral and cellular membranes into close proximity for fusion and infection. In order to catalyze the membrane fusion reaction, CoV S needs to be primed through cleavage at the S1/S2 and S2' sites. In the case of human-infecting coronaviruses such as SARS-CoV-2, HCoV-OC43, MERS-CoV, and HCoV-KU1, the spike protein contains an insertion of (R/K)-(2X)n-(R/K) (furin cleavage motif) at the S1/S2 site, which is absent in SARS-CoV and other SARS-related coronaviruses, as well as Rousettus bat coronavirus HKU9. The region modeled in this cd (SD-1 and SD-2, the S1/S2 cleavage region, and the S2 fusion subunit) plays an essential role in viral entry by initiating fusion of the viral and cellular membranes.


Pssm-ID: 411967 [Multi-domain]  Cd Length: 663  Bit Score: 332.51  E-value: 3.81e-99
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 971745448  666 IYGMTGRGVITRSNNTFITG---LYYTSNaGNLLAYKNSTTGVVYNVYPC----------QLSSQVA----------VIS 722
Cdd:cd22380     1 LYGITGQGIFKEVNADYYNSwqnLLYDSN-GNLYGFRDYLTNKTYMIRSCysgrvsaafhANASEPAllyrnlkcsyVFN 79
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 971745448  723 DAIVGMASSTPNVSIDFNVTVVADNfyYLSNSAQPCDQPVltyaGIGICSDGSiTNSTARRAAA---------------- 786
Cdd:cd22380    80 NTISREEQPLNYFDSYLGCVVNADN--STSSAVQTCDLRM----GSGYCVDYS-TSRRSRRSIStgyrfttfepftvnlv 152
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 971745448  787 -DPVSPViSG--NISVPTNFTFSVQVEYIQLMLKPVTVDCSVYVCNGNPRCLQLLAQYASACRTIEQALQlsarlesvEV 863
Cdd:cd22380   153 nDSVEPV-GGlyEIQIPTNFTIGNHEEFIQTSSPKVTIDCAAFVCGDYAACRQQLVEYGSFCDNINAILN--------EV 223
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 971745448  864 NSMISISQ----EALALGV---------IDNFKHDFNLTNVLP------ASVGAKSAVEDLLFDKVVTSGLGTVDAdYKE 924
Cdd:cd22380   224 NELLDTTQlqvaNSLMQGVtlssrlkdgINFNVDDINFSPVLGclgsdcNAASSRSAIEDLLFDKVKLSDVGFVEA-YNN 302
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 971745448  925 CASrtANTVAEVGCVQYYNGIMVLPGVVDQSLLAQYSAALTGAMVFGGVTAGAAVPFSIAVQSRLNYLALQTDVLQRNQQ 1004
Cdd:cd22380   303 CTG--GAEIRDLLCVQSFNGIKVLPPVLSENQISGYTTAATAASLFPPWSAAAGVPFSLNVQYRINGLGVTMDVLSQNQK 380
                         410       420       430       440       450       460       470       480
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 971745448 1005 QLANSFNAAMGNITEAFGRVNdaieqtshaistvaQALDKVQTVVNDQGLALSQLTKQLASNFQAISSSIEDLYNRLDRV 1084
Cdd:cd22380   381 LIANAFNNALGAIQEGFDATN--------------SALAKIQSVVNANAEALNNLLQQLSNRFGAISASLQEILSRLDAL 446
                         490       500       510       520       530       540       550       560
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 971745448 1085 EADLQVDRLITGRLAALNAFVAQQLTKYTDVRASRQLAQDKINECVKSQSFRYGFCGNGTHVFSVVNAAPDGMMFFHSVL 1164
Cdd:cd22380   447 EAQAQIDRLINGRLTALNAYVSQQLSDSTLVKFSAAQAIEKVNECVKSQSPRINFCGNGNHILSLVQNAPYGLYFIHFSY 526
                         570       580       590       600       610       620       630       640
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 971745448 1165 LPTAYMEVAAFSGLCVEGN-------GYVLRDtgnvlfekNGQYLITARKMFEPRVPQTSDFVQITGCDVVYLNVTRDEL 1237
Cdd:cd22380   527 VPTSFVTAKVSPGLCIAGDrgiapksGYFVNV--------NNEWMFTGSGYYYPEPITDKNVVVMSSCAVNYTKAPDVML 598
                         650       660       670       680       690       700
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*.
gi 971745448 1238 PTVIPDYIDVNSTVeDILSKLPNRTTPEFDLDIF-NATYLNLTGEIADLTARSESLKNTTLELKEL 1302
Cdd:cd22380   599 NTSIPNLPDFKEEL-DQWFKNQTSVAPDLSLDEYiNVTFLDLQDEMNRIQEAIKVLNESYINLKEI 663
SARS-CoV-like_Spike_SD1-2_S1-S2_S2 cd22378
SD-1 and SD-2 subdomains, the S1/S2 cleavage region, and the S2 fusion subunit of the spike (S) ...
668-1302 6.69e-94

SD-1 and SD-2 subdomains, the S1/S2 cleavage region, and the S2 fusion subunit of the spike (S) glycoprotein from SARS-CoV-2 (COVID-19) and related betacoronaviruses in the B lineage; This group contains the SD-1 and SD-2 subdomains of the S1 subunit C-terminal domain (C-domain), the S1/S2 cleavage region, and the S2 fusion subunit of the spike (S) glycoprotein from betacoronaviruses in the sarbecovirus subgenus (B lineage), including highly pathogenic human CoVs such as Severe acute respiratory syndrome (SARS) coronavirus (SARS-CoV), and SARS-CoV-2 (also known as a 2019 novel coronavirus or 2019-nCoV). The CoV S protein is an envelope glycoprotein that plays a very important role in viral attachment, fusion, and entry into host cells, and serves as a major target for the development of neutralizing antibodies, inhibitors of viral entry, and vaccines. It is synthesized as a precursor protein that is cleaved into an N-terminal S1 subunit (~700 amino acids) and a C-terminal S2 subunit (~600 amino acids) that mediates attachment and membrane fusion, respectively. Three S1/S2 heterodimers assemble to form a trimer spike protruding from the viral envelope. The S1 subunit contains a receptor-binding domain (RBD), while the S2 subunit contains the coronavirus fusion machinery and is primarily alpha-helical. S1 contains two structurally independent domains, the N-terminal domain (NTD) and the C-domain. The S1 C-domain also contains two subdomains (SD-1 and SD-2), which connect the S1 and S2 subunits. Depending on the virus, either the NTD or the C-domain can serve as the receptor-binding domain (RBD). While the RBD of mouse hepatitis virus (MHV) is located at the NTD, most CoVs, including SARS-CoV-2, SARS-CoV and MERS-CoV use the C-domain to bind their receptors. The S2 subunit comprises the fusion peptide (FP), a second proteolytic site (S2'), followed by an internal fusion peptide (IFP) and two heptad-repeat domains (HR1 and HR2) preceding the transmembrane domain (TM). After binding of the S1 subunit RBD on the virion to its receptor on the target cell, the HR1 and HR2 domains interact with each other to form a six-helix bundle (6-HB) fusion core, bringing viral and cellular membranes into close proximity for fusion and infection. In order to catalyze the membrane fusion reaction, CoV S needs to be primed through cleavage at the S1/S2 and S2' sites. Notably, SARS-CoV-2 has a functional polybasic (furin) cleavage site through the insertion of PRRAR*SV (* indicates the cleavage site) at the S1/S2 interface, which is absent in SARS-CoV and other SARS-related coronaviruses. The region modeled in this cd (SD-1 and SD-2, the S1/S2 cleavage region, and the S2 fusion subunit) plays an essential role in viral entry by initiating fusion of the viral and cellular membranes.


Pssm-ID: 411965 [Multi-domain]  Cd Length: 662  Bit Score: 317.71  E-value: 6.69e-94
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 971745448  668 GMTGRGVITRSNNTFITGLYYTSNAGNLL-AYKNSTTGVVYNVYPCQL------------SSQVAVISDAI----VGMAS 730
Cdd:cd22378     3 GLTGTGVLTPSSKRFQPFQQFGRDVSDFTdSVRDPKTLEILDISPCSFggvsvitpgtnaSSEVAVLYQDVnctdVPTAI 82
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 971745448  731 STPNVSIDFNVTVVADNFYYLS----------NSAQPCDQPVltyaGIGICSD---GSITNSTARRA--------AADPV 789
Cdd:cd22378    83 HADQLTPAWRVYSTGSNVFQTQagcligaehvNTSYECDIPI----GAGICASyhtVSLLRSTSQKSivaytmslGAENS 158
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 971745448  790 SPVISGNISVPTNFTFSVQVEYIQLMLKPVTVDCSVYVCNGNPRCLQLLAQYASACRTIEQALQLSARLESVEVNSMISI 869
Cdd:cd22378   159 IAYSNNSIAIPTNFSISVTTEVMPVSMAKTSVDCTMYICGDSTECSNLLLQYGSFCTQLNRALSGIAVEQDKNTQEVFAQ 238
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 971745448  870 SQEALALGVIDNFKhDFNLTNVLP--ASVGAKSAVEDLLFDKVVTSGLGTVDaDYKECASRTAntVAEVGCVQYYNGIMV 947
Cdd:cd22378   239 VKQMYKTPTIKDFG-GFNFSQILPdpSKPTKRSFIEDLLFNKVTLADAGFMK-QYGDCLGDIN--ARDLICAQKFNGLTV 314
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 971745448  948 LPGVVDQSLLAQYSAALTGAMVFGGVT--AGAA--VPFSIAVQSRLNYLALQTDVLQRNQQQLANSFNAAMGNITEAfgr 1023
Cdd:cd22378   315 LPPLLTDEMIAAYTAALVSGTATAGWTfgAGAAlqIPFAMQMAYRFNGIGVTQNVLYENQKQIANQFNKAISQIQES--- 391
                         410       420       430       440       450       460       470       480
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 971745448 1024 vndaieqtshaISTVAQALDKVQTVVNDQGLALSQLTKQLASNFQAISSSIEDLYNRLDRVEADLQVDRLITGRLAALNA 1103
Cdd:cd22378   392 -----------LTTTSTALGKLQDVVNQNAQALNTLVKQLSSNFGAISSVLNDILSRLDKVEAEVQIDRLITGRLQSLQT 460
                         490       500       510       520       530       540       550       560
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 971745448 1104 FVAQQLTKYTDVRASRQLAQDKINECVKSQSFRYGFCGNGTHVFSVVNAAPDGMMFFHSVLLPTAYMEVAAFSGLCVEGN 1183
Cdd:cd22378   461 YVTQQLIRAAEIRASANLAATKMSECVLGQSKRVDFCGKGYHLMSFPQAAPHGVVFLHVTYVPSQERNFTTAPAICHEGK 540
                         570       580       590       600       610       620       630       640
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 971745448 1184 GYVLRDTgnvLFEKNG-QYLITARKMFEPRVPQTSDFVQITGCDVVYLNVTRDELPTVIPdyiDVNSTVEDILSKLPNRT 1262
Cdd:cd22378   541 AYFPREG---VFVSNGtSWFITQRNFYSPQIITTDNTFVSGNCDVVIGIINNTVYDPLQP---ELDSFKEELDKYFKNHT 614
                         650       660       670       680
                  ....*....|....*....|....*....|....*....|...
gi 971745448 1263 TPEFDL-DI--FNATYLNLTGEIADLTARSESLKNTTLELKEL 1302
Cdd:cd22378   615 SPDVDLgDIsgINASVVNIQKEIDRLNEVAKNLNESLIDLQEL 657
CoV_S1_C pfam19209
Coronavirus spike glycoprotein S1, C-terminal; This entry represents a domain found at the ...
659-715 3.83e-23

Coronavirus spike glycoprotein S1, C-terminal; This entry represents a domain found at the C-terminus of the Coronavirus S1 protein. It is found across a range of alpha, beta and gamma coronaviruses. This small all beta stranded domain is known as subdomain 2 in the structure of the porcine epidemic diarrhea virus spike protein.


Pssm-ID: 437047 [Multi-domain]  Cd Length: 57  Bit Score: 93.45  E-value: 3.83e-23
                           10        20        30        40        50
                   ....*....|....*....|....*....|....*....|....*....|....*..
gi 971745448   659 DTCTTYTIYGMTGRGVITRSNNTFITGLYYTSNAGNLLAYKNSTTGVVYNVYPCQLS 715
Cdd:pfam19209    1 NVCTDYTIYGITGTGVIRETNSTIPSGLYYTSSSGDLLGFKNSTTGTVYSVTPCVSS 57
CoV_S2_C pfam19214
Coronavirus spike glycoprotein S2, intravirion; This entry represents the cysteine rich ...
1346-1387 1.21e-10

Coronavirus spike glycoprotein S2, intravirion; This entry represents the cysteine rich intravirion region found at the C-terminus of coronavirus spike proteins (S). These cysteine residues are targets for palmitoylation, necessary for efficiently S incorporation into virions and S-mediated membrane fusions.


Pssm-ID: 465998  Cd Length: 42  Bit Score: 57.81  E-value: 1.21e-10
                           10        20        30        40
                   ....*....|....*....|....*....|....*....|....
gi 971745448  1346 FCCCSTGCCGifSCMASSCGACCDIRG--TKLQRYEAIEKVHVQ 1387
Cdd:pfam19214    1 FCCCCTGCCG--CCFGCSCGGCCDSYDkrDDVYPAEVVEKVHVQ 42
CwlO1 COG3883
Uncharacterized N-terminal coiled-coil domain of peptidoglycan hydrolase CwlO [Function ...
965-1133 1.29e-04

Uncharacterized N-terminal coiled-coil domain of peptidoglycan hydrolase CwlO [Function unknown];


Pssm-ID: 443091 [Multi-domain]  Cd Length: 379  Bit Score: 45.98  E-value: 1.29e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 971745448  965 TGAMVFGGVTAGAAVPFSIAVQSRLNYLALQTDVLQRNQQQLANSFNAAMGNITEAFGRVNDA---IEQTSHAISTVAQA 1041
Cdd:COG3883     1 ALALALAAPTPAFADPQIQAKQKELSELQAELEAAQAELDALQAELEELNEEYNELQAELEALqaeIDKLQAEIAEAEAE 80
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 971745448 1042 LDKVQTVVNDQgLALSQLTKQLASNFQAI--SSSIEDLynrLDRVEAdlqVDRLITGRLAALNAFVAQQltkyTDVRASR 1119
Cdd:COG3883    81 IEERREELGER-ARALYRSGGSVSYLDVLlgSESFSDF---LDRLSA---LSKIADADADLLEELKADK----AELEAKK 149
                         170
                  ....*....|....
gi 971745448 1120 QLAQDKINECVKSQ 1133
Cdd:COG3883   150 AELEAKLAELEALK 163
Tar COG0840
Methyl-accepting chemotaxis protein (MCP) [Signal transduction mechanisms];
1005-1077 2.71e-04

Methyl-accepting chemotaxis protein (MCP) [Signal transduction mechanisms];


Pssm-ID: 440602 [Multi-domain]  Cd Length: 533  Bit Score: 45.40  E-value: 2.71e-04
                          10        20        30        40        50        60        70
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|...
gi 971745448 1005 QLANSFNAAMGNITEAFGRVNDAIEQTSHAISTVAQAldkvqtvvNDQglaLSQLTKQLASNFQAISSSIEDL 1077
Cdd:COG0840   239 QLADAFNRMIENLRELVGQVRESAEQVASASEELAAS--------AEE---LAAGAEEQAASLEETAAAMEEL 300
Tar COG0840
Methyl-accepting chemotaxis protein (MCP) [Signal transduction mechanisms];
1004-1083 2.68e-03

Methyl-accepting chemotaxis protein (MCP) [Signal transduction mechanisms];


Pssm-ID: 440602 [Multi-domain]  Cd Length: 533  Bit Score: 41.93  E-value: 2.68e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 971745448 1004 QQLANSFN---AAMGNITEAFGRVNDAIEQTSHAISTVAQALDKVQTVVNDQglalSQLTKQLASNFQAISSSIEDLYNR 1080
Cdd:COG0840   452 EEAGEALEeivEAVEEVSDLIQEIAAASEEQSAGTEEVNQAIEQIAAAAQEN----AASVEEVAAAAEELAELAEELQEL 527

                  ...
gi 971745448 1081 LDR 1083
Cdd:COG0840   528 VSR 530
 
Blast search parameters
Data Source: Precalculated data, version = cdd.v.3.21
Preset Options:Database: CDSEARCH/cdd   Low complexity filter: no  Composition Based Adjustment: yes   E-value threshold: 0.01

References:

  • Wang J et al. (2023), "The conserved domain database in 2023", Nucleic Acids Res.51(D)384-8.
  • Lu S et al. (2020), "The conserved domain database in 2020", Nucleic Acids Res.48(D)265-8.
  • Marchler-Bauer A et al. (2017), "CDD/SPARCLE: functional classification of proteins via subfamily domain architectures.", Nucleic Acids Res.45(D)200-3.
Help | Disclaimer | Write to the Help Desk
NCBI | NLM | NIH