NCBI Home Page NCBI Site Search page NCBI Guide that lists and describes the NCBI resources
Conserved domains on  [gi|1034627714|ref|XP_016883996|]
View 

uromodulin-like 1 isoform X1 [Homo sapiens]

Protein Classification

Graphical summary

 Zoom to residue level

show extra options »

Show site features     Horizontal zoom: ×

List of domain hits

Name Accession Description Interval E-value
Zona_pellucida pfam00100
Zona pellucida-like domain;
1017-1257 6.04e-40

Zona pellucida-like domain;


:

Pssm-ID: 459673 [Multi-domain]  Cd Length: 254  Bit Score: 148.91  E-value: 6.04e-40
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034627714 1017 CEIEKVVVAIQKRFLQQeSIPESSLYL---SHPSCNVSHSNGTH--VLLEAGWSECGTLMQSNMTNTVVRTTLRNDLSQE 1091
Cdd:pfam00100    1 CTPDTMTVSISKCLLVP-SGLLSSLSLlggLDPSCKPVSNTNGSpaVLFEFPLTGCGTTVQVNGTHIIYSNTLYSSTDLR 79
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034627714 1092 GIIHHLKILS--PIYCAFQNDLLTSSGFTLEWGVYTIIEDlhGAGNFVTEMQLFIGDS----PIPQNYSVSASDDVRIEV 1165
Cdd:pfam00100   80 SGIIRRTITRrlPFSCSYPRSSLVSLLVVAPPSPVPITVS--GSGVFLVSMDLYYDSSytspYSPYPVTVLLGDPLYVEV 157
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034627714 1166 GL-YRQKSNLKVVLTECWATPSSNARDPITFSFINNSCPVPNTYTN---VIENGNSNKAQFKLRIFSFINDSI--VYLHC 1239
Cdd:pfam00100  158 SLlSRTDPNLVLVLDNCWATPSPNPTSSPQYQLIVNGCPNDGDSTYpvsSLSNGPSHYVRFSFKAFRFVGSSIsqVYLHC 237
                          250
                   ....*....|....*...
gi 1034627714 1240 KLRVCmESPGATCKINCN 1257
Cdd:pfam00100  238 SVSVC-SSDSNSCGKSCS 254
WAP pfam00095
WAP-type (Whey Acidic Protein) 'four-disulfide core'; WAP belongs to the group of Elafin or ...
13-53 1.19e-10

WAP-type (Whey Acidic Protein) 'four-disulfide core'; WAP belongs to the group of Elafin or elastase-specific inhibitors.


:

Pssm-ID: 459672 [Multi-domain]  Cd Length: 42  Bit Score: 57.82  E-value: 1.19e-10
                           10        20        30        40
                   ....*....|....*....|....*....|....*....|..
gi 1034627714   13 RPGACPAEG-PEPSTSPCSLDIDCPGLEKCCPWSGGRYCMAP 53
Cdd:pfam00095    1 KPGCCPRLGaRGCCRSCCSSDDDCPGRQKCCSNGCGSVCVPP 42
Herpes_BLLF1 super family cl37540
Herpes virus major outer envelope glycoprotein (BLLF1); This family consists of the BLLF1 ...
458-679 3.63e-09

Herpes virus major outer envelope glycoprotein (BLLF1); This family consists of the BLLF1 viral late glycoprotein, also termed gp350/220. It is the most abundantly expressed glycoprotein in the viral envelope of the Herpesviruses and is the major antigen responsible for stimulating the production of neutralising antibodies in vivo.


The actual alignment was detected with superfamily member pfam05109:

Pssm-ID: 282904 [Multi-domain]  Cd Length: 886  Bit Score: 61.47  E-value: 3.63e-09
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034627714  458 GLSAATgvTVPGLGTGTAALGLENFTLSPSPGYPQGTPAAGQAWTPEPSPRRGGSNVVGYDRNNTGKGVEQEVP-STAPG 536
Cdd:pfam05109  447 GLPSST--HVPTNLTAPASTGPTVSTADVTSPTPAGTTSGASPVTPSPSPRDNGTESKAPDMTSPTSAVTTPTPnATSPT 524
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034627714  537 LGMDQGSPSQVNPSQGSPSQGSLRQESTSQASPSQRSTSQGSPSQVNPSQRSTSHANSSQGSPSQGSpsqeSPSQGSTSQ 616
Cdd:pfam05109  525 PAVTTPTPNATSPTLGKTSPTSAVTTPTPNATSPTPAVTTPTPNATIPTLGKTSPTSAVTTPTPNAT----SPTVGETSP 600
                          170       180       190       200       210       220
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 1034627714  617 ASPSHRNTIGviGTTSSPKATGSTHSFPPGATDGPLALPGQLQGNSIMEPPSW-----PSPTEDPTGH 679
Cdd:pfam05109  601 QANTTNHTLG--GTSSTPVVTSPPKNATSAVTTGQHNITSSSTSSMSLRPSSIsetlsPSTSDNSTSH 666
EGF_CA pfam07645
Calcium-binding EGF domain;
403-433 5.51e-07

Calcium-binding EGF domain;


:

Pssm-ID: 429571  Cd Length: 32  Bit Score: 46.85  E-value: 5.51e-07
                           10        20        30
                   ....*....|....*....|....*....|.
gi 1034627714  403 DWDECVDSAeHDCSPAAWCINLEGSYTCQCR 433
Cdd:pfam07645    1 DVDECATGT-HNCPANTVCVNTIGSFECRCP 30
EGF_CA smart00179
Calcium-binding EGF-like domain;
921-952 2.85e-06

Calcium-binding EGF-like domain;


:

Pssm-ID: 214542 [Multi-domain]  Cd Length: 39  Bit Score: 45.32  E-value: 2.85e-06
                            10        20        30
                    ....*....|....*....|....*....|..
gi 1034627714   921 DYDECERKeDDCVPGTSCRNTLGSFTCSCEGG 952
Cdd:smart00179    1 DIDECASG-NPCQNGGTCVNTVGSYRCECPPG 31
SEA super family cl02507
SEA domain; Domain found in Sea urchin sperm protein, Enterokinase, Agrin (SEA). Proposed ...
820-907 3.16e-06

SEA domain; Domain found in Sea urchin sperm protein, Enterokinase, Agrin (SEA). Proposed function of regulating or binding carbohydrate side chains. Recently a proteolytic activity has been shown for a SEA domain.


The actual alignment was detected with superfamily member smart00200:

Pssm-ID: 470595  Cd Length: 121  Bit Score: 47.41  E-value: 3.16e-06
                            10        20        30        40        50        60        70        80
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034627714   820 VRIKNVRYSESFRNASSQEYRDFLELFFRMVRGSLPATmcqHMDAGGVRMEVVSVTNGSIVVEFHLL----IIADVDVQE 895
Cdd:smart00200   14 VEGENLQYSPSLEDPSSEEYQELVRDVEKLLEQIYGKT---DLKPDFVGTEVIEFRNGSVVVDLGLLfnegVTNGQDVEE 90
                            90
                    ....*....|..
gi 1034627714   896 VSAAFLTAFQTV 907
Cdd:smart00200   91 DLLQVIKQAAYS 102
SEA pfam01390
SEA domain; Domain found in Sea urchin sperm protein, Enterokinase, Agrin (SEA). Proposed ...
294-370 5.01e-06

SEA domain; Domain found in Sea urchin sperm protein, Enterokinase, Agrin (SEA). Proposed function of regulating or binding carbohydrate side chains. Recently a proteolytic activity has been shown for a SEA domain.


:

Pssm-ID: 460188  Cd Length: 100  Bit Score: 46.46  E-value: 5.01e-06
                           10        20        30        40        50        60        70
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 1034627714  294 QVFEVTIKIVNHNLTEKLLNRSSVEYQDFSRQLLHEVESSFPPvvSDLyRSGKLRMQIVSL--QAGSVVVRLKLTVQDP 370
Cdd:pfam01390    1 QYYTGSFKITNLQYTPDLGNPSSQEFKSLSRRIESLLNELFRN--SSL-RKQYIKSHVLRLrpDGGSVVVDVVLVFRFP 76
EGF_CA smart00179
Calcium-binding EGF-like domain;
160-190 1.69e-03

Calcium-binding EGF-like domain;


:

Pssm-ID: 214542 [Multi-domain]  Cd Length: 39  Bit Score: 37.23  E-value: 1.69e-03
                            10        20        30
                    ....*....|....*....|....*....|.
gi 1034627714   160 DVNECfyEELNACSGRELCANLEGSYWCVCH 190
Cdd:smart00179    1 DIDEC--ASGNPCQNGGTCVNTVGSYRCECP 29
FN3 cd00063
Fibronectin type 3 domain; One of three types of internal repeats found in the plasma protein ...
727-803 9.56e-03

Fibronectin type 3 domain; One of three types of internal repeats found in the plasma protein fibronectin. Its tenth fibronectin type III repeat contains an RGD cell recognition sequence in a flexible loop between 2 strands. Approximately 2% of all animal proteins contain the FN3 repeat; including extracellular and intracellular proteins, membrane spanning cytokine receptors, growth hormone receptors, tyrosine phosphatase receptors, and adhesion molecules. FN3-like domains are also found in bacterial glycosyl hydrolases.


:

Pssm-ID: 238020 [Multi-domain]  Cd Length: 93  Bit Score: 36.71  E-value: 9.56e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034627714  727 PVSIGRIMVSNVTSTGFHLAWEADLAMDS-------TFQLTLTSMWSPAVVLETWNTSVTLSGLEPGVLHLVEIMAKACG 799
Cdd:cd00063      1 PSPPTNLRVTDVTSTSVTLSWTPPEDDGGpitgyvvEYREKGSGDWKEVEVTPGSETSYTLTGLKPGTEYEFRVRAVNGG 80

                   ....
gi 1034627714  800 KEGA 803
Cdd:cd00063     81 GESP 84
 
Name Accession Description Interval E-value
Zona_pellucida pfam00100
Zona pellucida-like domain;
1017-1257 6.04e-40

Zona pellucida-like domain;


Pssm-ID: 459673 [Multi-domain]  Cd Length: 254  Bit Score: 148.91  E-value: 6.04e-40
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034627714 1017 CEIEKVVVAIQKRFLQQeSIPESSLYL---SHPSCNVSHSNGTH--VLLEAGWSECGTLMQSNMTNTVVRTTLRNDLSQE 1091
Cdd:pfam00100    1 CTPDTMTVSISKCLLVP-SGLLSSLSLlggLDPSCKPVSNTNGSpaVLFEFPLTGCGTTVQVNGTHIIYSNTLYSSTDLR 79
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034627714 1092 GIIHHLKILS--PIYCAFQNDLLTSSGFTLEWGVYTIIEDlhGAGNFVTEMQLFIGDS----PIPQNYSVSASDDVRIEV 1165
Cdd:pfam00100   80 SGIIRRTITRrlPFSCSYPRSSLVSLLVVAPPSPVPITVS--GSGVFLVSMDLYYDSSytspYSPYPVTVLLGDPLYVEV 157
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034627714 1166 GL-YRQKSNLKVVLTECWATPSSNARDPITFSFINNSCPVPNTYTN---VIENGNSNKAQFKLRIFSFINDSI--VYLHC 1239
Cdd:pfam00100  158 SLlSRTDPNLVLVLDNCWATPSPNPTSSPQYQLIVNGCPNDGDSTYpvsSLSNGPSHYVRFSFKAFRFVGSSIsqVYLHC 237
                          250
                   ....*....|....*...
gi 1034627714 1240 KLRVCmESPGATCKINCN 1257
Cdd:pfam00100  238 SVSVC-SSDSNSCGKSCS 254
ZP smart00241
Zona pellucida (ZP) domain; ZP proteins are responsible for sperm-adhesion fo the zona ...
1017-1257 1.12e-30

Zona pellucida (ZP) domain; ZP proteins are responsible for sperm-adhesion fo the zona pellucida. ZP domains are also present in multidomain transmembrane proteins such as glycoprotein GP2, uromodulin and TGF-beta receptor type III (betaglycan).


Pssm-ID: 214579  Cd Length: 252  Bit Score: 122.11  E-value: 1.12e-30
                            10        20        30        40        50        60        70        80
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034627714  1017 CEIEKVVVAIQKRFLQQESIPESSLYLSHPSCNVSHS--NGTHVLLEAGWSECGTLMQSNMTNTVVRTTLRNDLSQEGII 1094
Cdd:smart00241    2 CGEDQMVVSVSTDLLFPGGINVKGLTLGDPSCRPQFTdaTSAFVSFEVPLNGCGTRRQVNPDGIVYSNTLVVSPFHPGFI 81
                            90       100       110       120       130       140       150       160
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034627714  1095 HHLKILS-PIYCAFQNDLLTSSGFTLEWGVYTIIEDLhGAGNFVTEMQLFIGD--SPIPQNYSVSASDDVRIEVG-LYRQ 1170
Cdd:smart00241   82 TRDDRAAyHFQCFYPENEKVSLNLDVSTIPPTELSSV-SEGPLTCSYRLYKDDsfGSPYQSADYVLGDPVYHEWEcDGAD 160
                           170       180       190       200       210       220       230       240
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034627714  1171 KSNLKVVLTECWATPSSNARDPITFSFINNSCPVPNTYTNVIE--NGNSNKAQFKLRIFSFINDSIVYLHCKLRVCMESP 1248
Cdd:smart00241  161 DPPLGLLVDNCYATPGPDPSSGPKYFIIDNGCPVDGYLDSTIPynSNPLHRARFSVKVFKFADRSLVYFHCQIRLCDKDD 240
                           250
                    ....*....|
gi 1034627714  1249 GATCK-INCN 1257
Cdd:smart00241  241 GSSCDgPACS 250
WAP pfam00095
WAP-type (Whey Acidic Protein) 'four-disulfide core'; WAP belongs to the group of Elafin or ...
13-53 1.19e-10

WAP-type (Whey Acidic Protein) 'four-disulfide core'; WAP belongs to the group of Elafin or elastase-specific inhibitors.


Pssm-ID: 459672 [Multi-domain]  Cd Length: 42  Bit Score: 57.82  E-value: 1.19e-10
                           10        20        30        40
                   ....*....|....*....|....*....|....*....|..
gi 1034627714   13 RPGACPAEG-PEPSTSPCSLDIDCPGLEKCCPWSGGRYCMAP 53
Cdd:pfam00095    1 KPGCCPRLGaRGCCRSCCSSDDDCPGRQKCCSNGCGSVCVPP 42
Herpes_BLLF1 pfam05109
Herpes virus major outer envelope glycoprotein (BLLF1); This family consists of the BLLF1 ...
458-679 3.63e-09

Herpes virus major outer envelope glycoprotein (BLLF1); This family consists of the BLLF1 viral late glycoprotein, also termed gp350/220. It is the most abundantly expressed glycoprotein in the viral envelope of the Herpesviruses and is the major antigen responsible for stimulating the production of neutralising antibodies in vivo.


Pssm-ID: 282904 [Multi-domain]  Cd Length: 886  Bit Score: 61.47  E-value: 3.63e-09
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034627714  458 GLSAATgvTVPGLGTGTAALGLENFTLSPSPGYPQGTPAAGQAWTPEPSPRRGGSNVVGYDRNNTGKGVEQEVP-STAPG 536
Cdd:pfam05109  447 GLPSST--HVPTNLTAPASTGPTVSTADVTSPTPAGTTSGASPVTPSPSPRDNGTESKAPDMTSPTSAVTTPTPnATSPT 524
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034627714  537 LGMDQGSPSQVNPSQGSPSQGSLRQESTSQASPSQRSTSQGSPSQVNPSQRSTSHANSSQGSPSQGSpsqeSPSQGSTSQ 616
Cdd:pfam05109  525 PAVTTPTPNATSPTLGKTSPTSAVTTPTPNATSPTPAVTTPTPNATIPTLGKTSPTSAVTTPTPNAT----SPTVGETSP 600
                          170       180       190       200       210       220
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 1034627714  617 ASPSHRNTIGviGTTSSPKATGSTHSFPPGATDGPLALPGQLQGNSIMEPPSW-----PSPTEDPTGH 679
Cdd:pfam05109  601 QANTTNHTLG--GTSSTPVVTSPPKNATSAVTTGQHNITSSSTSSMSLRPSSIsetlsPSTSDNSTSH 666
EGF_CA pfam07645
Calcium-binding EGF domain;
403-433 5.51e-07

Calcium-binding EGF domain;


Pssm-ID: 429571  Cd Length: 32  Bit Score: 46.85  E-value: 5.51e-07
                           10        20        30
                   ....*....|....*....|....*....|.
gi 1034627714  403 DWDECVDSAeHDCSPAAWCINLEGSYTCQCR 433
Cdd:pfam07645    1 DVDECATGT-HNCPANTVCVNTIGSFECRCP 30
EGF_CA smart00179
Calcium-binding EGF-like domain;
921-952 2.85e-06

Calcium-binding EGF-like domain;


Pssm-ID: 214542 [Multi-domain]  Cd Length: 39  Bit Score: 45.32  E-value: 2.85e-06
                            10        20        30
                    ....*....|....*....|....*....|..
gi 1034627714   921 DYDECERKeDDCVPGTSCRNTLGSFTCSCEGG 952
Cdd:smart00179    1 DIDECASG-NPCQNGGTCVNTVGSYRCECPPG 31
SEA smart00200
Domain found in sea urchin sperm protein, enterokinase, agrin; Proposed function of regulating ...
820-907 3.16e-06

Domain found in sea urchin sperm protein, enterokinase, agrin; Proposed function of regulating or binding carbohydrate sidechains.


Pssm-ID: 214554  Cd Length: 121  Bit Score: 47.41  E-value: 3.16e-06
                            10        20        30        40        50        60        70        80
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034627714   820 VRIKNVRYSESFRNASSQEYRDFLELFFRMVRGSLPATmcqHMDAGGVRMEVVSVTNGSIVVEFHLL----IIADVDVQE 895
Cdd:smart00200   14 VEGENLQYSPSLEDPSSEEYQELVRDVEKLLEQIYGKT---DLKPDFVGTEVIEFRNGSVVVDLGLLfnegVTNGQDVEE 90
                            90
                    ....*....|..
gi 1034627714   896 VSAAFLTAFQTV 907
Cdd:smart00200   91 DLLQVIKQAAYS 102
SEA pfam01390
SEA domain; Domain found in Sea urchin sperm protein, Enterokinase, Agrin (SEA). Proposed ...
294-370 5.01e-06

SEA domain; Domain found in Sea urchin sperm protein, Enterokinase, Agrin (SEA). Proposed function of regulating or binding carbohydrate side chains. Recently a proteolytic activity has been shown for a SEA domain.


Pssm-ID: 460188  Cd Length: 100  Bit Score: 46.46  E-value: 5.01e-06
                           10        20        30        40        50        60        70
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 1034627714  294 QVFEVTIKIVNHNLTEKLLNRSSVEYQDFSRQLLHEVESSFPPvvSDLyRSGKLRMQIVSL--QAGSVVVRLKLTVQDP 370
Cdd:pfam01390    1 QYYTGSFKITNLQYTPDLGNPSSQEFKSLSRRIESLLNELFRN--SSL-RKQYIKSHVLRLrpDGGSVVVDVVLVFRFP 76
EGF_CA pfam07645
Calcium-binding EGF domain;
921-952 1.17e-05

Calcium-binding EGF domain;


Pssm-ID: 429571  Cd Length: 32  Bit Score: 43.38  E-value: 1.17e-05
                           10        20        30
                   ....*....|....*....|....*....|..
gi 1034627714  921 DYDECERKEDDCVPGTSCRNTLGSFTCSCEGG 952
Cdd:pfam07645    1 DVDECATGTHNCPANTVCVNTIGSFECRCPDG 32
EGF_CA smart00179
Calcium-binding EGF-like domain;
405-433 1.44e-05

Calcium-binding EGF-like domain;


Pssm-ID: 214542 [Multi-domain]  Cd Length: 39  Bit Score: 43.00  E-value: 1.44e-05
                            10        20
                    ....*....|....*....|....*....
gi 1034627714   405 DECVDsaEHDCSPAAWCINLEGSYTCQCR 433
Cdd:smart00179    3 DECAS--GNPCQNGGTCVNTVGSYRCECP 29
EGF_CA cd00054
Calcium-binding EGF-like domain, present in a large number of membrane-bound and extracellular ...
921-952 2.65e-05

Calcium-binding EGF-like domain, present in a large number of membrane-bound and extracellular (mostly animal) proteins. Many of these proteins require calcium for their biological function and calcium-binding sites have been found to be located at the N-terminus of particular EGF-like domains; calcium-binding may be crucial for numerous protein-protein interactions. Six conserved core cysteines form three disulfide bridges as in non calcium-binding EGF domains, whose structures are very similar. EGF_CA can be found in tandem repeat arrangements.


Pssm-ID: 238011  Cd Length: 38  Bit Score: 42.24  E-value: 2.65e-05
                           10        20        30
                   ....*....|....*....|....*....|..
gi 1034627714  921 DYDECERkEDDCVPGTSCRNTLGSFTCSCEGG 952
Cdd:cd00054      1 DIDECAS-GNPCQNGGTCVNTVGSYRCSCPPG 31
EGF_CA cd00054
Calcium-binding EGF-like domain, present in a large number of membrane-bound and extracellular ...
405-433 8.13e-05

Calcium-binding EGF-like domain, present in a large number of membrane-bound and extracellular (mostly animal) proteins. Many of these proteins require calcium for their biological function and calcium-binding sites have been found to be located at the N-terminus of particular EGF-like domains; calcium-binding may be crucial for numerous protein-protein interactions. Six conserved core cysteines form three disulfide bridges as in non calcium-binding EGF domains, whose structures are very similar. EGF_CA can be found in tandem repeat arrangements.


Pssm-ID: 238011  Cd Length: 38  Bit Score: 41.08  E-value: 8.13e-05
                           10        20
                   ....*....|....*....|....*....
gi 1034627714  405 DECVDsaEHDCSPAAWCINLEGSYTCQCR 433
Cdd:cd00054      3 DECAS--GNPCQNGGTCVNTVGSYRCSCP 29
WAP cd00199
whey acidic protein-type four-disulfide core domains. Members of the family include whey ...
13-53 1.18e-04

whey acidic protein-type four-disulfide core domains. Members of the family include whey acidic protein, elafin (elastase-specific inhibitor), caltrin-like protein (a calcium transport inhibitor) and other extracellular proteinase inhibitors. A group of proteins containing 8 characteristically-spaced cysteine residuesforming disulphide bonds, have been termed '4-disulphide core' proteins. Protease inhibition occurs by insertion of the inhibitory loop into the active site pocket and interference with the catalytic residues of the protease.


Pssm-ID: 238120 [Multi-domain]  Cd Length: 60  Bit Score: 41.28  E-value: 1.18e-04
                           10        20        30        40
                   ....*....|....*....|....*....|....*....|....
gi 1034627714   13 RPGACPA-EGPEPSTSP--CSLDIDCPGLEKCCPWSGGRYCMAP 53
Cdd:cd00199     16 KPGRCPMvNPPSLGIPPnrCSSDSDCPGDKKCCENGCGKSCLTP 59
PHA03247 PHA03247
large tegument protein UL36; Provisional
486-727 1.56e-04

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 46.47  E-value: 1.56e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034627714  486 PSPGYPQGTPAAG--QAWTPEPSPRRGGSNVVGYDRNnTGKGVEQEVPSTAPGLGMDQGSPSQVNPS------------Q 551
Cdd:PHA03247  2552 PPPLPPAAPPAAPdrSVPPPRPAPRPSEPAVTSRARR-PDAPPQSARPRAPVDDRGDPRGPAPPSPLppdthapdppppS 2630
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034627714  552 GSPSQGSLRQESTSQASPSQRSTSQGSPSQVNPSQRSTSHANSSQGSpsqgspsqeSPSQGSTSQASPShrnTIGVIGTT 631
Cdd:PHA03247  2631 PSPAANEPDPHPPPTVPPPERPRDDPAPGRVSRPRRARRLGRAAQAS---------SPPQRPRRRAARP---TVGSLTSL 2698
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034627714  632 SSPKATGSTHSFPPGATDG--PLALPGQLQGNSIMEPPSWPSPTEDPTGHFL-----WHATRSTRETLLNPTWLRNEDSG 704
Cdd:PHA03247  2699 ADPPPPPPTPEPAPHALVSatPLPPGPAAARQASPALPAAPAPPAVPAGPATpggpaRPARPPTTAGPPAPAPPAAPAAG 2778
                          250       260
                   ....*....|....*....|...
gi 1034627714  705 PSGSVDLPLTSTLTALKTPACVP 727
Cdd:PHA03247  2779 PPRRLTRPAVASLSESRESLPSP 2801
WAP smart00217
Four-disulfide core domains;
13-54 1.60e-04

Four-disulfide core domains;


Pssm-ID: 197580 [Multi-domain]  Cd Length: 47  Bit Score: 40.43  E-value: 1.60e-04
                            10        20        30        40
                    ....*....|....*....|....*....|....*....|....*..
gi 1034627714    13 RPGACPAE-----GPEPSTSPCSLDIDCPGLEKCCPWSGGRYCMAPA 54
Cdd:smart00217    1 KPGSCPWPtiascPLGNPPNKCSSDSQCPGVKKCCFNGCGKSCLTPV 47
EGF_CA smart00179
Calcium-binding EGF-like domain;
160-190 1.69e-03

Calcium-binding EGF-like domain;


Pssm-ID: 214542 [Multi-domain]  Cd Length: 39  Bit Score: 37.23  E-value: 1.69e-03
                            10        20        30
                    ....*....|....*....|....*....|.
gi 1034627714   160 DVNECfyEELNACSGRELCANLEGSYWCVCH 190
Cdd:smart00179    1 DIDEC--ASGNPCQNGGTCVNTVGSYRCECP 29
EGF_CA pfam07645
Calcium-binding EGF domain;
160-189 2.09e-03

Calcium-binding EGF domain;


Pssm-ID: 429571  Cd Length: 32  Bit Score: 36.83  E-value: 2.09e-03
                           10        20        30
                   ....*....|....*....|....*....|
gi 1034627714  160 DVNECFyEELNACSGRELCANLEGSYWCVC 189
Cdd:pfam07645    1 DVDECA-TGTHNCPANTVCVNTIGSFECRC 29
FN3 cd00063
Fibronectin type 3 domain; One of three types of internal repeats found in the plasma protein ...
727-803 9.56e-03

Fibronectin type 3 domain; One of three types of internal repeats found in the plasma protein fibronectin. Its tenth fibronectin type III repeat contains an RGD cell recognition sequence in a flexible loop between 2 strands. Approximately 2% of all animal proteins contain the FN3 repeat; including extracellular and intracellular proteins, membrane spanning cytokine receptors, growth hormone receptors, tyrosine phosphatase receptors, and adhesion molecules. FN3-like domains are also found in bacterial glycosyl hydrolases.


Pssm-ID: 238020 [Multi-domain]  Cd Length: 93  Bit Score: 36.71  E-value: 9.56e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034627714  727 PVSIGRIMVSNVTSTGFHLAWEADLAMDS-------TFQLTLTSMWSPAVVLETWNTSVTLSGLEPGVLHLVEIMAKACG 799
Cdd:cd00063      1 PSPPTNLRVTDVTSTSVTLSWTPPEDDGGpitgyvvEYREKGSGDWKEVEVTPGSETSYTLTGLKPGTEYEFRVRAVNGG 80

                   ....
gi 1034627714  800 KEGA 803
Cdd:cd00063     81 GESP 84
 
Name Accession Description Interval E-value
Zona_pellucida pfam00100
Zona pellucida-like domain;
1017-1257 6.04e-40

Zona pellucida-like domain;


Pssm-ID: 459673 [Multi-domain]  Cd Length: 254  Bit Score: 148.91  E-value: 6.04e-40
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034627714 1017 CEIEKVVVAIQKRFLQQeSIPESSLYL---SHPSCNVSHSNGTH--VLLEAGWSECGTLMQSNMTNTVVRTTLRNDLSQE 1091
Cdd:pfam00100    1 CTPDTMTVSISKCLLVP-SGLLSSLSLlggLDPSCKPVSNTNGSpaVLFEFPLTGCGTTVQVNGTHIIYSNTLYSSTDLR 79
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034627714 1092 GIIHHLKILS--PIYCAFQNDLLTSSGFTLEWGVYTIIEDlhGAGNFVTEMQLFIGDS----PIPQNYSVSASDDVRIEV 1165
Cdd:pfam00100   80 SGIIRRTITRrlPFSCSYPRSSLVSLLVVAPPSPVPITVS--GSGVFLVSMDLYYDSSytspYSPYPVTVLLGDPLYVEV 157
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034627714 1166 GL-YRQKSNLKVVLTECWATPSSNARDPITFSFINNSCPVPNTYTN---VIENGNSNKAQFKLRIFSFINDSI--VYLHC 1239
Cdd:pfam00100  158 SLlSRTDPNLVLVLDNCWATPSPNPTSSPQYQLIVNGCPNDGDSTYpvsSLSNGPSHYVRFSFKAFRFVGSSIsqVYLHC 237
                          250
                   ....*....|....*...
gi 1034627714 1240 KLRVCmESPGATCKINCN 1257
Cdd:pfam00100  238 SVSVC-SSDSNSCGKSCS 254
ZP smart00241
Zona pellucida (ZP) domain; ZP proteins are responsible for sperm-adhesion fo the zona ...
1017-1257 1.12e-30

Zona pellucida (ZP) domain; ZP proteins are responsible for sperm-adhesion fo the zona pellucida. ZP domains are also present in multidomain transmembrane proteins such as glycoprotein GP2, uromodulin and TGF-beta receptor type III (betaglycan).


Pssm-ID: 214579  Cd Length: 252  Bit Score: 122.11  E-value: 1.12e-30
                            10        20        30        40        50        60        70        80
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034627714  1017 CEIEKVVVAIQKRFLQQESIPESSLYLSHPSCNVSHS--NGTHVLLEAGWSECGTLMQSNMTNTVVRTTLRNDLSQEGII 1094
Cdd:smart00241    2 CGEDQMVVSVSTDLLFPGGINVKGLTLGDPSCRPQFTdaTSAFVSFEVPLNGCGTRRQVNPDGIVYSNTLVVSPFHPGFI 81
                            90       100       110       120       130       140       150       160
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034627714  1095 HHLKILS-PIYCAFQNDLLTSSGFTLEWGVYTIIEDLhGAGNFVTEMQLFIGD--SPIPQNYSVSASDDVRIEVG-LYRQ 1170
Cdd:smart00241   82 TRDDRAAyHFQCFYPENEKVSLNLDVSTIPPTELSSV-SEGPLTCSYRLYKDDsfGSPYQSADYVLGDPVYHEWEcDGAD 160
                           170       180       190       200       210       220       230       240
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034627714  1171 KSNLKVVLTECWATPSSNARDPITFSFINNSCPVPNTYTNVIE--NGNSNKAQFKLRIFSFINDSIVYLHCKLRVCMESP 1248
Cdd:smart00241  161 DPPLGLLVDNCYATPGPDPSSGPKYFIIDNGCPVDGYLDSTIPynSNPLHRARFSVKVFKFADRSLVYFHCQIRLCDKDD 240
                           250
                    ....*....|
gi 1034627714  1249 GATCK-INCN 1257
Cdd:smart00241  241 GSSCDgPACS 250
WAP pfam00095
WAP-type (Whey Acidic Protein) 'four-disulfide core'; WAP belongs to the group of Elafin or ...
13-53 1.19e-10

WAP-type (Whey Acidic Protein) 'four-disulfide core'; WAP belongs to the group of Elafin or elastase-specific inhibitors.


Pssm-ID: 459672 [Multi-domain]  Cd Length: 42  Bit Score: 57.82  E-value: 1.19e-10
                           10        20        30        40
                   ....*....|....*....|....*....|....*....|..
gi 1034627714   13 RPGACPAEG-PEPSTSPCSLDIDCPGLEKCCPWSGGRYCMAP 53
Cdd:pfam00095    1 KPGCCPRLGaRGCCRSCCSSDDDCPGRQKCCSNGCGSVCVPP 42
Herpes_BLLF1 pfam05109
Herpes virus major outer envelope glycoprotein (BLLF1); This family consists of the BLLF1 ...
458-679 3.63e-09

Herpes virus major outer envelope glycoprotein (BLLF1); This family consists of the BLLF1 viral late glycoprotein, also termed gp350/220. It is the most abundantly expressed glycoprotein in the viral envelope of the Herpesviruses and is the major antigen responsible for stimulating the production of neutralising antibodies in vivo.


Pssm-ID: 282904 [Multi-domain]  Cd Length: 886  Bit Score: 61.47  E-value: 3.63e-09
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034627714  458 GLSAATgvTVPGLGTGTAALGLENFTLSPSPGYPQGTPAAGQAWTPEPSPRRGGSNVVGYDRNNTGKGVEQEVP-STAPG 536
Cdd:pfam05109  447 GLPSST--HVPTNLTAPASTGPTVSTADVTSPTPAGTTSGASPVTPSPSPRDNGTESKAPDMTSPTSAVTTPTPnATSPT 524
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034627714  537 LGMDQGSPSQVNPSQGSPSQGSLRQESTSQASPSQRSTSQGSPSQVNPSQRSTSHANSSQGSPSQGSpsqeSPSQGSTSQ 616
Cdd:pfam05109  525 PAVTTPTPNATSPTLGKTSPTSAVTTPTPNATSPTPAVTTPTPNATIPTLGKTSPTSAVTTPTPNAT----SPTVGETSP 600
                          170       180       190       200       210       220
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 1034627714  617 ASPSHRNTIGviGTTSSPKATGSTHSFPPGATDGPLALPGQLQGNSIMEPPSW-----PSPTEDPTGH 679
Cdd:pfam05109  601 QANTTNHTLG--GTSSTPVVTSPPKNATSAVTTGQHNITSSSTSSMSLRPSSIsetlsPSTSDNSTSH 666
EGF_CA pfam07645
Calcium-binding EGF domain;
403-433 5.51e-07

Calcium-binding EGF domain;


Pssm-ID: 429571  Cd Length: 32  Bit Score: 46.85  E-value: 5.51e-07
                           10        20        30
                   ....*....|....*....|....*....|.
gi 1034627714  403 DWDECVDSAeHDCSPAAWCINLEGSYTCQCR 433
Cdd:pfam07645    1 DVDECATGT-HNCPANTVCVNTIGSFECRCP 30
EGF_CA smart00179
Calcium-binding EGF-like domain;
921-952 2.85e-06

Calcium-binding EGF-like domain;


Pssm-ID: 214542 [Multi-domain]  Cd Length: 39  Bit Score: 45.32  E-value: 2.85e-06
                            10        20        30
                    ....*....|....*....|....*....|..
gi 1034627714   921 DYDECERKeDDCVPGTSCRNTLGSFTCSCEGG 952
Cdd:smart00179    1 DIDECASG-NPCQNGGTCVNTVGSYRCECPPG 31
SEA smart00200
Domain found in sea urchin sperm protein, enterokinase, agrin; Proposed function of regulating ...
820-907 3.16e-06

Domain found in sea urchin sperm protein, enterokinase, agrin; Proposed function of regulating or binding carbohydrate sidechains.


Pssm-ID: 214554  Cd Length: 121  Bit Score: 47.41  E-value: 3.16e-06
                            10        20        30        40        50        60        70        80
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034627714   820 VRIKNVRYSESFRNASSQEYRDFLELFFRMVRGSLPATmcqHMDAGGVRMEVVSVTNGSIVVEFHLL----IIADVDVQE 895
Cdd:smart00200   14 VEGENLQYSPSLEDPSSEEYQELVRDVEKLLEQIYGKT---DLKPDFVGTEVIEFRNGSVVVDLGLLfnegVTNGQDVEE 90
                            90
                    ....*....|..
gi 1034627714   896 VSAAFLTAFQTV 907
Cdd:smart00200   91 DLLQVIKQAAYS 102
SEA pfam01390
SEA domain; Domain found in Sea urchin sperm protein, Enterokinase, Agrin (SEA). Proposed ...
294-370 5.01e-06

SEA domain; Domain found in Sea urchin sperm protein, Enterokinase, Agrin (SEA). Proposed function of regulating or binding carbohydrate side chains. Recently a proteolytic activity has been shown for a SEA domain.


Pssm-ID: 460188  Cd Length: 100  Bit Score: 46.46  E-value: 5.01e-06
                           10        20        30        40        50        60        70
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 1034627714  294 QVFEVTIKIVNHNLTEKLLNRSSVEYQDFSRQLLHEVESSFPPvvSDLyRSGKLRMQIVSL--QAGSVVVRLKLTVQDP 370
Cdd:pfam01390    1 QYYTGSFKITNLQYTPDLGNPSSQEFKSLSRRIESLLNELFRN--SSL-RKQYIKSHVLRLrpDGGSVVVDVVLVFRFP 76
EGF_CA pfam07645
Calcium-binding EGF domain;
921-952 1.17e-05

Calcium-binding EGF domain;


Pssm-ID: 429571  Cd Length: 32  Bit Score: 43.38  E-value: 1.17e-05
                           10        20        30
                   ....*....|....*....|....*....|..
gi 1034627714  921 DYDECERKEDDCVPGTSCRNTLGSFTCSCEGG 952
Cdd:pfam07645    1 DVDECATGTHNCPANTVCVNTIGSFECRCPDG 32
Herpes_BLLF1 pfam05109
Herpes virus major outer envelope glycoprotein (BLLF1); This family consists of the BLLF1 ...
435-690 1.24e-05

Herpes virus major outer envelope glycoprotein (BLLF1); This family consists of the BLLF1 viral late glycoprotein, also termed gp350/220. It is the most abundantly expressed glycoprotein in the viral envelope of the Herpesviruses and is the major antigen responsible for stimulating the production of neutralising antibodies in vivo.


Pssm-ID: 282904 [Multi-domain]  Cd Length: 886  Bit Score: 49.91  E-value: 1.24e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034627714  435 TRDATPSRAGRACEGDLVS-PMGGGLSAATGVTVPGLGTGTAALGlenfTLSPSPGYPQGTPAAGQAWTPEPSPRRGGSN 513
Cdd:pfam05109  531 TPNATSPTLGKTSPTSAVTtPTPNATSPTPAVTTPTPNATIPTLG----KTSPTSAVTTPTPNATSPTVGETSPQANTTN 606
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034627714  514 vvgydrNNTGKGVEQEVPSTAPGLGMDQGSPSQVNPSQGSPSQGSLRQESTSQA---SPSQRSTSQ------GSPS---- 580
Cdd:pfam05109  607 ------HTLGGTSSTPVVTSPPKNATSAVTTGQHNITSSSTSSMSLRPSSISETlspSTSDNSTSHmplltsAHPTggen 680
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034627714  581 --QVNPSQRSTSHANssqgspsqgsPSQESPSQGSTSQASPShrntiGVIGTTSSPKATGSTHSFPPGATDGPLALPGQL 658
Cdd:pfam05109  681 itQVTPASTSTHHVS----------TSSPAPRPGTTSQASGP-----GNSSTSTKPGEVNVTKGTPPKNATSPQAPSGQK 745
                          250       260       270
                   ....*....|....*....|....*....|..
gi 1034627714  659 QGNSIMEPPSWPSPTEDPTGHFLWHATRSTRE 690
Cdd:pfam05109  746 TAVPTVTSTGGKANSTTGGKHTTGHGARTSTE 777
EGF_CA smart00179
Calcium-binding EGF-like domain;
405-433 1.44e-05

Calcium-binding EGF-like domain;


Pssm-ID: 214542 [Multi-domain]  Cd Length: 39  Bit Score: 43.00  E-value: 1.44e-05
                            10        20
                    ....*....|....*....|....*....
gi 1034627714   405 DECVDsaEHDCSPAAWCINLEGSYTCQCR 433
Cdd:smart00179    3 DECAS--GNPCQNGGTCVNTVGSYRCECP 29
DUF5585 pfam17823
Family of unknown function (DUF5585); This is a family of unknown function found in chordata.
410-659 2.28e-05

Family of unknown function (DUF5585); This is a family of unknown function found in chordata.


Pssm-ID: 465521 [Multi-domain]  Cd Length: 506  Bit Score: 48.80  E-value: 2.28e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034627714  410 SAEHDCSPAAWciNLEGSYTCQCRTTRDATPSRAGRACEGDLVSPMGGGLSAATGVTVPGLGTGTAALGlenfTLSPSPG 489
Cdd:pfam17823  166 SAPHAASPAPR--TAASSTTAASSTTAASSAPTTAASSAPATLTPARGISTAATATGHPAAGTALAAVG----NSSPAAG 239
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034627714  490 ypqGTPAAGQAWTPEpsprrggsnVVGydrnnTGKGVEQEVPSTAPGLGMDQGSPSQVNPSQGSPSQGSLRQESTSQASP 569
Cdd:pfam17823  240 ---TVTAAVGTVTPA---------ALA-----TLAAAAGTVASAAGTINMGDPHARRLSPAKHMPSDTMARNPAAPMGAQ 302
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034627714  570 SQRSTSQGSPSQ--VNPSQRST-SHANSSQGSPSQGSPSQESPSQGSTSQAS---PShRNTIGVIGTTSSP--KATGSTH 641
Cdd:pfam17823  303 AQGPIIQVSTDQpvHNTAGEPTpSPSNTTLEPNTPKSVASTNLAVVTTTKAQakePS-ASPVPVLHTSMIPevEATSPTT 381
                          250
                   ....*....|....*...
gi 1034627714  642 SFPPGATDGPLALPGQLQ 659
Cdd:pfam17823  382 QPSPLLPTQGAAGPGILL 399
EGF_CA cd00054
Calcium-binding EGF-like domain, present in a large number of membrane-bound and extracellular ...
921-952 2.65e-05

Calcium-binding EGF-like domain, present in a large number of membrane-bound and extracellular (mostly animal) proteins. Many of these proteins require calcium for their biological function and calcium-binding sites have been found to be located at the N-terminus of particular EGF-like domains; calcium-binding may be crucial for numerous protein-protein interactions. Six conserved core cysteines form three disulfide bridges as in non calcium-binding EGF domains, whose structures are very similar. EGF_CA can be found in tandem repeat arrangements.


Pssm-ID: 238011  Cd Length: 38  Bit Score: 42.24  E-value: 2.65e-05
                           10        20        30
                   ....*....|....*....|....*....|..
gi 1034627714  921 DYDECERkEDDCVPGTSCRNTLGSFTCSCEGG 952
Cdd:cd00054      1 DIDECAS-GNPCQNGGTCVNTVGSYRCSCPPG 31
EGF_CA cd00054
Calcium-binding EGF-like domain, present in a large number of membrane-bound and extracellular ...
405-433 8.13e-05

Calcium-binding EGF-like domain, present in a large number of membrane-bound and extracellular (mostly animal) proteins. Many of these proteins require calcium for their biological function and calcium-binding sites have been found to be located at the N-terminus of particular EGF-like domains; calcium-binding may be crucial for numerous protein-protein interactions. Six conserved core cysteines form three disulfide bridges as in non calcium-binding EGF domains, whose structures are very similar. EGF_CA can be found in tandem repeat arrangements.


Pssm-ID: 238011  Cd Length: 38  Bit Score: 41.08  E-value: 8.13e-05
                           10        20
                   ....*....|....*....|....*....
gi 1034627714  405 DECVDsaEHDCSPAAWCINLEGSYTCQCR 433
Cdd:cd00054      3 DECAS--GNPCQNGGTCVNTVGSYRCSCP 29
WAP cd00199
whey acidic protein-type four-disulfide core domains. Members of the family include whey ...
13-53 1.18e-04

whey acidic protein-type four-disulfide core domains. Members of the family include whey acidic protein, elafin (elastase-specific inhibitor), caltrin-like protein (a calcium transport inhibitor) and other extracellular proteinase inhibitors. A group of proteins containing 8 characteristically-spaced cysteine residuesforming disulphide bonds, have been termed '4-disulphide core' proteins. Protease inhibition occurs by insertion of the inhibitory loop into the active site pocket and interference with the catalytic residues of the protease.


Pssm-ID: 238120 [Multi-domain]  Cd Length: 60  Bit Score: 41.28  E-value: 1.18e-04
                           10        20        30        40
                   ....*....|....*....|....*....|....*....|....
gi 1034627714   13 RPGACPA-EGPEPSTSP--CSLDIDCPGLEKCCPWSGGRYCMAP 53
Cdd:cd00199     16 KPGRCPMvNPPSLGIPPnrCSSDSDCPGDKKCCENGCGKSCLTP 59
PHA03247 PHA03247
large tegument protein UL36; Provisional
486-727 1.56e-04

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 46.47  E-value: 1.56e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034627714  486 PSPGYPQGTPAAG--QAWTPEPSPRRGGSNVVGYDRNnTGKGVEQEVPSTAPGLGMDQGSPSQVNPS------------Q 551
Cdd:PHA03247  2552 PPPLPPAAPPAAPdrSVPPPRPAPRPSEPAVTSRARR-PDAPPQSARPRAPVDDRGDPRGPAPPSPLppdthapdppppS 2630
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034627714  552 GSPSQGSLRQESTSQASPSQRSTSQGSPSQVNPSQRSTSHANSSQGSpsqgspsqeSPSQGSTSQASPShrnTIGVIGTT 631
Cdd:PHA03247  2631 PSPAANEPDPHPPPTVPPPERPRDDPAPGRVSRPRRARRLGRAAQAS---------SPPQRPRRRAARP---TVGSLTSL 2698
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034627714  632 SSPKATGSTHSFPPGATDG--PLALPGQLQGNSIMEPPSWPSPTEDPTGHFL-----WHATRSTRETLLNPTWLRNEDSG 704
Cdd:PHA03247  2699 ADPPPPPPTPEPAPHALVSatPLPPGPAAARQASPALPAAPAPPAVPAGPATpggpaRPARPPTTAGPPAPAPPAAPAAG 2778
                          250       260
                   ....*....|....*....|...
gi 1034627714  705 PSGSVDLPLTSTLTALKTPACVP 727
Cdd:PHA03247  2779 PPRRLTRPAVASLSESRESLPSP 2801
WAP smart00217
Four-disulfide core domains;
13-54 1.60e-04

Four-disulfide core domains;


Pssm-ID: 197580 [Multi-domain]  Cd Length: 47  Bit Score: 40.43  E-value: 1.60e-04
                            10        20        30        40
                    ....*....|....*....|....*....|....*....|....*..
gi 1034627714    13 RPGACPAE-----GPEPSTSPCSLDIDCPGLEKCCPWSGGRYCMAPA 54
Cdd:smart00217    1 KPGSCPWPtiascPLGNPPNKCSSDSQCPGVKKCCFNGCGKSCLTPV 47
PRK14959 PRK14959
DNA polymerase III subunits gamma and tau; Provisional
418-633 3.10e-04

DNA polymerase III subunits gamma and tau; Provisional


Pssm-ID: 184923 [Multi-domain]  Cd Length: 624  Bit Score: 45.06  E-value: 3.10e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034627714  418 AAWCINLEGsytcQCRTTRDATPSRA-----------GRACEGDLVSPMGGGLSAATGVTVPGLGTGTAAlglenftLSP 486
Cdd:PRK14959   330 ACWQMTLEG----QRRVLTSLEPAMAlellllnlamlPRLMPVESLRPSGGGASAPSGSAAEGPASGGAA-------TIP 398
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034627714  487 SPGY--PQGT-PAAG----QAWTPEPSPRRGGSNVVGYDrnntgkgveqEVPSTAPglgmDQGSPSQVNPSQGSPSQGSL 559
Cdd:PRK14959   399 TPGTqgPQGTaPAAGmtpsSAAPATPAPSAAPSPRVPWD----------DAPPAPP----RSGIPPRPAPRMPEASPVPG 464
                          170       180       190       200       210       220       230
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 1034627714  560 RQESTSqaspsqrSTSQGSPSQVNPSQRSTSHANSSQGSPSQGSPSQESPSQGSTS-----QASPSHRNTIGVIGTTSS 633
Cdd:PRK14959   465 APDSVA-------SASDAPPTLGDPSDTAEHTPSGPRTWDGFLEFCQGRNGQGGRLatvlrQATPEHADGRLRLATMSS 536
PHA03378 PHA03378
EBNA-3B; Provisional
491-676 3.53e-04

EBNA-3B; Provisional


Pssm-ID: 223065 [Multi-domain]  Cd Length: 991  Bit Score: 45.06  E-value: 3.53e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034627714  491 PQGTPAAGQAWTPEPSPRRGGSNVVGYDRNNTGKGVEQEVPSTAPglgmdqGSPSQVNPSQGSPSQgslRQESTSQASPS 570
Cdd:PHA03378   727 PAAAPGRARPPAAAPGRARPPAAAPGRARPPAAAPGRARPPAAAP------GAPTPQPPPQAPPAP---QQRPRGAPTPQ 797
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034627714  571 QRSTSQGSPSQVNPSQRSTSHANSSQGSPSQGSPSQESpsqGSTSQASPSHRNTIGVIGTTSSPKATGSTHS------FP 644
Cdd:PHA03378   798 PPPQAGPTSMQLMPRAAPGQQGPTKQILRQLLTGGVKR---GRPSLKKPAALERQAAAGPTPSPGSGTSDKIvqapvfYP 874
                          170       180       190
                   ....*....|....*....|....*....|...
gi 1034627714  645 PGATdgPLALPGQLQGNSIMEPPSWP-SPTEDP 676
Cdd:PHA03378   875 PVLQ--PIQVMRQLGSVRAAAASTVTqAPTEYT 905
EGF_CA smart00179
Calcium-binding EGF-like domain;
160-190 1.69e-03

Calcium-binding EGF-like domain;


Pssm-ID: 214542 [Multi-domain]  Cd Length: 39  Bit Score: 37.23  E-value: 1.69e-03
                            10        20        30
                    ....*....|....*....|....*....|.
gi 1034627714   160 DVNECfyEELNACSGRELCANLEGSYWCVCH 190
Cdd:smart00179    1 DIDEC--ASGNPCQNGGTCVNTVGSYRCECP 29
EGF_CA pfam07645
Calcium-binding EGF domain;
160-189 2.09e-03

Calcium-binding EGF domain;


Pssm-ID: 429571  Cd Length: 32  Bit Score: 36.83  E-value: 2.09e-03
                           10        20        30
                   ....*....|....*....|....*....|
gi 1034627714  160 DVNECFyEELNACSGRELCANLEGSYWCVC 189
Cdd:pfam07645    1 DVDECA-TGTHNCPANTVCVNTIGSFECRC 29
EGF_3 pfam12947
EGF domain; This family includes a variety of EGF-like domain homologs. This family includes ...
413-434 4.17e-03

EGF domain; This family includes a variety of EGF-like domain homologs. This family includes the C-terminal domain of the malaria parasite MSP1 protein.


Pssm-ID: 463759 [Multi-domain]  Cd Length: 36  Bit Score: 36.04  E-value: 4.17e-03
                           10        20
                   ....*....|....*....|..
gi 1034627714  413 HDCSPAAWCINLEGSYTCQCRT 434
Cdd:pfam12947    6 GGCHPNATCTNTGGSFTCTCND 27
FN3 cd00063
Fibronectin type 3 domain; One of three types of internal repeats found in the plasma protein ...
727-803 9.56e-03

Fibronectin type 3 domain; One of three types of internal repeats found in the plasma protein fibronectin. Its tenth fibronectin type III repeat contains an RGD cell recognition sequence in a flexible loop between 2 strands. Approximately 2% of all animal proteins contain the FN3 repeat; including extracellular and intracellular proteins, membrane spanning cytokine receptors, growth hormone receptors, tyrosine phosphatase receptors, and adhesion molecules. FN3-like domains are also found in bacterial glycosyl hydrolases.


Pssm-ID: 238020 [Multi-domain]  Cd Length: 93  Bit Score: 36.71  E-value: 9.56e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1034627714  727 PVSIGRIMVSNVTSTGFHLAWEADLAMDS-------TFQLTLTSMWSPAVVLETWNTSVTLSGLEPGVLHLVEIMAKACG 799
Cdd:cd00063      1 PSPPTNLRVTDVTSTSVTLSWTPPEDDGGpitgyvvEYREKGSGDWKEVEVTPGSETSYTLTGLKPGTEYEFRVRAVNGG 80

                   ....
gi 1034627714  800 KEGA 803
Cdd:cd00063     81 GESP 84
 
Blast search parameters
Data Source: Precalculated data, version = cdd.v.3.21
Preset Options:Database: CDSEARCH/cdd   Low complexity filter: no  Composition Based Adjustment: yes   E-value threshold: 0.01

References:

  • Wang J et al. (2023), "The conserved domain database in 2023", Nucleic Acids Res.51(D)384-8.
  • Lu S et al. (2020), "The conserved domain database in 2020", Nucleic Acids Res.48(D)265-8.
  • Marchler-Bauer A et al. (2017), "CDD/SPARCLE: functional classification of proteins via subfamily domain architectures.", Nucleic Acids Res.45(D)200-3.
Help | Disclaimer | Write to the Help Desk
NCBI | NLM | NIH