NCBI Home Page NCBI Site Search page NCBI Guide that lists and describes the NCBI resources
Conserved domains on  [gi|1381378387|ref|XP_024661135|]
View 

protein HEG homolog 1 [Maylandia zebra]

Protein Classification

Graphical summary

 Zoom to residue level

show extra options »

Show site features     Horizontal zoom: ×

List of domain hits

Name Accession Description Interval E-value
EGF_CA smart00179
Calcium-binding EGF-like domain;
696-736 1.07e-11

Calcium-binding EGF-like domain;


:

Pssm-ID: 214542 [Multi-domain]  Cd Length: 39  Bit Score: 60.34  E-value: 1.07e-11
                            10        20        30        40
                    ....*....|....*....|....*....|....*....|.
gi 1381378387   696 DIDECKQKQePCPKGSTCVNTNGSFSCECPLGFdlEDGRTC 736
Cdd:smart00179    1 DIDECASGN-PCQNGGTCVNTVGSYRCECPPGY--TDGRNC 38
EGF_CA cd00054
Calcium-binding EGF-like domain, present in a large number of membrane-bound and extracellular ...
659-693 2.63e-06

Calcium-binding EGF-like domain, present in a large number of membrane-bound and extracellular (mostly animal) proteins. Many of these proteins require calcium for their biological function and calcium-binding sites have been found to be located at the N-terminus of particular EGF-like domains; calcium-binding may be crucial for numerous protein-protein interactions. Six conserved core cysteines form three disulfide bridges as in non calcium-binding EGF domains, whose structures are very similar. EGF_CA can be found in tandem repeat arrangements.


:

Pssm-ID: 238011  Cd Length: 38  Bit Score: 44.94  E-value: 2.63e-06
                           10        20        30
                   ....*....|....*....|....*....|....*.
gi 1381378387  659 NPCVS-NPCNNEGICMgDEPDAFRCSCQQGWTGRTC 693
Cdd:cd00054      3 DECASgNPCQNGGTCV-NTVGSYRCSCPPGYTGRNC 37
PTZ00382 super family cl20112
Variant-specific surface protein (VSP); Provisional
840-922 3.61e-04

Variant-specific surface protein (VSP); Provisional


The actual alignment was detected with superfamily member PTZ00382:

Pssm-ID: 173574 [Multi-domain]  Cd Length: 96  Bit Score: 40.70  E-value: 3.61e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1381378387  840 LCAAQKTQCDEERSSCTDTSGiaCCQCREGYYKHNPDDLYCLECGDGLKLENGTCVPCMFGFGGFNCGNFYKLIAVVVSP 919
Cdd:PTZ00382     1 MTPAVCTSCDSDKKPNKDGSG--CVLCSVGNCKSCVVDGVCGECNSGFSLDNGKCVSSGANRSGLSTGAIAGISVAVVAV 78

                   ...
gi 1381378387  920 AGG 922
Cdd:PTZ00382    79 VGG 81
ser_rich_anae_1 super family cl41472
serine-rich protein; This serine-rich protein belongs to a family with large size (over 1000 ...
297-451 3.78e-03

serine-rich protein; This serine-rich protein belongs to a family with large size (over 1000 amino acids), which a highly serine-rich central region that averages over 300 aa in length. Species encoding members of this family of proteins tend to be anaerobic bacteria, including Gram-positive bacteria of the human gut microbiome and Chloroflexi from marine sediments.


The actual alignment was detected with superfamily member NF033849:

Pssm-ID: 468206 [Multi-domain]  Cd Length: 1122  Bit Score: 41.53  E-value: 3.78e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1381378387  297 STWG--ISPKSTETENYTHSApSTGIT-GRDVTDSfkGVFSETGSPASSSGTQSITGHpNATEQQHTSTSEEPSDSTSSi 373
Cdd:NF033849   257 HSVGtsESHSVGTSQSQSHTT-GHGSTrGWSHTQS--TSESESTGQSSSVGTSESQSH-GTTEGTSTTDSSSHSQSSSY- 331
                           90       100       110       120       130       140       150
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 1381378387  374 rvtepSTDQSQMSFSSTPPFTSPAGGPTNISGTEQGLGSSSGASTEFSTGDHSSSTQSGTEGLSSQTREGNVDLGLTA 451
Cdd:NF033849   332 -----NVSSGTGVSSSHSDGTSQSTSISHSESSSESTGTSVGHSTSSSVSSSESSSRSSSSGVSGGFSGGIAGGGVTS 404
 
Name Accession Description Interval E-value
EGF_CA smart00179
Calcium-binding EGF-like domain;
696-736 1.07e-11

Calcium-binding EGF-like domain;


Pssm-ID: 214542 [Multi-domain]  Cd Length: 39  Bit Score: 60.34  E-value: 1.07e-11
                            10        20        30        40
                    ....*....|....*....|....*....|....*....|.
gi 1381378387   696 DIDECKQKQePCPKGSTCVNTNGSFSCECPLGFdlEDGRTC 736
Cdd:smart00179    1 DIDECASGN-PCQNGGTCVNTVGSYRCECPPGY--TDGRNC 38
EGF_CA cd00054
Calcium-binding EGF-like domain, present in a large number of membrane-bound and extracellular ...
696-736 7.25e-10

Calcium-binding EGF-like domain, present in a large number of membrane-bound and extracellular (mostly animal) proteins. Many of these proteins require calcium for their biological function and calcium-binding sites have been found to be located at the N-terminus of particular EGF-like domains; calcium-binding may be crucial for numerous protein-protein interactions. Six conserved core cysteines form three disulfide bridges as in non calcium-binding EGF domains, whose structures are very similar. EGF_CA can be found in tandem repeat arrangements.


Pssm-ID: 238011  Cd Length: 38  Bit Score: 54.95  E-value: 7.25e-10
                           10        20        30        40
                   ....*....|....*....|....*....|....*....|.
gi 1381378387  696 DIDECKQKQePCPKGSTCVNTNGSFSCECPLGFdleDGRTC 736
Cdd:cd00054      1 DIDECASGN-PCQNGGTCVNTVGSYRCSCPPGY---TGRNC 37
EGF_CA pfam07645
Calcium-binding EGF domain;
696-727 1.19e-08

Calcium-binding EGF domain;


Pssm-ID: 429571  Cd Length: 32  Bit Score: 51.47  E-value: 1.19e-08
                           10        20        30
                   ....*....|....*....|....*....|..
gi 1381378387  696 DIDECKQKQEPCPKGSTCVNTNGSFSCECPLG 727
Cdd:pfam07645    1 DVDECATGTHNCPANTVCVNTIGSFECRCPDG 32
EGF_CA cd00054
Calcium-binding EGF-like domain, present in a large number of membrane-bound and extracellular ...
659-693 2.63e-06

Calcium-binding EGF-like domain, present in a large number of membrane-bound and extracellular (mostly animal) proteins. Many of these proteins require calcium for their biological function and calcium-binding sites have been found to be located at the N-terminus of particular EGF-like domains; calcium-binding may be crucial for numerous protein-protein interactions. Six conserved core cysteines form three disulfide bridges as in non calcium-binding EGF domains, whose structures are very similar. EGF_CA can be found in tandem repeat arrangements.


Pssm-ID: 238011  Cd Length: 38  Bit Score: 44.94  E-value: 2.63e-06
                           10        20        30
                   ....*....|....*....|....*....|....*.
gi 1381378387  659 NPCVS-NPCNNEGICMgDEPDAFRCSCQQGWTGRTC 693
Cdd:cd00054      3 DECASgNPCQNGGTCV-NTVGSYRCSCPPGYTGRNC 37
EGF pfam00008
EGF-like domain; There is no clear separation between noise and signal. pfam00053 is very ...
661-692 5.82e-06

EGF-like domain; There is no clear separation between noise and signal. pfam00053 is very similar, but has 8 instead of 6 conserved cysteines. Includes some cytokine receptors. The EGF domain misses the N-terminus regions of the Ca2+ binding EGF domains (this is the main reason of discrepancy between swiss-prot domain start/end and Pfam). The family is hard to model due to many similar but different sub-types of EGF domains. Pfam certainly misses a number of EGF domains.


Pssm-ID: 394967  Cd Length: 31  Bit Score: 43.91  E-value: 5.82e-06
                           10        20        30
                   ....*....|....*....|....*....|..
gi 1381378387  661 CVSNPCNNEGICMgDEPDAFRCSCQQGWTGRT 692
Cdd:pfam00008    1 CAPNPCSNGGTCV-DTPGGYTCICPEGYTGKR 31
PTZ00382 PTZ00382
Variant-specific surface protein (VSP); Provisional
840-922 3.61e-04

Variant-specific surface protein (VSP); Provisional


Pssm-ID: 173574 [Multi-domain]  Cd Length: 96  Bit Score: 40.70  E-value: 3.61e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1381378387  840 LCAAQKTQCDEERSSCTDTSGiaCCQCREGYYKHNPDDLYCLECGDGLKLENGTCVPCMFGFGGFNCGNFYKLIAVVVSP 919
Cdd:PTZ00382     1 MTPAVCTSCDSDKKPNKDGSG--CVLCSVGNCKSCVVDGVCGECNSGFSLDNGKCVSSGANRSGLSTGAIAGISVAVVAV 78

                   ...
gi 1381378387  920 AGG 922
Cdd:PTZ00382    79 VGG 81
EGF_CA smart00179
Calcium-binding EGF-like domain;
659-693 1.50e-03

Calcium-binding EGF-like domain;


Pssm-ID: 214542 [Multi-domain]  Cd Length: 39  Bit Score: 37.23  E-value: 1.50e-03
                            10        20        30
                    ....*....|....*....|....*....|....*..
gi 1381378387   659 NPCVS-NPCNNEGICMgDEPDAFRCSCQQGWT-GRTC 693
Cdd:smart00179    3 DECASgNPCQNGGTCV-NTVGSYRCECPPGYTdGRNC 38
ser_rich_anae_1 NF033849
serine-rich protein; This serine-rich protein belongs to a family with large size (over 1000 ...
297-451 3.78e-03

serine-rich protein; This serine-rich protein belongs to a family with large size (over 1000 amino acids), which a highly serine-rich central region that averages over 300 aa in length. Species encoding members of this family of proteins tend to be anaerobic bacteria, including Gram-positive bacteria of the human gut microbiome and Chloroflexi from marine sediments.


Pssm-ID: 468206 [Multi-domain]  Cd Length: 1122  Bit Score: 41.53  E-value: 3.78e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1381378387  297 STWG--ISPKSTETENYTHSApSTGIT-GRDVTDSfkGVFSETGSPASSSGTQSITGHpNATEQQHTSTSEEPSDSTSSi 373
Cdd:NF033849   257 HSVGtsESHSVGTSQSQSHTT-GHGSTrGWSHTQS--TSESESTGQSSSVGTSESQSH-GTTEGTSTTDSSSHSQSSSY- 331
                           90       100       110       120       130       140       150
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 1381378387  374 rvtepSTDQSQMSFSSTPPFTSPAGGPTNISGTEQGLGSSSGASTEFSTGDHSSSTQSGTEGLSSQTREGNVDLGLTA 451
Cdd:NF033849   332 -----NVSSGTGVSSSHSDGTSQSTSISHSESSSESTGTSVGHSTSSSVSSSESSSRSSSSGVSGGFSGGIAGGGVTS 404
Nucleoporin_FG2 pfam15967
Nucleoporin FG repeated region; Nucleoporin_FG2, or nucleoporin p58/p45, is a family of ...
298-450 6.01e-03

Nucleoporin FG repeated region; Nucleoporin_FG2, or nucleoporin p58/p45, is a family of chordate nucleoporins. The proteins carry many repeats of the FG sequence motif.


Pssm-ID: 435043 [Multi-domain]  Cd Length: 586  Bit Score: 40.42  E-value: 6.01e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1381378387  298 TWGISPKSTETENYTHSAPSTGITGRDVTDSFKgvFSETGSPASSSGTQSIT-GHPNATEQQHTSTS---EEPSDSTSSI 373
Cdd:pfam15967   40 TLGAAPAATATTTTATLGLGGGLFGQKPATGFT--FGTPASSTAATGPTGLTlGTPAATTAASTGFSlgfNKPAASATPF 117
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1381378387  374 RVTEPSTDQSQMSFSSTPPFTSPAGGPTnisGTEQGLGSSSGASTEFSTGDHSSSTQSGTEGL----SSQTREGNVDLGL 449
Cdd:pfam15967  118 SLPASSTSGGGLSLGSVLTSTAAQQGAT---GFTLNLGGTPATTTAVSTGLSLGSTLTSLGGSlfqnTNSTGLGQTTLGL 194

                   .
gi 1381378387  450 T 450
Cdd:pfam15967  195 T 195
 
Name Accession Description Interval E-value
EGF_CA smart00179
Calcium-binding EGF-like domain;
696-736 1.07e-11

Calcium-binding EGF-like domain;


Pssm-ID: 214542 [Multi-domain]  Cd Length: 39  Bit Score: 60.34  E-value: 1.07e-11
                            10        20        30        40
                    ....*....|....*....|....*....|....*....|.
gi 1381378387   696 DIDECKQKQePCPKGSTCVNTNGSFSCECPLGFdlEDGRTC 736
Cdd:smart00179    1 DIDECASGN-PCQNGGTCVNTVGSYRCECPPGY--TDGRNC 38
EGF_CA cd00054
Calcium-binding EGF-like domain, present in a large number of membrane-bound and extracellular ...
696-736 7.25e-10

Calcium-binding EGF-like domain, present in a large number of membrane-bound and extracellular (mostly animal) proteins. Many of these proteins require calcium for their biological function and calcium-binding sites have been found to be located at the N-terminus of particular EGF-like domains; calcium-binding may be crucial for numerous protein-protein interactions. Six conserved core cysteines form three disulfide bridges as in non calcium-binding EGF domains, whose structures are very similar. EGF_CA can be found in tandem repeat arrangements.


Pssm-ID: 238011  Cd Length: 38  Bit Score: 54.95  E-value: 7.25e-10
                           10        20        30        40
                   ....*....|....*....|....*....|....*....|.
gi 1381378387  696 DIDECKQKQePCPKGSTCVNTNGSFSCECPLGFdleDGRTC 736
Cdd:cd00054      1 DIDECASGN-PCQNGGTCVNTVGSYRCSCPPGY---TGRNC 37
EGF_CA pfam07645
Calcium-binding EGF domain;
696-727 1.19e-08

Calcium-binding EGF domain;


Pssm-ID: 429571  Cd Length: 32  Bit Score: 51.47  E-value: 1.19e-08
                           10        20        30
                   ....*....|....*....|....*....|..
gi 1381378387  696 DIDECKQKQEPCPKGSTCVNTNGSFSCECPLG 727
Cdd:pfam07645    1 DVDECATGTHNCPANTVCVNTIGSFECRCPDG 32
EGF_CA cd00054
Calcium-binding EGF-like domain, present in a large number of membrane-bound and extracellular ...
659-693 2.63e-06

Calcium-binding EGF-like domain, present in a large number of membrane-bound and extracellular (mostly animal) proteins. Many of these proteins require calcium for their biological function and calcium-binding sites have been found to be located at the N-terminus of particular EGF-like domains; calcium-binding may be crucial for numerous protein-protein interactions. Six conserved core cysteines form three disulfide bridges as in non calcium-binding EGF domains, whose structures are very similar. EGF_CA can be found in tandem repeat arrangements.


Pssm-ID: 238011  Cd Length: 38  Bit Score: 44.94  E-value: 2.63e-06
                           10        20        30
                   ....*....|....*....|....*....|....*.
gi 1381378387  659 NPCVS-NPCNNEGICMgDEPDAFRCSCQQGWTGRTC 693
Cdd:cd00054      3 DECASgNPCQNGGTCV-NTVGSYRCSCPPGYTGRNC 37
EGF pfam00008
EGF-like domain; There is no clear separation between noise and signal. pfam00053 is very ...
661-692 5.82e-06

EGF-like domain; There is no clear separation between noise and signal. pfam00053 is very similar, but has 8 instead of 6 conserved cysteines. Includes some cytokine receptors. The EGF domain misses the N-terminus regions of the Ca2+ binding EGF domains (this is the main reason of discrepancy between swiss-prot domain start/end and Pfam). The family is hard to model due to many similar but different sub-types of EGF domains. Pfam certainly misses a number of EGF domains.


Pssm-ID: 394967  Cd Length: 31  Bit Score: 43.91  E-value: 5.82e-06
                           10        20        30
                   ....*....|....*....|....*....|..
gi 1381378387  661 CVSNPCNNEGICMgDEPDAFRCSCQQGWTGRT 692
Cdd:pfam00008    1 CAPNPCSNGGTCV-DTPGGYTCICPEGYTGKR 31
EGF cd00053
Epidermal growth factor domain, found in epidermal growth factor (EGF) presents in a large ...
699-736 1.74e-05

Epidermal growth factor domain, found in epidermal growth factor (EGF) presents in a large number of proteins, mostly animal; the list of proteins currently known to contain one or more copies of an EGF-like pattern is large and varied; the functional significance of EGF-like domains in what appear to be unrelated proteins is not yet clear; a common feature is that these repeats are found in the extracellular domain of membrane-bound proteins or in proteins known to be secreted (exception: prostaglandin G/H synthase); the domain includes six cysteine residues which have been shown to be involved in disulfide bonds; the main structure is a two-stranded beta-sheet followed by a loop to a C-terminal short two-stranded sheet; Subdomains between the conserved cysteines vary in length; the region between the 5th and 6th cysteine contains two conserved glycines of which at least one is present in most EGF-like domains; a subset of these bind calcium.


Pssm-ID: 238010  Cd Length: 36  Bit Score: 42.46  E-value: 1.74e-05
                           10        20        30
                   ....*....|....*....|....*....|....*...
gi 1381378387  699 ECKQKQePCPKGSTCVNTNGSFSCECPLGFDLEdgRTC 736
Cdd:cd00053      1 ECAASN-PCSNGGTCVNTPGSYRCVCPPGYTGD--RSC 35
PTZ00382 PTZ00382
Variant-specific surface protein (VSP); Provisional
840-922 3.61e-04

Variant-specific surface protein (VSP); Provisional


Pssm-ID: 173574 [Multi-domain]  Cd Length: 96  Bit Score: 40.70  E-value: 3.61e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1381378387  840 LCAAQKTQCDEERSSCTDTSGiaCCQCREGYYKHNPDDLYCLECGDGLKLENGTCVPCMFGFGGFNCGNFYKLIAVVVSP 919
Cdd:PTZ00382     1 MTPAVCTSCDSDKKPNKDGSG--CVLCSVGNCKSCVVDGVCGECNSGFSLDNGKCVSSGANRSGLSTGAIAGISVAVVAV 78

                   ...
gi 1381378387  920 AGG 922
Cdd:PTZ00382    79 VGG 81
EGF smart00181
Epidermal growth factor-like domain;
699-737 1.28e-03

Epidermal growth factor-like domain;


Pssm-ID: 214544  Cd Length: 35  Bit Score: 37.50  E-value: 1.28e-03
                            10        20        30
                    ....*....|....*....|....*....|....*....
gi 1381378387   699 ECKQKQePCPKGsTCVNTNGSFSCECPLGFdlEDGRTCS 737
Cdd:smart00181    1 ECASGG-PCSNG-TCINTPGSYTCSCPPGY--TGDKRCE 35
EGF_CA smart00179
Calcium-binding EGF-like domain;
659-693 1.50e-03

Calcium-binding EGF-like domain;


Pssm-ID: 214542 [Multi-domain]  Cd Length: 39  Bit Score: 37.23  E-value: 1.50e-03
                            10        20        30
                    ....*....|....*....|....*....|....*..
gi 1381378387   659 NPCVS-NPCNNEGICMgDEPDAFRCSCQQGWT-GRTC 693
Cdd:smart00179    3 DECASgNPCQNGGTCV-NTVGSYRCECPPGYTdGRNC 38
EGF cd00053
Epidermal growth factor domain, found in epidermal growth factor (EGF) presents in a large ...
660-693 1.54e-03

Epidermal growth factor domain, found in epidermal growth factor (EGF) presents in a large number of proteins, mostly animal; the list of proteins currently known to contain one or more copies of an EGF-like pattern is large and varied; the functional significance of EGF-like domains in what appear to be unrelated proteins is not yet clear; a common feature is that these repeats are found in the extracellular domain of membrane-bound proteins or in proteins known to be secreted (exception: prostaglandin G/H synthase); the domain includes six cysteine residues which have been shown to be involved in disulfide bonds; the main structure is a two-stranded beta-sheet followed by a loop to a C-terminal short two-stranded sheet; Subdomains between the conserved cysteines vary in length; the region between the 5th and 6th cysteine contains two conserved glycines of which at least one is present in most EGF-like domains; a subset of these bind calcium.


Pssm-ID: 238010  Cd Length: 36  Bit Score: 37.07  E-value: 1.54e-03
                           10        20        30
                   ....*....|....*....|....*....|....*.
gi 1381378387  660 PC-VSNPCNNEGICMgDEPDAFRCSCQQGWTG-RTC 693
Cdd:cd00053      1 ECaASNPCSNGGTCV-NTPGSYRCVCPPGYTGdRSC 35
ser_rich_anae_1 NF033849
serine-rich protein; This serine-rich protein belongs to a family with large size (over 1000 ...
297-451 3.78e-03

serine-rich protein; This serine-rich protein belongs to a family with large size (over 1000 amino acids), which a highly serine-rich central region that averages over 300 aa in length. Species encoding members of this family of proteins tend to be anaerobic bacteria, including Gram-positive bacteria of the human gut microbiome and Chloroflexi from marine sediments.


Pssm-ID: 468206 [Multi-domain]  Cd Length: 1122  Bit Score: 41.53  E-value: 3.78e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1381378387  297 STWG--ISPKSTETENYTHSApSTGIT-GRDVTDSfkGVFSETGSPASSSGTQSITGHpNATEQQHTSTSEEPSDSTSSi 373
Cdd:NF033849   257 HSVGtsESHSVGTSQSQSHTT-GHGSTrGWSHTQS--TSESESTGQSSSVGTSESQSH-GTTEGTSTTDSSSHSQSSSY- 331
                           90       100       110       120       130       140       150
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 1381378387  374 rvtepSTDQSQMSFSSTPPFTSPAGGPTNISGTEQGLGSSSGASTEFSTGDHSSSTQSGTEGLSSQTREGNVDLGLTA 451
Cdd:NF033849   332 -----NVSSGTGVSSSHSDGTSQSTSISHSESSSESTGTSVGHSTSSSVSSSESSSRSSSSGVSGGFSGGIAGGGVTS 404
hEGF pfam12661
Human growth factor-like EGF; hEGF, or human growth factor-like EGF, domains have six ...
707-728 5.23e-03

Human growth factor-like EGF; hEGF, or human growth factor-like EGF, domains have six conserved residues disulfide-bonded into the characteriztic 'ababcc' pattern. They are involved in growth and proliferation of cells, in proteins of the Notch/Delta pathway, neurogulin and selectins. hEGFs are also found in mosaic proteins with four-disulfide laminin EGFs such as aggrecan and perlecan. The core fold of the EGF domain consists of two small beta-hairpins packed against each other. Two major structural variants have been identified based on the structural context of the C-terminal Cys residue of disulfide 'c' in the C-terminal hairpin: hEGFs and cEGFs. In hEGFs the C-terminal thiol resides in the beta-turn, resulting in shorter loop-lengths between the Cys residues of disulfide 'c', typically C[8-9]XC. These shorter loop-lengths are also typical of the four-disulfide EGF domains, laminin ad integrin. Tandem hEGF domains have six linking residues between terminal cysteines of adjacent domains. hEGF domains may or may not bind calcium in the linker region. hEGF domains with the consensus motif CXD4X[F,Y]XCXC are hydroxylated exclusively in the Asp residue.


Pssm-ID: 463660  Cd Length: 22  Bit Score: 35.39  E-value: 5.23e-03
                           10        20
                   ....*....|....*....|..
gi 1381378387  707 CPKGSTCVNTNGSFSCECPLGF 728
Cdd:pfam12661    1 CQNGGTCVDGVNGYKCQCPPGY 22
Nucleoporin_FG2 pfam15967
Nucleoporin FG repeated region; Nucleoporin_FG2, or nucleoporin p58/p45, is a family of ...
298-450 6.01e-03

Nucleoporin FG repeated region; Nucleoporin_FG2, or nucleoporin p58/p45, is a family of chordate nucleoporins. The proteins carry many repeats of the FG sequence motif.


Pssm-ID: 435043 [Multi-domain]  Cd Length: 586  Bit Score: 40.42  E-value: 6.01e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1381378387  298 TWGISPKSTETENYTHSAPSTGITGRDVTDSFKgvFSETGSPASSSGTQSIT-GHPNATEQQHTSTS---EEPSDSTSSI 373
Cdd:pfam15967   40 TLGAAPAATATTTTATLGLGGGLFGQKPATGFT--FGTPASSTAATGPTGLTlGTPAATTAASTGFSlgfNKPAASATPF 117
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1381378387  374 RVTEPSTDQSQMSFSSTPPFTSPAGGPTnisGTEQGLGSSSGASTEFSTGDHSSSTQSGTEGL----SSQTREGNVDLGL 449
Cdd:pfam15967  118 SLPASSTSGGGLSLGSVLTSTAAQQGAT---GFTLNLGGTPATTTAVSTGLSLGSTLTSLGGSlfqnTNSTGLGQTTLGL 194

                   .
gi 1381378387  450 T 450
Cdd:pfam15967  195 T 195
 
Blast search parameters
Data Source: Precalculated data, version = cdd.v.3.21
Preset Options:Database: CDSEARCH/cdd   Low complexity filter: no  Composition Based Adjustment: yes   E-value threshold: 0.01

References:

  • Wang J et al. (2023), "The conserved domain database in 2023", Nucleic Acids Res.51(D)384-8.
  • Lu S et al. (2020), "The conserved domain database in 2020", Nucleic Acids Res.48(D)265-8.
  • Marchler-Bauer A et al. (2017), "CDD/SPARCLE: functional classification of proteins via subfamily domain architectures.", Nucleic Acids Res.45(D)200-3.
Help | Disclaimer | Write to the Help Desk
NCBI | NLM | NIH