NCBI Home Page NCBI Site Search page NCBI Guide that lists and describes the NCBI resources
Conserved domains on  [gi|1808670485|gb|KAF2369111|]
View 

EGF-like domain [Trinorchestia longiramus]

Protein Classification

Graphical summary

 Zoom to residue level

show extra options »

Show site features     Horizontal zoom: ×

List of domain hits

Name Accession Description Interval E-value
CA smart00112
Cadherin repeats; Cadherins are glycoproteins involved in Ca2+-mediated cell-cell adhesion. ...
33-112 3.55e-30

Cadherin repeats; Cadherins are glycoproteins involved in Ca2+-mediated cell-cell adhesion. Cadherin domains occur as repeats in the extracellular regions which are thought to mediate cell-cell contact when bound to calcium.


:

Pssm-ID: 214520 [Multi-domain]  Cd Length: 81  Bit Score: 114.37  E-value: 3.55e-30
                            10        20        30        40        50        60        70        80
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1808670485    33 ATDLDSGSNGRLIYKMAGGNEGDRFAIDPLTGWITVQKVLDREKEAKYSLEAAATDSGSPPRTGSALITVEITDANDNPP 112
Cdd:smart00112    2 ATDADSGENGKVTYSILSGNDDGLFSIDPETGEITTTKPLDREEQPEYTLTVEATDGGGPPLSSTATVTITVLDVNDNAP 81
LamG cd00110
Laminin G domain; Laminin G-like domains are usually Ca++ mediated receptors that can have ...
553-702 1.61e-29

Laminin G domain; Laminin G-like domains are usually Ca++ mediated receptors that can have binding sites for steroids, beta1 integrins, heparin, sulfatides, fibulin-1, and alpha-dystroglycans. Proteins that contain LamG domains serve a variety of purposes including signal transduction via cell-surface steroid receptors, adhesion, migration and differentiation through mediation of cell adhesion molecules.


:

Pssm-ID: 238058 [Multi-domain]  Cd Length: 151  Bit Score: 115.21  E-value: 1.61e-29
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1808670485  553 PVSFDGKGFVQYNIRSSIVDHFDFSTWFRTTHPSGNLLFISG--RIDYCILELHEGRVRYRWELGSGEGLTSVSSlPLND 630
Cdd:cd00110      1 GVSFSGSSYVRLPTLPAPRTRLSISFSFRTTSPNGLLLYAGSqnGGDFLALELEDGRLVLRYDLGSGSLVLSSKT-PLND 79
                           90       100       110       120       130       140       150
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*
gi 1808670485  631 SSWHHVALSHNRRNARIVVDGKHAVSGISPGNNEKLNLEsSQIYLG---AEVKPWYydEYTSRNLVGCVDNPHFN 702
Cdd:cd00110     80 GQWHSVSVERNGRSVTLSVDGERVVESGSPGGSALLNLD-GPLYLGglpEDLKSPG--LPVSPGFVGCIRDLKVN 151
Cadherin_repeat cd11304
Cadherin tandem repeat domain; Cadherins are glycoproteins involved in Ca2+-mediated cell-cell ...
120-212 7.49e-14

Cadherin tandem repeat domain; Cadherins are glycoproteins involved in Ca2+-mediated cell-cell adhesion. The cadherin repeat domains occur as tandem repeats in the extracellular regions, which are thought to mediate cell-cell contact when bound to calcium. They play numerous roles in cell fate, signalling, proliferation, differentiation, and migration; members include E-, N-, P-, T-, VE-, CNR-, proto-, and FAT-family cadherin, desmocollin, and desmoglein, a large variety of domain architectures with varying repeat copy numbers. Cadherin-repeat containing proteins exist as monomers, homodimers, or heterodimers.


:

Pssm-ID: 206637 [Multi-domain]  Cd Length: 98  Bit Score: 68.49  E-value: 7.49e-14
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1808670485  120 TLFIQEDRSPGQRLVRLKVTDADGnsgGNAGPFAFELLSGNSRGDFRI-TQNGDLLTAVSFAGRGLPVYRLQVRVHDNGR 198
Cdd:cd11304      3 EVSVPENAPPGTVVLTVSATDPDS---GENGEVTYSIVSGNEDGLFSIdPSTGEITTAKPLDREEQSSYTLTVTATDGGG 79
                           90
                   ....*....|....
gi 1808670485  199 PHLHTDIWLSVQVV 212
Cdd:cd11304     80 PPLSSTATVTITVL 93
Cadherin_repeat cd11304
Cadherin tandem repeat domain; Cadherins are glycoproteins involved in Ca2+-mediated cell-cell ...
237-315 2.34e-11

Cadherin tandem repeat domain; Cadherins are glycoproteins involved in Ca2+-mediated cell-cell adhesion. The cadherin repeat domains occur as tandem repeats in the extracellular regions, which are thought to mediate cell-cell contact when bound to calcium. They play numerous roles in cell fate, signalling, proliferation, differentiation, and migration; members include E-, N-, P-, T-, VE-, CNR-, proto-, and FAT-family cadherin, desmocollin, and desmoglein, a large variety of domain architectures with varying repeat copy numbers. Cadherin-repeat containing proteins exist as monomers, homodimers, or heterodimers.


:

Pssm-ID: 206637 [Multi-domain]  Cd Length: 98  Bit Score: 61.56  E-value: 2.34e-11
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1808670485  237 GGSLGIIRASDQD--PHDQLTYEVSyNNSNSELFKIDFETGELYVVKPL---SEGEYSLDVTVSD-GKFKSSGVVKILVR 310
Cdd:cd11304     13 GTVVLTVSATDPDsgENGEVTYSIV-SGNEDGLFSIDPSTGEITTAKPLdreEQSSYTLTVTATDgGGPPLSSTATVTIT 91

                   ....*
gi 1808670485  311 sVEDE 315
Cdd:cd11304     92 -VLDV 95
EGF_CA cd00054
Calcium-binding EGF-like domain, present in a large number of membrane-bound and extracellular ...
846-881 6.87e-10

Calcium-binding EGF-like domain, present in a large number of membrane-bound and extracellular (mostly animal) proteins. Many of these proteins require calcium for their biological function and calcium-binding sites have been found to be located at the N-terminus of particular EGF-like domains; calcium-binding may be crucial for numerous protein-protein interactions. Six conserved core cysteines form three disulfide bridges as in non calcium-binding EGF domains, whose structures are very similar. EGF_CA can be found in tandem repeat arrangements.


:

Pssm-ID: 238011  Cd Length: 38  Bit Score: 55.34  E-value: 6.87e-10
                           10        20        30
                   ....*....|....*....|....*....|....*..
gi 1808670485  846 DINECDEL-PCEQGGTCINTYGGFNCICSPDFAGQFC 881
Cdd:cd00054      1 DIDECASGnPCQNGGTCVNTVGSYRCSCPPGYTGRNC 37
EGF pfam00008
EGF-like domain; There is no clear separation between noise and signal. pfam00053 is very ...
777-807 1.05e-05

EGF-like domain; There is no clear separation between noise and signal. pfam00053 is very similar, but has 8 instead of 6 conserved cysteines. Includes some cytokine receptors. The EGF domain misses the N-terminus regions of the Ca2+ binding EGF domains (this is the main reason of discrepancy between swiss-prot domain start/end and Pfam). The family is hard to model due to many similar but different sub-types of EGF domains. Pfam certainly misses a number of EGF domains.


:

Pssm-ID: 394967  Cd Length: 31  Bit Score: 43.53  E-value: 1.05e-05
                           10        20        30
                   ....*....|....*....|....*....|.
gi 1808670485  777 CASSPCLNNGICIRDGYSYKCQCPAHVSGSR 807
Cdd:pfam00008    1 CAPNPCSNGGTCVDTPGGYTCICPEGYTGKR 31
EGF_CA cd00054
Calcium-binding EGF-like domain, present in a large number of membrane-bound and extracellular ...
739-771 3.60e-05

Calcium-binding EGF-like domain, present in a large number of membrane-bound and extracellular (mostly animal) proteins. Many of these proteins require calcium for their biological function and calcium-binding sites have been found to be located at the N-terminus of particular EGF-like domains; calcium-binding may be crucial for numerous protein-protein interactions. Six conserved core cysteines form three disulfide bridges as in non calcium-binding EGF domains, whose structures are very similar. EGF_CA can be found in tandem repeat arrangements.


:

Pssm-ID: 238011  Cd Length: 38  Bit Score: 41.85  E-value: 3.60e-05
                           10        20        30
                   ....*....|....*....|....*....|....
gi 1808670485  739 CSSL-PCLNGAKCSEEDGAYKCHCRSRFKGKQCE 771
Cdd:cd00054      5 CASGnPCQNGGTCVNTVGSYRCSCPPGYTGRNCE 38
EGF pfam00008
EGF-like domain; There is no clear separation between noise and signal. pfam00053 is very ...
813-842 1.00e-04

EGF-like domain; There is no clear separation between noise and signal. pfam00053 is very similar, but has 8 instead of 6 conserved cysteines. Includes some cytokine receptors. The EGF domain misses the N-terminus regions of the Ca2+ binding EGF domains (this is the main reason of discrepancy between swiss-prot domain start/end and Pfam). The family is hard to model due to many similar but different sub-types of EGF domains. Pfam certainly misses a number of EGF domains.


:

Pssm-ID: 394967  Cd Length: 31  Bit Score: 40.44  E-value: 1.00e-04
                           10        20        30
                   ....*....|....*....|....*....|.
gi 1808670485  813 CSPNPCLNKGTCEESSTGPICRCQ-GFKGIY 842
Cdd:pfam00008    1 CAPNPCSNGGTCVDTPGGYTCICPeGYTGKR 31
 
Name Accession Description Interval E-value
CA smart00112
Cadherin repeats; Cadherins are glycoproteins involved in Ca2+-mediated cell-cell adhesion. ...
33-112 3.55e-30

Cadherin repeats; Cadherins are glycoproteins involved in Ca2+-mediated cell-cell adhesion. Cadherin domains occur as repeats in the extracellular regions which are thought to mediate cell-cell contact when bound to calcium.


Pssm-ID: 214520 [Multi-domain]  Cd Length: 81  Bit Score: 114.37  E-value: 3.55e-30
                            10        20        30        40        50        60        70        80
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1808670485    33 ATDLDSGSNGRLIYKMAGGNEGDRFAIDPLTGWITVQKVLDREKEAKYSLEAAATDSGSPPRTGSALITVEITDANDNPP 112
Cdd:smart00112    2 ATDADSGENGKVTYSILSGNDDGLFSIDPETGEITTTKPLDREEQPEYTLTVEATDGGGPPLSSTATVTITVLDVNDNAP 81
Cadherin_repeat cd11304
Cadherin tandem repeat domain; Cadherins are glycoproteins involved in Ca2+-mediated cell-cell ...
29-110 3.62e-30

Cadherin tandem repeat domain; Cadherins are glycoproteins involved in Ca2+-mediated cell-cell adhesion. The cadherin repeat domains occur as tandem repeats in the extracellular regions, which are thought to mediate cell-cell contact when bound to calcium. They play numerous roles in cell fate, signalling, proliferation, differentiation, and migration; members include E-, N-, P-, T-, VE-, CNR-, proto-, and FAT-family cadherin, desmocollin, and desmoglein, a large variety of domain architectures with varying repeat copy numbers. Cadherin-repeat containing proteins exist as monomers, homodimers, or heterodimers.


Pssm-ID: 206637 [Multi-domain]  Cd Length: 98  Bit Score: 115.10  E-value: 3.62e-30
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1808670485   29 FNVMATDLDSGSNGRLIYKMAGGNEGDRFAIDPLTGWITVQKVLDREKEAKYSLEAAATDSGSPPRTGSALITVEITDAN 108
Cdd:cd11304     17 LTVSATDPDSGENGEVTYSIVSGNEDGLFSIDPSTGEITTAKPLDREEQSSYTLTVTATDGGGPPLSSTATVTITVLDVN 96

                   ..
gi 1808670485  109 DN 110
Cdd:cd11304     97 DN 98
LamG cd00110
Laminin G domain; Laminin G-like domains are usually Ca++ mediated receptors that can have ...
553-702 1.61e-29

Laminin G domain; Laminin G-like domains are usually Ca++ mediated receptors that can have binding sites for steroids, beta1 integrins, heparin, sulfatides, fibulin-1, and alpha-dystroglycans. Proteins that contain LamG domains serve a variety of purposes including signal transduction via cell-surface steroid receptors, adhesion, migration and differentiation through mediation of cell adhesion molecules.


Pssm-ID: 238058 [Multi-domain]  Cd Length: 151  Bit Score: 115.21  E-value: 1.61e-29
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1808670485  553 PVSFDGKGFVQYNIRSSIVDHFDFSTWFRTTHPSGNLLFISG--RIDYCILELHEGRVRYRWELGSGEGLTSVSSlPLND 630
Cdd:cd00110      1 GVSFSGSSYVRLPTLPAPRTRLSISFSFRTTSPNGLLLYAGSqnGGDFLALELEDGRLVLRYDLGSGSLVLSSKT-PLND 79
                           90       100       110       120       130       140       150
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*
gi 1808670485  631 SSWHHVALSHNRRNARIVVDGKHAVSGISPGNNEKLNLEsSQIYLG---AEVKPWYydEYTSRNLVGCVDNPHFN 702
Cdd:cd00110     80 GQWHSVSVERNGRSVTLSVDGERVVESGSPGGSALLNLD-GPLYLGglpEDLKSPG--LPVSPGFVGCIRDLKVN 151
LamG smart00282
Laminin G domain;
575-702 1.25e-27

Laminin G domain;


Pssm-ID: 214598 [Multi-domain]  Cd Length: 132  Bit Score: 108.97  E-value: 1.25e-27
                            10        20        30        40        50        60        70        80
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1808670485   575 DFSTWFRTTHPSGNLLFISG--RIDYCILELHEGRVRYRWELGSGEGLTSVSSLPLNDSSWHHVALSHNRRNARIVVDGK 652
Cdd:smart00282    1 SISFSFRTTSPNGLLLYAGSkgGGDYLALELRDGRLVLRYDLGSGPARLTSDPTPLNDGQWHRVAVERNGRSVTLSVDGG 80
                            90       100       110       120       130
                    ....*....|....*....|....*....|....*....|....*....|...
gi 1808670485   653 HAVSGISPGNNEKLNLESSqIYLG---AEVKPWYydEYTSRNLVGCVDNPHFN 702
Cdd:smart00282   81 NRVSGESPGGLTILNLDGP-LYLGglpEDLKLPP--LPVTPGFRGCIRNLKVN 130
Laminin_G_2 pfam02210
Laminin G domain; This family includes the Thrombospondin N-terminal-like domain, a Laminin G ...
580-702 8.84e-27

Laminin G domain; This family includes the Thrombospondin N-terminal-like domain, a Laminin G subfamily.


Pssm-ID: 460494 [Multi-domain]  Cd Length: 126  Bit Score: 106.35  E-value: 8.84e-27
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1808670485  580 FRTTHPSGNLLFISGRI-DYCILELHEGRVRYRWELGSGEGLTSVSSLPLNDSSWHHVALSHNRRNARIVVDGKHAVSGI 658
Cdd:pfam02210    1 FRTRQPNGLLLYAGGGGsDFLALELVNGRLVLRYDLGSGPESLLSSGKNLNDGQWHSVRVERNGNTLTLSVDGQTVVSSL 80
                           90       100       110       120
                   ....*....|....*....|....*....|....*....|....*.
gi 1808670485  659 SPGNNEKLNLESSqIYLGA--EVKPWYYdEYTSRNLVGCVDNPHFN 702
Cdd:pfam02210   81 PPGESLLLNLNGP-LYLGGlpPLLLLPA-LPVRAGFVGCIRDVRVN 124
Cadherin pfam00028
Cadherin domain;
11-105 5.37e-25

Cadherin domain;


Pssm-ID: 394985 [Multi-domain]  Cd Length: 92  Bit Score: 100.07  E-value: 5.37e-25
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1808670485   11 QAAIDENptcttrELSKTF--NVMATDLDSGSNGRLIYKMAGGNEGDRFAIDPLTGWITVQKVLDREKEAKYSLEAAATD 88
Cdd:pfam00028    2 SASVPEN------APVGTEvlTVTATDPDLGPNGRIFYSILGGGPGGNFRIDPDTGDISTTKPLDRESIGEYELTVEATD 75
                           90
                   ....*....|....*..
gi 1808670485   89 SGSPPRTGSALITVEIT 105
Cdd:pfam00028   76 SGGPPLSSTATVTITVL 92
Cadherin_repeat cd11304
Cadherin tandem repeat domain; Cadherins are glycoproteins involved in Ca2+-mediated cell-cell ...
120-212 7.49e-14

Cadherin tandem repeat domain; Cadherins are glycoproteins involved in Ca2+-mediated cell-cell adhesion. The cadherin repeat domains occur as tandem repeats in the extracellular regions, which are thought to mediate cell-cell contact when bound to calcium. They play numerous roles in cell fate, signalling, proliferation, differentiation, and migration; members include E-, N-, P-, T-, VE-, CNR-, proto-, and FAT-family cadherin, desmocollin, and desmoglein, a large variety of domain architectures with varying repeat copy numbers. Cadherin-repeat containing proteins exist as monomers, homodimers, or heterodimers.


Pssm-ID: 206637 [Multi-domain]  Cd Length: 98  Bit Score: 68.49  E-value: 7.49e-14
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1808670485  120 TLFIQEDRSPGQRLVRLKVTDADGnsgGNAGPFAFELLSGNSRGDFRI-TQNGDLLTAVSFAGRGLPVYRLQVRVHDNGR 198
Cdd:cd11304      3 EVSVPENAPPGTVVLTVSATDPDS---GENGEVTYSIVSGNEDGLFSIdPSTGEITTAKPLDREEQSSYTLTVTATDGGG 79
                           90
                   ....*....|....
gi 1808670485  199 PHLHTDIWLSVQVV 212
Cdd:cd11304     80 PPLSSTATVTITVL 93
Cadherin_repeat cd11304
Cadherin tandem repeat domain; Cadherins are glycoproteins involved in Ca2+-mediated cell-cell ...
237-315 2.34e-11

Cadherin tandem repeat domain; Cadherins are glycoproteins involved in Ca2+-mediated cell-cell adhesion. The cadherin repeat domains occur as tandem repeats in the extracellular regions, which are thought to mediate cell-cell contact when bound to calcium. They play numerous roles in cell fate, signalling, proliferation, differentiation, and migration; members include E-, N-, P-, T-, VE-, CNR-, proto-, and FAT-family cadherin, desmocollin, and desmoglein, a large variety of domain architectures with varying repeat copy numbers. Cadherin-repeat containing proteins exist as monomers, homodimers, or heterodimers.


Pssm-ID: 206637 [Multi-domain]  Cd Length: 98  Bit Score: 61.56  E-value: 2.34e-11
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1808670485  237 GGSLGIIRASDQD--PHDQLTYEVSyNNSNSELFKIDFETGELYVVKPL---SEGEYSLDVTVSD-GKFKSSGVVKILVR 310
Cdd:cd11304     13 GTVVLTVSATDPDsgENGEVTYSIV-SGNEDGLFSIDPSTGEITTAKPLdreEQSSYTLTVTATDgGGPPLSSTATVTIT 91

                   ....*
gi 1808670485  311 sVEDE 315
Cdd:cd11304     92 -VLDV 95
EGF_CA cd00054
Calcium-binding EGF-like domain, present in a large number of membrane-bound and extracellular ...
846-881 6.87e-10

Calcium-binding EGF-like domain, present in a large number of membrane-bound and extracellular (mostly animal) proteins. Many of these proteins require calcium for their biological function and calcium-binding sites have been found to be located at the N-terminus of particular EGF-like domains; calcium-binding may be crucial for numerous protein-protein interactions. Six conserved core cysteines form three disulfide bridges as in non calcium-binding EGF domains, whose structures are very similar. EGF_CA can be found in tandem repeat arrangements.


Pssm-ID: 238011  Cd Length: 38  Bit Score: 55.34  E-value: 6.87e-10
                           10        20        30
                   ....*....|....*....|....*....|....*..
gi 1808670485  846 DINECDEL-PCEQGGTCINTYGGFNCICSPDFAGQFC 881
Cdd:cd00054      1 DIDECASGnPCQNGGTCVNTVGSYRCSCPPGYTGRNC 37
EGF_CA smart00179
Calcium-binding EGF-like domain;
846-881 3.17e-09

Calcium-binding EGF-like domain;


Pssm-ID: 214542 [Multi-domain]  Cd Length: 39  Bit Score: 53.40  E-value: 3.17e-09
                            10        20        30
                    ....*....|....*....|....*....|....*...
gi 1808670485   846 DINECDEL-PCEQGGTCINTYGGFNCICSPDF-AGQFC 881
Cdd:smart00179    1 DIDECASGnPCQNGGTCVNTVGSYRCECPPGYtDGRNC 38
EGF_CA pfam07645
Calcium-binding EGF domain;
846-874 2.52e-07

Calcium-binding EGF domain;


Pssm-ID: 429571  Cd Length: 32  Bit Score: 48.00  E-value: 2.52e-07
                           10        20        30
                   ....*....|....*....|....*....|.
gi 1808670485  846 DINECDELP--CEQGGTCINTYGGFNCICSP 874
Cdd:pfam07645    1 DVDECATGThnCPANTVCVNTIGSFECRCPD 31
Cadherin pfam00028
Cadherin domain;
123-211 3.20e-07

Cadherin domain;


Pssm-ID: 394985 [Multi-domain]  Cd Length: 92  Bit Score: 49.61  E-value: 3.20e-07
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1808670485  123 IQEDRSPGQRLVRLKVTDADgnSGGNAGPFaFELLSGNSRGDFRI-TQNGDLLTAvsfagRGL-----PVYRLQVRVHDN 196
Cdd:pfam00028    5 VPENAPVGTEVLTVTATDPD--LGPNGRIF-YSILGGGPGGNFRIdPDTGDISTT-----KPLdresiGEYELTVEATDS 76
                           90
                   ....*....|....*
gi 1808670485  197 GRPHLHTDIWLSVQV 211
Cdd:pfam00028   77 GGPPLSSTATVTITV 91
CA smart00112
Cadherin repeats; Cadherins are glycoproteins involved in Ca2+-mediated cell-cell adhesion. ...
244-315 5.59e-07

Cadherin repeats; Cadherins are glycoproteins involved in Ca2+-mediated cell-cell adhesion. Cadherin domains occur as repeats in the extracellular regions which are thought to mediate cell-cell contact when bound to calcium.


Pssm-ID: 214520 [Multi-domain]  Cd Length: 81  Bit Score: 48.50  E-value: 5.59e-07
                            10        20        30        40        50        60        70
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 1808670485   244 RASDQD--PHDQLTYEVSYNNSNSeLFKIDFETGELYVVKPL---SEGEYSLDVTVSD-GKFKSSGVVKILVRsVEDE 315
Cdd:smart00112    1 SATDADsgENGKVTYSILSGNDDG-LFSIDPETGEITTTKPLdreEQPEYTLTVEATDgGGPPLSSTATVTIT-VLDV 76
EGF pfam00008
EGF-like domain; There is no clear separation between noise and signal. pfam00053 is very ...
777-807 1.05e-05

EGF-like domain; There is no clear separation between noise and signal. pfam00053 is very similar, but has 8 instead of 6 conserved cysteines. Includes some cytokine receptors. The EGF domain misses the N-terminus regions of the Ca2+ binding EGF domains (this is the main reason of discrepancy between swiss-prot domain start/end and Pfam). The family is hard to model due to many similar but different sub-types of EGF domains. Pfam certainly misses a number of EGF domains.


Pssm-ID: 394967  Cd Length: 31  Bit Score: 43.53  E-value: 1.05e-05
                           10        20        30
                   ....*....|....*....|....*....|.
gi 1808670485  777 CASSPCLNNGICIRDGYSYKCQCPAHVSGSR 807
Cdd:pfam00008    1 CAPNPCSNGGTCVDTPGGYTCICPEGYTGKR 31
CA smart00112
Cadherin repeats; Cadherins are glycoproteins involved in Ca2+-mediated cell-cell adhesion. ...
137-212 1.80e-05

Cadherin repeats; Cadherins are glycoproteins involved in Ca2+-mediated cell-cell adhesion. Cadherin domains occur as repeats in the extracellular regions which are thought to mediate cell-cell contact when bound to calcium.


Pssm-ID: 214520 [Multi-domain]  Cd Length: 81  Bit Score: 44.26  E-value: 1.80e-05
                            10        20        30        40        50        60        70
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*..
gi 1808670485   137 KVTDADgnSGGNaGPFAFELLSGNSRGDFRITQN-GDLLTAVSFAGRGLPVYRLQVRVHDNGRPHLHTDIWLSVQVV 212
Cdd:smart00112    1 SATDAD--SGEN-GKVTYSILSGNDDGLFSIDPEtGEITTTKPLDREEQPEYTLTVEATDGGGPPLSSTATVTITVL 74
Cadherin pfam00028
Cadherin domain;
237-297 1.85e-05

Cadherin domain;


Pssm-ID: 394985 [Multi-domain]  Cd Length: 92  Bit Score: 44.60  E-value: 1.85e-05
                           10        20        30        40        50        60
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*.
gi 1808670485  237 GGSLGIIRASDQD--PHDQLTYEVSYNNSNsELFKIDFETGELYVVKPL---SEGEYSLDVTVSDG 297
Cdd:pfam00028   12 GTEVLTVTATDPDlgPNGRIFYSILGGGPG-GNFRIDPDTGDISTTKPLdreSIGEYELTVEATDS 76
EGF_CA cd00054
Calcium-binding EGF-like domain, present in a large number of membrane-bound and extracellular ...
739-771 3.60e-05

Calcium-binding EGF-like domain, present in a large number of membrane-bound and extracellular (mostly animal) proteins. Many of these proteins require calcium for their biological function and calcium-binding sites have been found to be located at the N-terminus of particular EGF-like domains; calcium-binding may be crucial for numerous protein-protein interactions. Six conserved core cysteines form three disulfide bridges as in non calcium-binding EGF domains, whose structures are very similar. EGF_CA can be found in tandem repeat arrangements.


Pssm-ID: 238011  Cd Length: 38  Bit Score: 41.85  E-value: 3.60e-05
                           10        20        30
                   ....*....|....*....|....*....|....
gi 1808670485  739 CSSL-PCLNGAKCSEEDGAYKCHCRSRFKGKQCE 771
Cdd:cd00054      5 CASGnPCQNGGTCVNTVGSYRCSCPPGYTGRNCE 38
EGF pfam00008
EGF-like domain; There is no clear separation between noise and signal. pfam00053 is very ...
739-769 6.39e-05

EGF-like domain; There is no clear separation between noise and signal. pfam00053 is very similar, but has 8 instead of 6 conserved cysteines. Includes some cytokine receptors. The EGF domain misses the N-terminus regions of the Ca2+ binding EGF domains (this is the main reason of discrepancy between swiss-prot domain start/end and Pfam). The family is hard to model due to many similar but different sub-types of EGF domains. Pfam certainly misses a number of EGF domains.


Pssm-ID: 394967  Cd Length: 31  Bit Score: 41.21  E-value: 6.39e-05
                           10        20        30
                   ....*....|....*....|....*....|.
gi 1808670485  739 CSSLPCLNGAKCSEEDGAYKCHCRSRFKGKQ 769
Cdd:pfam00008    1 CAPNPCSNGGTCVDTPGGYTCICPEGYTGKR 31
EGF pfam00008
EGF-like domain; There is no clear separation between noise and signal. pfam00053 is very ...
813-842 1.00e-04

EGF-like domain; There is no clear separation between noise and signal. pfam00053 is very similar, but has 8 instead of 6 conserved cysteines. Includes some cytokine receptors. The EGF domain misses the N-terminus regions of the Ca2+ binding EGF domains (this is the main reason of discrepancy between swiss-prot domain start/end and Pfam). The family is hard to model due to many similar but different sub-types of EGF domains. Pfam certainly misses a number of EGF domains.


Pssm-ID: 394967  Cd Length: 31  Bit Score: 40.44  E-value: 1.00e-04
                           10        20        30
                   ....*....|....*....|....*....|.
gi 1808670485  813 CSPNPCLNKGTCEESSTGPICRCQ-GFKGIY 842
Cdd:pfam00008    1 CAPNPCSNGGTCVDTPGGYTCICPeGYTGKR 31
EGF_CA cd00054
Calcium-binding EGF-like domain, present in a large number of membrane-bound and extracellular ...
773-809 6.60e-04

Calcium-binding EGF-like domain, present in a large number of membrane-bound and extracellular (mostly animal) proteins. Many of these proteins require calcium for their biological function and calcium-binding sites have been found to be located at the N-terminus of particular EGF-like domains; calcium-binding may be crucial for numerous protein-protein interactions. Six conserved core cysteines form three disulfide bridges as in non calcium-binding EGF domains, whose structures are very similar. EGF_CA can be found in tandem repeat arrangements.


Pssm-ID: 238011  Cd Length: 38  Bit Score: 38.39  E-value: 6.60e-04
                           10        20        30
                   ....*....|....*....|....*....|....*....
gi 1808670485  773 DTEPCAS-SPCLNNGICI-RDGySYKCQCPAHVSGSRCQ 809
Cdd:cd00054      1 DIDECASgNPCQNGGTCVnTVG-SYRCSCPPGYTGRNCE 38
MJ1470 COG5306
Uncharacterized conserved protein MJ1470, contains DUF2341 domain, predicted component of type ...
551-651 5.04e-03

Uncharacterized conserved protein MJ1470, contains DUF2341 domain, predicted component of type IV pili-like system [General function prediction only];


Pssm-ID: 444105 [Multi-domain]  Cd Length: 529  Bit Score: 41.04  E-value: 5.04e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1808670485  551 SSPVSFDGKGFVQYNIRSSIVDH----FDFSTWFRTTHPSGNLLFISGRIDYCILElhEGRVRYRWelGSGEGLTSVSSL 626
Cdd:COG5306    172 GSGAQFDGTSPLVVPASPSLALDagggFTFSAWIKPAQLDGNAVLYSRRDGANGLD--NGAPFVEV--GGAGGTRSAAGA 247
                           90       100
                   ....*....|....*....|....*
gi 1808670485  627 PLNDSSWHHVALSHNRRNARIVVDG 651
Cdd:COG5306    248 PLAAGTWHHLAVVADAGKVTLYVNG 272
EGF_CA smart00179
Calcium-binding EGF-like domain;
777-809 6.52e-03

Calcium-binding EGF-like domain;


Pssm-ID: 214542 [Multi-domain]  Cd Length: 39  Bit Score: 35.69  E-value: 6.52e-03
                            10        20        30
                    ....*....|....*....|....*....|....*.
gi 1808670485   777 CAS-SPCLNNGICI-RDGySYKCQCPA-HVSGSRCQ 809
Cdd:smart00179    5 CASgNPCQNGGTCVnTVG-SYRCECPPgYTDGRNCE 39
 
Name Accession Description Interval E-value
CA smart00112
Cadherin repeats; Cadherins are glycoproteins involved in Ca2+-mediated cell-cell adhesion. ...
33-112 3.55e-30

Cadherin repeats; Cadherins are glycoproteins involved in Ca2+-mediated cell-cell adhesion. Cadherin domains occur as repeats in the extracellular regions which are thought to mediate cell-cell contact when bound to calcium.


Pssm-ID: 214520 [Multi-domain]  Cd Length: 81  Bit Score: 114.37  E-value: 3.55e-30
                            10        20        30        40        50        60        70        80
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1808670485    33 ATDLDSGSNGRLIYKMAGGNEGDRFAIDPLTGWITVQKVLDREKEAKYSLEAAATDSGSPPRTGSALITVEITDANDNPP 112
Cdd:smart00112    2 ATDADSGENGKVTYSILSGNDDGLFSIDPETGEITTTKPLDREEQPEYTLTVEATDGGGPPLSSTATVTITVLDVNDNAP 81
Cadherin_repeat cd11304
Cadherin tandem repeat domain; Cadherins are glycoproteins involved in Ca2+-mediated cell-cell ...
29-110 3.62e-30

Cadherin tandem repeat domain; Cadherins are glycoproteins involved in Ca2+-mediated cell-cell adhesion. The cadherin repeat domains occur as tandem repeats in the extracellular regions, which are thought to mediate cell-cell contact when bound to calcium. They play numerous roles in cell fate, signalling, proliferation, differentiation, and migration; members include E-, N-, P-, T-, VE-, CNR-, proto-, and FAT-family cadherin, desmocollin, and desmoglein, a large variety of domain architectures with varying repeat copy numbers. Cadherin-repeat containing proteins exist as monomers, homodimers, or heterodimers.


Pssm-ID: 206637 [Multi-domain]  Cd Length: 98  Bit Score: 115.10  E-value: 3.62e-30
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1808670485   29 FNVMATDLDSGSNGRLIYKMAGGNEGDRFAIDPLTGWITVQKVLDREKEAKYSLEAAATDSGSPPRTGSALITVEITDAN 108
Cdd:cd11304     17 LTVSATDPDSGENGEVTYSIVSGNEDGLFSIDPSTGEITTAKPLDREEQSSYTLTVTATDGGGPPLSSTATVTITVLDVN 96

                   ..
gi 1808670485  109 DN 110
Cdd:cd11304     97 DN 98
LamG cd00110
Laminin G domain; Laminin G-like domains are usually Ca++ mediated receptors that can have ...
553-702 1.61e-29

Laminin G domain; Laminin G-like domains are usually Ca++ mediated receptors that can have binding sites for steroids, beta1 integrins, heparin, sulfatides, fibulin-1, and alpha-dystroglycans. Proteins that contain LamG domains serve a variety of purposes including signal transduction via cell-surface steroid receptors, adhesion, migration and differentiation through mediation of cell adhesion molecules.


Pssm-ID: 238058 [Multi-domain]  Cd Length: 151  Bit Score: 115.21  E-value: 1.61e-29
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1808670485  553 PVSFDGKGFVQYNIRSSIVDHFDFSTWFRTTHPSGNLLFISG--RIDYCILELHEGRVRYRWELGSGEGLTSVSSlPLND 630
Cdd:cd00110      1 GVSFSGSSYVRLPTLPAPRTRLSISFSFRTTSPNGLLLYAGSqnGGDFLALELEDGRLVLRYDLGSGSLVLSSKT-PLND 79
                           90       100       110       120       130       140       150
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*
gi 1808670485  631 SSWHHVALSHNRRNARIVVDGKHAVSGISPGNNEKLNLEsSQIYLG---AEVKPWYydEYTSRNLVGCVDNPHFN 702
Cdd:cd00110     80 GQWHSVSVERNGRSVTLSVDGERVVESGSPGGSALLNLD-GPLYLGglpEDLKSPG--LPVSPGFVGCIRDLKVN 151
LamG smart00282
Laminin G domain;
575-702 1.25e-27

Laminin G domain;


Pssm-ID: 214598 [Multi-domain]  Cd Length: 132  Bit Score: 108.97  E-value: 1.25e-27
                            10        20        30        40        50        60        70        80
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1808670485   575 DFSTWFRTTHPSGNLLFISG--RIDYCILELHEGRVRYRWELGSGEGLTSVSSLPLNDSSWHHVALSHNRRNARIVVDGK 652
Cdd:smart00282    1 SISFSFRTTSPNGLLLYAGSkgGGDYLALELRDGRLVLRYDLGSGPARLTSDPTPLNDGQWHRVAVERNGRSVTLSVDGG 80
                            90       100       110       120       130
                    ....*....|....*....|....*....|....*....|....*....|...
gi 1808670485   653 HAVSGISPGNNEKLNLESSqIYLG---AEVKPWYydEYTSRNLVGCVDNPHFN 702
Cdd:smart00282   81 NRVSGESPGGLTILNLDGP-LYLGglpEDLKLPP--LPVTPGFRGCIRNLKVN 130
Laminin_G_2 pfam02210
Laminin G domain; This family includes the Thrombospondin N-terminal-like domain, a Laminin G ...
580-702 8.84e-27

Laminin G domain; This family includes the Thrombospondin N-terminal-like domain, a Laminin G subfamily.


Pssm-ID: 460494 [Multi-domain]  Cd Length: 126  Bit Score: 106.35  E-value: 8.84e-27
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1808670485  580 FRTTHPSGNLLFISGRI-DYCILELHEGRVRYRWELGSGEGLTSVSSLPLNDSSWHHVALSHNRRNARIVVDGKHAVSGI 658
Cdd:pfam02210    1 FRTRQPNGLLLYAGGGGsDFLALELVNGRLVLRYDLGSGPESLLSSGKNLNDGQWHSVRVERNGNTLTLSVDGQTVVSSL 80
                           90       100       110       120
                   ....*....|....*....|....*....|....*....|....*.
gi 1808670485  659 SPGNNEKLNLESSqIYLGA--EVKPWYYdEYTSRNLVGCVDNPHFN 702
Cdd:pfam02210   81 PPGESLLLNLNGP-LYLGGlpPLLLLPA-LPVRAGFVGCIRDVRVN 124
Cadherin pfam00028
Cadherin domain;
11-105 5.37e-25

Cadherin domain;


Pssm-ID: 394985 [Multi-domain]  Cd Length: 92  Bit Score: 100.07  E-value: 5.37e-25
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1808670485   11 QAAIDENptcttrELSKTF--NVMATDLDSGSNGRLIYKMAGGNEGDRFAIDPLTGWITVQKVLDREKEAKYSLEAAATD 88
Cdd:pfam00028    2 SASVPEN------APVGTEvlTVTATDPDLGPNGRIFYSILGGGPGGNFRIDPDTGDISTTKPLDRESIGEYELTVEATD 75
                           90
                   ....*....|....*..
gi 1808670485   89 SGSPPRTGSALITVEIT 105
Cdd:pfam00028   76 SGGPPLSSTATVTITVL 92
Laminin_G_1 pfam00054
Laminin G domain;
580-706 6.26e-20

Laminin G domain;


Pssm-ID: 395008 [Multi-domain]  Cd Length: 131  Bit Score: 86.99  E-value: 6.26e-20
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1808670485  580 FRTTHPSGnLLFISG---RIDYCILELHEGRVRYRWELGSGEGLTsVSSLPLNDSSWHHVALSHNRRNARIVVDGKHAVS 656
Cdd:pfam00054    1 FRTTEPSG-LLLYNGtqtERDFLALELRDGRLEVSYDLGSGAAVV-RSGDKLNDGKWHSVELERNGRSGTLSVDGEARPT 78
                           90       100       110       120       130
                   ....*....|....*....|....*....|....*....|....*....|....*
gi 1808670485  657 GISP-GNNEKLNLEsSQIYLG----AEVKPWyyDEYTSRNLVGCVDNPHFNKAAL 706
Cdd:pfam00054   79 GESPlGATTDLDVD-GPLYVGglpsLGVKKR--RLAISPSFDGCIRDVIVNGKPL 130
Cadherin_repeat cd11304
Cadherin tandem repeat domain; Cadherins are glycoproteins involved in Ca2+-mediated cell-cell ...
120-212 7.49e-14

Cadherin tandem repeat domain; Cadherins are glycoproteins involved in Ca2+-mediated cell-cell adhesion. The cadherin repeat domains occur as tandem repeats in the extracellular regions, which are thought to mediate cell-cell contact when bound to calcium. They play numerous roles in cell fate, signalling, proliferation, differentiation, and migration; members include E-, N-, P-, T-, VE-, CNR-, proto-, and FAT-family cadherin, desmocollin, and desmoglein, a large variety of domain architectures with varying repeat copy numbers. Cadherin-repeat containing proteins exist as monomers, homodimers, or heterodimers.


Pssm-ID: 206637 [Multi-domain]  Cd Length: 98  Bit Score: 68.49  E-value: 7.49e-14
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1808670485  120 TLFIQEDRSPGQRLVRLKVTDADGnsgGNAGPFAFELLSGNSRGDFRI-TQNGDLLTAVSFAGRGLPVYRLQVRVHDNGR 198
Cdd:cd11304      3 EVSVPENAPPGTVVLTVSATDPDS---GENGEVTYSIVSGNEDGLFSIdPSTGEITTAKPLDREEQSSYTLTVTATDGGG 79
                           90
                   ....*....|....
gi 1808670485  199 PHLHTDIWLSVQVV 212
Cdd:cd11304     80 PPLSSTATVTITVL 93
Cadherin_repeat cd11304
Cadherin tandem repeat domain; Cadherins are glycoproteins involved in Ca2+-mediated cell-cell ...
237-315 2.34e-11

Cadherin tandem repeat domain; Cadherins are glycoproteins involved in Ca2+-mediated cell-cell adhesion. The cadherin repeat domains occur as tandem repeats in the extracellular regions, which are thought to mediate cell-cell contact when bound to calcium. They play numerous roles in cell fate, signalling, proliferation, differentiation, and migration; members include E-, N-, P-, T-, VE-, CNR-, proto-, and FAT-family cadherin, desmocollin, and desmoglein, a large variety of domain architectures with varying repeat copy numbers. Cadherin-repeat containing proteins exist as monomers, homodimers, or heterodimers.


Pssm-ID: 206637 [Multi-domain]  Cd Length: 98  Bit Score: 61.56  E-value: 2.34e-11
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1808670485  237 GGSLGIIRASDQD--PHDQLTYEVSyNNSNSELFKIDFETGELYVVKPL---SEGEYSLDVTVSD-GKFKSSGVVKILVR 310
Cdd:cd11304     13 GTVVLTVSATDPDsgENGEVTYSIV-SGNEDGLFSIDPSTGEITTAKPLdreEQSSYTLTVTATDgGGPPLSSTATVTIT 91

                   ....*
gi 1808670485  311 sVEDE 315
Cdd:cd11304     92 -VLDV 95
EGF_CA cd00054
Calcium-binding EGF-like domain, present in a large number of membrane-bound and extracellular ...
846-881 6.87e-10

Calcium-binding EGF-like domain, present in a large number of membrane-bound and extracellular (mostly animal) proteins. Many of these proteins require calcium for their biological function and calcium-binding sites have been found to be located at the N-terminus of particular EGF-like domains; calcium-binding may be crucial for numerous protein-protein interactions. Six conserved core cysteines form three disulfide bridges as in non calcium-binding EGF domains, whose structures are very similar. EGF_CA can be found in tandem repeat arrangements.


Pssm-ID: 238011  Cd Length: 38  Bit Score: 55.34  E-value: 6.87e-10
                           10        20        30
                   ....*....|....*....|....*....|....*..
gi 1808670485  846 DINECDEL-PCEQGGTCINTYGGFNCICSPDFAGQFC 881
Cdd:cd00054      1 DIDECASGnPCQNGGTCVNTVGSYRCSCPPGYTGRNC 37
EGF_CA smart00179
Calcium-binding EGF-like domain;
846-881 3.17e-09

Calcium-binding EGF-like domain;


Pssm-ID: 214542 [Multi-domain]  Cd Length: 39  Bit Score: 53.40  E-value: 3.17e-09
                            10        20        30
                    ....*....|....*....|....*....|....*...
gi 1808670485   846 DINECDEL-PCEQGGTCINTYGGFNCICSPDF-AGQFC 881
Cdd:smart00179    1 DIDECASGnPCQNGGTCVNTVGSYRCECPPGYtDGRNC 38
EGF_CA pfam07645
Calcium-binding EGF domain;
846-874 2.52e-07

Calcium-binding EGF domain;


Pssm-ID: 429571  Cd Length: 32  Bit Score: 48.00  E-value: 2.52e-07
                           10        20        30
                   ....*....|....*....|....*....|.
gi 1808670485  846 DINECDELP--CEQGGTCINTYGGFNCICSP 874
Cdd:pfam07645    1 DVDECATGThnCPANTVCVNTIGSFECRCPD 31
Cadherin pfam00028
Cadherin domain;
123-211 3.20e-07

Cadherin domain;


Pssm-ID: 394985 [Multi-domain]  Cd Length: 92  Bit Score: 49.61  E-value: 3.20e-07
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1808670485  123 IQEDRSPGQRLVRLKVTDADgnSGGNAGPFaFELLSGNSRGDFRI-TQNGDLLTAvsfagRGL-----PVYRLQVRVHDN 196
Cdd:pfam00028    5 VPENAPVGTEVLTVTATDPD--LGPNGRIF-YSILGGGPGGNFRIdPDTGDISTT-----KPLdresiGEYELTVEATDS 76
                           90
                   ....*....|....*
gi 1808670485  197 GRPHLHTDIWLSVQV 211
Cdd:pfam00028   77 GGPPLSSTATVTITV 91
CA smart00112
Cadherin repeats; Cadherins are glycoproteins involved in Ca2+-mediated cell-cell adhesion. ...
244-315 5.59e-07

Cadherin repeats; Cadherins are glycoproteins involved in Ca2+-mediated cell-cell adhesion. Cadherin domains occur as repeats in the extracellular regions which are thought to mediate cell-cell contact when bound to calcium.


Pssm-ID: 214520 [Multi-domain]  Cd Length: 81  Bit Score: 48.50  E-value: 5.59e-07
                            10        20        30        40        50        60        70
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 1808670485   244 RASDQD--PHDQLTYEVSYNNSNSeLFKIDFETGELYVVKPL---SEGEYSLDVTVSD-GKFKSSGVVKILVRsVEDE 315
Cdd:smart00112    1 SATDADsgENGKVTYSILSGNDDG-LFSIDPETGEITTTKPLdreEQPEYTLTVEATDgGGPPLSSTATVTIT-VLDV 76
CA_like cd00031
Cadherin repeat-like domain; Cadherins are glycoproteins involved in Ca2+-mediated cell-cell ...
32-110 3.81e-06

Cadherin repeat-like domain; Cadherins are glycoproteins involved in Ca2+-mediated cell-cell adhesion. The cadherin repeat domains occur as tandem repeats in the extracellular regions, which are thought to mediate cell-cell contact when bound to calcium. They play numerous roles in cell fate, signalling, proliferation, differentiation, and migration; members include E-, N-, P-, T-, VE-, CNR-, proto-, and FAT-family cadherin, desmocollin, and desmoglein, a large variety of domain architectures with varying repeat copy numbers. Cadherin-repeat containing proteins exist as monomers, homodimers, or heterodimers. This family also includes the cadherin-like repeats of extracellular alpha-dystroglycan.


Pssm-ID: 206635  Cd Length: 98  Bit Score: 46.57  E-value: 3.81e-06
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1808670485   32 MATDLDSgSNGRLIYKMAGGNEGDR--FAIDPLTGWITVQKVLDREKEAKYSLEAAATDSGSPPRTGSALITVEITDAND 109
Cdd:cd00031     19 IPTDLIA-SSGEIIKISAAGKEALPswLHWEPHSGILEGLEKLDREDKGVHYISVSAASLGANVPQTSSVFSIEVYDEND 97

                   .
gi 1808670485  110 N 110
Cdd:cd00031     98 N 98
EGF pfam00008
EGF-like domain; There is no clear separation between noise and signal. pfam00053 is very ...
777-807 1.05e-05

EGF-like domain; There is no clear separation between noise and signal. pfam00053 is very similar, but has 8 instead of 6 conserved cysteines. Includes some cytokine receptors. The EGF domain misses the N-terminus regions of the Ca2+ binding EGF domains (this is the main reason of discrepancy between swiss-prot domain start/end and Pfam). The family is hard to model due to many similar but different sub-types of EGF domains. Pfam certainly misses a number of EGF domains.


Pssm-ID: 394967  Cd Length: 31  Bit Score: 43.53  E-value: 1.05e-05
                           10        20        30
                   ....*....|....*....|....*....|.
gi 1808670485  777 CASSPCLNNGICIRDGYSYKCQCPAHVSGSR 807
Cdd:pfam00008    1 CAPNPCSNGGTCVDTPGGYTCICPEGYTGKR 31
CA smart00112
Cadherin repeats; Cadherins are glycoproteins involved in Ca2+-mediated cell-cell adhesion. ...
137-212 1.80e-05

Cadherin repeats; Cadherins are glycoproteins involved in Ca2+-mediated cell-cell adhesion. Cadherin domains occur as repeats in the extracellular regions which are thought to mediate cell-cell contact when bound to calcium.


Pssm-ID: 214520 [Multi-domain]  Cd Length: 81  Bit Score: 44.26  E-value: 1.80e-05
                            10        20        30        40        50        60        70
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*..
gi 1808670485   137 KVTDADgnSGGNaGPFAFELLSGNSRGDFRITQN-GDLLTAVSFAGRGLPVYRLQVRVHDNGRPHLHTDIWLSVQVV 212
Cdd:smart00112    1 SATDAD--SGEN-GKVTYSILSGNDDGLFSIDPEtGEITTTKPLDREEQPEYTLTVEATDGGGPPLSSTATVTITVL 74
Cadherin pfam00028
Cadherin domain;
237-297 1.85e-05

Cadherin domain;


Pssm-ID: 394985 [Multi-domain]  Cd Length: 92  Bit Score: 44.60  E-value: 1.85e-05
                           10        20        30        40        50        60
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*.
gi 1808670485  237 GGSLGIIRASDQD--PHDQLTYEVSYNNSNsELFKIDFETGELYVVKPL---SEGEYSLDVTVSDG 297
Cdd:pfam00028   12 GTEVLTVTATDPDlgPNGRIFYSILGGGPG-GNFRIDPDTGDISTTKPLdreSIGEYELTVEATDS 76
Laminin_G_3 pfam13385
Concanavalin A-like lectin/glucanases superfamily; This domain belongs to the Concanavalin ...
574-704 3.06e-05

Concanavalin A-like lectin/glucanases superfamily; This domain belongs to the Concanavalin A-like lectin/glucanases superfamily.


Pssm-ID: 463865 [Multi-domain]  Cd Length: 151  Bit Score: 45.45  E-value: 3.06e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1808670485  574 FDFSTWFRTTHPSGNLLFISGRIDYCILELH-EGRVRYRWELGSGEGLTSV--SSLPLNDSSWHHVALSHNRRNARIVVD 650
Cdd:pfam13385   19 FTVSAWVKPDSLPGWARAIISSSGGGGYSLGlDGDGRLRFAVNGGNGGWDTvtSGASVPLGQWTHVAVTYDGGTLRLYVN 98
                           90       100       110       120       130
                   ....*....|....*....|....*....|....*....|....*....|....*
gi 1808670485  651 GKHAvsGISPGNNEKLNLESSQIYLGAEvkPWYydeytSRNLVGCVDNPH-FNKA 704
Cdd:pfam13385   99 GVLV--GSSTLTGGPPPGTGGPLYIGRS--PGG-----DDYFNGLIDEVRiYDRA 144
EGF_CA cd00054
Calcium-binding EGF-like domain, present in a large number of membrane-bound and extracellular ...
739-771 3.60e-05

Calcium-binding EGF-like domain, present in a large number of membrane-bound and extracellular (mostly animal) proteins. Many of these proteins require calcium for their biological function and calcium-binding sites have been found to be located at the N-terminus of particular EGF-like domains; calcium-binding may be crucial for numerous protein-protein interactions. Six conserved core cysteines form three disulfide bridges as in non calcium-binding EGF domains, whose structures are very similar. EGF_CA can be found in tandem repeat arrangements.


Pssm-ID: 238011  Cd Length: 38  Bit Score: 41.85  E-value: 3.60e-05
                           10        20        30
                   ....*....|....*....|....*....|....
gi 1808670485  739 CSSL-PCLNGAKCSEEDGAYKCHCRSRFKGKQCE 771
Cdd:cd00054      5 CASGnPCQNGGTCVNTVGSYRCSCPPGYTGRNCE 38
EGF pfam00008
EGF-like domain; There is no clear separation between noise and signal. pfam00053 is very ...
850-878 5.10e-05

EGF-like domain; There is no clear separation between noise and signal. pfam00053 is very similar, but has 8 instead of 6 conserved cysteines. Includes some cytokine receptors. The EGF domain misses the N-terminus regions of the Ca2+ binding EGF domains (this is the main reason of discrepancy between swiss-prot domain start/end and Pfam). The family is hard to model due to many similar but different sub-types of EGF domains. Pfam certainly misses a number of EGF domains.


Pssm-ID: 394967  Cd Length: 31  Bit Score: 41.60  E-value: 5.10e-05
                           10        20
                   ....*....|....*....|....*....
gi 1808670485  850 CDELPCEQGGTCINTYGGFNCICSPDFAG 878
Cdd:pfam00008    1 CAPNPCSNGGTCVDTPGGYTCICPEGYTG 29
EGF pfam00008
EGF-like domain; There is no clear separation between noise and signal. pfam00053 is very ...
739-769 6.39e-05

EGF-like domain; There is no clear separation between noise and signal. pfam00053 is very similar, but has 8 instead of 6 conserved cysteines. Includes some cytokine receptors. The EGF domain misses the N-terminus regions of the Ca2+ binding EGF domains (this is the main reason of discrepancy between swiss-prot domain start/end and Pfam). The family is hard to model due to many similar but different sub-types of EGF domains. Pfam certainly misses a number of EGF domains.


Pssm-ID: 394967  Cd Length: 31  Bit Score: 41.21  E-value: 6.39e-05
                           10        20        30
                   ....*....|....*....|....*....|.
gi 1808670485  739 CSSLPCLNGAKCSEEDGAYKCHCRSRFKGKQ 769
Cdd:pfam00008    1 CAPNPCSNGGTCVDTPGGYTCICPEGYTGKR 31
EGF pfam00008
EGF-like domain; There is no clear separation between noise and signal. pfam00053 is very ...
813-842 1.00e-04

EGF-like domain; There is no clear separation between noise and signal. pfam00053 is very similar, but has 8 instead of 6 conserved cysteines. Includes some cytokine receptors. The EGF domain misses the N-terminus regions of the Ca2+ binding EGF domains (this is the main reason of discrepancy between swiss-prot domain start/end and Pfam). The family is hard to model due to many similar but different sub-types of EGF domains. Pfam certainly misses a number of EGF domains.


Pssm-ID: 394967  Cd Length: 31  Bit Score: 40.44  E-value: 1.00e-04
                           10        20        30
                   ....*....|....*....|....*....|.
gi 1808670485  813 CSPNPCLNKGTCEESSTGPICRCQ-GFKGIY 842
Cdd:pfam00008    1 CAPNPCSNGGTCVDTPGGYTCICPeGYTGKR 31
EGF cd00053
Epidermal growth factor domain, found in epidermal growth factor (EGF) presents in a large ...
849-880 1.14e-04

Epidermal growth factor domain, found in epidermal growth factor (EGF) presents in a large number of proteins, mostly animal; the list of proteins currently known to contain one or more copies of an EGF-like pattern is large and varied; the functional significance of EGF-like domains in what appear to be unrelated proteins is not yet clear; a common feature is that these repeats are found in the extracellular domain of membrane-bound proteins or in proteins known to be secreted (exception: prostaglandin G/H synthase); the domain includes six cysteine residues which have been shown to be involved in disulfide bonds; the main structure is a two-stranded beta-sheet followed by a loop to a C-terminal short two-stranded sheet; Subdomains between the conserved cysteines vary in length; the region between the 5th and 6th cysteine contains two conserved glycines of which at least one is present in most EGF-like domains; a subset of these bind calcium.


Pssm-ID: 238010  Cd Length: 36  Bit Score: 40.54  E-value: 1.14e-04
                           10        20        30
                   ....*....|....*....|....*....|...
gi 1808670485  849 ECDEL-PCEQGGTCINTYGGFNCICSPDFAGQF 880
Cdd:cd00053      1 ECAASnPCSNGGTCVNTPGSYRCVCPPGYTGDR 33
EGF_CA cd00054
Calcium-binding EGF-like domain, present in a large number of membrane-bound and extracellular ...
773-809 6.60e-04

Calcium-binding EGF-like domain, present in a large number of membrane-bound and extracellular (mostly animal) proteins. Many of these proteins require calcium for their biological function and calcium-binding sites have been found to be located at the N-terminus of particular EGF-like domains; calcium-binding may be crucial for numerous protein-protein interactions. Six conserved core cysteines form three disulfide bridges as in non calcium-binding EGF domains, whose structures are very similar. EGF_CA can be found in tandem repeat arrangements.


Pssm-ID: 238011  Cd Length: 38  Bit Score: 38.39  E-value: 6.60e-04
                           10        20        30
                   ....*....|....*....|....*....|....*....
gi 1808670485  773 DTEPCAS-SPCLNNGICI-RDGySYKCQCPAHVSGSRCQ 809
Cdd:cd00054      1 DIDECASgNPCQNGGTCVnTVG-SYRCSCPPGYTGRNCE 38
hEGF pfam12661
Human growth factor-like EGF; hEGF, or human growth factor-like EGF, domains have six ...
855-876 1.78e-03

Human growth factor-like EGF; hEGF, or human growth factor-like EGF, domains have six conserved residues disulfide-bonded into the characteriztic 'ababcc' pattern. They are involved in growth and proliferation of cells, in proteins of the Notch/Delta pathway, neurogulin and selectins. hEGFs are also found in mosaic proteins with four-disulfide laminin EGFs such as aggrecan and perlecan. The core fold of the EGF domain consists of two small beta-hairpins packed against each other. Two major structural variants have been identified based on the structural context of the C-terminal Cys residue of disulfide 'c' in the C-terminal hairpin: hEGFs and cEGFs. In hEGFs the C-terminal thiol resides in the beta-turn, resulting in shorter loop-lengths between the Cys residues of disulfide 'c', typically C[8-9]XC. These shorter loop-lengths are also typical of the four-disulfide EGF domains, laminin ad integrin. Tandem hEGF domains have six linking residues between terminal cysteines of adjacent domains. hEGF domains may or may not bind calcium in the linker region. hEGF domains with the consensus motif CXD4X[F,Y]XCXC are hydroxylated exclusively in the Asp residue.


Pssm-ID: 463660  Cd Length: 22  Bit Score: 36.93  E-value: 1.78e-03
                           10        20
                   ....*....|....*....|..
gi 1808670485  855 CEQGGTCINTYGGFNCICSPDF 876
Cdd:pfam12661    1 CQNGGTCVDGVNGYKCQCPPGY 22
MJ1470 COG5306
Uncharacterized conserved protein MJ1470, contains DUF2341 domain, predicted component of type ...
551-651 5.04e-03

Uncharacterized conserved protein MJ1470, contains DUF2341 domain, predicted component of type IV pili-like system [General function prediction only];


Pssm-ID: 444105 [Multi-domain]  Cd Length: 529  Bit Score: 41.04  E-value: 5.04e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1808670485  551 SSPVSFDGKGFVQYNIRSSIVDH----FDFSTWFRTTHPSGNLLFISGRIDYCILElhEGRVRYRWelGSGEGLTSVSSL 626
Cdd:COG5306    172 GSGAQFDGTSPLVVPASPSLALDagggFTFSAWIKPAQLDGNAVLYSRRDGANGLD--NGAPFVEV--GGAGGTRSAAGA 247
                           90       100
                   ....*....|....*....|....*
gi 1808670485  627 PLNDSSWHHVALSHNRRNARIVVDG 651
Cdd:COG5306    248 PLAAGTWHHLAVVADAGKVTLYVNG 272
EGF cd00053
Epidermal growth factor domain, found in epidermal growth factor (EGF) presents in a large ...
776-809 5.36e-03

Epidermal growth factor domain, found in epidermal growth factor (EGF) presents in a large number of proteins, mostly animal; the list of proteins currently known to contain one or more copies of an EGF-like pattern is large and varied; the functional significance of EGF-like domains in what appear to be unrelated proteins is not yet clear; a common feature is that these repeats are found in the extracellular domain of membrane-bound proteins or in proteins known to be secreted (exception: prostaglandin G/H synthase); the domain includes six cysteine residues which have been shown to be involved in disulfide bonds; the main structure is a two-stranded beta-sheet followed by a loop to a C-terminal short two-stranded sheet; Subdomains between the conserved cysteines vary in length; the region between the 5th and 6th cysteine contains two conserved glycines of which at least one is present in most EGF-like domains; a subset of these bind calcium.


Pssm-ID: 238010  Cd Length: 36  Bit Score: 35.92  E-value: 5.36e-03
                           10        20        30
                   ....*....|....*....|....*....|....*.
gi 1808670485  776 PCA-SSPCLNNGICIRDGYSYKCQCPAHVSGS-RCQ 809
Cdd:cd00053      1 ECAaSNPCSNGGTCVNTPGSYRCVCPPGYTGDrSCE 36
EGF_CA smart00179
Calcium-binding EGF-like domain;
777-809 6.52e-03

Calcium-binding EGF-like domain;


Pssm-ID: 214542 [Multi-domain]  Cd Length: 39  Bit Score: 35.69  E-value: 6.52e-03
                            10        20        30
                    ....*....|....*....|....*....|....*.
gi 1808670485   777 CAS-SPCLNNGICI-RDGySYKCQCPA-HVSGSRCQ 809
Cdd:smart00179    5 CASgNPCQNGGTCVnTVG-SYRCECPPgYTDGRNCE 39
 
Blast search parameters
Data Source: Precalculated data, version = cdd.v.3.21
Preset Options:Database: CDSEARCH/cdd   Low complexity filter: no  Composition Based Adjustment: yes   E-value threshold: 0.01

References:

  • Wang J et al. (2023), "The conserved domain database in 2023", Nucleic Acids Res.51(D)384-8.
  • Lu S et al. (2020), "The conserved domain database in 2020", Nucleic Acids Res.48(D)265-8.
  • Marchler-Bauer A et al. (2017), "CDD/SPARCLE: functional classification of proteins via subfamily domain architectures.", Nucleic Acids Res.45(D)200-3.
Help | Disclaimer | Write to the Help Desk
NCBI | NLM | NIH