NCBI Home Page NCBI Site Search page NCBI Guide that lists and describes the NCBI resources
Conserved domains on  [gi|20988875|gb|AAH30532|]
View 

Suppression of tumorigenicity 14 (colon carcinoma) [Homo sapiens]

Protein Classification

CUB domain-containing protein( domain architecture ID 10475859)

CUB (complement C1r/C1s, Uegf, Bmp1) domain-containing protein

Graphical summary

 Zoom to residue level

show extra options »

Show site features     Horizontal zoom: ×

List of domain hits

Name Accession Description Interval E-value
Tryp_SPc cd00190
Trypsin-like serine protease; Many of these are synthesized as inactive precursor zymogens ...
615-852 9.02e-98

Trypsin-like serine protease; Many of these are synthesized as inactive precursor zymogens that are cleaved during limited proteolysis to generate their active forms. Alignment contains also inactive enzymes that have substitutions of the catalytic triad residues.


:

Pssm-ID: 238113 [Multi-domain]  Cd Length: 232  Bit Score: 304.58  E-value: 9.02e-98
                        10        20        30        40        50        60        70        80
                ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 20988875 615 VVGGTDADEGEWPWQVSLHALGQGHICGASLISPNWLVSAAHCyiddrgFRYSDPTQWTAFLGLHDQSQRSAPGvQERRL 694
Cdd:cd00190   1 IVGGSEAKIGSFPWQVSLQYTGGRHFCGGSLISPRWVLTAAHC------VYSSAPSNYTVRLGSHDLSSNEGGG-QVIKV 73
                        90       100       110       120       130       140       150       160
                ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 20988875 695 KRIISHPFFNDFTFDYDIALLELEKPAEYSSMVRPICLPDASHVFPAGKAIWVTGWGHTQYGGTGALILQKGEIRVINQT 774
Cdd:cd00190  74 KKVIVHPNYNPSTYDNDIALLKLKRPVTLSDNVRPICLPSSGYNLPAGTTCTVSGWGRTSEGGPLPDVLQEVNVPIVSNA 153
                       170       180       190       200       210       220       230       240
                ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 20988875 775 TCENLLPQQ--ITPRMMCVGFLSGGVDSCQGDSGGPLsSVEADGRIFQAGVVSWGDGCAQRNKPGVYTRLPLFRDWIKEN 852
Cdd:cd00190 154 ECKRAYSYGgtITDNMLCAGGLEGGKDACQGDSGGPL-VCNDNGRGVLVGIVSWGSGCARPNYPGVYTRVSSYLDWIQKT 232
CUB cd00041
CUB domain; extracellular domain; present in proteins mostly known to be involved in ...
340-444 3.16e-29

CUB domain; extracellular domain; present in proteins mostly known to be involved in development; not found in prokaryotes, plants and yeast.


:

Pssm-ID: 238001 [Multi-domain]  Cd Length: 113  Bit Score: 112.51  E-value: 3.16e-29
                        10        20        30        40        50        60        70        80
                ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 20988875 340 CGGRLRKA-QGTFNSPYYPGHYPPNIDCTWNIEVPNNQHVKVRFKFFYLLEPgvpaGTCPKDYVEI-NGE--------KY 409
Cdd:cd00041   1 CGGTLTAStSGTISSPNYPNNYPNNLNCVWTIEAPPGYRIRLTFEDFDLESS----PNCSYDYLEIyDGPstsspllgRF 76
                        90       100       110
                ....*....|....*....|....*....|....*
gi 20988875 410 CGERSQFVVTSNSNKITVRFHSDQSYTDTGFLAEY 444
Cdd:cd00041  77 CGSTLPPPIISSGNSLTVRFRSDSSVTGRGFKATY 111
CUB cd00041
CUB domain; extracellular domain; present in proteins mostly known to be involved in ...
227-332 1.06e-17

CUB domain; extracellular domain; present in proteins mostly known to be involved in development; not found in prokaryotes, plants and yeast.


:

Pssm-ID: 238001 [Multi-domain]  Cd Length: 113  Bit Score: 79.38  E-value: 1.06e-17
                        10        20        30        40        50        60        70        80
                ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 20988875 227 RFTTPGFPDsPYPAHARCQWALRGDADSVLSLTFRSFDLASCDERGSDLVTVYNTLSPMEPHaLVQLCGTYPPSynlTFH 306
Cdd:cd00041  12 TISSPNYPN-NYPNNLNCVWTIEAPPGYRIRLTFEDFDLESSPNCSYDYLEIYDGPSTSSPL-LGRFCGSTLPP---PII 86
                        90       100
                ....*....|....*....|....*.
gi 20988875 307 SSQNVLLITLITNTERRHPGFEATFF 332
Cdd:cd00041  87 SSGNSLTVRFRSDSSVTGRGFKATYS 112
LDLa cd00112
Low Density Lipoprotein Receptor Class A domain, a cysteine-rich repeat that plays a central ...
525-559 1.35e-12

Low Density Lipoprotein Receptor Class A domain, a cysteine-rich repeat that plays a central role in mammalian cholesterol metabolism; the receptor protein binds LDL and transports it into cells by endocytosis; 7 successive cysteine-rich repeats of about 40 amino acids are present in the N-terminal of this multidomain membrane protein; other homologous domains occur in related receptors, including the very low-density lipoprotein receptor and the LDL receptor-related protein/alpha 2-macroglobulin receptor, and in proteins which are functionally unrelated, such as the C9 component of complement; the binding of calcium is required for in vitro formation of the native disulfide isomer and is necessary in establishing and maintaining the modular structure


:

Pssm-ID: 238060  Cd Length: 35  Bit Score: 62.61  E-value: 1.35e-12
                        10        20        30
                ....*....|....*....|....*....|....*
gi 20988875 525 CPAQTFRCSNGKCLSKSQQCNGKDDCGDGSDEASC 559
Cdd:cd00112   1 CPPNEFRCANGRCIPSSWVCDGEDDCGDGSDEENC 35
SEA pfam01390
SEA domain; Domain found in Sea urchin sperm protein, Enterokinase, Agrin (SEA). Proposed ...
88-162 2.34e-11

SEA domain; Domain found in Sea urchin sperm protein, Enterokinase, Agrin (SEA). Proposed function of regulating or binding carbohydrate side chains. Recently a proteolytic activity has been shown for a SEA domain.


:

Pssm-ID: 460188  Cd Length: 100  Bit Score: 61.10  E-value: 2.34e-11
                          10        20        30        40        50        60        70
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*..
gi 20988875    88 KVFNGYMRITNENFVDAYENSNSTEFVSLASKVKDALKLLYSGVPfLGPYHKESAVTAFS--EGSVIAYYWSEFSIP 162
Cdd:pfam01390   1 QYYTGSFKITNLQYTPDLGNPSSQEFKSLSRRIESLLNELFRNSS-LRKQYIKSHVLRLRpdGGSVVVDVVLVFRFP 76
LDLa cd00112
Low Density Lipoprotein Receptor Class A domain, a cysteine-rich repeat that plays a central ...
454-486 6.93e-11

Low Density Lipoprotein Receptor Class A domain, a cysteine-rich repeat that plays a central role in mammalian cholesterol metabolism; the receptor protein binds LDL and transports it into cells by endocytosis; 7 successive cysteine-rich repeats of about 40 amino acids are present in the N-terminal of this multidomain membrane protein; other homologous domains occur in related receptors, including the very low-density lipoprotein receptor and the LDL receptor-related protein/alpha 2-macroglobulin receptor, and in proteins which are functionally unrelated, such as the C9 component of complement; the binding of calcium is required for in vitro formation of the native disulfide isomer and is necessary in establishing and maintaining the modular structure


:

Pssm-ID: 238060  Cd Length: 35  Bit Score: 57.60  E-value: 6.93e-11
                        10        20        30
                ....*....|....*....|....*....|...
gi 20988875 454 PGQFTCRTGRCIRKELRCDGWADCTDHSDELNC 486
Cdd:cd00112   3 PNEFRCANGRCIPSSWVCDGEDDCGDGSDEENC 35
LDLa cd00112
Low Density Lipoprotein Receptor Class A domain, a cysteine-rich repeat that plays a central ...
488-523 4.94e-10

Low Density Lipoprotein Receptor Class A domain, a cysteine-rich repeat that plays a central role in mammalian cholesterol metabolism; the receptor protein binds LDL and transports it into cells by endocytosis; 7 successive cysteine-rich repeats of about 40 amino acids are present in the N-terminal of this multidomain membrane protein; other homologous domains occur in related receptors, including the very low-density lipoprotein receptor and the LDL receptor-related protein/alpha 2-macroglobulin receptor, and in proteins which are functionally unrelated, such as the C9 component of complement; the binding of calcium is required for in vitro formation of the native disulfide isomer and is necessary in establishing and maintaining the modular structure


:

Pssm-ID: 238060  Cd Length: 35  Bit Score: 55.29  E-value: 4.94e-10
                        10        20        30
                ....*....|....*....|....*....|....*.
gi 20988875 488 CDAGhQFTCKNKFCKPLFWVCDSVNDCGDNSDEQGC 523
Cdd:cd00112   1 CPPN-EFRCANGRCIPSSWVCDGEDDCGDGSDEENC 35
LDLa cd00112
Low Density Lipoprotein Receptor Class A domain, a cysteine-rich repeat that plays a central ...
567-602 1.29e-07

Low Density Lipoprotein Receptor Class A domain, a cysteine-rich repeat that plays a central role in mammalian cholesterol metabolism; the receptor protein binds LDL and transports it into cells by endocytosis; 7 successive cysteine-rich repeats of about 40 amino acids are present in the N-terminal of this multidomain membrane protein; other homologous domains occur in related receptors, including the very low-density lipoprotein receptor and the LDL receptor-related protein/alpha 2-macroglobulin receptor, and in proteins which are functionally unrelated, such as the C9 component of complement; the binding of calcium is required for in vitro formation of the native disulfide isomer and is necessary in establishing and maintaining the modular structure


:

Pssm-ID: 238060  Cd Length: 35  Bit Score: 48.36  E-value: 1.29e-07
                        10        20        30
                ....*....|....*....|....*....|....*.
gi 20988875 567 CTKHTYRCLNGLCLSKGNpECDGKEDCSDGSDEKDC 602
Cdd:cd00112   1 CPPNEFRCANGRCIPSSW-VCDGEDDCGDGSDEENC 35
 
Name Accession Description Interval E-value
Tryp_SPc cd00190
Trypsin-like serine protease; Many of these are synthesized as inactive precursor zymogens ...
615-852 9.02e-98

Trypsin-like serine protease; Many of these are synthesized as inactive precursor zymogens that are cleaved during limited proteolysis to generate their active forms. Alignment contains also inactive enzymes that have substitutions of the catalytic triad residues.


Pssm-ID: 238113 [Multi-domain]  Cd Length: 232  Bit Score: 304.58  E-value: 9.02e-98
                        10        20        30        40        50        60        70        80
                ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 20988875 615 VVGGTDADEGEWPWQVSLHALGQGHICGASLISPNWLVSAAHCyiddrgFRYSDPTQWTAFLGLHDQSQRSAPGvQERRL 694
Cdd:cd00190   1 IVGGSEAKIGSFPWQVSLQYTGGRHFCGGSLISPRWVLTAAHC------VYSSAPSNYTVRLGSHDLSSNEGGG-QVIKV 73
                        90       100       110       120       130       140       150       160
                ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 20988875 695 KRIISHPFFNDFTFDYDIALLELEKPAEYSSMVRPICLPDASHVFPAGKAIWVTGWGHTQYGGTGALILQKGEIRVINQT 774
Cdd:cd00190  74 KKVIVHPNYNPSTYDNDIALLKLKRPVTLSDNVRPICLPSSGYNLPAGTTCTVSGWGRTSEGGPLPDVLQEVNVPIVSNA 153
                       170       180       190       200       210       220       230       240
                ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 20988875 775 TCENLLPQQ--ITPRMMCVGFLSGGVDSCQGDSGGPLsSVEADGRIFQAGVVSWGDGCAQRNKPGVYTRLPLFRDWIKEN 852
Cdd:cd00190 154 ECKRAYSYGgtITDNMLCAGGLEGGKDACQGDSGGPL-VCNDNGRGVLVGIVSWGSGCARPNYPGVYTRVSSYLDWIQKT 232
Tryp_SPc smart00020
Trypsin-like serine protease; Many of these are synthesised as inactive precursor zymogens ...
614-849 1.24e-94

Trypsin-like serine protease; Many of these are synthesised as inactive precursor zymogens that are cleaved during limited proteolysis to generate their active forms. A few, however, are active as single chain molecules, and others are inactive due to substitutions of the catalytic triad residues.


Pssm-ID: 214473  Cd Length: 229  Bit Score: 296.13  E-value: 1.24e-94
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 20988875    614 RVVGGTDADEGEWPWQVSLHALGQGHICGASLISPNWLVSAAHCyiddrgFRYSDPTQWTAFLGLHDQSqrSAPGVQERR 693
Cdd:smart00020   1 RIVGGSEANIGSFPWQVSLQYGGGRHFCGGSLISPRWVLTAAHC------VRGSDPSNIRVRLGSHDLS--SGEEGQVIK 72
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 20988875    694 LKRIISHPFFNDFTFDYDIALLELEKPAEYSSMVRPICLPDASHVFPAGKAIWVTGWGHTQYG-GTGALILQKGEIRVIN 772
Cdd:smart00020  73 VSKVIIHPNYNPSTYDNDIALLKLKEPVTLSDNVRPICLPSSNYNVPAGTTCTVSGWGRTSEGaGSLPDTLQEVNVPIVS 152
                          170       180       190       200       210       220       230
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 20988875    773 QTTCENLLPQQ--ITPRMMCVGFLSGGVDSCQGDSGGPLssVEADGRIFQAGVVSWGDGCAQRNKPGVYTRLPLFRDWI 849
Cdd:smart00020 153 NATCRRAYSGGgaITDNMLCAGGLEGGKDACQGDSGGPL--VCNDGRWVLVGIVSWGSGCARPGKPGVYTRVSSYLDWI 229
COG5640 COG5640
Secreted trypsin-like serine protease [Posttranslational modification, protein turnover, ...
612-855 2.62e-73

Secreted trypsin-like serine protease [Posttranslational modification, protein turnover, chaperones];


Pssm-ID: 444365 [Multi-domain]  Cd Length: 262  Bit Score: 240.71  E-value: 2.62e-73
                        10        20        30        40        50        60        70        80
                ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 20988875 612 QARVVGGTDADEGEWPWQVSLHALG--QGHICGASLISPNWLVSAAHCYIDDrgfrysDPTQWTAFLGLHDqsqRSAPGV 689
Cdd:COG5640  28 APAIVGGTPATVGEYPWMVALQSSNgpSGQFCGGTLIAPRWVLTAAHCVDGD------GPSDLRVVIGSTD---LSTSGG 98
                        90       100       110       120       130       140       150       160
                ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 20988875 690 QERRLKRIISHPFFNDFTFDYDIALLELEKPAeysSMVRPICLPDASHVFPAGKAIWVTGWGHT-QYGGTGALILQKGEI 768
Cdd:COG5640  99 TVVKVARIVVHPDYDPATPGNDIALLKLATPV---PGVAPAPLATSADAAAPGTPATVAGWGRTsEGPGSQSGTLRKADV 175
                       170       180       190       200       210       220       230       240
                ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 20988875 769 RVINQTTCeNLLPQQITPRMMCVGFLSGGVDSCQGDSGGPLSsVEADGRIFQAGVVSWGDGCAQRNKPGVYTRLPLFRDW 848
Cdd:COG5640 176 PVVSDATC-AAYGGFDGGTMLCAGYPEGGKDACQGDSGGPLV-VKDGGGWVLVGVVSWGGGPCAAGYPGVYTRVSAYRDW 253

                ....*..
gi 20988875 849 IKENTGV 855
Cdd:COG5640 254 IKSTAGG 260
Trypsin pfam00089
Trypsin;
615-849 3.61e-71

Trypsin;


Pssm-ID: 459667 [Multi-domain]  Cd Length: 219  Bit Score: 233.49  E-value: 3.61e-71
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 20988875   615 VVGGTDADEGEWPWQVSLHALGQGHICGASLISPNWLVSAAHCYiddrgfrySDPTQWTAFLGLHDQSQRSaPGVQERRL 694
Cdd:pfam00089   1 IVGGDEAQPGSFPWQVSLQLSSGKHFCGGSLISENWVLTAAHCV--------SGASDVKVVLGAHNIVLRE-GGEQKFDV 71
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 20988875   695 KRIISHPFFNDFTFDYDIALLELEKPAEYSSMVRPICLPDASHVFPAGKAIWVTGWGHTQYGGTgALILQKGEIRVINQT 774
Cdd:pfam00089  72 EKIIVHPNYNPDTLDNDIALLKLESPVTLGDTVRPICLPDASSDLPVGTTCTVSGWGNTKTLGP-SDTLQEVTVPVVSRE 150
                         170       180       190       200       210       220       230
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*
gi 20988875   775 TCENLLPQQITPRMMCVGFlsGGVDSCQGDSGGPLssVEADGriFQAGVVSWGDGCAQRNKPGVYTRLPLFRDWI 849
Cdd:pfam00089 151 TCRSAYGGTVTDTMICAGA--GGKDACQGDSGGPL--VCSDG--ELIGIVSWGYGCASGNYPGVYTPVSSYLDWI 219
CUB cd00041
CUB domain; extracellular domain; present in proteins mostly known to be involved in ...
340-444 3.16e-29

CUB domain; extracellular domain; present in proteins mostly known to be involved in development; not found in prokaryotes, plants and yeast.


Pssm-ID: 238001 [Multi-domain]  Cd Length: 113  Bit Score: 112.51  E-value: 3.16e-29
                        10        20        30        40        50        60        70        80
                ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 20988875 340 CGGRLRKA-QGTFNSPYYPGHYPPNIDCTWNIEVPNNQHVKVRFKFFYLLEPgvpaGTCPKDYVEI-NGE--------KY 409
Cdd:cd00041   1 CGGTLTAStSGTISSPNYPNNYPNNLNCVWTIEAPPGYRIRLTFEDFDLESS----PNCSYDYLEIyDGPstsspllgRF 76
                        90       100       110
                ....*....|....*....|....*....|....*
gi 20988875 410 CGERSQFVVTSNSNKITVRFHSDQSYTDTGFLAEY 444
Cdd:cd00041  77 CGSTLPPPIISSGNSLTVRFRSDSSVTGRGFKATY 111
CUB smart00042
Domain first found in C1r, C1s, uEGF, and bone morphogenetic protein; This domain is found ...
349-444 1.18e-24

Domain first found in C1r, C1s, uEGF, and bone morphogenetic protein; This domain is found mostly among developmentally-regulated proteins. Spermadhesins contain only this domain.


Pssm-ID: 214483 [Multi-domain]  Cd Length: 102  Bit Score: 99.00  E-value: 1.18e-24
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 20988875    349 GTFNSPYYPGHYPPNIDCTWNIEVPNNQHVKVRFKFFYLLepgvPAGTCPKDYVEI-NGE--------KYCG-ERSQFVV 418
Cdd:smart00042   1 GTITSPNYPQSYPNNLDCVWTIRAPPGYRIELQFTDFDLE----SSDNCEYDYVEIyDGPsasspllgRFCGsEAPPPVI 76
                           90       100
                   ....*....|....*....|....*.
gi 20988875    419 TSNSNKITVRFHSDQSYTDTGFLAEY 444
Cdd:smart00042  77 SSSSNSLTLTFVSDSSVQKRGFSARY 102
CUB pfam00431
CUB domain;
340-444 7.38e-23

CUB domain;


Pssm-ID: 395345 [Multi-domain]  Cd Length: 110  Bit Score: 94.28  E-value: 7.38e-23
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 20988875   340 CGGRLRKAQGTFNSPYYPGHYPPNIDCTWNIEVPNNQHVKVRFKFFYLlepgVPAGTCPKDYVEI---------NGEKYC 410
Cdd:pfam00431   1 CGGVLTDSSGSISSPNYPNPYPPNKDCVWLIRAPPGFRVKLTFQDFEL----EDHDECGYDYVEIrdgpsasspLLGRFC 76
                          90       100       110
                  ....*....|....*....|....*....|....
gi 20988875   411 GERSQFVVTSNSNKITVRFHSDQSYTDTGFLAEY 444
Cdd:pfam00431  77 GSGIPEDIVSSSNQMTIKFVSDASVQKRGFKATY 110
CUB cd00041
CUB domain; extracellular domain; present in proteins mostly known to be involved in ...
227-332 1.06e-17

CUB domain; extracellular domain; present in proteins mostly known to be involved in development; not found in prokaryotes, plants and yeast.


Pssm-ID: 238001 [Multi-domain]  Cd Length: 113  Bit Score: 79.38  E-value: 1.06e-17
                        10        20        30        40        50        60        70        80
                ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 20988875 227 RFTTPGFPDsPYPAHARCQWALRGDADSVLSLTFRSFDLASCDERGSDLVTVYNTLSPMEPHaLVQLCGTYPPSynlTFH 306
Cdd:cd00041  12 TISSPNYPN-NYPNNLNCVWTIEAPPGYRIRLTFEDFDLESSPNCSYDYLEIYDGPSTSSPL-LGRFCGSTLPP---PII 86
                        90       100
                ....*....|....*....|....*.
gi 20988875 307 SSQNVLLITLITNTERRHPGFEATFF 332
Cdd:cd00041  87 SSGNSLTVRFRSDSSVTGRGFKATYS 112
LDLa cd00112
Low Density Lipoprotein Receptor Class A domain, a cysteine-rich repeat that plays a central ...
525-559 1.35e-12

Low Density Lipoprotein Receptor Class A domain, a cysteine-rich repeat that plays a central role in mammalian cholesterol metabolism; the receptor protein binds LDL and transports it into cells by endocytosis; 7 successive cysteine-rich repeats of about 40 amino acids are present in the N-terminal of this multidomain membrane protein; other homologous domains occur in related receptors, including the very low-density lipoprotein receptor and the LDL receptor-related protein/alpha 2-macroglobulin receptor, and in proteins which are functionally unrelated, such as the C9 component of complement; the binding of calcium is required for in vitro formation of the native disulfide isomer and is necessary in establishing and maintaining the modular structure


Pssm-ID: 238060  Cd Length: 35  Bit Score: 62.61  E-value: 1.35e-12
                        10        20        30
                ....*....|....*....|....*....|....*
gi 20988875 525 CPAQTFRCSNGKCLSKSQQCNGKDDCGDGSDEASC 559
Cdd:cd00112   1 CPPNEFRCANGRCIPSSWVCDGEDDCGDGSDEENC 35
CUB smart00042
Domain first found in C1r, C1s, uEGF, and bone morphogenetic protein; This domain is found ...
227-331 3.70e-12

Domain first found in C1r, C1s, uEGF, and bone morphogenetic protein; This domain is found mostly among developmentally-regulated proteins. Spermadhesins contain only this domain.


Pssm-ID: 214483 [Multi-domain]  Cd Length: 102  Bit Score: 63.18  E-value: 3.70e-12
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 20988875    227 RFTTPGFPDsPYPAHARCQWALRGDADSVLSLTFRSFDLASCDERGSDLVTVYNTLSPMEPHaLVQLCGTYPPsyNLTFH 306
Cdd:smart00042   2 TITSPNYPQ-SYPNNLDCVWTIRAPPGYRIELQFTDFDLESSDNCEYDYVEIYDGPSASSPL-LGRFCGSEAP--PPVIS 77
                           90       100
                   ....*....|....*....|....*
gi 20988875    307 SSQNVLLITLITNTERRHPGFEATF 331
Cdd:smart00042  78 SSSNSLTLTFVSDSSVQKRGFSARY 102
LDLa smart00192
Low-density lipoprotein receptor domain class A; Cysteine-rich repeat in the low-density ...
525-556 1.01e-11

Low-density lipoprotein receptor domain class A; Cysteine-rich repeat in the low-density lipoprotein (LDL) receptor that plays a central role in mammalian cholesterol metabolism. The N-terminal type A repeats in LDL receptor bind the lipoproteins. Other homologous domains occur in related receptors, including the very low-density lipoprotein receptor and the LDL receptor-related protein/alpha 2-macroglobulin receptor, and in proteins which are functionally unrelated, such as the C9 component of complement. Mutations in the LDL receptor gene cause familial hypercholesterolemia.


Pssm-ID: 197566  Cd Length: 33  Bit Score: 59.95  E-value: 1.01e-11
                           10        20        30
                   ....*....|....*....|....*....|..
gi 20988875    525 CPAQTFRCSNGKCLSKSQQCNGKDDCGDGSDE 556
Cdd:smart00192   2 CPPGEFQCDNGRCIPSSWVCDGVDDCGDGSDE 33
CUB pfam00431
CUB domain;
227-331 1.14e-11

CUB domain;


Pssm-ID: 395345 [Multi-domain]  Cd Length: 110  Bit Score: 62.31  E-value: 1.14e-11
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 20988875   227 RFTTPGFPDsPYPAHARCQWALRGDADSVLSLTFRSFDLASCDERGSDLVTVYNTLSPMEPhALVQLCGTYPPsynLTFH 306
Cdd:pfam00431  11 SISSPNYPN-PYPPNKDCVWLIRAPPGFRVKLTFQDFELEDHDECGYDYVEIRDGPSASSP-LLGRFCGSGIP---EDIV 85
                          90       100
                  ....*....|....*....|....*
gi 20988875   307 SSQNVLLITLITNTERRHPGFEATF 331
Cdd:pfam00431  86 SSSNQMTIKFVSDASVQKRGFKATY 110
SEA pfam01390
SEA domain; Domain found in Sea urchin sperm protein, Enterokinase, Agrin (SEA). Proposed ...
88-162 2.34e-11

SEA domain; Domain found in Sea urchin sperm protein, Enterokinase, Agrin (SEA). Proposed function of regulating or binding carbohydrate side chains. Recently a proteolytic activity has been shown for a SEA domain.


Pssm-ID: 460188  Cd Length: 100  Bit Score: 61.10  E-value: 2.34e-11
                          10        20        30        40        50        60        70
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*..
gi 20988875    88 KVFNGYMRITNENFVDAYENSNSTEFVSLASKVKDALKLLYSGVPfLGPYHKESAVTAFS--EGSVIAYYWSEFSIP 162
Cdd:pfam01390   1 QYYTGSFKITNLQYTPDLGNPSSQEFKSLSRRIESLLNELFRNSS-LRKQYIKSHVLRLRpdGGSVVVDVVLVFRFP 76
LDLa cd00112
Low Density Lipoprotein Receptor Class A domain, a cysteine-rich repeat that plays a central ...
454-486 6.93e-11

Low Density Lipoprotein Receptor Class A domain, a cysteine-rich repeat that plays a central role in mammalian cholesterol metabolism; the receptor protein binds LDL and transports it into cells by endocytosis; 7 successive cysteine-rich repeats of about 40 amino acids are present in the N-terminal of this multidomain membrane protein; other homologous domains occur in related receptors, including the very low-density lipoprotein receptor and the LDL receptor-related protein/alpha 2-macroglobulin receptor, and in proteins which are functionally unrelated, such as the C9 component of complement; the binding of calcium is required for in vitro formation of the native disulfide isomer and is necessary in establishing and maintaining the modular structure


Pssm-ID: 238060  Cd Length: 35  Bit Score: 57.60  E-value: 6.93e-11
                        10        20        30
                ....*....|....*....|....*....|...
gi 20988875 454 PGQFTCRTGRCIRKELRCDGWADCTDHSDELNC 486
Cdd:cd00112   3 PNEFRCANGRCIPSSWVCDGEDDCGDGSDEENC 35
LDLa cd00112
Low Density Lipoprotein Receptor Class A domain, a cysteine-rich repeat that plays a central ...
488-523 4.94e-10

Low Density Lipoprotein Receptor Class A domain, a cysteine-rich repeat that plays a central role in mammalian cholesterol metabolism; the receptor protein binds LDL and transports it into cells by endocytosis; 7 successive cysteine-rich repeats of about 40 amino acids are present in the N-terminal of this multidomain membrane protein; other homologous domains occur in related receptors, including the very low-density lipoprotein receptor and the LDL receptor-related protein/alpha 2-macroglobulin receptor, and in proteins which are functionally unrelated, such as the C9 component of complement; the binding of calcium is required for in vitro formation of the native disulfide isomer and is necessary in establishing and maintaining the modular structure


Pssm-ID: 238060  Cd Length: 35  Bit Score: 55.29  E-value: 4.94e-10
                        10        20        30
                ....*....|....*....|....*....|....*.
gi 20988875 488 CDAGhQFTCKNKFCKPLFWVCDSVNDCGDNSDEQGC 523
Cdd:cd00112   1 CPPN-EFRCANGRCIPSSWVCDGEDDCGDGSDEENC 35
LDLa smart00192
Low-density lipoprotein receptor domain class A; Cysteine-rich repeat in the low-density ...
454-483 8.71e-09

Low-density lipoprotein receptor domain class A; Cysteine-rich repeat in the low-density lipoprotein (LDL) receptor that plays a central role in mammalian cholesterol metabolism. The N-terminal type A repeats in LDL receptor bind the lipoproteins. Other homologous domains occur in related receptors, including the very low-density lipoprotein receptor and the LDL receptor-related protein/alpha 2-macroglobulin receptor, and in proteins which are functionally unrelated, such as the C9 component of complement. Mutations in the LDL receptor gene cause familial hypercholesterolemia.


Pssm-ID: 197566  Cd Length: 33  Bit Score: 51.48  E-value: 8.71e-09
                           10        20        30
                   ....*....|....*....|....*....|
gi 20988875    454 PGQFTCRTGRCIRKELRCDGWADCTDHSDE 483
Cdd:smart00192   4 PGEFQCDNGRCIPSSWVCDGVDDCGDGSDE 33
LDLa smart00192
Low-density lipoprotein receptor domain class A; Cysteine-rich repeat in the low-density ...
486-520 1.43e-08

Low-density lipoprotein receptor domain class A; Cysteine-rich repeat in the low-density lipoprotein (LDL) receptor that plays a central role in mammalian cholesterol metabolism. The N-terminal type A repeats in LDL receptor bind the lipoproteins. Other homologous domains occur in related receptors, including the very low-density lipoprotein receptor and the LDL receptor-related protein/alpha 2-macroglobulin receptor, and in proteins which are functionally unrelated, such as the C9 component of complement. Mutations in the LDL receptor gene cause familial hypercholesterolemia.


Pssm-ID: 197566  Cd Length: 33  Bit Score: 51.09  E-value: 1.43e-08
                           10        20        30
                   ....*....|....*....|....*....|....*
gi 20988875    486 CSCDaghQFTCKNKFCKPLFWVCDSVNDCGDNSDE 520
Cdd:smart00192   2 CPPG---EFQCDNGRCIPSSWVCDGVDDCGDGSDE 33
Ldl_recept_a pfam00057
Low-density lipoprotein receptor domain class A;
525-559 1.72e-08

Low-density lipoprotein receptor domain class A;


Pssm-ID: 395011  Cd Length: 37  Bit Score: 50.71  E-value: 1.72e-08
                          10        20        30
                  ....*....|....*....|....*....|....*
gi 20988875   525 CPAQTFRCSNGKCLSKSQQCNGKDDCGDGSDEASC 559
Cdd:pfam00057   3 CSPNEFQCGSGECIPRSWVCDGDPDCGDGSDEENC 37
LDLa cd00112
Low Density Lipoprotein Receptor Class A domain, a cysteine-rich repeat that plays a central ...
567-602 1.29e-07

Low Density Lipoprotein Receptor Class A domain, a cysteine-rich repeat that plays a central role in mammalian cholesterol metabolism; the receptor protein binds LDL and transports it into cells by endocytosis; 7 successive cysteine-rich repeats of about 40 amino acids are present in the N-terminal of this multidomain membrane protein; other homologous domains occur in related receptors, including the very low-density lipoprotein receptor and the LDL receptor-related protein/alpha 2-macroglobulin receptor, and in proteins which are functionally unrelated, such as the C9 component of complement; the binding of calcium is required for in vitro formation of the native disulfide isomer and is necessary in establishing and maintaining the modular structure


Pssm-ID: 238060  Cd Length: 35  Bit Score: 48.36  E-value: 1.29e-07
                        10        20        30
                ....*....|....*....|....*....|....*.
gi 20988875 567 CTKHTYRCLNGLCLSKGNpECDGKEDCSDGSDEKDC 602
Cdd:cd00112   1 CPPNEFRCANGRCIPSSW-VCDGEDDCGDGSDEENC 35
Ldl_recept_a pfam00057
Low-density lipoprotein receptor domain class A;
454-486 4.28e-07

Low-density lipoprotein receptor domain class A;


Pssm-ID: 395011  Cd Length: 37  Bit Score: 46.86  E-value: 4.28e-07
                          10        20        30
                  ....*....|....*....|....*....|...
gi 20988875   454 PGQFTCRTGRCIRKELRCDGWADCTDHSDELNC 486
Cdd:pfam00057   5 PNEFQCGSGECIPRSWVCDGDPDCGDGSDEENC 37
Ldl_recept_a pfam00057
Low-density lipoprotein receptor domain class A;
492-523 2.48e-06

Low-density lipoprotein receptor domain class A;


Pssm-ID: 395011  Cd Length: 37  Bit Score: 44.55  E-value: 2.48e-06
                          10        20        30
                  ....*....|....*....|....*....|..
gi 20988875   492 HQFTCKNKFCKPLFWVCDSVNDCGDNSDEQGC 523
Cdd:pfam00057   6 NEFQCGSGECIPRSWVCDGDPDCGDGSDEENC 37
LDLa smart00192
Low-density lipoprotein receptor domain class A; Cysteine-rich repeat in the low-density ...
566-599 6.73e-06

Low-density lipoprotein receptor domain class A; Cysteine-rich repeat in the low-density lipoprotein (LDL) receptor that plays a central role in mammalian cholesterol metabolism. The N-terminal type A repeats in LDL receptor bind the lipoproteins. Other homologous domains occur in related receptors, including the very low-density lipoprotein receptor and the LDL receptor-related protein/alpha 2-macroglobulin receptor, and in proteins which are functionally unrelated, such as the C9 component of complement. Mutations in the LDL receptor gene cause familial hypercholesterolemia.


Pssm-ID: 197566  Cd Length: 33  Bit Score: 43.39  E-value: 6.73e-06
                           10        20        30
                   ....*....|....*....|....*....|....
gi 20988875    566 TCTKHTYRCLNGLCLSKGNpECDGKEDCSDGSDE 599
Cdd:smart00192   1 TCPPGEFQCDNGRCIPSSW-VCDGVDDCGDGSDE 33
Ldl_recept_a pfam00057
Low-density lipoprotein receptor domain class A;
565-602 5.34e-05

Low-density lipoprotein receptor domain class A;


Pssm-ID: 395011  Cd Length: 37  Bit Score: 41.08  E-value: 5.34e-05
                          10        20        30
                  ....*....|....*....|....*....|....*...
gi 20988875   565 VTCTKHTYRCLNGLCLSkGNPECDGKEDCSDGSDEKDC 602
Cdd:pfam00057   1 STCSPNEFQCGSGECIP-RSWVCDGDPDCGDGSDEENC 37
 
Name Accession Description Interval E-value
Tryp_SPc cd00190
Trypsin-like serine protease; Many of these are synthesized as inactive precursor zymogens ...
615-852 9.02e-98

Trypsin-like serine protease; Many of these are synthesized as inactive precursor zymogens that are cleaved during limited proteolysis to generate their active forms. Alignment contains also inactive enzymes that have substitutions of the catalytic triad residues.


Pssm-ID: 238113 [Multi-domain]  Cd Length: 232  Bit Score: 304.58  E-value: 9.02e-98
                        10        20        30        40        50        60        70        80
                ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 20988875 615 VVGGTDADEGEWPWQVSLHALGQGHICGASLISPNWLVSAAHCyiddrgFRYSDPTQWTAFLGLHDQSQRSAPGvQERRL 694
Cdd:cd00190   1 IVGGSEAKIGSFPWQVSLQYTGGRHFCGGSLISPRWVLTAAHC------VYSSAPSNYTVRLGSHDLSSNEGGG-QVIKV 73
                        90       100       110       120       130       140       150       160
                ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 20988875 695 KRIISHPFFNDFTFDYDIALLELEKPAEYSSMVRPICLPDASHVFPAGKAIWVTGWGHTQYGGTGALILQKGEIRVINQT 774
Cdd:cd00190  74 KKVIVHPNYNPSTYDNDIALLKLKRPVTLSDNVRPICLPSSGYNLPAGTTCTVSGWGRTSEGGPLPDVLQEVNVPIVSNA 153
                       170       180       190       200       210       220       230       240
                ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 20988875 775 TCENLLPQQ--ITPRMMCVGFLSGGVDSCQGDSGGPLsSVEADGRIFQAGVVSWGDGCAQRNKPGVYTRLPLFRDWIKEN 852
Cdd:cd00190 154 ECKRAYSYGgtITDNMLCAGGLEGGKDACQGDSGGPL-VCNDNGRGVLVGIVSWGSGCARPNYPGVYTRVSSYLDWIQKT 232
Tryp_SPc smart00020
Trypsin-like serine protease; Many of these are synthesised as inactive precursor zymogens ...
614-849 1.24e-94

Trypsin-like serine protease; Many of these are synthesised as inactive precursor zymogens that are cleaved during limited proteolysis to generate their active forms. A few, however, are active as single chain molecules, and others are inactive due to substitutions of the catalytic triad residues.


Pssm-ID: 214473  Cd Length: 229  Bit Score: 296.13  E-value: 1.24e-94
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 20988875    614 RVVGGTDADEGEWPWQVSLHALGQGHICGASLISPNWLVSAAHCyiddrgFRYSDPTQWTAFLGLHDQSqrSAPGVQERR 693
Cdd:smart00020   1 RIVGGSEANIGSFPWQVSLQYGGGRHFCGGSLISPRWVLTAAHC------VRGSDPSNIRVRLGSHDLS--SGEEGQVIK 72
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 20988875    694 LKRIISHPFFNDFTFDYDIALLELEKPAEYSSMVRPICLPDASHVFPAGKAIWVTGWGHTQYG-GTGALILQKGEIRVIN 772
Cdd:smart00020  73 VSKVIIHPNYNPSTYDNDIALLKLKEPVTLSDNVRPICLPSSNYNVPAGTTCTVSGWGRTSEGaGSLPDTLQEVNVPIVS 152
                          170       180       190       200       210       220       230
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 20988875    773 QTTCENLLPQQ--ITPRMMCVGFLSGGVDSCQGDSGGPLssVEADGRIFQAGVVSWGDGCAQRNKPGVYTRLPLFRDWI 849
Cdd:smart00020 153 NATCRRAYSGGgaITDNMLCAGGLEGGKDACQGDSGGPL--VCNDGRWVLVGIVSWGSGCARPGKPGVYTRVSSYLDWI 229
COG5640 COG5640
Secreted trypsin-like serine protease [Posttranslational modification, protein turnover, ...
612-855 2.62e-73

Secreted trypsin-like serine protease [Posttranslational modification, protein turnover, chaperones];


Pssm-ID: 444365 [Multi-domain]  Cd Length: 262  Bit Score: 240.71  E-value: 2.62e-73
                        10        20        30        40        50        60        70        80
                ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 20988875 612 QARVVGGTDADEGEWPWQVSLHALG--QGHICGASLISPNWLVSAAHCYIDDrgfrysDPTQWTAFLGLHDqsqRSAPGV 689
Cdd:COG5640  28 APAIVGGTPATVGEYPWMVALQSSNgpSGQFCGGTLIAPRWVLTAAHCVDGD------GPSDLRVVIGSTD---LSTSGG 98
                        90       100       110       120       130       140       150       160
                ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 20988875 690 QERRLKRIISHPFFNDFTFDYDIALLELEKPAeysSMVRPICLPDASHVFPAGKAIWVTGWGHT-QYGGTGALILQKGEI 768
Cdd:COG5640  99 TVVKVARIVVHPDYDPATPGNDIALLKLATPV---PGVAPAPLATSADAAAPGTPATVAGWGRTsEGPGSQSGTLRKADV 175
                       170       180       190       200       210       220       230       240
                ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 20988875 769 RVINQTTCeNLLPQQITPRMMCVGFLSGGVDSCQGDSGGPLSsVEADGRIFQAGVVSWGDGCAQRNKPGVYTRLPLFRDW 848
Cdd:COG5640 176 PVVSDATC-AAYGGFDGGTMLCAGYPEGGKDACQGDSGGPLV-VKDGGGWVLVGVVSWGGGPCAAGYPGVYTRVSAYRDW 253

                ....*..
gi 20988875 849 IKENTGV 855
Cdd:COG5640 254 IKSTAGG 260
Trypsin pfam00089
Trypsin;
615-849 3.61e-71

Trypsin;


Pssm-ID: 459667 [Multi-domain]  Cd Length: 219  Bit Score: 233.49  E-value: 3.61e-71
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 20988875   615 VVGGTDADEGEWPWQVSLHALGQGHICGASLISPNWLVSAAHCYiddrgfrySDPTQWTAFLGLHDQSQRSaPGVQERRL 694
Cdd:pfam00089   1 IVGGDEAQPGSFPWQVSLQLSSGKHFCGGSLISENWVLTAAHCV--------SGASDVKVVLGAHNIVLRE-GGEQKFDV 71
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 20988875   695 KRIISHPFFNDFTFDYDIALLELEKPAEYSSMVRPICLPDASHVFPAGKAIWVTGWGHTQYGGTgALILQKGEIRVINQT 774
Cdd:pfam00089  72 EKIIVHPNYNPDTLDNDIALLKLESPVTLGDTVRPICLPDASSDLPVGTTCTVSGWGNTKTLGP-SDTLQEVTVPVVSRE 150
                         170       180       190       200       210       220       230
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*
gi 20988875   775 TCENLLPQQITPRMMCVGFlsGGVDSCQGDSGGPLssVEADGriFQAGVVSWGDGCAQRNKPGVYTRLPLFRDWI 849
Cdd:pfam00089 151 TCRSAYGGTVTDTMICAGA--GGKDACQGDSGGPL--VCSDG--ELIGIVSWGYGCASGNYPGVYTPVSSYLDWI 219
CUB cd00041
CUB domain; extracellular domain; present in proteins mostly known to be involved in ...
340-444 3.16e-29

CUB domain; extracellular domain; present in proteins mostly known to be involved in development; not found in prokaryotes, plants and yeast.


Pssm-ID: 238001 [Multi-domain]  Cd Length: 113  Bit Score: 112.51  E-value: 3.16e-29
                        10        20        30        40        50        60        70        80
                ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 20988875 340 CGGRLRKA-QGTFNSPYYPGHYPPNIDCTWNIEVPNNQHVKVRFKFFYLLEPgvpaGTCPKDYVEI-NGE--------KY 409
Cdd:cd00041   1 CGGTLTAStSGTISSPNYPNNYPNNLNCVWTIEAPPGYRIRLTFEDFDLESS----PNCSYDYLEIyDGPstsspllgRF 76
                        90       100       110
                ....*....|....*....|....*....|....*
gi 20988875 410 CGERSQFVVTSNSNKITVRFHSDQSYTDTGFLAEY 444
Cdd:cd00041  77 CGSTLPPPIISSGNSLTVRFRSDSSVTGRGFKATY 111
CUB smart00042
Domain first found in C1r, C1s, uEGF, and bone morphogenetic protein; This domain is found ...
349-444 1.18e-24

Domain first found in C1r, C1s, uEGF, and bone morphogenetic protein; This domain is found mostly among developmentally-regulated proteins. Spermadhesins contain only this domain.


Pssm-ID: 214483 [Multi-domain]  Cd Length: 102  Bit Score: 99.00  E-value: 1.18e-24
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 20988875    349 GTFNSPYYPGHYPPNIDCTWNIEVPNNQHVKVRFKFFYLLepgvPAGTCPKDYVEI-NGE--------KYCG-ERSQFVV 418
Cdd:smart00042   1 GTITSPNYPQSYPNNLDCVWTIRAPPGYRIELQFTDFDLE----SSDNCEYDYVEIyDGPsasspllgRFCGsEAPPPVI 76
                           90       100
                   ....*....|....*....|....*.
gi 20988875    419 TSNSNKITVRFHSDQSYTDTGFLAEY 444
Cdd:smart00042  77 SSSSNSLTLTFVSDSSVQKRGFSARY 102
CUB pfam00431
CUB domain;
340-444 7.38e-23

CUB domain;


Pssm-ID: 395345 [Multi-domain]  Cd Length: 110  Bit Score: 94.28  E-value: 7.38e-23
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 20988875   340 CGGRLRKAQGTFNSPYYPGHYPPNIDCTWNIEVPNNQHVKVRFKFFYLlepgVPAGTCPKDYVEI---------NGEKYC 410
Cdd:pfam00431   1 CGGVLTDSSGSISSPNYPNPYPPNKDCVWLIRAPPGFRVKLTFQDFEL----EDHDECGYDYVEIrdgpsasspLLGRFC 76
                          90       100       110
                  ....*....|....*....|....*....|....
gi 20988875   411 GERSQFVVTSNSNKITVRFHSDQSYTDTGFLAEY 444
Cdd:pfam00431  77 GSGIPEDIVSSSNQMTIKFVSDASVQKRGFKATY 110
CUB cd00041
CUB domain; extracellular domain; present in proteins mostly known to be involved in ...
227-332 1.06e-17

CUB domain; extracellular domain; present in proteins mostly known to be involved in development; not found in prokaryotes, plants and yeast.


Pssm-ID: 238001 [Multi-domain]  Cd Length: 113  Bit Score: 79.38  E-value: 1.06e-17
                        10        20        30        40        50        60        70        80
                ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 20988875 227 RFTTPGFPDsPYPAHARCQWALRGDADSVLSLTFRSFDLASCDERGSDLVTVYNTLSPMEPHaLVQLCGTYPPSynlTFH 306
Cdd:cd00041  12 TISSPNYPN-NYPNNLNCVWTIEAPPGYRIRLTFEDFDLESSPNCSYDYLEIYDGPSTSSPL-LGRFCGSTLPP---PII 86
                        90       100
                ....*....|....*....|....*.
gi 20988875 307 SSQNVLLITLITNTERRHPGFEATFF 332
Cdd:cd00041  87 SSGNSLTVRFRSDSSVTGRGFKATYS 112
LDLa cd00112
Low Density Lipoprotein Receptor Class A domain, a cysteine-rich repeat that plays a central ...
525-559 1.35e-12

Low Density Lipoprotein Receptor Class A domain, a cysteine-rich repeat that plays a central role in mammalian cholesterol metabolism; the receptor protein binds LDL and transports it into cells by endocytosis; 7 successive cysteine-rich repeats of about 40 amino acids are present in the N-terminal of this multidomain membrane protein; other homologous domains occur in related receptors, including the very low-density lipoprotein receptor and the LDL receptor-related protein/alpha 2-macroglobulin receptor, and in proteins which are functionally unrelated, such as the C9 component of complement; the binding of calcium is required for in vitro formation of the native disulfide isomer and is necessary in establishing and maintaining the modular structure


Pssm-ID: 238060  Cd Length: 35  Bit Score: 62.61  E-value: 1.35e-12
                        10        20        30
                ....*....|....*....|....*....|....*
gi 20988875 525 CPAQTFRCSNGKCLSKSQQCNGKDDCGDGSDEASC 559
Cdd:cd00112   1 CPPNEFRCANGRCIPSSWVCDGEDDCGDGSDEENC 35
CUB smart00042
Domain first found in C1r, C1s, uEGF, and bone morphogenetic protein; This domain is found ...
227-331 3.70e-12

Domain first found in C1r, C1s, uEGF, and bone morphogenetic protein; This domain is found mostly among developmentally-regulated proteins. Spermadhesins contain only this domain.


Pssm-ID: 214483 [Multi-domain]  Cd Length: 102  Bit Score: 63.18  E-value: 3.70e-12
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 20988875    227 RFTTPGFPDsPYPAHARCQWALRGDADSVLSLTFRSFDLASCDERGSDLVTVYNTLSPMEPHaLVQLCGTYPPsyNLTFH 306
Cdd:smart00042   2 TITSPNYPQ-SYPNNLDCVWTIRAPPGYRIELQFTDFDLESSDNCEYDYVEIYDGPSASSPL-LGRFCGSEAP--PPVIS 77
                           90       100
                   ....*....|....*....|....*
gi 20988875    307 SSQNVLLITLITNTERRHPGFEATF 331
Cdd:smart00042  78 SSSNSLTLTFVSDSSVQKRGFSARY 102
LDLa smart00192
Low-density lipoprotein receptor domain class A; Cysteine-rich repeat in the low-density ...
525-556 1.01e-11

Low-density lipoprotein receptor domain class A; Cysteine-rich repeat in the low-density lipoprotein (LDL) receptor that plays a central role in mammalian cholesterol metabolism. The N-terminal type A repeats in LDL receptor bind the lipoproteins. Other homologous domains occur in related receptors, including the very low-density lipoprotein receptor and the LDL receptor-related protein/alpha 2-macroglobulin receptor, and in proteins which are functionally unrelated, such as the C9 component of complement. Mutations in the LDL receptor gene cause familial hypercholesterolemia.


Pssm-ID: 197566  Cd Length: 33  Bit Score: 59.95  E-value: 1.01e-11
                           10        20        30
                   ....*....|....*....|....*....|..
gi 20988875    525 CPAQTFRCSNGKCLSKSQQCNGKDDCGDGSDE 556
Cdd:smart00192   2 CPPGEFQCDNGRCIPSSWVCDGVDDCGDGSDE 33
CUB pfam00431
CUB domain;
227-331 1.14e-11

CUB domain;


Pssm-ID: 395345 [Multi-domain]  Cd Length: 110  Bit Score: 62.31  E-value: 1.14e-11
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 20988875   227 RFTTPGFPDsPYPAHARCQWALRGDADSVLSLTFRSFDLASCDERGSDLVTVYNTLSPMEPhALVQLCGTYPPsynLTFH 306
Cdd:pfam00431  11 SISSPNYPN-PYPPNKDCVWLIRAPPGFRVKLTFQDFELEDHDECGYDYVEIRDGPSASSP-LLGRFCGSGIP---EDIV 85
                          90       100
                  ....*....|....*....|....*
gi 20988875   307 SSQNVLLITLITNTERRHPGFEATF 331
Cdd:pfam00431  86 SSSNQMTIKFVSDASVQKRGFKATY 110
SEA pfam01390
SEA domain; Domain found in Sea urchin sperm protein, Enterokinase, Agrin (SEA). Proposed ...
88-162 2.34e-11

SEA domain; Domain found in Sea urchin sperm protein, Enterokinase, Agrin (SEA). Proposed function of regulating or binding carbohydrate side chains. Recently a proteolytic activity has been shown for a SEA domain.


Pssm-ID: 460188  Cd Length: 100  Bit Score: 61.10  E-value: 2.34e-11
                          10        20        30        40        50        60        70
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*..
gi 20988875    88 KVFNGYMRITNENFVDAYENSNSTEFVSLASKVKDALKLLYSGVPfLGPYHKESAVTAFS--EGSVIAYYWSEFSIP 162
Cdd:pfam01390   1 QYYTGSFKITNLQYTPDLGNPSSQEFKSLSRRIESLLNELFRNSS-LRKQYIKSHVLRLRpdGGSVVVDVVLVFRFP 76
LDLa cd00112
Low Density Lipoprotein Receptor Class A domain, a cysteine-rich repeat that plays a central ...
454-486 6.93e-11

Low Density Lipoprotein Receptor Class A domain, a cysteine-rich repeat that plays a central role in mammalian cholesterol metabolism; the receptor protein binds LDL and transports it into cells by endocytosis; 7 successive cysteine-rich repeats of about 40 amino acids are present in the N-terminal of this multidomain membrane protein; other homologous domains occur in related receptors, including the very low-density lipoprotein receptor and the LDL receptor-related protein/alpha 2-macroglobulin receptor, and in proteins which are functionally unrelated, such as the C9 component of complement; the binding of calcium is required for in vitro formation of the native disulfide isomer and is necessary in establishing and maintaining the modular structure


Pssm-ID: 238060  Cd Length: 35  Bit Score: 57.60  E-value: 6.93e-11
                        10        20        30
                ....*....|....*....|....*....|...
gi 20988875 454 PGQFTCRTGRCIRKELRCDGWADCTDHSDELNC 486
Cdd:cd00112   3 PNEFRCANGRCIPSSWVCDGEDDCGDGSDEENC 35
eMpr COG3591
V8-like Glu-specific endopeptidase [Posttranslational modification, protein turnover, ...
636-829 2.66e-10

V8-like Glu-specific endopeptidase [Posttranslational modification, protein turnover, chaperones];


Pssm-ID: 442810 [Multi-domain]  Cd Length: 194  Bit Score: 60.46  E-value: 2.66e-10
                        10        20        30        40        50        60        70        80
                ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 20988875 636 GQGHICGASLISPNWLVSAAHCYIDDRGFRYsdPTQWTAFLGLHDQSQRSAPGVqerrlkRIISHP-FFNDFTFDYDIAL 714
Cdd:COG3591   9 GGGGVCTGTLIGPNLVLTAGHCVYDGAGGGW--ATNIVFVPGYNGGPYGTATAT------RFRVPPgWVASGDAGYDYAL 80
                        90       100       110       120       130       140       150       160
                ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 20988875 715 LELEKPAEYSsmVRPICLpDASHVFPAGKAIWVTGWGhtqyGGTGALILQKGEIRVINQTTcenllpqqitprmmcvGFL 794
Cdd:COG3591  81 LRLDEPLGDT--TGWLGL-AFNDAPLAGEPVTIIGYP----GDRPKDLSLDCSGRVTGVQG----------------NRL 137
                       170       180       190
                ....*....|....*....|....*....|....*.
gi 20988875 795 SGGVDSCQGDSGGP-LSSVEADGRIFqaGVVSWGDG 829
Cdd:COG3591 138 SYDCDTTGGSSGSPvLDDSDGGGRVV--GVHSAGGA 171
LDLa cd00112
Low Density Lipoprotein Receptor Class A domain, a cysteine-rich repeat that plays a central ...
488-523 4.94e-10

Low Density Lipoprotein Receptor Class A domain, a cysteine-rich repeat that plays a central role in mammalian cholesterol metabolism; the receptor protein binds LDL and transports it into cells by endocytosis; 7 successive cysteine-rich repeats of about 40 amino acids are present in the N-terminal of this multidomain membrane protein; other homologous domains occur in related receptors, including the very low-density lipoprotein receptor and the LDL receptor-related protein/alpha 2-macroglobulin receptor, and in proteins which are functionally unrelated, such as the C9 component of complement; the binding of calcium is required for in vitro formation of the native disulfide isomer and is necessary in establishing and maintaining the modular structure


Pssm-ID: 238060  Cd Length: 35  Bit Score: 55.29  E-value: 4.94e-10
                        10        20        30
                ....*....|....*....|....*....|....*.
gi 20988875 488 CDAGhQFTCKNKFCKPLFWVCDSVNDCGDNSDEQGC 523
Cdd:cd00112   1 CPPN-EFRCANGRCIPSSWVCDGEDDCGDGSDEENC 35
LDLa smart00192
Low-density lipoprotein receptor domain class A; Cysteine-rich repeat in the low-density ...
454-483 8.71e-09

Low-density lipoprotein receptor domain class A; Cysteine-rich repeat in the low-density lipoprotein (LDL) receptor that plays a central role in mammalian cholesterol metabolism. The N-terminal type A repeats in LDL receptor bind the lipoproteins. Other homologous domains occur in related receptors, including the very low-density lipoprotein receptor and the LDL receptor-related protein/alpha 2-macroglobulin receptor, and in proteins which are functionally unrelated, such as the C9 component of complement. Mutations in the LDL receptor gene cause familial hypercholesterolemia.


Pssm-ID: 197566  Cd Length: 33  Bit Score: 51.48  E-value: 8.71e-09
                           10        20        30
                   ....*....|....*....|....*....|
gi 20988875    454 PGQFTCRTGRCIRKELRCDGWADCTDHSDE 483
Cdd:smart00192   4 PGEFQCDNGRCIPSSWVCDGVDDCGDGSDE 33
LDLa smart00192
Low-density lipoprotein receptor domain class A; Cysteine-rich repeat in the low-density ...
486-520 1.43e-08

Low-density lipoprotein receptor domain class A; Cysteine-rich repeat in the low-density lipoprotein (LDL) receptor that plays a central role in mammalian cholesterol metabolism. The N-terminal type A repeats in LDL receptor bind the lipoproteins. Other homologous domains occur in related receptors, including the very low-density lipoprotein receptor and the LDL receptor-related protein/alpha 2-macroglobulin receptor, and in proteins which are functionally unrelated, such as the C9 component of complement. Mutations in the LDL receptor gene cause familial hypercholesterolemia.


Pssm-ID: 197566  Cd Length: 33  Bit Score: 51.09  E-value: 1.43e-08
                           10        20        30
                   ....*....|....*....|....*....|....*
gi 20988875    486 CSCDaghQFTCKNKFCKPLFWVCDSVNDCGDNSDE 520
Cdd:smart00192   2 CPPG---EFQCDNGRCIPSSWVCDGVDDCGDGSDE 33
Ldl_recept_a pfam00057
Low-density lipoprotein receptor domain class A;
525-559 1.72e-08

Low-density lipoprotein receptor domain class A;


Pssm-ID: 395011  Cd Length: 37  Bit Score: 50.71  E-value: 1.72e-08
                          10        20        30
                  ....*....|....*....|....*....|....*
gi 20988875   525 CPAQTFRCSNGKCLSKSQQCNGKDDCGDGSDEASC 559
Cdd:pfam00057   3 CSPNEFQCGSGECIPRSWVCDGDPDCGDGSDEENC 37
LDLa cd00112
Low Density Lipoprotein Receptor Class A domain, a cysteine-rich repeat that plays a central ...
567-602 1.29e-07

Low Density Lipoprotein Receptor Class A domain, a cysteine-rich repeat that plays a central role in mammalian cholesterol metabolism; the receptor protein binds LDL and transports it into cells by endocytosis; 7 successive cysteine-rich repeats of about 40 amino acids are present in the N-terminal of this multidomain membrane protein; other homologous domains occur in related receptors, including the very low-density lipoprotein receptor and the LDL receptor-related protein/alpha 2-macroglobulin receptor, and in proteins which are functionally unrelated, such as the C9 component of complement; the binding of calcium is required for in vitro formation of the native disulfide isomer and is necessary in establishing and maintaining the modular structure


Pssm-ID: 238060  Cd Length: 35  Bit Score: 48.36  E-value: 1.29e-07
                        10        20        30
                ....*....|....*....|....*....|....*.
gi 20988875 567 CTKHTYRCLNGLCLSKGNpECDGKEDCSDGSDEKDC 602
Cdd:cd00112   1 CPPNEFRCANGRCIPSSW-VCDGEDDCGDGSDEENC 35
Ldl_recept_a pfam00057
Low-density lipoprotein receptor domain class A;
454-486 4.28e-07

Low-density lipoprotein receptor domain class A;


Pssm-ID: 395011  Cd Length: 37  Bit Score: 46.86  E-value: 4.28e-07
                          10        20        30
                  ....*....|....*....|....*....|...
gi 20988875   454 PGQFTCRTGRCIRKELRCDGWADCTDHSDELNC 486
Cdd:pfam00057   5 PNEFQCGSGECIPRSWVCDGDPDCGDGSDEENC 37
Ldl_recept_a pfam00057
Low-density lipoprotein receptor domain class A;
492-523 2.48e-06

Low-density lipoprotein receptor domain class A;


Pssm-ID: 395011  Cd Length: 37  Bit Score: 44.55  E-value: 2.48e-06
                          10        20        30
                  ....*....|....*....|....*....|..
gi 20988875   492 HQFTCKNKFCKPLFWVCDSVNDCGDNSDEQGC 523
Cdd:pfam00057   6 NEFQCGSGECIPRSWVCDGDPDCGDGSDEENC 37
LDLa smart00192
Low-density lipoprotein receptor domain class A; Cysteine-rich repeat in the low-density ...
566-599 6.73e-06

Low-density lipoprotein receptor domain class A; Cysteine-rich repeat in the low-density lipoprotein (LDL) receptor that plays a central role in mammalian cholesterol metabolism. The N-terminal type A repeats in LDL receptor bind the lipoproteins. Other homologous domains occur in related receptors, including the very low-density lipoprotein receptor and the LDL receptor-related protein/alpha 2-macroglobulin receptor, and in proteins which are functionally unrelated, such as the C9 component of complement. Mutations in the LDL receptor gene cause familial hypercholesterolemia.


Pssm-ID: 197566  Cd Length: 33  Bit Score: 43.39  E-value: 6.73e-06
                           10        20        30
                   ....*....|....*....|....*....|....
gi 20988875    566 TCTKHTYRCLNGLCLSKGNpECDGKEDCSDGSDE 599
Cdd:smart00192   1 TCPPGEFQCDNGRCIPSSW-VCDGVDDCGDGSDE 33
Ldl_recept_a pfam00057
Low-density lipoprotein receptor domain class A;
565-602 5.34e-05

Low-density lipoprotein receptor domain class A;


Pssm-ID: 395011  Cd Length: 37  Bit Score: 41.08  E-value: 5.34e-05
                          10        20        30
                  ....*....|....*....|....*....|....*...
gi 20988875   565 VTCTKHTYRCLNGLCLSkGNPECDGKEDCSDGSDEKDC 602
Cdd:pfam00057   1 STCSPNEFQCGSGECIP-RSWVCDGDPDCGDGSDEENC 37
CUB_2 pfam02408
CUB-like domain; This is a family of hypothetical C. elegans proteins. The aligned region has ...
350-427 2.50e-03

CUB-like domain; This is a family of hypothetical C. elegans proteins. The aligned region has no known function nor do any of the proteins which possess it. However, this domain is related to the CUB domain.


Pssm-ID: 280554  Cd Length: 120  Bit Score: 38.51  E-value: 2.50e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 20988875   350 TFNSP-------YYPGH---------YPPNIDCTWNIEVPNNQHVKVRFKFFYLLEPGVPAGTCPKDYVEINGEKYcger 413
Cdd:pfam02408  12 TINKPvngsipvYYPNTwngsmeppkIPANQNCSWNINVPKGYYAKVIISAKTNDDSSITVTDSLGNSEYVTDSDN---- 87
                          90
                  ....*....|....
gi 20988875   414 SQFVVTSNSNKITV 427
Cdd:pfam02408  88 EPYFFVSPSFTINL 101
 
Blast search parameters
Data Source: Precalculated data, version = cdd.v.3.21
Preset Options:Database: CDSEARCH/cdd   Low complexity filter: no  Composition Based Adjustment: yes   E-value threshold: 0.01

References:

  • Wang J et al. (2023), "The conserved domain database in 2023", Nucleic Acids Res.51(D)384-8.
  • Lu S et al. (2020), "The conserved domain database in 2020", Nucleic Acids Res.48(D)265-8.
  • Marchler-Bauer A et al. (2017), "CDD/SPARCLE: functional classification of proteins via subfamily domain architectures.", Nucleic Acids Res.45(D)200-3.
Help | Disclaimer | Write to the Help Desk
NCBI | NLM | NIH