NCBI Home Page NCBI Site Search page NCBI Guide that lists and describes the NCBI resources
Conserved domains on  [gi|1039750796|ref|XP_017172398|]
View 

transmembrane protease serine 7 isoform X7 [Mus musculus]

Protein Classification

CUB domain-containing protein( domain architecture ID 11504461)

CUB (complement C1r/C1s, Uegf, Bmp1) domain-containing protein

Graphical summary

 Zoom to residue level

show extra options »

Show site features     Horizontal zoom: ×

List of domain hits

Name Accession Description Interval E-value
Tryp_SPc smart00020
Trypsin-like serine protease; Many of these are synthesised as inactive precursor zymogens ...
334-564 4.59e-92

Trypsin-like serine protease; Many of these are synthesised as inactive precursor zymogens that are cleaved during limited proteolysis to generate their active forms. A few, however, are active as single chain molecules, and others are inactive due to substitutions of the catalytic triad residues.


:

Pssm-ID: 214473  Cd Length: 229  Bit Score: 282.26  E-value: 4.59e-92
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039750796  334 RIVGGSDSQEGTWPWQVSLHF-VGSAYCGASVISREWLLSAAHCFHGNRLSDptpWTAHLGMY--VQGNAKFISPVRRIV 410
Cdd:smart00020   1 RIVGGSEANIGSFPWQVSLQYgGGRHFCGGSLISPRWVLTAAHCVRGSDPSN---IRVRLGSHdlSSGEEGQVIKVSKVI 77
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039750796  411 VHEYYNSQTFDYDIALLQLSiaWPETLKQLIQPICIPPAGQKVRSGEKCWVTGWGRRHEADSKGSPVLQQAEVELIDQTV 490
Cdd:smart00020  78 IHPNYNPSTYDNDIALLKLK--EPVTLSDNVRPICLPSSNYNVPAGTTCTVSGWGRTSEGAGSLPDTLQEVNVPIVSNAT 155
                          170       180       190       200       210       220       230
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*..
gi 1039750796  491 CVSTYG---IITSRMLCAGVMSGKSDACKGDSGGPLSCRrksDGKWILTGIVSWGHGCGRPNFPGVYTRVSSFVPWI 564
Cdd:smart00020 156 CRRAYSgggAITDNMLCAGGLEGGKDACQGDSGGPLVCN---DGRWVLVGIVSWGSGCARPGKPGVYTRVSSYLDWI 229
LDLa cd00112
Low Density Lipoprotein Receptor Class A domain, a cysteine-rich repeat that plays a central ...
213-247 3.17e-10

Low Density Lipoprotein Receptor Class A domain, a cysteine-rich repeat that plays a central role in mammalian cholesterol metabolism; the receptor protein binds LDL and transports it into cells by endocytosis; 7 successive cysteine-rich repeats of about 40 amino acids are present in the N-terminal of this multidomain membrane protein; other homologous domains occur in related receptors, including the very low-density lipoprotein receptor and the LDL receptor-related protein/alpha 2-macroglobulin receptor, and in proteins which are functionally unrelated, such as the C9 component of complement; the binding of calcium is required for in vitro formation of the native disulfide isomer and is necessary in establishing and maintaining the modular structure


:

Pssm-ID: 238060  Cd Length: 35  Bit Score: 55.29  E-value: 3.17e-10
                          10        20        30
                  ....*....|....*....|....*....|....*
gi 1039750796 213 CPAGSFRCSSGLCVPQAQRCDGVNDCFDESDELFC 247
Cdd:cd00112     1 CPPNEFRCANGRCIPSSWVCDGEDDCGDGSDEENC 35
CUB cd00041
CUB domain; extracellular domain; present in proteins mostly known to be involved in ...
2-88 3.39e-08

CUB domain; extracellular domain; present in proteins mostly known to be involved in development; not found in prokaryotes, plants and yeast.


:

Pssm-ID: 238001 [Multi-domain]  Cd Length: 113  Bit Score: 52.03  E-value: 3.39e-08
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039750796   2 CHFKLVAIVGYLIRLSIESIQLEAD-NCITDSLTVYDSlLPIRSAILYRICEpTRTLMSFVSTNNLMLVILKSPYVRRLA 80
Cdd:cd00041    28 CVWTIEAPPGYRIRLTFEDFDLESSpNCSYDYLEIYDG-PSTSSPLLGRFCG-STLPPPIISSGNSLTVRFRSDSSVTGR 105

                  ....*...
gi 1039750796  81 GIRAYFEV 88
Cdd:cd00041   106 GFKATYSA 113
LDLa cd00112
Low Density Lipoprotein Receptor Class A domain, a cysteine-rich repeat that plays a central ...
288-323 2.82e-07

Low Density Lipoprotein Receptor Class A domain, a cysteine-rich repeat that plays a central role in mammalian cholesterol metabolism; the receptor protein binds LDL and transports it into cells by endocytosis; 7 successive cysteine-rich repeats of about 40 amino acids are present in the N-terminal of this multidomain membrane protein; other homologous domains occur in related receptors, including the very low-density lipoprotein receptor and the LDL receptor-related protein/alpha 2-macroglobulin receptor, and in proteins which are functionally unrelated, such as the C9 component of complement; the binding of calcium is required for in vitro formation of the native disulfide isomer and is necessary in establishing and maintaining the modular structure


:

Pssm-ID: 238060  Cd Length: 35  Bit Score: 46.82  E-value: 2.82e-07
                          10        20        30
                  ....*....|....*....|....*....|....*.
gi 1039750796 288 CTSRTFKCGNDICFRKqNAQCDGIVDCPDGSDEEGC 323
Cdd:cd00112     1 CPPNEFRCANGRCIPS-SWVCDGEDDCGDGSDEENC 35
 
Name Accession Description Interval E-value
Tryp_SPc smart00020
Trypsin-like serine protease; Many of these are synthesised as inactive precursor zymogens ...
334-564 4.59e-92

Trypsin-like serine protease; Many of these are synthesised as inactive precursor zymogens that are cleaved during limited proteolysis to generate their active forms. A few, however, are active as single chain molecules, and others are inactive due to substitutions of the catalytic triad residues.


Pssm-ID: 214473  Cd Length: 229  Bit Score: 282.26  E-value: 4.59e-92
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039750796  334 RIVGGSDSQEGTWPWQVSLHF-VGSAYCGASVISREWLLSAAHCFHGNRLSDptpWTAHLGMY--VQGNAKFISPVRRIV 410
Cdd:smart00020   1 RIVGGSEANIGSFPWQVSLQYgGGRHFCGGSLISPRWVLTAAHCVRGSDPSN---IRVRLGSHdlSSGEEGQVIKVSKVI 77
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039750796  411 VHEYYNSQTFDYDIALLQLSiaWPETLKQLIQPICIPPAGQKVRSGEKCWVTGWGRRHEADSKGSPVLQQAEVELIDQTV 490
Cdd:smart00020  78 IHPNYNPSTYDNDIALLKLK--EPVTLSDNVRPICLPSSNYNVPAGTTCTVSGWGRTSEGAGSLPDTLQEVNVPIVSNAT 155
                          170       180       190       200       210       220       230
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*..
gi 1039750796  491 CVSTYG---IITSRMLCAGVMSGKSDACKGDSGGPLSCRrksDGKWILTGIVSWGHGCGRPNFPGVYTRVSSFVPWI 564
Cdd:smart00020 156 CRRAYSgggAITDNMLCAGGLEGGKDACQGDSGGPLVCN---DGRWVLVGIVSWGSGCARPGKPGVYTRVSSYLDWI 229
Tryp_SPc cd00190
Trypsin-like serine protease; Many of these are synthesized as inactive precursor zymogens ...
335-566 3.97e-91

Trypsin-like serine protease; Many of these are synthesized as inactive precursor zymogens that are cleaved during limited proteolysis to generate their active forms. Alignment contains also inactive enzymes that have substitutions of the catalytic triad residues.


Pssm-ID: 238113 [Multi-domain]  Cd Length: 232  Bit Score: 279.93  E-value: 3.97e-91
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039750796 335 IVGGSDSQEGTWPWQVSLHF-VGSAYCGASVISREWLLSAAHCFHGnrlSDPTPWTAHLGMYVQGNAKF---ISPVRRIV 410
Cdd:cd00190     1 IVGGSEAKIGSFPWQVSLQYtGGRHFCGGSLISPRWVLTAAHCVYS---SAPSNYTVRLGSHDLSSNEGggqVIKVKKVI 77
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039750796 411 VHEYYNSQTFDYDIALLQLSiaWPETLKQLIQPICIPPAGQKVRSGEKCWVTGWGRRHEaDSKGSPVLQQAEVELIDQTV 490
Cdd:cd00190    78 VHPNYNPSTYDNDIALLKLK--RPVTLSDNVRPICLPSSGYNLPAGTTCTVSGWGRTSE-GGPLPDVLQEVNVPIVSNAE 154
                         170       180       190       200       210       220       230
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 1039750796 491 CVSTY---GIITSRMLCAGVMSGKSDACKGDSGGPLSCrrKSDGKWILTGIVSWGHGCGRPNFPGVYTRVSSFVPWIHK 566
Cdd:cd00190   155 CKRAYsygGTITDNMLCAGGLEGGKDACQGDSGGPLVC--NDNGRGVLVGIVSWGSGCARPNYPGVYTRVSSYLDWIQK 231
COG5640 COG5640
Secreted trypsin-like serine protease [Posttranslational modification, protein turnover, ...
334-571 2.56e-71

Secreted trypsin-like serine protease [Posttranslational modification, protein turnover, chaperones];


Pssm-ID: 444365 [Multi-domain]  Cd Length: 262  Bit Score: 229.54  E-value: 2.56e-71
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039750796 334 RIVGGSDSQEGTWPWQVSLHFVG---SAYCGASVISREWLLSAAHCFHGNRLSDptpWTAHLGMY-VQGNAKFISPVRRI 409
Cdd:COG5640    30 AIVGGTPATVGEYPWMVALQSSNgpsGQFCGGTLIAPRWVLTAAHCVDGDGPSD---LRVVIGSTdLSTSGGTVVKVARI 106
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039750796 410 VVHEYYNSQTFDYDIALLQLSIAWPEtlkqlIQPICIPPAGQKVRSGEKCWVTGWGRRHEADSKGSPVLQQAEVELIDQT 489
Cdd:COG5640   107 VVHPDYDPATPGNDIALLKLATPVPG-----VAPAPLATSADAAAPGTPATVAGWGRTSEGPGSQSGTLRKADVPVVSDA 181
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039750796 490 VCVSTYGIITSRMLCAGVMSGKSDACKGDSGGPLscRRKSDGKWILTGIVSWGHGCGRPNFPGVYTRVSSFVPWIHKYVP 569
Cdd:COG5640   182 TCAAYGGFDGGTMLCAGYPEGGKDACQGDSGGPL--VVKDGGGWVLVGVVSWGGGPCAAGYPGVYTRVSAYRDWIKSTAG 259

                  ..
gi 1039750796 570 SL 571
Cdd:COG5640   260 GL 261
Trypsin pfam00089
Trypsin;
335-564 9.52e-68

Trypsin;


Pssm-ID: 459667 [Multi-domain]  Cd Length: 219  Bit Score: 218.85  E-value: 9.52e-68
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039750796 335 IVGGSDSQEGTWPWQVSLHFVGSAY-CGASVISREWLLSAAHCFHGNrlSDPTPWT-AHLGMYVQGNAKFIsPVRRIVVH 412
Cdd:pfam00089   1 IVGGDEAQPGSFPWQVSLQLSSGKHfCGGSLISENWVLTAAHCVSGA--SDVKVVLgAHNIVLREGGEQKF-DVEKIIVH 77
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039750796 413 EYYNSQTFDYDIALLQLSIawPETLKQLIQPICIPPAGQKVRSGEKCWVTGWGRrhEADSKGSPVLQQAEVELIDQTVCV 492
Cdd:pfam00089  78 PNYNPDTLDNDIALLKLES--PVTLGDTVRPICLPDASSDLPVGTTCTVSGWGN--TKTLGPSDTLQEVTVPVVSRETCR 153
                         170       180       190       200       210       220       230
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|...
gi 1039750796 493 STYGI-ITSRMLCAGvmSGKSDACKGDSGGPLSCRRKsdgkwILTGIVSWGHGCGRPNFPGVYTRVSSFVPWI 564
Cdd:pfam00089 154 SAYGGtVTDTMICAG--AGGKDACQGDSGGPLVCSDG-----ELIGIVSWGYGCASGNYPGVYTPVSSYLDWI 219
LDLa cd00112
Low Density Lipoprotein Receptor Class A domain, a cysteine-rich repeat that plays a central ...
213-247 3.17e-10

Low Density Lipoprotein Receptor Class A domain, a cysteine-rich repeat that plays a central role in mammalian cholesterol metabolism; the receptor protein binds LDL and transports it into cells by endocytosis; 7 successive cysteine-rich repeats of about 40 amino acids are present in the N-terminal of this multidomain membrane protein; other homologous domains occur in related receptors, including the very low-density lipoprotein receptor and the LDL receptor-related protein/alpha 2-macroglobulin receptor, and in proteins which are functionally unrelated, such as the C9 component of complement; the binding of calcium is required for in vitro formation of the native disulfide isomer and is necessary in establishing and maintaining the modular structure


Pssm-ID: 238060  Cd Length: 35  Bit Score: 55.29  E-value: 3.17e-10
                          10        20        30
                  ....*....|....*....|....*....|....*
gi 1039750796 213 CPAGSFRCSSGLCVPQAQRCDGVNDCFDESDELFC 247
Cdd:cd00112     1 CPPNEFRCANGRCIPSSWVCDGEDDCGDGSDEENC 35
LDLa smart00192
Low-density lipoprotein receptor domain class A; Cysteine-rich repeat in the low-density ...
213-244 5.24e-10

Low-density lipoprotein receptor domain class A; Cysteine-rich repeat in the low-density lipoprotein (LDL) receptor that plays a central role in mammalian cholesterol metabolism. The N-terminal type A repeats in LDL receptor bind the lipoproteins. Other homologous domains occur in related receptors, including the very low-density lipoprotein receptor and the LDL receptor-related protein/alpha 2-macroglobulin receptor, and in proteins which are functionally unrelated, such as the C9 component of complement. Mutations in the LDL receptor gene cause familial hypercholesterolemia.


Pssm-ID: 197566  Cd Length: 33  Bit Score: 54.56  E-value: 5.24e-10
                           10        20        30
                   ....*....|....*....|....*....|..
gi 1039750796  213 CPAGSFRCSSGLCVPQAQRCDGVNDCFDESDE 244
Cdd:smart00192   2 CPPGEFQCDNGRCIPSSWVCDGVDDCGDGSDE 33
CUB cd00041
CUB domain; extracellular domain; present in proteins mostly known to be involved in ...
2-88 3.39e-08

CUB domain; extracellular domain; present in proteins mostly known to be involved in development; not found in prokaryotes, plants and yeast.


Pssm-ID: 238001 [Multi-domain]  Cd Length: 113  Bit Score: 52.03  E-value: 3.39e-08
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039750796   2 CHFKLVAIVGYLIRLSIESIQLEAD-NCITDSLTVYDSlLPIRSAILYRICEpTRTLMSFVSTNNLMLVILKSPYVRRLA 80
Cdd:cd00041    28 CVWTIEAPPGYRIRLTFEDFDLESSpNCSYDYLEIYDG-PSTSSPLLGRFCG-STLPPPIISSGNSLTVRFRSDSSVTGR 105

                  ....*...
gi 1039750796  81 GIRAYFEV 88
Cdd:cd00041   106 GFKATYSA 113
LDLa cd00112
Low Density Lipoprotein Receptor Class A domain, a cysteine-rich repeat that plays a central ...
288-323 2.82e-07

Low Density Lipoprotein Receptor Class A domain, a cysteine-rich repeat that plays a central role in mammalian cholesterol metabolism; the receptor protein binds LDL and transports it into cells by endocytosis; 7 successive cysteine-rich repeats of about 40 amino acids are present in the N-terminal of this multidomain membrane protein; other homologous domains occur in related receptors, including the very low-density lipoprotein receptor and the LDL receptor-related protein/alpha 2-macroglobulin receptor, and in proteins which are functionally unrelated, such as the C9 component of complement; the binding of calcium is required for in vitro formation of the native disulfide isomer and is necessary in establishing and maintaining the modular structure


Pssm-ID: 238060  Cd Length: 35  Bit Score: 46.82  E-value: 2.82e-07
                          10        20        30
                  ....*....|....*....|....*....|....*.
gi 1039750796 288 CTSRTFKCGNDICFRKqNAQCDGIVDCPDGSDEEGC 323
Cdd:cd00112     1 CPPNEFRCANGRCIPS-SWVCDGEDDCGDGSDEENC 35
Ldl_recept_a pfam00057
Low-density lipoprotein receptor domain class A;
212-247 3.49e-07

Low-density lipoprotein receptor domain class A;


Pssm-ID: 395011  Cd Length: 37  Bit Score: 46.47  E-value: 3.49e-07
                          10        20        30
                  ....*....|....*....|....*....|....*.
gi 1039750796 212 PCPAGSFRCSSGLCVPQAQRCDGVNDCFDESDELFC 247
Cdd:pfam00057   2 TCSPNEFQCGSGECIPRSWVCDGDPDCGDGSDEENC 37
CUB smart00042
Domain first found in C1r, C1s, uEGF, and bone morphogenetic protein; This domain is found ...
2-86 2.01e-06

Domain first found in C1r, C1s, uEGF, and bone morphogenetic protein; This domain is found mostly among developmentally-regulated proteins. Spermadhesins contain only this domain.


Pssm-ID: 214483 [Multi-domain]  Cd Length: 102  Bit Score: 46.61  E-value: 2.01e-06
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039750796    2 CHFKLVAIVGYLIRLSIESIQLEA-DNCITDSLTVYDSlLPIRSAILYRICEPTRTLMSFVSTNNLMLVILKSPYVRRLA 80
Cdd:smart00042  18 CVWTIRAPPGYRIELQFTDFDLESsDNCEYDYVEIYDG-PSASSPLLGRFCGSEAPPPVISSSSNSLTLTFVSDSSVQKR 96

                   ....*.
gi 1039750796   81 GIRAYF 86
Cdd:smart00042  97 GFSARY 102
LDLa smart00192
Low-density lipoprotein receptor domain class A; Cysteine-rich repeat in the low-density ...
288-320 4.23e-06

Low-density lipoprotein receptor domain class A; Cysteine-rich repeat in the low-density lipoprotein (LDL) receptor that plays a central role in mammalian cholesterol metabolism. The N-terminal type A repeats in LDL receptor bind the lipoproteins. Other homologous domains occur in related receptors, including the very low-density lipoprotein receptor and the LDL receptor-related protein/alpha 2-macroglobulin receptor, and in proteins which are functionally unrelated, such as the C9 component of complement. Mutations in the LDL receptor gene cause familial hypercholesterolemia.


Pssm-ID: 197566  Cd Length: 33  Bit Score: 43.39  E-value: 4.23e-06
                           10        20        30
                   ....*....|....*....|....*....|...
gi 1039750796  288 CTSRTFKCGNDICfRKQNAQCDGIVDCPDGSDE 320
Cdd:smart00192   2 CPPGEFQCDNGRC-IPSSWVCDGVDDCGDGSDE 33
Ldl_recept_a pfam00057
Low-density lipoprotein receptor domain class A;
287-323 4.23e-05

Low-density lipoprotein receptor domain class A;


Pssm-ID: 395011  Cd Length: 37  Bit Score: 40.69  E-value: 4.23e-05
                          10        20        30
                  ....*....|....*....|....*....|....*..
gi 1039750796 287 PCTSRTFKCGNDICFrKQNAQCDGIVDCPDGSDEEGC 323
Cdd:pfam00057   2 TCSPNEFQCGSGECI-PRSWVCDGDPDCGDGSDEENC 37
CUB pfam00431
CUB domain;
2-86 3.63e-03

CUB domain;


Pssm-ID: 395345 [Multi-domain]  Cd Length: 110  Bit Score: 37.27  E-value: 3.63e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039750796   2 CHFKLVAIVGYLIRLSIESIQLEA-DNCITDSLTVYDSllPIRSAILY-RICEPTRTLmSFVSTNNLMLVILKSPYVRRL 79
Cdd:pfam00431  27 CVWLIRAPPGFRVKLTFQDFELEDhDECGYDYVEIRDG--PSASSPLLgRFCGSGIPE-DIVSSSNQMTIKFVSDASVQK 103

                  ....*..
gi 1039750796  80 AGIRAYF 86
Cdd:pfam00431 104 RGFKATY 110
 
Name Accession Description Interval E-value
Tryp_SPc smart00020
Trypsin-like serine protease; Many of these are synthesised as inactive precursor zymogens ...
334-564 4.59e-92

Trypsin-like serine protease; Many of these are synthesised as inactive precursor zymogens that are cleaved during limited proteolysis to generate their active forms. A few, however, are active as single chain molecules, and others are inactive due to substitutions of the catalytic triad residues.


Pssm-ID: 214473  Cd Length: 229  Bit Score: 282.26  E-value: 4.59e-92
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039750796  334 RIVGGSDSQEGTWPWQVSLHF-VGSAYCGASVISREWLLSAAHCFHGNRLSDptpWTAHLGMY--VQGNAKFISPVRRIV 410
Cdd:smart00020   1 RIVGGSEANIGSFPWQVSLQYgGGRHFCGGSLISPRWVLTAAHCVRGSDPSN---IRVRLGSHdlSSGEEGQVIKVSKVI 77
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039750796  411 VHEYYNSQTFDYDIALLQLSiaWPETLKQLIQPICIPPAGQKVRSGEKCWVTGWGRRHEADSKGSPVLQQAEVELIDQTV 490
Cdd:smart00020  78 IHPNYNPSTYDNDIALLKLK--EPVTLSDNVRPICLPSSNYNVPAGTTCTVSGWGRTSEGAGSLPDTLQEVNVPIVSNAT 155
                          170       180       190       200       210       220       230
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*..
gi 1039750796  491 CVSTYG---IITSRMLCAGVMSGKSDACKGDSGGPLSCRrksDGKWILTGIVSWGHGCGRPNFPGVYTRVSSFVPWI 564
Cdd:smart00020 156 CRRAYSgggAITDNMLCAGGLEGGKDACQGDSGGPLVCN---DGRWVLVGIVSWGSGCARPGKPGVYTRVSSYLDWI 229
Tryp_SPc cd00190
Trypsin-like serine protease; Many of these are synthesized as inactive precursor zymogens ...
335-566 3.97e-91

Trypsin-like serine protease; Many of these are synthesized as inactive precursor zymogens that are cleaved during limited proteolysis to generate their active forms. Alignment contains also inactive enzymes that have substitutions of the catalytic triad residues.


Pssm-ID: 238113 [Multi-domain]  Cd Length: 232  Bit Score: 279.93  E-value: 3.97e-91
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039750796 335 IVGGSDSQEGTWPWQVSLHF-VGSAYCGASVISREWLLSAAHCFHGnrlSDPTPWTAHLGMYVQGNAKF---ISPVRRIV 410
Cdd:cd00190     1 IVGGSEAKIGSFPWQVSLQYtGGRHFCGGSLISPRWVLTAAHCVYS---SAPSNYTVRLGSHDLSSNEGggqVIKVKKVI 77
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039750796 411 VHEYYNSQTFDYDIALLQLSiaWPETLKQLIQPICIPPAGQKVRSGEKCWVTGWGRRHEaDSKGSPVLQQAEVELIDQTV 490
Cdd:cd00190    78 VHPNYNPSTYDNDIALLKLK--RPVTLSDNVRPICLPSSGYNLPAGTTCTVSGWGRTSE-GGPLPDVLQEVNVPIVSNAE 154
                         170       180       190       200       210       220       230
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 1039750796 491 CVSTY---GIITSRMLCAGVMSGKSDACKGDSGGPLSCrrKSDGKWILTGIVSWGHGCGRPNFPGVYTRVSSFVPWIHK 566
Cdd:cd00190   155 CKRAYsygGTITDNMLCAGGLEGGKDACQGDSGGPLVC--NDNGRGVLVGIVSWGSGCARPNYPGVYTRVSSYLDWIQK 231
COG5640 COG5640
Secreted trypsin-like serine protease [Posttranslational modification, protein turnover, ...
334-571 2.56e-71

Secreted trypsin-like serine protease [Posttranslational modification, protein turnover, chaperones];


Pssm-ID: 444365 [Multi-domain]  Cd Length: 262  Bit Score: 229.54  E-value: 2.56e-71
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039750796 334 RIVGGSDSQEGTWPWQVSLHFVG---SAYCGASVISREWLLSAAHCFHGNRLSDptpWTAHLGMY-VQGNAKFISPVRRI 409
Cdd:COG5640    30 AIVGGTPATVGEYPWMVALQSSNgpsGQFCGGTLIAPRWVLTAAHCVDGDGPSD---LRVVIGSTdLSTSGGTVVKVARI 106
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039750796 410 VVHEYYNSQTFDYDIALLQLSIAWPEtlkqlIQPICIPPAGQKVRSGEKCWVTGWGRRHEADSKGSPVLQQAEVELIDQT 489
Cdd:COG5640   107 VVHPDYDPATPGNDIALLKLATPVPG-----VAPAPLATSADAAAPGTPATVAGWGRTSEGPGSQSGTLRKADVPVVSDA 181
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039750796 490 VCVSTYGIITSRMLCAGVMSGKSDACKGDSGGPLscRRKSDGKWILTGIVSWGHGCGRPNFPGVYTRVSSFVPWIHKYVP 569
Cdd:COG5640   182 TCAAYGGFDGGTMLCAGYPEGGKDACQGDSGGPL--VVKDGGGWVLVGVVSWGGGPCAAGYPGVYTRVSAYRDWIKSTAG 259

                  ..
gi 1039750796 570 SL 571
Cdd:COG5640   260 GL 261
Trypsin pfam00089
Trypsin;
335-564 9.52e-68

Trypsin;


Pssm-ID: 459667 [Multi-domain]  Cd Length: 219  Bit Score: 218.85  E-value: 9.52e-68
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039750796 335 IVGGSDSQEGTWPWQVSLHFVGSAY-CGASVISREWLLSAAHCFHGNrlSDPTPWT-AHLGMYVQGNAKFIsPVRRIVVH 412
Cdd:pfam00089   1 IVGGDEAQPGSFPWQVSLQLSSGKHfCGGSLISENWVLTAAHCVSGA--SDVKVVLgAHNIVLREGGEQKF-DVEKIIVH 77
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039750796 413 EYYNSQTFDYDIALLQLSIawPETLKQLIQPICIPPAGQKVRSGEKCWVTGWGRrhEADSKGSPVLQQAEVELIDQTVCV 492
Cdd:pfam00089  78 PNYNPDTLDNDIALLKLES--PVTLGDTVRPICLPDASSDLPVGTTCTVSGWGN--TKTLGPSDTLQEVTVPVVSRETCR 153
                         170       180       190       200       210       220       230
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|...
gi 1039750796 493 STYGI-ITSRMLCAGvmSGKSDACKGDSGGPLSCRRKsdgkwILTGIVSWGHGCGRPNFPGVYTRVSSFVPWI 564
Cdd:pfam00089 154 SAYGGtVTDTMICAG--AGGKDACQGDSGGPLVCSDG-----ELIGIVSWGYGCASGNYPGVYTPVSSYLDWI 219
LDLa cd00112
Low Density Lipoprotein Receptor Class A domain, a cysteine-rich repeat that plays a central ...
213-247 3.17e-10

Low Density Lipoprotein Receptor Class A domain, a cysteine-rich repeat that plays a central role in mammalian cholesterol metabolism; the receptor protein binds LDL and transports it into cells by endocytosis; 7 successive cysteine-rich repeats of about 40 amino acids are present in the N-terminal of this multidomain membrane protein; other homologous domains occur in related receptors, including the very low-density lipoprotein receptor and the LDL receptor-related protein/alpha 2-macroglobulin receptor, and in proteins which are functionally unrelated, such as the C9 component of complement; the binding of calcium is required for in vitro formation of the native disulfide isomer and is necessary in establishing and maintaining the modular structure


Pssm-ID: 238060  Cd Length: 35  Bit Score: 55.29  E-value: 3.17e-10
                          10        20        30
                  ....*....|....*....|....*....|....*
gi 1039750796 213 CPAGSFRCSSGLCVPQAQRCDGVNDCFDESDELFC 247
Cdd:cd00112     1 CPPNEFRCANGRCIPSSWVCDGEDDCGDGSDEENC 35
LDLa smart00192
Low-density lipoprotein receptor domain class A; Cysteine-rich repeat in the low-density ...
213-244 5.24e-10

Low-density lipoprotein receptor domain class A; Cysteine-rich repeat in the low-density lipoprotein (LDL) receptor that plays a central role in mammalian cholesterol metabolism. The N-terminal type A repeats in LDL receptor bind the lipoproteins. Other homologous domains occur in related receptors, including the very low-density lipoprotein receptor and the LDL receptor-related protein/alpha 2-macroglobulin receptor, and in proteins which are functionally unrelated, such as the C9 component of complement. Mutations in the LDL receptor gene cause familial hypercholesterolemia.


Pssm-ID: 197566  Cd Length: 33  Bit Score: 54.56  E-value: 5.24e-10
                           10        20        30
                   ....*....|....*....|....*....|..
gi 1039750796  213 CPAGSFRCSSGLCVPQAQRCDGVNDCFDESDE 244
Cdd:smart00192   2 CPPGEFQCDNGRCIPSSWVCDGVDDCGDGSDE 33
CUB cd00041
CUB domain; extracellular domain; present in proteins mostly known to be involved in ...
2-88 3.39e-08

CUB domain; extracellular domain; present in proteins mostly known to be involved in development; not found in prokaryotes, plants and yeast.


Pssm-ID: 238001 [Multi-domain]  Cd Length: 113  Bit Score: 52.03  E-value: 3.39e-08
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039750796   2 CHFKLVAIVGYLIRLSIESIQLEAD-NCITDSLTVYDSlLPIRSAILYRICEpTRTLMSFVSTNNLMLVILKSPYVRRLA 80
Cdd:cd00041    28 CVWTIEAPPGYRIRLTFEDFDLESSpNCSYDYLEIYDG-PSTSSPLLGRFCG-STLPPPIISSGNSLTVRFRSDSSVTGR 105

                  ....*...
gi 1039750796  81 GIRAYFEV 88
Cdd:cd00041   106 GFKATYSA 113
LDLa cd00112
Low Density Lipoprotein Receptor Class A domain, a cysteine-rich repeat that plays a central ...
288-323 2.82e-07

Low Density Lipoprotein Receptor Class A domain, a cysteine-rich repeat that plays a central role in mammalian cholesterol metabolism; the receptor protein binds LDL and transports it into cells by endocytosis; 7 successive cysteine-rich repeats of about 40 amino acids are present in the N-terminal of this multidomain membrane protein; other homologous domains occur in related receptors, including the very low-density lipoprotein receptor and the LDL receptor-related protein/alpha 2-macroglobulin receptor, and in proteins which are functionally unrelated, such as the C9 component of complement; the binding of calcium is required for in vitro formation of the native disulfide isomer and is necessary in establishing and maintaining the modular structure


Pssm-ID: 238060  Cd Length: 35  Bit Score: 46.82  E-value: 2.82e-07
                          10        20        30
                  ....*....|....*....|....*....|....*.
gi 1039750796 288 CTSRTFKCGNDICFRKqNAQCDGIVDCPDGSDEEGC 323
Cdd:cd00112     1 CPPNEFRCANGRCIPS-SWVCDGEDDCGDGSDEENC 35
Ldl_recept_a pfam00057
Low-density lipoprotein receptor domain class A;
212-247 3.49e-07

Low-density lipoprotein receptor domain class A;


Pssm-ID: 395011  Cd Length: 37  Bit Score: 46.47  E-value: 3.49e-07
                          10        20        30
                  ....*....|....*....|....*....|....*.
gi 1039750796 212 PCPAGSFRCSSGLCVPQAQRCDGVNDCFDESDELFC 247
Cdd:pfam00057   2 TCSPNEFQCGSGECIPRSWVCDGDPDCGDGSDEENC 37
CUB smart00042
Domain first found in C1r, C1s, uEGF, and bone morphogenetic protein; This domain is found ...
2-86 2.01e-06

Domain first found in C1r, C1s, uEGF, and bone morphogenetic protein; This domain is found mostly among developmentally-regulated proteins. Spermadhesins contain only this domain.


Pssm-ID: 214483 [Multi-domain]  Cd Length: 102  Bit Score: 46.61  E-value: 2.01e-06
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039750796    2 CHFKLVAIVGYLIRLSIESIQLEA-DNCITDSLTVYDSlLPIRSAILYRICEPTRTLMSFVSTNNLMLVILKSPYVRRLA 80
Cdd:smart00042  18 CVWTIRAPPGYRIELQFTDFDLESsDNCEYDYVEIYDG-PSASSPLLGRFCGSEAPPPVISSSSNSLTLTFVSDSSVQKR 96

                   ....*.
gi 1039750796   81 GIRAYF 86
Cdd:smart00042  97 GFSARY 102
LDLa smart00192
Low-density lipoprotein receptor domain class A; Cysteine-rich repeat in the low-density ...
288-320 4.23e-06

Low-density lipoprotein receptor domain class A; Cysteine-rich repeat in the low-density lipoprotein (LDL) receptor that plays a central role in mammalian cholesterol metabolism. The N-terminal type A repeats in LDL receptor bind the lipoproteins. Other homologous domains occur in related receptors, including the very low-density lipoprotein receptor and the LDL receptor-related protein/alpha 2-macroglobulin receptor, and in proteins which are functionally unrelated, such as the C9 component of complement. Mutations in the LDL receptor gene cause familial hypercholesterolemia.


Pssm-ID: 197566  Cd Length: 33  Bit Score: 43.39  E-value: 4.23e-06
                           10        20        30
                   ....*....|....*....|....*....|...
gi 1039750796  288 CTSRTFKCGNDICfRKQNAQCDGIVDCPDGSDE 320
Cdd:smart00192   2 CPPGEFQCDNGRC-IPSSWVCDGVDDCGDGSDE 33
Ldl_recept_a pfam00057
Low-density lipoprotein receptor domain class A;
287-323 4.23e-05

Low-density lipoprotein receptor domain class A;


Pssm-ID: 395011  Cd Length: 37  Bit Score: 40.69  E-value: 4.23e-05
                          10        20        30
                  ....*....|....*....|....*....|....*..
gi 1039750796 287 PCTSRTFKCGNDICFrKQNAQCDGIVDCPDGSDEEGC 323
Cdd:pfam00057   2 TCSPNEFQCGSGECI-PRSWVCDGDPDCGDGSDEENC 37
CUB pfam00431
CUB domain;
2-86 3.63e-03

CUB domain;


Pssm-ID: 395345 [Multi-domain]  Cd Length: 110  Bit Score: 37.27  E-value: 3.63e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1039750796   2 CHFKLVAIVGYLIRLSIESIQLEA-DNCITDSLTVYDSllPIRSAILY-RICEPTRTLmSFVSTNNLMLVILKSPYVRRL 79
Cdd:pfam00431  27 CVWLIRAPPGFRVKLTFQDFELEDhDECGYDYVEIRDG--PSASSPLLgRFCGSGIPE-DIVSSSNQMTIKFVSDASVQK 103

                  ....*..
gi 1039750796  80 AGIRAYF 86
Cdd:pfam00431 104 RGFKATY 110
 
Blast search parameters
Data Source: Precalculated data, version = cdd.v.3.21
Preset Options:Database: CDSEARCH/cdd   Low complexity filter: no  Composition Based Adjustment: yes   E-value threshold: 0.01

References:

  • Wang J et al. (2023), "The conserved domain database in 2023", Nucleic Acids Res.51(D)384-8.
  • Lu S et al. (2020), "The conserved domain database in 2020", Nucleic Acids Res.48(D)265-8.
  • Marchler-Bauer A et al. (2017), "CDD/SPARCLE: functional classification of proteins via subfamily domain architectures.", Nucleic Acids Res.45(D)200-3.
Help | Disclaimer | Write to the Help Desk
NCBI | NLM | NIH