NCBI Home Page NCBI Site Search page NCBI Guide that lists and describes the NCBI resources
Conserved domains on  [gi|8850219|ref|NP_059066|]
View 

haptoglobin isoform 1 preproprotein [Mus musculus]

Protein Classification

haptoglobin( domain architecture ID 10034116)

haptoglobin binds to free hemoglobin released from erythrocytes with high affinity and thereby inhibits its deleterious oxidative activity

Graphical summary

 Zoom to residue level

show extra options »

Show site features     Horizontal zoom: ×

List of domain hits

Name Accession Description Interval E-value
Tryp_SPc cd00190
Trypsin-like serine protease; Many of these are synthesized as inactive precursor zymogens ...
103-343 3.62e-56

Trypsin-like serine protease; Many of these are synthesized as inactive precursor zymogens that are cleaved during limited proteolysis to generate their active forms. Alignment contains also inactive enzymes that have substitutions of the catalytic triad residues.


:

Pssm-ID: 238113 [Multi-domain]  Cd Length: 232  Bit Score: 182.86  E-value: 3.62e-56
                        10        20        30        40        50        60        70        80
                ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 8850219  103 IIGGSMDAKGSFPWQAKM-ISRHGLTTGATLISDQWLLTTAKNLFLNHSETASAKDITPTLTLYVGKNQLVEIEKVVLHP 181
Cdd:cd00190   1 IVGGSEAKIGSFPWQVSLqYTGGRHFCGGSLISPRWVLTAAHCVYSSAPSNYTVRLGSHDLSSNEGGGQVIKVKKVIVHP 80
                        90       100       110       120       130       140       150       160
                ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 8850219  182 NHSVV----DIGLIKLKQRVLVTERVMPICLPSKDYIAP-GRVGYVSGWGRNA-NFRFTDRLKYVMLPVADQDKCVVHYE 255
Cdd:cd00190  81 NYNPStydnDIALLKLKRPVTLSDNVRPICLPSSGYNLPaGTTCTVSGWGRTSeGGPLPDVLQEVNVPIVSNAECKRAYS 160
                       170       180       190       200       210       220       230       240
                ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 8850219  256 NstvpekknltspvgvQPILNEHTFCAGLTKYQEDTCYGDAGSAFAIHDmeEDTWYAAGILSFDKSCAVAEY-GVYVRAT 334
Cdd:cd00190 161 Y---------------GGTITDNMLCAGGLEGGKDACQGDSGGPLVCND--NGRGVLVGIVSWGSGCARPNYpGVYTRVS 223

                ....*....
gi 8850219  335 DLKDWVQET 343
Cdd:cd00190 224 SYLDWIQKT 232
CCP cd00033
Complement control protein (CCP) modules (aka short consensus repeats SCRs or SUSHI repeats) ...
33-87 3.45e-03

Complement control protein (CCP) modules (aka short consensus repeats SCRs or SUSHI repeats) have been identified in several proteins of the complement system; SUSHI repeats (short complement-like repeat, SCR) are abundant in complement control proteins. The complement control protein (CCP) modules (also known as short consensus repeats SCRs or SUSHI repeats) contain approximately 60 amino acid residues and have been identified in several proteins of the complement system. Typically, 2 to 4 modules contribute to a binding site, implying that the orientation of the modules to each other is critical for function.


:

Pssm-ID: 153056 [Multi-domain]  Cd Length: 57  Bit Score: 35.52  E-value: 3.45e-03
                        10        20        30        40        50        60
                ....*....|....*....|....*....|....*....|....*....|....*....|....
gi 8850219   33 CPKPPEIANGYVEHL---------VRYRCRQFYRLRaeGDGVYTLNDEKQWVNTVageklPECE 87
Cdd:cd00033   1 CPPPPVPENGTVTGSkgsysygstVTYSCNEGYTLV--GSSTITCTENGGWSPPP-----PTCE 57
 
Name Accession Description Interval E-value
Tryp_SPc cd00190
Trypsin-like serine protease; Many of these are synthesized as inactive precursor zymogens ...
103-343 3.62e-56

Trypsin-like serine protease; Many of these are synthesized as inactive precursor zymogens that are cleaved during limited proteolysis to generate their active forms. Alignment contains also inactive enzymes that have substitutions of the catalytic triad residues.


Pssm-ID: 238113 [Multi-domain]  Cd Length: 232  Bit Score: 182.86  E-value: 3.62e-56
                        10        20        30        40        50        60        70        80
                ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 8850219  103 IIGGSMDAKGSFPWQAKM-ISRHGLTTGATLISDQWLLTTAKNLFLNHSETASAKDITPTLTLYVGKNQLVEIEKVVLHP 181
Cdd:cd00190   1 IVGGSEAKIGSFPWQVSLqYTGGRHFCGGSLISPRWVLTAAHCVYSSAPSNYTVRLGSHDLSSNEGGGQVIKVKKVIVHP 80
                        90       100       110       120       130       140       150       160
                ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 8850219  182 NHSVV----DIGLIKLKQRVLVTERVMPICLPSKDYIAP-GRVGYVSGWGRNA-NFRFTDRLKYVMLPVADQDKCVVHYE 255
Cdd:cd00190  81 NYNPStydnDIALLKLKRPVTLSDNVRPICLPSSGYNLPaGTTCTVSGWGRTSeGGPLPDVLQEVNVPIVSNAECKRAYS 160
                       170       180       190       200       210       220       230       240
                ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 8850219  256 NstvpekknltspvgvQPILNEHTFCAGLTKYQEDTCYGDAGSAFAIHDmeEDTWYAAGILSFDKSCAVAEY-GVYVRAT 334
Cdd:cd00190 161 Y---------------GGTITDNMLCAGGLEGGKDACQGDSGGPLVCND--NGRGVLVGIVSWGSGCARPNYpGVYTRVS 223

                ....*....
gi 8850219  335 DLKDWVQET 343
Cdd:cd00190 224 SYLDWIQKT 232
Tryp_SPc smart00020
Trypsin-like serine protease; Many of these are synthesised as inactive precursor zymogens ...
102-340 2.94e-53

Trypsin-like serine protease; Many of these are synthesised as inactive precursor zymogens that are cleaved during limited proteolysis to generate their active forms. A few, however, are active as single chain molecules, and others are inactive due to substitutions of the catalytic triad residues.


Pssm-ID: 214473  Cd Length: 229  Bit Score: 175.17  E-value: 2.94e-53
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 8850219     102 RIIGGSMDAKGSFPWQAKM-ISRHGLTTGATLISDQWLLTTAKNLFLNHsetasAKDITPTL----TLYVGKNQLVEIEK 176
Cdd:smart00020   1 RIVGGSEANIGSFPWQVSLqYGGGRHFCGGSLISPRWVLTAAHCVRGSD-----PSNIRVRLgshdLSSGEEGQVIKVSK 75
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 8850219     177 VVLHPNHSVV----DIGLIKLKQRVLVTERVMPICLPSKDY-IAPGRVGYVSGWGR--NANFRFTDRLKYVMLPVADQDK 249
Cdd:smart00020  76 VIIHPNYNPStydnDIALLKLKEPVTLSDNVRPICLPSSNYnVPAGTTCTVSGWGRtsEGAGSLPDTLQEVNVPIVSNAT 155
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 8850219     250 CVVHYENstvpekknltspvgvQPILNEHTFCAGLTKYQEDTCYGDAGSAFAIHDmeeDTWYAAGILSFDKSCAVAEY-G 328
Cdd:smart00020 156 CRRAYSG---------------GGAITDNMLCAGGLEGGKDACQGDSGGPLVCND---GRWVLVGIVSWGSGCARPGKpG 217
                          250
                   ....*....|..
gi 8850219     329 VYVRATDLKDWV 340
Cdd:smart00020 218 VYTRVSSYLDWI 229
Trypsin pfam00089
Trypsin;
103-340 4.43e-46

Trypsin;


Pssm-ID: 459667 [Multi-domain]  Cd Length: 219  Bit Score: 156.06  E-value: 4.43e-46
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 8850219    103 IIGGSMDAKGSFPWQAKM-ISRHGLTTGATLISDQWLLTtAKNLFLNHSETAsAKDITPTLTLYVGKNQLVEIEKVVLHP 181
Cdd:pfam00089   1 IVGGDEAQPGSFPWQVSLqLSSGKHFCGGSLISENWVLT-AAHCVSGASDVK-VVLGAHNIVLREGGEQKFDVEKIIVHP 78
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 8850219    182 NHSVV----DIGLIKLKQRVLVTERVMPICLPSK-DYIAPGRVGYVSGWGRNANFRFTDRLKYVMLPVADQDKCVVHYEN 256
Cdd:pfam00089  79 NYNPDtldnDIALLKLESPVTLGDTVRPICLPDAsSDLPVGTTCTVSGWGNTKTLGPSDTLQEVTVPVVSRETCRSAYGG 158
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 8850219    257 StvpekknltspvgvqpiLNEHTFCAGLTKyqEDTCYGDAGSAFAIHDMeedtwYAAGILSFDKSCAVAEY-GVYVRATD 335
Cdd:pfam00089 159 T-----------------VTDTMICAGAGG--KDACQGDSGGPLVCSDG-----ELIGIVSWGYGCASGNYpGVYTPVSS 214

                  ....*
gi 8850219    336 LKDWV 340
Cdd:pfam00089 215 YLDWI 219
COG5640 COG5640
Secreted trypsin-like serine protease [Posttranslational modification, protein turnover, ...
96-347 4.71e-31

Secreted trypsin-like serine protease [Posttranslational modification, protein turnover, chaperones];


Pssm-ID: 444365 [Multi-domain]  Cd Length: 262  Bit Score: 118.21  E-value: 4.71e-31
                        10        20        30        40        50        60        70        80
                ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 8850219   96 PVDQVQRIIGGSMDAKGSFPWQAKMISRHGLTT---GATLISDQWLLTTAknlflnH-SETASAKDITptltLYVGKN-- 169
Cdd:COG5640  24 AADAAPAIVGGTPATVGEYPWMVALQSSNGPSGqfcGGTLIAPRWVLTAA------HcVDGDGPSDLR----VVIGSTdl 93
                        90       100       110       120       130       140       150       160
                ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 8850219  170 -----QLVEIEKVVLHPNHSVV----DIGLIKLKQRVlvtERVMPICLP-SKDYIAPGRVGYVSGWGRNANFR--FTDRL 237
Cdd:COG5640  94 stsggTVVKVARIVVHPDYDPAtpgnDIALLKLATPV---PGVAPAPLAtSADAAAPGTPATVAGWGRTSEGPgsQSGTL 170
                       170       180       190       200       210       220       230       240
                ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 8850219  238 KYVMLPVADQDKCVvhyenstvpekknltspvGVQPILNEHTFCAGLTKYQEDTCYGDAGS-AFAIHDmeeDTWYAAGIL 316
Cdd:COG5640 171 RKADVPVVSDATCA------------------AYGGFDGGTMLCAGYPEGGKDACQGDSGGpLVVKDG---GGWVLVGVV 229
                       250       260       270
                ....*....|....*....|....*....|..
gi 8850219  317 SF-DKSCAVAEYGVYVRATDLKDWVQETMAKN 347
Cdd:COG5640 230 SWgGGPCAAGYPGVYTRVSAYRDWIKSTAGGL 261
CCP cd00033
Complement control protein (CCP) modules (aka short consensus repeats SCRs or SUSHI repeats) ...
33-87 3.45e-03

Complement control protein (CCP) modules (aka short consensus repeats SCRs or SUSHI repeats) have been identified in several proteins of the complement system; SUSHI repeats (short complement-like repeat, SCR) are abundant in complement control proteins. The complement control protein (CCP) modules (also known as short consensus repeats SCRs or SUSHI repeats) contain approximately 60 amino acid residues and have been identified in several proteins of the complement system. Typically, 2 to 4 modules contribute to a binding site, implying that the orientation of the modules to each other is critical for function.


Pssm-ID: 153056 [Multi-domain]  Cd Length: 57  Bit Score: 35.52  E-value: 3.45e-03
                        10        20        30        40        50        60
                ....*....|....*....|....*....|....*....|....*....|....*....|....
gi 8850219   33 CPKPPEIANGYVEHL---------VRYRCRQFYRLRaeGDGVYTLNDEKQWVNTVageklPECE 87
Cdd:cd00033   1 CPPPPVPENGTVTGSkgsysygstVTYSCNEGYTLV--GSSTITCTENGGWSPPP-----PTCE 57
 
Name Accession Description Interval E-value
Tryp_SPc cd00190
Trypsin-like serine protease; Many of these are synthesized as inactive precursor zymogens ...
103-343 3.62e-56

Trypsin-like serine protease; Many of these are synthesized as inactive precursor zymogens that are cleaved during limited proteolysis to generate their active forms. Alignment contains also inactive enzymes that have substitutions of the catalytic triad residues.


Pssm-ID: 238113 [Multi-domain]  Cd Length: 232  Bit Score: 182.86  E-value: 3.62e-56
                        10        20        30        40        50        60        70        80
                ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 8850219  103 IIGGSMDAKGSFPWQAKM-ISRHGLTTGATLISDQWLLTTAKNLFLNHSETASAKDITPTLTLYVGKNQLVEIEKVVLHP 181
Cdd:cd00190   1 IVGGSEAKIGSFPWQVSLqYTGGRHFCGGSLISPRWVLTAAHCVYSSAPSNYTVRLGSHDLSSNEGGGQVIKVKKVIVHP 80
                        90       100       110       120       130       140       150       160
                ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 8850219  182 NHSVV----DIGLIKLKQRVLVTERVMPICLPSKDYIAP-GRVGYVSGWGRNA-NFRFTDRLKYVMLPVADQDKCVVHYE 255
Cdd:cd00190  81 NYNPStydnDIALLKLKRPVTLSDNVRPICLPSSGYNLPaGTTCTVSGWGRTSeGGPLPDVLQEVNVPIVSNAECKRAYS 160
                       170       180       190       200       210       220       230       240
                ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 8850219  256 NstvpekknltspvgvQPILNEHTFCAGLTKYQEDTCYGDAGSAFAIHDmeEDTWYAAGILSFDKSCAVAEY-GVYVRAT 334
Cdd:cd00190 161 Y---------------GGTITDNMLCAGGLEGGKDACQGDSGGPLVCND--NGRGVLVGIVSWGSGCARPNYpGVYTRVS 223

                ....*....
gi 8850219  335 DLKDWVQET 343
Cdd:cd00190 224 SYLDWIQKT 232
Tryp_SPc smart00020
Trypsin-like serine protease; Many of these are synthesised as inactive precursor zymogens ...
102-340 2.94e-53

Trypsin-like serine protease; Many of these are synthesised as inactive precursor zymogens that are cleaved during limited proteolysis to generate their active forms. A few, however, are active as single chain molecules, and others are inactive due to substitutions of the catalytic triad residues.


Pssm-ID: 214473  Cd Length: 229  Bit Score: 175.17  E-value: 2.94e-53
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 8850219     102 RIIGGSMDAKGSFPWQAKM-ISRHGLTTGATLISDQWLLTTAKNLFLNHsetasAKDITPTL----TLYVGKNQLVEIEK 176
Cdd:smart00020   1 RIVGGSEANIGSFPWQVSLqYGGGRHFCGGSLISPRWVLTAAHCVRGSD-----PSNIRVRLgshdLSSGEEGQVIKVSK 75
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 8850219     177 VVLHPNHSVV----DIGLIKLKQRVLVTERVMPICLPSKDY-IAPGRVGYVSGWGR--NANFRFTDRLKYVMLPVADQDK 249
Cdd:smart00020  76 VIIHPNYNPStydnDIALLKLKEPVTLSDNVRPICLPSSNYnVPAGTTCTVSGWGRtsEGAGSLPDTLQEVNVPIVSNAT 155
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 8850219     250 CVVHYENstvpekknltspvgvQPILNEHTFCAGLTKYQEDTCYGDAGSAFAIHDmeeDTWYAAGILSFDKSCAVAEY-G 328
Cdd:smart00020 156 CRRAYSG---------------GGAITDNMLCAGGLEGGKDACQGDSGGPLVCND---GRWVLVGIVSWGSGCARPGKpG 217
                          250
                   ....*....|..
gi 8850219     329 VYVRATDLKDWV 340
Cdd:smart00020 218 VYTRVSSYLDWI 229
Trypsin pfam00089
Trypsin;
103-340 4.43e-46

Trypsin;


Pssm-ID: 459667 [Multi-domain]  Cd Length: 219  Bit Score: 156.06  E-value: 4.43e-46
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 8850219    103 IIGGSMDAKGSFPWQAKM-ISRHGLTTGATLISDQWLLTtAKNLFLNHSETAsAKDITPTLTLYVGKNQLVEIEKVVLHP 181
Cdd:pfam00089   1 IVGGDEAQPGSFPWQVSLqLSSGKHFCGGSLISENWVLT-AAHCVSGASDVK-VVLGAHNIVLREGGEQKFDVEKIIVHP 78
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 8850219    182 NHSVV----DIGLIKLKQRVLVTERVMPICLPSK-DYIAPGRVGYVSGWGRNANFRFTDRLKYVMLPVADQDKCVVHYEN 256
Cdd:pfam00089  79 NYNPDtldnDIALLKLESPVTLGDTVRPICLPDAsSDLPVGTTCTVSGWGNTKTLGPSDTLQEVTVPVVSRETCRSAYGG 158
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 8850219    257 StvpekknltspvgvqpiLNEHTFCAGLTKyqEDTCYGDAGSAFAIHDMeedtwYAAGILSFDKSCAVAEY-GVYVRATD 335
Cdd:pfam00089 159 T-----------------VTDTMICAGAGG--KDACQGDSGGPLVCSDG-----ELIGIVSWGYGCASGNYpGVYTPVSS 214

                  ....*
gi 8850219    336 LKDWV 340
Cdd:pfam00089 215 YLDWI 219
COG5640 COG5640
Secreted trypsin-like serine protease [Posttranslational modification, protein turnover, ...
96-347 4.71e-31

Secreted trypsin-like serine protease [Posttranslational modification, protein turnover, chaperones];


Pssm-ID: 444365 [Multi-domain]  Cd Length: 262  Bit Score: 118.21  E-value: 4.71e-31
                        10        20        30        40        50        60        70        80
                ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 8850219   96 PVDQVQRIIGGSMDAKGSFPWQAKMISRHGLTT---GATLISDQWLLTTAknlflnH-SETASAKDITptltLYVGKN-- 169
Cdd:COG5640  24 AADAAPAIVGGTPATVGEYPWMVALQSSNGPSGqfcGGTLIAPRWVLTAA------HcVDGDGPSDLR----VVIGSTdl 93
                        90       100       110       120       130       140       150       160
                ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 8850219  170 -----QLVEIEKVVLHPNHSVV----DIGLIKLKQRVlvtERVMPICLP-SKDYIAPGRVGYVSGWGRNANFR--FTDRL 237
Cdd:COG5640  94 stsggTVVKVARIVVHPDYDPAtpgnDIALLKLATPV---PGVAPAPLAtSADAAAPGTPATVAGWGRTSEGPgsQSGTL 170
                       170       180       190       200       210       220       230       240
                ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 8850219  238 KYVMLPVADQDKCVvhyenstvpekknltspvGVQPILNEHTFCAGLTKYQEDTCYGDAGS-AFAIHDmeeDTWYAAGIL 316
Cdd:COG5640 171 RKADVPVVSDATCA------------------AYGGFDGGTMLCAGYPEGGKDACQGDSGGpLVVKDG---GGWVLVGVV 229
                       250       260       270
                ....*....|....*....|....*....|..
gi 8850219  317 SF-DKSCAVAEYGVYVRATDLKDWVQETMAKN 347
Cdd:COG5640 230 SWgGGPCAAGYPGVYTRVSAYRDWIKSTAGGL 261
CCP cd00033
Complement control protein (CCP) modules (aka short consensus repeats SCRs or SUSHI repeats) ...
33-87 3.45e-03

Complement control protein (CCP) modules (aka short consensus repeats SCRs or SUSHI repeats) have been identified in several proteins of the complement system; SUSHI repeats (short complement-like repeat, SCR) are abundant in complement control proteins. The complement control protein (CCP) modules (also known as short consensus repeats SCRs or SUSHI repeats) contain approximately 60 amino acid residues and have been identified in several proteins of the complement system. Typically, 2 to 4 modules contribute to a binding site, implying that the orientation of the modules to each other is critical for function.


Pssm-ID: 153056 [Multi-domain]  Cd Length: 57  Bit Score: 35.52  E-value: 3.45e-03
                        10        20        30        40        50        60
                ....*....|....*....|....*....|....*....|....*....|....*....|....
gi 8850219   33 CPKPPEIANGYVEHL---------VRYRCRQFYRLRaeGDGVYTLNDEKQWVNTVageklPECE 87
Cdd:cd00033   1 CPPPPVPENGTVTGSkgsysygstVTYSCNEGYTLV--GSSTITCTENGGWSPPP-----PTCE 57
 
Blast search parameters
Data Source: Precalculated data, version = cdd.v.3.21
Preset Options:Database: CDSEARCH/cdd   Low complexity filter: no  Composition Based Adjustment: yes   E-value threshold: 0.01

References:

  • Wang J et al. (2023), "The conserved domain database in 2023", Nucleic Acids Res.51(D)384-8.
  • Lu S et al. (2020), "The conserved domain database in 2020", Nucleic Acids Res.48(D)265-8.
  • Marchler-Bauer A et al. (2017), "CDD/SPARCLE: functional classification of proteins via subfamily domain architectures.", Nucleic Acids Res.45(D)200-3.
Help | Disclaimer | Write to the Help Desk
NCBI | NLM | NIH