NCBI Home Page NCBI Site Search page NCBI Guide that lists and describes the NCBI resources
Conserved domains on  [gi|14550407|ref|NP_000054|]
View 

complement C2 isoform 1 preproprotein [Homo sapiens]

Protein Classification

serine protease( domain architecture ID 13332040)

trypsin-like serine protease such as human plasminogen, the precursor of the widely distributed protease plasmin, or granzyme B, a human enzyme necessary for target cell lysis in cell-mediated immune responses

Graphical summary

 Zoom to residue level

show extra options »

Show site features     Horizontal zoom: ×

List of domain hits

Name Accession Description Interval E-value
vWA_complement_factors cd01470
Complement factors B and C2 are two critical proteases for complement activation. They both ...
253-450 7.09e-96

Complement factors B and C2 are two critical proteases for complement activation. They both contain three CCP or Sushi domains, a trypsin-type serine protease domain and a single VWA domain with a conserved metal ion dependent adhesion site referred commonly as the MIDAS motif. Orthologues of these molecules are found from echinoderms to chordates. During complement activation, the CCP domains are cleaved off, resulting in the formation of an active protease that cleaves and activates complement C3. Complement C2 is in the classical pathway and complement B is in the alternative pathway. The interaction of C2 with C4 and of factor B with C3b are both dependent on Mg2+ binding sites within the VWA domains and the VWA domain of factor B has been shown to mediate the binding of C3. This is consistent with the common inferred function of VWA domains as magnesium-dependent protein interaction domains.


:

Pssm-ID: 238747 [Multi-domain]  Cd Length: 198  Bit Score: 295.74  E-value: 7.09e-96
                        10        20        30        40        50        60        70        80
                ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 14550407 253 LNLYLLLDCSQSVSENDFLIFKESASLMVDRIFSFEINVSVAIITFASEPKVLMSVLNDNSRDMTEVISSLENANYKDHE 332
Cdd:cd01470   1 LNIYIALDASDSIGEEDFDEAKNAIKTLIEKISSYEVSPRYEIISYASDPKEIVSIRDFNSNDADDVIKRLEDFNYDDHG 80
                        90       100       110       120       130       140       150       160
                ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 14550407 333 NGTGTNTYAALNSVYLMmnnqMRLLGME-TMAWQEIRHAIILLTDGKSNMGGSPKTAVDHIREILNINQK----RNDYLD 407
Cdd:cd01470  81 DKTGTNTAAALKKVYER----MALEKVRnKEAFNETRHVIILFTDGKSNMGGSPLPTVDKIKNLVYKNNKsdnpREDYLD 156
                       170       180       190       200
                ....*....|....*....|....*....|....*....|...
gi 14550407 408 IYAIGVGKlDVDWRELNELGSKKDGERHAFILQDTKALHQVFE 450
Cdd:cd01470 157 VYVFGVGD-DVNKEELNDLASKKDNERHFFKLKDYEDLQEVFD 198
Tryp_SPc cd00190
Trypsin-like serine protease; Many of these are synthesized as inactive precursor zymogens ...
473-742 7.95e-39

Trypsin-like serine protease; Many of these are synthesized as inactive precursor zymogens that are cleaved during limited proteolysis to generate their active forms. Alignment contains also inactive enzymes that have substitutions of the catalytic triad residues.


:

Pssm-ID: 238113 [Multi-domain]  Cd Length: 232  Bit Score: 143.96  E-value: 7.95e-39
                        10        20        30        40        50        60        70        80
                ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 14550407 473 SDQERTPWHVTIKPKS-QETCRGALISDQWVLTAAHCFRdGNDHSLWRVNVGD----PKSQWGKEFLIEKAVISPGFDVF 547
Cdd:cd00190   7 AKIGSFPWQVSLQYTGgRHFCGGSLISPRWVLTAAHCVY-SSAPSNYTVRLGShdlsSNEGGGQVIKVKKVIVHPNYNPS 85
                        90       100       110       120       130       140       150       160
                ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 14550407 548 akknqgileFYGDDIALLKLAQKVKMSTHARPICLPCTMEANlalrrPQGSTCR-----DHENEllnkqsvpahfVALNG 622
Cdd:cd00190  86 ---------TYDNDIALLKLKRPVTLSDNVRPICLPSSGYNL-----PAGTTCTvsgwgRTSEG-----------GPLPD 140
                       170       180       190       200       210       220       230       240
                ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 14550407 623 SKLNINLKMgVEWTSCAEVVSQektmfpnltdvREVVTDQFLCSGTQE-DESPCKGESGGAVFLERRFRFFQVGLVSWGl 701
Cdd:cd00190 141 VLQEVNVPI-VSNAECKRAYSY-----------GGTITDNMLCAGGLEgGKDACQGDSGGPLVCNDNGRGVLVGIVSWG- 207
                       250       260       270       280
                ....*....|....*....|....*....|....*....|.
gi 14550407 702 ynpcLGSADKNSrkraprskvppPRDFHiNLFRMQPWLRQH 742
Cdd:cd00190 208 ----SGCARPNY-----------PGVYT-RVSSYLDWIQKT 232
CCP cd00033
Complement control protein (CCP) modules (aka short consensus repeats SCRs or SUSHI repeats) ...
89-144 4.19e-16

Complement control protein (CCP) modules (aka short consensus repeats SCRs or SUSHI repeats) have been identified in several proteins of the complement system; SUSHI repeats (short complement-like repeat, SCR) are abundant in complement control proteins. The complement control protein (CCP) modules (also known as short consensus repeats SCRs or SUSHI repeats) contain approximately 60 amino acid residues and have been identified in several proteins of the complement system. Typically, 2 to 4 modules contribute to a binding site, implying that the orientation of the modules to each other is critical for function.


:

Pssm-ID: 153056 [Multi-domain]  Cd Length: 57  Bit Score: 72.88  E-value: 4.19e-16
                        10        20        30        40        50
                ....*....|....*....|....*....|....*....|....*....|....*.
gi 14550407  89 CPAPVSFENGIYTPRLGSYPVGGNVSFECEDGFILRGSPVRQCRPNGMWDGETAVC 144
Cdd:cd00033   1 CPPPPVPENGTVTGSKGSYSYGSTVTYSCNEGYTLVGSSTITCTENGGWSPPPPTC 56
PHA02927 super family cl33700
secreted complement-binding protein; Provisional
24-204 7.78e-16

secreted complement-binding protein; Provisional


The actual alignment was detected with superfamily member PHA02927:

Pssm-ID: 222943 [Multi-domain]  Cd Length: 263  Bit Score: 78.16  E-value: 7.78e-16
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 14550407   24 CPQNVNISGGTFTLShGWAPGSLLTYSCPQGLYpspasrLCKSSGQWQTPGATRSL----SKAVCKPVRCPAPVSFENGI 99
Cdd:PHA02927  86 CPSPRDIDNGQLDIG-GVDFGSSITYSCNSGYQ------LIGESKSYCELGSTGSMvwnpEAPICESVKCQSPPSISNGR 158
                         90       100       110       120       130       140       150       160
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 14550407  100 YTPRLGSYPVGGNVSFECEDGFILRGSPVRQCRpNGMWDgETAVCDngAGHCPNPGISLGAVRTGFR--FGHGDKVRYRC 177
Cdd:PHA02927 159 HNGYEDFYTDGSVVTYSCNSGYSLIGNSGVLCS-GGEWS-DPPTCQ--IVKCPHPTISNGYLSSGFKrsYSYNDNVDFKC 234
                        170       180
                 ....*....|....*....|....*..
gi 14550407  178 SSNLVLTGSSERECQGNGVWSGTEPIC 204
Cdd:PHA02927 235 KYGYKLSGSSSSTCSPGNTWQPELPKC 261
 
Name Accession Description Interval E-value
vWA_complement_factors cd01470
Complement factors B and C2 are two critical proteases for complement activation. They both ...
253-450 7.09e-96

Complement factors B and C2 are two critical proteases for complement activation. They both contain three CCP or Sushi domains, a trypsin-type serine protease domain and a single VWA domain with a conserved metal ion dependent adhesion site referred commonly as the MIDAS motif. Orthologues of these molecules are found from echinoderms to chordates. During complement activation, the CCP domains are cleaved off, resulting in the formation of an active protease that cleaves and activates complement C3. Complement C2 is in the classical pathway and complement B is in the alternative pathway. The interaction of C2 with C4 and of factor B with C3b are both dependent on Mg2+ binding sites within the VWA domains and the VWA domain of factor B has been shown to mediate the binding of C3. This is consistent with the common inferred function of VWA domains as magnesium-dependent protein interaction domains.


Pssm-ID: 238747 [Multi-domain]  Cd Length: 198  Bit Score: 295.74  E-value: 7.09e-96
                        10        20        30        40        50        60        70        80
                ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 14550407 253 LNLYLLLDCSQSVSENDFLIFKESASLMVDRIFSFEINVSVAIITFASEPKVLMSVLNDNSRDMTEVISSLENANYKDHE 332
Cdd:cd01470   1 LNIYIALDASDSIGEEDFDEAKNAIKTLIEKISSYEVSPRYEIISYASDPKEIVSIRDFNSNDADDVIKRLEDFNYDDHG 80
                        90       100       110       120       130       140       150       160
                ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 14550407 333 NGTGTNTYAALNSVYLMmnnqMRLLGME-TMAWQEIRHAIILLTDGKSNMGGSPKTAVDHIREILNINQK----RNDYLD 407
Cdd:cd01470  81 DKTGTNTAAALKKVYER----MALEKVRnKEAFNETRHVIILFTDGKSNMGGSPLPTVDKIKNLVYKNNKsdnpREDYLD 156
                       170       180       190       200
                ....*....|....*....|....*....|....*....|...
gi 14550407 408 IYAIGVGKlDVDWRELNELGSKKDGERHAFILQDTKALHQVFE 450
Cdd:cd01470 157 VYVFGVGD-DVNKEELNDLASKKDNERHFFKLKDYEDLQEVFD 198
Tryp_SPc cd00190
Trypsin-like serine protease; Many of these are synthesized as inactive precursor zymogens ...
473-742 7.95e-39

Trypsin-like serine protease; Many of these are synthesized as inactive precursor zymogens that are cleaved during limited proteolysis to generate their active forms. Alignment contains also inactive enzymes that have substitutions of the catalytic triad residues.


Pssm-ID: 238113 [Multi-domain]  Cd Length: 232  Bit Score: 143.96  E-value: 7.95e-39
                        10        20        30        40        50        60        70        80
                ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 14550407 473 SDQERTPWHVTIKPKS-QETCRGALISDQWVLTAAHCFRdGNDHSLWRVNVGD----PKSQWGKEFLIEKAVISPGFDVF 547
Cdd:cd00190   7 AKIGSFPWQVSLQYTGgRHFCGGSLISPRWVLTAAHCVY-SSAPSNYTVRLGShdlsSNEGGGQVIKVKKVIVHPNYNPS 85
                        90       100       110       120       130       140       150       160
                ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 14550407 548 akknqgileFYGDDIALLKLAQKVKMSTHARPICLPCTMEANlalrrPQGSTCR-----DHENEllnkqsvpahfVALNG 622
Cdd:cd00190  86 ---------TYDNDIALLKLKRPVTLSDNVRPICLPSSGYNL-----PAGTTCTvsgwgRTSEG-----------GPLPD 140
                       170       180       190       200       210       220       230       240
                ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 14550407 623 SKLNINLKMgVEWTSCAEVVSQektmfpnltdvREVVTDQFLCSGTQE-DESPCKGESGGAVFLERRFRFFQVGLVSWGl 701
Cdd:cd00190 141 VLQEVNVPI-VSNAECKRAYSY-----------GGTITDNMLCAGGLEgGKDACQGDSGGPLVCNDNGRGVLVGIVSWG- 207
                       250       260       270       280
                ....*....|....*....|....*....|....*....|.
gi 14550407 702 ynpcLGSADKNSrkraprskvppPRDFHiNLFRMQPWLRQH 742
Cdd:cd00190 208 ----SGCARPNY-----------PGVYT-RVSSYLDWIQKT 232
Tryp_SPc smart00020
Trypsin-like serine protease; Many of these are synthesised as inactive precursor zymogens ...
473-705 3.40e-37

Trypsin-like serine protease; Many of these are synthesised as inactive precursor zymogens that are cleaved during limited proteolysis to generate their active forms. A few, however, are active as single chain molecules, and others are inactive due to substitutions of the catalytic triad residues.


Pssm-ID: 214473  Cd Length: 229  Bit Score: 138.96  E-value: 3.40e-37
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 14550407    473 SDQERTPWHVTIKPKS-QETCRGALISDQWVLTAAHCFRdGNDHSLWRVNVGDPKSQWGKE---FLIEKAVISPGFDvfa 548
Cdd:smart00020   8 ANIGSFPWQVSLQYGGgRHFCGGSLISPRWVLTAAHCVR-GSDPSNIRVRLGSHDLSSGEEgqvIKVSKVIIHPNYN--- 83
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 14550407    549 kknqgiLEFYGDDIALLKLAQKVKMSTHARPICLPCTmeanlALRRPQGSTCRD----HENEllnkqSVPAHFVALNGSK 624
Cdd:smart00020  84 ------PSTYDNDIALLKLKEPVTLSDNVRPICLPSS-----NYNVPAGTTCTVsgwgRTSE-----GAGSLPDTLQEVN 147
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 14550407    625 LNInlkmgVEWTSCAEVVSQektmfpnltdvREVVTDQFLCSGTQE-DESPCKGESGGAVFLERRfRFFQVGLVSWGlyN 703
Cdd:smart00020 148 VPI-----VSNATCRRAYSG-----------GGAITDNMLCAGGLEgGKDACQGDSGGPLVCNDG-RWVLVGIVSWG--S 208

                   ..
gi 14550407    704 PC 705
Cdd:smart00020 209 GC 210
VWA pfam00092
von Willebrand factor type A domain;
254-451 3.60e-31

von Willebrand factor type A domain;


Pssm-ID: 459670 [Multi-domain]  Cd Length: 174  Bit Score: 120.07  E-value: 3.60e-31
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 14550407   254 NLYLLLDCSQSVSENDFLIFKESASLMVDRIFSFEINVSVAIITFASEPKVLMSvLNDNsRDMTEVISSLENANYKDH-E 332
Cdd:pfam00092   1 DIVFLLDGSGSIGGDNFEKVKEFLKKLVESLDIGPDGTRVGLVQYSSDVRTEFP-LNDY-SSKEELLSAVDNLRYLGGgT 78
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 14550407   333 NGTGTNTYAALNSVYLMMNNQMRllgmetmawqEIRHAIILLTDGKSNMgGSPKTAVDHIREiLNINqkrndyldIYAIG 412
Cdd:pfam00092  79 TNTGKALKYALENLFSSAAGARP----------GAPKVVVLLTDGRSQD-GDPEEVARELKS-AGVT--------VFAVG 138
                         170       180       190
                  ....*....|....*....|....*....|....*....
gi 14550407   413 VGklDVDWRELNELGSKKDgERHAFILQDTKALHQVFEH 451
Cdd:pfam00092 139 VG--NADDEELRKIASEPG-EGHVFTVSDFEALEDLQDQ 174
VWA smart00327
von Willebrand factor (vWF) type A domain; VWA domains in extracellular eukaryotic proteins ...
254-447 6.76e-27

von Willebrand factor (vWF) type A domain; VWA domains in extracellular eukaryotic proteins mediate adhesion via metal ion-dependent adhesion sites (MIDAS). Intracellular VWA domains and homologues in prokaryotes have recently been identified. The proposed VWA domains in integrin beta subunits have recently been substantiated using sequence-based methods.


Pssm-ID: 214621 [Multi-domain]  Cd Length: 175  Bit Score: 107.93  E-value: 6.76e-27
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 14550407    254 NLYLLLDCSQSVSENDFLIFKESASLMVDRIFSFEINVSVAIITFASEPKVLMSVlnDNSRDMTEVISSLENANYKDhen 333
Cdd:smart00327   1 DVVFLLDGSGSMGGNRFELAKEFVLKLVEQLDIGPDGDRVGLVTFSDDARVLFPL--NDSRSKDALLEALASLSYKL--- 75
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 14550407    334 GTGTNTYAALNSVYLMMNNQMRllgmetmAWQE-IRHAIILLTDGKSNMGGSpktavdhiREILNINQKRNDYLDIYAIG 412
Cdd:smart00327  76 GGGTNLGAALQYALENLFSKSA-------GSRRgAPKVVILITDGESNDGPK--------DLLKAAKELKRSGVKVFVVG 140
                          170       180       190
                   ....*....|....*....|....*....|....*
gi 14550407    413 VGKlDVDWRELNELgSKKDGERHAFILQDTKALHQ 447
Cdd:smart00327 141 VGN-DVDEEELKKL-ASAPGGVYVFLPELLDLLID 173
Trypsin pfam00089
Trypsin;
479-700 2.01e-22

Trypsin;


Pssm-ID: 459667 [Multi-domain]  Cd Length: 219  Bit Score: 96.36  E-value: 2.01e-22
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 14550407   479 PWHVTIK-PKSQETCRGALISDQWVLTAAHCFRDGNDhslWRVNVGDP----KSQWGKEFLIEKAVISPGFDVFAKKNqg 553
Cdd:pfam00089  13 PWQVSLQlSSGKHFCGGSLISENWVLTAAHCVSGASD---VKVVLGAHnivlREGGEQKFDVEKIIVHPNYNPDTLDN-- 87
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 14550407   554 ilefygdDIALLKLAQKVKMSTHARPICLPCTmeanlALRRPQGSTC------RDHEN---ELLNKQSVPahfvalngsk 624
Cdd:pfam00089  88 -------DIALLKLESPVTLGDTVRPICLPDA-----SSDLPVGTTCtvsgwgNTKTLgpsDTLQEVTVP---------- 145
                         170       180       190       200       210       220       230
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*.
gi 14550407   625 lninlkmgvewtscaeVVSQEKtmfpNLTDVREVVTDQFLCSGTQeDESPCKGESGGAVFLERRfrfFQVGLVSWG 700
Cdd:pfam00089 146 ----------------VVSRET----CRSAYGGTVTDTMICAGAG-GKDACQGDSGGPLVCSDG---ELIGIVSWG 197
COG5640 COG5640
Secreted trypsin-like serine protease [Posttranslational modification, protein turnover, ...
471-709 1.31e-20

Secreted trypsin-like serine protease [Posttranslational modification, protein turnover, chaperones];


Pssm-ID: 444365 [Multi-domain]  Cd Length: 262  Bit Score: 92.02  E-value: 1.31e-20
                        10        20        30        40        50        60        70        80
                ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 14550407 471 NASDQERtPWHVTIKPKS---QETCRGALISDQWVLTAAHCFRDGNDHSLwRVNVG--DPKSQWGKEFLIEKAVISPGFD 545
Cdd:COG5640  36 PATVGEY-PWMVALQSSNgpsGQFCGGTLIAPRWVLTAAHCVDGDGPSDL-RVVIGstDLSTSGGTVVKVARIVVHPDYD 113
                        90       100       110       120       130       140       150       160
                ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 14550407 546 VFAkknqgilefYGDDIALLKLAQKVkmsTHARPICLPctmEANLALRRPQ-------GSTCRDHEN--ELLNKQSVPAh 616
Cdd:COG5640 114 PAT---------PGNDIALLKLATPV---PGVAPAPLA---TSADAAAPGTpatvagwGRTSEGPGSqsGTLRKADVPV- 177
                       170       180       190       200       210       220       230       240
                ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 14550407 617 fvalngsklninlkmgVEWTSCAevvsqektmfpnltDVREVVTDQFLCSGTQE-DESPCKGESGGAVFLERRFRFFQVG 695
Cdd:COG5640 178 ----------------VSDATCA--------------AYGGFDGGTMLCAGYPEgGKDACQGDSGGPLVVKDGGGWVLVG 227
                       250
                ....*....|....
gi 14550407 696 LVSWGlYNPCLGSA 709
Cdd:COG5640 228 VVSWG-GGPCAAGY 240
CCP cd00033
Complement control protein (CCP) modules (aka short consensus repeats SCRs or SUSHI repeats) ...
89-144 4.19e-16

Complement control protein (CCP) modules (aka short consensus repeats SCRs or SUSHI repeats) have been identified in several proteins of the complement system; SUSHI repeats (short complement-like repeat, SCR) are abundant in complement control proteins. The complement control protein (CCP) modules (also known as short consensus repeats SCRs or SUSHI repeats) contain approximately 60 amino acid residues and have been identified in several proteins of the complement system. Typically, 2 to 4 modules contribute to a binding site, implying that the orientation of the modules to each other is critical for function.


Pssm-ID: 153056 [Multi-domain]  Cd Length: 57  Bit Score: 72.88  E-value: 4.19e-16
                        10        20        30        40        50
                ....*....|....*....|....*....|....*....|....*....|....*.
gi 14550407  89 CPAPVSFENGIYTPRLGSYPVGGNVSFECEDGFILRGSPVRQCRPNGMWDGETAVC 144
Cdd:cd00033   1 CPPPPVPENGTVTGSKGSYSYGSTVTYSCNEGYTLVGSSTITCTENGGWSPPPPTC 56
PHA02927 PHA02927
secreted complement-binding protein; Provisional
24-204 7.78e-16

secreted complement-binding protein; Provisional


Pssm-ID: 222943 [Multi-domain]  Cd Length: 263  Bit Score: 78.16  E-value: 7.78e-16
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 14550407   24 CPQNVNISGGTFTLShGWAPGSLLTYSCPQGLYpspasrLCKSSGQWQTPGATRSL----SKAVCKPVRCPAPVSFENGI 99
Cdd:PHA02927  86 CPSPRDIDNGQLDIG-GVDFGSSITYSCNSGYQ------LIGESKSYCELGSTGSMvwnpEAPICESVKCQSPPSISNGR 158
                         90       100       110       120       130       140       150       160
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 14550407  100 YTPRLGSYPVGGNVSFECEDGFILRGSPVRQCRpNGMWDgETAVCDngAGHCPNPGISLGAVRTGFR--FGHGDKVRYRC 177
Cdd:PHA02927 159 HNGYEDFYTDGSVVTYSCNSGYSLIGNSGVLCS-GGEWS-DPPTCQ--IVKCPHPTISNGYLSSGFKrsYSYNDNVDFKC 234
                        170       180
                 ....*....|....*....|....*..
gi 14550407  178 SSNLVLTGSSERECQGNGVWSGTEPIC 204
Cdd:PHA02927 235 KYGYKLSGSSSSTCSPGNTWQPELPKC 261
CCP smart00032
Domain abundant in complement control proteins; SUSHI repeat; short complement-like repeat ...
89-144 7.02e-14

Domain abundant in complement control proteins; SUSHI repeat; short complement-like repeat (SCR); The complement control protein (CCP) modules (also known as short consensus repeats SCRs or SUSHI repeats) contain approximately 60 amino acid residues and have been identified in several proteins of the complement system. A missense mutation in seventh CCP domain causes deficiency of the b subunit of factor XIII.


Pssm-ID: 214478 [Multi-domain]  Cd Length: 56  Bit Score: 66.40  E-value: 7.02e-14
                           10        20        30        40        50
                   ....*....|....*....|....*....|....*....|....*....|....*.
gi 14550407     89 CPAPVSFENGIYTPRLGSYPVGGNVSFECEDGFILRGSPVRQCRPNGMWDGETAVC 144
Cdd:smart00032   1 CPPPPDIENGTVTSSSGTYSYGDTVTYSCDPGYTLIGSSTITCLENGTWSPPPPTC 56
YfbK COG2304
Secreted protein containing bacterial Ig-like domain and vWFA domain [General function ...
242-484 2.06e-12

Secreted protein containing bacterial Ig-like domain and vWFA domain [General function prediction only];


Pssm-ID: 441879 [Multi-domain]  Cd Length: 289  Bit Score: 68.59  E-value: 2.06e-12
                        10        20        30        40        50        60        70        80
                ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 14550407 242 GRKIQIQRSGHLNLYLLLDCSQSVSENDFLIFKESASLMVDRIfsfEINVSVAIITFASEPKVLM-SVLNDNSRDMTEVI 320
Cdd:COG2304  81 PPKAAAEERPPLNLVFVIDVSGSMSGDKLELAKEAAKLLVDQL---RPGDRVSIVTFAGDARVLLpPTPATDRAKILAAI 157
                        90       100       110       120       130       140       150       160
                ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 14550407 321 SSLEnAnykdhenGTGTNTYAALNSVYlmmnNQMRllgmETMAWQEIRHaIILLTDGKSNMGgspKTAVDHIREILNINQ 400
Cdd:COG2304 158 DRLQ-A-------GGGTALGAGLELAY----ELAR----KHFIPGRVNR-VILLTDGDANVG---ITDPEELLKLAEEAR 217
                       170       180       190       200       210       220       230       240
                ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 14550407 401 KRNdyLDIYAIGVGkldVDWRE--LNELGSKKDGeRHAFIlQDTKALHQVFEhmldvsKLTDTIcGVGNMSANASDQERT 478
Cdd:COG2304 218 EEG--ITLTTLGVG---SDYNEdlLERLADAGGG-NYYYI-DDPEEAEKVFV------REFSRI-GYENRALATEDFPLP 283

                ....*.
gi 14550407 479 PWHVTI 484
Cdd:COG2304 284 YGTLKL 289
CCP cd00033
Complement control protein (CCP) modules (aka short consensus repeats SCRs or SUSHI repeats) ...
151-205 5.37e-12

Complement control protein (CCP) modules (aka short consensus repeats SCRs or SUSHI repeats) have been identified in several proteins of the complement system; SUSHI repeats (short complement-like repeat, SCR) are abundant in complement control proteins. The complement control protein (CCP) modules (also known as short consensus repeats SCRs or SUSHI repeats) contain approximately 60 amino acid residues and have been identified in several proteins of the complement system. Typically, 2 to 4 modules contribute to a binding site, implying that the orientation of the modules to each other is critical for function.


Pssm-ID: 153056 [Multi-domain]  Cd Length: 57  Bit Score: 61.32  E-value: 5.37e-12
                        10        20        30        40        50
                ....*....|....*....|....*....|....*....|....*....|....*..
gi 14550407 151 CPNPGISLGAVRTGF--RFGHGDKVRYRCSSNLVLTGSSERECQGNGVWSGTEPICR 205
Cdd:cd00033   1 CPPPPVPENGTVTGSkgSYSYGSTVTYSCNEGYTLVGSSTITCTENGGWSPPPPTCE 57
Sushi pfam00084
Sushi repeat (SCR repeat);
89-144 7.84e-12

Sushi repeat (SCR repeat);


Pssm-ID: 459664 [Multi-domain]  Cd Length: 56  Bit Score: 60.59  E-value: 7.84e-12
                          10        20        30        40        50
                  ....*....|....*....|....*....|....*....|....*....|....*.
gi 14550407    89 CPAPVSFENGIYTPRLGSYPVGGNVSFECEDGFILRGSPVRQCRPNGMWDGETAVC 144
Cdd:pfam00084   1 CPPPPDIPNGKVSATKNEYNYGASVSYECDPGYRLVGSPTITCQEDGTWSPPFPEC 56
CCP smart00032
Domain abundant in complement control proteins; SUSHI repeat; short complement-like repeat ...
151-204 4.70e-11

Domain abundant in complement control proteins; SUSHI repeat; short complement-like repeat (SCR); The complement control protein (CCP) modules (also known as short consensus repeats SCRs or SUSHI repeats) contain approximately 60 amino acid residues and have been identified in several proteins of the complement system. A missense mutation in seventh CCP domain causes deficiency of the b subunit of factor XIII.


Pssm-ID: 214478 [Multi-domain]  Cd Length: 56  Bit Score: 58.69  E-value: 4.70e-11
                           10        20        30        40        50
                   ....*....|....*....|....*....|....*....|....*....|....*.
gi 14550407    151 CPNPG-ISLGAVRTGF-RFGHGDKVRYRCSSNLVLTGSSERECQGNGVWSGTEPIC 204
Cdd:smart00032   1 CPPPPdIENGTVTSSSgTYSYGDTVTYSCDPGYTLIGSSTITCLENGTWSPPPPTC 56
Sushi pfam00084
Sushi repeat (SCR repeat);
151-204 1.05e-07

Sushi repeat (SCR repeat);


Pssm-ID: 459664 [Multi-domain]  Cd Length: 56  Bit Score: 49.03  E-value: 1.05e-07
                          10        20        30        40        50
                  ....*....|....*....|....*....|....*....|....*....|....*.
gi 14550407   151 CPNPGISLGAVRTGFRFGH--GDKVRYRCSSNLVLTGSSERECQGNGVWSGTEPIC 204
Cdd:pfam00084   1 CPPPPDIPNGKVSATKNEYnyGASVSYECDPGYRLVGSPTITCQEDGTWSPPFPEC 56
 
Name Accession Description Interval E-value
vWA_complement_factors cd01470
Complement factors B and C2 are two critical proteases for complement activation. They both ...
253-450 7.09e-96

Complement factors B and C2 are two critical proteases for complement activation. They both contain three CCP or Sushi domains, a trypsin-type serine protease domain and a single VWA domain with a conserved metal ion dependent adhesion site referred commonly as the MIDAS motif. Orthologues of these molecules are found from echinoderms to chordates. During complement activation, the CCP domains are cleaved off, resulting in the formation of an active protease that cleaves and activates complement C3. Complement C2 is in the classical pathway and complement B is in the alternative pathway. The interaction of C2 with C4 and of factor B with C3b are both dependent on Mg2+ binding sites within the VWA domains and the VWA domain of factor B has been shown to mediate the binding of C3. This is consistent with the common inferred function of VWA domains as magnesium-dependent protein interaction domains.


Pssm-ID: 238747 [Multi-domain]  Cd Length: 198  Bit Score: 295.74  E-value: 7.09e-96
                        10        20        30        40        50        60        70        80
                ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 14550407 253 LNLYLLLDCSQSVSENDFLIFKESASLMVDRIFSFEINVSVAIITFASEPKVLMSVLNDNSRDMTEVISSLENANYKDHE 332
Cdd:cd01470   1 LNIYIALDASDSIGEEDFDEAKNAIKTLIEKISSYEVSPRYEIISYASDPKEIVSIRDFNSNDADDVIKRLEDFNYDDHG 80
                        90       100       110       120       130       140       150       160
                ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 14550407 333 NGTGTNTYAALNSVYLMmnnqMRLLGME-TMAWQEIRHAIILLTDGKSNMGGSPKTAVDHIREILNINQK----RNDYLD 407
Cdd:cd01470  81 DKTGTNTAAALKKVYER----MALEKVRnKEAFNETRHVIILFTDGKSNMGGSPLPTVDKIKNLVYKNNKsdnpREDYLD 156
                       170       180       190       200
                ....*....|....*....|....*....|....*....|...
gi 14550407 408 IYAIGVGKlDVDWRELNELGSKKDGERHAFILQDTKALHQVFE 450
Cdd:cd01470 157 VYVFGVGD-DVNKEELNDLASKKDNERHFFKLKDYEDLQEVFD 198
Tryp_SPc cd00190
Trypsin-like serine protease; Many of these are synthesized as inactive precursor zymogens ...
473-742 7.95e-39

Trypsin-like serine protease; Many of these are synthesized as inactive precursor zymogens that are cleaved during limited proteolysis to generate their active forms. Alignment contains also inactive enzymes that have substitutions of the catalytic triad residues.


Pssm-ID: 238113 [Multi-domain]  Cd Length: 232  Bit Score: 143.96  E-value: 7.95e-39
                        10        20        30        40        50        60        70        80
                ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 14550407 473 SDQERTPWHVTIKPKS-QETCRGALISDQWVLTAAHCFRdGNDHSLWRVNVGD----PKSQWGKEFLIEKAVISPGFDVF 547
Cdd:cd00190   7 AKIGSFPWQVSLQYTGgRHFCGGSLISPRWVLTAAHCVY-SSAPSNYTVRLGShdlsSNEGGGQVIKVKKVIVHPNYNPS 85
                        90       100       110       120       130       140       150       160
                ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 14550407 548 akknqgileFYGDDIALLKLAQKVKMSTHARPICLPCTMEANlalrrPQGSTCR-----DHENEllnkqsvpahfVALNG 622
Cdd:cd00190  86 ---------TYDNDIALLKLKRPVTLSDNVRPICLPSSGYNL-----PAGTTCTvsgwgRTSEG-----------GPLPD 140
                       170       180       190       200       210       220       230       240
                ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 14550407 623 SKLNINLKMgVEWTSCAEVVSQektmfpnltdvREVVTDQFLCSGTQE-DESPCKGESGGAVFLERRFRFFQVGLVSWGl 701
Cdd:cd00190 141 VLQEVNVPI-VSNAECKRAYSY-----------GGTITDNMLCAGGLEgGKDACQGDSGGPLVCNDNGRGVLVGIVSWG- 207
                       250       260       270       280
                ....*....|....*....|....*....|....*....|.
gi 14550407 702 ynpcLGSADKNSrkraprskvppPRDFHiNLFRMQPWLRQH 742
Cdd:cd00190 208 ----SGCARPNY-----------PGVYT-RVSSYLDWIQKT 232
Tryp_SPc smart00020
Trypsin-like serine protease; Many of these are synthesised as inactive precursor zymogens ...
473-705 3.40e-37

Trypsin-like serine protease; Many of these are synthesised as inactive precursor zymogens that are cleaved during limited proteolysis to generate their active forms. A few, however, are active as single chain molecules, and others are inactive due to substitutions of the catalytic triad residues.


Pssm-ID: 214473  Cd Length: 229  Bit Score: 138.96  E-value: 3.40e-37
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 14550407    473 SDQERTPWHVTIKPKS-QETCRGALISDQWVLTAAHCFRdGNDHSLWRVNVGDPKSQWGKE---FLIEKAVISPGFDvfa 548
Cdd:smart00020   8 ANIGSFPWQVSLQYGGgRHFCGGSLISPRWVLTAAHCVR-GSDPSNIRVRLGSHDLSSGEEgqvIKVSKVIIHPNYN--- 83
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 14550407    549 kknqgiLEFYGDDIALLKLAQKVKMSTHARPICLPCTmeanlALRRPQGSTCRD----HENEllnkqSVPAHFVALNGSK 624
Cdd:smart00020  84 ------PSTYDNDIALLKLKEPVTLSDNVRPICLPSS-----NYNVPAGTTCTVsgwgRTSE-----GAGSLPDTLQEVN 147
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 14550407    625 LNInlkmgVEWTSCAEVVSQektmfpnltdvREVVTDQFLCSGTQE-DESPCKGESGGAVFLERRfRFFQVGLVSWGlyN 703
Cdd:smart00020 148 VPI-----VSNATCRRAYSG-----------GGAITDNMLCAGGLEgGKDACQGDSGGPLVCNDG-RWVLVGIVSWG--S 208

                   ..
gi 14550407    704 PC 705
Cdd:smart00020 209 GC 210
vWFA_subfamily_ECM cd01450
Von Willebrand factor type A (vWA) domain was originally found in the blood coagulation ...
253-437 9.44e-34

Von Willebrand factor type A (vWA) domain was originally found in the blood coagulation protein von Willebrand factor (vWF). Typically, the vWA domain is made up of approximately 200 amino acid residues folded into a classic a/b para-rossmann type of fold. The vWA domain, since its discovery, has drawn great interest because of its widespread occurrence and its involvement in a wide variety of important cellular functions. These include basal membrane formation, cell migration, cell differentiation, adhesion, haemostasis, signaling, chromosomal stability, malignant transformation and in immune defenses In integrins these domains form heterodimers while in vWF it forms multimers. There are different interaction surfaces of this domain as seen by the various molecules it complexes with. Ligand binding in most cases is mediated by the presence of a metal ion dependent adhesion site termed as the MIDAS motif that is a characteristic feature of most, if not all A domains


Pssm-ID: 238727 [Multi-domain]  Cd Length: 161  Bit Score: 127.02  E-value: 9.44e-34
                        10        20        30        40        50        60        70        80
                ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 14550407 253 LNLYLLLDCSQSVSENDFLIFKESASLMVDRIFSFEINVSVAIITFASEPKVLMSVLNDNSRDmtEVISSLENANYKDhe 332
Cdd:cd01450   1 LDIVFLLDGSESVGPENFEKVKDFIEKLVEKLDIGPDKTRVGLVQYSDDVRVEFSLNDYKSKD--DLLKAVKNLKYLG-- 76
                        90       100       110       120       130       140       150       160
                ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 14550407 333 nGTGTNTYAALNSVYLMMNNqmrllgmETMAWQEIRHAIILLTDGKSNMGGSPKTAVDHIREIlninqkrndYLDIYAIG 412
Cdd:cd01450  77 -GGGTNTGKALQYALEQLFS-------ESNARENVPKVIIVLTDGRSDDGGDPKEAAAKLKDE---------GIKVFVVG 139
                       170       180
                ....*....|....*....|....*
gi 14550407 413 VGklDVDWRELNELGSKKdGERHAF 437
Cdd:cd01450 140 VG--PADEEELREIASCP-SERHVF 161
VWA pfam00092
von Willebrand factor type A domain;
254-451 3.60e-31

von Willebrand factor type A domain;


Pssm-ID: 459670 [Multi-domain]  Cd Length: 174  Bit Score: 120.07  E-value: 3.60e-31
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 14550407   254 NLYLLLDCSQSVSENDFLIFKESASLMVDRIFSFEINVSVAIITFASEPKVLMSvLNDNsRDMTEVISSLENANYKDH-E 332
Cdd:pfam00092   1 DIVFLLDGSGSIGGDNFEKVKEFLKKLVESLDIGPDGTRVGLVQYSSDVRTEFP-LNDY-SSKEELLSAVDNLRYLGGgT 78
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 14550407   333 NGTGTNTYAALNSVYLMMNNQMRllgmetmawqEIRHAIILLTDGKSNMgGSPKTAVDHIREiLNINqkrndyldIYAIG 412
Cdd:pfam00092  79 TNTGKALKYALENLFSSAAGARP----------GAPKVVVLLTDGRSQD-GDPEEVARELKS-AGVT--------VFAVG 138
                         170       180       190
                  ....*....|....*....|....*....|....*....
gi 14550407   413 VGklDVDWRELNELGSKKDgERHAFILQDTKALHQVFEH 451
Cdd:pfam00092 139 VG--NADDEELRKIASEPG-EGHVFTVSDFEALEDLQDQ 174
VWA smart00327
von Willebrand factor (vWF) type A domain; VWA domains in extracellular eukaryotic proteins ...
254-447 6.76e-27

von Willebrand factor (vWF) type A domain; VWA domains in extracellular eukaryotic proteins mediate adhesion via metal ion-dependent adhesion sites (MIDAS). Intracellular VWA domains and homologues in prokaryotes have recently been identified. The proposed VWA domains in integrin beta subunits have recently been substantiated using sequence-based methods.


Pssm-ID: 214621 [Multi-domain]  Cd Length: 175  Bit Score: 107.93  E-value: 6.76e-27
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 14550407    254 NLYLLLDCSQSVSENDFLIFKESASLMVDRIFSFEINVSVAIITFASEPKVLMSVlnDNSRDMTEVISSLENANYKDhen 333
Cdd:smart00327   1 DVVFLLDGSGSMGGNRFELAKEFVLKLVEQLDIGPDGDRVGLVTFSDDARVLFPL--NDSRSKDALLEALASLSYKL--- 75
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 14550407    334 GTGTNTYAALNSVYLMMNNQMRllgmetmAWQE-IRHAIILLTDGKSNMGGSpktavdhiREILNINQKRNDYLDIYAIG 412
Cdd:smart00327  76 GGGTNLGAALQYALENLFSKSA-------GSRRgAPKVVILITDGESNDGPK--------DLLKAAKELKRSGVKVFVVG 140
                          170       180       190
                   ....*....|....*....|....*....|....*
gi 14550407    413 VGKlDVDWRELNELgSKKDGERHAFILQDTKALHQ 447
Cdd:smart00327 141 VGN-DVDEEELKKL-ASAPGGVYVFLPELLDLLID 173
Trypsin pfam00089
Trypsin;
479-700 2.01e-22

Trypsin;


Pssm-ID: 459667 [Multi-domain]  Cd Length: 219  Bit Score: 96.36  E-value: 2.01e-22
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 14550407   479 PWHVTIK-PKSQETCRGALISDQWVLTAAHCFRDGNDhslWRVNVGDP----KSQWGKEFLIEKAVISPGFDVFAKKNqg 553
Cdd:pfam00089  13 PWQVSLQlSSGKHFCGGSLISENWVLTAAHCVSGASD---VKVVLGAHnivlREGGEQKFDVEKIIVHPNYNPDTLDN-- 87
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 14550407   554 ilefygdDIALLKLAQKVKMSTHARPICLPCTmeanlALRRPQGSTC------RDHEN---ELLNKQSVPahfvalngsk 624
Cdd:pfam00089  88 -------DIALLKLESPVTLGDTVRPICLPDA-----SSDLPVGTTCtvsgwgNTKTLgpsDTLQEVTVP---------- 145
                         170       180       190       200       210       220       230
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*.
gi 14550407   625 lninlkmgvewtscaeVVSQEKtmfpNLTDVREVVTDQFLCSGTQeDESPCKGESGGAVFLERRfrfFQVGLVSWG 700
Cdd:pfam00089 146 ----------------VVSRET----CRSAYGGTVTDTMICAGAG-GKDACQGDSGGPLVCSDG---ELIGIVSWG 197
COG5640 COG5640
Secreted trypsin-like serine protease [Posttranslational modification, protein turnover, ...
471-709 1.31e-20

Secreted trypsin-like serine protease [Posttranslational modification, protein turnover, chaperones];


Pssm-ID: 444365 [Multi-domain]  Cd Length: 262  Bit Score: 92.02  E-value: 1.31e-20
                        10        20        30        40        50        60        70        80
                ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 14550407 471 NASDQERtPWHVTIKPKS---QETCRGALISDQWVLTAAHCFRDGNDHSLwRVNVG--DPKSQWGKEFLIEKAVISPGFD 545
Cdd:COG5640  36 PATVGEY-PWMVALQSSNgpsGQFCGGTLIAPRWVLTAAHCVDGDGPSDL-RVVIGstDLSTSGGTVVKVARIVVHPDYD 113
                        90       100       110       120       130       140       150       160
                ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 14550407 546 VFAkknqgilefYGDDIALLKLAQKVkmsTHARPICLPctmEANLALRRPQ-------GSTCRDHEN--ELLNKQSVPAh 616
Cdd:COG5640 114 PAT---------PGNDIALLKLATPV---PGVAPAPLA---TSADAAAPGTpatvagwGRTSEGPGSqsGTLRKADVPV- 177
                       170       180       190       200       210       220       230       240
                ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 14550407 617 fvalngsklninlkmgVEWTSCAevvsqektmfpnltDVREVVTDQFLCSGTQE-DESPCKGESGGAVFLERRFRFFQVG 695
Cdd:COG5640 178 ----------------VSDATCA--------------AYGGFDGGTMLCAGYPEgGKDACQGDSGGPLVVKDGGGWVLVG 227
                       250
                ....*....|....
gi 14550407 696 LVSWGlYNPCLGSA 709
Cdd:COG5640 228 VVSWG-GGPCAAGY 240
vWFA cd00198
Von Willebrand factor type A (vWA) domain was originally found in the blood coagulation ...
253-434 1.66e-19

Von Willebrand factor type A (vWA) domain was originally found in the blood coagulation protein von Willebrand factor (vWF). Typically, the vWA domain is made up of approximately 200 amino acid residues folded into a classic a/b para-rossmann type of fold. The vWA domain, since its discovery, has drawn great interest because of its widespread occurrence and its involvement in a wide variety of important cellular functions. These include basal membrane formation, cell migration, cell differentiation, adhesion, haemostasis, signaling, chromosomal stability, malignant transformation and in immune defenses In integrins these domains form heterodimers while in vWF it forms multimers. There are different interaction surfaces of this domain as seen by the various molecules it complexes with. Ligand binding in most cases is mediated by the presence of a metal ion dependent adhesion site termed as the MIDAS motif that is a characteristic feature of most, if not all A domains.


Pssm-ID: 238119 [Multi-domain]  Cd Length: 161  Bit Score: 86.08  E-value: 1.66e-19
                        10        20        30        40        50        60        70        80
                ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 14550407 253 LNLYLLLDCSQSVSENDFLIFKESASLMVDRIFSFEINVSVAIITFASEPKVLMSVLNDNSRDmtEVISSLENANYKdhe 332
Cdd:cd00198   1 ADIVFLLDVSGSMGGEKLDKAKEALKALVSSLSASPPGDRVGLVTFGSNARVVLPLTTDTDKA--DLLEAIDALKKG--- 75
                        90       100       110       120       130       140       150       160
                ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 14550407 333 NGTGTNTYAALNSVYLMMNNQMRLLGmetmawqeiRHAIILLTDGKSNMGGSPKTAVdhIREIlninqkRNDYLDIYAIG 412
Cdd:cd00198  76 LGGGTNIGAALRLALELLKSAKRPNA---------RRVIILLTDGEPNDGPELLAEA--AREL------RKLGITVYTIG 138
                       170       180
                ....*....|....*....|..
gi 14550407 413 VGkLDVDWRELNELGSKKDGER 434
Cdd:cd00198 139 IG-DDANEDELKEIADKTTGGA 159
CCP cd00033
Complement control protein (CCP) modules (aka short consensus repeats SCRs or SUSHI repeats) ...
89-144 4.19e-16

Complement control protein (CCP) modules (aka short consensus repeats SCRs or SUSHI repeats) have been identified in several proteins of the complement system; SUSHI repeats (short complement-like repeat, SCR) are abundant in complement control proteins. The complement control protein (CCP) modules (also known as short consensus repeats SCRs or SUSHI repeats) contain approximately 60 amino acid residues and have been identified in several proteins of the complement system. Typically, 2 to 4 modules contribute to a binding site, implying that the orientation of the modules to each other is critical for function.


Pssm-ID: 153056 [Multi-domain]  Cd Length: 57  Bit Score: 72.88  E-value: 4.19e-16
                        10        20        30        40        50
                ....*....|....*....|....*....|....*....|....*....|....*.
gi 14550407  89 CPAPVSFENGIYTPRLGSYPVGGNVSFECEDGFILRGSPVRQCRPNGMWDGETAVC 144
Cdd:cd00033   1 CPPPPVPENGTVTGSKGSYSYGSTVTYSCNEGYTLVGSSTITCTENGGWSPPPPTC 56
PHA02927 PHA02927
secreted complement-binding protein; Provisional
24-204 7.78e-16

secreted complement-binding protein; Provisional


Pssm-ID: 222943 [Multi-domain]  Cd Length: 263  Bit Score: 78.16  E-value: 7.78e-16
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 14550407   24 CPQNVNISGGTFTLShGWAPGSLLTYSCPQGLYpspasrLCKSSGQWQTPGATRSL----SKAVCKPVRCPAPVSFENGI 99
Cdd:PHA02927  86 CPSPRDIDNGQLDIG-GVDFGSSITYSCNSGYQ------LIGESKSYCELGSTGSMvwnpEAPICESVKCQSPPSISNGR 158
                         90       100       110       120       130       140       150       160
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 14550407  100 YTPRLGSYPVGGNVSFECEDGFILRGSPVRQCRpNGMWDgETAVCDngAGHCPNPGISLGAVRTGFR--FGHGDKVRYRC 177
Cdd:PHA02927 159 HNGYEDFYTDGSVVTYSCNSGYSLIGNSGVLCS-GGEWS-DPPTCQ--IVKCPHPTISNGYLSSGFKrsYSYNDNVDFKC 234
                        170       180
                 ....*....|....*....|....*..
gi 14550407  178 SSNLVLTGSSERECQGNGVWSGTEPIC 204
Cdd:PHA02927 235 KYGYKLSGSSSSTCSPGNTWQPELPKC 261
CCP smart00032
Domain abundant in complement control proteins; SUSHI repeat; short complement-like repeat ...
89-144 7.02e-14

Domain abundant in complement control proteins; SUSHI repeat; short complement-like repeat (SCR); The complement control protein (CCP) modules (also known as short consensus repeats SCRs or SUSHI repeats) contain approximately 60 amino acid residues and have been identified in several proteins of the complement system. A missense mutation in seventh CCP domain causes deficiency of the b subunit of factor XIII.


Pssm-ID: 214478 [Multi-domain]  Cd Length: 56  Bit Score: 66.40  E-value: 7.02e-14
                           10        20        30        40        50
                   ....*....|....*....|....*....|....*....|....*....|....*.
gi 14550407     89 CPAPVSFENGIYTPRLGSYPVGGNVSFECEDGFILRGSPVRQCRPNGMWDGETAVC 144
Cdd:smart00032   1 CPPPPDIENGTVTSSSGTYSYGDTVTYSCDPGYTLIGSSTITCLENGTWSPPPPTC 56
YfbK COG2304
Secreted protein containing bacterial Ig-like domain and vWFA domain [General function ...
242-484 2.06e-12

Secreted protein containing bacterial Ig-like domain and vWFA domain [General function prediction only];


Pssm-ID: 441879 [Multi-domain]  Cd Length: 289  Bit Score: 68.59  E-value: 2.06e-12
                        10        20        30        40        50        60        70        80
                ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 14550407 242 GRKIQIQRSGHLNLYLLLDCSQSVSENDFLIFKESASLMVDRIfsfEINVSVAIITFASEPKVLM-SVLNDNSRDMTEVI 320
Cdd:COG2304  81 PPKAAAEERPPLNLVFVIDVSGSMSGDKLELAKEAAKLLVDQL---RPGDRVSIVTFAGDARVLLpPTPATDRAKILAAI 157
                        90       100       110       120       130       140       150       160
                ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 14550407 321 SSLEnAnykdhenGTGTNTYAALNSVYlmmnNQMRllgmETMAWQEIRHaIILLTDGKSNMGgspKTAVDHIREILNINQ 400
Cdd:COG2304 158 DRLQ-A-------GGGTALGAGLELAY----ELAR----KHFIPGRVNR-VILLTDGDANVG---ITDPEELLKLAEEAR 217
                       170       180       190       200       210       220       230       240
                ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 14550407 401 KRNdyLDIYAIGVGkldVDWRE--LNELGSKKDGeRHAFIlQDTKALHQVFEhmldvsKLTDTIcGVGNMSANASDQERT 478
Cdd:COG2304 218 EEG--ITLTTLGVG---SDYNEdlLERLADAGGG-NYYYI-DDPEEAEKVFV------REFSRI-GYENRALATEDFPLP 283

                ....*.
gi 14550407 479 PWHVTI 484
Cdd:COG2304 284 YGTLKL 289
CCP cd00033
Complement control protein (CCP) modules (aka short consensus repeats SCRs or SUSHI repeats) ...
151-205 5.37e-12

Complement control protein (CCP) modules (aka short consensus repeats SCRs or SUSHI repeats) have been identified in several proteins of the complement system; SUSHI repeats (short complement-like repeat, SCR) are abundant in complement control proteins. The complement control protein (CCP) modules (also known as short consensus repeats SCRs or SUSHI repeats) contain approximately 60 amino acid residues and have been identified in several proteins of the complement system. Typically, 2 to 4 modules contribute to a binding site, implying that the orientation of the modules to each other is critical for function.


Pssm-ID: 153056 [Multi-domain]  Cd Length: 57  Bit Score: 61.32  E-value: 5.37e-12
                        10        20        30        40        50
                ....*....|....*....|....*....|....*....|....*....|....*..
gi 14550407 151 CPNPGISLGAVRTGF--RFGHGDKVRYRCSSNLVLTGSSERECQGNGVWSGTEPICR 205
Cdd:cd00033   1 CPPPPVPENGTVTGSkgSYSYGSTVTYSCNEGYTLVGSSTITCTENGGWSPPPPTCE 57
ChlD COG1240
vWFA (von Willebrand factor type A) domain of Mg and Co chelatases [Coenzyme transport and ...
241-450 5.44e-12

vWFA (von Willebrand factor type A) domain of Mg and Co chelatases [Coenzyme transport and metabolism];


Pssm-ID: 440853 [Multi-domain]  Cd Length: 262  Bit Score: 66.89  E-value: 5.44e-12
                        10        20        30        40        50        60        70        80
                ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 14550407 241 LGRKIQIQRSGHLNLYLLLDCSQSVSENDFL-IFKESASLMVDRIFSfeiNVSVAIITFASEPKVLMSVlndnSRDMTEV 319
Cdd:COG1240  81 LAPLALARPQRGRDVVLVVDASGSMAAENRLeAAKGALLDFLDDYRP---RDRVGLVAFGGEAEVLLPL----TRDREAL 153
                        90       100       110       120       130       140       150       160
                ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 14550407 320 ISSLENANYkdhenGTGTNTYAALNSVYLMMNNQMrllgmetmawQEIRHAIILLTDGKSNMG-GSPKTAVDHIREiLNI 398
Cdd:COG1240 154 KRALDELPP-----GGGTPLGDALALALELLKRAD----------PARRKVIVLLTDGRDNAGrIDPLEAAELAAA-AGI 217
                       170       180       190       200       210
                ....*....|....*....|....*....|....*....|....*....|..
gi 14550407 399 nqkrndylDIYAIGVGKLDVDWRELNELGSKKDGErhAFILQDTKALHQVFE 450
Cdd:COG1240 218 --------RIYTIGVGTEAVDEGLLREIAEATGGR--YFRADDLSELAAIYR 259
Sushi pfam00084
Sushi repeat (SCR repeat);
89-144 7.84e-12

Sushi repeat (SCR repeat);


Pssm-ID: 459664 [Multi-domain]  Cd Length: 56  Bit Score: 60.59  E-value: 7.84e-12
                          10        20        30        40        50
                  ....*....|....*....|....*....|....*....|....*....|....*.
gi 14550407    89 CPAPVSFENGIYTPRLGSYPVGGNVSFECEDGFILRGSPVRQCRPNGMWDGETAVC 144
Cdd:pfam00084   1 CPPPPDIPNGKVSATKNEYNYGASVSYECDPGYRLVGSPTITCQEDGTWSPPFPEC 56
CCP smart00032
Domain abundant in complement control proteins; SUSHI repeat; short complement-like repeat ...
151-204 4.70e-11

Domain abundant in complement control proteins; SUSHI repeat; short complement-like repeat (SCR); The complement control protein (CCP) modules (also known as short consensus repeats SCRs or SUSHI repeats) contain approximately 60 amino acid residues and have been identified in several proteins of the complement system. A missense mutation in seventh CCP domain causes deficiency of the b subunit of factor XIII.


Pssm-ID: 214478 [Multi-domain]  Cd Length: 56  Bit Score: 58.69  E-value: 4.70e-11
                           10        20        30        40        50
                   ....*....|....*....|....*....|....*....|....*....|....*.
gi 14550407    151 CPNPG-ISLGAVRTGF-RFGHGDKVRYRCSSNLVLTGSSERECQGNGVWSGTEPIC 204
Cdd:smart00032   1 CPPPPdIENGTVTSSSgTYSYGDTVTYSCDPGYTLIGSSTITCLENGTWSPPPPTC 56
vWA_collagen cd01472
von Willebrand factor (vWF) type A domain; equivalent to the I-domain of integrins. This ...
254-441 6.55e-08

von Willebrand factor (vWF) type A domain; equivalent to the I-domain of integrins. This domain has a variety of functions including: intermolecular adhesion, cell migration, signalling, transcription, and DNA repair. In integrins these domains form heterodimers while in vWF it forms homodimers and multimers. There are different interaction surfaces of this domain as seen by its complexes with collagen with either integrin or human vWFA. In integrins collagen binding occurs via the metal ion-dependent adhesion site (MIDAS) and involves three surface loops located on the upper surface of the molecule. In human vWFA, collagen binding is thought to occur on the bottom of the molecule and does not involve the vestigial MIDAS motif.


Pssm-ID: 238749 [Multi-domain]  Cd Length: 164  Bit Score: 52.62  E-value: 6.55e-08
                        10        20        30        40        50        60        70        80
                ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 14550407 254 NLYLLLDCSQSVSENDFLIFKESASLMVDRifsFEI---NVSVAIITFASEPKVlMSVLNdNSRDMTEVISSLENANYKd 330
Cdd:cd01472   2 DIVFLVDGSESIGLSNFNLVKDFVKRVVER---LDIgpdGVRVGVVQYSDDPRT-EFYLN-TYRSKDDVLEAVKNLRYI- 75
                        90       100       110       120       130       140       150       160
                ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 14550407 331 henGTGTNTYAALNsvYLMMNNQMRLLGMEtmawQEIRHAIILLTDGKSNMGGspktavdhireILNINQKRNDYLDIYA 410
Cdd:cd01472  76 ---GGGTNTGKALK--YVRENLFTEASGSR----EGVPKVLVVITDGKSQDDV-----------EEPAVELKQAGIEVFA 135
                       170       180       190
                ....*....|....*....|....*....|.
gi 14550407 411 IGVGKLDVDwrELNELGSkKDGERHAFILQD 441
Cdd:cd01472 136 VGVKNADEE--ELKQIAS-DPKELYVFNVAD 163
vWA_Matrilin cd01475
VWA_Matrilin: In cartilaginous plate, extracellular matrix molecules mediate cell-matrix and ...
251-473 6.69e-08

VWA_Matrilin: In cartilaginous plate, extracellular matrix molecules mediate cell-matrix and matrix-matrix interactions thereby providing tissue integrity. Some members of the matrilin family are expressed specifically in developing cartilage rudiments. The matrilin family consists of at least four members. All the members of the matrilin family contain VWA domains, EGF-like domains and a heptad repeat coiled-coiled domain at the carboxy terminus which is responsible for the oligomerization of the matrilins. The VWA domains have been shown to be essential for matrilin network formation by interacting with matrix ligands.


Pssm-ID: 238752 [Multi-domain]  Cd Length: 224  Bit Score: 53.93  E-value: 6.69e-08
                        10        20        30        40        50        60        70        80
                ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 14550407 251 GHLNLYLLLDCSQSVSENDFLIFKESASLMVDRIFSFEINVSVAIITFASEPKVLMSVLNDNSR-DMTEVISSLENAnyk 329
Cdd:cd01475   1 GPTDLVFLIDSSRSVRPENFELVKQFLNQIIDSLDVGPDATRVGLVQYSSTVKQEFPLGRFKSKaDLKRAVRRMEYL--- 77
                        90       100       110       120       130       140       150       160
                ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 14550407 330 dhenGTGTNTYAALNsvYLMMNNQMRLLGMETMAwQEIRHAIILLTDGKSNmggspktavDHIREIlnINQKRNDYLDIY 409
Cdd:cd01475  78 ----ETGTMTGLAIQ--YAMNNAFSEAEGARPGS-ERVPRVGIVVTDGRPQ---------DDVSEV--AAKARALGIEMF 139
                       170       180       190       200       210       220
                ....*....|....*....|....*....|....*....|....*....|....*....|....
gi 14550407 410 AIGVGKLDVDwrELNELGSKKDGErHAFILQDTKALHQVfehmldVSKLTDTICGVGNMSANAS 473
Cdd:cd01475 140 AVGVGRADEE--ELREIASEPLAD-HVFYVEDFSTIEEL------TKKFQGKICVVPDLCATLS 194
Sushi pfam00084
Sushi repeat (SCR repeat);
151-204 1.05e-07

Sushi repeat (SCR repeat);


Pssm-ID: 459664 [Multi-domain]  Cd Length: 56  Bit Score: 49.03  E-value: 1.05e-07
                          10        20        30        40        50
                  ....*....|....*....|....*....|....*....|....*....|....*.
gi 14550407   151 CPNPGISLGAVRTGFRFGH--GDKVRYRCSSNLVLTGSSERECQGNGVWSGTEPIC 204
Cdd:pfam00084   1 CPPPPDIPNGKVSATKNEYnyGASVSYECDPGYRLVGSPTITCQEDGTWSPPFPEC 56
vWA_subgroup cd01465
VWA subgroup: Von Willebrand factor type A (vWA) domain was originally found in the blood ...
253-438 1.85e-07

VWA subgroup: Von Willebrand factor type A (vWA) domain was originally found in the blood coagulation protein von Willebrand factor (vWF). Typically, the vWA domain is made up of approximately 200 amino acid residues folded into a classic a/b para-rossmann type of fold. The vWA domain, since its discovery, has drawn great interest because of its widespread occurrence and its involvement in a wide variety of important cellular functions. These include basal membrane formation, cell migration, cell differentiation, adhesion, haemostasis, signaling, chromosomal stability, malignant transformation and in immune defenses In integrins these domains form heterodimers while in vWF it forms multimers. There are different interaction surfaces of this domain as seen by the various molecules it complexes with. Ligand binding in most cases is mediated by the presence of a metal ion dependent adhesion site termed as the MIDAS motif that is a characteristic feature of most, if not all A domains. Not much is known about the function of the VWA domain in these proteins. The members do have a conserved MIDAS motif. The biochemical function however is not known.


Pssm-ID: 238742 [Multi-domain]  Cd Length: 170  Bit Score: 51.50  E-value: 1.85e-07
                        10        20        30        40        50        60        70        80
                ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 14550407 253 LNLYLLLDCSQSVSENDFLIFKESASLMVDRIfsfEINVSVAIITFASEPK-VLMSVLNDNSRDMTEVISSLenanykdh 331
Cdd:cd01465   1 LNLVFVIDRSGSMDGPKLPLVKSALKLLVDQL---RPDDRLAIVTYDGAAEtVLPATPVRDKAAILAAIDRL-------- 69
                        90       100       110       120       130       140       150       160
                ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 14550407 332 ENGTGTNTYAalnsvylmmnnqmrllGMEtMAWQEIRHA--------IILLTDGKSNMGgspktaVDHIREIL-NINQKR 402
Cdd:cd01465  70 TAGGSTAGGA----------------GIQ-LGYQEAQKHfvpggvnrILLATDGDFNVG------ETDPDELArLVAQKR 126
                       170       180       190
                ....*....|....*....|....*....|....*...
gi 14550407 403 NDYLDIYAIGVGKldvDWRE--LNELGSKKDGeRHAFI 438
Cdd:cd01465 127 ESGITLSTLGFGD---NYNEdlMEAIADAGNG-NTAYI 160
VWA_2 pfam13519
von Willebrand factor type A domain;
255-355 2.62e-07

von Willebrand factor type A domain;


Pssm-ID: 463909 [Multi-domain]  Cd Length: 103  Bit Score: 49.21  E-value: 2.62e-07
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 14550407   255 LYLLLDCSQSVSENDFLIFK-ESASLMVDRIFSFEINVSVAIITFASEPKVLMSvLNDNSRDMTEVISSLEnanykdhEN 333
Cdd:pfam13519   1 LVFVLDTSGSMRNGDYGPTRlEAAKDAVLALLKSLPGDRVGLVTFGDGPEVLIP-LTKDRAKILRALRRLE-------PK 72
                          90       100
                  ....*....|....*....|..
gi 14550407   334 GTGTNTYAALNSVYLMMNNQMR 355
Cdd:pfam13519  73 GGGTNLAAALQLARAALKHRRK 94
vWA_collagen_alphaI-XII-like cd01482
Collagen: The extracellular matrix represents a complex alloy of variable members of diverse ...
257-441 2.65e-07

Collagen: The extracellular matrix represents a complex alloy of variable members of diverse protein families defining structural integrity and various physiological functions. The most abundant family is the collagens with more than 20 different collagen types identified thus far. Collagens are centrally involved in the formation of fibrillar and microfibrillar networks of the extracellular matrix, basement membranes as well as other structures of the extracellular matrix. Some collagens have about 15-18 vWA domains in them. The VWA domains present in these collagens mediate protein-protein interactions.


Pssm-ID: 238759 [Multi-domain]  Cd Length: 164  Bit Score: 51.13  E-value: 2.65e-07
                        10        20        30        40        50        60        70        80
                ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 14550407 257 LLLDCSQSVSENDFlifKESASLMVDRIFSFEI---NVSVAIITFASEPKVLMSVLNDNSRDmtEVISSLENANYKdhen 333
Cdd:cd01482   5 FLVDGSWSIGRSNF---NLVRSFLSSVVEAFEIgpdGVQVGLVQYSDDPRTEFDLNAYTSKE--DVLAAIKNLPYK---- 75
                        90       100       110       120       130       140       150       160
                ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 14550407 334 GTGTNTYAALNsvYLMMNNQMRLLGMEtmawQEIRHAIILLTDGKSNmggspktavDHIREilnINQK-RNDYLDIYAIG 412
Cdd:cd01482  76 GGNTRTGKALT--HVREKNFTPDAGAR----PGVPKVVILITDGKSQ---------DDVEL---PARVlRNLGVNVFAVG 137
                       170       180
                ....*....|....*....|....*....
gi 14550407 413 VGklDVDWRELNELGSKKDgERHAFILQD 441
Cdd:cd01482 138 VK--DADESELKMIASKPS-ETHVFNVAD 163
PHA02927 PHA02927
secreted complement-binding protein; Provisional
84-205 3.58e-07

secreted complement-binding protein; Provisional


Pssm-ID: 222943 [Multi-domain]  Cd Length: 263  Bit Score: 52.35  E-value: 3.58e-07
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 14550407   84 CKPVRCPAPVSFENGIYTprLGSYPVGGNVSFECEDGFILRGSPVRQCRPNG----MWDGETAVCDNGAGHCPnPGISLG 159
Cdd:PHA02927  81 CIKRRCPSPRDIDNGQLD--IGGVDFGSSITYSCNSGYQLIGESKSYCELGStgsmVWNPEAPICESVKCQSP-PSISNG 157
                         90       100       110       120
                 ....*....|....*....|....*....|....*....|....*...
gi 14550407  160 AvRTGFR--FGHGDKVRYRCSSNLVLTGSSERECQGnGVWSgTEPICR 205
Cdd:PHA02927 158 R-HNGYEdfYTDGSVVTYSCNSGYSLIGNSGVLCSG-GEWS-DPPTCQ 202
TerY COG4245
Uncharacterized conserved protein YegL, contains vWA domain of TerY type [Function unknown];
252-428 5.04e-07

Uncharacterized conserved protein YegL, contains vWA domain of TerY type [Function unknown];


Pssm-ID: 443387 [Multi-domain]  Cd Length: 196  Bit Score: 50.69  E-value: 5.04e-07
                        10        20        30        40        50        60        70        80
                ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 14550407 252 HLNLYLLLDCSQSVSENDFLIFKESASLMVDRIFSFEI---NVSVAIITFASEPKVLMsvlndnsrDMTEViSSLENANY 328
Cdd:COG4245   5 RLPVYLLLDTSGSMSGEPIEALNEGLQALIDELRQDPYaleTVEVSVITFDGEAKVLL--------PLTDL-EDFQPPDL 75
                        90       100       110       120       130       140       150       160
                ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 14550407 329 kdhENGTGTNTYAALNSV-YLMMNNQMRLLGMETMAWQEIrhaIILLTDGKSNmGGSPKTAvdhIREILNINQKRNDYld 407
Cdd:COG4245  76 ---SASGGTPLGAALELLlDLIERRVQKYTAEGKGDWRPV---VFLITDGEPT-DSDWEAA---LQRLKDGEAAKKAN-- 143
                       170       180
                ....*....|....*....|.
gi 14550407 408 IYAIGVGKlDVDWRELNELGS 428
Cdd:COG4245 144 IFAIGVGP-DADTEVLKQLTD 163
PHA02817 PHA02817
EEV Host range protein; Provisional
84-237 2.21e-06

EEV Host range protein; Provisional


Pssm-ID: 165167 [Multi-domain]  Cd Length: 225  Bit Score: 49.55  E-value: 2.21e-06
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 14550407   84 CKPVRCPAPVSFENGIYTPRLGSYPVGGNVSFECEDG-----FILRGSPVRQCRPNGMWDGETAVCDngAGHCPNPGISL 158
Cdd:PHA02817  19 CDLNKCCYPPSIKNGYIYNKKTEYNIGSNVTFFCGNNtrgvrYTLVGEKNIICEKDGKWNKEFPVCK--IIRCRFPALQN 96
                         90       100       110       120       130       140       150       160
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 14550407  159 GAVR---TGFRFGHGDKVRYRCSSNLVLTGSSERECQGNGVWSGTEPIC-RQPYSY------------DFPEDVAPALGT 222
Cdd:PHA02817  97 GFVNgipDSKKFYYESEVSFSCKPGFVLIGTKYSVCGINSSWIPKVPICsRDNITYnkiyinkvniddNFFNQINNSNTY 176
                        170
                 ....*....|....*
gi 14550407  223 SFSHMLGATNPTQKT 237
Cdd:PHA02817 177 YFDKILQINNVNRYT 191
PHA02831 PHA02831
EEV host range protein; Provisional
47-204 2.87e-06

EEV host range protein; Provisional


Pssm-ID: 165176 [Multi-domain]  Cd Length: 268  Bit Score: 49.61  E-value: 2.87e-06
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 14550407   47 LTYSCPQGLypSPASRLCkSSGQWQTpgatrslsKAVCKPVR-CPAPVSFENGIYTPRLGSYPVGGNVSFECE----DGF 121
Cdd:PHA02831  46 LEYKCNNNF--DKVFVTC-NNGSWST--------KNMCIGKRnCKDPVTILNGYIKNKKDQYSFGDSVTYACKvnklEKY 114
                         90       100       110       120       130       140       150       160
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 14550407  122 ILRGSPVRQCrPNGMWDGETAVCDngAGHCPNPgislgAVRTGF------RFGHGDKVRYRCSSNLVLTGSSERECQGNG 195
Cdd:PHA02831 115 SIVGNETVKC-INKQWVPKYPVCK--LIRCKYP-----ALQNGFlnvfekKFYYGDIVNFKCKKGFILLGSSVSTCDINS 186

                 ....*....
gi 14550407  196 VWSGTEPIC 204
Cdd:PHA02831 187 IWYPGIPKC 195
VWA_integrin_invertebrates cd01476
VWA_integrin (invertebrates): Integrins are a family of cell surface receptors that have ...
253-438 4.97e-06

VWA_integrin (invertebrates): Integrins are a family of cell surface receptors that have diverse functions in cell-cell and cell-extracellular matrix interactions. Because of their involvement in many biologically important adhesion processes, integrins are conserved across a wide range of multicellular animals. Integrins from invertebrates have been identified from six phyla. There are no data to date to suggest any immunological functions for the invertebrate integrins. The members of this sub-group have the conserved MIDAS motif that is charateristic of this domain suggesting the involvement of the integrins in the recognition and binding of multi-ligands.


Pssm-ID: 238753 [Multi-domain]  Cd Length: 163  Bit Score: 47.39  E-value: 4.97e-06
                        10        20        30        40        50        60        70        80
                ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 14550407 253 LNLYLLLDCSQSVSEndflIFKESASLMVDRIFSFEIN---VSVAIITFASEPKVLMSVLNDNSRDMTEVISSLENANyk 329
Cdd:cd01476   1 LDLLFVLDSSGSVRG----KFEKYKKYIERIVEGLEIGptaTRVALITYSGRGRQRVRFNLPKHNDGEELLEKVDNLR-- 74
                        90       100       110       120       130       140       150       160
                ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 14550407 330 dHENGTgTNTYAALN-SVYLMMNNQMRLLGMETMAwqeirhaiILLTDGKSNmgGSPKTAVDHIREILNInqkrndylDI 408
Cdd:cd01476  75 -FIGGT-TATGAAIEvALQQLDPSEGRREGIPKVV--------VVLTDGRSH--DDPEKQARILRAVPNI--------ET 134
                       170       180       190
                ....*....|....*....|....*....|.
gi 14550407 409 YAIGVG-KLDVDWRELNELGSKkdgERHAFI 438
Cdd:cd01476 135 FAVGTGdPGTVDTEELHSITGN---EDHIFT 162
eMpr COG3591
V8-like Glu-specific endopeptidase [Posttranslational modification, protein turnover, ...
492-571 1.33e-05

V8-like Glu-specific endopeptidase [Posttranslational modification, protein turnover, chaperones];


Pssm-ID: 442810 [Multi-domain]  Cd Length: 194  Bit Score: 46.59  E-value: 1.33e-05
                        10        20        30        40        50        60        70        80
                ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 14550407 492 CRGALISDQWVLTAAHCF---RDGNDHSLWRVNVGDPKSQWGkEFLIEKAVISPGFDVFAKknqgilefYGDDIALLKLA 568
Cdd:COG3591  14 CTGTLIGPNLVLTAGHCVydgAGGGWATNIVFVPGYNGGPYG-TATATRFRVPPGWVASGD--------AGYDYALLRLD 84

                ...
gi 14550407 569 QKV 571
Cdd:COG3591  85 EPL 87
vWA_integrins_alpha_subunit cd01469
Integrins are a class of adhesion receptors that link the extracellular matrix to the ...
258-445 1.34e-05

Integrins are a class of adhesion receptors that link the extracellular matrix to the cytoskeleton and cooperate with growth factor receptors to promote celll survival, cell cycle progression and cell migration. Integrins consist of an alpha and a beta sub-unit. Each sub-unit has a large extracellular portion, a single transmembrane segment and a short cytoplasmic domain. The N-terminal domains of the alpha and beta subunits associate to form the integrin headpiece, which contains the ligand binding site, whereas the C-terminal segments traverse the plasma membrane and mediate interaction with the cytoskeleton and with signalling proteins.The VWA domains present in the alpha subunits of integrins seem to be a chordate specific radiation of the gene family being found only in vertebrates. They mediate protein-protein interactions.


Pssm-ID: 238746 [Multi-domain]  Cd Length: 177  Bit Score: 46.19  E-value: 1.34e-05
                        10        20        30        40        50        60        70        80
                ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 14550407 258 LLDCSQSVSENDFLIFKESASLMVDRIFSFEINVSVAIITFASEPKVLMSvLNDnSRDMTEVISSLENAnykdHENGTGT 337
Cdd:cd01469   6 VLDGSGSIYPDDFQKVKNFLSTVMKKLDIGPTKTQFGLVQYSESFRTEFT-LNE-YRTKEEPLSLVKHI----SQLLGLT 79
                        90       100       110       120       130       140       150       160
                ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 14550407 338 NTYAALNSVylmmnnQMRLLGMETMAWQEIRHAIILLTDGKSNmGGSPKTAVdhireilnINQKRNDYLDIYAIGVGKL- 416
Cdd:cd01469  80 NTATAIQYV------VTELFSESNGARKDATKVLVVITDGESH-DDPLLKDV--------IPQAEREGIIRYAIGVGGHf 144
                       170       180       190
                ....*....|....*....|....*....|.
gi 14550407 417 --DVDWRELNELGSKKDgERHAFILQDTKAL 445
Cdd:cd01469 145 qrENSREELKTIASKPP-EEHFFNVTDFAAL 174
vWA_micronemal_protein cd01471
Micronemal proteins: The Toxoplasma lytic cycle begins when the parasite actively invades a ...
253-414 1.73e-05

Micronemal proteins: The Toxoplasma lytic cycle begins when the parasite actively invades a target cell. In association with invasion, T. gondii sequentially discharges three sets of secretory organelles beginning with the micronemes, which contain adhesive proteins involved in parasite attachment to a host cell. Deployed as protein complexes, several micronemal proteins possess vertebrate-derived adhesive sequences that function in binding receptors. The VWA domain likely mediates the protein-protein interactions of these with their interacting partners.


Pssm-ID: 238748 [Multi-domain]  Cd Length: 186  Bit Score: 46.22  E-value: 1.73e-05
                        10        20        30        40        50        60        70        80
                ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 14550407 253 LNLYLLLDCSQSVSENDFliFKESASLMVDRIFSFEIN---VSVAIITFASEPKVLMSVLNDNSRD---MTEVISSLENA 326
Cdd:cd01471   1 LDLYLLVDGSGSIGYSNW--VTHVVPFLHTFVQNLNISpdeINLYLVTFSTNAKELIRLSSPNSTNkdlALNAIRALLSL 78
                        90       100       110       120       130       140       150       160
                ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 14550407 327 NYKdheNGTgTNTYAALNSVyLMMNNQMRllgmetMAWQEIRHAIILLTDGKSNmggSPKTAVDHIREILNINQKrndyl 406
Cdd:cd01471  79 YYP---NGS-TNTTSALLVV-EKHLFDTR------GNRENAPQLVIIMTDGIPD---SKFRTLKEARKLRERGVI----- 139

                ....*...
gi 14550407 407 dIYAIGVG 414
Cdd:cd01471 140 -IAVLGVG 146
PHA02817 PHA02817
EEV Host range protein; Provisional
4-144 2.65e-05

EEV Host range protein; Provisional


Pssm-ID: 165167 [Multi-domain]  Cd Length: 225  Bit Score: 46.09  E-value: 2.65e-05
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 14550407    4 LMVLFCLLFLYPGL----ADSAPSC--PQNVNisGGTFTLSHGWAPGSLLTYSCPQGLYPSPASRL------CKSSGQWQ 71
Cdd:PHA02817   1 MKNIHMLLILLCNKvyslCDLNKCCypPSIKN--GYIYNKKTEYNIGSNVTFFCGNNTRGVRYTLVgekniiCEKDGKWN 78
                         90       100       110       120       130       140       150
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*.
gi 14550407   72 TpgatrslSKAVCKPVRCPAPV---SFENGIytPRLGSYPVGGNVSFECEDGFILRGSPVRQCRPNGMWDGETAVC 144
Cdd:PHA02817  79 K-------EFPVCKIIRCRFPAlqnGFVNGI--PDSKKFYYESEVSFSCKPGFVLIGTKYSVCGINSSWIPKVPIC 145
PHA02954 PHA02954
EEV membrane glycoprotein; Provisional
87-241 3.86e-05

EEV membrane glycoprotein; Provisional


Pssm-ID: 165263 [Multi-domain]  Cd Length: 317  Bit Score: 46.23  E-value: 3.86e-05
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 14550407   87 VRCP----APVSFENGIYTPRLGSYPVGGNVSFECEDGFILRGSPVRQCRPNGmWDgetaVCDNGAGHCPNPGISLGAVr 162
Cdd:PHA02954 123 VTCPnaecQPLQLEHGSCQPVKEKYSFGEHITINCDVGYEVIGASYISCTANS-WN----VIPSCQQKCDIPSLSNGLI- 196
                         90       100       110       120       130       140       150       160
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 14550407  163 TGFRFGHGDKVRYRCSSNLVLTGSSERECQgNGVWSGTEPIC-RQPYSYDfPEDVAPALGTSFSHMLGATNPTQKTKESL 241
Cdd:PHA02954 197 SGSTFSIGGVIHLSCKSGFTLTGSPSSTCI-DGKWNPVLPICvRSNEEFD-PVDDGPDDETDLSKLSKDVVQYEQEIESL 274
ViaA COG2425
Uncharacterized conserved protein, contains a von Willebrand factor type A (vWA) domain ...
255-414 1.11e-04

Uncharacterized conserved protein, contains a von Willebrand factor type A (vWA) domain [Function unknown];


Pssm-ID: 441973 [Multi-domain]  Cd Length: 263  Bit Score: 44.67  E-value: 1.11e-04
                        10        20        30        40        50        60        70        80
                ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 14550407 255 LYLLLDCSQSVSENDFLIFKESASLMVDRIFSfeiNVSVAIITFASEPKVLMSVLNDNS-RDMTEVISSLENanykdhen 333
Cdd:COG2425 121 VVLCVDTSGSMAGSKEAAAKAAALALLRALRP---NRRFGVILFDTEVVEDLPLTADDGlEDAIEFLSGLFA-------- 189
                        90       100       110       120       130       140       150       160
                ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 14550407 334 GTGTNTYAALNSvylmmnnqmrllGMETMAWQEIRHA-IILLTDGKSnmGGSPKTAVDHIReilninQKRNDYlDIYAIG 412
Cdd:COG2425 190 GGGTDIAPALRA------------ALELLEEPDYRNAdIVLITDGEA--GVSPEELLREVR------AKESGV-RLFTVA 248

                ..
gi 14550407 413 VG 414
Cdd:COG2425 249 IG 250
PHA02639 PHA02639
EEV host range protein; Provisional
85-205 2.62e-04

EEV host range protein; Provisional


Pssm-ID: 165022 [Multi-domain]  Cd Length: 295  Bit Score: 43.88  E-value: 2.62e-04
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 14550407   85 KPVRCPAPVSFENGIYTPRLGSYPVGGNVSFECEDGFILRGSPVRQC---RPNGMWDGETAVCDNGAGHCPnPGISLGAV 161
Cdd:PHA02639  18 KSIYCDKPDDISNGFITELMEKYEIGKLIEYTCNTDYALIGDRFRTCikdKNNAIWSNKAPFCMLKECNDP-PSIINGKI 96
                         90       100       110       120
                 ....*....|....*....|....*....|....*....|....*....
gi 14550407  162 RTGFR-FGHGDKVRYRCSSN----LVLTGSSERECQGNGVWSGTEPICR 205
Cdd:PHA02639  97 YNKREmYKVGDEIYYVCNEHkgvqYSLVGNEKITCIQDKSWKPDPPICK 145
vWA_collagen_alpha_1-VI-type cd01480
VWA_collagen alpha(VI) type: The extracellular matrix represents a complex alloy of variable ...
251-379 6.18e-04

VWA_collagen alpha(VI) type: The extracellular matrix represents a complex alloy of variable members of diverse protein families defining structural integrity and various physiological functions. The most abundant family is the collagens with more than 20 different collagen types identified thus far. Collagens are centrally involved in the formation of fibrillar and microfibrillar networks of the extracellular matrix, basement membranes as well as other structures of the extracellular matrix. Some collagens have about 15-18 vWA domains in them. The VWA domains present in these collagens mediate protein-protein interactions.


Pssm-ID: 238757 [Multi-domain]  Cd Length: 186  Bit Score: 41.60  E-value: 6.18e-04
                        10        20        30        40        50        60        70        80
                ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 14550407 251 GHLNLYLLLDCSQSVSENDFLIFKESASLMVDRIFSFE------INVSVAIITFASEPKVlMSVLNDNSRDMTEVISSLE 324
Cdd:cd01480   1 GPVDITFVLDSSESVGLQNFDITKNFVKRVAERFLKDYyrkdpaGSWRVGVVQYSDQQEV-EAGFLRDIRNYTSLKEAVD 79
                        90       100       110       120       130
                ....*....|....*....|....*....|....*....|....*....|....*
gi 14550407 325 NANYKdhenGTGTNTYAALNSVYlmmnNQMRllgmETMAWQEIRHAiILLTDGKS 379
Cdd:cd01480  80 NLEYI----GGGTFTDCALKYAT----EQLL----EGSHQKENKFL-LVITDGHS 121
PHA02639 PHA02639
EEV host range protein; Provisional
24-204 4.75e-03

EEV host range protein; Provisional


Pssm-ID: 165022 [Multi-domain]  Cd Length: 295  Bit Score: 39.65  E-value: 4.75e-03
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 14550407   24 CPQNVNISGGTFT-LSHGWAPGSLLTYSCPQGlYPSPASRL--C---KSSGQWqtpgatrSLSKAVCKPVRCPAPVSFEN 97
Cdd:PHA02639  22 CDKPDDISNGFITeLMEKYEIGKLIEYTCNTD-YALIGDRFrtCikdKNNAIW-------SNKAPFCMLKECNDPPSIIN 93
                         90       100       110       120       130       140       150       160
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 14550407   98 G-IYTPRlGSYPVGGNVSFECED----GFILRGSPVRQCRPNGMWDGETAVCDngAGHCPNPGISLG---AVRTGFRFGH 169
Cdd:PHA02639  94 GkIYNKR-EMYKVGDEIYYVCNEhkgvQYSLVGNEKITCIQDKSWKPDPPICK--MINCRFPALQNGyinGIPSNKKFYY 170
                        170       180       190
                 ....*....|....*....|....*....|....*
gi 14550407  170 GDKVRYRCSSNLVLTGSSERECQGNGVWSGTEPIC 204
Cdd:PHA02639 171 KTRVGFSCKSGFDLVGEKYSTCNINATWFPSIPTC 205
 
Blast search parameters
Data Source: Precalculated data, version = cdd.v.3.21
Preset Options:Database: CDSEARCH/cdd   Low complexity filter: no  Composition Based Adjustment: yes   E-value threshold: 0.01

References:

  • Wang J et al. (2023), "The conserved domain database in 2023", Nucleic Acids Res.51(D)384-8.
  • Lu S et al. (2020), "The conserved domain database in 2020", Nucleic Acids Res.48(D)265-8.
  • Marchler-Bauer A et al. (2017), "CDD/SPARCLE: functional classification of proteins via subfamily domain architectures.", Nucleic Acids Res.45(D)200-3.
Help | Disclaimer | Write to the Help Desk
NCBI | NLM | NIH