NCBI Home Page NCBI Site Search page NCBI Guide that lists and describes the NCBI resources
Conserved domains on  [gi|1720388883|ref|XP_030105319|]
View 

complement C4-B isoform X3 [Mus musculus]

Protein Classification

Graphical summary

 Zoom to residue level

show extra options »

Show site features     Horizontal zoom: ×

List of domain hits

Name Accession Description Interval E-value
TED_complement pfam07678
A-macroglobulin TED domain; This entry corresponds to the TED domain of the complement ...
978-1311 4.33e-129

A-macroglobulin TED domain; This entry corresponds to the TED domain of the complement components such as C3, C4 and C5. This domain contains a short highly conserved region of proteinase-binding alpha-macro-globulins contains the cysteine and a glutamine of a thiol-ester bond that is cleaved at the moment of proteinase binding, and mediates the covalent binding of the alpha-macro-globulin to the proteinase. The GCGEQ motif is highly conserved.


:

Pssm-ID: 462227 [Multi-domain]  Cd Length: 311  Bit Score: 405.53  E-value: 4.33e-129
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720388883  978 ASEPLETMGSEGALSPGGVASLLRLPQGCAEQTMIYLAPTLTASNYLDRTEQWSklsPETKDHAVDLIQKGYMRIQQFRK 1057
Cdd:pfam07678    1 ISVVGDIMGPAIQVVPENLSSLLRLPYGCGEQNMVLFAPNVYVLRYLDKTNQLT---KLIKSKAIDYLEQGYQRQLSYKH 77
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720388883 1058 NDGSFGAWLHRDSSTWLTAFVLKILSLAQEQVGNSPEKLQETASWLLAQQLGDGSFHDPCPVIHRAMQGGLvgsDETVAL 1137
Cdd:pfam07678   78 PDGSYSAFGHSPGSTWLTAFVLKVFAQARKFIFIDPEEICQSLRWLLSQQKPDGSFREPGPLLHRAMKGGV---DGEVSL 154
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720388883 1138 TAFVVIALHHGLDVFqdddakQLKNRVEASITKANSFLGQKASAGLLGAHAAAITAYALTLTKASEDlRNVAHNSLMAMA 1217
Cdd:pfam07678  155 TAYVTIALLEALDIN------GLLQRVHPSIRKALTYLEQAQLAGLTSPYTLAILAYALALAGSPET-REELLKSLDAMA 227
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720388883 1218 EETGEHLYWGLVLGSQDKVVLrptaprsptEPVPQAPALWIETTAYALLHLLLReGKGKMADKAASWLTHQGSFHGAFRS 1297
Cdd:pfam07678  228 REEGNSRYWERDEKSDPQGVP---------EYPPQAPSLEVETTAYALLAYLLL-GDLTYADPIVKWLTSQRNSHGGFSS 297
                          330
                   ....*....|....
gi 1720388883 1298 TQDTVVTLDALSAY 1311
Cdd:pfam07678  298 TQDTVVALQALAEY 311
NTR_like super family cl02512
NTR_like domain; a beta barrel with an oligosaccharide/oligonucleotide-binding fold found in ...
1562-1716 6.33e-71

NTR_like domain; a beta barrel with an oligosaccharide/oligonucleotide-binding fold found in netrins, complement proteins, tissue inhibitors of metalloproteases (TIMP), and procollagen C-proteinase enhancers (PCOLCE), amongst others. In netrins, the domain plays a role in controlling axon branching in neural development, while the common function of these modules in TIMPs appears to be binding to metzincins. A subset of this family is also known as the C345C domain because it occurs as a C-terminal domain in complement C3, C4 and C5. In C5, the domain interacts with various partners during the formation of the membrane attack complex.


The actual alignment was detected with superfamily member cd03584:

Pssm-ID: 470599  Cd Length: 153  Bit Score: 234.17  E-value: 6.33e-71
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720388883 1562 CQCAEGKCPRLLRSLERRVEDKDgyRMRFACYYPRVEYGFTVKVLREDGRAAFRLFESKITQVLHFRKDTMASIGQTRNF 1641
Cdd:cd03584      1 CQCAEGGCPKQKSTFSKEITKTD--RFDFACYSPRVDYAYVVKVLNISEKSNFELYETSITDVLQTTGDVSVKPEETRVF 78
                           90       100       110       120       130       140       150
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*
gi 1720388883 1642 LSRASCRLRLEPNKEYLIMGMDGETSDNKGDPQYLLDSNTWIEEMPSEQMCKSTRHRAACFQLKDFLMEFSSRGC 1716
Cdd:cd03584     79 LKRLSCKLELKKGKEYLIMGKDGATSDSNGHMQYLLDSKTWVEKIPSEKRCKATRNRSACKQLNEFLKEYKINGC 153
A2M pfam00207
Alpha-2-macroglobulin family; This family includes the C-terminal region of the ...
779-867 6.45e-34

Alpha-2-macroglobulin family; This family includes the C-terminal region of the alpha-2-macroglobulin family.


:

Pssm-ID: 459711 [Multi-domain]  Cd Length: 91  Bit Score: 125.78  E-value: 6.45e-34
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720388883  779 NWLWRVEPVDSS--KLLTVWLPDSMTTWEIHGVSLSKSKGLCVAKPTRVRVFRKFHLHLRLPISIRRFEQFELRPVLYNY 856
Cdd:pfam00207    1 TWLWDPVLVTDNgkASLSFTLPDSITTWRATAFALSPDTGLGVAEPPELVVFKPFFVDLNLPYSVRRGEQFELKATVFNY 80
                           90
                   ....*....|.
gi 1720388883  857 LNDDVAVSVHV 867
Cdd:pfam00207   81 LDKCLKVRVRL 91
YfaS super family cl34462
Uncharacterized conserved protein YfaS, alpha-2-macroglobulin family [General function ...
136-1327 3.88e-31

Uncharacterized conserved protein YfaS, alpha-2-macroglobulin family [General function prediction only];


The actual alignment was detected with superfamily member COG2373:

Pssm-ID: 441940 [Multi-domain]  Cd Length: 1605  Bit Score: 134.05  E-value: 3.88e-31
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720388883  136 RGHIFvqTDQPIYNPGQRVRYRVFALDQKMRPSTDF-LTITVENSHGlRVLKKEIFTSTS--IFQDAFTIPDISEPGTWK 212
Cdd:COG2373    370 DAFLF--TDRGIYRPGETVHLKALLRDADGKAPAGLpLTLELTDPDG-KEVRRQTLTLNEfgGYSFSFPLPEDAPTGTWR 446
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720388883  213 ISARfSDGLESNRSTHFEVKKYVLPNFEVKITPWKPYILMvpsnSDEIQLDIQARYIYGKPVQG--VAYT---------- 280
Cdd:COG2373    447 LELY-VDPKPALGSKSFRVEEFKPPRFKVDLTLDKEPLKP----GDPVTVTVDARYLFGAPAAGlkVEGEvtlrpartaf 521
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720388883  281 ------RFALMDEQGKRTFLrgLETQAKL-VEGRTHISISKDQFQAAldkinigVRDLEgLRLYAAtaVIEsPGGemeea 353
Cdd:COG2373    522 pgypgyRFGDPDEEFEPEEL--DLGEGTLdADGKASLSLPLPDAPDA-------PGPLR-ATVEAS--VFE-SGG----- 583
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720388883  354 eltswRFVSSAFSLDLSRtKRHLV-----------PGAHFLLQALVQEMSGSEASNVPVKVsaTLV------------SG 410
Cdd:COG2373    584 -----RPVTRSATVPVHP-ADFYVgirlplfdgdpEGAPATFEVVAVDPDGKPVAGKGLKV--ELYreewryvwyksdDG 655
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720388883  411 S--------DSQVLDIQQSTNGIGQVSISFPiPPTVTELRLLVS-AGSLYPAIARLTVQAPPSRGT---GFLSIEpLDPR 478
Cdd:COG2373    656 GwryesqekEEPVAEGTLTTGADGPASLSLT-PVEWGRYRLEVKdPDGGLATSVRFYAGGNASWGAerpDRLELS-LDKE 733
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720388883  479 SPSVGDTFILNLQPvgiPAPtfshyyymiiSRGQIMAMGREP--RKTV------TSVSVLVDHQLAPSFYFVAYFYHQGH 550
Cdd:COG2373    734 SYKPGETAKLLIQS---PFA----------GRALVTVERDGVleTQWVdvkgggTTVEIPVTEDWAPNAYVSATLVRPGD 800
                          490       500       510       520       530       540       550       560
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720388883  551 PVANSL------LINIQSRDCEGKLQLKVDGAKEYRNADMMKLRIQTDS----KALVALGAVDMALYAVGGrsHKPLDMS 620
Cdd:COG2373    801 STANDMparaygVAPLPVDPPARRLKVELTAPEKLRPGETLTVTVKVKGaagkAAEVTLAAVDEGILNLTG--YKTPDPL 878
                          570       580       590       600       610       620       630       640
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720388883  621 KVFEVINSYNVgcgpgggdDALQVFQDAGLAFSDGDRLTQTREDLSCPKEKKSRQKRNvNFqKAVSeklgQYSspdakrc 700
Cdd:COG2373    879 DFFYGKRALGV--------ETRDLYGRLIGAFGGAAGALRSGGDGALGRGGNPKPPRK-RF-KPVA----LFS------- 937
                          650       660       670       680       690       700       710       720
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720388883  701 cqdGMTKLPMKRTceqraARVpqqacrepflscckfaedlrrnqtrsqahlarnnhnmlqeedlideddilvrtSFPenw 780
Cdd:COG2373    938 ---GPVKTDADGK-----ATV-----------------------------------------------------SFD--- 953
                          730       740       750       760       770       780       790       800
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720388883  781 lwrvepvdssklltvwLPDSMTTWEIHGVSLSKSKGLCVAKPTRVRvfrkfhlhlrLPISIR----RF----EQFELRPV 852
Cdd:COG2373    954 ----------------LPDFNGTLRVMAVAWSDDRFGSAEATVTVR----------KPLVVRpslpRFlapgDRFELPVD 1007
                          810       820       830       840       850       860       870       880
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720388883  853 LYNYLNDDVAVSVHVTPVEGLCLAGGGmmAQQVTVPAGSARPVAFSVVPTAAANVPLKVVARGvFDLGDAVSKILQIEKE 932
Cdd:COG2373   1008 VFNLTGKAGTVTVTLEASGGLTLEGEA--TQTVTLAAGGRATVRFPLKAPDAGDAKVTVTATG-GGESDAREVELPVRPA 1084
                          890       900       910       920       930       940       950       960
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720388883  933 GAIHREELVYNLDPlnnlGRTLEIPGSSDPNIVPdGDFSSLVRVTASEPLETmgsegalsPGGVASLLRLPQGCAEQTMI 1012
Cdd:COG2373   1085 NPLVTRATSGVLAP----GESWTLPLDLPGGLRP-GTGSLTLSLSSSPPLDL--------AGLLRYLLRYPYGCTEQTTS 1151
                          970       980       990      1000      1010      1020      1030      1040
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720388883 1013 YLAPTLtasnYLDRTEQWSKLSPETKDHAVDLIQKGYMRIQQFRKNDGSFGAW-LHRDSSTWLTAFVLKILSLAQEQvGN 1091
Cdd:COG2373   1152 RALPLL----YLSDLAEALGLKGDKDAELRARIQAAIARLLSMQNSDGGFGLWpGGSESDPWLTAYATDFLLEAREA-GY 1226
                         1050      1060      1070      1080      1090      1100      1110      1120
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720388883 1092 S-PEKLQEtaSWL--LAQQLGDGSFHDPCPVIHRAMQgglvgsdetvaltAFVVIALH-HG------LDVFQDDDAKQLK 1161
Cdd:COG2373   1227 AvPDDALD--RALdyLRNYLRNPWEIEYDDAYRLAVR-------------AYALYVLArAGkadlgdLRYLYDRRKDALS 1291
                         1130      1140      1150      1160      1170      1180      1190      1200
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720388883 1162 NRVEASITKANSFLGQKasagllgahaaaitayaltltKASEDLRnvahNSLMAMAEETGEHLYWGLVLGSQdkvvLRPT 1241
Cdd:COG2373   1292 PLAKAQLAAALALLGDK---------------------ARAEELL----AAALARLRETGARDYWYGDYGSP----LRDQ 1342
                         1210      1220      1230      1240      1250      1260      1270      1280
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720388883 1242 AprsptepvpqapalwiettayALLHLLLREG-KGKMADKAASWLTHQGSfHGAFRSTQDTVVTLDALSAYWIASHTTEE 1320
Cdd:COG2373   1343 A---------------------LALALLAELGpDAPLAPKLARWLAKALK-SGRWLSTQETAWALLALAAYARAAGASPD 1400

                   ....*..
gi 1720388883 1321 KALNVTL 1327
Cdd:COG2373   1401 FTATLTL 1407
A2M_recep pfam07677
A-macroglobulin receptor binding domain; This family includes the receptor binding domain ...
1455-1544 1.34e-30

A-macroglobulin receptor binding domain; This family includes the receptor binding domain region of the alpha-2-macroglobulin family.


:

Pssm-ID: 462226  Cd Length: 92  Bit Score: 116.52  E-value: 1.34e-30
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720388883 1455 SGMAIADITLLSGFHALRADLEKLTSlsDRYVSHFETDGP-HVLLYFDSVPTTRECVGFGASQEVVVGLVQPSSAVLYDY 1533
Cdd:pfam07677    3 SNMAILEVGLPSGFVPDEEDLKKLGV--DPLIKRVETVDDgKVILYLDKLSGEPLCFSFRAEQTFPVANLKPAPVKVYDY 80
                           90
                   ....*....|.
gi 1720388883 1534 YSPDHKCSVFY 1544
Cdd:pfam07677   81 YEPERRATTFY 91
 
Name Accession Description Interval E-value
TED_complement pfam07678
A-macroglobulin TED domain; This entry corresponds to the TED domain of the complement ...
978-1311 4.33e-129

A-macroglobulin TED domain; This entry corresponds to the TED domain of the complement components such as C3, C4 and C5. This domain contains a short highly conserved region of proteinase-binding alpha-macro-globulins contains the cysteine and a glutamine of a thiol-ester bond that is cleaved at the moment of proteinase binding, and mediates the covalent binding of the alpha-macro-globulin to the proteinase. The GCGEQ motif is highly conserved.


Pssm-ID: 462227 [Multi-domain]  Cd Length: 311  Bit Score: 405.53  E-value: 4.33e-129
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720388883  978 ASEPLETMGSEGALSPGGVASLLRLPQGCAEQTMIYLAPTLTASNYLDRTEQWSklsPETKDHAVDLIQKGYMRIQQFRK 1057
Cdd:pfam07678    1 ISVVGDIMGPAIQVVPENLSSLLRLPYGCGEQNMVLFAPNVYVLRYLDKTNQLT---KLIKSKAIDYLEQGYQRQLSYKH 77
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720388883 1058 NDGSFGAWLHRDSSTWLTAFVLKILSLAQEQVGNSPEKLQETASWLLAQQLGDGSFHDPCPVIHRAMQGGLvgsDETVAL 1137
Cdd:pfam07678   78 PDGSYSAFGHSPGSTWLTAFVLKVFAQARKFIFIDPEEICQSLRWLLSQQKPDGSFREPGPLLHRAMKGGV---DGEVSL 154
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720388883 1138 TAFVVIALHHGLDVFqdddakQLKNRVEASITKANSFLGQKASAGLLGAHAAAITAYALTLTKASEDlRNVAHNSLMAMA 1217
Cdd:pfam07678  155 TAYVTIALLEALDIN------GLLQRVHPSIRKALTYLEQAQLAGLTSPYTLAILAYALALAGSPET-REELLKSLDAMA 227
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720388883 1218 EETGEHLYWGLVLGSQDKVVLrptaprsptEPVPQAPALWIETTAYALLHLLLReGKGKMADKAASWLTHQGSFHGAFRS 1297
Cdd:pfam07678  228 REEGNSRYWERDEKSDPQGVP---------EYPPQAPSLEVETTAYALLAYLLL-GDLTYADPIVKWLTSQRNSHGGFSS 297
                          330
                   ....*....|....
gi 1720388883 1298 TQDTVVTLDALSAY 1311
Cdd:pfam07678  298 TQDTVVALQALAEY 311
complement_C3_C4_C5 cd02896
Proteins similar to C3, C4 and C5 of vertebrate complement. The vertebrate complement system, ...
992-1311 1.50e-112

Proteins similar to C3, C4 and C5 of vertebrate complement. The vertebrate complement system, comprised of a large number of distinct plasma proteins, is an effector of both the acquired and innate immune systems. The point of convergence of the classical, alternative and lectin pathways of the complement system is the proteolytic activation of C3. C4 plays a key role in propagating the classical and lectin pathways. C5 participates in the classical and alternative pathways. The thioester bond located within the structure of C3 and C4 is central to the function of complement. C5 does not contain an active thioester bond.


Pssm-ID: 239226 [Multi-domain]  Cd Length: 297  Bit Score: 358.89  E-value: 1.50e-112
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720388883  992 SPGGVASLLRLPQGCAEQTMIYLAPTLTASNYLDRTEQWSKLSPETKDHAVDLIQKGYMRIQQFRKNDGSFGAWLHRDSS 1071
Cdd:cd02896      1 SPEGLEKLIRLPTGCGEQTMIKLAPTVYALRYLDTTNQWEKLGPERRDEALKYIRQGYQRQLSYRKPDGSYAAWKNRPSS 80
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720388883 1072 TWLTAFVLKILSLAQEQVGNSPEKLQETASWLLAQQLGDGSFHDPCPVIHRAMQGGLVGSDETVALTAFVVIALHHGLDV 1151
Cdd:cd02896     81 TWLTAFVVKVFSLARKYIPVDQNVICGSVNWLISNQKPDGSFQEPSPVIHREMTGGVEGSEGDVSLTAFVLIALQEARSI 160
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720388883 1152 FQDDdakqlKNRVEASITKANSFLGQKASagllgaHAAAITAYALT---LTKASEDLRNVAHNSLMAMAEETGEHLYWGl 1228
Cdd:cd02896    161 CPPE-----VQNLDQSIRKAISYLENQLP------NLQRPYALAITayaLALADSPLSHAANRKLLSLAKRDGNGWYWW- 228
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720388883 1229 vlgsqdkvvlrptAPRSPTEPVPQAPALWIETTAYALLHLLLReGKGKMADKAASWLTHQGSFHGAFRSTQDTVVTLDAL 1308
Cdd:cd02896    229 -------------TIDSPYWPVPGPSAITVETTAYALLALLKL-GDIEYANPIARWLTEQRNYGGGFGSTQDTVVALQAL 294

                   ...
gi 1720388883 1309 SAY 1311
Cdd:cd02896    295 AEY 297
NTR_complement_C4 cd03584
NTR/C345C domain, complement C4 subfamily; The NTR domain found in complement C4 is also known ...
1562-1716 6.33e-71

NTR/C345C domain, complement C4 subfamily; The NTR domain found in complement C4 is also known as the C345C domain because it occurs at the C-terminus of complement C3, C4 and C5. Complement C4 is a key player in the activation of the component classical pathway. C4 is cleaved by activated C1 to yield C4a anaphylatoxin, and the larger fragment C4b, an essential component of the C3- and C5-convertase enzymes. C4b binds covalently to the surface of pathogens through a reactive thioester. The role of the NTR/C345C domain in C4 (C4b) is unclear.


Pssm-ID: 239639  Cd Length: 153  Bit Score: 234.17  E-value: 6.33e-71
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720388883 1562 CQCAEGKCPRLLRSLERRVEDKDgyRMRFACYYPRVEYGFTVKVLREDGRAAFRLFESKITQVLHFRKDTMASIGQTRNF 1641
Cdd:cd03584      1 CQCAEGGCPKQKSTFSKEITKTD--RFDFACYSPRVDYAYVVKVLNISEKSNFELYETSITDVLQTTGDVSVKPEETRVF 78
                           90       100       110       120       130       140       150
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*
gi 1720388883 1642 LSRASCRLRLEPNKEYLIMGMDGETSDNKGDPQYLLDSNTWIEEMPSEQMCKSTRHRAACFQLKDFLMEFSSRGC 1716
Cdd:cd03584     79 LKRLSCKLELKKGKEYLIMGKDGATSDSNGHMQYLLDSKTWVEKIPSEKRCKATRNRSACKQLNEFLKEYKINGC 153
A2M pfam00207
Alpha-2-macroglobulin family; This family includes the C-terminal region of the ...
779-867 6.45e-34

Alpha-2-macroglobulin family; This family includes the C-terminal region of the alpha-2-macroglobulin family.


Pssm-ID: 459711 [Multi-domain]  Cd Length: 91  Bit Score: 125.78  E-value: 6.45e-34
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720388883  779 NWLWRVEPVDSS--KLLTVWLPDSMTTWEIHGVSLSKSKGLCVAKPTRVRVFRKFHLHLRLPISIRRFEQFELRPVLYNY 856
Cdd:pfam00207    1 TWLWDPVLVTDNgkASLSFTLPDSITTWRATAFALSPDTGLGVAEPPELVVFKPFFVDLNLPYSVRRGEQFELKATVFNY 80
                           90
                   ....*....|.
gi 1720388883  857 LNDDVAVSVHV 867
Cdd:pfam00207   81 LDKCLKVRVRL 91
YfaS COG2373
Uncharacterized conserved protein YfaS, alpha-2-macroglobulin family [General function ...
136-1327 3.88e-31

Uncharacterized conserved protein YfaS, alpha-2-macroglobulin family [General function prediction only];


Pssm-ID: 441940 [Multi-domain]  Cd Length: 1605  Bit Score: 134.05  E-value: 3.88e-31
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720388883  136 RGHIFvqTDQPIYNPGQRVRYRVFALDQKMRPSTDF-LTITVENSHGlRVLKKEIFTSTS--IFQDAFTIPDISEPGTWK 212
Cdd:COG2373    370 DAFLF--TDRGIYRPGETVHLKALLRDADGKAPAGLpLTLELTDPDG-KEVRRQTLTLNEfgGYSFSFPLPEDAPTGTWR 446
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720388883  213 ISARfSDGLESNRSTHFEVKKYVLPNFEVKITPWKPYILMvpsnSDEIQLDIQARYIYGKPVQG--VAYT---------- 280
Cdd:COG2373    447 LELY-VDPKPALGSKSFRVEEFKPPRFKVDLTLDKEPLKP----GDPVTVTVDARYLFGAPAAGlkVEGEvtlrpartaf 521
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720388883  281 ------RFALMDEQGKRTFLrgLETQAKL-VEGRTHISISKDQFQAAldkinigVRDLEgLRLYAAtaVIEsPGGemeea 353
Cdd:COG2373    522 pgypgyRFGDPDEEFEPEEL--DLGEGTLdADGKASLSLPLPDAPDA-------PGPLR-ATVEAS--VFE-SGG----- 583
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720388883  354 eltswRFVSSAFSLDLSRtKRHLV-----------PGAHFLLQALVQEMSGSEASNVPVKVsaTLV------------SG 410
Cdd:COG2373    584 -----RPVTRSATVPVHP-ADFYVgirlplfdgdpEGAPATFEVVAVDPDGKPVAGKGLKV--ELYreewryvwyksdDG 655
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720388883  411 S--------DSQVLDIQQSTNGIGQVSISFPiPPTVTELRLLVS-AGSLYPAIARLTVQAPPSRGT---GFLSIEpLDPR 478
Cdd:COG2373    656 GwryesqekEEPVAEGTLTTGADGPASLSLT-PVEWGRYRLEVKdPDGGLATSVRFYAGGNASWGAerpDRLELS-LDKE 733
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720388883  479 SPSVGDTFILNLQPvgiPAPtfshyyymiiSRGQIMAMGREP--RKTV------TSVSVLVDHQLAPSFYFVAYFYHQGH 550
Cdd:COG2373    734 SYKPGETAKLLIQS---PFA----------GRALVTVERDGVleTQWVdvkgggTTVEIPVTEDWAPNAYVSATLVRPGD 800
                          490       500       510       520       530       540       550       560
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720388883  551 PVANSL------LINIQSRDCEGKLQLKVDGAKEYRNADMMKLRIQTDS----KALVALGAVDMALYAVGGrsHKPLDMS 620
Cdd:COG2373    801 STANDMparaygVAPLPVDPPARRLKVELTAPEKLRPGETLTVTVKVKGaagkAAEVTLAAVDEGILNLTG--YKTPDPL 878
                          570       580       590       600       610       620       630       640
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720388883  621 KVFEVINSYNVgcgpgggdDALQVFQDAGLAFSDGDRLTQTREDLSCPKEKKSRQKRNvNFqKAVSeklgQYSspdakrc 700
Cdd:COG2373    879 DFFYGKRALGV--------ETRDLYGRLIGAFGGAAGALRSGGDGALGRGGNPKPPRK-RF-KPVA----LFS------- 937
                          650       660       670       680       690       700       710       720
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720388883  701 cqdGMTKLPMKRTceqraARVpqqacrepflscckfaedlrrnqtrsqahlarnnhnmlqeedlideddilvrtSFPenw 780
Cdd:COG2373    938 ---GPVKTDADGK-----ATV-----------------------------------------------------SFD--- 953
                          730       740       750       760       770       780       790       800
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720388883  781 lwrvepvdssklltvwLPDSMTTWEIHGVSLSKSKGLCVAKPTRVRvfrkfhlhlrLPISIR----RF----EQFELRPV 852
Cdd:COG2373    954 ----------------LPDFNGTLRVMAVAWSDDRFGSAEATVTVR----------KPLVVRpslpRFlapgDRFELPVD 1007
                          810       820       830       840       850       860       870       880
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720388883  853 LYNYLNDDVAVSVHVTPVEGLCLAGGGmmAQQVTVPAGSARPVAFSVVPTAAANVPLKVVARGvFDLGDAVSKILQIEKE 932
Cdd:COG2373   1008 VFNLTGKAGTVTVTLEASGGLTLEGEA--TQTVTLAAGGRATVRFPLKAPDAGDAKVTVTATG-GGESDAREVELPVRPA 1084
                          890       900       910       920       930       940       950       960
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720388883  933 GAIHREELVYNLDPlnnlGRTLEIPGSSDPNIVPdGDFSSLVRVTASEPLETmgsegalsPGGVASLLRLPQGCAEQTMI 1012
Cdd:COG2373   1085 NPLVTRATSGVLAP----GESWTLPLDLPGGLRP-GTGSLTLSLSSSPPLDL--------AGLLRYLLRYPYGCTEQTTS 1151
                          970       980       990      1000      1010      1020      1030      1040
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720388883 1013 YLAPTLtasnYLDRTEQWSKLSPETKDHAVDLIQKGYMRIQQFRKNDGSFGAW-LHRDSSTWLTAFVLKILSLAQEQvGN 1091
Cdd:COG2373   1152 RALPLL----YLSDLAEALGLKGDKDAELRARIQAAIARLLSMQNSDGGFGLWpGGSESDPWLTAYATDFLLEAREA-GY 1226
                         1050      1060      1070      1080      1090      1100      1110      1120
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720388883 1092 S-PEKLQEtaSWL--LAQQLGDGSFHDPCPVIHRAMQgglvgsdetvaltAFVVIALH-HG------LDVFQDDDAKQLK 1161
Cdd:COG2373   1227 AvPDDALD--RALdyLRNYLRNPWEIEYDDAYRLAVR-------------AYALYVLArAGkadlgdLRYLYDRRKDALS 1291
                         1130      1140      1150      1160      1170      1180      1190      1200
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720388883 1162 NRVEASITKANSFLGQKasagllgahaaaitayaltltKASEDLRnvahNSLMAMAEETGEHLYWGLVLGSQdkvvLRPT 1241
Cdd:COG2373   1292 PLAKAQLAAALALLGDK---------------------ARAEELL----AAALARLRETGARDYWYGDYGSP----LRDQ 1342
                         1210      1220      1230      1240      1250      1260      1270      1280
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720388883 1242 AprsptepvpqapalwiettayALLHLLLREG-KGKMADKAASWLTHQGSfHGAFRSTQDTVVTLDALSAYWIASHTTEE 1320
Cdd:COG2373   1343 A---------------------LALALLAELGpDAPLAPKLARWLAKALK-SGRWLSTQETAWALLALAAYARAAGASPD 1400

                   ....*..
gi 1720388883 1321 KALNVTL 1327
Cdd:COG2373   1401 FTATLTL 1407
A2M_recep pfam07677
A-macroglobulin receptor binding domain; This family includes the receptor binding domain ...
1455-1544 1.34e-30

A-macroglobulin receptor binding domain; This family includes the receptor binding domain region of the alpha-2-macroglobulin family.


Pssm-ID: 462226  Cd Length: 92  Bit Score: 116.52  E-value: 1.34e-30
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720388883 1455 SGMAIADITLLSGFHALRADLEKLTSlsDRYVSHFETDGP-HVLLYFDSVPTTRECVGFGASQEVVVGLVQPSSAVLYDY 1533
Cdd:pfam07677    3 SNMAILEVGLPSGFVPDEEDLKKLGV--DPLIKRVETVDDgKVILYLDKLSGEPLCFSFRAEQTFPVANLKPAPVKVYDY 80
                           90
                   ....*....|.
gi 1720388883 1534 YSPDHKCSVFY 1544
Cdd:pfam07677   81 YEPERRATTFY 91
C345C smart00643
Netrin C-terminal Domain;
1588-1698 1.22e-29

Netrin C-terminal Domain;


Pssm-ID: 214759  Cd Length: 114  Bit Score: 114.39  E-value: 1.22e-29
                            10        20        30        40        50        60        70        80
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720388883  1588 MRFACYYPRvEYGFTVKVLREDGRAAFRLFESKITQVLHFRKDTMAS-IGQTRNFLSRASCR--LRLEPNKEYLIMGMDG 1664
Cdd:smart00643    1 LEKACKSDV-DYVYKVKVLSVEEEGGFDKYTVKILEVIKSGTDELVRgKNKLRVFISRASCRcpLLLKLGKSYLIMGKSG 79
                            90       100       110
                    ....*....|....*....|....*....|....
gi 1720388883  1665 ETSDNKGDPQYLLDSNTWIEEMPSEQMCKSTRHR 1698
Cdd:smart00643   80 DLWDAKGRGQYVLGKNSWVEEWPTEEECRLRRLQ 113
NTR pfam01759
UNC-6/NTR/C345C module; Sequence similarity between netrin UNC-6 and C345C complement protein ...
1589-1698 1.63e-25

UNC-6/NTR/C345C module; Sequence similarity between netrin UNC-6 and C345C complement protein family members, and hence the existence of the UNC-6 module, was first reported in. Subsequently, many additional members of the family were identified on the basis of sequence similarity between the C-terminal domains of netrins, complement proteins C3, C4, C5, secreted frizzled-related proteins, and type I pro-collagen C-proteinase enhancer proteins (PCOLCEs), which are homologous with the N-terminal domains of tissue inhibitors of metalloproteinases (TIMPs). The TIMPs are classified as a separate family in Pfam (pfam00965). This expanded domain family has been named as the NTR module.


Pssm-ID: 396359  Cd Length: 106  Bit Score: 102.42  E-value: 1.63e-25
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720388883 1589 RFACYypRVEYGFTVKVLREDGRAAFRLFESKITQVLHFRKDtMASIGQTRNFLSRASCR-LRLEPNKEYLIMGMDGets 1667
Cdd:pfam01759    1 KKACK--GSDYVYKVKVLSVEEEGSFDKYTVKVKEVLKEGTD-KIQRGKVRLFLKRGDCRcPQLRLGKEYLIMGKVG--- 74
                           90       100       110
                   ....*....|....*....|....*....|.
gi 1720388883 1668 DNKGDPQYLLDSNTWIEEMPSEQMCKSTRHR 1698
Cdd:pfam01759   75 DLEGRGRYVLDKNSWVEPWPTKWECKLRELQ 105
ANATO cd00017
Anaphylatoxin homologous domain; C3a, C4a and C5a anaphylatoxins are protein fragments ...
686-752 6.43e-25

Anaphylatoxin homologous domain; C3a, C4a and C5a anaphylatoxins are protein fragments generated enzymatically in serum during activation of complement molecules C3, C4, and C5. They induce smooth muscle contraction. These fragments are homologous to repeats in fibulins.


Pssm-ID: 237984  Cd Length: 70  Bit Score: 99.46  E-value: 6.43e-25
                           10        20        30        40        50        60
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 1720388883  686 SEKLGQYSSPDAKRCCQDGMTKLPMKRTCEQRAARV-PQQACREPFLSCCKFAEDLRRNQTRSQAHLA 752
Cdd:cd00017      3 SEKAAQYKDKELRKCCLDGMRENPMGQTCEERAAYItDGKECRKAFLECCVYAEELRDEEREDGLGLA 70
A2M_BRD pfam07703
Alpha-2-macroglobulin bait region domain; Alpha-2-macroglobulins (A2Ms) are plasma proteins ...
470-609 4.54e-24

Alpha-2-macroglobulin bait region domain; Alpha-2-macroglobulins (A2Ms) are plasma proteins that trap and inhibit a broad range of proteases and are major components of the eukaryotic innate immune system. However, A2M-like proteins were identified in pathogenically invasive bacteria and species that colonize higher eukaryotes. This domain is found in eukaryotic and bacterial proteins. In human A2Ms, this domain encompasses macroglobulin-like domain MG5 and 6 including bait region. In Salmonella enterica ser A2Ms, this domain encompasses MG7 and MG8 including the bait region. The Bait region is cleaved by proteases, followed by a large conformational change that blocks the target protease within a cage-like complex. This model of protease entrapment is recognized as the Venus flytrap mechanism.


Pssm-ID: 462235 [Multi-domain]  Cd Length: 139  Bit Score: 99.35  E-value: 4.54e-24
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720388883  470 LSIEPlDPRSPSVGDTFILNLQPVGIPAPTFSHYYYMIISRGQIMAMGRepRKTVTSVSVLVDHQLAPSFYFVAYFYHQG 549
Cdd:pfam07703    1 LHLST-DKTEYKPGETATVTVKSPFDGTVERDGFTYLVLSKGQIVVVGR--GGVTTSFSLPVTAEMAPSARVVAYYVRVD 77
                           90       100       110       120       130       140
                   ....*....|....*....|....*....|....*....|....*....|....*....|....
gi 1720388883  550 HP----VANSLLINIQSrDCEGKLQLKVDgAKEYRNADMMKLRIQTDSKALVALGAVDMALYAV 609
Cdd:pfam07703   78 LSkpevVADSVWVDVDD-TCENKLKVTLS-AEKYRPGSTVELKVKADPGAYVALAAVDKGVLLL 139
ANATO smart00104
Anaphylatoxin homologous domain; C3a, C4a and C5a anaphylatoxins are protein fragments ...
700-734 1.52e-12

Anaphylatoxin homologous domain; C3a, C4a and C5a anaphylatoxins are protein fragments generated enzymatically in serum during activation of complement molecules C3, C4, and C5. They induce smooth muscle contraction. These fragments are homologous to a three-fold repeat in fibulins.


Pssm-ID: 197517  Cd Length: 35  Bit Score: 63.12  E-value: 1.52e-12
                            10        20        30
                    ....*....|....*....|....*....|....*
gi 1720388883   700 CCQDGMTKLPMKRTCEQRAARVPQQACREPFLSCC 734
Cdd:smart00104    1 CCADGMRLAPMGETCEERAARINSGDCRKAFLQCC 35
 
Name Accession Description Interval E-value
TED_complement pfam07678
A-macroglobulin TED domain; This entry corresponds to the TED domain of the complement ...
978-1311 4.33e-129

A-macroglobulin TED domain; This entry corresponds to the TED domain of the complement components such as C3, C4 and C5. This domain contains a short highly conserved region of proteinase-binding alpha-macro-globulins contains the cysteine and a glutamine of a thiol-ester bond that is cleaved at the moment of proteinase binding, and mediates the covalent binding of the alpha-macro-globulin to the proteinase. The GCGEQ motif is highly conserved.


Pssm-ID: 462227 [Multi-domain]  Cd Length: 311  Bit Score: 405.53  E-value: 4.33e-129
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720388883  978 ASEPLETMGSEGALSPGGVASLLRLPQGCAEQTMIYLAPTLTASNYLDRTEQWSklsPETKDHAVDLIQKGYMRIQQFRK 1057
Cdd:pfam07678    1 ISVVGDIMGPAIQVVPENLSSLLRLPYGCGEQNMVLFAPNVYVLRYLDKTNQLT---KLIKSKAIDYLEQGYQRQLSYKH 77
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720388883 1058 NDGSFGAWLHRDSSTWLTAFVLKILSLAQEQVGNSPEKLQETASWLLAQQLGDGSFHDPCPVIHRAMQGGLvgsDETVAL 1137
Cdd:pfam07678   78 PDGSYSAFGHSPGSTWLTAFVLKVFAQARKFIFIDPEEICQSLRWLLSQQKPDGSFREPGPLLHRAMKGGV---DGEVSL 154
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720388883 1138 TAFVVIALHHGLDVFqdddakQLKNRVEASITKANSFLGQKASAGLLGAHAAAITAYALTLTKASEDlRNVAHNSLMAMA 1217
Cdd:pfam07678  155 TAYVTIALLEALDIN------GLLQRVHPSIRKALTYLEQAQLAGLTSPYTLAILAYALALAGSPET-REELLKSLDAMA 227
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720388883 1218 EETGEHLYWGLVLGSQDKVVLrptaprsptEPVPQAPALWIETTAYALLHLLLReGKGKMADKAASWLTHQGSFHGAFRS 1297
Cdd:pfam07678  228 REEGNSRYWERDEKSDPQGVP---------EYPPQAPSLEVETTAYALLAYLLL-GDLTYADPIVKWLTSQRNSHGGFSS 297
                          330
                   ....*....|....
gi 1720388883 1298 TQDTVVTLDALSAY 1311
Cdd:pfam07678  298 TQDTVVALQALAEY 311
complement_C3_C4_C5 cd02896
Proteins similar to C3, C4 and C5 of vertebrate complement. The vertebrate complement system, ...
992-1311 1.50e-112

Proteins similar to C3, C4 and C5 of vertebrate complement. The vertebrate complement system, comprised of a large number of distinct plasma proteins, is an effector of both the acquired and innate immune systems. The point of convergence of the classical, alternative and lectin pathways of the complement system is the proteolytic activation of C3. C4 plays a key role in propagating the classical and lectin pathways. C5 participates in the classical and alternative pathways. The thioester bond located within the structure of C3 and C4 is central to the function of complement. C5 does not contain an active thioester bond.


Pssm-ID: 239226 [Multi-domain]  Cd Length: 297  Bit Score: 358.89  E-value: 1.50e-112
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720388883  992 SPGGVASLLRLPQGCAEQTMIYLAPTLTASNYLDRTEQWSKLSPETKDHAVDLIQKGYMRIQQFRKNDGSFGAWLHRDSS 1071
Cdd:cd02896      1 SPEGLEKLIRLPTGCGEQTMIKLAPTVYALRYLDTTNQWEKLGPERRDEALKYIRQGYQRQLSYRKPDGSYAAWKNRPSS 80
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720388883 1072 TWLTAFVLKILSLAQEQVGNSPEKLQETASWLLAQQLGDGSFHDPCPVIHRAMQGGLVGSDETVALTAFVVIALHHGLDV 1151
Cdd:cd02896     81 TWLTAFVVKVFSLARKYIPVDQNVICGSVNWLISNQKPDGSFQEPSPVIHREMTGGVEGSEGDVSLTAFVLIALQEARSI 160
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720388883 1152 FQDDdakqlKNRVEASITKANSFLGQKASagllgaHAAAITAYALT---LTKASEDLRNVAHNSLMAMAEETGEHLYWGl 1228
Cdd:cd02896    161 CPPE-----VQNLDQSIRKAISYLENQLP------NLQRPYALAITayaLALADSPLSHAANRKLLSLAKRDGNGWYWW- 228
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720388883 1229 vlgsqdkvvlrptAPRSPTEPVPQAPALWIETTAYALLHLLLReGKGKMADKAASWLTHQGSFHGAFRSTQDTVVTLDAL 1308
Cdd:cd02896    229 -------------TIDSPYWPVPGPSAITVETTAYALLALLKL-GDIEYANPIARWLTEQRNYGGGFGSTQDTVVALQAL 294

                   ...
gi 1720388883 1309 SAY 1311
Cdd:cd02896    295 AEY 297
A2M_like cd02891
Proteins similar to alpha2-macroglobulin (alpha (2)-M). Alpha (2)-M is a major carrier ...
992-1311 5.34e-74

Proteins similar to alpha2-macroglobulin (alpha (2)-M). Alpha (2)-M is a major carrier protein in serum. It is a broadly specific proteinase inhibitor. The structural thioester of alpha (2)-M, is involved in the immobilization and entrapment of proteases. This group contains another broadly specific proteinase inhibitor: pregnancy zone protein (PZP). PZP is a trace protein in the plasma of non-pregnant females and males which is elevated in pregnancy. Alpha (2)-M and PZ bind to placental protein-14 and may modulate its activity in T-cell growth and cytokine production thereby protecting the allogeneic fetus from attack by the maternal immune system. This group also contains C3, C4 and C5 of vertebrate complement. The vertebrate complement is an effector of both the acquired and innate immune systems The point of convergence of the classical, alternative and lectin pathways of the complement system is the proteolytic activation of C3. C4 plays a key role in propagating the classical and lectin pathways. C5 participates in the classical and alternative pathways. The thioester bond located within the structure of C3 and C4 is central to the function of complement. C5 does not contain an active thioester bond.


Pssm-ID: 239221 [Multi-domain]  Cd Length: 282  Bit Score: 248.46  E-value: 5.34e-74
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720388883  992 SPGGVASLLRLPQGCAEQTMIYLAPTLTASNYLDRTEQWSklsPETKDHAVDLIQKGYMRIQQFRKNDGSFGAWLHRD-S 1070
Cdd:cd02891      1 SLGNLDYLLRYPYGCGEQTMSRAAPNLYVLKYLDATGQLT---PEIREKALEYIRKGYQRLLTYQRSDGSFSAWGNSDsG 77
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720388883 1071 STWLTAFVLKILSLAQEQVGNSPEKLQETASWLLAQQLGDGSFHDPCPVIHRAMQGglvGSDETVALTAFVVIALHHGLD 1150
Cdd:cd02891     78 STWLTAYVVKFLSQARKYIDVDENVLARALGWLVPQQKEDGSFRELGPVIHREMKG---GVDDSVSLTAYVLIALAEAGK 154
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720388883 1151 VFQDDDAKQLkNRVEASITKANSFLGQkasagllgahaaAITAYALTLTKASEDlrnvAHNSLMAMAEETGEHLYWglvl 1230
Cdd:cd02891    155 ACDASIEKAL-AYLETQLDGLLDPYAL------------AILAYALALAGDSTR----ADEALKKLLEAAREKGGT---- 213
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720388883 1231 gsqdkvvlrptAPRSPTEPVPQAPALWIETTAYALLHLLLReGKGKMADKAASWLTHQGSFHGAFRSTQDTVVTLDALSA 1310
Cdd:cd02891    214 -----------AHWSLSWPGDYGSSLRVEATAYALLALLKL-GDLEEAGPIAKWLAQQRNSGGGFLSTQDTVVALQALAA 281

                   .
gi 1720388883 1311 Y 1311
Cdd:cd02891    282 Y 282
NTR_complement_C4 cd03584
NTR/C345C domain, complement C4 subfamily; The NTR domain found in complement C4 is also known ...
1562-1716 6.33e-71

NTR/C345C domain, complement C4 subfamily; The NTR domain found in complement C4 is also known as the C345C domain because it occurs at the C-terminus of complement C3, C4 and C5. Complement C4 is a key player in the activation of the component classical pathway. C4 is cleaved by activated C1 to yield C4a anaphylatoxin, and the larger fragment C4b, an essential component of the C3- and C5-convertase enzymes. C4b binds covalently to the surface of pathogens through a reactive thioester. The role of the NTR/C345C domain in C4 (C4b) is unclear.


Pssm-ID: 239639  Cd Length: 153  Bit Score: 234.17  E-value: 6.33e-71
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720388883 1562 CQCAEGKCPRLLRSLERRVEDKDgyRMRFACYYPRVEYGFTVKVLREDGRAAFRLFESKITQVLHFRKDTMASIGQTRNF 1641
Cdd:cd03584      1 CQCAEGGCPKQKSTFSKEITKTD--RFDFACYSPRVDYAYVVKVLNISEKSNFELYETSITDVLQTTGDVSVKPEETRVF 78
                           90       100       110       120       130       140       150
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*
gi 1720388883 1642 LSRASCRLRLEPNKEYLIMGMDGETSDNKGDPQYLLDSNTWIEEMPSEQMCKSTRHRAACFQLKDFLMEFSSRGC 1716
Cdd:cd03584     79 LKRLSCKLELKKGKEYLIMGKDGATSDSNGHMQYLLDSKTWVEKIPSEKRCKATRNRSACKQLNEFLKEYKINGC 153
A2M_2 cd02897
Proteins similar to alpha2-macroglobulin (alpha (2)-M). This group also contains the pregnancy ...
998-1311 2.71e-60

Proteins similar to alpha2-macroglobulin (alpha (2)-M). This group also contains the pregnancy zone protein (PZP). Alpha(2)-M and PZP are broadly specific proteinase inhibitors. Alpha (2)-M is a major carrier protein in serum. The structural thioester of alpha (2)-M, is involved in the immobilization and entrapment of proteases. PZP is a trace protein in the plasma of non-pregnant females and males which is elevated in pregnancy. Alpha (2)-M and PZ bind to placental protein-14 and may modulate its activity in T-cell growth and cytokine production contributing to fetal survival. It has been suggested that thioester bond cleavage promotes the binding of PZ and alpha (2)-M to the CD91 receptor clearing them from circulation.


Pssm-ID: 239227  Cd Length: 292  Bit Score: 209.36  E-value: 2.71e-60
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720388883  998 SLLRLPQGCAEQTMIYLAPTLTASNYLDRTEQwskLSPETKDHAVDLIQKGYMRIQQFRKNDGSFGAWLHRD--SSTWLT 1075
Cdd:cd02897      7 NLLRMPYGCGEQNMVNFAPNIYVLDYLKATGQ---LTPEIESKALGFLRTGYQRQLTYKHSDGSYSAFGESDksGSTWLT 83
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720388883 1076 AFVLKILSLAQEQVGNSPEKLQETASWLLAQQLGDGSFHDPCPVIHRAMQGglvGSDETVALTAFVVIAL-HHGLDVfQD 1154
Cdd:cd02897     84 AFVLKSFAQARPFIYIDENVLQQALTWLSSHQKSNGCFREVGRVFHKAMQG---GVDDEVALTAYVLIALlEAGLPS-ER 159
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720388883 1155 DDAKQLKNRVEASITKANS--FLGQKAsagllgahaaaitayaLTLTKASEDLRNVAHNSLMAMAEETGEHLYWglvlgs 1232
Cdd:cd02897    160 PVVEKALSCLEAALDSISDpyTLALAA----------------YALTLAGSEKRPEALKKLDELAISEDGTKHW------ 217
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720388883 1233 qdkvvLRPTAPRSPTEPVPQAPALWIETTAYALLHLLLREGKGkmADKAAS---WLTHQGSFHGAFRSTQDTVVTLDALS 1309
Cdd:cd02897    218 -----SRPPPSEEGPSYYWQAPSAEVEMTAYALLALLSAGGED--LAEALPivkWLAKQRNSLGGFSSTQDTVVALQALA 290

                   ..
gi 1720388883 1310 AY 1311
Cdd:cd02897    291 KY 292
ISOPREN_C2_like cd00688
This group contains class II terpene cyclases, protein prenyltransferases beta subunit, two ...
992-1311 2.39e-40

This group contains class II terpene cyclases, protein prenyltransferases beta subunit, two broadly specific proteinase inhibitors alpha2-macroglobulin (alpha (2)-M) and pregnancy zone protein (PZP) and, the C3 C4 and C5 components of vertebrate complement. Class II terpene cyclases include squalene cyclase (SQCY) and 2,3-oxidosqualene cyclase (OSQCY), these integral membrane proteins catalyze a cationic cyclization cascade converting linear triterpenes to fused ring compounds. The protein prenyltransferases include protein farnesyltransferase (FTase) and geranylgeranyltransferase types I and II (GGTase-I and GGTase-II) which catalyze the carboxyl-terminal lipidation of Ras, Rab, and several other cellular signal transduction proteins, facilitating membrane associations and specific protein-protein interactions. Alpha (2)-M is a major carrier protein in serum and involved in the immobilization and entrapment of proteases. PZP is a pregnancy associated protein. Alpha (2)-M and PZP are known to bind to and, may modulate, the activity of placental protein-14 in T-cell growth and cytokine production thereby protecting the allogeneic fetus from attack by the maternal immune system.


Pssm-ID: 238362 [Multi-domain]  Cd Length: 300  Bit Score: 151.93  E-value: 2.39e-40
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720388883  992 SPGGVASLLRLPQG--------CAEQTMIYLAPTLTASNYLDRTEqwsklspeTKDHAVDLIQKGYMRIQQFRKNDGSFG 1063
Cdd:cd00688      1 IEKHLKYLLRYPYGdghwyqslCGEQTWSTAWPLLALLLLLAATG--------IRDKADENIEKGIQRLLSYQLSDGGFS 72
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720388883 1064 AWLHRD-SSTWLTAFVLKILSLAQEQVGNSPEKLQETASWLLAQQLGDGSFHDPCPVIHRAMqgglvGSDETVALTAFVV 1142
Cdd:cd00688     73 GWGGNDyPSLWLTAYALKALLLAGDYIAVDRIDLARALNWLLSLQNEDGGFREDGPGNHRIG-----GDESDVRLTAYAL 147
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720388883 1143 IALHHGLDVFQDDDAKQLKNRVEASITKANSF-------LGQKASAGLlgahaaaitayalTLTKASEDLRNVAHNSLMA 1215
Cdd:cd00688    148 IALALLGKLDPDPLIEKALDYLLSCQNYDGGFgpggeshGYGTACAAA-------------ALALLGDLDSPDAKKALRW 214
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720388883 1216 MAEETGEHLYWGlvlgsqdkvvlrptapRSPTEPVPQAPALWIETTAYALLHLLLREGKGKmADKAASWLTHQGSFHGAF 1295
Cdd:cd00688    215 LLSRQRPDGGWG----------------EGRDRTNKLSDSCYTEWAAYALLALGKLGDLED-AEKLVKWLLSQQNEDGGF 277
                          330       340
                   ....*....|....*....|...
gi 1720388883 1296 RS-------TQDTVVTLDALSAY 1311
Cdd:cd00688    278 SSkpgksydTQHTVFALLALSLY 300
NTR_complement_C345C cd03574
NTR/C345C domain; The NTR domains that are found in the C-termini of complement C3, C4 and C5, ...
1569-1716 4.81e-35

NTR/C345C domain; The NTR domains that are found in the C-termini of complement C3, C4 and C5, are also called C345C domains. In C5, the domain interacts with various partners during the formation of the membrane attack complex, a fundamental process in the mammalian defense against infection. It's role in component C3 and C4 is not well understood.


Pssm-ID: 239629  Cd Length: 147  Bit Score: 131.36  E-value: 4.81e-35
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720388883 1569 CPRLLRSLERRVEDkdgyRMRFACYYprVEYGFTVKVLREDGRAAFRLFESKITQVLHFRKDTMASIGQTRNFLSRASCR 1648
Cdd:cd03574      1 CPICKRELSDTCEN----LLDKACTS--VDYVYKVKVTSVEEEAGFRIYKARVTEVIKSGSDDVQNGNARRTFIIRESCD 74
                           90       100       110       120       130       140       150
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|...
gi 1720388883 1649 --LRLEPNKEYLIMGMDGETSDNK---GDPQYLLDSNTWIEEMPSEQMCKSTRHRAACFQLKDFLMEFSSRGC 1716
Cdd:cd03574     75 cpLRLKEGRHYLIMGSDGAFYDDRngeDRYQYVLDSNTWVEEWPTDSKCRNERQQAACDKLKKFEESMVLQGC 147
A2M pfam00207
Alpha-2-macroglobulin family; This family includes the C-terminal region of the ...
779-867 6.45e-34

Alpha-2-macroglobulin family; This family includes the C-terminal region of the alpha-2-macroglobulin family.


Pssm-ID: 459711 [Multi-domain]  Cd Length: 91  Bit Score: 125.78  E-value: 6.45e-34
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720388883  779 NWLWRVEPVDSS--KLLTVWLPDSMTTWEIHGVSLSKSKGLCVAKPTRVRVFRKFHLHLRLPISIRRFEQFELRPVLYNY 856
Cdd:pfam00207    1 TWLWDPVLVTDNgkASLSFTLPDSITTWRATAFALSPDTGLGVAEPPELVVFKPFFVDLNLPYSVRRGEQFELKATVFNY 80
                           90
                   ....*....|.
gi 1720388883  857 LNDDVAVSVHV 867
Cdd:pfam00207   81 LDKCLKVRVRL 91
YfaS COG2373
Uncharacterized conserved protein YfaS, alpha-2-macroglobulin family [General function ...
136-1327 3.88e-31

Uncharacterized conserved protein YfaS, alpha-2-macroglobulin family [General function prediction only];


Pssm-ID: 441940 [Multi-domain]  Cd Length: 1605  Bit Score: 134.05  E-value: 3.88e-31
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720388883  136 RGHIFvqTDQPIYNPGQRVRYRVFALDQKMRPSTDF-LTITVENSHGlRVLKKEIFTSTS--IFQDAFTIPDISEPGTWK 212
Cdd:COG2373    370 DAFLF--TDRGIYRPGETVHLKALLRDADGKAPAGLpLTLELTDPDG-KEVRRQTLTLNEfgGYSFSFPLPEDAPTGTWR 446
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720388883  213 ISARfSDGLESNRSTHFEVKKYVLPNFEVKITPWKPYILMvpsnSDEIQLDIQARYIYGKPVQG--VAYT---------- 280
Cdd:COG2373    447 LELY-VDPKPALGSKSFRVEEFKPPRFKVDLTLDKEPLKP----GDPVTVTVDARYLFGAPAAGlkVEGEvtlrpartaf 521
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720388883  281 ------RFALMDEQGKRTFLrgLETQAKL-VEGRTHISISKDQFQAAldkinigVRDLEgLRLYAAtaVIEsPGGemeea 353
Cdd:COG2373    522 pgypgyRFGDPDEEFEPEEL--DLGEGTLdADGKASLSLPLPDAPDA-------PGPLR-ATVEAS--VFE-SGG----- 583
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720388883  354 eltswRFVSSAFSLDLSRtKRHLV-----------PGAHFLLQALVQEMSGSEASNVPVKVsaTLV------------SG 410
Cdd:COG2373    584 -----RPVTRSATVPVHP-ADFYVgirlplfdgdpEGAPATFEVVAVDPDGKPVAGKGLKV--ELYreewryvwyksdDG 655
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720388883  411 S--------DSQVLDIQQSTNGIGQVSISFPiPPTVTELRLLVS-AGSLYPAIARLTVQAPPSRGT---GFLSIEpLDPR 478
Cdd:COG2373    656 GwryesqekEEPVAEGTLTTGADGPASLSLT-PVEWGRYRLEVKdPDGGLATSVRFYAGGNASWGAerpDRLELS-LDKE 733
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720388883  479 SPSVGDTFILNLQPvgiPAPtfshyyymiiSRGQIMAMGREP--RKTV------TSVSVLVDHQLAPSFYFVAYFYHQGH 550
Cdd:COG2373    734 SYKPGETAKLLIQS---PFA----------GRALVTVERDGVleTQWVdvkgggTTVEIPVTEDWAPNAYVSATLVRPGD 800
                          490       500       510       520       530       540       550       560
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720388883  551 PVANSL------LINIQSRDCEGKLQLKVDGAKEYRNADMMKLRIQTDS----KALVALGAVDMALYAVGGrsHKPLDMS 620
Cdd:COG2373    801 STANDMparaygVAPLPVDPPARRLKVELTAPEKLRPGETLTVTVKVKGaagkAAEVTLAAVDEGILNLTG--YKTPDPL 878
                          570       580       590       600       610       620       630       640
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720388883  621 KVFEVINSYNVgcgpgggdDALQVFQDAGLAFSDGDRLTQTREDLSCPKEKKSRQKRNvNFqKAVSeklgQYSspdakrc 700
Cdd:COG2373    879 DFFYGKRALGV--------ETRDLYGRLIGAFGGAAGALRSGGDGALGRGGNPKPPRK-RF-KPVA----LFS------- 937
                          650       660       670       680       690       700       710       720
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720388883  701 cqdGMTKLPMKRTceqraARVpqqacrepflscckfaedlrrnqtrsqahlarnnhnmlqeedlideddilvrtSFPenw 780
Cdd:COG2373    938 ---GPVKTDADGK-----ATV-----------------------------------------------------SFD--- 953
                          730       740       750       760       770       780       790       800
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720388883  781 lwrvepvdssklltvwLPDSMTTWEIHGVSLSKSKGLCVAKPTRVRvfrkfhlhlrLPISIR----RF----EQFELRPV 852
Cdd:COG2373    954 ----------------LPDFNGTLRVMAVAWSDDRFGSAEATVTVR----------KPLVVRpslpRFlapgDRFELPVD 1007
                          810       820       830       840       850       860       870       880
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720388883  853 LYNYLNDDVAVSVHVTPVEGLCLAGGGmmAQQVTVPAGSARPVAFSVVPTAAANVPLKVVARGvFDLGDAVSKILQIEKE 932
Cdd:COG2373   1008 VFNLTGKAGTVTVTLEASGGLTLEGEA--TQTVTLAAGGRATVRFPLKAPDAGDAKVTVTATG-GGESDAREVELPVRPA 1084
                          890       900       910       920       930       940       950       960
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720388883  933 GAIHREELVYNLDPlnnlGRTLEIPGSSDPNIVPdGDFSSLVRVTASEPLETmgsegalsPGGVASLLRLPQGCAEQTMI 1012
Cdd:COG2373   1085 NPLVTRATSGVLAP----GESWTLPLDLPGGLRP-GTGSLTLSLSSSPPLDL--------AGLLRYLLRYPYGCTEQTTS 1151
                          970       980       990      1000      1010      1020      1030      1040
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720388883 1013 YLAPTLtasnYLDRTEQWSKLSPETKDHAVDLIQKGYMRIQQFRKNDGSFGAW-LHRDSSTWLTAFVLKILSLAQEQvGN 1091
Cdd:COG2373   1152 RALPLL----YLSDLAEALGLKGDKDAELRARIQAAIARLLSMQNSDGGFGLWpGGSESDPWLTAYATDFLLEAREA-GY 1226
                         1050      1060      1070      1080      1090      1100      1110      1120
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720388883 1092 S-PEKLQEtaSWL--LAQQLGDGSFHDPCPVIHRAMQgglvgsdetvaltAFVVIALH-HG------LDVFQDDDAKQLK 1161
Cdd:COG2373   1227 AvPDDALD--RALdyLRNYLRNPWEIEYDDAYRLAVR-------------AYALYVLArAGkadlgdLRYLYDRRKDALS 1291
                         1130      1140      1150      1160      1170      1180      1190      1200
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720388883 1162 NRVEASITKANSFLGQKasagllgahaaaitayaltltKASEDLRnvahNSLMAMAEETGEHLYWGLVLGSQdkvvLRPT 1241
Cdd:COG2373   1292 PLAKAQLAAALALLGDK---------------------ARAEELL----AAALARLRETGARDYWYGDYGSP----LRDQ 1342
                         1210      1220      1230      1240      1250      1260      1270      1280
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720388883 1242 AprsptepvpqapalwiettayALLHLLLREG-KGKMADKAASWLTHQGSfHGAFRSTQDTVVTLDALSAYWIASHTTEE 1320
Cdd:COG2373   1343 A---------------------LALALLAELGpDAPLAPKLARWLAKALK-SGRWLSTQETAWALLALAAYARAAGASPD 1400

                   ....*..
gi 1720388883 1321 KALNVTL 1327
Cdd:COG2373   1401 FTATLTL 1407
A2M_recep pfam07677
A-macroglobulin receptor binding domain; This family includes the receptor binding domain ...
1455-1544 1.34e-30

A-macroglobulin receptor binding domain; This family includes the receptor binding domain region of the alpha-2-macroglobulin family.


Pssm-ID: 462226  Cd Length: 92  Bit Score: 116.52  E-value: 1.34e-30
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720388883 1455 SGMAIADITLLSGFHALRADLEKLTSlsDRYVSHFETDGP-HVLLYFDSVPTTRECVGFGASQEVVVGLVQPSSAVLYDY 1533
Cdd:pfam07677    3 SNMAILEVGLPSGFVPDEEDLKKLGV--DPLIKRVETVDDgKVILYLDKLSGEPLCFSFRAEQTFPVANLKPAPVKVYDY 80
                           90
                   ....*....|.
gi 1720388883 1534 YSPDHKCSVFY 1544
Cdd:pfam07677   81 YEPERRATTFY 91
C345C smart00643
Netrin C-terminal Domain;
1588-1698 1.22e-29

Netrin C-terminal Domain;


Pssm-ID: 214759  Cd Length: 114  Bit Score: 114.39  E-value: 1.22e-29
                            10        20        30        40        50        60        70        80
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720388883  1588 MRFACYYPRvEYGFTVKVLREDGRAAFRLFESKITQVLHFRKDTMAS-IGQTRNFLSRASCR--LRLEPNKEYLIMGMDG 1664
Cdd:smart00643    1 LEKACKSDV-DYVYKVKVLSVEEEGGFDKYTVKILEVIKSGTDELVRgKNKLRVFISRASCRcpLLLKLGKSYLIMGKSG 79
                            90       100       110
                    ....*....|....*....|....*....|....
gi 1720388883  1665 ETSDNKGDPQYLLDSNTWIEEMPSEQMCKSTRHR 1698
Cdd:smart00643   80 DLWDAKGRGQYVLGKNSWVEEWPTEEECRLRRLQ 113
NTR pfam01759
UNC-6/NTR/C345C module; Sequence similarity between netrin UNC-6 and C345C complement protein ...
1589-1698 1.63e-25

UNC-6/NTR/C345C module; Sequence similarity between netrin UNC-6 and C345C complement protein family members, and hence the existence of the UNC-6 module, was first reported in. Subsequently, many additional members of the family were identified on the basis of sequence similarity between the C-terminal domains of netrins, complement proteins C3, C4, C5, secreted frizzled-related proteins, and type I pro-collagen C-proteinase enhancer proteins (PCOLCEs), which are homologous with the N-terminal domains of tissue inhibitors of metalloproteinases (TIMPs). The TIMPs are classified as a separate family in Pfam (pfam00965). This expanded domain family has been named as the NTR module.


Pssm-ID: 396359  Cd Length: 106  Bit Score: 102.42  E-value: 1.63e-25
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720388883 1589 RFACYypRVEYGFTVKVLREDGRAAFRLFESKITQVLHFRKDtMASIGQTRNFLSRASCR-LRLEPNKEYLIMGMDGets 1667
Cdd:pfam01759    1 KKACK--GSDYVYKVKVLSVEEEGSFDKYTVKVKEVLKEGTD-KIQRGKVRLFLKRGDCRcPQLRLGKEYLIMGKVG--- 74
                           90       100       110
                   ....*....|....*....|....*....|.
gi 1720388883 1668 DNKGDPQYLLDSNTWIEEMPSEQMCKSTRHR 1698
Cdd:pfam01759   75 DLEGRGRYVLDKNSWVEPWPTKWECKLRELQ 105
ANATO cd00017
Anaphylatoxin homologous domain; C3a, C4a and C5a anaphylatoxins are protein fragments ...
686-752 6.43e-25

Anaphylatoxin homologous domain; C3a, C4a and C5a anaphylatoxins are protein fragments generated enzymatically in serum during activation of complement molecules C3, C4, and C5. They induce smooth muscle contraction. These fragments are homologous to repeats in fibulins.


Pssm-ID: 237984  Cd Length: 70  Bit Score: 99.46  E-value: 6.43e-25
                           10        20        30        40        50        60
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 1720388883  686 SEKLGQYSSPDAKRCCQDGMTKLPMKRTCEQRAARV-PQQACREPFLSCCKFAEDLRRNQTRSQAHLA 752
Cdd:cd00017      3 SEKAAQYKDKELRKCCLDGMRENPMGQTCEERAAYItDGKECRKAFLECCVYAEELRDEEREDGLGLA 70
A2M_BRD pfam07703
Alpha-2-macroglobulin bait region domain; Alpha-2-macroglobulins (A2Ms) are plasma proteins ...
470-609 4.54e-24

Alpha-2-macroglobulin bait region domain; Alpha-2-macroglobulins (A2Ms) are plasma proteins that trap and inhibit a broad range of proteases and are major components of the eukaryotic innate immune system. However, A2M-like proteins were identified in pathogenically invasive bacteria and species that colonize higher eukaryotes. This domain is found in eukaryotic and bacterial proteins. In human A2Ms, this domain encompasses macroglobulin-like domain MG5 and 6 including bait region. In Salmonella enterica ser A2Ms, this domain encompasses MG7 and MG8 including the bait region. The Bait region is cleaved by proteases, followed by a large conformational change that blocks the target protease within a cage-like complex. This model of protease entrapment is recognized as the Venus flytrap mechanism.


Pssm-ID: 462235 [Multi-domain]  Cd Length: 139  Bit Score: 99.35  E-value: 4.54e-24
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720388883  470 LSIEPlDPRSPSVGDTFILNLQPVGIPAPTFSHYYYMIISRGQIMAMGRepRKTVTSVSVLVDHQLAPSFYFVAYFYHQG 549
Cdd:pfam07703    1 LHLST-DKTEYKPGETATVTVKSPFDGTVERDGFTYLVLSKGQIVVVGR--GGVTTSFSLPVTAEMAPSARVVAYYVRVD 77
                           90       100       110       120       130       140
                   ....*....|....*....|....*....|....*....|....*....|....*....|....
gi 1720388883  550 HP----VANSLLINIQSrDCEGKLQLKVDgAKEYRNADMMKLRIQTDSKALVALGAVDMALYAV 609
Cdd:pfam07703   78 LSkpevVADSVWVDVDD-TCENKLKVTLS-AEKYRPGSTVELKVKADPGAYVALAAVDKGVLLL 139
NTR_complement_C3 cd03583
NTR/C345C domain, complement C3 subfamily; The NTR domain found in complement C3 is also known ...
1564-1716 5.09e-20

NTR/C345C domain, complement C3 subfamily; The NTR domain found in complement C3 is also known as the C345C domain because it occurs at the C-terminus of complement C3, C4 and C5. Complement C3 plays a pivotal role in the activation of the complement systems, as all pathways (classical, alternative, and lectin) result in the processing of C3 by C3 convertase. The larger fragment, activated C3b, contains the NTR/C345C domain and binds covalently, via a reactive thioester, to cell surface carbohydrates including components of bacterial cell walls and immune aggregates. The smaller cleavage product, C3a, acts independently as a diffusible signal to mediate local inflammatory processes. The structure of C3 shows that the NTR/C345C domain is located in an exposed position relative to the rest of the molecule. The function of the domain in complement C3 is poorly understood.


Pssm-ID: 239638  Cd Length: 149  Bit Score: 88.18  E-value: 5.09e-20
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720388883 1564 CAEGKCpRLLRSLERRVEDKdgyRMRFACYyPRVEYGFTVKVLREDGRAAFRLFESKITQVLHFRKDTmASIGQTRNFLS 1643
Cdd:cd03583      1 CAEENC-SMQKKGDKVTNDE---RIDKACE-PGVDYVYKVKLVNVELSDSYDIYTMEILQVIKEGTDE-GPEGKTRTFIS 74
                           90       100       110       120       130       140       150
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*
gi 1720388883 1644 RASCR--LRLEPNKEYLIMGMDGETSDNKGDPQYLLDSNTWIEEMPSEQMCKSTRHRAACFQLKDFLMEFSSRGC 1716
Cdd:cd03583     75 HPKCReaLNLKEGKDYLIMGLSSDLWRIKDKYSYVIGKDTWIEYWPTEDECQDEENQKLCLDLAEFSEQLTVFGC 149
MG2 pfam01835
MG2 domain; This is the MG2 (macroglobulin) domain of alpha-2-macroglobulin in eukaryotes. ...
139-231 6.75e-19

MG2 domain; This is the MG2 (macroglobulin) domain of alpha-2-macroglobulin in eukaryotes. Alpha-2-macroglobulins (A2Ms) are plasma proteins that trap and inhibit a broad range of proteases and are major components of the eukaryotic innate immune system. However, A2M-like proteins were identified in pathogenically invasive bacteria and species that colonize higher eukaryotes. This domain is found in eukaryotic and bacterial proteins. In human A2Ms, this domain is termed macroglobulin-like (MG) domain 2 and in Salmonella enterica ser A2Ms, this is domain 4.


Pssm-ID: 426464 [Multi-domain]  Cd Length: 95  Bit Score: 83.13  E-value: 6.75e-19
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720388883  139 IFVQTDQPIYNPGQRVRYRVFALDQKMRPSTD-FLTITVENSHGLRVLKKEI-FTSTSIFQDAFTIPDISEPGTWKISAR 216
Cdd:pfam01835    2 AFVYTDRGIYRPGETVHFKGLLRDQDLRPLAGlPVTLTVTDPDGNEVRRLPLtTDEFGGFSGSFPLPETAPTGTYTVVLR 81
                           90
                   ....*....|....*
gi 1720388883  217 FSDGlESNRSTHFEV 231
Cdd:pfam01835   82 DGAG-GSLGSGSFRV 95
MG3 pfam17791
Macroglobulin domain MG3; This entry corresponds to the MG3 domain found in complement ...
233-318 1.25e-18

Macroglobulin domain MG3; This entry corresponds to the MG3 domain found in complement components C3, C4 and C5.


Pssm-ID: 465509  Cd Length: 83  Bit Score: 81.93  E-value: 1.25e-18
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720388883  233 KYVLPNFEVKIT-PwkPYILMvpsNSDEIQLDIQARYIYGKPVQGVAYTRFALMDEQGKRTFLrgLETQAKLVEGRTHIS 311
Cdd:pfam17791    1 EYVLPKFEVKVEvP--KFISV---KDEEFQVTICAKYTYGKPVKGKAYVTLCLKDDSKRKCFE--SFSKELDKDGCGSAS 73

                   ....*..
gi 1720388883  312 ISKDQFQ 318
Cdd:pfam17791   74 LSTEEFQ 80
ANATO pfam01821
Anaphylotoxin-like domain; C3a, C4a and C5a anaphylatoxins are protein fragments generated ...
700-734 3.25e-14

Anaphylotoxin-like domain; C3a, C4a and C5a anaphylatoxins are protein fragments generated enzymatically in serum during activation of complement molecules C3, C4, and C5. They induce smooth muscle contraction. These fragments are homologous to a three-fold repeat in fibulins.


Pssm-ID: 460347  Cd Length: 36  Bit Score: 68.07  E-value: 3.25e-14
                           10        20        30
                   ....*....|....*....|....*....|....*.
gi 1720388883  700 CCQDGMTKLPMKRTCEQRAARV-PQQACREPFLSCC 734
Cdd:pfam01821    1 CCLDGMKRNPMGRSCEQRAARIkEGPRCRKAFLQCC 36
ANATO smart00104
Anaphylatoxin homologous domain; C3a, C4a and C5a anaphylatoxins are protein fragments ...
700-734 1.52e-12

Anaphylatoxin homologous domain; C3a, C4a and C5a anaphylatoxins are protein fragments generated enzymatically in serum during activation of complement molecules C3, C4, and C5. They induce smooth muscle contraction. These fragments are homologous to a three-fold repeat in fibulins.


Pssm-ID: 197517  Cd Length: 35  Bit Score: 63.12  E-value: 1.52e-12
                            10        20        30
                    ....*....|....*....|....*....|....*
gi 1720388883   700 CCQDGMTKLPMKRTCEQRAARVPQQACREPFLSCC 734
Cdd:smart00104    1 CCADGMRLAPMGETCEERAARINSGDCRKAFLQCC 35
NTR_complement_C5 cd03582
NTR/C345C domain, complement C5 subfamily; The NTR domain found in complement C5 is also known ...
1587-1716 6.99e-08

NTR/C345C domain, complement C5 subfamily; The NTR domain found in complement C5 is also known as C345C because it occurs at the C-terminus of complement C3, C4 and C5. Complement C5 is activated by C5 convertase, which itself is a complex between C3b and C3 convertase. The small cleavage fragment, C5a, is the most important small peptide mediator of inflammation, and the larger active fragment, C5b, initiates late events of complement activation. The NTR/C345C domain is important in the function of C5 as it interacts with enzymes that convert C5 to the active form, C5b. The domain has also been found to bind to complement components C6 and C7, and may specifically interact with their factor I modules.


Pssm-ID: 239637  Cd Length: 150  Bit Score: 53.28  E-value: 6.99e-08
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720388883 1587 RMRFACYyPRVEYGFTVKVLREDGRAAFRLFESKITQVLHfRKDTMASIGQTRNFLSRASC-RLRLEPNKEYLIMGMDG- 1664
Cdd:cd03582     22 RKSETCK-EQIAYAYKVMIKSSAAEGDFVTYKATVLDVLK-NGQAELEKDSEVTLVKKATCtSVELQEGQQYLIMGKEAl 99
                           90       100       110       120       130
                   ....*....|....*....|....*....|....*....|....*....|...
gi 1720388883 1665 ETSDNKG-DPQYLLDSNTWIEEMPSEQMCKSTRHRAAcfQLKDFLMEFSSRGC 1716
Cdd:cd03582    100 KIRLNRSfRYRYPLDSEAWIEWWPTDTGCPECQDFLN--QLDDFAEDLQLMGC 150
 
Blast search parameters
Data Source: Precalculated data, version = cdd.v.3.21
Preset Options:Database: CDSEARCH/cdd   Low complexity filter: no  Composition Based Adjustment: yes   E-value threshold: 0.01

References:

  • Wang J et al. (2023), "The conserved domain database in 2023", Nucleic Acids Res.51(D)384-8.
  • Lu S et al. (2020), "The conserved domain database in 2020", Nucleic Acids Res.48(D)265-8.
  • Marchler-Bauer A et al. (2017), "CDD/SPARCLE: functional classification of proteins via subfamily domain architectures.", Nucleic Acids Res.45(D)200-3.
Help | Disclaimer | Write to the Help Desk
NCBI | NLM | NIH