NCBI Home Page NCBI Site Search page NCBI Guide that lists and describes the NCBI resources
Conserved domains on  [gi|768001956|ref|XP_011526219|]
View 

C3 and PZP-like alpha-2-macroglobulin domain-containing protein 8 isoform X1 [Homo sapiens]

Protein Classification

alpha-2-macroglobulin-like protein( domain architecture ID 13392528)

alpha-2-macroglobulin-like protein may function as a proteinase inhibitor via a trapping mechanism. A peptide stretch serves as the bait region and contains cleavage sites for various proteinases; as soon as a proteinase cleaves the bait region, a conformational change traps the proteinase and significantly reduces its activity against high molecular weight substrates

Graphical summary

 Zoom to residue level

show extra options »

Show site features     Horizontal zoom: ×

List of domain hits

Name Accession Description Interval E-value
A2M_2 cd02897
Proteins similar to alpha2-macroglobulin (alpha (2)-M). This group also contains the pregnancy ...
1176-1464 5.19e-145

Proteins similar to alpha2-macroglobulin (alpha (2)-M). This group also contains the pregnancy zone protein (PZP). Alpha(2)-M and PZP are broadly specific proteinase inhibitors. Alpha (2)-M is a major carrier protein in serum. The structural thioester of alpha (2)-M, is involved in the immobilization and entrapment of proteases. PZP is a trace protein in the plasma of non-pregnant females and males which is elevated in pregnancy. Alpha (2)-M and PZ bind to placental protein-14 and may modulate its activity in T-cell growth and cytokine production contributing to fetal survival. It has been suggested that thioester bond cleavage promotes the binding of PZ and alpha (2)-M to the CD91 receptor clearing them from circulation.


:

Pssm-ID: 239227  Cd Length: 292  Bit Score: 450.11  E-value: 5.19e-145
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 768001956 1176 PFGCGEQNMIHFAPNVFVLKYLQKTQQLSPEVERETTDYLVQGYQRQLTYKRQDGSYSAFGERDASGSMWLTAFVLKSFA 1255
Cdd:cd02897    12 PYGCGEQNMVNFAPNIYVLDYLKATGQLTPEIESKALGFLRTGYQRQLTYKHSDGSYSAFGESDKSGSTWLTAFVLKSFA 91
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 768001956 1256 QARSFIFVDPRELAAAKSWIIQQQQADGSFLAVGRVLNKDIQGGIHGTVPLTAYVVVALLETGTASeeERGSTDKARHFL 1335
Cdd:cd02897    92 QARPFIYIDENVLQQALTWLSSHQKSNGCFREVGRVFHKAMQGGVDDEVALTAYVLIALLEAGLPS--ERPVVEKALSCL 169
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 768001956 1336 ESAAPLAMDPYSCALTTYALTLLRSPAAPEALRKLRSLAIMRDGVTHWSLSnswdvdkgTFLSFSDRVSQSVVSAEVEMT 1415
Cdd:cd02897   170 EAALDSISDPYTLALAAYALTLAGSEKRPEALKKLDELAISEDGTKHWSRP--------PPSEEGPSYYWQAPSAEVEMT 241
                         250       260       270       280       290
                  ....*....|....*....|....*....|....*....|....*....|.
gi 768001956 1416 AYALLTYTLLG--DVAAALPVVKWLSQQRNALGGFSSTQDTCVALQALAEY 1464
Cdd:cd02897   242 AYALLALLSAGgeDLAEALPIVKWLAKQRNSLGGFSSTQDTVVALQALAKY 292
A2M pfam00207
Alpha-2-macroglobulin family; This family includes the C-terminal region of the ...
785-876 9.21e-34

Alpha-2-macroglobulin family; This family includes the C-terminal region of the alpha-2-macroglobulin family.


:

Pssm-ID: 459711 [Multi-domain]  Cd Length: 91  Bit Score: 125.39  E-value: 9.21e-34
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 768001956   785 TWIWHCLNISDpSGEGTLSVKVPDSITSWVGEAVALSTSQGLGIAEPSLLKTFKPFFVDFMLPALIIRGEQVKIPLSVYN 864
Cdd:pfam00207    1 TWLWDPVLVTD-NGKASLSFTLPDSITTWRATAFALSPDTGLGVAEPPELVVFKPFFVDLNLPYSVRRGEQFELKATVFN 79
                           90
                   ....*....|..
gi 768001956   865 YMGTCAEVYMKL 876
Cdd:pfam00207   80 YLDKCLKVRVRL 91
Methyltransf_FA pfam12248
Farnesoic acid 0-methyl transferase; This domain family is found in bacteria and eukaryotes, ...
1020-1121 4.53e-32

Farnesoic acid 0-methyl transferase; This domain family is found in bacteria and eukaryotes, and is approximately 110 amino acids in length.Farnesoic acid O-methyl transferase (FAMeT) is the enzyme that catalyzes the formation of methyl farnesoate (MF) from farnesoic acid (FA) in the biosynthetic pathway of juvenile hormone (JH).


:

Pssm-ID: 463505  Cd Length: 104  Bit Score: 121.21  E-value: 4.53e-32
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 768001956  1020 VALSS--GPQDTAGMIEIVLGGHQNTRSWISTSKMG--EPVASAHTAKILSWDEFRTFWISWR-GGLIQVGHGPEpsnES 1094
Cdd:pfam12248    2 IALSSspYPYDSDPMYEIVIGGWGNTRSVIRRQKRGsaPDVVEVSTPGILSPDEPRMFWISWTdDGLISVGKGGE---EN 78
                           90       100
                   ....*....|....*....|....*..
gi 768001956  1095 VIVAWTLPRPPEVQFIGFSTgWGSMGE 1121
Cdd:pfam12248   79 PFLQWSDPNPLPVNYIGFST-WGSTGE 104
A2M_BRD pfam07703
Alpha-2-macroglobulin bait region domain; Alpha-2-macroglobulins (A2Ms) are plasma proteins ...
490-661 5.48e-28

Alpha-2-macroglobulin bait region domain; Alpha-2-macroglobulins (A2Ms) are plasma proteins that trap and inhibit a broad range of proteases and are major components of the eukaryotic innate immune system. However, A2M-like proteins were identified in pathogenically invasive bacteria and species that colonize higher eukaryotes. This domain is found in eukaryotic and bacterial proteins. In human A2Ms, this domain encompasses macroglobulin-like domain MG5 and 6 including bait region. In Salmonella enterica ser A2Ms, this domain encompasses MG7 and MG8 including the bait region. The Bait region is cleaved by proteases, followed by a large conformational change that blocks the target protease within a cage-like complex. This model of protease entrapment is recognized as the Venus flytrap mechanism.


:

Pssm-ID: 462235 [Multi-domain]  Cd Length: 139  Bit Score: 110.90  E-value: 5.48e-28
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 768001956   490 LQLQPPSHPLQVGEEAYFSVKSTC----PCNFTlYYEVAARGNIVLSGQQPAhttqqrskraapalekpirlthlsetep 565
Cdd:pfam07703    1 LHLSTDKTEYKPGETATVTVKSPFdgtvERDGF-TYLVLSKGQIVVVGRGGV---------------------------- 51
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 768001956   566 ppapeaevdvcVTSLHLAVTPSMVPLGRLLVFYVRE---NGEGVADSLQFAVETFFENQVSVTYSANETQPGEVVDLRIR 642
Cdd:pfam07703   52 -----------TTSFSLPVTAEMAPSARVVAYYVRVdlsKPEVVADSVWVDVDDTCENKLKVTLSAEKYRPGSTVELKVK 120
                          170
                   ....*....|....*....
gi 768001956   643 AARGSCVCVAAVDKSVYLL 661
Cdd:pfam07703  121 ADPGAYVALAAVDKGVLLL 139
A2M_recep pfam07677
A-macroglobulin receptor binding domain; This family includes the receptor binding domain ...
1602-1695 1.05e-27

A-macroglobulin receptor binding domain; This family includes the receptor binding domain region of the alpha-2-macroglobulin family.


:

Pssm-ID: 462226  Cd Length: 92  Bit Score: 108.43  E-value: 1.05e-27
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 768001956  1602 GSSNMAVLEVPLLSGFRADIESLEQLLLDkhMGMKRYE-VAGRRVLFYFDEIPSRcLTCVRFRALRECVVGRTSALPVSV 1680
Cdd:pfam07677    1 ESSNMAILEVGLPSGFVPDEEDLKKLGVD--PLIKRVEtVDDGKVILYLDKLSGE-PLCFSFRAEQTFPVANLKPAPVKV 77
                           90
                   ....*....|....*
gi 768001956  1681 YDYYEPAFEATRFYN 1695
Cdd:pfam07677   78 YDYYEPERRATTFYS 92
MG2 pfam01835
MG2 domain; This is the MG2 (macroglobulin) domain of alpha-2-macroglobulin in eukaryotes. ...
164-256 4.99e-12

MG2 domain; This is the MG2 (macroglobulin) domain of alpha-2-macroglobulin in eukaryotes. Alpha-2-macroglobulins (A2Ms) are plasma proteins that trap and inhibit a broad range of proteases and are major components of the eukaryotic innate immune system. However, A2M-like proteins were identified in pathogenically invasive bacteria and species that colonize higher eukaryotes. This domain is found in eukaryotic and bacterial proteins. In human A2Ms, this domain is termed macroglobulin-like (MG) domain 2 and in Salmonella enterica ser A2Ms, this is domain 4.


:

Pssm-ID: 426464 [Multi-domain]  Cd Length: 95  Bit Score: 63.87  E-value: 4.99e-12
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 768001956   164 SVFIQTDKPVYRPQHRVLISIFTVSPNLRPVNEK-LEAYILDPRGSRM--IEWRHLKPfccGITNMSFPLSDQPVLGEWF 240
Cdd:pfam01835    1 RAFVYTDRGIYRPGETVHFKGLLRDQDLRPLAGLpVTLTVTDPDGNEVrrLPLTTDEF---GGFSGSFPLPETAPTGTYT 77
                           90
                   ....*....|....*...
gi 768001956   241 I--FVEMQGHAYNKSFEV 256
Cdd:pfam01835   78 VvlRDGAGGSLGSGSFRV 95
YfaS super family cl34462
Uncharacterized conserved protein YfaS, alpha-2-macroglobulin family [General function ...
158-925 1.22e-10

Uncharacterized conserved protein YfaS, alpha-2-macroglobulin family [General function prediction only];


The actual alignment was detected with superfamily member COG2373:

Pssm-ID: 441940 [Multi-domain]  Cd Length: 1605  Bit Score: 67.03  E-value: 1.22e-10
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 768001956  158 VDGR----GASVFIQTDKPVYRPQHRVLISIFTVSPNLRPVNE-KLEAYILDPRGSRMIEWR-HLKPFccGITNMSFPLS 231
Cdd:COG2373   360 VGGRappgGLDAFLFTDRGIYRPGETVHLKALLRDADGKAPAGlPLTLELTDPDGKEVRRQTlTLNEF--GGYSFSFPLP 437
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 768001956  232 DQPVLGEWFIFVEMQGHA--YNKSFEVQKYVLPKFELLIDPPRYIQDLDACETGTVRARYTFGKPVAGAlminmTVNGV- 308
Cdd:COG2373   438 EDAPTGTWRLELYVDPKPalGSKSFRVEEFKPPRFKVDLTLDKEPLKPGDPVTVTVDARYLFGAPAAGL-----KVEGEv 512
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 768001956  309 -------------GYYSHEVGRPVLRTTKIL--------GSRDFDICVRDMIPADVPehfrGRVSIWAMVTSVDGsQQVA 367
Cdd:COG2373   513 tlrpartafpgypGYRFGDPDEEFEPEELDLgegtldadGKASLSLPLPDAPDAPGP----LRATVEASVFESGG-RPVT 587
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 768001956  368 FDDSTPVQRQ--LVDIRYSK----DTRKQFKPGLAYVGkvelsyPDGSPAEGVTVQIKAE--------LTPKDNIYTSEV 433
Cdd:COG2373   588 RSATVPVHPAdfYVGIRLPLfdgdPEGAPATFEVVAVD------PDGKPVAGKGLKVELYreewryvwYKSDDGGWRYES 661
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 768001956  434 VSQRGLVG-FEIPSIPTSAQHVWLETK--------VMALNGKPVGAQYLPSYlSLGSWYSPSQCYLQLQPPSHPLQVGEE 504
Cdd:COG2373   662 QEKEEPVAeGTLTTGADGPASLSLTPVewgryrleVKDPDGGLATSVRFYAG-GNASWGAERPDRLELSLDKESYKPGET 740
                         410       420       430       440       450       460       470       480
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 768001956  505 AYFSVKStcPcnftlyyeVAARGNIVLSGQQPAHTtqqrskraapalekpiRLTHLSETEpppapeaevdvcvTSLHLAV 584
Cdd:COG2373   741 AKLLIQS--P--------FAGRALVTVERDGVLET----------------QWVDVKGGG-------------TTVEIPV 781
                         490       500       510       520       530       540       550       560
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 768001956  585 TPSMVPLGRLLVFYVR--ENGEGVADSLQFAVETFF----ENQVSVTYSANET-QPGEVVDLRIRAA----RGSCVCVAA 653
Cdd:COG2373   782 TEDWAPNAYVSATLVRpgDSTANDMPARAYGVAPLPvdppARRLKVELTAPEKlRPGETLTVTVKVKgaagKAAEVTLAA 861
                         570       580       590       600       610       620       630       640
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 768001956  654 VDKSVYLLrSGFRlTPaqvfqeledyDVSDSFGvsredgpfwwagltaqRRRRSSVFPWpwgitkDS-GFAFTETGLVVM 732
Cdd:COG2373   862 VDEGILNL-TGYK-TP----------DPLDFFY----------------GKRALGVETR------DLyGRLIGAFGGAAG 907
                         650       660       670       680       690       700       710       720
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 768001956  733 TDRVslnhrqdGGLytdeavpafqphtGSLVAVAPSRhPPRTEkrkrtfFPETWIWHCLNISDPSGEGTLSVKVPDSITS 812
Cdd:COG2373   908 ALRS-------GGD-------------GALGRGGNPK-PPRKR------FKPVALFSGPVKTDADGKATVSFDLPDFNGT 960
                         730       740       750       760       770       780       790       800
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 768001956  813 WVGEAVALSTSQgLGIAEpsllKTF---KPFFVDFMLPALIIRGEQVKIPLSVYNYMGTCAEVYMKLSVPKGIQFVGhPG 889
Cdd:COG2373   961 LRVMAVAWSDDR-FGSAE----ATVtvrKPLVVRPSLPRFLAPGDRFELPVDVFNLTGKAGTVTVTLEASGGLTLEG-EA 1034
                         810       820       830
                  ....*....|....*....|....*....|....*.
gi 768001956  890 KRHVTkkmcVAPGEAEPIWVVLSFSDLGLNNITAKA 925
Cdd:COG2373  1035 TQTVT----LAAGGRATVRFPLKAPDAGDAKVTVTA 1066
KAZAL smart00280
Kazal type serine protease inhibitors; Kazal type serine protease inhibitors and ...
1747-1779 9.93e-09

Kazal type serine protease inhibitors; Kazal type serine protease inhibitors and follistatin-like domains.


:

Pssm-ID: 197624  Cd Length: 46  Bit Score: 52.68  E-value: 9.93e-09
                            10        20        30
                    ....*....|....*....|....*....|...
gi 768001956   1747 CDHDCGAQGNPVCGSDGVVYASACRLREAACRQ 1779
Cdd:smart00280    2 CPEACPREYDPVCGSDGVTYSNECHLCKAACES 34
PHA03247 super family cl33720
large tegument protein UL36; Provisional
1675-1879 6.55e-04

large tegument protein UL36; Provisional


The actual alignment was detected with superfamily member PHA03247:

Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 44.93  E-value: 6.55e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 768001956 1675 ALPVSVYDYYEPAFEATRFYNVSTHSPLARELCAGPACNEVERAPARGPGwfPGESGPAVAPEEGAAiarcgcdhdcGAQ 1754
Cdd:PHA03247 2731 ASPALPAAPAPPAVPAGPATPGGPARPARPPTTAGPPAPAPPAAPAAGPP--RRLTRPAVASLSESR----------ESL 2798
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 768001956 1755 GNPVCGSDGVVYASACRLREAACRQAAPLEPAPPSCCALEQRLPASSSSTYGDDLASVAPGplqQDVKLNGaglevedsd 1834
Cdd:PHA03247 2799 PSPWDPADPPAAVLAPAAALPPAASPAGPLPPPTSAQPTAPPPPPGPPPPSLPLGGSVAPG---GDVRRRP--------- 2866
                         170       180       190       200
                  ....*....|....*....|....*....|....*....|....*
gi 768001956 1835 pePEGEAEDRVTAGPRPPVSSGNLESSTQSASPFHRWGQTPAPQR 1879
Cdd:PHA03247 2867 --PSRSPAAKPAAPARPPVRRLARPAVSRSTESFALPPDQPERPP 2909
 
Name Accession Description Interval E-value
A2M_2 cd02897
Proteins similar to alpha2-macroglobulin (alpha (2)-M). This group also contains the pregnancy ...
1176-1464 5.19e-145

Proteins similar to alpha2-macroglobulin (alpha (2)-M). This group also contains the pregnancy zone protein (PZP). Alpha(2)-M and PZP are broadly specific proteinase inhibitors. Alpha (2)-M is a major carrier protein in serum. The structural thioester of alpha (2)-M, is involved in the immobilization and entrapment of proteases. PZP is a trace protein in the plasma of non-pregnant females and males which is elevated in pregnancy. Alpha (2)-M and PZ bind to placental protein-14 and may modulate its activity in T-cell growth and cytokine production contributing to fetal survival. It has been suggested that thioester bond cleavage promotes the binding of PZ and alpha (2)-M to the CD91 receptor clearing them from circulation.


Pssm-ID: 239227  Cd Length: 292  Bit Score: 450.11  E-value: 5.19e-145
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 768001956 1176 PFGCGEQNMIHFAPNVFVLKYLQKTQQLSPEVERETTDYLVQGYQRQLTYKRQDGSYSAFGERDASGSMWLTAFVLKSFA 1255
Cdd:cd02897    12 PYGCGEQNMVNFAPNIYVLDYLKATGQLTPEIESKALGFLRTGYQRQLTYKHSDGSYSAFGESDKSGSTWLTAFVLKSFA 91
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 768001956 1256 QARSFIFVDPRELAAAKSWIIQQQQADGSFLAVGRVLNKDIQGGIHGTVPLTAYVVVALLETGTASeeERGSTDKARHFL 1335
Cdd:cd02897    92 QARPFIYIDENVLQQALTWLSSHQKSNGCFREVGRVFHKAMQGGVDDEVALTAYVLIALLEAGLPS--ERPVVEKALSCL 169
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 768001956 1336 ESAAPLAMDPYSCALTTYALTLLRSPAAPEALRKLRSLAIMRDGVTHWSLSnswdvdkgTFLSFSDRVSQSVVSAEVEMT 1415
Cdd:cd02897   170 EAALDSISDPYTLALAAYALTLAGSEKRPEALKKLDELAISEDGTKHWSRP--------PPSEEGPSYYWQAPSAEVEMT 241
                         250       260       270       280       290
                  ....*....|....*....|....*....|....*....|....*....|.
gi 768001956 1416 AYALLTYTLLG--DVAAALPVVKWLSQQRNALGGFSSTQDTCVALQALAEY 1464
Cdd:cd02897   242 AYALLALLSAGgeDLAEALPIVKWLAKQRNSLGGFSSTQDTVVALQALAKY 292
TED_complement pfam07678
A-macroglobulin TED domain; This entry corresponds to the TED domain of the complement ...
1155-1464 2.11e-130

A-macroglobulin TED domain; This entry corresponds to the TED domain of the complement components such as C3, C4 and C5. This domain contains a short highly conserved region of proteinase-binding alpha-macro-globulins contains the cysteine and a glutamine of a thiol-ester bond that is cleaved at the moment of proteinase binding, and mediates the covalent binding of the alpha-macro-globulin to the proteinase. The GCGEQ motif is highly conserved.


Pssm-ID: 462227 [Multi-domain]  Cd Length: 311  Bit Score: 410.54  E-value: 2.11e-130
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 768001956  1155 ASIIGDVMGPTLNHLNNLLRL----PFGCGEQNMIHFAPNVFVLKYLQKTQQLSPEVERETTDYLVQGYQRQLTYKRQDG 1230
Cdd:pfam07678    1 ISVVGDIMGPAIQVVPENLSSllrlPYGCGEQNMVLFAPNVYVLRYLDKTNQLTKLIKSKAIDYLEQGYQRQLSYKHPDG 80
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 768001956  1231 SYSAFGERDasGSMWLTAFVLKSFAQARSFIFVDPRELAAAKSWIIQQQQADGSFLAVGRVLNKDIQGGIHGTVPLTAYV 1310
Cdd:pfam07678   81 SYSAFGHSP--GSTWLTAFVLKVFAQARKFIFIDPEEICQSLRWLLSQQKPDGSFREPGPLLHRAMKGGVDGEVSLTAYV 158
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 768001956  1311 VVALLETGTASE---EERGSTDKARHFLESAA-PLAMDPYSCALTTYALTLLRSPA-APEALRKLRSLAIMRDGVTHW-- 1383
Cdd:pfam07678  159 TIALLEALDINGllqRVHPSIRKALTYLEQAQlAGLTSPYTLAILAYALALAGSPEtREELLKSLDAMAREEGNSRYWer 238
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 768001956  1384 -SLSNSWDVdkgtflsfsDRVSQSVVSAEVEMTAYALLTYTLLGDVAAALPVVKWLSQQRNALGGFSSTQDTCVALQALA 1462
Cdd:pfam07678  239 dEKSDPQGV---------PEYPPQAPSLEVETTAYALLAYLLLGDLTYADPIVKWLTSQRNSHGGFSSTQDTVVALQALA 309

                   ..
gi 768001956  1463 EY 1464
Cdd:pfam07678  310 EY 311
A2M pfam00207
Alpha-2-macroglobulin family; This family includes the C-terminal region of the ...
785-876 9.21e-34

Alpha-2-macroglobulin family; This family includes the C-terminal region of the alpha-2-macroglobulin family.


Pssm-ID: 459711 [Multi-domain]  Cd Length: 91  Bit Score: 125.39  E-value: 9.21e-34
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 768001956   785 TWIWHCLNISDpSGEGTLSVKVPDSITSWVGEAVALSTSQGLGIAEPSLLKTFKPFFVDFMLPALIIRGEQVKIPLSVYN 864
Cdd:pfam00207    1 TWLWDPVLVTD-NGKASLSFTLPDSITTWRATAFALSPDTGLGVAEPPELVVFKPFFVDLNLPYSVRRGEQFELKATVFN 79
                           90
                   ....*....|..
gi 768001956   865 YMGTCAEVYMKL 876
Cdd:pfam00207   80 YLDKCLKVRVRL 91
Methyltransf_FA pfam12248
Farnesoic acid 0-methyl transferase; This domain family is found in bacteria and eukaryotes, ...
1020-1121 4.53e-32

Farnesoic acid 0-methyl transferase; This domain family is found in bacteria and eukaryotes, and is approximately 110 amino acids in length.Farnesoic acid O-methyl transferase (FAMeT) is the enzyme that catalyzes the formation of methyl farnesoate (MF) from farnesoic acid (FA) in the biosynthetic pathway of juvenile hormone (JH).


Pssm-ID: 463505  Cd Length: 104  Bit Score: 121.21  E-value: 4.53e-32
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 768001956  1020 VALSS--GPQDTAGMIEIVLGGHQNTRSWISTSKMG--EPVASAHTAKILSWDEFRTFWISWR-GGLIQVGHGPEpsnES 1094
Cdd:pfam12248    2 IALSSspYPYDSDPMYEIVIGGWGNTRSVIRRQKRGsaPDVVEVSTPGILSPDEPRMFWISWTdDGLISVGKGGE---EN 78
                           90       100
                   ....*....|....*....|....*..
gi 768001956  1095 VIVAWTLPRPPEVQFIGFSTgWGSMGE 1121
Cdd:pfam12248   79 PFLQWSDPNPLPVNYIGFST-WGSTGE 104
A2M_BRD pfam07703
Alpha-2-macroglobulin bait region domain; Alpha-2-macroglobulins (A2Ms) are plasma proteins ...
490-661 5.48e-28

Alpha-2-macroglobulin bait region domain; Alpha-2-macroglobulins (A2Ms) are plasma proteins that trap and inhibit a broad range of proteases and are major components of the eukaryotic innate immune system. However, A2M-like proteins were identified in pathogenically invasive bacteria and species that colonize higher eukaryotes. This domain is found in eukaryotic and bacterial proteins. In human A2Ms, this domain encompasses macroglobulin-like domain MG5 and 6 including bait region. In Salmonella enterica ser A2Ms, this domain encompasses MG7 and MG8 including the bait region. The Bait region is cleaved by proteases, followed by a large conformational change that blocks the target protease within a cage-like complex. This model of protease entrapment is recognized as the Venus flytrap mechanism.


Pssm-ID: 462235 [Multi-domain]  Cd Length: 139  Bit Score: 110.90  E-value: 5.48e-28
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 768001956   490 LQLQPPSHPLQVGEEAYFSVKSTC----PCNFTlYYEVAARGNIVLSGQQPAhttqqrskraapalekpirlthlsetep 565
Cdd:pfam07703    1 LHLSTDKTEYKPGETATVTVKSPFdgtvERDGF-TYLVLSKGQIVVVGRGGV---------------------------- 51
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 768001956   566 ppapeaevdvcVTSLHLAVTPSMVPLGRLLVFYVRE---NGEGVADSLQFAVETFFENQVSVTYSANETQPGEVVDLRIR 642
Cdd:pfam07703   52 -----------TTSFSLPVTAEMAPSARVVAYYVRVdlsKPEVVADSVWVDVDDTCENKLKVTLSAEKYRPGSTVELKVK 120
                          170
                   ....*....|....*....
gi 768001956   643 AARGSCVCVAAVDKSVYLL 661
Cdd:pfam07703  121 ADPGAYVALAAVDKGVLLL 139
A2M_recep pfam07677
A-macroglobulin receptor binding domain; This family includes the receptor binding domain ...
1602-1695 1.05e-27

A-macroglobulin receptor binding domain; This family includes the receptor binding domain region of the alpha-2-macroglobulin family.


Pssm-ID: 462226  Cd Length: 92  Bit Score: 108.43  E-value: 1.05e-27
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 768001956  1602 GSSNMAVLEVPLLSGFRADIESLEQLLLDkhMGMKRYE-VAGRRVLFYFDEIPSRcLTCVRFRALRECVVGRTSALPVSV 1680
Cdd:pfam07677    1 ESSNMAILEVGLPSGFVPDEEDLKKLGVD--PLIKRVEtVDDGKVILYLDKLSGE-PLCFSFRAEQTFPVANLKPAPVKV 77
                           90
                   ....*....|....*
gi 768001956  1681 YDYYEPAFEATRFYN 1695
Cdd:pfam07677   78 YDYYEPERRATTFYS 92
MG2 pfam01835
MG2 domain; This is the MG2 (macroglobulin) domain of alpha-2-macroglobulin in eukaryotes. ...
164-256 4.99e-12

MG2 domain; This is the MG2 (macroglobulin) domain of alpha-2-macroglobulin in eukaryotes. Alpha-2-macroglobulins (A2Ms) are plasma proteins that trap and inhibit a broad range of proteases and are major components of the eukaryotic innate immune system. However, A2M-like proteins were identified in pathogenically invasive bacteria and species that colonize higher eukaryotes. This domain is found in eukaryotic and bacterial proteins. In human A2Ms, this domain is termed macroglobulin-like (MG) domain 2 and in Salmonella enterica ser A2Ms, this is domain 4.


Pssm-ID: 426464 [Multi-domain]  Cd Length: 95  Bit Score: 63.87  E-value: 4.99e-12
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 768001956   164 SVFIQTDKPVYRPQHRVLISIFTVSPNLRPVNEK-LEAYILDPRGSRM--IEWRHLKPfccGITNMSFPLSDQPVLGEWF 240
Cdd:pfam01835    1 RAFVYTDRGIYRPGETVHFKGLLRDQDLRPLAGLpVTLTVTDPDGNEVrrLPLTTDEF---GGFSGSFPLPETAPTGTYT 77
                           90
                   ....*....|....*...
gi 768001956   241 I--FVEMQGHAYNKSFEV 256
Cdd:pfam01835   78 VvlRDGAGGSLGSGSFRV 95
YfaS COG2373
Uncharacterized conserved protein YfaS, alpha-2-macroglobulin family [General function ...
158-925 1.22e-10

Uncharacterized conserved protein YfaS, alpha-2-macroglobulin family [General function prediction only];


Pssm-ID: 441940 [Multi-domain]  Cd Length: 1605  Bit Score: 67.03  E-value: 1.22e-10
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 768001956  158 VDGR----GASVFIQTDKPVYRPQHRVLISIFTVSPNLRPVNE-KLEAYILDPRGSRMIEWR-HLKPFccGITNMSFPLS 231
Cdd:COG2373   360 VGGRappgGLDAFLFTDRGIYRPGETVHLKALLRDADGKAPAGlPLTLELTDPDGKEVRRQTlTLNEF--GGYSFSFPLP 437
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 768001956  232 DQPVLGEWFIFVEMQGHA--YNKSFEVQKYVLPKFELLIDPPRYIQDLDACETGTVRARYTFGKPVAGAlminmTVNGV- 308
Cdd:COG2373   438 EDAPTGTWRLELYVDPKPalGSKSFRVEEFKPPRFKVDLTLDKEPLKPGDPVTVTVDARYLFGAPAAGL-----KVEGEv 512
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 768001956  309 -------------GYYSHEVGRPVLRTTKIL--------GSRDFDICVRDMIPADVPehfrGRVSIWAMVTSVDGsQQVA 367
Cdd:COG2373   513 tlrpartafpgypGYRFGDPDEEFEPEELDLgegtldadGKASLSLPLPDAPDAPGP----LRATVEASVFESGG-RPVT 587
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 768001956  368 FDDSTPVQRQ--LVDIRYSK----DTRKQFKPGLAYVGkvelsyPDGSPAEGVTVQIKAE--------LTPKDNIYTSEV 433
Cdd:COG2373   588 RSATVPVHPAdfYVGIRLPLfdgdPEGAPATFEVVAVD------PDGKPVAGKGLKVELYreewryvwYKSDDGGWRYES 661
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 768001956  434 VSQRGLVG-FEIPSIPTSAQHVWLETK--------VMALNGKPVGAQYLPSYlSLGSWYSPSQCYLQLQPPSHPLQVGEE 504
Cdd:COG2373   662 QEKEEPVAeGTLTTGADGPASLSLTPVewgryrleVKDPDGGLATSVRFYAG-GNASWGAERPDRLELSLDKESYKPGET 740
                         410       420       430       440       450       460       470       480
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 768001956  505 AYFSVKStcPcnftlyyeVAARGNIVLSGQQPAHTtqqrskraapalekpiRLTHLSETEpppapeaevdvcvTSLHLAV 584
Cdd:COG2373   741 AKLLIQS--P--------FAGRALVTVERDGVLET----------------QWVDVKGGG-------------TTVEIPV 781
                         490       500       510       520       530       540       550       560
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 768001956  585 TPSMVPLGRLLVFYVR--ENGEGVADSLQFAVETFF----ENQVSVTYSANET-QPGEVVDLRIRAA----RGSCVCVAA 653
Cdd:COG2373   782 TEDWAPNAYVSATLVRpgDSTANDMPARAYGVAPLPvdppARRLKVELTAPEKlRPGETLTVTVKVKgaagKAAEVTLAA 861
                         570       580       590       600       610       620       630       640
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 768001956  654 VDKSVYLLrSGFRlTPaqvfqeledyDVSDSFGvsredgpfwwagltaqRRRRSSVFPWpwgitkDS-GFAFTETGLVVM 732
Cdd:COG2373   862 VDEGILNL-TGYK-TP----------DPLDFFY----------------GKRALGVETR------DLyGRLIGAFGGAAG 907
                         650       660       670       680       690       700       710       720
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 768001956  733 TDRVslnhrqdGGLytdeavpafqphtGSLVAVAPSRhPPRTEkrkrtfFPETWIWHCLNISDPSGEGTLSVKVPDSITS 812
Cdd:COG2373   908 ALRS-------GGD-------------GALGRGGNPK-PPRKR------FKPVALFSGPVKTDADGKATVSFDLPDFNGT 960
                         730       740       750       760       770       780       790       800
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 768001956  813 WVGEAVALSTSQgLGIAEpsllKTF---KPFFVDFMLPALIIRGEQVKIPLSVYNYMGTCAEVYMKLSVPKGIQFVGhPG 889
Cdd:COG2373   961 LRVMAVAWSDDR-FGSAE----ATVtvrKPLVVRPSLPRFLAPGDRFELPVDVFNLTGKAGTVTVTLEASGGLTLEG-EA 1034
                         810       820       830
                  ....*....|....*....|....*....|....*.
gi 768001956  890 KRHVTkkmcVAPGEAEPIWVVLSFSDLGLNNITAKA 925
Cdd:COG2373  1035 TQTVT----LAAGGRATVRFPLKAPDAGDAKVTVTA 1066
MG4 pfam17789
Macroglobulin domain MG4; This domain is MG4 found in complement C3 and C5 proteins.
385-452 6.07e-10

Macroglobulin domain MG4; This domain is MG4 found in complement C3 and C5 proteins.


Pssm-ID: 465507  Cd Length: 95  Bit Score: 57.65  E-value: 6.07e-10
                           10        20        30        40        50        60
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 768001956   385 KDTRKQFKPGLAYVGKVELSYPDGSPAEGVTVQIKA-ELTPKDNIYTSEvvsqRGLVGFEIPSIPTSAQ 452
Cdd:pfam17789    4 EKTPKYFKPGLPFSGQVLVVDPDGSPAPNVPVFIEAgNTEFNQNLTTDE----DGTAQFSINTPGNAAS 68
YfaS COG2373
Uncharacterized conserved protein YfaS, alpha-2-macroglobulin family [General function ...
1089-1542 5.89e-09

Uncharacterized conserved protein YfaS, alpha-2-macroglobulin family [General function prediction only];


Pssm-ID: 441940 [Multi-domain]  Cd Length: 1605  Bit Score: 61.64  E-value: 5.89e-09
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 768001956 1089 EPSNESVIVAWTLP-RPPevqfigfstgwgSMGEFRIwRKMEVDESYSEAFTLGVPHGAIPGSERATASI----IGDVMG 1163
Cdd:COG2373  1067 TGGGESDAREVELPvRPA------------NPLVTRA-TSGVLAPGESWTLPLDLPGGLRPGTGSLTLSLssspPLDLAG 1133
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 768001956 1164 PTLNHLNNllrlPFGCGEQNMIHFAPNVFVLKyLQKTQQLSPEVERETTDYLVQGYQRQLTYKRQDGSYSAFGeRDASGS 1243
Cdd:COG2373  1134 LLRYLLRY----PYGCTEQTTSRALPLLYLSD-LAEALGLKGDKDAELRARIQAAIARLLSMQNSDGGFGLWP-GGSESD 1207
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 768001956 1244 MWLTAFVLKSFAQARSF-IFVDPRELAAAKSWiiQQQQADGSflavgrvlnKDIQGGIHGTVPLTAYVVVALletgtaSE 1322
Cdd:COG2373  1208 PWLTAYATDFLLEAREAgYAVPDDALDRALDY--LRNYLRNP---------WEIEYDDAYRLAVRAYALYVL------AR 1270
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 768001956 1323 EERGSTDKARHFLESAAPlAMDPYSCALttYALTLLRSPAAPealrklRSLAIMRDGVTHWSLSNSWDVDKGTFLSfsdr 1402
Cdd:COG2373  1271 AGKADLGDLRYLYDRRKD-ALSPLAKAQ--LAAALALLGDKA------RAEELLAAALARLRETGARDYWYGDYGS---- 1337
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 768001956 1403 vsqsvvsaEVEMTAYALLTYTLLGDVAAALP-VVKWLSQQRNAlGGFSSTQDTCVALQALAEYA-ILSYAGGINLTVSLA 1480
Cdd:COG2373  1338 --------PLRDQALALALLAELGPDAPLAPkLARWLAKALKS-GRWLSTQETAWALLALAAYArAAGASPDFTATLTLD 1408
                         410       420       430       440       450       460
                  ....*....|....*....|....*....|....*....|....*....|....*....|..
gi 768001956 1481 STNLDYQETFELHRTNqkvlqTAAIPSLPTGLFVSAKGDGCCLMQIDVTYnVPDPVAKPAFQ 1542
Cdd:COG2373  1409 GKTLPLTGRGPLARVT-----LPAAELLAGPLTITNTGDGPLYYTLTLSG-YPAEGPPPAAS 1464
KAZAL smart00280
Kazal type serine protease inhibitors; Kazal type serine protease inhibitors and ...
1747-1779 9.93e-09

Kazal type serine protease inhibitors; Kazal type serine protease inhibitors and follistatin-like domains.


Pssm-ID: 197624  Cd Length: 46  Bit Score: 52.68  E-value: 9.93e-09
                            10        20        30
                    ....*....|....*....|....*....|...
gi 768001956   1747 CDHDCGAQGNPVCGSDGVVYASACRLREAACRQ 1779
Cdd:smart00280    2 CPEACPREYDPVCGSDGVTYSNECHLCKAACES 34
KAZAL_FS cd00104
Kazal type serine protease inhibitors and follistatin-like domains. Kazal inhibitors inhibit ...
1751-1791 2.32e-08

Kazal type serine protease inhibitors and follistatin-like domains. Kazal inhibitors inhibit serine proteases, such as, trypsin, chyomotrypsin, avian ovomucoids, and elastases. The inhibitory domain has one reactive site peptide bond, which serves the cognate enzyme as substrate. The reactive site peptide bond is a combining loop which has an identical conformation in all Kazal inhibitors and in all enzyme/inhibitor complexes. These Kazal domains (small hydrophobic core of alpha/beta structure with 3 to 4 disulfide bonds) often occur in tandem arrays. Similar domains are also present in follistatin (FS) and follistatin-like family members, which play an important role in tissue specific regulation. The FS domain consists of an N-terminal beta hairpin (FOLN/EGF-like domain) and a Kazal-like domain and has five disulfide bonds. Although the Kazal-like FS substructure is similar to Kazal proteinase inhibitors, no FS domain has yet been shown to be a proteinase inhibitor. Follistatin-like family members include SPARC, also known as, BM-40 or osteonectin, the Gallus gallus Flik protein, as well as, agrin which has a long array of FS domains. The kazal-type inhibitor domain has also been detected in an extracellular loop region of solute carrier 21 (SLC21) family members (organic anion transporters) , which may regulate the specificity of anion uptake. The distant homolog, Ascidian trypsin inhibitor, is included in this CD.


Pssm-ID: 238052 [Multi-domain]  Cd Length: 41  Bit Score: 51.50  E-value: 2.32e-08
                          10        20        30        40
                  ....*....|....*....|....*....|....*....|.
gi 768001956 1751 CGAQGNPVCGSDGVVYASACRLREAACRQAAPLEPAPPSCC 1791
Cdd:cd00104     1 CPKEYDPVCGSDGKTYSNECHLGCAACRSGRSITVAHNGPC 41
Kazal_2 pfam07648
Kazal-type serine protease inhibitor domain; Usually indicative of serine protease inhibitors. ...
1744-1779 2.26e-07

Kazal-type serine protease inhibitor domain; Usually indicative of serine protease inhibitors. However, kazal-like domains are also seen in the extracellular part of agrins, which are not known to be protease inhibitors. Kazal domains often occur in tandem arrays. Small alpha+beta fold containing three disulphides.


Pssm-ID: 400135  Cd Length: 50  Bit Score: 49.03  E-value: 2.26e-07
                           10        20        30
                   ....*....|....*....|....*....|....*.
gi 768001956  1744 RCGCDHDcgaQGNPVCGSDGVVYASACRLREAACRQ 1779
Cdd:pfam07648    3 NCQCPKT---EYEPVCGSDGVTYPSPCALCAAGCKL 35
PHA03247 PHA03247
large tegument protein UL36; Provisional
1675-1879 6.55e-04

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 44.93  E-value: 6.55e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 768001956 1675 ALPVSVYDYYEPAFEATRFYNVSTHSPLARELCAGPACNEVERAPARGPGwfPGESGPAVAPEEGAAiarcgcdhdcGAQ 1754
Cdd:PHA03247 2731 ASPALPAAPAPPAVPAGPATPGGPARPARPPTTAGPPAPAPPAAPAAGPP--RRLTRPAVASLSESR----------ESL 2798
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 768001956 1755 GNPVCGSDGVVYASACRLREAACRQAAPLEPAPPSCCALEQRLPASSSSTYGDDLASVAPGplqQDVKLNGaglevedsd 1834
Cdd:PHA03247 2799 PSPWDPADPPAAVLAPAAALPPAASPAGPLPPPTSAQPTAPPPPPGPPPPSLPLGGSVAPG---GDVRRRP--------- 2866
                         170       180       190       200
                  ....*....|....*....|....*....|....*....|....*
gi 768001956 1835 pePEGEAEDRVTAGPRPPVSSGNLESSTQSASPFHRWGQTPAPQR 1879
Cdd:PHA03247 2867 --PSRSPAAKPAAPARPPVRRLARPAVSRSTESFALPPDQPERPP 2909
 
Name Accession Description Interval E-value
A2M_2 cd02897
Proteins similar to alpha2-macroglobulin (alpha (2)-M). This group also contains the pregnancy ...
1176-1464 5.19e-145

Proteins similar to alpha2-macroglobulin (alpha (2)-M). This group also contains the pregnancy zone protein (PZP). Alpha(2)-M and PZP are broadly specific proteinase inhibitors. Alpha (2)-M is a major carrier protein in serum. The structural thioester of alpha (2)-M, is involved in the immobilization and entrapment of proteases. PZP is a trace protein in the plasma of non-pregnant females and males which is elevated in pregnancy. Alpha (2)-M and PZ bind to placental protein-14 and may modulate its activity in T-cell growth and cytokine production contributing to fetal survival. It has been suggested that thioester bond cleavage promotes the binding of PZ and alpha (2)-M to the CD91 receptor clearing them from circulation.


Pssm-ID: 239227  Cd Length: 292  Bit Score: 450.11  E-value: 5.19e-145
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 768001956 1176 PFGCGEQNMIHFAPNVFVLKYLQKTQQLSPEVERETTDYLVQGYQRQLTYKRQDGSYSAFGERDASGSMWLTAFVLKSFA 1255
Cdd:cd02897    12 PYGCGEQNMVNFAPNIYVLDYLKATGQLTPEIESKALGFLRTGYQRQLTYKHSDGSYSAFGESDKSGSTWLTAFVLKSFA 91
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 768001956 1256 QARSFIFVDPRELAAAKSWIIQQQQADGSFLAVGRVLNKDIQGGIHGTVPLTAYVVVALLETGTASeeERGSTDKARHFL 1335
Cdd:cd02897    92 QARPFIYIDENVLQQALTWLSSHQKSNGCFREVGRVFHKAMQGGVDDEVALTAYVLIALLEAGLPS--ERPVVEKALSCL 169
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 768001956 1336 ESAAPLAMDPYSCALTTYALTLLRSPAAPEALRKLRSLAIMRDGVTHWSLSnswdvdkgTFLSFSDRVSQSVVSAEVEMT 1415
Cdd:cd02897   170 EAALDSISDPYTLALAAYALTLAGSEKRPEALKKLDELAISEDGTKHWSRP--------PPSEEGPSYYWQAPSAEVEMT 241
                         250       260       270       280       290
                  ....*....|....*....|....*....|....*....|....*....|.
gi 768001956 1416 AYALLTYTLLG--DVAAALPVVKWLSQQRNALGGFSSTQDTCVALQALAEY 1464
Cdd:cd02897   242 AYALLALLSAGgeDLAEALPIVKWLAKQRNSLGGFSSTQDTVVALQALAKY 292
TED_complement pfam07678
A-macroglobulin TED domain; This entry corresponds to the TED domain of the complement ...
1155-1464 2.11e-130

A-macroglobulin TED domain; This entry corresponds to the TED domain of the complement components such as C3, C4 and C5. This domain contains a short highly conserved region of proteinase-binding alpha-macro-globulins contains the cysteine and a glutamine of a thiol-ester bond that is cleaved at the moment of proteinase binding, and mediates the covalent binding of the alpha-macro-globulin to the proteinase. The GCGEQ motif is highly conserved.


Pssm-ID: 462227 [Multi-domain]  Cd Length: 311  Bit Score: 410.54  E-value: 2.11e-130
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 768001956  1155 ASIIGDVMGPTLNHLNNLLRL----PFGCGEQNMIHFAPNVFVLKYLQKTQQLSPEVERETTDYLVQGYQRQLTYKRQDG 1230
Cdd:pfam07678    1 ISVVGDIMGPAIQVVPENLSSllrlPYGCGEQNMVLFAPNVYVLRYLDKTNQLTKLIKSKAIDYLEQGYQRQLSYKHPDG 80
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 768001956  1231 SYSAFGERDasGSMWLTAFVLKSFAQARSFIFVDPRELAAAKSWIIQQQQADGSFLAVGRVLNKDIQGGIHGTVPLTAYV 1310
Cdd:pfam07678   81 SYSAFGHSP--GSTWLTAFVLKVFAQARKFIFIDPEEICQSLRWLLSQQKPDGSFREPGPLLHRAMKGGVDGEVSLTAYV 158
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 768001956  1311 VVALLETGTASE---EERGSTDKARHFLESAA-PLAMDPYSCALTTYALTLLRSPA-APEALRKLRSLAIMRDGVTHW-- 1383
Cdd:pfam07678  159 TIALLEALDINGllqRVHPSIRKALTYLEQAQlAGLTSPYTLAILAYALALAGSPEtREELLKSLDAMAREEGNSRYWer 238
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 768001956  1384 -SLSNSWDVdkgtflsfsDRVSQSVVSAEVEMTAYALLTYTLLGDVAAALPVVKWLSQQRNALGGFSSTQDTCVALQALA 1462
Cdd:pfam07678  239 dEKSDPQGV---------PEYPPQAPSLEVETTAYALLAYLLLGDLTYADPIVKWLTSQRNSHGGFSSTQDTVVALQALA 309

                   ..
gi 768001956  1463 EY 1464
Cdd:pfam07678  310 EY 311
A2M_like cd02891
Proteins similar to alpha2-macroglobulin (alpha (2)-M). Alpha (2)-M is a major carrier ...
1176-1464 5.70e-109

Proteins similar to alpha2-macroglobulin (alpha (2)-M). Alpha (2)-M is a major carrier protein in serum. It is a broadly specific proteinase inhibitor. The structural thioester of alpha (2)-M, is involved in the immobilization and entrapment of proteases. This group contains another broadly specific proteinase inhibitor: pregnancy zone protein (PZP). PZP is a trace protein in the plasma of non-pregnant females and males which is elevated in pregnancy. Alpha (2)-M and PZ bind to placental protein-14 and may modulate its activity in T-cell growth and cytokine production thereby protecting the allogeneic fetus from attack by the maternal immune system. This group also contains C3, C4 and C5 of vertebrate complement. The vertebrate complement is an effector of both the acquired and innate immune systems The point of convergence of the classical, alternative and lectin pathways of the complement system is the proteolytic activation of C3. C4 plays a key role in propagating the classical and lectin pathways. C5 participates in the classical and alternative pathways. The thioester bond located within the structure of C3 and C4 is central to the function of complement. C5 does not contain an active thioester bond.


Pssm-ID: 239221 [Multi-domain]  Cd Length: 282  Bit Score: 348.61  E-value: 5.70e-109
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 768001956 1176 PFGCGEQNMIHFAPNVFVLKYLQKTQQLSPEVERETTDYLVQGYQRQLTYKRQDGSYSAFGERDaSGSMWLTAFVLKSFA 1255
Cdd:cd02891    12 PYGCGEQTMSRAAPNLYVLKYLDATGQLTPEIREKALEYIRKGYQRLLTYQRSDGSFSAWGNSD-SGSTWLTAYVVKFLS 90
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 768001956 1256 QARSFIFVDPRELAAAKSWIIQQQQADGSFLAVGRVLNKDIQGGIHGTVPLTAYVVVALLETGTASeeeRGSTDKARHFL 1335
Cdd:cd02891    91 QARKYIDVDENVLARALGWLVPQQKEDGSFRELGPVIHREMKGGVDDSVSLTAYVLIALAEAGKAC---DASIEKALAYL 167
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 768001956 1336 ESAAPLAMDPYSCALTTYALTLLR-SPAAPEALRKLRSLAIMRDGVTHWSLsnSWDVDKGTflsfsdrvsqsvvSAEVEM 1414
Cdd:cd02891   168 ETQLDGLLDPYALAILAYALALAGdSTRADEALKKLLEAAREKGGTAHWSL--SWPGDYGS-------------SLRVEA 232
                         250       260       270       280       290
                  ....*....|....*....|....*....|....*....|....*....|
gi 768001956 1415 TAYALLTYTLLGDVAAALPVVKWLSQQRNALGGFSSTQDTCVALQALAEY 1464
Cdd:cd02891   233 TAYALLALLKLGDLEEAGPIAKWLAQQRNSGGGFLSTQDTVVALQALAAY 282
complement_C3_C4_C5 cd02896
Proteins similar to C3, C4 and C5 of vertebrate complement. The vertebrate complement system, ...
1176-1464 8.66e-94

Proteins similar to C3, C4 and C5 of vertebrate complement. The vertebrate complement system, comprised of a large number of distinct plasma proteins, is an effector of both the acquired and innate immune systems. The point of convergence of the classical, alternative and lectin pathways of the complement system is the proteolytic activation of C3. C4 plays a key role in propagating the classical and lectin pathways. C5 participates in the classical and alternative pathways. The thioester bond located within the structure of C3 and C4 is central to the function of complement. C5 does not contain an active thioester bond.


Pssm-ID: 239226 [Multi-domain]  Cd Length: 297  Bit Score: 305.74  E-value: 8.66e-94
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 768001956 1176 PFGCGEQNMIHFAPNVFVLKYLQKTQQ---LSPEVERETTDYLVQGYQRQLTYKRQDGSYSAFGERDasGSMWLTAFVLK 1252
Cdd:cd02896    12 PTGCGEQTMIKLAPTVYALRYLDTTNQwekLGPERRDEALKYIRQGYQRQLSYRKPDGSYAAWKNRP--SSTWLTAFVVK 89
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 768001956 1253 SFAQARSFIFVDPRELAAAKSWIIQQQQADGSFLAVGRVLNKDIQGGIHGT---VPLTAYVVVALLET----GTASEEER 1325
Cdd:cd02896    90 VFSLARKYIPVDQNVICGSVNWLISNQKPDGSFQEPSPVIHREMTGGVEGSegdVSLTAFVLIALQEArsicPPEVQNLD 169
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 768001956 1326 GSTDKARHFLESAAPLAMDPYSCALTTYALTLLRSPAAPEALRKLRSLAIMRDGVTHWSL--SNSWDVDKGTFLSfsdrv 1403
Cdd:cd02896   170 QSIRKAISYLENQLPNLQRPYALAITAYALALADSPLSHAANRKLLSLAKRDGNGWYWWTidSPYWPVPGPSAIT----- 244
                         250       260       270       280       290       300
                  ....*....|....*....|....*....|....*....|....*....|....*....|.
gi 768001956 1404 sqsvvsaeVEMTAYALLTYTLLGDVAAALPVVKWLSQQRNALGGFSSTQDTCVALQALAEY 1464
Cdd:cd02896   245 --------VETTAYALLALLKLGDIEYANPIARWLTEQRNYGGGFGSTQDTVVALQALAEY 297
ISOPREN_C2_like cd00688
This group contains class II terpene cyclases, protein prenyltransferases beta subunit, two ...
1179-1464 4.54e-64

This group contains class II terpene cyclases, protein prenyltransferases beta subunit, two broadly specific proteinase inhibitors alpha2-macroglobulin (alpha (2)-M) and pregnancy zone protein (PZP) and, the C3 C4 and C5 components of vertebrate complement. Class II terpene cyclases include squalene cyclase (SQCY) and 2,3-oxidosqualene cyclase (OSQCY), these integral membrane proteins catalyze a cationic cyclization cascade converting linear triterpenes to fused ring compounds. The protein prenyltransferases include protein farnesyltransferase (FTase) and geranylgeranyltransferase types I and II (GGTase-I and GGTase-II) which catalyze the carboxyl-terminal lipidation of Ras, Rab, and several other cellular signal transduction proteins, facilitating membrane associations and specific protein-protein interactions. Alpha (2)-M is a major carrier protein in serum and involved in the immobilization and entrapment of proteases. PZP is a pregnancy associated protein. Alpha (2)-M and PZP are known to bind to and, may modulate, the activity of placental protein-14 in T-cell growth and cytokine production thereby protecting the allogeneic fetus from attack by the maternal immune system.


Pssm-ID: 238362 [Multi-domain]  Cd Length: 300  Bit Score: 220.50  E-value: 4.54e-64
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 768001956 1179 CGEQNMIHFAPNVFVLKYLQKTqqlspEVERETTDYLVQGYQRQLTYKRQDGSYSAFGERDaSGSMWLTAFVLKSFAQAR 1258
Cdd:cd00688    23 CGEQTWSTAWPLLALLLLLAAT-----GIRDKADENIEKGIQRLLSYQLSDGGFSGWGGND-YPSLWLTAYALKALLLAG 96
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 768001956 1259 SFIFVDPRELAAAKSWIIQQQQADGSFLAVGRVLNKdiQGGIHGTVPLTAYVVVALLETGTASEEErgSTDKARHFLESA 1338
Cdd:cd00688    97 DYIAVDRIDLARALNWLLSLQNEDGGFREDGPGNHR--IGGDESDVRLTAYALIALALLGKLDPDP--LIEKALDYLLSC 172
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 768001956 1339 APLAM--------DPYSCALTTYALTLL---RSPAAPEALRKLRSLAIMRDGVTHWSLSNSWDVDkgtflsfsdrvsqsv 1407
Cdd:cd00688   173 QNYDGgfgpggesHGYGTACAAAALALLgdlDSPDAKKALRWLLSRQRPDGGWGEGRDRTNKLSD--------------- 237
                         250       260       270       280       290       300
                  ....*....|....*....|....*....|....*....|....*....|....*....|....
gi 768001956 1408 vSAEVEMTAYALLTYTLLGDVAAALPVVKWLSQQRNALGGFSS-------TQDTCVALQALAEY 1464
Cdd:cd00688   238 -SCYTEWAAYALLALGKLGDLEDAEKLVKWLLSQQNEDGGFSSkpgksydTQHTVFALLALSLY 300
A2M pfam00207
Alpha-2-macroglobulin family; This family includes the C-terminal region of the ...
785-876 9.21e-34

Alpha-2-macroglobulin family; This family includes the C-terminal region of the alpha-2-macroglobulin family.


Pssm-ID: 459711 [Multi-domain]  Cd Length: 91  Bit Score: 125.39  E-value: 9.21e-34
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 768001956   785 TWIWHCLNISDpSGEGTLSVKVPDSITSWVGEAVALSTSQGLGIAEPSLLKTFKPFFVDFMLPALIIRGEQVKIPLSVYN 864
Cdd:pfam00207    1 TWLWDPVLVTD-NGKASLSFTLPDSITTWRATAFALSPDTGLGVAEPPELVVFKPFFVDLNLPYSVRRGEQFELKATVFN 79
                           90
                   ....*....|..
gi 768001956   865 YMGTCAEVYMKL 876
Cdd:pfam00207   80 YLDKCLKVRVRL 91
Methyltransf_FA pfam12248
Farnesoic acid 0-methyl transferase; This domain family is found in bacteria and eukaryotes, ...
1020-1121 4.53e-32

Farnesoic acid 0-methyl transferase; This domain family is found in bacteria and eukaryotes, and is approximately 110 amino acids in length.Farnesoic acid O-methyl transferase (FAMeT) is the enzyme that catalyzes the formation of methyl farnesoate (MF) from farnesoic acid (FA) in the biosynthetic pathway of juvenile hormone (JH).


Pssm-ID: 463505  Cd Length: 104  Bit Score: 121.21  E-value: 4.53e-32
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 768001956  1020 VALSS--GPQDTAGMIEIVLGGHQNTRSWISTSKMG--EPVASAHTAKILSWDEFRTFWISWR-GGLIQVGHGPEpsnES 1094
Cdd:pfam12248    2 IALSSspYPYDSDPMYEIVIGGWGNTRSVIRRQKRGsaPDVVEVSTPGILSPDEPRMFWISWTdDGLISVGKGGE---EN 78
                           90       100
                   ....*....|....*....|....*..
gi 768001956  1095 VIVAWTLPRPPEVQFIGFSTgWGSMGE 1121
Cdd:pfam12248   79 PFLQWSDPNPLPVNYIGFST-WGSTGE 104
A2M_BRD pfam07703
Alpha-2-macroglobulin bait region domain; Alpha-2-macroglobulins (A2Ms) are plasma proteins ...
490-661 5.48e-28

Alpha-2-macroglobulin bait region domain; Alpha-2-macroglobulins (A2Ms) are plasma proteins that trap and inhibit a broad range of proteases and are major components of the eukaryotic innate immune system. However, A2M-like proteins were identified in pathogenically invasive bacteria and species that colonize higher eukaryotes. This domain is found in eukaryotic and bacterial proteins. In human A2Ms, this domain encompasses macroglobulin-like domain MG5 and 6 including bait region. In Salmonella enterica ser A2Ms, this domain encompasses MG7 and MG8 including the bait region. The Bait region is cleaved by proteases, followed by a large conformational change that blocks the target protease within a cage-like complex. This model of protease entrapment is recognized as the Venus flytrap mechanism.


Pssm-ID: 462235 [Multi-domain]  Cd Length: 139  Bit Score: 110.90  E-value: 5.48e-28
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 768001956   490 LQLQPPSHPLQVGEEAYFSVKSTC----PCNFTlYYEVAARGNIVLSGQQPAhttqqrskraapalekpirlthlsetep 565
Cdd:pfam07703    1 LHLSTDKTEYKPGETATVTVKSPFdgtvERDGF-TYLVLSKGQIVVVGRGGV---------------------------- 51
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 768001956   566 ppapeaevdvcVTSLHLAVTPSMVPLGRLLVFYVRE---NGEGVADSLQFAVETFFENQVSVTYSANETQPGEVVDLRIR 642
Cdd:pfam07703   52 -----------TTSFSLPVTAEMAPSARVVAYYVRVdlsKPEVVADSVWVDVDDTCENKLKVTLSAEKYRPGSTVELKVK 120
                          170
                   ....*....|....*....
gi 768001956   643 AARGSCVCVAAVDKSVYLL 661
Cdd:pfam07703  121 ADPGAYVALAAVDKGVLLL 139
A2M_recep pfam07677
A-macroglobulin receptor binding domain; This family includes the receptor binding domain ...
1602-1695 1.05e-27

A-macroglobulin receptor binding domain; This family includes the receptor binding domain region of the alpha-2-macroglobulin family.


Pssm-ID: 462226  Cd Length: 92  Bit Score: 108.43  E-value: 1.05e-27
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 768001956  1602 GSSNMAVLEVPLLSGFRADIESLEQLLLDkhMGMKRYE-VAGRRVLFYFDEIPSRcLTCVRFRALRECVVGRTSALPVSV 1680
Cdd:pfam07677    1 ESSNMAILEVGLPSGFVPDEEDLKKLGVD--PLIKRVEtVDDGKVILYLDKLSGE-PLCFSFRAEQTFPVANLKPAPVKV 77
                           90
                   ....*....|....*
gi 768001956  1681 YDYYEPAFEATRFYN 1695
Cdd:pfam07677   78 YDYYEPERRATTFYS 92
MG2 pfam01835
MG2 domain; This is the MG2 (macroglobulin) domain of alpha-2-macroglobulin in eukaryotes. ...
164-256 4.99e-12

MG2 domain; This is the MG2 (macroglobulin) domain of alpha-2-macroglobulin in eukaryotes. Alpha-2-macroglobulins (A2Ms) are plasma proteins that trap and inhibit a broad range of proteases and are major components of the eukaryotic innate immune system. However, A2M-like proteins were identified in pathogenically invasive bacteria and species that colonize higher eukaryotes. This domain is found in eukaryotic and bacterial proteins. In human A2Ms, this domain is termed macroglobulin-like (MG) domain 2 and in Salmonella enterica ser A2Ms, this is domain 4.


Pssm-ID: 426464 [Multi-domain]  Cd Length: 95  Bit Score: 63.87  E-value: 4.99e-12
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 768001956   164 SVFIQTDKPVYRPQHRVLISIFTVSPNLRPVNEK-LEAYILDPRGSRM--IEWRHLKPfccGITNMSFPLSDQPVLGEWF 240
Cdd:pfam01835    1 RAFVYTDRGIYRPGETVHFKGLLRDQDLRPLAGLpVTLTVTDPDGNEVrrLPLTTDEF---GGFSGSFPLPETAPTGTYT 77
                           90
                   ....*....|....*...
gi 768001956   241 I--FVEMQGHAYNKSFEV 256
Cdd:pfam01835   78 VvlRDGAGGSLGSGSFRV 95
YfaS COG2373
Uncharacterized conserved protein YfaS, alpha-2-macroglobulin family [General function ...
158-925 1.22e-10

Uncharacterized conserved protein YfaS, alpha-2-macroglobulin family [General function prediction only];


Pssm-ID: 441940 [Multi-domain]  Cd Length: 1605  Bit Score: 67.03  E-value: 1.22e-10
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 768001956  158 VDGR----GASVFIQTDKPVYRPQHRVLISIFTVSPNLRPVNE-KLEAYILDPRGSRMIEWR-HLKPFccGITNMSFPLS 231
Cdd:COG2373   360 VGGRappgGLDAFLFTDRGIYRPGETVHLKALLRDADGKAPAGlPLTLELTDPDGKEVRRQTlTLNEF--GGYSFSFPLP 437
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 768001956  232 DQPVLGEWFIFVEMQGHA--YNKSFEVQKYVLPKFELLIDPPRYIQDLDACETGTVRARYTFGKPVAGAlminmTVNGV- 308
Cdd:COG2373   438 EDAPTGTWRLELYVDPKPalGSKSFRVEEFKPPRFKVDLTLDKEPLKPGDPVTVTVDARYLFGAPAAGL-----KVEGEv 512
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 768001956  309 -------------GYYSHEVGRPVLRTTKIL--------GSRDFDICVRDMIPADVPehfrGRVSIWAMVTSVDGsQQVA 367
Cdd:COG2373   513 tlrpartafpgypGYRFGDPDEEFEPEELDLgegtldadGKASLSLPLPDAPDAPGP----LRATVEASVFESGG-RPVT 587
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 768001956  368 FDDSTPVQRQ--LVDIRYSK----DTRKQFKPGLAYVGkvelsyPDGSPAEGVTVQIKAE--------LTPKDNIYTSEV 433
Cdd:COG2373   588 RSATVPVHPAdfYVGIRLPLfdgdPEGAPATFEVVAVD------PDGKPVAGKGLKVELYreewryvwYKSDDGGWRYES 661
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 768001956  434 VSQRGLVG-FEIPSIPTSAQHVWLETK--------VMALNGKPVGAQYLPSYlSLGSWYSPSQCYLQLQPPSHPLQVGEE 504
Cdd:COG2373   662 QEKEEPVAeGTLTTGADGPASLSLTPVewgryrleVKDPDGGLATSVRFYAG-GNASWGAERPDRLELSLDKESYKPGET 740
                         410       420       430       440       450       460       470       480
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 768001956  505 AYFSVKStcPcnftlyyeVAARGNIVLSGQQPAHTtqqrskraapalekpiRLTHLSETEpppapeaevdvcvTSLHLAV 584
Cdd:COG2373   741 AKLLIQS--P--------FAGRALVTVERDGVLET----------------QWVDVKGGG-------------TTVEIPV 781
                         490       500       510       520       530       540       550       560
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 768001956  585 TPSMVPLGRLLVFYVR--ENGEGVADSLQFAVETFF----ENQVSVTYSANET-QPGEVVDLRIRAA----RGSCVCVAA 653
Cdd:COG2373   782 TEDWAPNAYVSATLVRpgDSTANDMPARAYGVAPLPvdppARRLKVELTAPEKlRPGETLTVTVKVKgaagKAAEVTLAA 861
                         570       580       590       600       610       620       630       640
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 768001956  654 VDKSVYLLrSGFRlTPaqvfqeledyDVSDSFGvsredgpfwwagltaqRRRRSSVFPWpwgitkDS-GFAFTETGLVVM 732
Cdd:COG2373   862 VDEGILNL-TGYK-TP----------DPLDFFY----------------GKRALGVETR------DLyGRLIGAFGGAAG 907
                         650       660       670       680       690       700       710       720
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 768001956  733 TDRVslnhrqdGGLytdeavpafqphtGSLVAVAPSRhPPRTEkrkrtfFPETWIWHCLNISDPSGEGTLSVKVPDSITS 812
Cdd:COG2373   908 ALRS-------GGD-------------GALGRGGNPK-PPRKR------FKPVALFSGPVKTDADGKATVSFDLPDFNGT 960
                         730       740       750       760       770       780       790       800
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 768001956  813 WVGEAVALSTSQgLGIAEpsllKTF---KPFFVDFMLPALIIRGEQVKIPLSVYNYMGTCAEVYMKLSVPKGIQFVGhPG 889
Cdd:COG2373   961 LRVMAVAWSDDR-FGSAE----ATVtvrKPLVVRPSLPRFLAPGDRFELPVDVFNLTGKAGTVTVTLEASGGLTLEG-EA 1034
                         810       820       830
                  ....*....|....*....|....*....|....*.
gi 768001956  890 KRHVTkkmcVAPGEAEPIWVVLSFSDLGLNNITAKA 925
Cdd:COG2373  1035 TQTVT----LAAGGRATVRFPLKAPDAGDAKVTVTA 1066
MG4 pfam17789
Macroglobulin domain MG4; This domain is MG4 found in complement C3 and C5 proteins.
385-452 6.07e-10

Macroglobulin domain MG4; This domain is MG4 found in complement C3 and C5 proteins.


Pssm-ID: 465507  Cd Length: 95  Bit Score: 57.65  E-value: 6.07e-10
                           10        20        30        40        50        60
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 768001956   385 KDTRKQFKPGLAYVGKVELSYPDGSPAEGVTVQIKA-ELTPKDNIYTSEvvsqRGLVGFEIPSIPTSAQ 452
Cdd:pfam17789    4 EKTPKYFKPGLPFSGQVLVVDPDGSPAPNVPVFIEAgNTEFNQNLTTDE----DGTAQFSINTPGNAAS 68
MG3 pfam17791
Macroglobulin domain MG3; This entry corresponds to the MG3 domain found in complement ...
258-324 2.45e-09

Macroglobulin domain MG3; This entry corresponds to the MG3 domain found in complement components C3, C4 and C5.


Pssm-ID: 465509  Cd Length: 83  Bit Score: 55.74  E-value: 2.45e-09
                           10        20        30        40        50        60
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*..
gi 768001956   258 KYVLPKFELLIDPPRYIQDLDACETGTVRARYTFGKPVAGALMINmtvngVGYYSHEVGRPVLRTTK 324
Cdd:pfam17791    1 EYVLPKFEVKVEVPKFISVKDEEFQVTICAKYTYGKPVKGKAYVT-----LCLKDDSKRKCFESFSK 62
YfaS COG2373
Uncharacterized conserved protein YfaS, alpha-2-macroglobulin family [General function ...
1089-1542 5.89e-09

Uncharacterized conserved protein YfaS, alpha-2-macroglobulin family [General function prediction only];


Pssm-ID: 441940 [Multi-domain]  Cd Length: 1605  Bit Score: 61.64  E-value: 5.89e-09
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 768001956 1089 EPSNESVIVAWTLP-RPPevqfigfstgwgSMGEFRIwRKMEVDESYSEAFTLGVPHGAIPGSERATASI----IGDVMG 1163
Cdd:COG2373  1067 TGGGESDAREVELPvRPA------------NPLVTRA-TSGVLAPGESWTLPLDLPGGLRPGTGSLTLSLssspPLDLAG 1133
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 768001956 1164 PTLNHLNNllrlPFGCGEQNMIHFAPNVFVLKyLQKTQQLSPEVERETTDYLVQGYQRQLTYKRQDGSYSAFGeRDASGS 1243
Cdd:COG2373  1134 LLRYLLRY----PYGCTEQTTSRALPLLYLSD-LAEALGLKGDKDAELRARIQAAIARLLSMQNSDGGFGLWP-GGSESD 1207
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 768001956 1244 MWLTAFVLKSFAQARSF-IFVDPRELAAAKSWiiQQQQADGSflavgrvlnKDIQGGIHGTVPLTAYVVVALletgtaSE 1322
Cdd:COG2373  1208 PWLTAYATDFLLEAREAgYAVPDDALDRALDY--LRNYLRNP---------WEIEYDDAYRLAVRAYALYVL------AR 1270
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 768001956 1323 EERGSTDKARHFLESAAPlAMDPYSCALttYALTLLRSPAAPealrklRSLAIMRDGVTHWSLSNSWDVDKGTFLSfsdr 1402
Cdd:COG2373  1271 AGKADLGDLRYLYDRRKD-ALSPLAKAQ--LAAALALLGDKA------RAEELLAAALARLRETGARDYWYGDYGS---- 1337
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 768001956 1403 vsqsvvsaEVEMTAYALLTYTLLGDVAAALP-VVKWLSQQRNAlGGFSSTQDTCVALQALAEYA-ILSYAGGINLTVSLA 1480
Cdd:COG2373  1338 --------PLRDQALALALLAELGPDAPLAPkLARWLAKALKS-GRWLSTQETAWALLALAAYArAAGASPDFTATLTLD 1408
                         410       420       430       440       450       460
                  ....*....|....*....|....*....|....*....|....*....|....*....|..
gi 768001956 1481 STNLDYQETFELHRTNqkvlqTAAIPSLPTGLFVSAKGDGCCLMQIDVTYnVPDPVAKPAFQ 1542
Cdd:COG2373  1409 GKTLPLTGRGPLARVT-----LPAAELLAGPLTITNTGDGPLYYTLTLSG-YPAEGPPPAAS 1464
KAZAL smart00280
Kazal type serine protease inhibitors; Kazal type serine protease inhibitors and ...
1747-1779 9.93e-09

Kazal type serine protease inhibitors; Kazal type serine protease inhibitors and follistatin-like domains.


Pssm-ID: 197624  Cd Length: 46  Bit Score: 52.68  E-value: 9.93e-09
                            10        20        30
                    ....*....|....*....|....*....|...
gi 768001956   1747 CDHDCGAQGNPVCGSDGVVYASACRLREAACRQ 1779
Cdd:smart00280    2 CPEACPREYDPVCGSDGVTYSNECHLCKAACES 34
KAZAL_FS cd00104
Kazal type serine protease inhibitors and follistatin-like domains. Kazal inhibitors inhibit ...
1751-1791 2.32e-08

Kazal type serine protease inhibitors and follistatin-like domains. Kazal inhibitors inhibit serine proteases, such as, trypsin, chyomotrypsin, avian ovomucoids, and elastases. The inhibitory domain has one reactive site peptide bond, which serves the cognate enzyme as substrate. The reactive site peptide bond is a combining loop which has an identical conformation in all Kazal inhibitors and in all enzyme/inhibitor complexes. These Kazal domains (small hydrophobic core of alpha/beta structure with 3 to 4 disulfide bonds) often occur in tandem arrays. Similar domains are also present in follistatin (FS) and follistatin-like family members, which play an important role in tissue specific regulation. The FS domain consists of an N-terminal beta hairpin (FOLN/EGF-like domain) and a Kazal-like domain and has five disulfide bonds. Although the Kazal-like FS substructure is similar to Kazal proteinase inhibitors, no FS domain has yet been shown to be a proteinase inhibitor. Follistatin-like family members include SPARC, also known as, BM-40 or osteonectin, the Gallus gallus Flik protein, as well as, agrin which has a long array of FS domains. The kazal-type inhibitor domain has also been detected in an extracellular loop region of solute carrier 21 (SLC21) family members (organic anion transporters) , which may regulate the specificity of anion uptake. The distant homolog, Ascidian trypsin inhibitor, is included in this CD.


Pssm-ID: 238052 [Multi-domain]  Cd Length: 41  Bit Score: 51.50  E-value: 2.32e-08
                          10        20        30        40
                  ....*....|....*....|....*....|....*....|.
gi 768001956 1751 CGAQGNPVCGSDGVVYASACRLREAACRQAAPLEPAPPSCC 1791
Cdd:cd00104     1 CPKEYDPVCGSDGKTYSNECHLGCAACRSGRSITVAHNGPC 41
Kazal_2 pfam07648
Kazal-type serine protease inhibitor domain; Usually indicative of serine protease inhibitors. ...
1744-1779 2.26e-07

Kazal-type serine protease inhibitor domain; Usually indicative of serine protease inhibitors. However, kazal-like domains are also seen in the extracellular part of agrins, which are not known to be protease inhibitors. Kazal domains often occur in tandem arrays. Small alpha+beta fold containing three disulphides.


Pssm-ID: 400135  Cd Length: 50  Bit Score: 49.03  E-value: 2.26e-07
                           10        20        30
                   ....*....|....*....|....*....|....*.
gi 768001956  1744 RCGCDHDcgaQGNPVCGSDGVVYASACRLREAACRQ 1779
Cdd:pfam07648    3 NCQCPKT---EYEPVCGSDGVTYPSPCALCAAGCKL 35
PHA03247 PHA03247
large tegument protein UL36; Provisional
1675-1879 6.55e-04

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 44.93  E-value: 6.55e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 768001956 1675 ALPVSVYDYYEPAFEATRFYNVSTHSPLARELCAGPACNEVERAPARGPGwfPGESGPAVAPEEGAAiarcgcdhdcGAQ 1754
Cdd:PHA03247 2731 ASPALPAAPAPPAVPAGPATPGGPARPARPPTTAGPPAPAPPAAPAAGPP--RRLTRPAVASLSESR----------ESL 2798
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 768001956 1755 GNPVCGSDGVVYASACRLREAACRQAAPLEPAPPSCCALEQRLPASSSSTYGDDLASVAPGplqQDVKLNGaglevedsd 1834
Cdd:PHA03247 2799 PSPWDPADPPAAVLAPAAALPPAASPAGPLPPPTSAQPTAPPPPPGPPPPSLPLGGSVAPG---GDVRRRP--------- 2866
                         170       180       190       200
                  ....*....|....*....|....*....|....*....|....*
gi 768001956 1835 pePEGEAEDRVTAGPRPPVSSGNLESSTQSASPFHRWGQTPAPQR 1879
Cdd:PHA03247 2867 --PSRSPAAKPAAPARPPVRRLARPAVSRSTESFALPPDQPERPP 2909
KAZAL_SLC21 cd01330
The kazal-type serine protease inhibitor domain has been detected in an extracellular loop ...
1743-1770 1.08e-03

The kazal-type serine protease inhibitor domain has been detected in an extracellular loop region of solute carrier 21 (SLC21) family members (organic anion transporters) , which may regulate the specificity of anion uptake. The KAZAL_SLC21 domain is a member of the superfamily of kazal-like proteinase inhibitors and follistatin-like proteins.


Pssm-ID: 238650 [Multi-domain]  Cd Length: 54  Bit Score: 38.82  E-value: 1.08e-03
                          10        20
                  ....*....|....*....|....*...
gi 768001956 1743 ARCGCDhdcGAQGNPVCGSDGVVYASAC 1770
Cdd:cd01330     7 SNCSCS---ESAYSPVCGENGITYFSPC 31
CAL1 COG5029
Prenyltransferase, beta subunit [Posttranslational modification, protein turnover, chaperones, ...
1224-1426 3.44e-03

Prenyltransferase, beta subunit [Posttranslational modification, protein turnover, chaperones, Lipid transport and metabolism];


Pssm-ID: 444045 [Multi-domain]  Cd Length: 259  Bit Score: 41.23  E-value: 3.44e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 768001956 1224 TYKRQDGSY-SAFGErdASGSMWLTAFVLKSfAQARSFIFVDPRELAAaksWIIQQQQADGSF-LAVGRVLNKDIqggih 1301
Cdd:COG5029    76 SLRVEDGGFaKAPEG--GAGSTYHTYLATLL-AELLGRPPPDPDRLVR---FLISQQNDDGGFeISPGRRSDTNP----- 144
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 768001956 1302 gtvplTAYVVVALLETGTASEEERgsTDKARHFLESA------APLAMDPYSCALTTYALTLlrspaapeALRKLRSLAI 1375
Cdd:COG5029   145 -----TAAAIGALRALGALDDPIE--TKVIRFLRDVQspeggfAYNTRIGEADLLSTFTAIL--------TLYDLGAAPK 209
                         170       180       190       200       210
                  ....*....|....*....|....*....|....*....|....*....|.
gi 768001956 1376 MRDGVTHWSLSNswDVDKGtflSFSDRVSQSVvsAEVEMTAYALLTYTLLG 1426
Cdd:COG5029   210 LVDDLQAYILSL--QLPDG---GFEGAPWDGV--EDVEYTFYGVGALALLG 253
Kazal_1 pfam00050
Kazal-type serine protease inhibitor domain; Usually indicative of serine protease inhibitors. ...
1748-1770 3.87e-03

Kazal-type serine protease inhibitor domain; Usually indicative of serine protease inhibitors. However, kazal-like domains are also seen in the extracellular part of agrins, which are not known to be protease inhibitors. Kazal domains often occur in tandem arrays. Small alpha+beta fold containing three disulphides. Alignment also includes a single domain from transporters in the OATP/PGT family.


Pssm-ID: 395004  Cd Length: 49  Bit Score: 36.88  E-value: 3.87e-03
                           10        20
                   ....*....|....*....|...
gi 768001956  1748 DHDCGAQGNPVCGSDGVVYASAC 1770
Cdd:pfam00050    6 SGACPRIYDPVCGTDGKTYSNEC 28
 
Blast search parameters
Data Source: Precalculated data, version = cdd.v.3.21
Preset Options:Database: CDSEARCH/cdd   Low complexity filter: no  Composition Based Adjustment: yes   E-value threshold: 0.01

References:

  • Wang J et al. (2023), "The conserved domain database in 2023", Nucleic Acids Res.51(D)384-8.
  • Lu S et al. (2020), "The conserved domain database in 2020", Nucleic Acids Res.48(D)265-8.
  • Marchler-Bauer A et al. (2017), "CDD/SPARCLE: functional classification of proteins via subfamily domain architectures.", Nucleic Acids Res.45(D)200-3.
Help | Disclaimer | Write to the Help Desk
NCBI | NLM | NIH