NCBI Home Page NCBI Site Search page NCBI Guide that lists and describes the NCBI resources
Conserved domains on  [gi|93141047|ref|NP_004361|]
View 

collagen alpha-1(XII) chain long isoform precursor [Homo sapiens]

Protein Classification

Graphical summary

 Zoom to residue level

show extra options »

Show site features     Horizontal zoom: ×

List of domain hits

Name Accession Description Interval E-value
vWA_collagen_alphaI-XII-like cd01482
Collagen: The extracellular matrix represents a complex alloy of variable members of diverse ...
1198-1361 6.39e-92

Collagen: The extracellular matrix represents a complex alloy of variable members of diverse protein families defining structural integrity and various physiological functions. The most abundant family is the collagens with more than 20 different collagen types identified thus far. Collagens are centrally involved in the formation of fibrillar and microfibrillar networks of the extracellular matrix, basement membranes as well as other structures of the extracellular matrix. Some collagens have about 15-18 vWA domains in them. The VWA domains present in these collagens mediate protein-protein interactions.


:

Pssm-ID: 238759 [Multi-domain]  Cd Length: 164  Bit Score: 295.35  E-value: 6.39e-92
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 93141047 1198 ADIVLLVDGSWSIGRANFRTVRSFISRIVEVFDIGPKRVQIALAQYSGDPRTEWQLNAHRDKKSLLQAVANLPYKGGNTL 1277
Cdd:cd01482    1 ADIVFLVDGSWSIGRSNFNLVRSFLSSVVEAFEIGPDGVQVGLVQYSDDPRTEFDLNAYTSKEDVLAAIKNLPYKGGNTR 80
                         90       100       110       120       130       140       150       160
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 93141047 1278 TGMALNFIRQQNFRTQAGMRPRARKIGVLITDGKSQDDVEAPSKKLKDEGVELFAIGIKNADEVELKMIATDPDDTHAYN 1357
Cdd:cd01482   81 TGKALTHVREKNFTPDAGARPGVPKVVILITDGKSQDDVELPARVLRNLGVNVFAVGVKDADESELKMIASKPSETHVFN 160

                 ....
gi 93141047 1358 VADF 1361
Cdd:cd01482  161 VADF 164
vWA_collagen_alphaI-XII-like cd01482
Collagen: The extracellular matrix represents a complex alloy of variable members of diverse ...
139-302 2.06e-86

Collagen: The extracellular matrix represents a complex alloy of variable members of diverse protein families defining structural integrity and various physiological functions. The most abundant family is the collagens with more than 20 different collagen types identified thus far. Collagens are centrally involved in the formation of fibrillar and microfibrillar networks of the extracellular matrix, basement membranes as well as other structures of the extracellular matrix. Some collagens have about 15-18 vWA domains in them. The VWA domains present in these collagens mediate protein-protein interactions.


:

Pssm-ID: 238759 [Multi-domain]  Cd Length: 164  Bit Score: 279.56  E-value: 2.06e-86
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 93141047  139 TDLVFLVDGSWSVGRNNFKYILDFIAALVSAFDIGEEKTRVGVVQYSSDTRTEFNLNQYYQRDELLAAIKKIPYKGGNTM 218
Cdd:cd01482    1 ADIVFLVDGSWSIGRSNFNLVRSFLSSVVEAFEIGPDGVQVGLVQYSDDPRTEFDLNAYTSKEDVLAAIKNLPYKGGNTR 80
                         90       100       110       120       130       140       150       160
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 93141047  219 TGDAIDYLVKNTFTESAGARVGFPKVAIIITDGKSQDEVEIPARELRNVGVEVFSLGIKAADAKELKQIASTPSLNHVFN 298
Cdd:cd01482   81 TGKALTHVREKNFTPDAGARPGVPKVVILITDGKSQDDVELPARVLRNLGVNVFAVGVKDADESELKMIASKPSETHVFN 160

                 ....
gi 93141047  299 VANF 302
Cdd:cd01482  161 VADF 164
vWA_collagen_alphaI-XII-like cd01482
Collagen: The extracellular matrix represents a complex alloy of variable members of diverse ...
439-602 2.40e-86

Collagen: The extracellular matrix represents a complex alloy of variable members of diverse protein families defining structural integrity and various physiological functions. The most abundant family is the collagens with more than 20 different collagen types identified thus far. Collagens are centrally involved in the formation of fibrillar and microfibrillar networks of the extracellular matrix, basement membranes as well as other structures of the extracellular matrix. Some collagens have about 15-18 vWA domains in them. The VWA domains present in these collagens mediate protein-protein interactions.


:

Pssm-ID: 238759 [Multi-domain]  Cd Length: 164  Bit Score: 279.56  E-value: 2.40e-86
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 93141047  439 ADIVFLVDGSYSIGIANFVKVRAFLEVLVKSFEISPNRVQISLVQYSRDPHTEFTLKKFTKVEDIIEAINTFPYRGGSTN 518
Cdd:cd01482    1 ADIVFLVDGSWSIGRSNFNLVRSFLSSVVEAFEIGPDGVQVGLVQYSDDPRTEFDLNAYTSKEDVLAAIKNLPYKGGNTR 80
                         90       100       110       120       130       140       150       160
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 93141047  519 TGKAMTYVREKIFVPSKGSRSNVPKVMILITDGKSSDAFRDPAIKLRNSDVEIFAVGVKDAVRSELEAIASPPAETHVFT 598
Cdd:cd01482   81 TGKALTHVREKNFTPDAGARPGVPKVVILITDGKSQDDVELPARVLRNLGVNVFAVGVKDADESELKMIASKPSETHVFN 160

                 ....
gi 93141047  599 VEDF 602
Cdd:cd01482  161 VADF 164
vWA_collagen_alphaI-XII-like cd01482
Collagen: The extracellular matrix represents a complex alloy of variable members of diverse ...
2322-2486 3.07e-83

Collagen: The extracellular matrix represents a complex alloy of variable members of diverse protein families defining structural integrity and various physiological functions. The most abundant family is the collagens with more than 20 different collagen types identified thus far. Collagens are centrally involved in the formation of fibrillar and microfibrillar networks of the extracellular matrix, basement membranes as well as other structures of the extracellular matrix. Some collagens have about 15-18 vWA domains in them. The VWA domains present in these collagens mediate protein-protein interactions.


:

Pssm-ID: 238759 [Multi-domain]  Cd Length: 164  Bit Score: 270.70  E-value: 3.07e-83
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 93141047 2322 ADIVFLTDASWSIGDDNFNKVVKFIFNTVGGFdEISPAGIQVSFVQYSDEVKSEFKLNTYNDKALALGALQNIRYRGGNT 2401
Cdd:cd01482    1 ADIVFLVDGSWSIGRSNFNLVRSFLSSVVEAF-EIGPDGVQVGLVQYSDDPRTEFDLNAYTSKEDVLAAIKNLPYKGGNT 79
                         90       100       110       120       130       140       150       160
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 93141047 2402 RTGKALTFIKEKVLTWESGMRKNVPKVLVVVTDGRSQDEVKKAALVIQQSGFSVFVVGVADVDYNELANIASKPSERHVF 2481
Cdd:cd01482   80 RTGKALTHVREKNFTPDAGARPGVPKVVILITDGKSQDDVELPARVLRNLGVNVFAVGVKDADESELKMIASKPSETHVF 159

                 ....*
gi 93141047 2482 IVDDF 2486
Cdd:cd01482  160 NVADF 164
TSPN smart00210
Thrombospondin N-terminal -like domains; Heparin-binding and cell adhesion domain of ...
2520-2712 3.13e-59

Thrombospondin N-terminal -like domains; Heparin-binding and cell adhesion domain of thrombospondin


:

Pssm-ID: 214560  Cd Length: 184  Bit Score: 202.59  E-value: 3.13e-59
                            10        20        30        40        50        60        70        80
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 93141047    2520 GFKMLEAYNLTEKNFASVQGVSLEsgsfPSYSAYRIQKNAFVNQPTADLHPNGLPPSYTIILLFRLlpeTPSDPFAIWQI 2599
Cdd:smart00210    1 GQDLLQVFDLPSLSFAIRQVVGPE----PGSPAYRLGDPALVPQPTRDLFPSGLPEDFSLLTTFRQ---TPKSRGVLFAI 73
                            90       100       110       120       130       140       150       160
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 93141047    2600 TDRDYKPQVGVIADPSSKTLSFFNKDTRGEVQTVTFdteEVKTLFYGSFHKVHIVVTSKSVKIYIDCYEIIEKDIKEAG- 2678
Cdd:smart00210   74 YDAQNVRQFGLEVDGRANTLLLRYQGVDGKQHTVSF---RNLPLADGQWHKLALSVSGSSATLYVDCNEIDSRPLDRPGq 150
                           170       180       190
                    ....*....|....*....|....*....|....*
gi 93141047    2679 -NITTDGYEILGKLLKGeRKSAAFQIQSFDIVCSP 2712
Cdd:smart00210  151 pPIDTDGIEVRGAQAAD-RKPFQGDLQQLKIVCDP 184
fn3 pfam00041
Fibronectin type III domain;
1756-1834 2.51e-15

Fibronectin type III domain;


:

Pssm-ID: 394996 [Multi-domain]  Cd Length: 85  Bit Score: 73.22  E-value: 2.51e-15
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 93141047   1756 PRNLQVYNATSNSLTVKWDPA---SGRVQKYRITYQPSTGEGNEQTTTIGGRQNSVVLQKLKPDTPYTITVSSLYPDGEG 1832
Cdd:pfam00041    3 PSNLTVTDVTSTSLTVSWTPPpdgNGPITGYEVEYRPKNSGEPWNEITVPGTTTSVTLTGLKPGTEYEVRVQAVNGGGEG 82

                   ..
gi 93141047   1833 GR 1834
Cdd:pfam00041   83 PP 84
fn3 pfam00041
Fibronectin type III domain;
1387-1465 3.30e-15

Fibronectin type III domain;


:

Pssm-ID: 394996 [Multi-domain]  Cd Length: 85  Bit Score: 73.22  E-value: 3.30e-15
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 93141047   1387 APSNLVISERTHRSFRVSWTPPSDS---VDRYKVEYYPVSGGKR-QEFYVSRMETSTVLKDLKPETEYVVNVYSVVEDEY 1462
Cdd:pfam00041    2 APSNLTVTDVTSTSLTVSWTPPPDGngpITGYEVEYRPKNSGEPwNEITVPGTTTSVTLTGLKPGTEYEVRVQAVNGGGE 81

                   ...
gi 93141047   1463 SEP 1465
Cdd:pfam00041   82 GPP 84
FN3 cd00063
Fibronectin type 3 domain; One of three types of internal repeats found in the plasma protein ...
2118-2197 2.89e-14

Fibronectin type 3 domain; One of three types of internal repeats found in the plasma protein fibronectin. Its tenth fibronectin type III repeat contains an RGD cell recognition sequence in a flexible loop between 2 strands. Approximately 2% of all animal proteins contain the FN3 repeat; including extracellular and intracellular proteins, membrane spanning cytokine receptors, growth hormone receptors, tyrosine phosphatase receptors, and adhesion molecules. FN3-like domains are also found in bacterial glycosyl hydrolases.


:

Pssm-ID: 238020 [Multi-domain]  Cd Length: 93  Bit Score: 70.60  E-value: 2.89e-14
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 93141047 2118 PPQNIHISDEWYTRFRVSWDPSP---SPVLGYKIVYKPVGSNEPMEA--FVGEMTSYTLHNLNPSTTYDVNVYAQYDSGL 2192
Cdd:cd00063    3 PPTNLRVTDVTSTSVTLSWTPPEddgGPITGYVVEYREKGSGDWKEVevTPGSETSYTLTGLKPGTEYEFRVRAVNGGGE 82

                 ....*
gi 93141047 2193 SVPLT 2197
Cdd:cd00063   83 SPPSE 87
FN3 cd00063
Fibronectin type 3 domain; One of three types of internal repeats found in the plasma protein ...
724-805 7.59e-14

Fibronectin type 3 domain; One of three types of internal repeats found in the plasma protein fibronectin. Its tenth fibronectin type III repeat contains an RGD cell recognition sequence in a flexible loop between 2 strands. Approximately 2% of all animal proteins contain the FN3 repeat; including extracellular and intracellular proteins, membrane spanning cytokine receptors, growth hormone receptors, tyrosine phosphatase receptors, and adhesion molecules. FN3-like domains are also found in bacterial glycosyl hydrolases.


:

Pssm-ID: 238020 [Multi-domain]  Cd Length: 93  Bit Score: 69.45  E-value: 7.59e-14
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 93141047  724 GAPRNLKVTDETTDSFKITWTQAP---GRVLRYRIIYRPVAGGESREVTTPPNQRRR-TLENLIPDTKYEVSVIPEYFSG 799
Cdd:cd00063    2 SPPTNLRVTDVTSTSVTLSWTPPEddgGPITGYVVEYREKGSGDWKEVEVTPGSETSyTLTGLKPGTEYEFRVRAVNGGG 81

                 ....*.
gi 93141047  800 PGTPLT 805
Cdd:cd00063   82 ESPPSE 87
FN3 super family cl27307
Fibronectin type 3 domain [General function prediction only];
1419-1986 1.17e-13

Fibronectin type 3 domain [General function prediction only];


The actual alignment was detected with superfamily member COG3401:

Pssm-ID: 442628 [Multi-domain]  Cd Length: 603  Bit Score: 76.96  E-value: 1.17e-13
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 93141047 1419 YYPVSGGKRQEFYVSRMETSTVLKDLKPETEYVVNVYSVVEDEYSEPLKGTEKTLPVPVVSLNIYDVGPTTMHVQWQPVG 1498
Cdd:COG3401    5 YLTSLDAGIAASAAANTAVNALSKAGGSGKTILVYLAVVLSVTTKESPGTLLVAAGLSSGGGLGTGGRAGTTSGVAAVAV 84
                         90       100       110       120       130       140       150       160
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 93141047 1499 GATGYILSYKPVKDTEPTRPKEVRLGPTVNDMQLTDLVPNTEYAVTVqavlhdltsepVTVREVTLPLPRPQDLKLRDVT 1578
Cdd:COG3401   85 AAAPPTATGLTTLTGSGSVGGATNTGLTSSDEVPSPAVGTATTATAV-----------AGGAATAGTYALGAGLYGVDGA 153
                        170       180       190       200       210       220       230       240
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 93141047 1579 HSTMNVFWEPVPGKVRKYIVRYKTPEEDVKEVEVDRSETSTSLKDLFSQTLYTVSVSAVHDEGESPP---VTAQETTRPV 1655
Cdd:COG3401  154 NASGTTASSVAGAGVVVSPDTSATAAVATTSLTVTSTTLVDGGGDIEPGTTYYYRVAATDTGGESAPsneVSVTTPTTPP 233
                        250       260       270       280       290       300       310       320
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 93141047 1656 PAPTNLKITEVTSEGFRGTWD-HGASDVSLYRITWAPfGSSDKMETILNGDENTLVFENLNPNTIYEVSITAIYPDESES 1734
Cdd:COG3401  234 SAPTGLTATADTPGSVTLSWDpVTESDATGYRVYRSN-SGDGPFTKVATVTTTSYTDTGLTNGTTYYYRVTAVDAAGNES 312
                        330       340       350       360       370       380       390       400
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 93141047 1735 DDligSErTLPILTTQAPKSGPRNLQVYNATSNSLTVKWDPASG-RVQKYRItYQPSTGEGNEQTTTIGGRQNSVVLQKL 1813
Cdd:COG3401  313 AP---SN-VVSVTTDLTPPAAPSGLTATAVGSSSITLSWTASSDaDVTGYNV-YRSTSGGGTYTKIAETVTTTSYTDTGL 387
                        410       420       430       440       450       460       470       480
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 93141047 1814 KPDTPYTITVSSLYPDGEGGRMTGRGKTKPLNTVRNLRVYDPSTSTLNVRWDHAEGNPRQYKLFYAPAAGGPEELVPIPG 1893
Cdd:COG3401  388 TPGTTYYYKVTAVDAAGNESAPSEEVSATTASAASGESLTASVDAVPLTDVAGATAAASAASNPGVSAAVLADGGDTGNA 467
                        490       500       510       520       530       540       550       560
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 93141047 1894 NTNYAILRNLQPDTSYTVTVVPVYTEGDGGRTSDTGR-----TLMRGLARNVQVYNPTPNSLDVRWDPAPGPVLQYRVVY 1968
Cdd:COG3401  468 VPFTTTSSTVTATTTDTTTANLSVTTGSLVGGSGASSvtnsvSVIGASAAAAVGGAPDGTPNVTGASPVTVGASTGDVLI 547
                        570
                 ....*....|....*...
gi 93141047 1969 SPVDGTRPSESIVVPGNT 1986
Cdd:COG3401  548 TDLVSLTTSASSSVSGAG 565
fn3 pfam00041
Fibronectin type III domain;
336-415 2.24e-12

Fibronectin type III domain;


:

Pssm-ID: 394996 [Multi-domain]  Cd Length: 85  Bit Score: 65.13  E-value: 2.24e-12
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 93141047    336 PPSNLIAMEVSSKYVKLNWNPSP---SPVTGYKVILTPMTAGSRQHALSVGPQTTTLSVRDLSADTEYQISVSAMKGMTS 412
Cdd:pfam00041    2 APSNLTVTDVTSTSLTVSWTPPPdgnGPITGYEVEYRPKNSGEPWNEITVPGTTTSVTLTGLKPGTEYEVRVQAVNGGGE 81

                   ...
gi 93141047    413 SEP 415
Cdd:pfam00041   82 GPP 84
fn3 pfam00041
Fibronectin type III domain;
2028-2105 5.72e-12

Fibronectin type III domain;


:

Pssm-ID: 394996 [Multi-domain]  Cd Length: 85  Bit Score: 63.97  E-value: 5.72e-12
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 93141047   2028 PRNLRVFGETTNSLSVAW---DHADGPVQQYRIIYSPTVGDPIDEYTTVPGRRNNVILQPLQPDTPYKITVIAVYEDGDG 2104
Cdd:pfam00041    3 PSNLTVTDVTSTSLTVSWtppPDGNGPITGYEVEYRPKNSGEPWNEITVPGTTTSVTLTGLKPGTEYEVRVQAVNGGGEG 82

                   .
gi 93141047   2105 G 2105
Cdd:pfam00041   83 P 83
fn3 pfam00041
Fibronectin type III domain;
817-892 7.90e-12

Fibronectin type III domain;


:

Pssm-ID: 394996 [Multi-domain]  Cd Length: 85  Bit Score: 63.59  E-value: 7.90e-12
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 93141047    817 PRDLRVSDPTTSTMKLSWSGAP---GKVKQYLVTYTPV-AGGETQEVTVRGDTTNTVLQGLKEGTQYALSVTALYASGAG 892
Cdd:pfam00041    3 PSNLTVTDVTSTSLTVSWTPPPdgnGPITGYEVEYRPKnSGEPWNEITVPGTTTSVTLTGLKPGTEYEVRVQAVNGGGEG 82
FN3 cd00063
Fibronectin type 3 domain; One of three types of internal repeats found in the plasma protein ...
1090-1176 3.65e-11

Fibronectin type 3 domain; One of three types of internal repeats found in the plasma protein fibronectin. Its tenth fibronectin type III repeat contains an RGD cell recognition sequence in a flexible loop between 2 strands. Approximately 2% of all animal proteins contain the FN3 repeat; including extracellular and intracellular proteins, membrane spanning cytokine receptors, growth hormone receptors, tyrosine phosphatase receptors, and adhesion molecules. FN3-like domains are also found in bacterial glycosyl hydrolases.


:

Pssm-ID: 238020 [Multi-domain]  Cd Length: 93  Bit Score: 61.74  E-value: 3.65e-11
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 93141047 1090 PRNLKTSDPTMSSFRVTWEPAP---GEVKGYKVTFHPTGDDRRLGELVVGPYDNTVVLEELRAGTTYKVNVFGMFDGGES 1166
Cdd:cd00063    4 PTNLRVTDVTSTSVTLSWTPPEddgGPITGYVVEYREKGSGDWKEVEVTPGSETSYTLTGLKPGTEYEFRVRAVNGGGES 83
                         90
                 ....*....|
gi 93141047 1167 SPLVGQEMTT 1176
Cdd:cd00063   84 PPSESVTVTT 93
FN3 cd00063
Fibronectin type 3 domain; One of three types of internal repeats found in the plasma protein ...
1002-1075 1.08e-10

Fibronectin type 3 domain; One of three types of internal repeats found in the plasma protein fibronectin. Its tenth fibronectin type III repeat contains an RGD cell recognition sequence in a flexible loop between 2 strands. Approximately 2% of all animal proteins contain the FN3 repeat; including extracellular and intracellular proteins, membrane spanning cytokine receptors, growth hormone receptors, tyrosine phosphatase receptors, and adhesion molecules. FN3-like domains are also found in bacterial glycosyl hydrolases.


:

Pssm-ID: 238020 [Multi-domain]  Cd Length: 93  Bit Score: 60.59  E-value: 1.08e-10
                         10        20        30        40        50        60        70
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 93141047 1002 LKVDEETENTMRVTWKPAP---GKVVNYRVVYRPHGRGK-QMVAKVPPTVTSTVLKRLQPQTTYDITVLPIYKMGEGK 1075
Cdd:cd00063    7 LRVTDVTSTSVTLSWTPPEddgGPITGYVVEYREKGSGDwKEVEVTPGSETSYTLTGLKPGTEYEFRVRAVNGGGESP 84
fn3 pfam00041
Fibronectin type III domain;
26-105 1.13e-10

Fibronectin type III domain;


:

Pssm-ID: 394996 [Multi-domain]  Cd Length: 85  Bit Score: 60.12  E-value: 1.13e-10
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 93141047     26 DPPSDLNFKIIDENTVHMSWAKPVD---PIVGYRITVDPTTDG-PTKEFTLSASTTETLLSELVPETEYVVTITSYDEVE 101
Cdd:pfam00041    1 SAPSNLTVTDVTSTSLTVSWTPPPDgngPITGYEVEYRPKNSGePWNEITVPGTTTSVTLTGLKPGTEYEVRVQAVNGGG 80

                   ....
gi 93141047    102 ESVP 105
Cdd:pfam00041   81 EGPP 84
fn3 pfam00041
Fibronectin type III domain;
906-986 1.48e-08

Fibronectin type III domain;


:

Pssm-ID: 394996 [Multi-domain]  Cd Length: 85  Bit Score: 53.96  E-value: 1.48e-08
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 93141047    906 GSPQDLVTKDITDTSIGAYWTSAP---GMVRGYRVSWKSLYDDVDTGEKNLPEDAIHTMIENLQPETKYRISVFATYSSG 982
Cdd:pfam00041    1 SAPSNLTVTDVTSTSLTVSWTPPPdgnGPITGYEVEYRPKNSGEPWNEITVPGTTTSVTLTGLKPGTEYEVRVQAVNGGG 80

                   ....
gi 93141047    983 EGEP 986
Cdd:pfam00041   81 EGPP 84
gly_rich_SclB super family cl45768
LPXTG-anchored collagen-like adhesin Scl2/SclB; SclB (or Scl2 - streptococcal collagen-like ...
2764-2897 2.07e-08

LPXTG-anchored collagen-like adhesin Scl2/SclB; SclB (or Scl2 - streptococcal collagen-like protein 2) is an LPXTG-anchored surface-anchored adhesin with a variable-length region of triple helix-forming collagen-like Gly-Xaa-Xaa repeats.


The actual alignment was detected with superfamily member NF038329:

Pssm-ID: 468478 [Multi-domain]  Cd Length: 440  Bit Score: 59.53  E-value: 2.07e-08
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 93141047  2764 RGERGISGAIGPPGPRGDIGPPGPQGPPGPQGPNGLSIP-GEQGRQGMKGDAGEPGLPGRTGTPGLPGPPGPMGPPGDRG 2842
Cdd:NF038329  137 RGDRGETGPAGPAGPPGPQGERGEKGPAGPQGEAGPQGPaGKDGEAGAKGPAGEKGPQGPRGETGPAGEQGPAGPAGPDG 216
                          90       100       110       120       130
                  ....*....|....*....|....*....|....*....|....*....|....*
gi 93141047  2843 FTGKDGAMGPRGPPGPPGspgspgvTGPSGKPGKPGDHGRPGPSGLKGEKGDRGD 2897
Cdd:NF038329  217 EAGPAGEDGPAGPAGDGQ-------QGPDGDPGPTGEDGPQGPDGPAGKDGPRGD 264
fn3 pfam00041
Fibronectin type III domain;
2207-2282 1.43e-06

Fibronectin type III domain;


:

Pssm-ID: 394996 [Multi-domain]  Cd Length: 85  Bit Score: 48.57  E-value: 1.43e-06
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 93141047   2207 VTDLKTYQIGWDTFCVKWSP----HRAATSYRLKLSPADGTRG-QEITVRGSETSHCFTGLSPDTDYGVTVFVQTPNLEG 2281
Cdd:pfam00041    3 PSNLTVTDVTSTSLTVSWTPppdgNGPITGYEVEYRPKNSGEPwNEITVPGTTTSVTLTGLKPGTEYEVRVQAVNGGGEG 82

                   .
gi 93141047   2282 P 2282
Cdd:pfam00041   83 P 83
FN3 cd00063
Fibronectin type 3 domain; One of three types of internal repeats found in the plasma protein ...
634-719 1.17e-04

Fibronectin type 3 domain; One of three types of internal repeats found in the plasma protein fibronectin. Its tenth fibronectin type III repeat contains an RGD cell recognition sequence in a flexible loop between 2 strands. Approximately 2% of all animal proteins contain the FN3 repeat; including extracellular and intracellular proteins, membrane spanning cytokine receptors, growth hormone receptors, tyrosine phosphatase receptors, and adhesion molecules. FN3-like domains are also found in bacterial glycosyl hydrolases.


:

Pssm-ID: 238020 [Multi-domain]  Cd Length: 93  Bit Score: 43.25  E-value: 1.17e-04
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 93141047  634 PPKDLSFSEVTSYGFKTNWSPA---GENVFSYHITYKEAAGDDEVTVVEPASSTSVVLSSL-KPETLYLVNVTAEYEDGF 709
Cdd:cd00063    3 PPTNLRVTDVTSTSVTLSWTPPeddGGPITGYVVEYREKGSGDWKEVEVTPGSETSYTLTGlKPGTEYEFRVRAVNGGGE 82
                         90
                 ....*....|
gi 93141047  710 SIPLAGEETT 719
Cdd:cd00063   83 SPPSESVTVT 92
 
Name Accession Description Interval E-value
vWA_collagen_alphaI-XII-like cd01482
Collagen: The extracellular matrix represents a complex alloy of variable members of diverse ...
1198-1361 6.39e-92

Collagen: The extracellular matrix represents a complex alloy of variable members of diverse protein families defining structural integrity and various physiological functions. The most abundant family is the collagens with more than 20 different collagen types identified thus far. Collagens are centrally involved in the formation of fibrillar and microfibrillar networks of the extracellular matrix, basement membranes as well as other structures of the extracellular matrix. Some collagens have about 15-18 vWA domains in them. The VWA domains present in these collagens mediate protein-protein interactions.


Pssm-ID: 238759 [Multi-domain]  Cd Length: 164  Bit Score: 295.35  E-value: 6.39e-92
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 93141047 1198 ADIVLLVDGSWSIGRANFRTVRSFISRIVEVFDIGPKRVQIALAQYSGDPRTEWQLNAHRDKKSLLQAVANLPYKGGNTL 1277
Cdd:cd01482    1 ADIVFLVDGSWSIGRSNFNLVRSFLSSVVEAFEIGPDGVQVGLVQYSDDPRTEFDLNAYTSKEDVLAAIKNLPYKGGNTR 80
                         90       100       110       120       130       140       150       160
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 93141047 1278 TGMALNFIRQQNFRTQAGMRPRARKIGVLITDGKSQDDVEAPSKKLKDEGVELFAIGIKNADEVELKMIATDPDDTHAYN 1357
Cdd:cd01482   81 TGKALTHVREKNFTPDAGARPGVPKVVILITDGKSQDDVELPARVLRNLGVNVFAVGVKDADESELKMIASKPSETHVFN 160

                 ....
gi 93141047 1358 VADF 1361
Cdd:cd01482  161 VADF 164
vWA_collagen_alphaI-XII-like cd01482
Collagen: The extracellular matrix represents a complex alloy of variable members of diverse ...
139-302 2.06e-86

Collagen: The extracellular matrix represents a complex alloy of variable members of diverse protein families defining structural integrity and various physiological functions. The most abundant family is the collagens with more than 20 different collagen types identified thus far. Collagens are centrally involved in the formation of fibrillar and microfibrillar networks of the extracellular matrix, basement membranes as well as other structures of the extracellular matrix. Some collagens have about 15-18 vWA domains in them. The VWA domains present in these collagens mediate protein-protein interactions.


Pssm-ID: 238759 [Multi-domain]  Cd Length: 164  Bit Score: 279.56  E-value: 2.06e-86
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 93141047  139 TDLVFLVDGSWSVGRNNFKYILDFIAALVSAFDIGEEKTRVGVVQYSSDTRTEFNLNQYYQRDELLAAIKKIPYKGGNTM 218
Cdd:cd01482    1 ADIVFLVDGSWSIGRSNFNLVRSFLSSVVEAFEIGPDGVQVGLVQYSDDPRTEFDLNAYTSKEDVLAAIKNLPYKGGNTR 80
                         90       100       110       120       130       140       150       160
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 93141047  219 TGDAIDYLVKNTFTESAGARVGFPKVAIIITDGKSQDEVEIPARELRNVGVEVFSLGIKAADAKELKQIASTPSLNHVFN 298
Cdd:cd01482   81 TGKALTHVREKNFTPDAGARPGVPKVVILITDGKSQDDVELPARVLRNLGVNVFAVGVKDADESELKMIASKPSETHVFN 160

                 ....
gi 93141047  299 VANF 302
Cdd:cd01482  161 VADF 164
vWA_collagen_alphaI-XII-like cd01482
Collagen: The extracellular matrix represents a complex alloy of variable members of diverse ...
439-602 2.40e-86

Collagen: The extracellular matrix represents a complex alloy of variable members of diverse protein families defining structural integrity and various physiological functions. The most abundant family is the collagens with more than 20 different collagen types identified thus far. Collagens are centrally involved in the formation of fibrillar and microfibrillar networks of the extracellular matrix, basement membranes as well as other structures of the extracellular matrix. Some collagens have about 15-18 vWA domains in them. The VWA domains present in these collagens mediate protein-protein interactions.


Pssm-ID: 238759 [Multi-domain]  Cd Length: 164  Bit Score: 279.56  E-value: 2.40e-86
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 93141047  439 ADIVFLVDGSYSIGIANFVKVRAFLEVLVKSFEISPNRVQISLVQYSRDPHTEFTLKKFTKVEDIIEAINTFPYRGGSTN 518
Cdd:cd01482    1 ADIVFLVDGSWSIGRSNFNLVRSFLSSVVEAFEIGPDGVQVGLVQYSDDPRTEFDLNAYTSKEDVLAAIKNLPYKGGNTR 80
                         90       100       110       120       130       140       150       160
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 93141047  519 TGKAMTYVREKIFVPSKGSRSNVPKVMILITDGKSSDAFRDPAIKLRNSDVEIFAVGVKDAVRSELEAIASPPAETHVFT 598
Cdd:cd01482   81 TGKALTHVREKNFTPDAGARPGVPKVVILITDGKSQDDVELPARVLRNLGVNVFAVGVKDADESELKMIASKPSETHVFN 160

                 ....
gi 93141047  599 VEDF 602
Cdd:cd01482  161 VADF 164
vWA_collagen_alphaI-XII-like cd01482
Collagen: The extracellular matrix represents a complex alloy of variable members of diverse ...
2322-2486 3.07e-83

Collagen: The extracellular matrix represents a complex alloy of variable members of diverse protein families defining structural integrity and various physiological functions. The most abundant family is the collagens with more than 20 different collagen types identified thus far. Collagens are centrally involved in the formation of fibrillar and microfibrillar networks of the extracellular matrix, basement membranes as well as other structures of the extracellular matrix. Some collagens have about 15-18 vWA domains in them. The VWA domains present in these collagens mediate protein-protein interactions.


Pssm-ID: 238759 [Multi-domain]  Cd Length: 164  Bit Score: 270.70  E-value: 3.07e-83
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 93141047 2322 ADIVFLTDASWSIGDDNFNKVVKFIFNTVGGFdEISPAGIQVSFVQYSDEVKSEFKLNTYNDKALALGALQNIRYRGGNT 2401
Cdd:cd01482    1 ADIVFLVDGSWSIGRSNFNLVRSFLSSVVEAF-EIGPDGVQVGLVQYSDDPRTEFDLNAYTSKEDVLAAIKNLPYKGGNT 79
                         90       100       110       120       130       140       150       160
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 93141047 2402 RTGKALTFIKEKVLTWESGMRKNVPKVLVVVTDGRSQDEVKKAALVIQQSGFSVFVVGVADVDYNELANIASKPSERHVF 2481
Cdd:cd01482   80 RTGKALTHVREKNFTPDAGARPGVPKVVILITDGKSQDDVELPARVLRNLGVNVFAVGVKDADESELKMIASKPSETHVF 159

                 ....*
gi 93141047 2482 IVDDF 2486
Cdd:cd01482  160 NVADF 164
VWA pfam00092
von Willebrand factor type A domain;
140-311 3.44e-66

von Willebrand factor type A domain;


Pssm-ID: 459670 [Multi-domain]  Cd Length: 174  Bit Score: 222.15  E-value: 3.44e-66
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 93141047    140 DLVFLVDGSWSVGRNNFKYILDFIAALVSAFDIGEEKTRVGVVQYSSDTRTEFNLNQYYQRDELLAAIKKIPYKGGNTM- 218
Cdd:pfam00092    1 DIVFLLDGSGSIGGDNFEKVKEFLKKLVESLDIGPDGTRVGLVQYSSDVRTEFPLNDYSSKEELLSAVDNLRYLGGGTTn 80
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 93141047    219 TGDAIDYLVKNTFTESAGARVGFPKVAIIITDGKSQD-EVEIPARELRNVGVEVFSLGIKAADAKELKQIASTPSLNHVF 297
Cdd:pfam00092   81 TGKALKYALENLFSSAAGARPGAPKVVVLLTDGRSQDgDPEEVARELKSAGVTVFAVGVGNADDEELRKIASEPGEGHVF 160
                          170
                   ....*....|....
gi 93141047    298 NVANFDAIVDIQNE 311
Cdd:pfam00092  161 TVSDFEALEDLQDQ 174
VWA pfam00092
von Willebrand factor type A domain;
1199-1370 1.16e-65

von Willebrand factor type A domain;


Pssm-ID: 459670 [Multi-domain]  Cd Length: 174  Bit Score: 220.61  E-value: 1.16e-65
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 93141047   1199 DIVLLVDGSWSIGRANFRTVRSFISRIVEVFDIGPKRVQIALAQYSGDPRTEWQLNAHRDKKSLLQAVANLPYKGGNTL- 1277
Cdd:pfam00092    1 DIVFLLDGSGSIGGDNFEKVKEFLKKLVESLDIGPDGTRVGLVQYSSDVRTEFPLNDYSSKEELLSAVDNLRYLGGGTTn 80
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 93141047   1278 TGMALNFIRQQNFRTQAGMRPRARKIGVLITDGKSQD-DVEAPSKKLKDEGVELFAIGIKNADEVELKMIATDPDDTHAY 1356
Cdd:pfam00092   81 TGKALKYALENLFSSAAGARPGAPKVVVLLTDGRSQDgDPEEVARELKSAGVTVFAVGVGNADDEELRKIASEPGEGHVF 160
                          170
                   ....*....|....
gi 93141047   1357 NVADFESLSRIVDD 1370
Cdd:pfam00092  161 TVSDFEALEDLQDQ 174
VWA pfam00092
von Willebrand factor type A domain;
440-608 1.83e-61

von Willebrand factor type A domain;


Pssm-ID: 459670 [Multi-domain]  Cd Length: 174  Bit Score: 208.67  E-value: 1.83e-61
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 93141047    440 DIVFLVDGSYSIGIANFVKVRAFLEVLVKSFEISPNRVQISLVQYSRDPHTEFTLKKFTKVEDIIEAINTFPYRGGST-N 518
Cdd:pfam00092    1 DIVFLLDGSGSIGGDNFEKVKEFLKKLVESLDIGPDGTRVGLVQYSSDVRTEFPLNDYSSKEELLSAVDNLRYLGGGTtN 80
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 93141047    519 TGKAMTYVREKIFVPSKGSRSNVPKVMILITDGKSSD-AFRDPAIKLRNSDVEIFAVGVKDAVRSELEAIASPPAETHVF 597
Cdd:pfam00092   81 TGKALKYALENLFSSAAGARPGAPKVVVLLTDGRSQDgDPEEVARELKSAGVTVFAVGVGNADDEELRKIASEPGEGHVF 160
                          170
                   ....*....|.
gi 93141047    598 TVEDFDAFQRI 608
Cdd:pfam00092  161 TVSDFEALEDL 171
VWA pfam00092
von Willebrand factor type A domain;
2323-2495 7.01e-61

von Willebrand factor type A domain;


Pssm-ID: 459670 [Multi-domain]  Cd Length: 174  Bit Score: 207.13  E-value: 7.01e-61
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 93141047   2323 DIVFLTDASWSIGDDNFNKVVKFIFNTVGGFDeISPAGIQVSFVQYSDEVKSEFKLNTYNDKALALGALQNIRYRGGNTR 2402
Cdd:pfam00092    1 DIVFLLDGSGSIGGDNFEKVKEFLKKLVESLD-IGPDGTRVGLVQYSSDVRTEFPLNDYSSKEELLSAVDNLRYLGGGTT 79
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 93141047   2403 -TGKALTFIKEKVLTWESGMRKNVPKVLVVVTDGRSQD-EVKKAALVIQQSGFSVFVVGVADVDYNELANIASKPSERHV 2480
Cdd:pfam00092   80 nTGKALKYALENLFSSAAGARPGAPKVVVLLTDGRSQDgDPEEVARELKSAGVTVFAVGVGNADDEELRKIASEPGEGHV 159
                          170
                   ....*....|....*
gi 93141047   2481 FIVDDFESFEKIEDN 2495
Cdd:pfam00092  160 FTVSDFEALEDLQDQ 174
TSPN smart00210
Thrombospondin N-terminal -like domains; Heparin-binding and cell adhesion domain of ...
2520-2712 3.13e-59

Thrombospondin N-terminal -like domains; Heparin-binding and cell adhesion domain of thrombospondin


Pssm-ID: 214560  Cd Length: 184  Bit Score: 202.59  E-value: 3.13e-59
                            10        20        30        40        50        60        70        80
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 93141047    2520 GFKMLEAYNLTEKNFASVQGVSLEsgsfPSYSAYRIQKNAFVNQPTADLHPNGLPPSYTIILLFRLlpeTPSDPFAIWQI 2599
Cdd:smart00210    1 GQDLLQVFDLPSLSFAIRQVVGPE----PGSPAYRLGDPALVPQPTRDLFPSGLPEDFSLLTTFRQ---TPKSRGVLFAI 73
                            90       100       110       120       130       140       150       160
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 93141047    2600 TDRDYKPQVGVIADPSSKTLSFFNKDTRGEVQTVTFdteEVKTLFYGSFHKVHIVVTSKSVKIYIDCYEIIEKDIKEAG- 2678
Cdd:smart00210   74 YDAQNVRQFGLEVDGRANTLLLRYQGVDGKQHTVSF---RNLPLADGQWHKLALSVSGSSATLYVDCNEIDSRPLDRPGq 150
                           170       180       190
                    ....*....|....*....|....*....|....*
gi 93141047    2679 -NITTDGYEILGKLLKGeRKSAAFQIQSFDIVCSP 2712
Cdd:smart00210  151 pPIDTDGIEVRGAQAAD-RKPFQGDLQQLKIVCDP 184
VWA smart00327
von Willebrand factor (vWF) type A domain; VWA domains in extracellular eukaryotic proteins ...
140-309 1.38e-53

von Willebrand factor (vWF) type A domain; VWA domains in extracellular eukaryotic proteins mediate adhesion via metal ion-dependent adhesion sites (MIDAS). Intracellular VWA domains and homologues in prokaryotes have recently been identified. The proposed VWA domains in integrin beta subunits have recently been substantiated using sequence-based methods.


Pssm-ID: 214621 [Multi-domain]  Cd Length: 175  Bit Score: 186.12  E-value: 1.38e-53
                            10        20        30        40        50        60        70        80
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 93141047     140 DLVFLVDGSWSVGRNNFKYILDFIAALVSAFDIGEEKTRVGVVQYSSDTRTEFNLNQYYQRDELLAAIKKIPYK-GGNTM 218
Cdd:smart00327    1 DVVFLLDGSGSMGGNRFELAKEFVLKLVEQLDIGPDGDRVGLVTFSDDARVLFPLNDSRSKDALLEALASLSYKlGGGTN 80
                            90       100       110       120       130       140       150       160
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 93141047     219 TGDAIDYLVKNTFTESAGARVGFPKVAIIITDGKSQD---EVEIPARELRNVGVEVFSLGIK-AADAKELKQIASTPSLN 294
Cdd:smart00327   81 LGAALQYALENLFSKSAGSRRGAPKVVILITDGESNDgpkDLLKAAKELKRSGVKVFVVGVGnDVDEEELKKLASAPGGV 160
                           170
                    ....*....|....*
gi 93141047     295 HVFNVANFDAIVDIQ 309
Cdd:smart00327  161 YVFLPELLDLLIDLL 175
VWA smart00327
von Willebrand factor (vWF) type A domain; VWA domains in extracellular eukaryotic proteins ...
440-609 5.63e-52

von Willebrand factor (vWF) type A domain; VWA domains in extracellular eukaryotic proteins mediate adhesion via metal ion-dependent adhesion sites (MIDAS). Intracellular VWA domains and homologues in prokaryotes have recently been identified. The proposed VWA domains in integrin beta subunits have recently been substantiated using sequence-based methods.


Pssm-ID: 214621 [Multi-domain]  Cd Length: 175  Bit Score: 181.50  E-value: 5.63e-52
                            10        20        30        40        50        60        70        80
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 93141047     440 DIVFLVDGSYSIGIANFVKVRAFLEVLVKSFEISPNRVQISLVQYSRDPHTEFTLKKFTKVEDIIEAINTFPYR-GGSTN 518
Cdd:smart00327    1 DVVFLLDGSGSMGGNRFELAKEFVLKLVEQLDIGPDGDRVGLVTFSDDARVLFPLNDSRSKDALLEALASLSYKlGGGTN 80
                            90       100       110       120       130       140       150       160
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 93141047     519 TGKAMTYVREKIFVPSKGSRSNVPKVMILITDGKSSDA---FRDPAIKLRNSDVEIFAVGVKDAV-RSELEAIASPPAET 594
Cdd:smart00327   81 LGAALQYALENLFSKSAGSRRGAPKVVILITDGESNDGpkdLLKAAKELKRSGVKVFVVGVGNDVdEEELKKLASAPGGV 160
                           170
                    ....*....|....*
gi 93141047     595 HVFTVEDFDAFQRIS 609
Cdd:smart00327  161 YVFLPELLDLLIDLL 175
VWA smart00327
von Willebrand factor (vWF) type A domain; VWA domains in extracellular eukaryotic proteins ...
1199-1368 2.86e-49

von Willebrand factor (vWF) type A domain; VWA domains in extracellular eukaryotic proteins mediate adhesion via metal ion-dependent adhesion sites (MIDAS). Intracellular VWA domains and homologues in prokaryotes have recently been identified. The proposed VWA domains in integrin beta subunits have recently been substantiated using sequence-based methods.


Pssm-ID: 214621 [Multi-domain]  Cd Length: 175  Bit Score: 173.79  E-value: 2.86e-49
                            10        20        30        40        50        60        70        80
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 93141047    1199 DIVLLVDGSWSIGRANFRTVRSFISRIVEVFDIGPKRVQIALAQYSGDPRTEWQLNAHRDKKSLLQAVANLPYK-GGNTL 1277
Cdd:smart00327    1 DVVFLLDGSGSMGGNRFELAKEFVLKLVEQLDIGPDGDRVGLVTFSDDARVLFPLNDSRSKDALLEALASLSYKlGGGTN 80
                            90       100       110       120       130       140       150       160
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 93141047    1278 TGMALNFIRQQNFRTQAGMRPRARKIGVLITDGKSQD---DVEAPSKKLKDEGVELFAIGIKNA-DEVELKMIATDPDDT 1353
Cdd:smart00327   81 LGAALQYALENLFSKSAGSRRGAPKVVILITDGESNDgpkDLLKAAKELKRSGVKVFVVGVGNDvDEEELKKLASAPGGV 160
                           170
                    ....*....|....*
gi 93141047    1354 HAYNVADFESLSRIV 1368
Cdd:smart00327  161 YVFLPELLDLLIDLL 175
VWA smart00327
von Willebrand factor (vWF) type A domain; VWA domains in extracellular eukaryotic proteins ...
2323-2492 3.07e-45

von Willebrand factor (vWF) type A domain; VWA domains in extracellular eukaryotic proteins mediate adhesion via metal ion-dependent adhesion sites (MIDAS). Intracellular VWA domains and homologues in prokaryotes have recently been identified. The proposed VWA domains in integrin beta subunits have recently been substantiated using sequence-based methods.


Pssm-ID: 214621 [Multi-domain]  Cd Length: 175  Bit Score: 162.24  E-value: 3.07e-45
                            10        20        30        40        50        60        70        80
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 93141047    2323 DIVFLTDASWSIGDDNFNKVVKFIFNTVGGFDeISPAGIQVSFVQYSDEVKSEFKLNTYNDKALALGALQNIRYR-GGNT 2401
Cdd:smart00327    1 DVVFLLDGSGSMGGNRFELAKEFVLKLVEQLD-IGPDGDRVGLVTFSDDARVLFPLNDSRSKDALLEALASLSYKlGGGT 79
                            90       100       110       120       130       140       150       160
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 93141047    2402 RTGKALTFIKEKVLTWESGMRKNVPKVLVVVTDGRSQD---EVKKAALVIQQSGFSVFVVGV-ADVDYNELANIASKPSE 2477
Cdd:smart00327   80 NLGAALQYALENLFSKSAGSRRGAPKVVILITDGESNDgpkDLLKAAKELKRSGVKVFVVGVgNDVDEEELKKLASAPGG 159
                           170
                    ....*....|....*
gi 93141047    2478 RHVFIVDDFESFEKI 2492
Cdd:smart00327  160 VYVFLPELLDLLIDL 174
fn3 pfam00041
Fibronectin type III domain;
1756-1834 2.51e-15

Fibronectin type III domain;


Pssm-ID: 394996 [Multi-domain]  Cd Length: 85  Bit Score: 73.22  E-value: 2.51e-15
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 93141047   1756 PRNLQVYNATSNSLTVKWDPA---SGRVQKYRITYQPSTGEGNEQTTTIGGRQNSVVLQKLKPDTPYTITVSSLYPDGEG 1832
Cdd:pfam00041    3 PSNLTVTDVTSTSLTVSWTPPpdgNGPITGYEVEYRPKNSGEPWNEITVPGTTTSVTLTGLKPGTEYEVRVQAVNGGGEG 82

                   ..
gi 93141047   1833 GR 1834
Cdd:pfam00041   83 PP 84
fn3 pfam00041
Fibronectin type III domain;
1387-1465 3.30e-15

Fibronectin type III domain;


Pssm-ID: 394996 [Multi-domain]  Cd Length: 85  Bit Score: 73.22  E-value: 3.30e-15
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 93141047   1387 APSNLVISERTHRSFRVSWTPPSDS---VDRYKVEYYPVSGGKR-QEFYVSRMETSTVLKDLKPETEYVVNVYSVVEDEY 1462
Cdd:pfam00041    2 APSNLTVTDVTSTSLTVSWTPPPDGngpITGYEVEYRPKNSGEPwNEITVPGTTTSVTLTGLKPGTEYEVRVQAVNGGGE 81

                   ...
gi 93141047   1463 SEP 1465
Cdd:pfam00041   82 GPP 84
FN3 cd00063
Fibronectin type 3 domain; One of three types of internal repeats found in the plasma protein ...
2118-2197 2.89e-14

Fibronectin type 3 domain; One of three types of internal repeats found in the plasma protein fibronectin. Its tenth fibronectin type III repeat contains an RGD cell recognition sequence in a flexible loop between 2 strands. Approximately 2% of all animal proteins contain the FN3 repeat; including extracellular and intracellular proteins, membrane spanning cytokine receptors, growth hormone receptors, tyrosine phosphatase receptors, and adhesion molecules. FN3-like domains are also found in bacterial glycosyl hydrolases.


Pssm-ID: 238020 [Multi-domain]  Cd Length: 93  Bit Score: 70.60  E-value: 2.89e-14
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 93141047 2118 PPQNIHISDEWYTRFRVSWDPSP---SPVLGYKIVYKPVGSNEPMEA--FVGEMTSYTLHNLNPSTTYDVNVYAQYDSGL 2192
Cdd:cd00063    3 PPTNLRVTDVTSTSVTLSWTPPEddgGPITGYVVEYREKGSGDWKEVevTPGSETSYTLTGLKPGTEYEFRVRAVNGGGE 82

                 ....*
gi 93141047 2193 SVPLT 2197
Cdd:cd00063   83 SPPSE 87
ChlD COG1240
vWFA (von Willebrand factor type A) domain of Mg and Co chelatases [Coenzyme transport and ...
439-608 6.18e-14

vWFA (von Willebrand factor type A) domain of Mg and Co chelatases [Coenzyme transport and metabolism];


Pssm-ID: 440853 [Multi-domain]  Cd Length: 262  Bit Score: 74.59  E-value: 6.18e-14
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 93141047  439 ADIVFLVDGSYSIGIAN-FVKVRAFLEVLVKSFeisPNRVQISLVQYSRDPhteFTLKKFTK-VEDIIEAINTFPYRGGs 516
Cdd:COG1240   93 RDVVLVVDASGSMAAENrLEAAKGALLDFLDDY---RPRDRVGLVAFGGEA---EVLLPLTRdREALKRALDELPPGGG- 165
                         90       100       110       120       130       140       150       160
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 93141047  517 TNTGKAMTYVREKIfvpsKGSRSNVPKVMILITDGKSSDAFRDP---AIKLRNSDVEIFAVGVKDAVRSE--LEAIAspp 591
Cdd:COG1240  166 TPLGDALALALELL----KRADPARRKVIVLLTDGRDNAGRIDPleaAELAAAAGIRIYTIGVGTEAVDEglLREIA--- 238
                        170
                 ....*....|....*....
gi 93141047  592 AET--HVFTVEDFDAFQRI 608
Cdd:COG1240  239 EATggRYFRADDLSELAAI 257
FN3 cd00063
Fibronectin type 3 domain; One of three types of internal repeats found in the plasma protein ...
724-805 7.59e-14

Fibronectin type 3 domain; One of three types of internal repeats found in the plasma protein fibronectin. Its tenth fibronectin type III repeat contains an RGD cell recognition sequence in a flexible loop between 2 strands. Approximately 2% of all animal proteins contain the FN3 repeat; including extracellular and intracellular proteins, membrane spanning cytokine receptors, growth hormone receptors, tyrosine phosphatase receptors, and adhesion molecules. FN3-like domains are also found in bacterial glycosyl hydrolases.


Pssm-ID: 238020 [Multi-domain]  Cd Length: 93  Bit Score: 69.45  E-value: 7.59e-14
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 93141047  724 GAPRNLKVTDETTDSFKITWTQAP---GRVLRYRIIYRPVAGGESREVTTPPNQRRR-TLENLIPDTKYEVSVIPEYFSG 799
Cdd:cd00063    2 SPPTNLRVTDVTSTSVTLSWTPPEddgGPITGYVVEYREKGSGDWKEVEVTPGSETSyTLTGLKPGTEYEFRVRAVNGGG 81

                 ....*.
gi 93141047  800 PGTPLT 805
Cdd:cd00063   82 ESPPSE 87
ChlD COG1240
vWFA (von Willebrand factor type A) domain of Mg and Co chelatases [Coenzyme transport and ...
1195-1371 1.00e-13

vWFA (von Willebrand factor type A) domain of Mg and Co chelatases [Coenzyme transport and metabolism];


Pssm-ID: 440853 [Multi-domain]  Cd Length: 262  Bit Score: 73.82  E-value: 1.00e-13
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 93141047 1195 RAEADIVLLVDGSWSIGRAN-FRTVRSFISRIVEVFdigPKRVQIALAQYSGDPrtEWQLNAHRDKKSLLQAVANLPYKG 1273
Cdd:COG1240   90 QRGRDVVLVVDASGSMAAENrLEAAKGALLDFLDDY---RPRDRVGLVAFGGEA--EVLLPLTRDREALKRALDELPPGG 164
                         90       100       110       120       130       140       150       160
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 93141047 1274 GNTLT---GMALNFIRQQNfrtqagmrPRARKIGVLITDGK---SQDDVEAPSKKLKDEGVELFAIGI--KNADEVELKM 1345
Cdd:COG1240  165 GTPLGdalALALELLKRAD--------PARRKVIVLLTDGRdnaGRIDPLEAAELAAAAGIRIYTIGVgtEAVDEGLLRE 236
                        170       180
                 ....*....|....*....|....*...
gi 93141047 1346 IAtdpDDTHA--YNVADFESLSRIVDDL 1371
Cdd:COG1240  237 IA---EATGGryFRADDLSELAAIYREI 261
FN3 COG3401
Fibronectin type 3 domain [General function prediction only];
1419-1986 1.17e-13

Fibronectin type 3 domain [General function prediction only];


Pssm-ID: 442628 [Multi-domain]  Cd Length: 603  Bit Score: 76.96  E-value: 1.17e-13
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 93141047 1419 YYPVSGGKRQEFYVSRMETSTVLKDLKPETEYVVNVYSVVEDEYSEPLKGTEKTLPVPVVSLNIYDVGPTTMHVQWQPVG 1498
Cdd:COG3401    5 YLTSLDAGIAASAAANTAVNALSKAGGSGKTILVYLAVVLSVTTKESPGTLLVAAGLSSGGGLGTGGRAGTTSGVAAVAV 84
                         90       100       110       120       130       140       150       160
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 93141047 1499 GATGYILSYKPVKDTEPTRPKEVRLGPTVNDMQLTDLVPNTEYAVTVqavlhdltsepVTVREVTLPLPRPQDLKLRDVT 1578
Cdd:COG3401   85 AAAPPTATGLTTLTGSGSVGGATNTGLTSSDEVPSPAVGTATTATAV-----------AGGAATAGTYALGAGLYGVDGA 153
                        170       180       190       200       210       220       230       240
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 93141047 1579 HSTMNVFWEPVPGKVRKYIVRYKTPEEDVKEVEVDRSETSTSLKDLFSQTLYTVSVSAVHDEGESPP---VTAQETTRPV 1655
Cdd:COG3401  154 NASGTTASSVAGAGVVVSPDTSATAAVATTSLTVTSTTLVDGGGDIEPGTTYYYRVAATDTGGESAPsneVSVTTPTTPP 233
                        250       260       270       280       290       300       310       320
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 93141047 1656 PAPTNLKITEVTSEGFRGTWD-HGASDVSLYRITWAPfGSSDKMETILNGDENTLVFENLNPNTIYEVSITAIYPDESES 1734
Cdd:COG3401  234 SAPTGLTATADTPGSVTLSWDpVTESDATGYRVYRSN-SGDGPFTKVATVTTTSYTDTGLTNGTTYYYRVTAVDAAGNES 312
                        330       340       350       360       370       380       390       400
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 93141047 1735 DDligSErTLPILTTQAPKSGPRNLQVYNATSNSLTVKWDPASG-RVQKYRItYQPSTGEGNEQTTTIGGRQNSVVLQKL 1813
Cdd:COG3401  313 AP---SN-VVSVTTDLTPPAAPSGLTATAVGSSSITLSWTASSDaDVTGYNV-YRSTSGGGTYTKIAETVTTTSYTDTGL 387
                        410       420       430       440       450       460       470       480
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 93141047 1814 KPDTPYTITVSSLYPDGEGGRMTGRGKTKPLNTVRNLRVYDPSTSTLNVRWDHAEGNPRQYKLFYAPAAGGPEELVPIPG 1893
Cdd:COG3401  388 TPGTTYYYKVTAVDAAGNESAPSEEVSATTASAASGESLTASVDAVPLTDVAGATAAASAASNPGVSAAVLADGGDTGNA 467
                        490       500       510       520       530       540       550       560
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 93141047 1894 NTNYAILRNLQPDTSYTVTVVPVYTEGDGGRTSDTGR-----TLMRGLARNVQVYNPTPNSLDVRWDPAPGPVLQYRVVY 1968
Cdd:COG3401  468 VPFTTTSSTVTATTTDTTTANLSVTTGSLVGGSGASSvtnsvSVIGASAAAAVGGAPDGTPNVTGASPVTVGASTGDVLI 547
                        570
                 ....*....|....*...
gi 93141047 1969 SPVDGTRPSESIVVPGNT 1986
Cdd:COG3401  548 TDLVSLTTSASSSVSGAG 565
fn3 pfam00041
Fibronectin type III domain;
1938-2016 1.23e-13

Fibronectin type III domain;


Pssm-ID: 394996 [Multi-domain]  Cd Length: 85  Bit Score: 68.60  E-value: 1.23e-13
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 93141047   1938 RNVQVYNPTPNSLDVRWDPAP---GPVLQYRVVYSPVDGTRPSESIVVPGNTRMVHLERLIPDTLYSVNLVALYSDGEGN 2014
Cdd:pfam00041    4 SNLTVTDVTSTSLTVSWTPPPdgnGPITGYEVEYRPKNSGEPWNEITVPGTTTSVTLTGLKPGTEYEVRVQAVNGGGEGP 83

                   ..
gi 93141047   2015 PS 2016
Cdd:pfam00041   84 PS 85
fn3 pfam00041
Fibronectin type III domain;
724-793 3.59e-13

Fibronectin type III domain;


Pssm-ID: 394996 [Multi-domain]  Cd Length: 85  Bit Score: 67.44  E-value: 3.59e-13
                           10        20        30        40        50        60        70
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....
gi 93141047    724 GAPRNLKVTDETTDSFKITWT---QAPGRVLRYRIIYRPV-AGGESREVTTPPNQRRRTLENLIPDTKYEVSVI 793
Cdd:pfam00041    1 SAPSNLTVTDVTSTSLTVSWTpppDGNGPITGYEVEYRPKnSGEPWNEITVPGTTTSVTLTGLKPGTEYEVRVQ 74
FN3 cd00063
Fibronectin type 3 domain; One of three types of internal repeats found in the plasma protein ...
1386-1470 4.47e-13

Fibronectin type 3 domain; One of three types of internal repeats found in the plasma protein fibronectin. Its tenth fibronectin type III repeat contains an RGD cell recognition sequence in a flexible loop between 2 strands. Approximately 2% of all animal proteins contain the FN3 repeat; including extracellular and intracellular proteins, membrane spanning cytokine receptors, growth hormone receptors, tyrosine phosphatase receptors, and adhesion molecules. FN3-like domains are also found in bacterial glycosyl hydrolases.


Pssm-ID: 238020 [Multi-domain]  Cd Length: 93  Bit Score: 67.14  E-value: 4.47e-13
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 93141047 1386 EAPSNLVISERTHRSFRVSWTPPSDS---VDRYKVEYYPVSGGKRQEFYVSRM-ETSTVLKDLKPETEYVVNVYSVVEDE 1461
Cdd:cd00063    2 SPPTNLRVTDVTSTSVTLSWTPPEDDggpITGYVVEYREKGSGDWKEVEVTPGsETSYTLTGLKPGTEYEFRVRAVNGGG 81

                 ....*....
gi 93141047 1462 YSEPLKGTE 1470
Cdd:cd00063   82 ESPPSESVT 90
FN3 cd00063
Fibronectin type 3 domain; One of three types of internal repeats found in the plasma protein ...
1754-1832 6.28e-13

Fibronectin type 3 domain; One of three types of internal repeats found in the plasma protein fibronectin. Its tenth fibronectin type III repeat contains an RGD cell recognition sequence in a flexible loop between 2 strands. Approximately 2% of all animal proteins contain the FN3 repeat; including extracellular and intracellular proteins, membrane spanning cytokine receptors, growth hormone receptors, tyrosine phosphatase receptors, and adhesion molecules. FN3-like domains are also found in bacterial glycosyl hydrolases.


Pssm-ID: 238020 [Multi-domain]  Cd Length: 93  Bit Score: 66.75  E-value: 6.28e-13
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 93141047 1754 SGPRNLQVYNATSNSLTVKWDPAS---GRVQKYRITYQPSTGEGNEQTTTIGGRQNSVVLQKLKPDTPYTITVSSLYPDG 1830
Cdd:cd00063    2 SPPTNLRVTDVTSTSVTLSWTPPEddgGPITGYVVEYREKGSGDWKEVEVTPGSETSYTLTGLKPGTEYEFRVRAVNGGG 81

                 ..
gi 93141047 1831 EG 1832
Cdd:cd00063   82 ES 83
fn3 pfam00041
Fibronectin type III domain;
2118-2195 1.36e-12

Fibronectin type III domain;


Pssm-ID: 394996 [Multi-domain]  Cd Length: 85  Bit Score: 65.51  E-value: 1.36e-12
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 93141047   2118 PPQNIHISDEWYTRFRVSWDPSP---SPVLGYKIVYKPVGSNEPMEAF--VGEMTSYTLHNLNPSTTYDVNVYAQYDSGL 2192
Cdd:pfam00041    2 APSNLTVTDVTSTSLTVSWTPPPdgnGPITGYEVEYRPKNSGEPWNEItvPGTTTSVTLTGLKPGTEYEVRVQAVNGGGE 81

                   ...
gi 93141047   2193 SVP 2195
Cdd:pfam00041   82 GPP 84
ChlD COG1240
vWFA (von Willebrand factor type A) domain of Mg and Co chelatases [Coenzyme transport and ...
139-312 1.41e-12

vWFA (von Willebrand factor type A) domain of Mg and Co chelatases [Coenzyme transport and metabolism];


Pssm-ID: 440853 [Multi-domain]  Cd Length: 262  Bit Score: 70.74  E-value: 1.41e-12
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 93141047  139 TDLVFLVDGSWSVGRNN----FKY-ILDFIAALVsafdigeEKTRVGVVQYSSDTRTEFNLNqyYQRDELLAAIKKIPYK 213
Cdd:COG1240   93 RDVVLVVDASGSMAAENrleaAKGaLLDFLDDYR-------PRDRVGLVAFGGEAEVLLPLT--RDREALKRALDELPPG 163
                         90       100       110       120       130       140       150       160
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 93141047  214 GGnTMTGDAIdYLVKNTFTESAGARvgfPKVAIIITDGK---SQDEVEIPARELRNVGVEVFSLGI--KAADAKELKQIA 288
Cdd:COG1240  164 GG-TPLGDAL-ALALELLKRADPAR---RKVIVLLTDGRdnaGRIDPLEAAELAAAAGIRIYTIGVgtEAVDEGLLREIA 238
                        170       180
                 ....*....|....*....|....*
gi 93141047  289 STpsLN-HVFNVANFDAIVDIQNEI 312
Cdd:COG1240  239 EA--TGgRYFRADDLSELAAIYREI 261
fn3 pfam00041
Fibronectin type III domain;
336-415 2.24e-12

Fibronectin type III domain;


Pssm-ID: 394996 [Multi-domain]  Cd Length: 85  Bit Score: 65.13  E-value: 2.24e-12
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 93141047    336 PPSNLIAMEVSSKYVKLNWNPSP---SPVTGYKVILTPMTAGSRQHALSVGPQTTTLSVRDLSADTEYQISVSAMKGMTS 412
Cdd:pfam00041    2 APSNLTVTDVTSTSLTVSWTPPPdgnGPITGYEVEYRPKNSGEPWNEITVPGTTTSVTLTGLKPGTEYEVRVQAVNGGGE 81

                   ...
gi 93141047    413 SEP 415
Cdd:pfam00041   82 GPP 84
FN3 cd00063
Fibronectin type 3 domain; One of three types of internal repeats found in the plasma protein ...
1569-1652 3.23e-12

Fibronectin type 3 domain; One of three types of internal repeats found in the plasma protein fibronectin. Its tenth fibronectin type III repeat contains an RGD cell recognition sequence in a flexible loop between 2 strands. Approximately 2% of all animal proteins contain the FN3 repeat; including extracellular and intracellular proteins, membrane spanning cytokine receptors, growth hormone receptors, tyrosine phosphatase receptors, and adhesion molecules. FN3-like domains are also found in bacterial glycosyl hydrolases.


Pssm-ID: 238020 [Multi-domain]  Cd Length: 93  Bit Score: 64.83  E-value: 3.23e-12
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 93141047 1569 PQDLKLRDVTHSTMNVFWEPVP---GKVRKYIVRYK-TPEEDVKEVEVDR-SETSTSLKDLFSQTLYTVSVSAVHDEGES 1643
Cdd:cd00063    4 PTNLRVTDVTSTSVTLSWTPPEddgGPITGYVVEYReKGSGDWKEVEVTPgSETSYTLTGLKPGTEYEFRVRAVNGGGES 83

                 ....*....
gi 93141047 1644 PPVTAQETT 1652
Cdd:cd00063   84 PPSESVTVT 92
FN3 smart00060
Fibronectin type 3 domain; One of three types of internal repeat within the plasma protein, ...
1474-1548 3.80e-12

Fibronectin type 3 domain; One of three types of internal repeat within the plasma protein, fibronectin. The tenth fibronectin type III repeat contains a RGD cell recognition sequence in a flexible loop between 2 strands. Type III modules are present in both extracellular and intracellular proteins.


Pssm-ID: 214495 [Multi-domain]  Cd Length: 83  Bit Score: 64.17  E-value: 3.80e-12
                            10        20        30        40        50        60        70
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*..
gi 93141047    1474 PVPVVSLNIYDVGPTTMHVQWQPV--GGATGYILSYKPVKDTEPTRPKEVRLGPTVNDMQLTDLVPNTEYAVTVQAV 1548
Cdd:smart00060    1 PSPPSNLRVTDVTSTSVTLSWEPPpdDGITGYIVGYRVEYREEGSEWKEVNVTPSSTSYTLTGLKPGTEYEFRVRAV 77
FN3 smart00060
Fibronectin type 3 domain; One of three types of internal repeat within the plasma protein, ...
2118-2193 5.30e-12

Fibronectin type 3 domain; One of three types of internal repeat within the plasma protein, fibronectin. The tenth fibronectin type III repeat contains a RGD cell recognition sequence in a flexible loop between 2 strands. Type III modules are present in both extracellular and intracellular proteins.


Pssm-ID: 214495 [Multi-domain]  Cd Length: 83  Bit Score: 63.79  E-value: 5.30e-12
                            10        20        30        40        50        60        70        80
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 93141047    2118 PPQNIHISDEWYTRFRVSWDPSP-----SPVLGYKIVYKPVGSNEPMEAFVGEMTSYTLHNLNPSTTYDVNVYAQYDSGL 2192
Cdd:smart00060    3 PPSNLRVTDVTSTSVTLSWEPPPddgitGYIVGYRVEYREEGSEWKEVNVTPSSTSYTLTGLKPGTEYEFRVRAVNGAGE 82

                    .
gi 93141047    2193 S 2193
Cdd:smart00060   83 G 83
fn3 pfam00041
Fibronectin type III domain;
2028-2105 5.72e-12

Fibronectin type III domain;


Pssm-ID: 394996 [Multi-domain]  Cd Length: 85  Bit Score: 63.97  E-value: 5.72e-12
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 93141047   2028 PRNLRVFGETTNSLSVAW---DHADGPVQQYRIIYSPTVGDPIDEYTTVPGRRNNVILQPLQPDTPYKITVIAVYEDGDG 2104
Cdd:pfam00041    3 PSNLTVTDVTSTSLTVSWtppPDGNGPITGYEVEYRPKNSGEPWNEITVPGTTTSVTLTGLKPGTEYEVRVQAVNGGGEG 82

                   .
gi 93141047   2105 G 2105
Cdd:pfam00041   83 P 83
fn3 pfam00041
Fibronectin type III domain;
817-892 7.90e-12

Fibronectin type III domain;


Pssm-ID: 394996 [Multi-domain]  Cd Length: 85  Bit Score: 63.59  E-value: 7.90e-12
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 93141047    817 PRDLRVSDPTTSTMKLSWSGAP---GKVKQYLVTYTPV-AGGETQEVTVRGDTTNTVLQGLKEGTQYALSVTALYASGAG 892
Cdd:pfam00041    3 PSNLTVTDVTSTSLTVSWTPPPdgnGPITGYEVEYRPKnSGEPWNEITVPGTTTSVTLTGLKPGTEYEVRVQAVNGGGEG 82
FN3 cd00063
Fibronectin type 3 domain; One of three types of internal repeats found in the plasma protein ...
815-893 8.89e-12

Fibronectin type 3 domain; One of three types of internal repeats found in the plasma protein fibronectin. Its tenth fibronectin type III repeat contains an RGD cell recognition sequence in a flexible loop between 2 strands. Approximately 2% of all animal proteins contain the FN3 repeat; including extracellular and intracellular proteins, membrane spanning cytokine receptors, growth hormone receptors, tyrosine phosphatase receptors, and adhesion molecules. FN3-like domains are also found in bacterial glycosyl hydrolases.


Pssm-ID: 238020 [Multi-domain]  Cd Length: 93  Bit Score: 63.67  E-value: 8.89e-12
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 93141047  815 GNPRDLRVSDPTTSTMKLSWSGAP---GKVKQYLVTYTPVAGGETQEVTV-RGDTTNTVLQGLKEGTQYALSVTALYASG 890
Cdd:cd00063    2 SPPTNLRVTDVTSTSVTLSWTPPEddgGPITGYVVEYREKGSGDWKEVEVtPGSETSYTLTGLKPGTEYEFRVRAVNGGG 81

                 ...
gi 93141047  891 AGD 893
Cdd:cd00063   82 ESP 84
FN3 smart00060
Fibronectin type 3 domain; One of three types of internal repeat within the plasma protein, ...
1754-1832 1.14e-11

Fibronectin type 3 domain; One of three types of internal repeat within the plasma protein, fibronectin. The tenth fibronectin type III repeat contains a RGD cell recognition sequence in a flexible loop between 2 strands. Type III modules are present in both extracellular and intracellular proteins.


Pssm-ID: 214495 [Multi-domain]  Cd Length: 83  Bit Score: 63.02  E-value: 1.14e-11
                            10        20        30        40        50        60        70        80
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 93141047    1754 SGPRNLQVYNATSNSLTVKWD-PASGRVQKYRITYQPSTGEGNEQTTTI--GGRQNSVVLQKLKPDTPYTITVSSLYPDG 1830
Cdd:smart00060    2 SPPSNLRVTDVTSTSVTLSWEpPPDDGITGYIVGYRVEYREEGSEWKEVnvTPSSTSYTLTGLKPGTEYEFRVRAVNGAG 81

                    ..
gi 93141047    1831 EG 1832
Cdd:smart00060   82 EG 83
FN3 cd00063
Fibronectin type 3 domain; One of three types of internal repeats found in the plasma protein ...
2026-2104 2.04e-11

Fibronectin type 3 domain; One of three types of internal repeats found in the plasma protein fibronectin. Its tenth fibronectin type III repeat contains an RGD cell recognition sequence in a flexible loop between 2 strands. Approximately 2% of all animal proteins contain the FN3 repeat; including extracellular and intracellular proteins, membrane spanning cytokine receptors, growth hormone receptors, tyrosine phosphatase receptors, and adhesion molecules. FN3-like domains are also found in bacterial glycosyl hydrolases.


Pssm-ID: 238020 [Multi-domain]  Cd Length: 93  Bit Score: 62.51  E-value: 2.04e-11
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 93141047 2026 SGPRNLRVFGETTNSLSVAWDHAD---GPVQQYRIIYSPTVGDPIDEYTTVPGRRNNVILQPLQPDTPYKITVIAVYEDG 2102
Cdd:cd00063    2 SPPTNLRVTDVTSTSVTLSWTPPEddgGPITGYVVEYREKGSGDWKEVEVTPGSETSYTLTGLKPGTEYEFRVRAVNGGG 81

                 ..
gi 93141047 2103 DG 2104
Cdd:cd00063   82 ES 83
FN3 smart00060
Fibronectin type 3 domain; One of three types of internal repeat within the plasma protein, ...
724-801 3.22e-11

Fibronectin type 3 domain; One of three types of internal repeat within the plasma protein, fibronectin. The tenth fibronectin type III repeat contains a RGD cell recognition sequence in a flexible loop between 2 strands. Type III modules are present in both extracellular and intracellular proteins.


Pssm-ID: 214495 [Multi-domain]  Cd Length: 83  Bit Score: 61.48  E-value: 3.22e-11
                            10        20        30        40        50        60        70        80
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 93141047     724 GAPRNLKVTDETTDSFKITWTQAP-----GRVLRYRIIYRPVaGGESREVTTPPNQRRRTLENLIPDTKYEVSVIPEYFS 798
Cdd:smart00060    2 SPPSNLRVTDVTSTSVTLSWEPPPddgitGYIVGYRVEYREE-GSEWKEVNVTPSSTSYTLTGLKPGTEYEFRVRAVNGA 80

                    ...
gi 93141047     799 GPG 801
Cdd:smart00060   81 GEG 83
FN3 cd00063
Fibronectin type 3 domain; One of three types of internal repeats found in the plasma protein ...
1090-1176 3.65e-11

Fibronectin type 3 domain; One of three types of internal repeats found in the plasma protein fibronectin. Its tenth fibronectin type III repeat contains an RGD cell recognition sequence in a flexible loop between 2 strands. Approximately 2% of all animal proteins contain the FN3 repeat; including extracellular and intracellular proteins, membrane spanning cytokine receptors, growth hormone receptors, tyrosine phosphatase receptors, and adhesion molecules. FN3-like domains are also found in bacterial glycosyl hydrolases.


Pssm-ID: 238020 [Multi-domain]  Cd Length: 93  Bit Score: 61.74  E-value: 3.65e-11
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 93141047 1090 PRNLKTSDPTMSSFRVTWEPAP---GEVKGYKVTFHPTGDDRRLGELVVGPYDNTVVLEELRAGTTYKVNVFGMFDGGES 1166
Cdd:cd00063    4 PTNLRVTDVTSTSVTLSWTPPEddgGPITGYVVEYREKGSGDWKEVEVTPGSETSYTLTGLKPGTEYEFRVRAVNGGGES 83
                         90
                 ....*....|
gi 93141047 1167 SPLVGQEMTT 1176
Cdd:cd00063   84 PPSESVTVTT 93
FN3 cd00063
Fibronectin type 3 domain; One of three types of internal repeats found in the plasma protein ...
335-420 1.05e-10

Fibronectin type 3 domain; One of three types of internal repeats found in the plasma protein fibronectin. Its tenth fibronectin type III repeat contains an RGD cell recognition sequence in a flexible loop between 2 strands. Approximately 2% of all animal proteins contain the FN3 repeat; including extracellular and intracellular proteins, membrane spanning cytokine receptors, growth hormone receptors, tyrosine phosphatase receptors, and adhesion molecules. FN3-like domains are also found in bacterial glycosyl hydrolases.


Pssm-ID: 238020 [Multi-domain]  Cd Length: 93  Bit Score: 60.59  E-value: 1.05e-10
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 93141047  335 EPPSNLIAMEVSSKYVKLNWNPSP---SPVTGYKVILTPMTAGSRQHALSVGPQTTTLSVRDLSADTEYQISVSAMKGMT 411
Cdd:cd00063    2 SPPTNLRVTDVTSTSVTLSWTPPEddgGPITGYVVEYREKGSGDWKEVEVTPGSETSYTLTGLKPGTEYEFRVRAVNGGG 81

                 ....*....
gi 93141047  412 SSEPISIME 420
Cdd:cd00063   82 ESPPSESVT 90
FN3 cd00063
Fibronectin type 3 domain; One of three types of internal repeats found in the plasma protein ...
1002-1075 1.08e-10

Fibronectin type 3 domain; One of three types of internal repeats found in the plasma protein fibronectin. Its tenth fibronectin type III repeat contains an RGD cell recognition sequence in a flexible loop between 2 strands. Approximately 2% of all animal proteins contain the FN3 repeat; including extracellular and intracellular proteins, membrane spanning cytokine receptors, growth hormone receptors, tyrosine phosphatase receptors, and adhesion molecules. FN3-like domains are also found in bacterial glycosyl hydrolases.


Pssm-ID: 238020 [Multi-domain]  Cd Length: 93  Bit Score: 60.59  E-value: 1.08e-10
                         10        20        30        40        50        60        70
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 93141047 1002 LKVDEETENTMRVTWKPAP---GKVVNYRVVYRPHGRGK-QMVAKVPPTVTSTVLKRLQPQTTYDITVLPIYKMGEGK 1075
Cdd:cd00063    7 LRVTDVTSTSVTLSWTPPEddgGPITGYVVEYREKGSGDwKEVEVTPGSETSYTLTGLKPGTEYEFRVRAVNGGGESP 84
fn3 pfam00041
Fibronectin type III domain;
26-105 1.13e-10

Fibronectin type III domain;


Pssm-ID: 394996 [Multi-domain]  Cd Length: 85  Bit Score: 60.12  E-value: 1.13e-10
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 93141047     26 DPPSDLNFKIIDENTVHMSWAKPVD---PIVGYRITVDPTTDG-PTKEFTLSASTTETLLSELVPETEYVVTITSYDEVE 101
Cdd:pfam00041    1 SAPSNLTVTDVTSTSLTVSWTPPPDgngPITGYEVEYRPKNSGePWNEITVPGTTTSVTLTGLKPGTEYEVRVQAVNGGG 80

                   ....
gi 93141047    102 ESVP 105
Cdd:pfam00041   81 EGPP 84
FN3 smart00060
Fibronectin type 3 domain; One of three types of internal repeat within the plasma protein, ...
1387-1457 3.19e-10

Fibronectin type 3 domain; One of three types of internal repeat within the plasma protein, fibronectin. The tenth fibronectin type III repeat contains a RGD cell recognition sequence in a flexible loop between 2 strands. Type III modules are present in both extracellular and intracellular proteins.


Pssm-ID: 214495 [Multi-domain]  Cd Length: 83  Bit Score: 58.78  E-value: 3.19e-10
                            10        20        30        40        50        60        70
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*
gi 93141047    1387 APSNLVISERTHRSFRVSWTPP-SDSVDRYKVEY---YPVSGGKRQEFYVSRMETSTVLKDLKPETEYVVNVYSV 1457
Cdd:smart00060    3 PPSNLRVTDVTSTSVTLSWEPPpDDGITGYIVGYrveYREEGSEWKEVNVTPSSTSYTLTGLKPGTEYEFRVRAV 77
FN3 cd00063
Fibronectin type 3 domain; One of three types of internal repeats found in the plasma protein ...
26-106 1.03e-09

Fibronectin type 3 domain; One of three types of internal repeats found in the plasma protein fibronectin. Its tenth fibronectin type III repeat contains an RGD cell recognition sequence in a flexible loop between 2 strands. Approximately 2% of all animal proteins contain the FN3 repeat; including extracellular and intracellular proteins, membrane spanning cytokine receptors, growth hormone receptors, tyrosine phosphatase receptors, and adhesion molecules. FN3-like domains are also found in bacterial glycosyl hydrolases.


Pssm-ID: 238020 [Multi-domain]  Cd Length: 93  Bit Score: 57.89  E-value: 1.03e-09
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 93141047   26 DPPSDLNFKIIDENTVHMSWAKPVD---PIVGYRITVDPTTDGPTKEF-TLSASTTETLLSELVPETEYVVTITSYDEVE 101
Cdd:cd00063    2 SPPTNLRVTDVTSTSVTLSWTPPEDdggPITGYVVEYREKGSGDWKEVeVTPGSETSYTLTGLKPGTEYEFRVRAVNGGG 81

                 ....*
gi 93141047  102 ESVPV 106
Cdd:cd00063   82 ESPPS 86
FN3 smart00060
Fibronectin type 3 domain; One of three types of internal repeat within the plasma protein, ...
815-892 1.38e-09

Fibronectin type 3 domain; One of three types of internal repeat within the plasma protein, fibronectin. The tenth fibronectin type III repeat contains a RGD cell recognition sequence in a flexible loop between 2 strands. Type III modules are present in both extracellular and intracellular proteins.


Pssm-ID: 214495 [Multi-domain]  Cd Length: 83  Bit Score: 56.85  E-value: 1.38e-09
                            10        20        30        40        50        60        70        80
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 93141047     815 GNPRDLRVSDPTTSTMKLSWS-GAPGKVKQYLVTYTPV---AGGETQEVTVRGDTTNTVLQGLKEGTQYALSVTALYASG 890
Cdd:smart00060    2 SPPSNLRVTDVTSTSVTLSWEpPPDDGITGYIVGYRVEyreEGSEWKEVNVTPSSTSYTLTGLKPGTEYEFRVRAVNGAG 81

                    ..
gi 93141047     891 AG 892
Cdd:smart00060   82 EG 83
ChlD COG1240
vWFA (von Willebrand factor type A) domain of Mg and Co chelatases [Coenzyme transport and ...
2322-2492 1.99e-09

vWFA (von Willebrand factor type A) domain of Mg and Co chelatases [Coenzyme transport and metabolism];


Pssm-ID: 440853 [Multi-domain]  Cd Length: 262  Bit Score: 61.11  E-value: 1.99e-09
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 93141047 2322 ADIVFLTDASWSIGDDNFNKVVKfifNTVGGFDEISPAGIQVSFVQYSDEVksEFKLNTYNDKALALGALQNIRYRGGnT 2401
Cdd:COG1240   93 RDVVLVVDASGSMAAENRLEAAK---GALLDFLDDYRPRDRVGLVAFGGEA--EVLLPLTRDREALKRALDELPPGGG-T 166
                         90       100       110       120       130       140       150       160
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 93141047 2402 RTGKALtfikEKVLTWESGMRKNVPKVLVVVTDGR---SQDEVKKAALVIQQSGFSVFVVGVAD--VDYNELANIASKpS 2476
Cdd:COG1240  167 PLGDAL----ALALELLKRADPARRKVIVLLTDGRdnaGRIDPLEAAELAAAAGIRIYTIGVGTeaVDEGLLREIAEA-T 241
                        170
                 ....*....|....*.
gi 93141047 2477 ERHVFIVDDFESFEKI 2492
Cdd:COG1240  242 GGRYFRADDLSELAAI 257
FN3 smart00060
Fibronectin type 3 domain; One of three types of internal repeat within the plasma protein, ...
1002-1074 2.55e-09

Fibronectin type 3 domain; One of three types of internal repeat within the plasma protein, fibronectin. The tenth fibronectin type III repeat contains a RGD cell recognition sequence in a flexible loop between 2 strands. Type III modules are present in both extracellular and intracellular proteins.


Pssm-ID: 214495 [Multi-domain]  Cd Length: 83  Bit Score: 56.08  E-value: 2.55e-09
                            10        20        30        40        50        60        70
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 93141047    1002 LKVDEETENTMRVTWKPAP-----GKVVNYRVVYRPHGRGKQMVaKVPPTVTSTVLKRLQPQTTYDITVLPIYKMGEG 1074
Cdd:smart00060    7 LRVTDVTSTSVTLSWEPPPddgitGYIVGYRVEYREEGSEWKEV-NVTPSSTSYTLTGLKPGTEYEFRVRAVNGAGEG 83
fn3 pfam00041
Fibronectin type III domain;
1089-1168 6.02e-09

Fibronectin type III domain;


Pssm-ID: 394996 [Multi-domain]  Cd Length: 85  Bit Score: 55.11  E-value: 6.02e-09
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 93141047   1089 SPRNLKTSDPTMSSFRVTWEPAP---GEVKGYKVTFHPTGDDRRLGELVVGPYDNTVVLEELRAGTTYKVNVFGMFDGGE 1165
Cdd:pfam00041    2 APSNLTVTDVTSTSLTVSWTPPPdgnGPITGYEVEYRPKNSGEPWNEITVPGTTTSVTLTGLKPGTEYEVRVQAVNGGGE 81

                   ...
gi 93141047   1166 SSP 1168
Cdd:pfam00041   82 GPP 84
FN3 smart00060
Fibronectin type 3 domain; One of three types of internal repeat within the plasma protein, ...
2026-2104 6.85e-09

Fibronectin type 3 domain; One of three types of internal repeat within the plasma protein, fibronectin. The tenth fibronectin type III repeat contains a RGD cell recognition sequence in a flexible loop between 2 strands. Type III modules are present in both extracellular and intracellular proteins.


Pssm-ID: 214495 [Multi-domain]  Cd Length: 83  Bit Score: 54.93  E-value: 6.85e-09
                            10        20        30        40        50        60        70        80
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 93141047    2026 SGPRNLRVFGETTNSLSVAWDHADGPVQQ-YRIIYSPTVGDPIDEYTTV--PGRRNNVILQPLQPDTPYKITVIAVYEDG 2102
Cdd:smart00060    2 SPPSNLRVTDVTSTSVTLSWEPPPDDGITgYIVGYRVEYREEGSEWKEVnvTPSSTSYTLTGLKPGTEYEFRVRAVNGAG 81

                    ..
gi 93141047    2103 DG 2104
Cdd:smart00060   82 EG 83
PTZ00441 PTZ00441
sporozoite surface protein 2 (SSP2); Provisional
1197-1402 1.47e-08

sporozoite surface protein 2 (SSP2); Provisional


Pssm-ID: 240420 [Multi-domain]  Cd Length: 576  Bit Score: 60.36  E-value: 1.47e-08
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 93141047  1197 EADIVLLVDGSWSIGRANFRT-VRSFISRIVEVFDIGPKRVQIALAQYSGDPRTEWQL--NAHRDKKSLLQAV-----AN 1268
Cdd:PTZ00441   42 EVDLYLLVDGSGSIGYHNWIThVIPMLMGLIQQLNLSDDAINLYMSLFSNNTTELIRLgsGASKDKEQALIIVkslrkTY 121
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 93141047  1269 LPYkgGNTLTGMALNFIRQQ-NFRTQagmRPRARKIGVLITDG--KSQDDVEAPSKKLKDEGVELFAIGIKNADEVELK- 1344
Cdd:PTZ00441  122 LPY--GKTNMTDALLEVRKHlNDRVN---RENAIQLVILMTDGipNSKYRALEESRKLKDRNVKLAVIGIGQGINHQFNr 196
                         170       180       190       200       210       220
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*.
gi 93141047  1345 -MIATDPDDTHA--YNVADFESLSRIVDDLTINLCNSVK-----GPGDLEAPSNLVISERTHRSFR 1402
Cdd:PTZ00441  197 lLAGCRPREGKCkfYSDADWEEAKNLIKPFIAKVCTEVErtascGPWDEWTPCSVTCGKGTHSRSR 262
fn3 pfam00041
Fibronectin type III domain;
906-986 1.48e-08

Fibronectin type III domain;


Pssm-ID: 394996 [Multi-domain]  Cd Length: 85  Bit Score: 53.96  E-value: 1.48e-08
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 93141047    906 GSPQDLVTKDITDTSIGAYWTSAP---GMVRGYRVSWKSLYDDVDTGEKNLPEDAIHTMIENLQPETKYRISVFATYSSG 982
Cdd:pfam00041    1 SAPSNLTVTDVTSTSLTVSWTPPPdgnGPITGYEVEYRPKNSGEPWNEITVPGTTTSVTLTGLKPGTEYEVRVQAVNGGG 80

                   ....
gi 93141047    983 EGEP 986
Cdd:pfam00041   81 EGPP 84
gly_rich_SclB NF038329
LPXTG-anchored collagen-like adhesin Scl2/SclB; SclB (or Scl2 - streptococcal collagen-like ...
2764-2897 2.07e-08

LPXTG-anchored collagen-like adhesin Scl2/SclB; SclB (or Scl2 - streptococcal collagen-like protein 2) is an LPXTG-anchored surface-anchored adhesin with a variable-length region of triple helix-forming collagen-like Gly-Xaa-Xaa repeats.


Pssm-ID: 468478 [Multi-domain]  Cd Length: 440  Bit Score: 59.53  E-value: 2.07e-08
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 93141047  2764 RGERGISGAIGPPGPRGDIGPPGPQGPPGPQGPNGLSIP-GEQGRQGMKGDAGEPGLPGRTGTPGLPGPPGPMGPPGDRG 2842
Cdd:NF038329  137 RGDRGETGPAGPAGPPGPQGERGEKGPAGPQGEAGPQGPaGKDGEAGAKGPAGEKGPQGPRGETGPAGEQGPAGPAGPDG 216
                          90       100       110       120       130
                  ....*....|....*....|....*....|....*....|....*....|....*
gi 93141047  2843 FTGKDGAMGPRGPPGPPGspgspgvTGPSGKPGKPGDHGRPGPSGLKGEKGDRGD 2897
Cdd:NF038329  217 EAGPAGEDGPAGPAGDGQ-------QGPDGDPGPTGEDGPQGPDGPAGKDGPRGD 264
FN3 cd00063
Fibronectin type 3 domain; One of three types of internal repeats found in the plasma protein ...
906-988 3.24e-08

Fibronectin type 3 domain; One of three types of internal repeats found in the plasma protein fibronectin. Its tenth fibronectin type III repeat contains an RGD cell recognition sequence in a flexible loop between 2 strands. Approximately 2% of all animal proteins contain the FN3 repeat; including extracellular and intracellular proteins, membrane spanning cytokine receptors, growth hormone receptors, tyrosine phosphatase receptors, and adhesion molecules. FN3-like domains are also found in bacterial glycosyl hydrolases.


Pssm-ID: 238020 [Multi-domain]  Cd Length: 93  Bit Score: 53.27  E-value: 3.24e-08
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 93141047  906 GSPQDLVTKDITDTSIGAYWTSAP---GMVRGYRVSWKSLYDDVDTGEKNLPEDAIHTMIENLQPETKYRISVFATYSSG 982
Cdd:cd00063    2 SPPTNLRVTDVTSTSVTLSWTPPEddgGPITGYVVEYREKGSGDWKEVEVTPGSETSYTLTGLKPGTEYEFRVRAVNGGG 81

                 ....*.
gi 93141047  983 EGEPLT 988
Cdd:cd00063   82 ESPPSE 87
FN3 smart00060
Fibronectin type 3 domain; One of three types of internal repeat within the plasma protein, ...
906-984 3.27e-08

Fibronectin type 3 domain; One of three types of internal repeat within the plasma protein, fibronectin. The tenth fibronectin type III repeat contains a RGD cell recognition sequence in a flexible loop between 2 strands. Type III modules are present in both extracellular and intracellular proteins.


Pssm-ID: 214495 [Multi-domain]  Cd Length: 83  Bit Score: 53.00  E-value: 3.27e-08
                            10        20        30        40        50        60        70        80
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 93141047     906 GSPQDLVTKDITDTSIGAYWT-SAPGMVRGYRVSWKSLYDDVDTGEKNLPEDAIHT--MIENLQPETKYRISVFATYSSG 982
Cdd:smart00060    2 SPPSNLRVTDVTSTSVTLSWEpPPDDGITGYIVGYRVEYREEGSEWKEVNVTPSSTsyTLTGLKPGTEYEFRVRAVNGAG 81

                    ..
gi 93141047     983 EG 984
Cdd:smart00060   82 EG 83
fn3 pfam00041
Fibronectin type III domain;
1002-1075 5.01e-08

Fibronectin type III domain;


Pssm-ID: 394996 [Multi-domain]  Cd Length: 85  Bit Score: 52.80  E-value: 5.01e-08
                           10        20        30        40        50        60        70
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 93141047   1002 LKVDEETENTMRVTWKPAP---GKVVNYRVVYRPHGRGKQMVAK-VPPTVTSTVLKRLQPQTTYDITVLPIYKMGEGK 1075
Cdd:pfam00041    6 LTVTDVTSTSLTVSWTPPPdgnGPITGYEVEYRPKNSGEPWNEItVPGTTTSVTLTGLKPGTEYEVRVQAVNGGGEGP 83
gly_rich_SclB NF038329
LPXTG-anchored collagen-like adhesin Scl2/SclB; SclB (or Scl2 - streptococcal collagen-like ...
2764-2896 5.01e-08

LPXTG-anchored collagen-like adhesin Scl2/SclB; SclB (or Scl2 - streptococcal collagen-like protein 2) is an LPXTG-anchored surface-anchored adhesin with a variable-length region of triple helix-forming collagen-like Gly-Xaa-Xaa repeats.


Pssm-ID: 468478 [Multi-domain]  Cd Length: 440  Bit Score: 58.38  E-value: 5.01e-08
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 93141047  2764 RGERGISG---AIGPPGPRGDIGPPGPQGPPGPQGPNGLsiPGEQGRQGMKGDAGEPGLPGRTGTPGLPGPPGPMGPPGD 2840
Cdd:NF038329  119 KGEPGPAGpagPAGEQGPRGDRGETGPAGPAGPPGPQGE--RGEKGPAGPQGEAGPQGPAGKDGEAGAKGPAGEKGPQGP 196
                          90       100       110       120       130
                  ....*....|....*....|....*....|....*....|....*....|....*...
gi 93141047  2841 RGFTGKDGAMGPRGPPGPPGSPGSPGVTGPSGKPGKP--GDHGRPGPSGLKGEKGDRG 2896
Cdd:NF038329  197 RGETGPAGEQGPAGPAGPDGEAGPAGEDGPAGPAGDGqqGPDGDPGPTGEDGPQGPDG 254
FN3 smart00060
Fibronectin type 3 domain; One of three types of internal repeat within the plasma protein, ...
26-103 7.50e-08

Fibronectin type 3 domain; One of three types of internal repeat within the plasma protein, fibronectin. The tenth fibronectin type III repeat contains a RGD cell recognition sequence in a flexible loop between 2 strands. Type III modules are present in both extracellular and intracellular proteins.


Pssm-ID: 214495 [Multi-domain]  Cd Length: 83  Bit Score: 52.23  E-value: 7.50e-08
                            10        20        30        40        50        60        70        80
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 93141047      26 DPPSDLNFKIIDENTVHMSWAKPVDP-----IVGYRItVDPTTDGPTKEFTLSASTTETLLSELVPETEYVVTITSYDEV 100
Cdd:smart00060    2 SPPSNLRVTDVTSTSVTLSWEPPPDDgitgyIVGYRV-EYREEGSEWKEVNVTPSSTSYTLTGLKPGTEYEFRVRAVNGA 80

                    ...
gi 93141047     101 EES 103
Cdd:smart00060   81 GEG 83
FN3 smart00060
Fibronectin type 3 domain; One of three types of internal repeat within the plasma protein, ...
336-413 1.50e-07

Fibronectin type 3 domain; One of three types of internal repeat within the plasma protein, fibronectin. The tenth fibronectin type III repeat contains a RGD cell recognition sequence in a flexible loop between 2 strands. Type III modules are present in both extracellular and intracellular proteins.


Pssm-ID: 214495 [Multi-domain]  Cd Length: 83  Bit Score: 51.08  E-value: 1.50e-07
                            10        20        30        40        50        60        70        80
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 93141047     336 PPSNLIAMEVSSKYVKLNWNPSPSPV-TGYKV--ILTPMTAGSRQHALSVGPQTTTLSVRDLSADTEYQISVSAMKGMTS 412
Cdd:smart00060    3 PPSNLRVTDVTSTSVTLSWEPPPDDGiTGYIVgyRVEYREEGSEWKEVNVTPSSTSYTLTGLKPGTEYEFRVRAVNGAGE 82

                    .
gi 93141047     413 S 413
Cdd:smart00060   83 G 83
FN3 smart00060
Fibronectin type 3 domain; One of three types of internal repeat within the plasma protein, ...
1089-1166 2.92e-07

Fibronectin type 3 domain; One of three types of internal repeat within the plasma protein, fibronectin. The tenth fibronectin type III repeat contains a RGD cell recognition sequence in a flexible loop between 2 strands. Type III modules are present in both extracellular and intracellular proteins.


Pssm-ID: 214495 [Multi-domain]  Cd Length: 83  Bit Score: 50.31  E-value: 2.92e-07
                            10        20        30        40        50        60        70        80
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 93141047    1089 SPRNLKTSDPTMSSFRVTWE-PAPGEVKGYKVTFHPTGDDRRLGELVV--GPYDNTVVLEELRAGTTYKVNVFGMFDGGE 1165
Cdd:smart00060    3 PPSNLRVTDVTSTSVTLSWEpPPDDGITGYIVGYRVEYREEGSEWKEVnvTPSSTSYTLTGLKPGTEYEFRVRAVNGAGE 82

                    .
gi 93141047    1166 S 1166
Cdd:smart00060   83 G 83
gly_rich_SclB NF038329
LPXTG-anchored collagen-like adhesin Scl2/SclB; SclB (or Scl2 - streptococcal collagen-like ...
2764-2896 4.87e-07

LPXTG-anchored collagen-like adhesin Scl2/SclB; SclB (or Scl2 - streptococcal collagen-like protein 2) is an LPXTG-anchored surface-anchored adhesin with a variable-length region of triple helix-forming collagen-like Gly-Xaa-Xaa repeats.


Pssm-ID: 468478 [Multi-domain]  Cd Length: 440  Bit Score: 55.30  E-value: 4.87e-07
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 93141047  2764 RGERGISGAIGPPGPRGDIGPPGPQGPPGPQGPNGLsiPGEQGRQGMKGDAGEPGLPGRTGTPGLPGPPGpmgppgdrgf 2843
Cdd:NF038329  176 AGKDGEAGAKGPAGEKGPQGPRGETGPAGEQGPAGP--AGPDGEAGPAGEDGPAGPAGDGQQGPDGDPGP---------- 243
                          90       100       110       120       130
                  ....*....|....*....|....*....|....*....|....*....|...
gi 93141047  2844 TGKDGAMGPRGPPGPPGSPGSPGVTGPSGKPGKPGDHGRPGPSGLKGEKGDRG 2896
Cdd:NF038329  244 TGEDGPQGPDGPAGKDGPRGDRGEAGPDGPDGKDGERGPVGPAGKDGQNGKDG 296
fn3 pfam00041
Fibronectin type III domain;
2207-2282 1.43e-06

Fibronectin type III domain;


Pssm-ID: 394996 [Multi-domain]  Cd Length: 85  Bit Score: 48.57  E-value: 1.43e-06
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 93141047   2207 VTDLKTYQIGWDTFCVKWSP----HRAATSYRLKLSPADGTRG-QEITVRGSETSHCFTGLSPDTDYGVTVFVQTPNLEG 2281
Cdd:pfam00041    3 PSNLTVTDVTSTSLTVSWTPppdgNGPITGYEVEYRPKNSGEPwNEITVPGTTTSVTLTGLKPGTEYEVRVQAVNGGGEG 82

                   .
gi 93141047   2282 P 2282
Cdd:pfam00041   83 P 83
gly_rich_SclB NF038329
LPXTG-anchored collagen-like adhesin Scl2/SclB; SclB (or Scl2 - streptococcal collagen-like ...
2765-2896 2.96e-06

LPXTG-anchored collagen-like adhesin Scl2/SclB; SclB (or Scl2 - streptococcal collagen-like protein 2) is an LPXTG-anchored surface-anchored adhesin with a variable-length region of triple helix-forming collagen-like Gly-Xaa-Xaa repeats.


Pssm-ID: 468478 [Multi-domain]  Cd Length: 440  Bit Score: 52.60  E-value: 2.96e-06
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 93141047  2765 GERGISGAIGPPGPRGDigppgpqgppgpqgpnglsiPGEQGRQGMKGDAGEPGLPGRTGTPGLpgppgpmgppgdrgfT 2844
Cdd:NF038329  117 GEKGEPGPAGPAGPAGE--------------------QGPRGDRGETGPAGPAGPPGPQGERGE---------------K 161
                          90       100       110       120       130
                  ....*....|....*....|....*....|....*....|....*....|..
gi 93141047  2845 GKDGAMGPRGPPGPPGSPGSPGVTGPSGKPGKPGDHGRPGPSGLKGEKGDRG 2896
Cdd:NF038329  162 GPAGPQGEAGPQGPAGKDGEAGAKGPAGEKGPQGPRGETGPAGEQGPAGPAG 213
Collagen pfam01391
Collagen triple helix repeat (20 copies); Members of this family belong to the collagen ...
2869-2897 3.38e-05

Collagen triple helix repeat (20 copies); Members of this family belong to the collagen superfamily. Collagens are generally extracellular structural proteins involved in formation of connective tissue structure. The alignment contains 20 copies of the G-X-Y repeat that forms a triple helix. The first position of the repeat is glycine, the second and third positions can be any residue but are frequently proline and hydroxy-proline. Collagens are post translationally modified by proline hydroxylase to form the hydroxy-proline residues. Defective hydroxylation is the cause of scurvy. Some members of the collagen superfamily are not involved in connective tissue structure but share the same triple helical structure. The family includes bacterial collagen-like triple-helix repeat proteins.


Pssm-ID: 460189 [Multi-domain]  Cd Length: 57  Bit Score: 43.64  E-value: 3.38e-05
                           10        20
                   ....*....|....*....|....*....
gi 93141047   2869 GPSGKPGKPGDHGRPGPSGLKGEKGDRGD 2897
Cdd:pfam01391    4 GPPGPPGPPGPPGPPGPPGPPGPPGPPGE 32
gly_rich_SclB NF038329
LPXTG-anchored collagen-like adhesin Scl2/SclB; SclB (or Scl2 - streptococcal collagen-like ...
2765-2896 3.99e-05

LPXTG-anchored collagen-like adhesin Scl2/SclB; SclB (or Scl2 - streptococcal collagen-like protein 2) is an LPXTG-anchored surface-anchored adhesin with a variable-length region of triple helix-forming collagen-like Gly-Xaa-Xaa repeats.


Pssm-ID: 468478 [Multi-domain]  Cd Length: 440  Bit Score: 49.13  E-value: 3.99e-05
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 93141047  2765 GERGISGAIGPPGPRGDIGPPGPQGPP--GPQGPNG-LSIPGEQGRQGMKGDAGEPGLPGRTGTPGLPgppgpmgppgdr 2841
Cdd:NF038329  201 GPAGEQGPAGPAGPDGEAGPAGEDGPAgpAGDGQQGpDGDPGPTGEDGPQGPDGPAGKDGPRGDRGEA------------ 268
                          90       100       110       120       130
                  ....*....|....*....|....*....|....*....|....*....|....*
gi 93141047  2842 gftGKDGAMGPRgppgppgspgspgvtGPSGKPGKPGDHGRPGPSGLKGEKGDRG 2896
Cdd:NF038329  269 ---GPDGPDGKD---------------GERGPVGPAGKDGQNGKDGLPGKDGKDG 305
FN3 cd00063
Fibronectin type 3 domain; One of three types of internal repeats found in the plasma protein ...
634-719 1.17e-04

Fibronectin type 3 domain; One of three types of internal repeats found in the plasma protein fibronectin. Its tenth fibronectin type III repeat contains an RGD cell recognition sequence in a flexible loop between 2 strands. Approximately 2% of all animal proteins contain the FN3 repeat; including extracellular and intracellular proteins, membrane spanning cytokine receptors, growth hormone receptors, tyrosine phosphatase receptors, and adhesion molecules. FN3-like domains are also found in bacterial glycosyl hydrolases.


Pssm-ID: 238020 [Multi-domain]  Cd Length: 93  Bit Score: 43.25  E-value: 1.17e-04
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 93141047  634 PPKDLSFSEVTSYGFKTNWSPA---GENVFSYHITYKEAAGDDEVTVVEPASSTSVVLSSL-KPETLYLVNVTAEYEDGF 709
Cdd:cd00063    3 PPTNLRVTDVTSTSVTLSWTPPeddGGPITGYVVEYREKGSGDWKEVEVTPGSETSYTLTGlKPGTEYEFRVRAVNGGGE 82
                         90
                 ....*....|
gi 93141047  710 SIPLAGEETT 719
Cdd:cd00063   83 SPPSESVTVT 92
FN3 cd00063
Fibronectin type 3 domain; One of three types of internal repeats found in the plasma protein ...
2206-2291 1.37e-04

Fibronectin type 3 domain; One of three types of internal repeats found in the plasma protein fibronectin. Its tenth fibronectin type III repeat contains an RGD cell recognition sequence in a flexible loop between 2 strands. Approximately 2% of all animal proteins contain the FN3 repeat; including extracellular and intracellular proteins, membrane spanning cytokine receptors, growth hormone receptors, tyrosine phosphatase receptors, and adhesion molecules. FN3-like domains are also found in bacterial glycosyl hydrolases.


Pssm-ID: 238020 [Multi-domain]  Cd Length: 93  Bit Score: 43.25  E-value: 1.37e-04
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 93141047 2206 NVTDLKTYQIGWDTFCVKWSP----HRAATSYRLKLSPADGTRGQEITVRGSETSHC-FTGLSPDTDYGVTVFVQTPNLE 2280
Cdd:cd00063    3 PPTNLRVTDVTSTSVTLSWTPpeddGGPITGYVVEYREKGSGDWKEVEVTPGSETSYtLTGLKPGTEYEFRVRAVNGGGE 82
                         90
                 ....*....|.
gi 93141047 2281 GPGVSVKEHTT 2291
Cdd:cd00063   83 SPPSESVTVTT 93
FN3 smart00060
Fibronectin type 3 domain; One of three types of internal repeat within the plasma protein, ...
2206-2281 5.90e-04

Fibronectin type 3 domain; One of three types of internal repeat within the plasma protein, fibronectin. The tenth fibronectin type III repeat contains a RGD cell recognition sequence in a flexible loop between 2 strands. Type III modules are present in both extracellular and intracellular proteins.


Pssm-ID: 214495 [Multi-domain]  Cd Length: 83  Bit Score: 41.06  E-value: 5.90e-04
                            10        20        30        40        50        60        70        80
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 93141047    2206 NVTDLKTYQIGWDTFCVKWSP--HRAATSYRLKLSPADGTRG---QEITVRGSETSHCFTGLSPDTDYGVTVFVQTPNLE 2280
Cdd:smart00060    3 PPSNLRVTDVTSTSVTLSWEPppDDGITGYIVGYRVEYREEGsewKEVNVTPSSTSYTLTGLKPGTEYEFRVRAVNGAGE 82

                    .
gi 93141047    2281 G 2281
Cdd:smart00060   83 G 83
fn3 pfam00041
Fibronectin type III domain;
634-712 6.85e-04

Fibronectin type III domain;


Pssm-ID: 394996 [Multi-domain]  Cd Length: 85  Bit Score: 40.86  E-value: 6.85e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 93141047    634 PPKDLSFSEVTSYGFKTNWSPA---GENVFSYHITYKEA-AGDDEVTVVEPASSTSVVLSSLKPETLYLVNVTAEYEDGF 709
Cdd:pfam00041    2 APSNLTVTDVTSTSLTVSWTPPpdgNGPITGYEVEYRPKnSGEPWNEITVPGTTTSVTLTGLKPGTEYEVRVQAVNGGGE 81

                   ...
gi 93141047    710 SIP 712
Cdd:pfam00041   82 GPP 84
PTZ00441 PTZ00441
sporozoite surface protein 2 (SSP2); Provisional
438-575 1.38e-03

sporozoite surface protein 2 (SSP2); Provisional


Pssm-ID: 240420 [Multi-domain]  Cd Length: 576  Bit Score: 44.18  E-value: 1.38e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 93141047   438 KADIVFLVDGSYSIGIANFV-KVRAFLEVLVKSFEISPNRVQISLVQYSRDP------HTEFTLKKFTKVEDIIEAINTF 510
Cdd:PTZ00441   42 EVDLYLLVDGSGSIGYHNWItHVIPMLMGLIQQLNLSDDAINLYMSLFSNNTtelirlGSGASKDKEQALIIVKSLRKTY 121
                          90       100       110       120       130       140       150
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 93141047   511 -PYrgGSTNTGKAMTYVREKIfvPSKGSRSNVPKVMILITDGKSSDAFR----DPAIKLRNSDVEIFAVG 575
Cdd:PTZ00441  122 lPY--GKTNMTDALLEVRKHL--NDRVNRENAIQLVILMTDGIPNSKYRaleeSRKLKDRNVKLAVIGIG 187
FN3 smart00060
Fibronectin type 3 domain; One of three types of internal repeat within the plasma protein, ...
634-710 2.59e-03

Fibronectin type 3 domain; One of three types of internal repeat within the plasma protein, fibronectin. The tenth fibronectin type III repeat contains a RGD cell recognition sequence in a flexible loop between 2 strands. Type III modules are present in both extracellular and intracellular proteins.


Pssm-ID: 214495 [Multi-domain]  Cd Length: 83  Bit Score: 39.13  E-value: 2.59e-03
                            10        20        30        40        50        60        70        80
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 93141047     634 PPKDLSFSEVTSYGFKTNWS-PAGENVFSYHITYK---EAAGDDEVTVVEPASSTSVVLSSLKPETLYLVNVTAEYEDGF 709
Cdd:smart00060    3 PPSNLRVTDVTSTSVTLSWEpPPDDGITGYIVGYRveyREEGSEWKEVNVTPSSTSYTLTGLKPGTEYEFRVRAVNGAGE 82

                    .
gi 93141047     710 S 710
Cdd:smart00060   83 G 83
 
Name Accession Description Interval E-value
vWA_collagen_alphaI-XII-like cd01482
Collagen: The extracellular matrix represents a complex alloy of variable members of diverse ...
1198-1361 6.39e-92

Collagen: The extracellular matrix represents a complex alloy of variable members of diverse protein families defining structural integrity and various physiological functions. The most abundant family is the collagens with more than 20 different collagen types identified thus far. Collagens are centrally involved in the formation of fibrillar and microfibrillar networks of the extracellular matrix, basement membranes as well as other structures of the extracellular matrix. Some collagens have about 15-18 vWA domains in them. The VWA domains present in these collagens mediate protein-protein interactions.


Pssm-ID: 238759 [Multi-domain]  Cd Length: 164  Bit Score: 295.35  E-value: 6.39e-92
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 93141047 1198 ADIVLLVDGSWSIGRANFRTVRSFISRIVEVFDIGPKRVQIALAQYSGDPRTEWQLNAHRDKKSLLQAVANLPYKGGNTL 1277
Cdd:cd01482    1 ADIVFLVDGSWSIGRSNFNLVRSFLSSVVEAFEIGPDGVQVGLVQYSDDPRTEFDLNAYTSKEDVLAAIKNLPYKGGNTR 80
                         90       100       110       120       130       140       150       160
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 93141047 1278 TGMALNFIRQQNFRTQAGMRPRARKIGVLITDGKSQDDVEAPSKKLKDEGVELFAIGIKNADEVELKMIATDPDDTHAYN 1357
Cdd:cd01482   81 TGKALTHVREKNFTPDAGARPGVPKVVILITDGKSQDDVELPARVLRNLGVNVFAVGVKDADESELKMIASKPSETHVFN 160

                 ....
gi 93141047 1358 VADF 1361
Cdd:cd01482  161 VADF 164
vWA_collagen_alphaI-XII-like cd01482
Collagen: The extracellular matrix represents a complex alloy of variable members of diverse ...
139-302 2.06e-86

Collagen: The extracellular matrix represents a complex alloy of variable members of diverse protein families defining structural integrity and various physiological functions. The most abundant family is the collagens with more than 20 different collagen types identified thus far. Collagens are centrally involved in the formation of fibrillar and microfibrillar networks of the extracellular matrix, basement membranes as well as other structures of the extracellular matrix. Some collagens have about 15-18 vWA domains in them. The VWA domains present in these collagens mediate protein-protein interactions.


Pssm-ID: 238759 [Multi-domain]  Cd Length: 164  Bit Score: 279.56  E-value: 2.06e-86
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 93141047  139 TDLVFLVDGSWSVGRNNFKYILDFIAALVSAFDIGEEKTRVGVVQYSSDTRTEFNLNQYYQRDELLAAIKKIPYKGGNTM 218
Cdd:cd01482    1 ADIVFLVDGSWSIGRSNFNLVRSFLSSVVEAFEIGPDGVQVGLVQYSDDPRTEFDLNAYTSKEDVLAAIKNLPYKGGNTR 80
                         90       100       110       120       130       140       150       160
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 93141047  219 TGDAIDYLVKNTFTESAGARVGFPKVAIIITDGKSQDEVEIPARELRNVGVEVFSLGIKAADAKELKQIASTPSLNHVFN 298
Cdd:cd01482   81 TGKALTHVREKNFTPDAGARPGVPKVVILITDGKSQDDVELPARVLRNLGVNVFAVGVKDADESELKMIASKPSETHVFN 160

                 ....
gi 93141047  299 VANF 302
Cdd:cd01482  161 VADF 164
vWA_collagen_alphaI-XII-like cd01482
Collagen: The extracellular matrix represents a complex alloy of variable members of diverse ...
439-602 2.40e-86

Collagen: The extracellular matrix represents a complex alloy of variable members of diverse protein families defining structural integrity and various physiological functions. The most abundant family is the collagens with more than 20 different collagen types identified thus far. Collagens are centrally involved in the formation of fibrillar and microfibrillar networks of the extracellular matrix, basement membranes as well as other structures of the extracellular matrix. Some collagens have about 15-18 vWA domains in them. The VWA domains present in these collagens mediate protein-protein interactions.


Pssm-ID: 238759 [Multi-domain]  Cd Length: 164  Bit Score: 279.56  E-value: 2.40e-86
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 93141047  439 ADIVFLVDGSYSIGIANFVKVRAFLEVLVKSFEISPNRVQISLVQYSRDPHTEFTLKKFTKVEDIIEAINTFPYRGGSTN 518
Cdd:cd01482    1 ADIVFLVDGSWSIGRSNFNLVRSFLSSVVEAFEIGPDGVQVGLVQYSDDPRTEFDLNAYTSKEDVLAAIKNLPYKGGNTR 80
                         90       100       110       120       130       140       150       160
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 93141047  519 TGKAMTYVREKIFVPSKGSRSNVPKVMILITDGKSSDAFRDPAIKLRNSDVEIFAVGVKDAVRSELEAIASPPAETHVFT 598
Cdd:cd01482   81 TGKALTHVREKNFTPDAGARPGVPKVVILITDGKSQDDVELPARVLRNLGVNVFAVGVKDADESELKMIASKPSETHVFN 160

                 ....
gi 93141047  599 VEDF 602
Cdd:cd01482  161 VADF 164
vWA_collagen_alphaI-XII-like cd01482
Collagen: The extracellular matrix represents a complex alloy of variable members of diverse ...
2322-2486 3.07e-83

Collagen: The extracellular matrix represents a complex alloy of variable members of diverse protein families defining structural integrity and various physiological functions. The most abundant family is the collagens with more than 20 different collagen types identified thus far. Collagens are centrally involved in the formation of fibrillar and microfibrillar networks of the extracellular matrix, basement membranes as well as other structures of the extracellular matrix. Some collagens have about 15-18 vWA domains in them. The VWA domains present in these collagens mediate protein-protein interactions.


Pssm-ID: 238759 [Multi-domain]  Cd Length: 164  Bit Score: 270.70  E-value: 3.07e-83
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 93141047 2322 ADIVFLTDASWSIGDDNFNKVVKFIFNTVGGFdEISPAGIQVSFVQYSDEVKSEFKLNTYNDKALALGALQNIRYRGGNT 2401
Cdd:cd01482    1 ADIVFLVDGSWSIGRSNFNLVRSFLSSVVEAF-EIGPDGVQVGLVQYSDDPRTEFDLNAYTSKEDVLAAIKNLPYKGGNT 79
                         90       100       110       120       130       140       150       160
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 93141047 2402 RTGKALTFIKEKVLTWESGMRKNVPKVLVVVTDGRSQDEVKKAALVIQQSGFSVFVVGVADVDYNELANIASKPSERHVF 2481
Cdd:cd01482   80 RTGKALTHVREKNFTPDAGARPGVPKVVILITDGKSQDDVELPARVLRNLGVNVFAVGVKDADESELKMIASKPSETHVF 159

                 ....*
gi 93141047 2482 IVDDF 2486
Cdd:cd01482  160 NVADF 164
vWA_collagen cd01472
von Willebrand factor (vWF) type A domain; equivalent to the I-domain of integrins. This ...
1198-1361 5.24e-78

von Willebrand factor (vWF) type A domain; equivalent to the I-domain of integrins. This domain has a variety of functions including: intermolecular adhesion, cell migration, signalling, transcription, and DNA repair. In integrins these domains form heterodimers while in vWF it forms homodimers and multimers. There are different interaction surfaces of this domain as seen by its complexes with collagen with either integrin or human vWFA. In integrins collagen binding occurs via the metal ion-dependent adhesion site (MIDAS) and involves three surface loops located on the upper surface of the molecule. In human vWFA, collagen binding is thought to occur on the bottom of the molecule and does not involve the vestigial MIDAS motif.


Pssm-ID: 238749 [Multi-domain]  Cd Length: 164  Bit Score: 255.62  E-value: 5.24e-78
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 93141047 1198 ADIVLLVDGSWSIGRANFRTVRSFISRIVEVFDIGPKRVQIALAQYSGDPRTEWQLNAHRDKKSLLQAVANLPYKGGNTL 1277
Cdd:cd01472    1 ADIVFLVDGSESIGLSNFNLVKDFVKRVVERLDIGPDGVRVGVVQYSDDPRTEFYLNTYRSKDDVLEAVKNLRYIGGGTN 80
                         90       100       110       120       130       140       150       160
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 93141047 1278 TGMALNFIRQQNFRTQAGMRPRARKIGVLITDGKSQDDVEAPSKKLKDEGVELFAIGIKNADEVELKMIATDPDDTHAYN 1357
Cdd:cd01472   81 TGKALKYVRENLFTEASGSREGVPKVLVVITDGKSQDDVEEPAVELKQAGIEVFAVGVKNADEEELKQIASDPKELYVFN 160

                 ....
gi 93141047 1358 VADF 1361
Cdd:cd01472  161 VADF 164
vWA_collagen cd01472
von Willebrand factor (vWF) type A domain; equivalent to the I-domain of integrins. This ...
139-302 5.10e-77

von Willebrand factor (vWF) type A domain; equivalent to the I-domain of integrins. This domain has a variety of functions including: intermolecular adhesion, cell migration, signalling, transcription, and DNA repair. In integrins these domains form heterodimers while in vWF it forms homodimers and multimers. There are different interaction surfaces of this domain as seen by its complexes with collagen with either integrin or human vWFA. In integrins collagen binding occurs via the metal ion-dependent adhesion site (MIDAS) and involves three surface loops located on the upper surface of the molecule. In human vWFA, collagen binding is thought to occur on the bottom of the molecule and does not involve the vestigial MIDAS motif.


Pssm-ID: 238749 [Multi-domain]  Cd Length: 164  Bit Score: 252.92  E-value: 5.10e-77
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 93141047  139 TDLVFLVDGSWSVGRNNFKYILDFIAALVSAFDIGEEKTRVGVVQYSSDTRTEFNLNQYYQRDELLAAIKKIPYKGGNTM 218
Cdd:cd01472    1 ADIVFLVDGSESIGLSNFNLVKDFVKRVVERLDIGPDGVRVGVVQYSDDPRTEFYLNTYRSKDDVLEAVKNLRYIGGGTN 80
                         90       100       110       120       130       140       150       160
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 93141047  219 TGDAIDYLVKNTFTESAGARVGFPKVAIIITDGKSQDEVEIPARELRNVGVEVFSLGIKAADAKELKQIASTPSLNHVFN 298
Cdd:cd01472   81 TGKALKYVRENLFTEASGSREGVPKVLVVITDGKSQDDVEEPAVELKQAGIEVFAVGVKNADEEELKQIASDPKELYVFN 160

                 ....
gi 93141047  299 VANF 302
Cdd:cd01472  161 VADF 164
vWA_collagen cd01472
von Willebrand factor (vWF) type A domain; equivalent to the I-domain of integrins. This ...
439-602 2.54e-71

von Willebrand factor (vWF) type A domain; equivalent to the I-domain of integrins. This domain has a variety of functions including: intermolecular adhesion, cell migration, signalling, transcription, and DNA repair. In integrins these domains form heterodimers while in vWF it forms homodimers and multimers. There are different interaction surfaces of this domain as seen by its complexes with collagen with either integrin or human vWFA. In integrins collagen binding occurs via the metal ion-dependent adhesion site (MIDAS) and involves three surface loops located on the upper surface of the molecule. In human vWFA, collagen binding is thought to occur on the bottom of the molecule and does not involve the vestigial MIDAS motif.


Pssm-ID: 238749 [Multi-domain]  Cd Length: 164  Bit Score: 236.36  E-value: 2.54e-71
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 93141047  439 ADIVFLVDGSYSIGIANFVKVRAFLEVLVKSFEISPNRVQISLVQYSRDPHTEFTLKKFTKVEDIIEAINTFPYRGGSTN 518
Cdd:cd01472    1 ADIVFLVDGSESIGLSNFNLVKDFVKRVVERLDIGPDGVRVGVVQYSDDPRTEFYLNTYRSKDDVLEAVKNLRYIGGGTN 80
                         90       100       110       120       130       140       150       160
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 93141047  519 TGKAMTYVREKIFVPSKGSRSNVPKVMILITDGKSSDAFRDPAIKLRNSDVEIFAVGVKDAVRSELEAIASPPAETHVFT 598
Cdd:cd01472   81 TGKALKYVRENLFTEASGSREGVPKVLVVITDGKSQDDVEEPAVELKQAGIEVFAVGVKNADEEELKQIASDPKELYVFN 160

                 ....
gi 93141047  599 VEDF 602
Cdd:cd01472  161 VADF 164
vWA_collagen cd01472
von Willebrand factor (vWF) type A domain; equivalent to the I-domain of integrins. This ...
2322-2486 2.49e-70

von Willebrand factor (vWF) type A domain; equivalent to the I-domain of integrins. This domain has a variety of functions including: intermolecular adhesion, cell migration, signalling, transcription, and DNA repair. In integrins these domains form heterodimers while in vWF it forms homodimers and multimers. There are different interaction surfaces of this domain as seen by its complexes with collagen with either integrin or human vWFA. In integrins collagen binding occurs via the metal ion-dependent adhesion site (MIDAS) and involves three surface loops located on the upper surface of the molecule. In human vWFA, collagen binding is thought to occur on the bottom of the molecule and does not involve the vestigial MIDAS motif.


Pssm-ID: 238749 [Multi-domain]  Cd Length: 164  Bit Score: 233.66  E-value: 2.49e-70
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 93141047 2322 ADIVFLTDASWSIGDDNFNKVVKFIFNTVGGFDeISPAGIQVSFVQYSDEVKSEFKLNTYNDKALALGALQNIRYRGGNT 2401
Cdd:cd01472    1 ADIVFLVDGSESIGLSNFNLVKDFVKRVVERLD-IGPDGVRVGVVQYSDDPRTEFYLNTYRSKDDVLEAVKNLRYIGGGT 79
                         90       100       110       120       130       140       150       160
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 93141047 2402 RTGKALTFIKEKVLTWESGMRKNVPKVLVVVTDGRSQDEVKKAALVIQQSGFSVFVVGVADVDYNELANIASKPSERHVF 2481
Cdd:cd01472   80 NTGKALKYVRENLFTEASGSREGVPKVLVVITDGKSQDDVEEPAVELKQAGIEVFAVGVKNADEEELKQIASDPKELYVF 159

                 ....*
gi 93141047 2482 IVDDF 2486
Cdd:cd01472  160 NVADF 164
VWA pfam00092
von Willebrand factor type A domain;
140-311 3.44e-66

von Willebrand factor type A domain;


Pssm-ID: 459670 [Multi-domain]  Cd Length: 174  Bit Score: 222.15  E-value: 3.44e-66
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 93141047    140 DLVFLVDGSWSVGRNNFKYILDFIAALVSAFDIGEEKTRVGVVQYSSDTRTEFNLNQYYQRDELLAAIKKIPYKGGNTM- 218
Cdd:pfam00092    1 DIVFLLDGSGSIGGDNFEKVKEFLKKLVESLDIGPDGTRVGLVQYSSDVRTEFPLNDYSSKEELLSAVDNLRYLGGGTTn 80
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 93141047    219 TGDAIDYLVKNTFTESAGARVGFPKVAIIITDGKSQD-EVEIPARELRNVGVEVFSLGIKAADAKELKQIASTPSLNHVF 297
Cdd:pfam00092   81 TGKALKYALENLFSSAAGARPGAPKVVVLLTDGRSQDgDPEEVARELKSAGVTVFAVGVGNADDEELRKIASEPGEGHVF 160
                          170
                   ....*....|....
gi 93141047    298 NVANFDAIVDIQNE 311
Cdd:pfam00092  161 TVSDFEALEDLQDQ 174
VWA pfam00092
von Willebrand factor type A domain;
1199-1370 1.16e-65

von Willebrand factor type A domain;


Pssm-ID: 459670 [Multi-domain]  Cd Length: 174  Bit Score: 220.61  E-value: 1.16e-65
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 93141047   1199 DIVLLVDGSWSIGRANFRTVRSFISRIVEVFDIGPKRVQIALAQYSGDPRTEWQLNAHRDKKSLLQAVANLPYKGGNTL- 1277
Cdd:pfam00092    1 DIVFLLDGSGSIGGDNFEKVKEFLKKLVESLDIGPDGTRVGLVQYSSDVRTEFPLNDYSSKEELLSAVDNLRYLGGGTTn 80
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 93141047   1278 TGMALNFIRQQNFRTQAGMRPRARKIGVLITDGKSQD-DVEAPSKKLKDEGVELFAIGIKNADEVELKMIATDPDDTHAY 1356
Cdd:pfam00092   81 TGKALKYALENLFSSAAGARPGAPKVVVLLTDGRSQDgDPEEVARELKSAGVTVFAVGVGNADDEELRKIASEPGEGHVF 160
                          170
                   ....*....|....
gi 93141047   1357 NVADFESLSRIVDD 1370
Cdd:pfam00092  161 TVSDFEALEDLQDQ 174
VWA pfam00092
von Willebrand factor type A domain;
440-608 1.83e-61

von Willebrand factor type A domain;


Pssm-ID: 459670 [Multi-domain]  Cd Length: 174  Bit Score: 208.67  E-value: 1.83e-61
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 93141047    440 DIVFLVDGSYSIGIANFVKVRAFLEVLVKSFEISPNRVQISLVQYSRDPHTEFTLKKFTKVEDIIEAINTFPYRGGST-N 518
Cdd:pfam00092    1 DIVFLLDGSGSIGGDNFEKVKEFLKKLVESLDIGPDGTRVGLVQYSSDVRTEFPLNDYSSKEELLSAVDNLRYLGGGTtN 80
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 93141047    519 TGKAMTYVREKIFVPSKGSRSNVPKVMILITDGKSSD-AFRDPAIKLRNSDVEIFAVGVKDAVRSELEAIASPPAETHVF 597
Cdd:pfam00092   81 TGKALKYALENLFSSAAGARPGAPKVVVLLTDGRSQDgDPEEVARELKSAGVTVFAVGVGNADDEELRKIASEPGEGHVF 160
                          170
                   ....*....|.
gi 93141047    598 TVEDFDAFQRI 608
Cdd:pfam00092  161 TVSDFEALEDL 171
VWA pfam00092
von Willebrand factor type A domain;
2323-2495 7.01e-61

von Willebrand factor type A domain;


Pssm-ID: 459670 [Multi-domain]  Cd Length: 174  Bit Score: 207.13  E-value: 7.01e-61
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 93141047   2323 DIVFLTDASWSIGDDNFNKVVKFIFNTVGGFDeISPAGIQVSFVQYSDEVKSEFKLNTYNDKALALGALQNIRYRGGNTR 2402
Cdd:pfam00092    1 DIVFLLDGSGSIGGDNFEKVKEFLKKLVESLD-IGPDGTRVGLVQYSSDVRTEFPLNDYSSKEELLSAVDNLRYLGGGTT 79
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 93141047   2403 -TGKALTFIKEKVLTWESGMRKNVPKVLVVVTDGRSQD-EVKKAALVIQQSGFSVFVVGVADVDYNELANIASKPSERHV 2480
Cdd:pfam00092   80 nTGKALKYALENLFSSAAGARPGAPKVVVLLTDGRSQDgDPEEVARELKSAGVTVFAVGVGNADDEELRKIASEPGEGHV 159
                          170
                   ....*....|....*
gi 93141047   2481 FIVDDFESFEKIEDN 2495
Cdd:pfam00092  160 FTVSDFEALEDLQDQ 174
TSPN smart00210
Thrombospondin N-terminal -like domains; Heparin-binding and cell adhesion domain of ...
2520-2712 3.13e-59

Thrombospondin N-terminal -like domains; Heparin-binding and cell adhesion domain of thrombospondin


Pssm-ID: 214560  Cd Length: 184  Bit Score: 202.59  E-value: 3.13e-59
                            10        20        30        40        50        60        70        80
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 93141047    2520 GFKMLEAYNLTEKNFASVQGVSLEsgsfPSYSAYRIQKNAFVNQPTADLHPNGLPPSYTIILLFRLlpeTPSDPFAIWQI 2599
Cdd:smart00210    1 GQDLLQVFDLPSLSFAIRQVVGPE----PGSPAYRLGDPALVPQPTRDLFPSGLPEDFSLLTTFRQ---TPKSRGVLFAI 73
                            90       100       110       120       130       140       150       160
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 93141047    2600 TDRDYKPQVGVIADPSSKTLSFFNKDTRGEVQTVTFdteEVKTLFYGSFHKVHIVVTSKSVKIYIDCYEIIEKDIKEAG- 2678
Cdd:smart00210   74 YDAQNVRQFGLEVDGRANTLLLRYQGVDGKQHTVSF---RNLPLADGQWHKLALSVSGSSATLYVDCNEIDSRPLDRPGq 150
                           170       180       190
                    ....*....|....*....|....*....|....*
gi 93141047    2679 -NITTDGYEILGKLLKGeRKSAAFQIQSFDIVCSP 2712
Cdd:smart00210  151 pPIDTDGIEVRGAQAAD-RKPFQGDLQQLKIVCDP 184
vWFA_subfamily_ECM cd01450
Von Willebrand factor type A (vWA) domain was originally found in the blood coagulation ...
139-297 1.04e-55

Von Willebrand factor type A (vWA) domain was originally found in the blood coagulation protein von Willebrand factor (vWF). Typically, the vWA domain is made up of approximately 200 amino acid residues folded into a classic a/b para-rossmann type of fold. The vWA domain, since its discovery, has drawn great interest because of its widespread occurrence and its involvement in a wide variety of important cellular functions. These include basal membrane formation, cell migration, cell differentiation, adhesion, haemostasis, signaling, chromosomal stability, malignant transformation and in immune defenses In integrins these domains form heterodimers while in vWF it forms multimers. There are different interaction surfaces of this domain as seen by the various molecules it complexes with. Ligand binding in most cases is mediated by the presence of a metal ion dependent adhesion site termed as the MIDAS motif that is a characteristic feature of most, if not all A domains


Pssm-ID: 238727 [Multi-domain]  Cd Length: 161  Bit Score: 191.74  E-value: 1.04e-55
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 93141047  139 TDLVFLVDGSWSVGRNNFKYILDFIAALVSAFDIGEEKTRVGVVQYSSDTRTEFNLNQYYQRDELLAAIKKIPYKGGN-T 217
Cdd:cd01450    1 LDIVFLLDGSESVGPENFEKVKDFIEKLVEKLDIGPDKTRVGLVQYSDDVRVEFSLNDYKSKDDLLKAVKNLKYLGGGgT 80
                         90       100       110       120       130       140       150       160
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 93141047  218 MTGDAIDYLVKNTFTESAgARVGFPKVAIIITDGKSQD--EVEIPARELRNVGVEVFSLGIKAADAKELKQIASTPSLNH 295
Cdd:cd01450   81 NTGKALQYALEQLFSESN-ARENVPKVIIVLTDGRSDDggDPKEAAAKLKDEGIKVFVVGVGPADEEELREIASCPSERH 159

                 ..
gi 93141047  296 VF 297
Cdd:cd01450  160 VF 161
vWFA_subfamily_ECM cd01450
Von Willebrand factor type A (vWA) domain was originally found in the blood coagulation ...
439-597 6.96e-54

Von Willebrand factor type A (vWA) domain was originally found in the blood coagulation protein von Willebrand factor (vWF). Typically, the vWA domain is made up of approximately 200 amino acid residues folded into a classic a/b para-rossmann type of fold. The vWA domain, since its discovery, has drawn great interest because of its widespread occurrence and its involvement in a wide variety of important cellular functions. These include basal membrane formation, cell migration, cell differentiation, adhesion, haemostasis, signaling, chromosomal stability, malignant transformation and in immune defenses In integrins these domains form heterodimers while in vWF it forms multimers. There are different interaction surfaces of this domain as seen by the various molecules it complexes with. Ligand binding in most cases is mediated by the presence of a metal ion dependent adhesion site termed as the MIDAS motif that is a characteristic feature of most, if not all A domains


Pssm-ID: 238727 [Multi-domain]  Cd Length: 161  Bit Score: 186.34  E-value: 6.96e-54
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 93141047  439 ADIVFLVDGSYSIGIANFVKVRAFLEVLVKSFEISPNRVQISLVQYSRDPHTEFTLKKFTKVEDIIEAINTFPYRGGS-T 517
Cdd:cd01450    1 LDIVFLLDGSESVGPENFEKVKDFIEKLVEKLDIGPDKTRVGLVQYSDDVRVEFSLNDYKSKDDLLKAVKNLKYLGGGgT 80
                         90       100       110       120       130       140       150       160
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 93141047  518 NTGKAMTYVREKIFVPSkGSRSNVPKVMILITDGKSSD--AFRDPAIKLRNSDVEIFAVGVKDAVRSELEAIASPPAETH 595
Cdd:cd01450   81 NTGKALQYALEQLFSES-NARENVPKVIIVLTDGRSDDggDPKEAAAKLKDEGIKVFVVGVGPADEEELREIASCPSERH 159

                 ..
gi 93141047  596 VF 597
Cdd:cd01450  160 VF 161
VWA smart00327
von Willebrand factor (vWF) type A domain; VWA domains in extracellular eukaryotic proteins ...
140-309 1.38e-53

von Willebrand factor (vWF) type A domain; VWA domains in extracellular eukaryotic proteins mediate adhesion via metal ion-dependent adhesion sites (MIDAS). Intracellular VWA domains and homologues in prokaryotes have recently been identified. The proposed VWA domains in integrin beta subunits have recently been substantiated using sequence-based methods.


Pssm-ID: 214621 [Multi-domain]  Cd Length: 175  Bit Score: 186.12  E-value: 1.38e-53
                            10        20        30        40        50        60        70        80
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 93141047     140 DLVFLVDGSWSVGRNNFKYILDFIAALVSAFDIGEEKTRVGVVQYSSDTRTEFNLNQYYQRDELLAAIKKIPYK-GGNTM 218
Cdd:smart00327    1 DVVFLLDGSGSMGGNRFELAKEFVLKLVEQLDIGPDGDRVGLVTFSDDARVLFPLNDSRSKDALLEALASLSYKlGGGTN 80
                            90       100       110       120       130       140       150       160
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 93141047     219 TGDAIDYLVKNTFTESAGARVGFPKVAIIITDGKSQD---EVEIPARELRNVGVEVFSLGIK-AADAKELKQIASTPSLN 294
Cdd:smart00327   81 LGAALQYALENLFSKSAGSRRGAPKVVILITDGESNDgpkDLLKAAKELKRSGVKVFVVGVGnDVDEEELKKLASAPGGV 160
                           170
                    ....*....|....*
gi 93141047     295 HVFNVANFDAIVDIQ 309
Cdd:smart00327  161 YVFLPELLDLLIDLL 175
vWFA_subfamily_ECM cd01450
Von Willebrand factor type A (vWA) domain was originally found in the blood coagulation ...
1198-1356 4.83e-53

Von Willebrand factor type A (vWA) domain was originally found in the blood coagulation protein von Willebrand factor (vWF). Typically, the vWA domain is made up of approximately 200 amino acid residues folded into a classic a/b para-rossmann type of fold. The vWA domain, since its discovery, has drawn great interest because of its widespread occurrence and its involvement in a wide variety of important cellular functions. These include basal membrane formation, cell migration, cell differentiation, adhesion, haemostasis, signaling, chromosomal stability, malignant transformation and in immune defenses In integrins these domains form heterodimers while in vWF it forms multimers. There are different interaction surfaces of this domain as seen by the various molecules it complexes with. Ligand binding in most cases is mediated by the presence of a metal ion dependent adhesion site termed as the MIDAS motif that is a characteristic feature of most, if not all A domains


Pssm-ID: 238727 [Multi-domain]  Cd Length: 161  Bit Score: 184.03  E-value: 4.83e-53
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 93141047 1198 ADIVLLVDGSWSIGRANFRTVRSFISRIVEVFDIGPKRVQIALAQYSGDPRTEWQLNAHRDKKSLLQAVANLPYKGGN-T 1276
Cdd:cd01450    1 LDIVFLLDGSESVGPENFEKVKDFIEKLVEKLDIGPDKTRVGLVQYSDDVRVEFSLNDYKSKDDLLKAVKNLKYLGGGgT 80
                         90       100       110       120       130       140       150       160
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 93141047 1277 LTGMALNFIRQQNFrTQAGMRPRARKIGVLITDGKSQD--DVEAPSKKLKDEGVELFAIGIKNADEVELKMIATDPDDTH 1354
Cdd:cd01450   81 NTGKALQYALEQLF-SESNARENVPKVIIVLTDGRSDDggDPKEAAAKLKDEGIKVFVVGVGPADEEELREIASCPSERH 159

                 ..
gi 93141047 1355 AY 1356
Cdd:cd01450  160 VF 161
vWFA_subfamily_ECM cd01450
Von Willebrand factor type A (vWA) domain was originally found in the blood coagulation ...
2322-2481 2.89e-52

Von Willebrand factor type A (vWA) domain was originally found in the blood coagulation protein von Willebrand factor (vWF). Typically, the vWA domain is made up of approximately 200 amino acid residues folded into a classic a/b para-rossmann type of fold. The vWA domain, since its discovery, has drawn great interest because of its widespread occurrence and its involvement in a wide variety of important cellular functions. These include basal membrane formation, cell migration, cell differentiation, adhesion, haemostasis, signaling, chromosomal stability, malignant transformation and in immune defenses In integrins these domains form heterodimers while in vWF it forms multimers. There are different interaction surfaces of this domain as seen by the various molecules it complexes with. Ligand binding in most cases is mediated by the presence of a metal ion dependent adhesion site termed as the MIDAS motif that is a characteristic feature of most, if not all A domains


Pssm-ID: 238727 [Multi-domain]  Cd Length: 161  Bit Score: 181.72  E-value: 2.89e-52
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 93141047 2322 ADIVFLTDASWSIGDDNFNKVVKFIFNTVGGFDeISPAGIQVSFVQYSDEVKSEFKLNTYNDKALALGALQNIRYRGGN- 2400
Cdd:cd01450    1 LDIVFLLDGSESVGPENFEKVKDFIEKLVEKLD-IGPDKTRVGLVQYSDDVRVEFSLNDYKSKDDLLKAVKNLKYLGGGg 79
                         90       100       110       120       130       140       150       160
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 93141047 2401 TRTGKALTFIKEKVLTwESGMRKNVPKVLVVVTDGRSQD--EVKKAALVIQQSGFSVFVVGVADVDYNELANIASKPSER 2478
Cdd:cd01450   80 TNTGKALQYALEQLFS-ESNARENVPKVIIVLTDGRSDDggDPKEAAAKLKDEGIKVFVVGVGPADEEELREIASCPSER 158

                 ...
gi 93141047 2479 HVF 2481
Cdd:cd01450  159 HVF 161
vWA_Matrilin cd01475
VWA_Matrilin: In cartilaginous plate, extracellular matrix molecules mediate cell-matrix and ...
139-322 3.09e-52

VWA_Matrilin: In cartilaginous plate, extracellular matrix molecules mediate cell-matrix and matrix-matrix interactions thereby providing tissue integrity. Some members of the matrilin family are expressed specifically in developing cartilage rudiments. The matrilin family consists of at least four members. All the members of the matrilin family contain VWA domains, EGF-like domains and a heptad repeat coiled-coiled domain at the carboxy terminus which is responsible for the oligomerization of the matrilins. The VWA domains have been shown to be essential for matrilin network formation by interacting with matrix ligands.


Pssm-ID: 238752 [Multi-domain]  Cd Length: 224  Bit Score: 184.13  E-value: 3.09e-52
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 93141047  139 TDLVFLVDGSWSVGRNNFKYILDFIAALVSAFDIGEEKTRVGVVQYSSDTRTEFNLNQYYQRDELLAAIKKIPYKGGNTM 218
Cdd:cd01475    3 TDLVFLIDSSRSVRPENFELVKQFLNQIIDSLDVGPDATRVGLVQYSSTVKQEFPLGRFKSKADLKRAVRRMEYLETGTM 82
                         90       100       110       120       130       140       150       160
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 93141047  219 TGDAIDYLVKNTFTESAGARVG---FPKVAIIITDGKSQDEVEIPARELRNVGVEVFSLGIKAADAKELKQIASTPSLNH 295
Cdd:cd01475   83 TGLAIQYAMNNAFSEAEGARPGserVPRVGIVVTDGRPQDDVSEVAAKARALGIEMFAVGVGRADEEELREIASEPLADH 162
                        170       180
                 ....*....|....*....|....*..
gi 93141047  296 VFNVANFDAIVDIQNEIISQVCSGVDE 322
Cdd:cd01475  163 VFYVEDFSTIEELTKKFQGKICVVPDL 189
VWA smart00327
von Willebrand factor (vWF) type A domain; VWA domains in extracellular eukaryotic proteins ...
440-609 5.63e-52

von Willebrand factor (vWF) type A domain; VWA domains in extracellular eukaryotic proteins mediate adhesion via metal ion-dependent adhesion sites (MIDAS). Intracellular VWA domains and homologues in prokaryotes have recently been identified. The proposed VWA domains in integrin beta subunits have recently been substantiated using sequence-based methods.


Pssm-ID: 214621 [Multi-domain]  Cd Length: 175  Bit Score: 181.50  E-value: 5.63e-52
                            10        20        30        40        50        60        70        80
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 93141047     440 DIVFLVDGSYSIGIANFVKVRAFLEVLVKSFEISPNRVQISLVQYSRDPHTEFTLKKFTKVEDIIEAINTFPYR-GGSTN 518
Cdd:smart00327    1 DVVFLLDGSGSMGGNRFELAKEFVLKLVEQLDIGPDGDRVGLVTFSDDARVLFPLNDSRSKDALLEALASLSYKlGGGTN 80
                            90       100       110       120       130       140       150       160
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 93141047     519 TGKAMTYVREKIFVPSKGSRSNVPKVMILITDGKSSDA---FRDPAIKLRNSDVEIFAVGVKDAV-RSELEAIASPPAET 594
Cdd:smart00327   81 LGAALQYALENLFSKSAGSRRGAPKVVILITDGESNDGpkdLLKAAKELKRSGVKVFVVGVGNDVdEEELKKLASAPGGV 160
                           170
                    ....*....|....*
gi 93141047     595 HVFTVEDFDAFQRIS 609
Cdd:smart00327  161 YVFLPELLDLLIDLL 175
vWA_collagen_alpha3-VI-like cd01481
VWA_collagen alpha 3(VI) like: The extracellular matrix represents a complex alloy of variable ...
140-302 7.08e-51

VWA_collagen alpha 3(VI) like: The extracellular matrix represents a complex alloy of variable members of diverse protein families defining structural integrity and various physiological functions. The most abundant family is the collagens with more than 20 different collagen types identified thus far. Collagens are centrally involved in the formation of fibrillar and microfibrillar networks of the extracellular matrix, basement membranes as well as other structures of the extracellular matrix. Some collagens have about 15-18 vWA domains in them. The VWA domains present in these collagens mediate protein-protein interactions.


Pssm-ID: 238758  Cd Length: 165  Bit Score: 177.90  E-value: 7.08e-51
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 93141047  140 DLVFLVDGSWSVGRNNFKYILDFIAALVSAFDIGEEKTRVGVVQYSSDTRTEFNLNQYYQRDELLAAIKKIPYKGGNTM- 218
Cdd:cd01481    2 DIVFLIDGSDNVGSGNFPAIRDFIERIVQSLDVGPDKIRVAVVQFSDTPRPEFYLNTHSTKADVLGAVRRLRLRGGSQLn 81
                         90       100       110       120       130       140       150       160
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 93141047  219 TGDAIDYLVKNTFTESAGARV--GFPKVAIIITDGKSQDEVEIPARELRNVGVEVFSLGIKAADAKELKQIASTPSLnhV 296
Cdd:cd01481   82 TGSALDYVVKNLFTKSAGSRIeeGVPQFLVLITGGKSQDDVERPAVALKRAGIVPFAIGARNADLAELQQIAFDPSF--V 159

                 ....*.
gi 93141047  297 FNVANF 302
Cdd:cd01481  160 FQVSDF 165
VWA smart00327
von Willebrand factor (vWF) type A domain; VWA domains in extracellular eukaryotic proteins ...
1199-1368 2.86e-49

von Willebrand factor (vWF) type A domain; VWA domains in extracellular eukaryotic proteins mediate adhesion via metal ion-dependent adhesion sites (MIDAS). Intracellular VWA domains and homologues in prokaryotes have recently been identified. The proposed VWA domains in integrin beta subunits have recently been substantiated using sequence-based methods.


Pssm-ID: 214621 [Multi-domain]  Cd Length: 175  Bit Score: 173.79  E-value: 2.86e-49
                            10        20        30        40        50        60        70        80
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 93141047    1199 DIVLLVDGSWSIGRANFRTVRSFISRIVEVFDIGPKRVQIALAQYSGDPRTEWQLNAHRDKKSLLQAVANLPYK-GGNTL 1277
Cdd:smart00327    1 DVVFLLDGSGSMGGNRFELAKEFVLKLVEQLDIGPDGDRVGLVTFSDDARVLFPLNDSRSKDALLEALASLSYKlGGGTN 80
                            90       100       110       120       130       140       150       160
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 93141047    1278 TGMALNFIRQQNFRTQAGMRPRARKIGVLITDGKSQD---DVEAPSKKLKDEGVELFAIGIKNA-DEVELKMIATDPDDT 1353
Cdd:smart00327   81 LGAALQYALENLFSKSAGSRRGAPKVVILITDGESNDgpkDLLKAAKELKRSGVKVFVVGVGNDvDEEELKKLASAPGGV 160
                           170
                    ....*....|....*
gi 93141047    1354 HAYNVADFESLSRIV 1368
Cdd:smart00327  161 YVFLPELLDLLIDLL 175
vWA_Matrilin cd01475
VWA_Matrilin: In cartilaginous plate, extracellular matrix molecules mediate cell-matrix and ...
439-617 1.52e-46

VWA_Matrilin: In cartilaginous plate, extracellular matrix molecules mediate cell-matrix and matrix-matrix interactions thereby providing tissue integrity. Some members of the matrilin family are expressed specifically in developing cartilage rudiments. The matrilin family consists of at least four members. All the members of the matrilin family contain VWA domains, EGF-like domains and a heptad repeat coiled-coiled domain at the carboxy terminus which is responsible for the oligomerization of the matrilins. The VWA domains have been shown to be essential for matrilin network formation by interacting with matrix ligands.


Pssm-ID: 238752 [Multi-domain]  Cd Length: 224  Bit Score: 167.95  E-value: 1.52e-46
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 93141047  439 ADIVFLVDGSYSIGIANFVKVRAFLEVLVKSFEISPNRVQISLVQYSRDPHTEFTLKKFTKVEDIIEAINTFPYRGGSTN 518
Cdd:cd01475    3 TDLVFLIDSSRSVRPENFELVKQFLNQIIDSLDVGPDATRVGLVQYSSTVKQEFPLGRFKSKADLKRAVRRMEYLETGTM 82
                         90       100       110       120       130       140       150       160
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 93141047  519 TGKAMTYVREKIFVPSKGSRS---NVPKVMILITDGKSSDAFRDPAIKLRNSDVEIFAVGVKDAVRSELEAIASPPAETH 595
Cdd:cd01475   83 TGLAIQYAMNNAFSEAEGARPgseRVPRVGIVVTDGRPQDDVSEVAAKARALGIEMFAVGVGRADEEELREIASEPLADH 162
                        170       180
                 ....*....|....*....|..
gi 93141047  596 VFTVEDFDAFQRISFELTQSIC 617
Cdd:cd01475  163 VFYVEDFSTIEELTKKFQGKIC 184
vWA_collagen_alpha3-VI-like cd01481
VWA_collagen alpha 3(VI) like: The extracellular matrix represents a complex alloy of variable ...
1198-1361 1.31e-45

VWA_collagen alpha 3(VI) like: The extracellular matrix represents a complex alloy of variable members of diverse protein families defining structural integrity and various physiological functions. The most abundant family is the collagens with more than 20 different collagen types identified thus far. Collagens are centrally involved in the formation of fibrillar and microfibrillar networks of the extracellular matrix, basement membranes as well as other structures of the extracellular matrix. Some collagens have about 15-18 vWA domains in them. The VWA domains present in these collagens mediate protein-protein interactions.


Pssm-ID: 238758  Cd Length: 165  Bit Score: 162.88  E-value: 1.31e-45
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 93141047 1198 ADIVLLVDGSWSIGRANFRTVRSFISRIVEVFDIGPKRVQIALAQYSGDPRTEWQLNAHRDKKSLLQAVANLPYKGGNTL 1277
Cdd:cd01481    1 KDIVFLIDGSDNVGSGNFPAIRDFIERIVQSLDVGPDKIRVAVVQFSDTPRPEFYLNTHSTKADVLGAVRRLRLRGGSQL 80
                         90       100       110       120       130       140       150       160
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 93141047 1278 -TGMALNFIRQQNFRTQAGMRPR--ARKIGVLITDGKSQDDVEAPSKKLKDEGVELFAIGIKNADEVELKMIATDPDdtH 1354
Cdd:cd01481   81 nTGSALDYVVKNLFTKSAGSRIEegVPQFLVLITGGKSQDDVERPAVALKRAGIVPFAIGARNADLAELQQIAFDPS--F 158

                 ....*..
gi 93141047 1355 AYNVADF 1361
Cdd:cd01481  159 VFQVSDF 165
vWA_Matrilin cd01475
VWA_Matrilin: In cartilaginous plate, extracellular matrix molecules mediate cell-matrix and ...
1198-1376 2.59e-45

VWA_Matrilin: In cartilaginous plate, extracellular matrix molecules mediate cell-matrix and matrix-matrix interactions thereby providing tissue integrity. Some members of the matrilin family are expressed specifically in developing cartilage rudiments. The matrilin family consists of at least four members. All the members of the matrilin family contain VWA domains, EGF-like domains and a heptad repeat coiled-coiled domain at the carboxy terminus which is responsible for the oligomerization of the matrilins. The VWA domains have been shown to be essential for matrilin network formation by interacting with matrix ligands.


Pssm-ID: 238752 [Multi-domain]  Cd Length: 224  Bit Score: 164.48  E-value: 2.59e-45
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 93141047 1198 ADIVLLVDGSWSIGRANFRTVRSFISRIVEVFDIGPKRVQIALAQYSGDPRTEWQLNAHRDKKSLLQAVANLPYKGGNTL 1277
Cdd:cd01475    3 TDLVFLIDSSRSVRPENFELVKQFLNQIIDSLDVGPDATRVGLVQYSSTVKQEFPLGRFKSKADLKRAVRRMEYLETGTM 82
                         90       100       110       120       130       140       150       160
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 93141047 1278 TGMALNFIRQQNFRTQAGMRPRAR---KIGVLITDGKSQDDVEAPSKKLKDEGVELFAIGIKNADEVELKMIATDPDDTH 1354
Cdd:cd01475   83 TGLAIQYAMNNAFSEAEGARPGSErvpRVGIVVTDGRPQDDVSEVAAKARALGIEMFAVGVGRADEEELREIASEPLADH 162
                        170       180
                 ....*....|....*....|..
gi 93141047 1355 AYNVADFESLSRIVDDLTINLC 1376
Cdd:cd01475  163 VFYVEDFSTIEELTKKFQGKIC 184
VWA smart00327
von Willebrand factor (vWF) type A domain; VWA domains in extracellular eukaryotic proteins ...
2323-2492 3.07e-45

von Willebrand factor (vWF) type A domain; VWA domains in extracellular eukaryotic proteins mediate adhesion via metal ion-dependent adhesion sites (MIDAS). Intracellular VWA domains and homologues in prokaryotes have recently been identified. The proposed VWA domains in integrin beta subunits have recently been substantiated using sequence-based methods.


Pssm-ID: 214621 [Multi-domain]  Cd Length: 175  Bit Score: 162.24  E-value: 3.07e-45
                            10        20        30        40        50        60        70        80
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 93141047    2323 DIVFLTDASWSIGDDNFNKVVKFIFNTVGGFDeISPAGIQVSFVQYSDEVKSEFKLNTYNDKALALGALQNIRYR-GGNT 2401
Cdd:smart00327    1 DVVFLLDGSGSMGGNRFELAKEFVLKLVEQLD-IGPDGDRVGLVTFSDDARVLFPLNDSRSKDALLEALASLSYKlGGGT 79
                            90       100       110       120       130       140       150       160
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 93141047    2402 RTGKALTFIKEKVLTWESGMRKNVPKVLVVVTDGRSQD---EVKKAALVIQQSGFSVFVVGV-ADVDYNELANIASKPSE 2477
Cdd:smart00327   80 NLGAALQYALENLFSKSAGSRRGAPKVVILITDGESNDgpkDLLKAAKELKRSGVKVFVVGVgNDVDEEELKKLASAPGG 159
                           170
                    ....*....|....*
gi 93141047    2478 RHVFIVDDFESFEKI 2492
Cdd:smart00327  160 VYVFLPELLDLLIDL 174
vWA_integrins_alpha_subunit cd01469
Integrins are a class of adhesion receptors that link the extracellular matrix to the ...
440-608 1.54e-44

Integrins are a class of adhesion receptors that link the extracellular matrix to the cytoskeleton and cooperate with growth factor receptors to promote celll survival, cell cycle progression and cell migration. Integrins consist of an alpha and a beta sub-unit. Each sub-unit has a large extracellular portion, a single transmembrane segment and a short cytoplasmic domain. The N-terminal domains of the alpha and beta subunits associate to form the integrin headpiece, which contains the ligand binding site, whereas the C-terminal segments traverse the plasma membrane and mediate interaction with the cytoskeleton and with signalling proteins.The VWA domains present in the alpha subunits of integrins seem to be a chordate specific radiation of the gene family being found only in vertebrates. They mediate protein-protein interactions.


Pssm-ID: 238746 [Multi-domain]  Cd Length: 177  Bit Score: 160.21  E-value: 1.54e-44
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 93141047  440 DIVFLVDGSYSIGIANFVKVRAFLEVLVKSFEISPNRVQISLVQYSRDPHTEFTLKKFTKVEDIIEAINTFPYRGGSTNT 519
Cdd:cd01469    2 DIVFVLDGSGSIYPDDFQKVKNFLSTVMKKLDIGPTKTQFGLVQYSESFRTEFTLNEYRTKEEPLSLVKHISQLLGLTNT 81
                         90       100       110       120       130       140       150       160
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 93141047  520 GKAMTYVREKIFVPSKGSRSNVPKVMILITDGKSSDAFRDPAI--KLRNSDVEIFAVGVKDAVRS-----ELEAIASPPA 592
Cdd:cd01469   82 ATAIQYVVTELFSESNGARKDATKVLVVITDGESHDDPLLKDVipQAEREGIIRYAIGVGGHFQRensreELKTIASKPP 161
                        170
                 ....*....|....*.
gi 93141047  593 ETHVFTVEDFDAFQRI 608
Cdd:cd01469  162 EEHFFNVTDFAALKDI 177
vWA_integrins_alpha_subunit cd01469
Integrins are a class of adhesion receptors that link the extracellular matrix to the ...
139-308 7.66e-43

Integrins are a class of adhesion receptors that link the extracellular matrix to the cytoskeleton and cooperate with growth factor receptors to promote celll survival, cell cycle progression and cell migration. Integrins consist of an alpha and a beta sub-unit. Each sub-unit has a large extracellular portion, a single transmembrane segment and a short cytoplasmic domain. The N-terminal domains of the alpha and beta subunits associate to form the integrin headpiece, which contains the ligand binding site, whereas the C-terminal segments traverse the plasma membrane and mediate interaction with the cytoskeleton and with signalling proteins.The VWA domains present in the alpha subunits of integrins seem to be a chordate specific radiation of the gene family being found only in vertebrates. They mediate protein-protein interactions.


Pssm-ID: 238746 [Multi-domain]  Cd Length: 177  Bit Score: 155.59  E-value: 7.66e-43
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 93141047  139 TDLVFLVDGSWSVGRNNFKYILDFIAALVSAFDIGEEKTRVGVVQYSSDTRTEFNLNQYYQRDELLAAIKKIPYKGGNTM 218
Cdd:cd01469    1 MDIVFVLDGSGSIYPDDFQKVKNFLSTVMKKLDIGPTKTQFGLVQYSESFRTEFTLNEYRTKEEPLSLVKHISQLLGLTN 80
                         90       100       110       120       130       140       150       160
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 93141047  219 TGDAIDYLVKNTFTESAGARVGFPKVAIIITDGKSQD----EVEIPARELRNvgVEVFSLGIKAA-----DAKELKQIAS 289
Cdd:cd01469   81 TATAIQYVVTELFSESNGARKDATKVLVVITDGESHDdpllKDVIPQAEREG--IIRYAIGVGGHfqrenSREELKTIAS 158
                        170
                 ....*....|....*....
gi 93141047  290 TPSLNHVFNVANFDAIVDI 308
Cdd:cd01469  159 KPPEEHFFNVTDFAALKDI 177
vWA_collagen_alpha3-VI-like cd01481
VWA_collagen alpha 3(VI) like: The extracellular matrix represents a complex alloy of variable ...
439-602 7.88e-43

VWA_collagen alpha 3(VI) like: The extracellular matrix represents a complex alloy of variable members of diverse protein families defining structural integrity and various physiological functions. The most abundant family is the collagens with more than 20 different collagen types identified thus far. Collagens are centrally involved in the formation of fibrillar and microfibrillar networks of the extracellular matrix, basement membranes as well as other structures of the extracellular matrix. Some collagens have about 15-18 vWA domains in them. The VWA domains present in these collagens mediate protein-protein interactions.


Pssm-ID: 238758  Cd Length: 165  Bit Score: 154.79  E-value: 7.88e-43
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 93141047  439 ADIVFLVDGSYSIGIANFVKVRAFLEVLVKSFEISPNRVQISLVQYSRDPHTEFTLKKFTKVEDIIEAINTFPYRGGS-T 517
Cdd:cd01481    1 KDIVFLIDGSDNVGSGNFPAIRDFIERIVQSLDVGPDKIRVAVVQFSDTPRPEFYLNTHSTKADVLGAVRRLRLRGGSqL 80
                         90       100       110       120       130       140       150       160
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 93141047  518 NTGKAMTYVREKIFVPSKGSR--SNVPKVMILITDGKSSDAFRDPAIKLRNSDVEIFAVGVKDAVRSELEAIASPPaeTH 595
Cdd:cd01481   81 NTGSALDYVVKNLFTKSAGSRieEGVPQFLVLITGGKSQDDVERPAVALKRAGIVPFAIGARNADLAELQQIAFDP--SF 158

                 ....*..
gi 93141047  596 VFTVEDF 602
Cdd:cd01481  159 VFQVSDF 165
vWA_Matrilin cd01475
VWA_Matrilin: In cartilaginous plate, extracellular matrix molecules mediate cell-matrix and ...
2322-2525 5.13e-42

VWA_Matrilin: In cartilaginous plate, extracellular matrix molecules mediate cell-matrix and matrix-matrix interactions thereby providing tissue integrity. Some members of the matrilin family are expressed specifically in developing cartilage rudiments. The matrilin family consists of at least four members. All the members of the matrilin family contain VWA domains, EGF-like domains and a heptad repeat coiled-coiled domain at the carboxy terminus which is responsible for the oligomerization of the matrilins. The VWA domains have been shown to be essential for matrilin network formation by interacting with matrix ligands.


Pssm-ID: 238752 [Multi-domain]  Cd Length: 224  Bit Score: 154.85  E-value: 5.13e-42
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 93141047 2322 ADIVFLTDASWSIGDDNFNKVVKFIFNTVGGFDeISPAGIQVSFVQYSDEVKSEFKLNTYNDKALALGALQNIRYRGGNT 2401
Cdd:cd01475    3 TDLVFLIDSSRSVRPENFELVKQFLNQIIDSLD-VGPDATRVGLVQYSSTVKQEFPLGRFKSKADLKRAVRRMEYLETGT 81
                         90       100       110       120       130       140       150       160
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 93141047 2402 RTGKALTFIKEKVLTWESGMRK---NVPKVLVVVTDGRSQDEVKKAALVIQQSGFSVFVVGVADVDYNELANIASKPSER 2478
Cdd:cd01475   82 MTGLAIQYAMNNAFSEAEGARPgseRVPRVGIVVTDGRPQDDVSEVAAKARALGIEMFAVGVGRADEEELREIASEPLAD 161
                        170       180       190       200       210       220
                 ....*....|....*....|....*....|....*....|....*....|....*....|
gi 93141047 2479 HVFIVDDFESFEKIEDNL-------------ITFVCETATSSCPLIYLDGYTSpGFKMLE 2525
Cdd:cd01475  162 HVFYVEDFSTIEELTKKFqgkicvvpdlcatLSHVCQQVCISTPGSYLCACTE-GYALLE 220
vWA_integrins_alpha_subunit cd01469
Integrins are a class of adhesion receptors that link the extracellular matrix to the ...
1199-1367 2.28e-40

Integrins are a class of adhesion receptors that link the extracellular matrix to the cytoskeleton and cooperate with growth factor receptors to promote celll survival, cell cycle progression and cell migration. Integrins consist of an alpha and a beta sub-unit. Each sub-unit has a large extracellular portion, a single transmembrane segment and a short cytoplasmic domain. The N-terminal domains of the alpha and beta subunits associate to form the integrin headpiece, which contains the ligand binding site, whereas the C-terminal segments traverse the plasma membrane and mediate interaction with the cytoskeleton and with signalling proteins.The VWA domains present in the alpha subunits of integrins seem to be a chordate specific radiation of the gene family being found only in vertebrates. They mediate protein-protein interactions.


Pssm-ID: 238746 [Multi-domain]  Cd Length: 177  Bit Score: 148.27  E-value: 2.28e-40
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 93141047 1199 DIVLLVDGSWSIGRANFRTVRSFISRIVEVFDIGPKRVQIALAQYSGDPRTEWQLNAHRDKKSLLQAVANLPYKGGNTLT 1278
Cdd:cd01469    2 DIVFVLDGSGSIYPDDFQKVKNFLSTVMKKLDIGPTKTQFGLVQYSESFRTEFTLNEYRTKEEPLSLVKHISQLLGLTNT 81
                         90       100       110       120       130       140       150       160
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 93141047 1279 GMALNFIRQQNFRTQAGMRPRARKIGVLITDGKSQDDVEAPS--KKLKDEGVELFAIGI------KNADEvELKMIATDP 1350
Cdd:cd01469   82 ATAIQYVVTELFSESNGARKDATKVLVVITDGESHDDPLLKDviPQAEREGIIRYAIGVgghfqrENSRE-ELKTIASKP 160
                        170
                 ....*....|....*..
gi 93141047 1351 DDTHAYNVADFESLSRI 1367
Cdd:cd01469  161 PEEHFFNVTDFAALKDI 177
vWA_collagen_alpha3-VI-like cd01481
VWA_collagen alpha 3(VI) like: The extracellular matrix represents a complex alloy of variable ...
2322-2486 1.59e-38

VWA_collagen alpha 3(VI) like: The extracellular matrix represents a complex alloy of variable members of diverse protein families defining structural integrity and various physiological functions. The most abundant family is the collagens with more than 20 different collagen types identified thus far. Collagens are centrally involved in the formation of fibrillar and microfibrillar networks of the extracellular matrix, basement membranes as well as other structures of the extracellular matrix. Some collagens have about 15-18 vWA domains in them. The VWA domains present in these collagens mediate protein-protein interactions.


Pssm-ID: 238758  Cd Length: 165  Bit Score: 142.46  E-value: 1.59e-38
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 93141047 2322 ADIVFLTDASWSIGDDNFNKVVKFIFNTVGGFDeISPAGIQVSFVQYSDEVKSEFKLNTYNDKALALGALQNIRYRGGNT 2401
Cdd:cd01481    1 KDIVFLIDGSDNVGSGNFPAIRDFIERIVQSLD-VGPDKIRVAVVQFSDTPRPEFYLNTHSTKADVLGAVRRLRLRGGSQ 79
                         90       100       110       120       130       140       150       160
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 93141047 2402 -RTGKALTFIKEKVLTWESG--MRKNVPKVLVVVTDGRSQDEVKKAALVIQQSGFSVFVVGVADVDYNELANIASKPSer 2478
Cdd:cd01481   80 lNTGSALDYVVKNLFTKSAGsrIEEGVPQFLVLITGGKSQDDVERPAVALKRAGIVPFAIGARNADLAELQQIAFDPS-- 157

                 ....*...
gi 93141047 2479 HVFIVDDF 2486
Cdd:cd01481  158 FVFQVSDF 165
vWA_integrins_alpha_subunit cd01469
Integrins are a class of adhesion receptors that link the extracellular matrix to the ...
2323-2492 1.52e-37

Integrins are a class of adhesion receptors that link the extracellular matrix to the cytoskeleton and cooperate with growth factor receptors to promote celll survival, cell cycle progression and cell migration. Integrins consist of an alpha and a beta sub-unit. Each sub-unit has a large extracellular portion, a single transmembrane segment and a short cytoplasmic domain. The N-terminal domains of the alpha and beta subunits associate to form the integrin headpiece, which contains the ligand binding site, whereas the C-terminal segments traverse the plasma membrane and mediate interaction with the cytoskeleton and with signalling proteins.The VWA domains present in the alpha subunits of integrins seem to be a chordate specific radiation of the gene family being found only in vertebrates. They mediate protein-protein interactions.


Pssm-ID: 238746 [Multi-domain]  Cd Length: 177  Bit Score: 140.18  E-value: 1.52e-37
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 93141047 2323 DIVFLTDASWSIGDDNFNKVVKFIFNTVGGFDeISPAGIQVSFVQYSDEVKSEFKLNTYNDKALALGALQNIRYRGGNTR 2402
Cdd:cd01469    2 DIVFVLDGSGSIYPDDFQKVKNFLSTVMKKLD-IGPTKTQFGLVQYSESFRTEFTLNEYRTKEEPLSLVKHISQLLGLTN 80
                         90       100       110       120       130       140       150       160
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 93141047 2403 TGKALTFIKEKVLTWESGMRKNVPKVLVVVTDGRSQDEVKKAAlVIQQS---GFSVFVVGVADV-----DYNELANIASK 2474
Cdd:cd01469   81 TATAIQYVVTELFSESNGARKDATKVLVVITDGESHDDPLLKD-VIPQAereGIIRYAIGVGGHfqrenSREELKTIASK 159
                        170
                 ....*....|....*...
gi 93141047 2475 PSERHVFIVDDFESFEKI 2492
Cdd:cd01469  160 PPEEHFFNVTDFAALKDI 177
vWFA cd00198
Von Willebrand factor type A (vWA) domain was originally found in the blood coagulation ...
140-297 5.29e-32

Von Willebrand factor type A (vWA) domain was originally found in the blood coagulation protein von Willebrand factor (vWF). Typically, the vWA domain is made up of approximately 200 amino acid residues folded into a classic a/b para-rossmann type of fold. The vWA domain, since its discovery, has drawn great interest because of its widespread occurrence and its involvement in a wide variety of important cellular functions. These include basal membrane formation, cell migration, cell differentiation, adhesion, haemostasis, signaling, chromosomal stability, malignant transformation and in immune defenses In integrins these domains form heterodimers while in vWF it forms multimers. There are different interaction surfaces of this domain as seen by the various molecules it complexes with. Ligand binding in most cases is mediated by the presence of a metal ion dependent adhesion site termed as the MIDAS motif that is a characteristic feature of most, if not all A domains.


Pssm-ID: 238119 [Multi-domain]  Cd Length: 161  Bit Score: 123.83  E-value: 5.29e-32
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 93141047  140 DLVFLVDGSWSVGRNNFKYILDFIAALVSAFDIGEEKTRVGVVQYSSDTRTEFNLNQYYQRDELLAAIKKIPYK-GGNTM 218
Cdd:cd00198    2 DIVFLLDVSGSMGGEKLDKAKEALKALVSSLSASPPGDRVGLVTFGSNARVVLPLTTDTDKADLLEAIDALKKGlGGGTN 81
                         90       100       110       120       130       140       150       160
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 93141047  219 TGDAIDYLVKNTFtesAGARVGFPKVAIIITDGKSQDEVEIP---ARELRNVGVEVFSLGIK-AADAKELKQIASTPSLN 294
Cdd:cd00198   82 IGAALRLALELLK---SAKRPNARRVIILLTDGEPNDGPELLaeaARELRKLGITVYTIGIGdDANEDELKEIADKTTGG 158

                 ...
gi 93141047  295 HVF 297
Cdd:cd00198  159 AVF 161
vWFA cd00198
Von Willebrand factor type A (vWA) domain was originally found in the blood coagulation ...
439-597 6.23e-32

Von Willebrand factor type A (vWA) domain was originally found in the blood coagulation protein von Willebrand factor (vWF). Typically, the vWA domain is made up of approximately 200 amino acid residues folded into a classic a/b para-rossmann type of fold. The vWA domain, since its discovery, has drawn great interest because of its widespread occurrence and its involvement in a wide variety of important cellular functions. These include basal membrane formation, cell migration, cell differentiation, adhesion, haemostasis, signaling, chromosomal stability, malignant transformation and in immune defenses In integrins these domains form heterodimers while in vWF it forms multimers. There are different interaction surfaces of this domain as seen by the various molecules it complexes with. Ligand binding in most cases is mediated by the presence of a metal ion dependent adhesion site termed as the MIDAS motif that is a characteristic feature of most, if not all A domains.


Pssm-ID: 238119 [Multi-domain]  Cd Length: 161  Bit Score: 123.44  E-value: 6.23e-32
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 93141047  439 ADIVFLVDGSYSIGIANFVKVRAFLEVLVKSFEISPNRVQISLVQYSRDPHTEFTLKKFTKVEDIIEAINTFPYR-GGST 517
Cdd:cd00198    1 ADIVFLLDVSGSMGGEKLDKAKEALKALVSSLSASPPGDRVGLVTFGSNARVVLPLTTDTDKADLLEAIDALKKGlGGGT 80
                         90       100       110       120       130       140       150       160
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 93141047  518 NTGKAMTYVREKIFvpsKGSRSNVPKVMILITDGKSSDA---FRDPAIKLRNSDVEIFAVGVKD-AVRSELEAIASPPAE 593
Cdd:cd00198   81 NIGAALRLALELLK---SAKRPNARRVIILLTDGEPNDGpelLAEAARELRKLGITVYTIGIGDdANEDELKEIADKTTG 157

                 ....
gi 93141047  594 THVF 597
Cdd:cd00198  158 GAVF 161
vWFA cd00198
Von Willebrand factor type A (vWA) domain was originally found in the blood coagulation ...
1198-1356 3.39e-30

Von Willebrand factor type A (vWA) domain was originally found in the blood coagulation protein von Willebrand factor (vWF). Typically, the vWA domain is made up of approximately 200 amino acid residues folded into a classic a/b para-rossmann type of fold. The vWA domain, since its discovery, has drawn great interest because of its widespread occurrence and its involvement in a wide variety of important cellular functions. These include basal membrane formation, cell migration, cell differentiation, adhesion, haemostasis, signaling, chromosomal stability, malignant transformation and in immune defenses In integrins these domains form heterodimers while in vWF it forms multimers. There are different interaction surfaces of this domain as seen by the various molecules it complexes with. Ligand binding in most cases is mediated by the presence of a metal ion dependent adhesion site termed as the MIDAS motif that is a characteristic feature of most, if not all A domains.


Pssm-ID: 238119 [Multi-domain]  Cd Length: 161  Bit Score: 118.82  E-value: 3.39e-30
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 93141047 1198 ADIVLLVDGSWSIGRANFRTVRSFISRIVEVFDIGPKRVQIALAQYSGDPRTEWQLNAHRDKKSLLQAVANLPYK-GGNT 1276
Cdd:cd00198    1 ADIVFLLDVSGSMGGEKLDKAKEALKALVSSLSASPPGDRVGLVTFGSNARVVLPLTTDTDKADLLEAIDALKKGlGGGT 80
                         90       100       110       120       130       140       150       160
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 93141047 1277 LTGMALNFIRQQNFRTQagmRPRARKIGVLITDGKSQDD---VEAPSKKLKDEGVELFAIGIKN-ADEVELKMIATDPDD 1352
Cdd:cd00198   81 NIGAALRLALELLKSAK---RPNARRVIILLTDGEPNDGpelLAEAARELRKLGITVYTIGIGDdANEDELKEIADKTTG 157

                 ....
gi 93141047 1353 THAY 1356
Cdd:cd00198  158 GAVF 161
VWA_integrin_invertebrates cd01476
VWA_integrin (invertebrates): Integrins are a family of cell surface receptors that have ...
140-297 3.74e-30

VWA_integrin (invertebrates): Integrins are a family of cell surface receptors that have diverse functions in cell-cell and cell-extracellular matrix interactions. Because of their involvement in many biologically important adhesion processes, integrins are conserved across a wide range of multicellular animals. Integrins from invertebrates have been identified from six phyla. There are no data to date to suggest any immunological functions for the invertebrate integrins. The members of this sub-group have the conserved MIDAS motif that is charateristic of this domain suggesting the involvement of the integrins in the recognition and binding of multi-ligands.


Pssm-ID: 238753 [Multi-domain]  Cd Length: 163  Bit Score: 118.66  E-value: 3.74e-30
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 93141047  140 DLVFLVDGSWSVGRNNFKYIlDFIAALVSAFDIGEEKTRVGVVQYSSDTRT--EFNLNQYYQRDELLAAIKKIPYKGGNT 217
Cdd:cd01476    2 DLLFVLDSSGSVRGKFEKYK-KYIERIVEGLEIGPTATRVALITYSGRGRQrvRFNLPKHNDGEELLEKVDNLRFIGGTT 80
                         90       100       110       120       130       140       150       160
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 93141047  218 MTGDAIDYLVkNTFTESAGARVGFPKVAIIITDGKSQDEVEIPARELRNV-GVEVFSLGIK---AADAKELKQIASTPsl 293
Cdd:cd01476   81 ATGAAIEVAL-QQLDPSEGRREGIPKVVVVLTDGRSHDDPEKQARILRAVpNIETFAVGTGdpgTVDTEELHSITGNE-- 157

                 ....
gi 93141047  294 NHVF 297
Cdd:cd01476  158 DHIF 161
VWA_integrin_invertebrates cd01476
VWA_integrin (invertebrates): Integrins are a family of cell surface receptors that have ...
2322-2482 3.52e-28

VWA_integrin (invertebrates): Integrins are a family of cell surface receptors that have diverse functions in cell-cell and cell-extracellular matrix interactions. Because of their involvement in many biologically important adhesion processes, integrins are conserved across a wide range of multicellular animals. Integrins from invertebrates have been identified from six phyla. There are no data to date to suggest any immunological functions for the invertebrate integrins. The members of this sub-group have the conserved MIDAS motif that is charateristic of this domain suggesting the involvement of the integrins in the recognition and binding of multi-ligands.


Pssm-ID: 238753 [Multi-domain]  Cd Length: 163  Bit Score: 112.88  E-value: 3.52e-28
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 93141047 2322 ADIVFLTDASWSIgDDNFNKVVKFIFNTVGGFDeISPAGIQVSFVQYSDEVKS--EFKLNTYNDKALALGALQNIRYRGG 2399
Cdd:cd01476    1 LDLLFVLDSSGSV-RGKFEKYKKYIERIVEGLE-IGPTATRVALITYSGRGRQrvRFNLPKHNDGEELLEKVDNLRFIGG 78
                         90       100       110       120       130       140       150       160
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 93141047 2400 NTRTGKALTFiKEKVLTWESGMRKNVPKVLVVVTDGRSQDEVKKAALVIQ-QSGFSVFVVGVAD---VDYNELANIASkp 2475
Cdd:cd01476   79 TTATGAAIEV-ALQQLDPSEGRREGIPKVVVVLTDGRSHDDPEKQARILRaVPNIETFAVGTGDpgtVDTEELHSITG-- 155

                 ....*..
gi 93141047 2476 SERHVFI 2482
Cdd:cd01476  156 NEDHIFT 162
VWA_integrin_invertebrates cd01476
VWA_integrin (invertebrates): Integrins are a family of cell surface receptors that have ...
439-598 9.34e-26

VWA_integrin (invertebrates): Integrins are a family of cell surface receptors that have diverse functions in cell-cell and cell-extracellular matrix interactions. Because of their involvement in many biologically important adhesion processes, integrins are conserved across a wide range of multicellular animals. Integrins from invertebrates have been identified from six phyla. There are no data to date to suggest any immunological functions for the invertebrate integrins. The members of this sub-group have the conserved MIDAS motif that is charateristic of this domain suggesting the involvement of the integrins in the recognition and binding of multi-ligands.


Pssm-ID: 238753 [Multi-domain]  Cd Length: 163  Bit Score: 105.94  E-value: 9.34e-26
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 93141047  439 ADIVFLVDGSYSIGiANFVKVRAFLEVLVKSFEISPNRVQISLVQYS--RDPHTEFTLKKFTKVEDIIEAINTFPYRGGS 516
Cdd:cd01476    1 LDLLFVLDSSGSVR-GKFEKYKKYIERIVEGLEIGPTATRVALITYSgrGRQRVRFNLPKHNDGEELLEKVDNLRFIGGT 79
                         90       100       110       120       130       140       150       160
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 93141047  517 TNTGKAMTYVREkIFVPSKGSRSNVPKVMILITDGKSSDAFRDPAIKLRNS-DVEIFAVGVKD---AVRSELEAIASppA 592
Cdd:cd01476   80 TATGAAIEVALQ-QLDPSEGRREGIPKVVVVLTDGRSHDDPEKQARILRAVpNIETFAVGTGDpgtVDTEELHSITG--N 156

                 ....*.
gi 93141047  593 ETHVFT 598
Cdd:cd01476  157 EDHIFT 162
VWA_integrin_invertebrates cd01476
VWA_integrin (invertebrates): Integrins are a family of cell surface receptors that have ...
1198-1353 3.01e-25

VWA_integrin (invertebrates): Integrins are a family of cell surface receptors that have diverse functions in cell-cell and cell-extracellular matrix interactions. Because of their involvement in many biologically important adhesion processes, integrins are conserved across a wide range of multicellular animals. Integrins from invertebrates have been identified from six phyla. There are no data to date to suggest any immunological functions for the invertebrate integrins. The members of this sub-group have the conserved MIDAS motif that is charateristic of this domain suggesting the involvement of the integrins in the recognition and binding of multi-ligands.


Pssm-ID: 238753 [Multi-domain]  Cd Length: 163  Bit Score: 104.40  E-value: 3.01e-25
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 93141047 1198 ADIVLLVDGSWSIgRANFRTVRSFISRIVEVFDIGPKRVQIALAQYSGDPRT--EWQLNAHRDKKSLLQAVANLPYKGGN 1275
Cdd:cd01476    1 LDLLFVLDSSGSV-RGKFEKYKKYIERIVEGLEIGPTATRVALITYSGRGRQrvRFNLPKHNDGEELLEKVDNLRFIGGT 79
                         90       100       110       120       130       140       150       160
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 93141047 1276 TLTGMALNFIRQQnFRTQAGMRPRARKIGVLITDGKSQDDVEAPSKKLKDE-GVELFAIGIKN---ADEVELKMIATDPD 1351
Cdd:cd01476   80 TATGAAIEVALQQ-LDPSEGRREGIPKVVVVLTDGRSHDDPEKQARILRAVpNIETFAVGTGDpgtVDTEELHSITGNED 158

                 ..
gi 93141047 1352 DT 1353
Cdd:cd01476  159 HI 160
vWFA cd00198
Von Willebrand factor type A (vWA) domain was originally found in the blood coagulation ...
2322-2481 1.08e-23

Von Willebrand factor type A (vWA) domain was originally found in the blood coagulation protein von Willebrand factor (vWF). Typically, the vWA domain is made up of approximately 200 amino acid residues folded into a classic a/b para-rossmann type of fold. The vWA domain, since its discovery, has drawn great interest because of its widespread occurrence and its involvement in a wide variety of important cellular functions. These include basal membrane formation, cell migration, cell differentiation, adhesion, haemostasis, signaling, chromosomal stability, malignant transformation and in immune defenses In integrins these domains form heterodimers while in vWF it forms multimers. There are different interaction surfaces of this domain as seen by the various molecules it complexes with. Ligand binding in most cases is mediated by the presence of a metal ion dependent adhesion site termed as the MIDAS motif that is a characteristic feature of most, if not all A domains.


Pssm-ID: 238119 [Multi-domain]  Cd Length: 161  Bit Score: 99.95  E-value: 1.08e-23
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 93141047 2322 ADIVFLTDASWSIGDDNFNKVVKFIFNTVGGFDeISPAGIQVSFVQYSDEVKSEFKLNTYNDKALALGALQNIRYR-GGN 2400
Cdd:cd00198    1 ADIVFLLDVSGSMGGEKLDKAKEALKALVSSLS-ASPPGDRVGLVTFGSNARVVLPLTTDTDKADLLEAIDALKKGlGGG 79
                         90       100       110       120       130       140       150       160
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 93141047 2401 TRTGKALTFIKEkvlTWESGMRKNVPKVLVVVTDGRSQDEVKKAALVIQQ---SGFSVFVVGV-ADVDYNELANIASKPS 2476
Cdd:cd00198   80 TNIGAALRLALE---LLKSAKRPNARRVIILLTDGEPNDGPELLAEAARElrkLGITVYTIGIgDDANEDELKEIADKTT 156

                 ....*
gi 93141047 2477 ERHVF 2481
Cdd:cd00198  157 GGAVF 161
vWA_micronemal_protein cd01471
Micronemal proteins: The Toxoplasma lytic cycle begins when the parasite actively invades a ...
440-583 1.22e-22

Micronemal proteins: The Toxoplasma lytic cycle begins when the parasite actively invades a target cell. In association with invasion, T. gondii sequentially discharges three sets of secretory organelles beginning with the micronemes, which contain adhesive proteins involved in parasite attachment to a host cell. Deployed as protein complexes, several micronemal proteins possess vertebrate-derived adhesive sequences that function in binding receptors. The VWA domain likely mediates the protein-protein interactions of these with their interacting partners.


Pssm-ID: 238748 [Multi-domain]  Cd Length: 186  Bit Score: 97.84  E-value: 1.22e-22
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 93141047  440 DIVFLVDGSYSIGIANFV-KVRAFLEVLVKSFEISPNRVQISLVQYSRDPHTEFTLKKF-----TKVEDIIEAINTFPYR 513
Cdd:cd01471    2 DLYLLVDGSGSIGYSNWVtHVVPFLHTFVQNLNISPDEINLYLVTFSTNAKELIRLSSPnstnkDLALNAIRALLSLYYP 81
                         90       100       110       120       130       140       150
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|..
gi 93141047  514 GGSTNTGKAMTYVREKIFvPSKGSRSNVPKVMILITDGKSSDAFR--DPAIKLRNSDVEIFAVGVKDAVRSE 583
Cdd:cd01471   82 NGSTNTTSALLVVEKHLF-DTRGNRENAPQLVIIMTDGIPDSKFRtlKEARKLRERGVIIAVLGVGQGVNHE 152
vWA_micronemal_protein cd01471
Micronemal proteins: The Toxoplasma lytic cycle begins when the parasite actively invades a ...
1199-1335 1.18e-21

Micronemal proteins: The Toxoplasma lytic cycle begins when the parasite actively invades a target cell. In association with invasion, T. gondii sequentially discharges three sets of secretory organelles beginning with the micronemes, which contain adhesive proteins involved in parasite attachment to a host cell. Deployed as protein complexes, several micronemal proteins possess vertebrate-derived adhesive sequences that function in binding receptors. The VWA domain likely mediates the protein-protein interactions of these with their interacting partners.


Pssm-ID: 238748 [Multi-domain]  Cd Length: 186  Bit Score: 95.14  E-value: 1.18e-21
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 93141047 1199 DIVLLVDGSWSIGRAN-FRTVRSFISRIVEVFDIGPKRVQIALAQYSGDPRTEWQLNAHR--DKKSLLQAVA---NLPYK 1272
Cdd:cd01471    2 DLYLLVDGSGSIGYSNwVTHVVPFLHTFVQNLNISPDEINLYLVTFSTNAKELIRLSSPNstNKDLALNAIRallSLYYP 81
                         90       100       110       120       130       140
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*
gi 93141047 1273 GGNTLTGMALNFIRQQNFRTqAGMRPRARKIGVLITDGKSQDDVEA--PSKKLKDEGVELFAIGI 1335
Cdd:cd01471   82 NGSTNTTSALLVVEKHLFDT-RGNRENAPQLVIIMTDGIPDSKFRTlkEARKLRERGVIIAVLGV 145
vWA_collagen_alpha_1-VI-type cd01480
VWA_collagen alpha(VI) type: The extracellular matrix represents a complex alloy of variable ...
140-296 4.18e-21

VWA_collagen alpha(VI) type: The extracellular matrix represents a complex alloy of variable members of diverse protein families defining structural integrity and various physiological functions. The most abundant family is the collagens with more than 20 different collagen types identified thus far. Collagens are centrally involved in the formation of fibrillar and microfibrillar networks of the extracellular matrix, basement membranes as well as other structures of the extracellular matrix. Some collagens have about 15-18 vWA domains in them. The VWA domains present in these collagens mediate protein-protein interactions.


Pssm-ID: 238757 [Multi-domain]  Cd Length: 186  Bit Score: 93.60  E-value: 4.18e-21
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 93141047  140 DLVFLVDGSWSVGRNNFKYILDFIAALVSAF------DIGEEKTRVGVVQYSSDTRTEFNLNQYYQ-RDELLAAIKKIPY 212
Cdd:cd01480    4 DITFVLDSSESVGLQNFDITKNFVKRVAERFlkdyyrKDPAGSWRVGVVQYSDQQEVEAGFLRDIRnYTSLKEAVDNLEY 83
                         90       100       110       120       130       140       150       160
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 93141047  213 KGGNTMTGDAIDYLVKNTFTESAGARVgfpKVAIIITDGKSQDE----VEIPARELRNVGVEVFSLGIKAADAKELKQIA 288
Cdd:cd01480   84 IGGGTFTDCALKYATEQLLEGSHQKEN---KFLLVITDGHSDGSpdggIEKAVNEADHLGIKIFFVAVGSQNEEPLSRIA 160

                 ....*...
gi 93141047  289 STPSLNHV 296
Cdd:cd01480  161 CDGKSALY 168
vWA_micronemal_protein cd01471
Micronemal proteins: The Toxoplasma lytic cycle begins when the parasite actively invades a ...
140-276 1.17e-18

Micronemal proteins: The Toxoplasma lytic cycle begins when the parasite actively invades a target cell. In association with invasion, T. gondii sequentially discharges three sets of secretory organelles beginning with the micronemes, which contain adhesive proteins involved in parasite attachment to a host cell. Deployed as protein complexes, several micronemal proteins possess vertebrate-derived adhesive sequences that function in binding receptors. The VWA domain likely mediates the protein-protein interactions of these with their interacting partners.


Pssm-ID: 238748 [Multi-domain]  Cd Length: 186  Bit Score: 86.29  E-value: 1.17e-18
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 93141047  140 DLVFLVDGSWSVGRNN-FKYILDFIAALVSAFDIGEEKTRVGVVQYSSDTRTEFNLNQYYQRDE-----LLAAIKKIPYK 213
Cdd:cd01471    2 DLYLLVDGSGSIGYSNwVTHVVPFLHTFVQNLNISPDEINLYLVTFSTNAKELIRLSSPNSTNKdlalnAIRALLSLYYP 81
                         90       100       110       120       130       140
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*.
gi 93141047  214 GGNTMTGDAIDYLVKNTFtESAGARVGFPKVAIIITDGKS---QDEVEIpARELRNVGVEVFSLGI 276
Cdd:cd01471   82 NGSTNTTSALLVVEKHLF-DTRGNRENAPQLVIIMTDGIPdskFRTLKE-ARKLRERGVIIAVLGV 145
vWA_collagen_alpha_1-VI-type cd01480
VWA_collagen alpha(VI) type: The extracellular matrix represents a complex alloy of variable ...
1198-1373 2.45e-18

VWA_collagen alpha(VI) type: The extracellular matrix represents a complex alloy of variable members of diverse protein families defining structural integrity and various physiological functions. The most abundant family is the collagens with more than 20 different collagen types identified thus far. Collagens are centrally involved in the formation of fibrillar and microfibrillar networks of the extracellular matrix, basement membranes as well as other structures of the extracellular matrix. Some collagens have about 15-18 vWA domains in them. The VWA domains present in these collagens mediate protein-protein interactions.


Pssm-ID: 238757 [Multi-domain]  Cd Length: 186  Bit Score: 85.51  E-value: 2.45e-18
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 93141047 1198 ADIVLLVDGSWSIGRANFRTVRSFISRIVEVF------DIGPKRVQIALAQYSGDPRTEWQ-LNAHRDKKSLLQAVANLP 1270
Cdd:cd01480    3 VDITFVLDSSESVGLQNFDITKNFVKRVAERFlkdyyrKDPAGSWRVGVVQYSDQQEVEAGfLRDIRNYTSLKEAVDNLE 82
                         90       100       110       120       130       140       150       160
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 93141047 1271 YKGGNTLTGMALNFIRQQnfrTQAGMRPRARKIGVLITDGKSQ-DDVEAPSKKLKD---EGVELFAIGIKNADEVELKMI 1346
Cdd:cd01480   83 YIGGGTFTDCALKYATEQ---LLEGSHQKENKFLLVITDGHSDgSPDGGIEKAVNEadhLGIKIFFVAVGSQNEEPLSRI 159
                        170       180
                 ....*....|....*....|....*...
gi 93141047 1347 ATDPDDTH-AYNVADfESLSRIVDDLTI 1373
Cdd:cd01480  160 ACDGKSALyRENFAE-LLWSFFIDDETA 186
vWA_collagen_alpha_1-VI-type cd01480
VWA_collagen alpha(VI) type: The extracellular matrix represents a complex alloy of variable ...
2322-2480 7.78e-17

VWA_collagen alpha(VI) type: The extracellular matrix represents a complex alloy of variable members of diverse protein families defining structural integrity and various physiological functions. The most abundant family is the collagens with more than 20 different collagen types identified thus far. Collagens are centrally involved in the formation of fibrillar and microfibrillar networks of the extracellular matrix, basement membranes as well as other structures of the extracellular matrix. Some collagens have about 15-18 vWA domains in them. The VWA domains present in these collagens mediate protein-protein interactions.


Pssm-ID: 238757 [Multi-domain]  Cd Length: 186  Bit Score: 81.28  E-value: 7.78e-17
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 93141047 2322 ADIVFLTDASWSIGDDNFNKVVKFIFNTVGGF-----DEISPAGIQVSFVQYSDEVKSEFKL--NTYNDKALAlGALQNI 2394
Cdd:cd01480    3 VDITFVLDSSESVGLQNFDITKNFVKRVAERFlkdyyRKDPAGSWRVGVVQYSDQQEVEAGFlrDIRNYTSLK-EAVDNL 81
                         90       100       110       120       130       140       150       160
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 93141047 2395 RYRGGNTRTGKALTFIKEKVLtweSGMRKNVPKVLVVVTDGRSQDE----VKKAALVIQQSGFSVFVVGVADVDYNELAN 2470
Cdd:cd01480   82 EYIGGGTFTDCALKYATEQLL---EGSHQKENKFLLVITDGHSDGSpdggIEKAVNEADHLGIKIFFVAVGSQNEEPLSR 158
                        170
                 ....*....|
gi 93141047 2471 IASKPSERHV 2480
Cdd:cd01480  159 IACDGKSALY 168
vWA_collagen_alpha_1-VI-type cd01480
VWA_collagen alpha(VI) type: The extracellular matrix represents a complex alloy of variable ...
439-605 2.89e-16

VWA_collagen alpha(VI) type: The extracellular matrix represents a complex alloy of variable members of diverse protein families defining structural integrity and various physiological functions. The most abundant family is the collagens with more than 20 different collagen types identified thus far. Collagens are centrally involved in the formation of fibrillar and microfibrillar networks of the extracellular matrix, basement membranes as well as other structures of the extracellular matrix. Some collagens have about 15-18 vWA domains in them. The VWA domains present in these collagens mediate protein-protein interactions.


Pssm-ID: 238757 [Multi-domain]  Cd Length: 186  Bit Score: 79.35  E-value: 2.89e-16
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 93141047  439 ADIVFLVDGSYSIGIANFVKVRAFLEVLVKSF------EISPNRVQISLVQYSRDPHTEFT-LKKFTKVEDIIEAINTFP 511
Cdd:cd01480    3 VDITFVLDSSESVGLQNFDITKNFVKRVAERFlkdyyrKDPAGSWRVGVVQYSDQQEVEAGfLRDIRNYTSLKEAVDNLE 82
                         90       100       110       120       130       140       150       160
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 93141047  512 YRGGSTNTGKAMTYVREKIFvpsKGSRSNVPKVMILITDGKsSDAFRDPAIK-----LRNSDVEIFAVGVKDAVRSELEA 586
Cdd:cd01480   83 YIGGGTFTDCALKYATEQLL---EGSHQKENKFLLVITDGH-SDGSPDGGIEkavneADHLGIKIFFVAVGSQNEEPLSR 158
                        170       180
                 ....*....|....*....|..
gi 93141047  587 IASPPAETHV---FTVEDFDAF 605
Cdd:cd01480  159 IACDGKSALYrenFAELLWSFF 180
vWA_ATR cd01474
ATR (Anthrax Toxin Receptor): Anthrax toxin is a key virulence factor for Bacillus anthracis, ...
439-620 1.26e-15

ATR (Anthrax Toxin Receptor): Anthrax toxin is a key virulence factor for Bacillus anthracis, the causative agent of anthrax. ATR is the cellular receptor for the anthrax protective antigen and facilitates entry of the toxin into cells. The VWA domain in ATR contains the toxin binding site and mediates interaction with protective antigen. The binding is mediated by divalent cations that binds to the MIDAS motif. These proteins are a family of vertebrate ECM receptors expressed by endothelial cells.


Pssm-ID: 238751 [Multi-domain]  Cd Length: 185  Bit Score: 77.55  E-value: 1.26e-15
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 93141047  439 ADIVFLVDGSYSIGiANFVKVRAFLEVLVKSFeISPNrVQISLVQYSRDPHTEFTLKKFTKVED--IIEAINTFPyrGGS 516
Cdd:cd01474    5 FDLYFVLDKSGSVA-ANWIEIYDFVEQLVDRF-NSPG-LRFSFITFSTRATKILPLTDDSSAIIkgLEVLKKVTP--SGQ 79
                         90       100       110       120       130       140       150       160
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 93141047  517 TNTGKAMTYVREKIFVPSKGSRSNVpKVMILITDGK-SSDAFRDP---AIKLRNSDVEIFAVGVKDAVRSELEAIASppA 592
Cdd:cd01474   80 TYIHEGLENANEQIFNRNGGGRETV-SVIIALTDGQlLLNGHKYPeheAKLSRKLGAIVYCVGVTDFLKSQLINIAD--S 156
                        170       180
                 ....*....|....*....|....*....
gi 93141047  593 ETHVFTVED-FDAFQRISFELTQSICLRI 620
Cdd:cd01474  157 KEYVFPVTSgFQALSGIIESVVKKACIEI 185
vWA_ATR cd01474
ATR (Anthrax Toxin Receptor): Anthrax toxin is a key virulence factor for Bacillus anthracis, ...
2317-2501 1.50e-15

ATR (Anthrax Toxin Receptor): Anthrax toxin is a key virulence factor for Bacillus anthracis, the causative agent of anthrax. ATR is the cellular receptor for the anthrax protective antigen and facilitates entry of the toxin into cells. The VWA domain in ATR contains the toxin binding site and mediates interaction with protective antigen. The binding is mediated by divalent cations that binds to the MIDAS motif. These proteins are a family of vertebrate ECM receptors expressed by endothelial cells.


Pssm-ID: 238751 [Multi-domain]  Cd Length: 185  Bit Score: 77.55  E-value: 1.50e-15
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 93141047 2317 CKGAkADIVFLTDASWSIGDdNFNKVVKFIFNTVGGFdeISPaGIQVSFVQYSDEVKSEFKLNTYNDKAL-ALGALQNIr 2395
Cdd:cd01474    1 CAGH-FDLYFVLDKSGSVAA-NWIEIYDFVEQLVDRF--NSP-GLRFSFITFSTRATKILPLTDDSSAIIkGLEVLKKV- 74
                         90       100       110       120       130       140       150       160
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 93141047 2396 YRGGNTRTGKALTFIKEKVLTWESGMRKNVpKVLVVVTDGRSQDEV----KKAALVIQQSGFSVFVVGVADVDYNELANI 2471
Cdd:cd01474   75 TPSGQTYIHEGLENANEQIFNRNGGGRETV-SVIIALTDGQLLLNGhkypEHEAKLSRKLGAIVYCVGVTDFLKSQLINI 153
                        170       180       190
                 ....*....|....*....|....*....|.
gi 93141047 2472 ASkpSERHVFIVDD-FESFEKIEDNLITFVC 2501
Cdd:cd01474  154 AD--SKEYVFPVTSgFQALSGIIESVVKKAC 182
fn3 pfam00041
Fibronectin type III domain;
1756-1834 2.51e-15

Fibronectin type III domain;


Pssm-ID: 394996 [Multi-domain]  Cd Length: 85  Bit Score: 73.22  E-value: 2.51e-15
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 93141047   1756 PRNLQVYNATSNSLTVKWDPA---SGRVQKYRITYQPSTGEGNEQTTTIGGRQNSVVLQKLKPDTPYTITVSSLYPDGEG 1832
Cdd:pfam00041    3 PSNLTVTDVTSTSLTVSWTPPpdgNGPITGYEVEYRPKNSGEPWNEITVPGTTTSVTLTGLKPGTEYEVRVQAVNGGGEG 82

                   ..
gi 93141047   1833 GR 1834
Cdd:pfam00041   83 PP 84
fn3 pfam00041
Fibronectin type III domain;
1387-1465 3.30e-15

Fibronectin type III domain;


Pssm-ID: 394996 [Multi-domain]  Cd Length: 85  Bit Score: 73.22  E-value: 3.30e-15
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 93141047   1387 APSNLVISERTHRSFRVSWTPPSDS---VDRYKVEYYPVSGGKR-QEFYVSRMETSTVLKDLKPETEYVVNVYSVVEDEY 1462
Cdd:pfam00041    2 APSNLTVTDVTSTSLTVSWTPPPDGngpITGYEVEYRPKNSGEPwNEITVPGTTTSVTLTGLKPGTEYEVRVQAVNGGGE 81

                   ...
gi 93141047   1463 SEP 1465
Cdd:pfam00041   82 GPP 84
vWA_micronemal_protein cd01471
Micronemal proteins: The Toxoplasma lytic cycle begins when the parasite actively invades a ...
2323-2460 7.61e-15

Micronemal proteins: The Toxoplasma lytic cycle begins when the parasite actively invades a target cell. In association with invasion, T. gondii sequentially discharges three sets of secretory organelles beginning with the micronemes, which contain adhesive proteins involved in parasite attachment to a host cell. Deployed as protein complexes, several micronemal proteins possess vertebrate-derived adhesive sequences that function in binding receptors. The VWA domain likely mediates the protein-protein interactions of these with their interacting partners.


Pssm-ID: 238748 [Multi-domain]  Cd Length: 186  Bit Score: 75.50  E-value: 7.61e-15
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 93141047 2323 DIVFLTDASWSIGDDN-FNKVVKFIFNTVGGFDeISPAGIQVSFVQYSDEVKSEFKLNTYN--DKALALGALQNIR---Y 2396
Cdd:cd01471    2 DLYLLVDGSGSIGYSNwVTHVVPFLHTFVQNLN-ISPDEINLYLVTFSTNAKELIRLSSPNstNKDLALNAIRALLslyY 80
                         90       100       110       120       130       140
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*.
gi 93141047 2397 RGGNTRTGKALTFIkEKVLTWESGMRKNVPKVLVVVTDGRSQDEVK--KAALVIQQSGFSVFVVGV 2460
Cdd:cd01471   81 PNGSTNTTSALLVV-EKHLFDTRGNRENAPQLVIIMTDGIPDSKFRtlKEARKLRERGVIIAVLGV 145
FN3 cd00063
Fibronectin type 3 domain; One of three types of internal repeats found in the plasma protein ...
2118-2197 2.89e-14

Fibronectin type 3 domain; One of three types of internal repeats found in the plasma protein fibronectin. Its tenth fibronectin type III repeat contains an RGD cell recognition sequence in a flexible loop between 2 strands. Approximately 2% of all animal proteins contain the FN3 repeat; including extracellular and intracellular proteins, membrane spanning cytokine receptors, growth hormone receptors, tyrosine phosphatase receptors, and adhesion molecules. FN3-like domains are also found in bacterial glycosyl hydrolases.


Pssm-ID: 238020 [Multi-domain]  Cd Length: 93  Bit Score: 70.60  E-value: 2.89e-14
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 93141047 2118 PPQNIHISDEWYTRFRVSWDPSP---SPVLGYKIVYKPVGSNEPMEA--FVGEMTSYTLHNLNPSTTYDVNVYAQYDSGL 2192
Cdd:cd00063    3 PPTNLRVTDVTSTSVTLSWTPPEddgGPITGYVVEYREKGSGDWKEVevTPGSETSYTLTGLKPGTEYEFRVRAVNGGGE 82

                 ....*
gi 93141047 2193 SVPLT 2197
Cdd:cd00063   83 SPPSE 87
ChlD COG1240
vWFA (von Willebrand factor type A) domain of Mg and Co chelatases [Coenzyme transport and ...
439-608 6.18e-14

vWFA (von Willebrand factor type A) domain of Mg and Co chelatases [Coenzyme transport and metabolism];


Pssm-ID: 440853 [Multi-domain]  Cd Length: 262  Bit Score: 74.59  E-value: 6.18e-14
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 93141047  439 ADIVFLVDGSYSIGIAN-FVKVRAFLEVLVKSFeisPNRVQISLVQYSRDPhteFTLKKFTK-VEDIIEAINTFPYRGGs 516
Cdd:COG1240   93 RDVVLVVDASGSMAAENrLEAAKGALLDFLDDY---RPRDRVGLVAFGGEA---EVLLPLTRdREALKRALDELPPGGG- 165
                         90       100       110       120       130       140       150       160
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 93141047  517 TNTGKAMTYVREKIfvpsKGSRSNVPKVMILITDGKSSDAFRDP---AIKLRNSDVEIFAVGVKDAVRSE--LEAIAspp 591
Cdd:COG1240  166 TPLGDALALALELL----KRADPARRKVIVLLTDGRDNAGRIDPleaAELAAAAGIRIYTIGVGTEAVDEglLREIA--- 238
                        170
                 ....*....|....*....
gi 93141047  592 AET--HVFTVEDFDAFQRI 608
Cdd:COG1240  239 EATggRYFRADDLSELAAI 257
FN3 cd00063
Fibronectin type 3 domain; One of three types of internal repeats found in the plasma protein ...
724-805 7.59e-14

Fibronectin type 3 domain; One of three types of internal repeats found in the plasma protein fibronectin. Its tenth fibronectin type III repeat contains an RGD cell recognition sequence in a flexible loop between 2 strands. Approximately 2% of all animal proteins contain the FN3 repeat; including extracellular and intracellular proteins, membrane spanning cytokine receptors, growth hormone receptors, tyrosine phosphatase receptors, and adhesion molecules. FN3-like domains are also found in bacterial glycosyl hydrolases.


Pssm-ID: 238020 [Multi-domain]  Cd Length: 93  Bit Score: 69.45  E-value: 7.59e-14
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 93141047  724 GAPRNLKVTDETTDSFKITWTQAP---GRVLRYRIIYRPVAGGESREVTTPPNQRRR-TLENLIPDTKYEVSVIPEYFSG 799
Cdd:cd00063    2 SPPTNLRVTDVTSTSVTLSWTPPEddgGPITGYVVEYREKGSGDWKEVEVTPGSETSyTLTGLKPGTEYEFRVRAVNGGG 81

                 ....*.
gi 93141047  800 PGTPLT 805
Cdd:cd00063   82 ESPPSE 87
ChlD COG1240
vWFA (von Willebrand factor type A) domain of Mg and Co chelatases [Coenzyme transport and ...
1195-1371 1.00e-13

vWFA (von Willebrand factor type A) domain of Mg and Co chelatases [Coenzyme transport and metabolism];


Pssm-ID: 440853 [Multi-domain]  Cd Length: 262  Bit Score: 73.82  E-value: 1.00e-13
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 93141047 1195 RAEADIVLLVDGSWSIGRAN-FRTVRSFISRIVEVFdigPKRVQIALAQYSGDPrtEWQLNAHRDKKSLLQAVANLPYKG 1273
Cdd:COG1240   90 QRGRDVVLVVDASGSMAAENrLEAAKGALLDFLDDY---RPRDRVGLVAFGGEA--EVLLPLTRDREALKRALDELPPGG 164
                         90       100       110       120       130       140       150       160
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 93141047 1274 GNTLT---GMALNFIRQQNfrtqagmrPRARKIGVLITDGK---SQDDVEAPSKKLKDEGVELFAIGI--KNADEVELKM 1345
Cdd:COG1240  165 GTPLGdalALALELLKRAD--------PARRKVIVLLTDGRdnaGRIDPLEAAELAAAAGIRIYTIGVgtEAVDEGLLRE 236
                        170       180
                 ....*....|....*....|....*...
gi 93141047 1346 IAtdpDDTHA--YNVADFESLSRIVDDL 1371
Cdd:COG1240  237 IA---EATGGryFRADDLSELAAIYREI 261
FN3 COG3401
Fibronectin type 3 domain [General function prediction only];
1419-1986 1.17e-13

Fibronectin type 3 domain [General function prediction only];


Pssm-ID: 442628 [Multi-domain]  Cd Length: 603  Bit Score: 76.96  E-value: 1.17e-13
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 93141047 1419 YYPVSGGKRQEFYVSRMETSTVLKDLKPETEYVVNVYSVVEDEYSEPLKGTEKTLPVPVVSLNIYDVGPTTMHVQWQPVG 1498
Cdd:COG3401    5 YLTSLDAGIAASAAANTAVNALSKAGGSGKTILVYLAVVLSVTTKESPGTLLVAAGLSSGGGLGTGGRAGTTSGVAAVAV 84
                         90       100       110       120       130       140       150       160
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 93141047 1499 GATGYILSYKPVKDTEPTRPKEVRLGPTVNDMQLTDLVPNTEYAVTVqavlhdltsepVTVREVTLPLPRPQDLKLRDVT 1578
Cdd:COG3401   85 AAAPPTATGLTTLTGSGSVGGATNTGLTSSDEVPSPAVGTATTATAV-----------AGGAATAGTYALGAGLYGVDGA 153
                        170       180       190       200       210       220       230       240
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 93141047 1579 HSTMNVFWEPVPGKVRKYIVRYKTPEEDVKEVEVDRSETSTSLKDLFSQTLYTVSVSAVHDEGESPP---VTAQETTRPV 1655
Cdd:COG3401  154 NASGTTASSVAGAGVVVSPDTSATAAVATTSLTVTSTTLVDGGGDIEPGTTYYYRVAATDTGGESAPsneVSVTTPTTPP 233
                        250       260       270       280       290       300       310       320
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 93141047 1656 PAPTNLKITEVTSEGFRGTWD-HGASDVSLYRITWAPfGSSDKMETILNGDENTLVFENLNPNTIYEVSITAIYPDESES 1734
Cdd:COG3401  234 SAPTGLTATADTPGSVTLSWDpVTESDATGYRVYRSN-SGDGPFTKVATVTTTSYTDTGLTNGTTYYYRVTAVDAAGNES 312
                        330       340       350       360       370       380       390       400
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 93141047 1735 DDligSErTLPILTTQAPKSGPRNLQVYNATSNSLTVKWDPASG-RVQKYRItYQPSTGEGNEQTTTIGGRQNSVVLQKL 1813
Cdd:COG3401  313 AP---SN-VVSVTTDLTPPAAPSGLTATAVGSSSITLSWTASSDaDVTGYNV-YRSTSGGGTYTKIAETVTTTSYTDTGL 387
                        410       420       430       440       450       460       470       480
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 93141047 1814 KPDTPYTITVSSLYPDGEGGRMTGRGKTKPLNTVRNLRVYDPSTSTLNVRWDHAEGNPRQYKLFYAPAAGGPEELVPIPG 1893
Cdd:COG3401  388 TPGTTYYYKVTAVDAAGNESAPSEEVSATTASAASGESLTASVDAVPLTDVAGATAAASAASNPGVSAAVLADGGDTGNA 467
                        490       500       510       520       530       540       550       560
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 93141047 1894 NTNYAILRNLQPDTSYTVTVVPVYTEGDGGRTSDTGR-----TLMRGLARNVQVYNPTPNSLDVRWDPAPGPVLQYRVVY 1968
Cdd:COG3401  468 VPFTTTSSTVTATTTDTTTANLSVTTGSLVGGSGASSvtnsvSVIGASAAAAVGGAPDGTPNVTGASPVTVGASTGDVLI 547
                        570
                 ....*....|....*...
gi 93141047 1969 SPVDGTRPSESIVVPGNT 1986
Cdd:COG3401  548 TDLVSLTTSASSSVSGAG 565
fn3 pfam00041
Fibronectin type III domain;
1938-2016 1.23e-13

Fibronectin type III domain;


Pssm-ID: 394996 [Multi-domain]  Cd Length: 85  Bit Score: 68.60  E-value: 1.23e-13
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 93141047   1938 RNVQVYNPTPNSLDVRWDPAP---GPVLQYRVVYSPVDGTRPSESIVVPGNTRMVHLERLIPDTLYSVNLVALYSDGEGN 2014
Cdd:pfam00041    4 SNLTVTDVTSTSLTVSWTPPPdgnGPITGYEVEYRPKNSGEPWNEITVPGTTTSVTLTGLKPGTEYEVRVQAVNGGGEGP 83

                   ..
gi 93141047   2015 PS 2016
Cdd:pfam00041   84 PS 85
vWA_CTRP cd01473
CTRP for CS protein-TRAP-related protein: Adhesion of Plasmodium to host cells is an ...
140-317 3.05e-13

CTRP for CS protein-TRAP-related protein: Adhesion of Plasmodium to host cells is an important phenomenon in parasite invasion and in malaria associated pathology.CTRP encodes a protein containing a putative signal sequence followed by a long extracellular region of 1990 amino acids, a transmembrane domain, and a short cytoplasmic segment. The extracellular region of CTRP contains two separated adhesive domains. The first domain contains six 210-amino acid-long homologous VWA domain repeats. The second domain contains seven repeats of 87-60 amino acids in length, which share similarities with the thrombospondin type 1 domain found in a variety of adhesive molecules. Finally, CTRP also contains consensus motifs found in the superfamily of haematopoietin receptors. The VWA domains in these proteins likely mediate protein-protein interactions.


Pssm-ID: 238750 [Multi-domain]  Cd Length: 192  Bit Score: 70.81  E-value: 3.05e-13
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 93141047  140 DLVFLVDGSWSVGRNNF-KYILDFIAALVSAFDIGEEKTRVGVVQYSSDTRT--EFNLNQYYQRDELLAAIKKIP--YK- 213
Cdd:cd01473    2 DLTLILDESASIGYSNWrKDVIPFTEKIINNLNISKDKVHVGILLFAEKNRDvvPFSDEERYDKNELLKKINDLKnsYRs 81
                         90       100       110       120       130       140       150       160
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 93141047  214 GGNTMTGDAIDYLVKNtFTESAGARVGFPKVAIIITDGKSQDEVEipaRELRNVG-------VEVFSLGIKAADAKELKQ 286
Cdd:cd01473   82 GGETYIVEALKYGLKN-YTKHGNRRKDAPKVTMLFTDGNDTSASK---KELQDISllykeenVKLLVVGVGAASENKLKL 157
                        170       180       190
                 ....*....|....*....|....*....|....
gi 93141047  287 IA--STPSLNHVFNV-ANFDAIVDIQNEIISQVC 317
Cdd:cd01473  158 LAgcDINNDNCPNVIkTEWNNLNGISKFLTDKIC 191
fn3 pfam00041
Fibronectin type III domain;
724-793 3.59e-13

Fibronectin type III domain;


Pssm-ID: 394996 [Multi-domain]  Cd Length: 85  Bit Score: 67.44  E-value: 3.59e-13
                           10        20        30        40        50        60        70
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....
gi 93141047    724 GAPRNLKVTDETTDSFKITWT---QAPGRVLRYRIIYRPV-AGGESREVTTPPNQRRRTLENLIPDTKYEVSVI 793
Cdd:pfam00041    1 SAPSNLTVTDVTSTSLTVSWTpppDGNGPITGYEVEYRPKnSGEPWNEITVPGTTTSVTLTGLKPGTEYEVRVQ 74
FN3 cd00063
Fibronectin type 3 domain; One of three types of internal repeats found in the plasma protein ...
1386-1470 4.47e-13

Fibronectin type 3 domain; One of three types of internal repeats found in the plasma protein fibronectin. Its tenth fibronectin type III repeat contains an RGD cell recognition sequence in a flexible loop between 2 strands. Approximately 2% of all animal proteins contain the FN3 repeat; including extracellular and intracellular proteins, membrane spanning cytokine receptors, growth hormone receptors, tyrosine phosphatase receptors, and adhesion molecules. FN3-like domains are also found in bacterial glycosyl hydrolases.


Pssm-ID: 238020 [Multi-domain]  Cd Length: 93  Bit Score: 67.14  E-value: 4.47e-13
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 93141047 1386 EAPSNLVISERTHRSFRVSWTPPSDS---VDRYKVEYYPVSGGKRQEFYVSRM-ETSTVLKDLKPETEYVVNVYSVVEDE 1461
Cdd:cd00063    2 SPPTNLRVTDVTSTSVTLSWTPPEDDggpITGYVVEYREKGSGDWKEVEVTPGsETSYTLTGLKPGTEYEFRVRAVNGGG 81

                 ....*....
gi 93141047 1462 YSEPLKGTE 1470
Cdd:cd00063   82 ESPPSESVT 90
FN3 cd00063
Fibronectin type 3 domain; One of three types of internal repeats found in the plasma protein ...
1754-1832 6.28e-13

Fibronectin type 3 domain; One of three types of internal repeats found in the plasma protein fibronectin. Its tenth fibronectin type III repeat contains an RGD cell recognition sequence in a flexible loop between 2 strands. Approximately 2% of all animal proteins contain the FN3 repeat; including extracellular and intracellular proteins, membrane spanning cytokine receptors, growth hormone receptors, tyrosine phosphatase receptors, and adhesion molecules. FN3-like domains are also found in bacterial glycosyl hydrolases.


Pssm-ID: 238020 [Multi-domain]  Cd Length: 93  Bit Score: 66.75  E-value: 6.28e-13
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 93141047 1754 SGPRNLQVYNATSNSLTVKWDPAS---GRVQKYRITYQPSTGEGNEQTTTIGGRQNSVVLQKLKPDTPYTITVSSLYPDG 1830
Cdd:cd00063    2 SPPTNLRVTDVTSTSVTLSWTPPEddgGPITGYVVEYREKGSGDWKEVEVTPGSETSYTLTGLKPGTEYEFRVRAVNGGG 81

                 ..
gi 93141047 1831 EG 1832
Cdd:cd00063   82 ES 83
VWA_2 pfam13519
von Willebrand factor type A domain;
441-548 6.31e-13

von Willebrand factor type A domain;


Pssm-ID: 463909 [Multi-domain]  Cd Length: 103  Bit Score: 67.32  E-value: 6.31e-13
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 93141047    441 IVFLVDGSYSI-----GIANFVKVRAFLEVLVKSFeispNRVQISLVQYSRDPHTEFTLKKFTKveDIIEAINTFPYRGG 515
Cdd:pfam13519    1 LVFVLDTSGSMrngdyGPTRLEAAKDAVLALLKSL----PGDRVGLVTFGDGPEVLIPLTKDRA--KILRALRRLEPKGG 74
                           90       100       110
                   ....*....|....*....|....*....|...
gi 93141047    516 STNTGKAMTYVREKIfvpsKGSRSNVPKVMILI 548
Cdd:pfam13519   75 GTNLAAALQLARAAL----KHRRKNQPRRIVLI 103
fn3 pfam00041
Fibronectin type III domain;
2118-2195 1.36e-12

Fibronectin type III domain;


Pssm-ID: 394996 [Multi-domain]  Cd Length: 85  Bit Score: 65.51  E-value: 1.36e-12
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 93141047   2118 PPQNIHISDEWYTRFRVSWDPSP---SPVLGYKIVYKPVGSNEPMEAF--VGEMTSYTLHNLNPSTTYDVNVYAQYDSGL 2192
Cdd:pfam00041    2 APSNLTVTDVTSTSLTVSWTPPPdgnGPITGYEVEYRPKNSGEPWNEItvPGTTTSVTLTGLKPGTEYEVRVQAVNGGGE 81

                   ...
gi 93141047   2193 SVP 2195
Cdd:pfam00041   82 GPP 84
ChlD COG1240
vWFA (von Willebrand factor type A) domain of Mg and Co chelatases [Coenzyme transport and ...
139-312 1.41e-12

vWFA (von Willebrand factor type A) domain of Mg and Co chelatases [Coenzyme transport and metabolism];


Pssm-ID: 440853 [Multi-domain]  Cd Length: 262  Bit Score: 70.74  E-value: 1.41e-12
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 93141047  139 TDLVFLVDGSWSVGRNN----FKY-ILDFIAALVsafdigeEKTRVGVVQYSSDTRTEFNLNqyYQRDELLAAIKKIPYK 213
Cdd:COG1240   93 RDVVLVVDASGSMAAENrleaAKGaLLDFLDDYR-------PRDRVGLVAFGGEAEVLLPLT--RDREALKRALDELPPG 163
                         90       100       110       120       130       140       150       160
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 93141047  214 GGnTMTGDAIdYLVKNTFTESAGARvgfPKVAIIITDGK---SQDEVEIPARELRNVGVEVFSLGI--KAADAKELKQIA 288
Cdd:COG1240  164 GG-TPLGDAL-ALALELLKRADPAR---RKVIVLLTDGRdnaGRIDPLEAAELAAAAGIRIYTIGVgtEAVDEGLLREIA 238
                        170       180
                 ....*....|....*....|....*
gi 93141047  289 STpsLN-HVFNVANFDAIVDIQNEI 312
Cdd:COG1240  239 EA--TGgRYFRADDLSELAAIYREI 261
fn3 pfam00041
Fibronectin type III domain;
336-415 2.24e-12

Fibronectin type III domain;


Pssm-ID: 394996 [Multi-domain]  Cd Length: 85  Bit Score: 65.13  E-value: 2.24e-12
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 93141047    336 PPSNLIAMEVSSKYVKLNWNPSP---SPVTGYKVILTPMTAGSRQHALSVGPQTTTLSVRDLSADTEYQISVSAMKGMTS 412
Cdd:pfam00041    2 APSNLTVTDVTSTSLTVSWTPPPdgnGPITGYEVEYRPKNSGEPWNEITVPGTTTSVTLTGLKPGTEYEVRVQAVNGGGE 81

                   ...
gi 93141047    413 SEP 415
Cdd:pfam00041   82 GPP 84
FN3 cd00063
Fibronectin type 3 domain; One of three types of internal repeats found in the plasma protein ...
1569-1652 3.23e-12

Fibronectin type 3 domain; One of three types of internal repeats found in the plasma protein fibronectin. Its tenth fibronectin type III repeat contains an RGD cell recognition sequence in a flexible loop between 2 strands. Approximately 2% of all animal proteins contain the FN3 repeat; including extracellular and intracellular proteins, membrane spanning cytokine receptors, growth hormone receptors, tyrosine phosphatase receptors, and adhesion molecules. FN3-like domains are also found in bacterial glycosyl hydrolases.


Pssm-ID: 238020 [Multi-domain]  Cd Length: 93  Bit Score: 64.83  E-value: 3.23e-12
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 93141047 1569 PQDLKLRDVTHSTMNVFWEPVP---GKVRKYIVRYK-TPEEDVKEVEVDR-SETSTSLKDLFSQTLYTVSVSAVHDEGES 1643
Cdd:cd00063    4 PTNLRVTDVTSTSVTLSWTPPEddgGPITGYVVEYReKGSGDWKEVEVTPgSETSYTLTGLKPGTEYEFRVRAVNGGGES 83

                 ....*....
gi 93141047 1644 PPVTAQETT 1652
Cdd:cd00063   84 PPSESVTVT 92
FN3 cd00063
Fibronectin type 3 domain; One of three types of internal repeats found in the plasma protein ...
1938-2017 3.49e-12

Fibronectin type 3 domain; One of three types of internal repeats found in the plasma protein fibronectin. Its tenth fibronectin type III repeat contains an RGD cell recognition sequence in a flexible loop between 2 strands. Approximately 2% of all animal proteins contain the FN3 repeat; including extracellular and intracellular proteins, membrane spanning cytokine receptors, growth hormone receptors, tyrosine phosphatase receptors, and adhesion molecules. FN3-like domains are also found in bacterial glycosyl hydrolases.


Pssm-ID: 238020 [Multi-domain]  Cd Length: 93  Bit Score: 64.83  E-value: 3.49e-12
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 93141047 1938 RNVQVYNPTPNSLDVRWDPAP---GPVLQYRVVYSPVDGTRPSESIVVPGNTRMVHLERLIPDTLYSVNLVALYSDGEGN 2014
Cdd:cd00063    5 TNLRVTDVTSTSVTLSWTPPEddgGPITGYVVEYREKGSGDWKEVEVTPGSETSYTLTGLKPGTEYEFRVRAVNGGGESP 84

                 ...
gi 93141047 2015 PSP 2017
Cdd:cd00063   85 PSE 87
FN3 smart00060
Fibronectin type 3 domain; One of three types of internal repeat within the plasma protein, ...
1474-1548 3.80e-12

Fibronectin type 3 domain; One of three types of internal repeat within the plasma protein, fibronectin. The tenth fibronectin type III repeat contains a RGD cell recognition sequence in a flexible loop between 2 strands. Type III modules are present in both extracellular and intracellular proteins.


Pssm-ID: 214495 [Multi-domain]  Cd Length: 83  Bit Score: 64.17  E-value: 3.80e-12
                            10        20        30        40        50        60        70
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*..
gi 93141047    1474 PVPVVSLNIYDVGPTTMHVQWQPV--GGATGYILSYKPVKDTEPTRPKEVRLGPTVNDMQLTDLVPNTEYAVTVQAV 1548
Cdd:smart00060    1 PSPPSNLRVTDVTSTSVTLSWEPPpdDGITGYIVGYRVEYREEGSEWKEVNVTPSSTSYTLTGLKPGTEYEFRVRAV 77
FN3 smart00060
Fibronectin type 3 domain; One of three types of internal repeat within the plasma protein, ...
2118-2193 5.30e-12

Fibronectin type 3 domain; One of three types of internal repeat within the plasma protein, fibronectin. The tenth fibronectin type III repeat contains a RGD cell recognition sequence in a flexible loop between 2 strands. Type III modules are present in both extracellular and intracellular proteins.


Pssm-ID: 214495 [Multi-domain]  Cd Length: 83  Bit Score: 63.79  E-value: 5.30e-12
                            10        20        30        40        50        60        70        80
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 93141047    2118 PPQNIHISDEWYTRFRVSWDPSP-----SPVLGYKIVYKPVGSNEPMEAFVGEMTSYTLHNLNPSTTYDVNVYAQYDSGL 2192
Cdd:smart00060    3 PPSNLRVTDVTSTSVTLSWEPPPddgitGYIVGYRVEYREEGSEWKEVNVTPSSTSYTLTGLKPGTEYEFRVRAVNGAGE 82

                    .
gi 93141047    2193 S 2193
Cdd:smart00060   83 G 83
fn3 pfam00041
Fibronectin type III domain;
2028-2105 5.72e-12

Fibronectin type III domain;


Pssm-ID: 394996 [Multi-domain]  Cd Length: 85  Bit Score: 63.97  E-value: 5.72e-12
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 93141047   2028 PRNLRVFGETTNSLSVAW---DHADGPVQQYRIIYSPTVGDPIDEYTTVPGRRNNVILQPLQPDTPYKITVIAVYEDGDG 2104
Cdd:pfam00041    3 PSNLTVTDVTSTSLTVSWtppPDGNGPITGYEVEYRPKNSGEPWNEITVPGTTTSVTLTGLKPGTEYEVRVQAVNGGGEG 82

                   .
gi 93141047   2105 G 2105
Cdd:pfam00041   83 P 83
vWA_CTRP cd01473
CTRP for CS protein-TRAP-related protein: Adhesion of Plasmodium to host cells is an ...
440-617 6.64e-12

CTRP for CS protein-TRAP-related protein: Adhesion of Plasmodium to host cells is an important phenomenon in parasite invasion and in malaria associated pathology.CTRP encodes a protein containing a putative signal sequence followed by a long extracellular region of 1990 amino acids, a transmembrane domain, and a short cytoplasmic segment. The extracellular region of CTRP contains two separated adhesive domains. The first domain contains six 210-amino acid-long homologous VWA domain repeats. The second domain contains seven repeats of 87-60 amino acids in length, which share similarities with the thrombospondin type 1 domain found in a variety of adhesive molecules. Finally, CTRP also contains consensus motifs found in the superfamily of haematopoietin receptors. The VWA domains in these proteins likely mediate protein-protein interactions.


Pssm-ID: 238750 [Multi-domain]  Cd Length: 192  Bit Score: 66.96  E-value: 6.64e-12
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 93141047  440 DIVFLVDGSYSIGIANFVK-VRAFLEVLVKSFEISPNRVQISLVQYS---RDpHTEFTLKKFTKVEDIIEAINTFP--YR 513
Cdd:cd01473    2 DLTLILDESASIGYSNWRKdVIPFTEKIINNLNISKDKVHVGILLFAeknRD-VVPFSDEERYDKNELLKKINDLKnsYR 80
                         90       100       110       120       130       140       150       160
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 93141047  514 -GGSTNTGKAMTYVREKIFvPSKGSRSNVPKVMILITDGKSSDA----FRDPAIKLRNSDVEIFAVGVKDAVRSELEAIA 588
Cdd:cd01473   81 sGGETYIVEALKYGLKNYT-KHGNRRKDAPKVTMLFTDGNDTSAskkeLQDISLLYKEENVKLLVVGVGAASENKLKLLA 159
                        170       180       190
                 ....*....|....*....|....*....|..
gi 93141047  589 --SPPAETHVFTVE-DFDAFQRISFELTQSIC 617
Cdd:cd01473  160 gcDINNDNCPNVIKtEWNNLNGISKFLTDKIC 191
fn3 pfam00041
Fibronectin type III domain;
817-892 7.90e-12

Fibronectin type III domain;


Pssm-ID: 394996 [Multi-domain]  Cd Length: 85  Bit Score: 63.59  E-value: 7.90e-12
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 93141047    817 PRDLRVSDPTTSTMKLSWSGAP---GKVKQYLVTYTPV-AGGETQEVTVRGDTTNTVLQGLKEGTQYALSVTALYASGAG 892
Cdd:pfam00041    3 PSNLTVTDVTSTSLTVSWTPPPdgnGPITGYEVEYRPKnSGEPWNEITVPGTTTSVTLTGLKPGTEYEVRVQAVNGGGEG 82
FN3 cd00063
Fibronectin type 3 domain; One of three types of internal repeats found in the plasma protein ...
815-893 8.89e-12

Fibronectin type 3 domain; One of three types of internal repeats found in the plasma protein fibronectin. Its tenth fibronectin type III repeat contains an RGD cell recognition sequence in a flexible loop between 2 strands. Approximately 2% of all animal proteins contain the FN3 repeat; including extracellular and intracellular proteins, membrane spanning cytokine receptors, growth hormone receptors, tyrosine phosphatase receptors, and adhesion molecules. FN3-like domains are also found in bacterial glycosyl hydrolases.


Pssm-ID: 238020 [Multi-domain]  Cd Length: 93  Bit Score: 63.67  E-value: 8.89e-12
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 93141047  815 GNPRDLRVSDPTTSTMKLSWSGAP---GKVKQYLVTYTPVAGGETQEVTV-RGDTTNTVLQGLKEGTQYALSVTALYASG 890
Cdd:cd00063    2 SPPTNLRVTDVTSTSVTLSWTPPEddgGPITGYVVEYREKGSGDWKEVEVtPGSETSYTLTGLKPGTEYEFRVRAVNGGG 81

                 ...
gi 93141047  891 AGD 893
Cdd:cd00063   82 ESP 84
VWA_2 pfam13519
von Willebrand factor type A domain;
141-248 9.43e-12

von Willebrand factor type A domain;


Pssm-ID: 463909 [Multi-domain]  Cd Length: 103  Bit Score: 63.85  E-value: 9.43e-12
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 93141047    141 LVFLVDGSWS-----VGRNNFKYILDFIAALVSAFDIgeekTRVGVVQYSSDTRTEFNLNQyyQRDELLAAIKKIPYKGG 215
Cdd:pfam13519    1 LVFVLDTSGSmrngdYGPTRLEAAKDAVLALLKSLPG----DRVGLVTFGDGPEVLIPLTK--DRAKILRALRRLEPKGG 74
                           90       100       110
                   ....*....|....*....|....*....|...
gi 93141047    216 NTMTGDAIDYLVKNTFTESAGArvgfPKVAIII 248
Cdd:pfam13519   75 GTNLAAALQLARAALKHRRKNQ----PRRIVLI 103
FN3 smart00060
Fibronectin type 3 domain; One of three types of internal repeat within the plasma protein, ...
1754-1832 1.14e-11

Fibronectin type 3 domain; One of three types of internal repeat within the plasma protein, fibronectin. The tenth fibronectin type III repeat contains a RGD cell recognition sequence in a flexible loop between 2 strands. Type III modules are present in both extracellular and intracellular proteins.


Pssm-ID: 214495 [Multi-domain]  Cd Length: 83  Bit Score: 63.02  E-value: 1.14e-11
                            10        20        30        40        50        60        70        80
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 93141047    1754 SGPRNLQVYNATSNSLTVKWD-PASGRVQKYRITYQPSTGEGNEQTTTI--GGRQNSVVLQKLKPDTPYTITVSSLYPDG 1830
Cdd:smart00060    2 SPPSNLRVTDVTSTSVTLSWEpPPDDGITGYIVGYRVEYREEGSEWKEVnvTPSSTSYTLTGLKPGTEYEFRVRAVNGAG 81

                    ..
gi 93141047    1831 EG 1832
Cdd:smart00060   82 EG 83
vWA_ATR cd01474
ATR (Anthrax Toxin Receptor): Anthrax toxin is a key virulence factor for Bacillus anthracis, ...
1195-1379 1.52e-11

ATR (Anthrax Toxin Receptor): Anthrax toxin is a key virulence factor for Bacillus anthracis, the causative agent of anthrax. ATR is the cellular receptor for the anthrax protective antigen and facilitates entry of the toxin into cells. The VWA domain in ATR contains the toxin binding site and mediates interaction with protective antigen. The binding is mediated by divalent cations that binds to the MIDAS motif. These proteins are a family of vertebrate ECM receptors expressed by endothelial cells.


Pssm-ID: 238751 [Multi-domain]  Cd Length: 185  Bit Score: 65.61  E-value: 1.52e-11
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 93141047 1195 RAEADIVLLVDGSWSIGrANFRTVRSFISRIVEVFdIGPKrVQIALAQYSGDPRTEWQLNAHRDK--KSLLQAVANLPyk 1272
Cdd:cd01474    2 AGHFDLYFVLDKSGSVA-ANWIEIYDFVEQLVDRF-NSPG-LRFSFITFSTRATKILPLTDDSSAiiKGLEVLKKVTP-- 76
                         90       100       110       120       130       140       150       160
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 93141047 1273 GGNTLTGMALNFIRQQNFRTQAGMRpRARKIGVLITDGKSQDDV----EAPSKKLKDEGVELFAIGIKNADEVELKMIAT 1348
Cdd:cd01474   77 SGQTYIHEGLENANEQIFNRNGGGR-ETVSVIIALTDGQLLLNGhkypEHEAKLSRKLGAIVYCVGVTDFLKSQLINIAD 155
                        170       180       190
                 ....*....|....*....|....*....|..
gi 93141047 1349 DPDdtHAYNVAD-FESLSRIVDDLTINLCNSV 1379
Cdd:cd01474  156 SKE--YVFPVTSgFQALSGIIESVVKKACIEI 185
FN3 cd00063
Fibronectin type 3 domain; One of three types of internal repeats found in the plasma protein ...
2026-2104 2.04e-11

Fibronectin type 3 domain; One of three types of internal repeats found in the plasma protein fibronectin. Its tenth fibronectin type III repeat contains an RGD cell recognition sequence in a flexible loop between 2 strands. Approximately 2% of all animal proteins contain the FN3 repeat; including extracellular and intracellular proteins, membrane spanning cytokine receptors, growth hormone receptors, tyrosine phosphatase receptors, and adhesion molecules. FN3-like domains are also found in bacterial glycosyl hydrolases.


Pssm-ID: 238020 [Multi-domain]  Cd Length: 93  Bit Score: 62.51  E-value: 2.04e-11
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 93141047 2026 SGPRNLRVFGETTNSLSVAWDHAD---GPVQQYRIIYSPTVGDPIDEYTTVPGRRNNVILQPLQPDTPYKITVIAVYEDG 2102
Cdd:cd00063    2 SPPTNLRVTDVTSTSVTLSWTPPEddgGPITGYVVEYREKGSGDWKEVEVTPGSETSYTLTGLKPGTEYEFRVRAVNGGG 81

                 ..
gi 93141047 2103 DG 2104
Cdd:cd00063   82 ES 83
FN3 smart00060
Fibronectin type 3 domain; One of three types of internal repeat within the plasma protein, ...
724-801 3.22e-11

Fibronectin type 3 domain; One of three types of internal repeat within the plasma protein, fibronectin. The tenth fibronectin type III repeat contains a RGD cell recognition sequence in a flexible loop between 2 strands. Type III modules are present in both extracellular and intracellular proteins.


Pssm-ID: 214495 [Multi-domain]  Cd Length: 83  Bit Score: 61.48  E-value: 3.22e-11
                            10        20        30        40        50        60        70        80
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 93141047     724 GAPRNLKVTDETTDSFKITWTQAP-----GRVLRYRIIYRPVaGGESREVTTPPNQRRRTLENLIPDTKYEVSVIPEYFS 798
Cdd:smart00060    2 SPPSNLRVTDVTSTSVTLSWEPPPddgitGYIVGYRVEYREE-GSEWKEVNVTPSSTSYTLTGLKPGTEYEFRVRAVNGA 80

                    ...
gi 93141047     799 GPG 801
Cdd:smart00060   81 GEG 83
FN3 cd00063
Fibronectin type 3 domain; One of three types of internal repeats found in the plasma protein ...
1474-1563 3.35e-11

Fibronectin type 3 domain; One of three types of internal repeats found in the plasma protein fibronectin. Its tenth fibronectin type III repeat contains an RGD cell recognition sequence in a flexible loop between 2 strands. Approximately 2% of all animal proteins contain the FN3 repeat; including extracellular and intracellular proteins, membrane spanning cytokine receptors, growth hormone receptors, tyrosine phosphatase receptors, and adhesion molecules. FN3-like domains are also found in bacterial glycosyl hydrolases.


Pssm-ID: 238020 [Multi-domain]  Cd Length: 93  Bit Score: 62.13  E-value: 3.35e-11
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 93141047 1474 PVPVVSLNIYDVGPTTMHVQWQPV----GGATGYILSYKPVKDTEPTRPKEVRlgPTVNDMQLTDLVPNTEYAVTVQAVL 1549
Cdd:cd00063    1 PSPPTNLRVTDVTSTSVTLSWTPPeddgGPITGYVVEYREKGSGDWKEVEVTP--GSETSYTLTGLKPGTEYEFRVRAVN 78
                         90
                 ....*....|....
gi 93141047 1550 HDLTSEPVTVREVT 1563
Cdd:cd00063   79 GGGESPPSESVTVT 92
FN3 cd00063
Fibronectin type 3 domain; One of three types of internal repeats found in the plasma protein ...
1090-1176 3.65e-11

Fibronectin type 3 domain; One of three types of internal repeats found in the plasma protein fibronectin. Its tenth fibronectin type III repeat contains an RGD cell recognition sequence in a flexible loop between 2 strands. Approximately 2% of all animal proteins contain the FN3 repeat; including extracellular and intracellular proteins, membrane spanning cytokine receptors, growth hormone receptors, tyrosine phosphatase receptors, and adhesion molecules. FN3-like domains are also found in bacterial glycosyl hydrolases.


Pssm-ID: 238020 [Multi-domain]  Cd Length: 93  Bit Score: 61.74  E-value: 3.65e-11
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 93141047 1090 PRNLKTSDPTMSSFRVTWEPAP---GEVKGYKVTFHPTGDDRRLGELVVGPYDNTVVLEELRAGTTYKVNVFGMFDGGES 1166
Cdd:cd00063    4 PTNLRVTDVTSTSVTLSWTPPEddgGPITGYVVEYREKGSGDWKEVEVTPGSETSYTLTGLKPGTEYEFRVRAVNGGGES 83
                         90
                 ....*....|
gi 93141047 1167 SPLVGQEMTT 1176
Cdd:cd00063   84 PPSESVTVTT 93
fn3 pfam00041
Fibronectin type III domain;
1567-1645 4.44e-11

Fibronectin type III domain;


Pssm-ID: 394996 [Multi-domain]  Cd Length: 85  Bit Score: 61.28  E-value: 4.44e-11
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 93141047   1567 PRPQDLKLRDVTHSTMNVFWEPVP---GKVRKYIVRYKTP--EEDVKEVEVDRSETSTSLKDLFSQTLYTVSVSAVHDEG 1641
Cdd:pfam00041    1 SAPSNLTVTDVTSTSLTVSWTPPPdgnGPITGYEVEYRPKnsGEPWNEITVPGTTTSVTLTGLKPGTEYEVRVQAVNGGG 80

                   ....
gi 93141047   1642 ESPP 1645
Cdd:pfam00041   81 EGPP 84
FN3 smart00060
Fibronectin type 3 domain; One of three types of internal repeat within the plasma protein, ...
1569-1643 6.50e-11

Fibronectin type 3 domain; One of three types of internal repeat within the plasma protein, fibronectin. The tenth fibronectin type III repeat contains a RGD cell recognition sequence in a flexible loop between 2 strands. Type III modules are present in both extracellular and intracellular proteins.


Pssm-ID: 214495 [Multi-domain]  Cd Length: 83  Bit Score: 60.71  E-value: 6.50e-11
                            10        20        30        40        50        60        70        80
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 93141047    1569 PQDLKLRDVTHSTMNVFWEPVP-----GKVRKYIVRYKTPEEDVKEVEVDRSETSTSLKDLFSQTLYTVSVSAVHDEGES 1643
Cdd:smart00060    4 PSNLRVTDVTSTSVTLSWEPPPddgitGYIVGYRVEYREEGSEWKEVNVTPSSTSYTLTGLKPGTEYEFRVRAVNGAGEG 83
vWA_ATR cd01474
ATR (Anthrax Toxin Receptor): Anthrax toxin is a key virulence factor for Bacillus anthracis, ...
140-317 7.38e-11

ATR (Anthrax Toxin Receptor): Anthrax toxin is a key virulence factor for Bacillus anthracis, the causative agent of anthrax. ATR is the cellular receptor for the anthrax protective antigen and facilitates entry of the toxin into cells. The VWA domain in ATR contains the toxin binding site and mediates interaction with protective antigen. The binding is mediated by divalent cations that binds to the MIDAS motif. These proteins are a family of vertebrate ECM receptors expressed by endothelial cells.


Pssm-ID: 238751 [Multi-domain]  Cd Length: 185  Bit Score: 63.68  E-value: 7.38e-11
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 93141047  140 DLVFLVDGSWSVGRNNFKyILDFIAALVSAFDigEEKTRVGVVQYSSDTRTEFNLNQYY-QRDELLAAIKKIpYKGGNTM 218
Cdd:cd01474    6 DLYFVLDKSGSVAANWIE-IYDFVEQLVDRFN--SPGLRFSFITFSTRATKILPLTDDSsAIIKGLEVLKKV-TPSGQTY 81
                         90       100       110       120       130       140       150       160
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 93141047  219 TGDAIDYLVKNTFTESAGARVgFPKVAIIITDGKSQDEV----EIPARELRNVGVEVFSLGIKAADAKELKQIASTPslN 294
Cdd:cd01474   82 IHEGLENANEQIFNRNGGGRE-TVSVIIALTDGQLLLNGhkypEHEAKLSRKLGAIVYCVGVTDFLKSQLINIADSK--E 158
                        170       180
                 ....*....|....*....|....
gi 93141047  295 HVFNV-ANFDAIVDIQNEIISQVC 317
Cdd:cd01474  159 YVFPVtSGFQALSGIIESVVKKAC 182
FN3 cd00063
Fibronectin type 3 domain; One of three types of internal repeats found in the plasma protein ...
335-420 1.05e-10

Fibronectin type 3 domain; One of three types of internal repeats found in the plasma protein fibronectin. Its tenth fibronectin type III repeat contains an RGD cell recognition sequence in a flexible loop between 2 strands. Approximately 2% of all animal proteins contain the FN3 repeat; including extracellular and intracellular proteins, membrane spanning cytokine receptors, growth hormone receptors, tyrosine phosphatase receptors, and adhesion molecules. FN3-like domains are also found in bacterial glycosyl hydrolases.


Pssm-ID: 238020 [Multi-domain]  Cd Length: 93  Bit Score: 60.59  E-value: 1.05e-10
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 93141047  335 EPPSNLIAMEVSSKYVKLNWNPSP---SPVTGYKVILTPMTAGSRQHALSVGPQTTTLSVRDLSADTEYQISVSAMKGMT 411
Cdd:cd00063    2 SPPTNLRVTDVTSTSVTLSWTPPEddgGPITGYVVEYREKGSGDWKEVEVTPGSETSYTLTGLKPGTEYEFRVRAVNGGG 81

                 ....*....
gi 93141047  412 SSEPISIME 420
Cdd:cd00063   82 ESPPSESVT 90
FN3 cd00063
Fibronectin type 3 domain; One of three types of internal repeats found in the plasma protein ...
1002-1075 1.08e-10

Fibronectin type 3 domain; One of three types of internal repeats found in the plasma protein fibronectin. Its tenth fibronectin type III repeat contains an RGD cell recognition sequence in a flexible loop between 2 strands. Approximately 2% of all animal proteins contain the FN3 repeat; including extracellular and intracellular proteins, membrane spanning cytokine receptors, growth hormone receptors, tyrosine phosphatase receptors, and adhesion molecules. FN3-like domains are also found in bacterial glycosyl hydrolases.


Pssm-ID: 238020 [Multi-domain]  Cd Length: 93  Bit Score: 60.59  E-value: 1.08e-10
                         10        20        30        40        50        60        70
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 93141047 1002 LKVDEETENTMRVTWKPAP---GKVVNYRVVYRPHGRGK-QMVAKVPPTVTSTVLKRLQPQTTYDITVLPIYKMGEGK 1075
Cdd:cd00063    7 LRVTDVTSTSVTLSWTPPEddgGPITGYVVEYREKGSGDwKEVEVTPGSETSYTLTGLKPGTEYEFRVRAVNGGGESP 84
fn3 pfam00041
Fibronectin type III domain;
26-105 1.13e-10

Fibronectin type III domain;


Pssm-ID: 394996 [Multi-domain]  Cd Length: 85  Bit Score: 60.12  E-value: 1.13e-10
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 93141047     26 DPPSDLNFKIIDENTVHMSWAKPVD---PIVGYRITVDPTTDG-PTKEFTLSASTTETLLSELVPETEYVVTITSYDEVE 101
Cdd:pfam00041    1 SAPSNLTVTDVTSTSLTVSWTPPPDgngPITGYEVEYRPKNSGePWNEITVPGTTTSVTLTGLKPGTEYEVRVQAVNGGG 80

                   ....
gi 93141047    102 ESVP 105
Cdd:pfam00041   81 EGPP 84
FN3 COG3401
Fibronectin type 3 domain [General function prediction only];
1386-1637 1.30e-10

Fibronectin type 3 domain [General function prediction only];


Pssm-ID: 442628 [Multi-domain]  Cd Length: 603  Bit Score: 66.95  E-value: 1.30e-10
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 93141047 1386 EAPSNLVISERTHRSFRVSWTPPSDS-VDRYKVEYYPVSGGKRQEFYVSRmETSTVLKDLKPETEYVVNVYSV----VED 1460
Cdd:COG3401  234 SAPTGLTATADTPGSVTLSWDPVTESdATGYRVYRSNSGDGPFTKVATVT-TTSYTDTGLTNGTTYYYRVTAVdaagNES 312
                         90       100       110       120       130       140       150       160
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 93141047 1461 EYSEPLKGT-EKTLPVPVVSLNIYDVGPTTMHVQWQPV--GGATGYILSYKPVKDTEPTRPKEVRLGPTVNDmqlTDLVP 1537
Cdd:COG3401  313 APSNVVSVTtDLTPPAAPSGLTATAVGSSSITLSWTASsdADVTGYNVYRSTSGGGTYTKIAETVTTTSYTD---TGLTP 389
                        170       180       190       200       210       220       230       240
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 93141047 1538 NTEYAVTVQAVLHDLT----SEPVTVREVTLP----LPRPQDLKLRDVTHSTMNVFWEPVPGKVRKYIVRYKTPEEDVKE 1609
Cdd:COG3401  390 GTTYYYKVTAVDAAGNesapSEEVSATTASAAsgesLTASVDAVPLTDVAGATAAASAASNPGVSAAVLADGGDTGNAVP 469
                        250       260
                 ....*....|....*....|....*...
gi 93141047 1610 VEVDRSETSTSLKDLFSQTLYTVSVSAV 1637
Cdd:COG3401  470 FTTTSSTVTATTTDTTTANLSVTTGSLV 497
vWA_complement_factors cd01470
Complement factors B and C2 are two critical proteases for complement activation. They both ...
440-611 2.69e-10

Complement factors B and C2 are two critical proteases for complement activation. They both contain three CCP or Sushi domains, a trypsin-type serine protease domain and a single VWA domain with a conserved metal ion dependent adhesion site referred commonly as the MIDAS motif. Orthologues of these molecules are found from echinoderms to chordates. During complement activation, the CCP domains are cleaved off, resulting in the formation of an active protease that cleaves and activates complement C3. Complement C2 is in the classical pathway and complement B is in the alternative pathway. The interaction of C2 with C4 and of factor B with C3b are both dependent on Mg2+ binding sites within the VWA domains and the VWA domain of factor B has been shown to mediate the binding of C3. This is consistent with the common inferred function of VWA domains as magnesium-dependent protein interaction domains.


Pssm-ID: 238747 [Multi-domain]  Cd Length: 198  Bit Score: 62.31  E-value: 2.69e-10
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 93141047  440 DIVFLVDGSYSIGIANFVKVRAFLEVLVK---SFEISPNrvqISLVQYSRDPHTEFTLKKF--TKVEDIIEAINTFPYR- 513
Cdd:cd01470    2 NIYIALDASDSIGEEDFDEAKNAIKTLIEkisSYEVSPR---YEIISYASDPKEIVSIRDFnsNDADDVIKRLEDFNYDd 78
                         90       100       110       120       130       140       150       160
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 93141047  514 ---GGSTNTGKAMTYVREKIFVPSKGSRSNVPK---VMILITDGKSS------------DAFRDPAIKLRNS-----DVE 570
Cdd:cd01470   79 hgdKTGTNTAAALKKVYERMALEKVRNKEAFNEtrhVIILFTDGKSNmggsplptvdkiKNLVYKNNKSDNPredylDVY 158
                        170       180       190       200
                 ....*....|....*....|....*....|....*....|..
gi 93141047  571 IFAVGvKDAVRSELEAIAS-PPAETHVFTVEDFDAFQRiSFE 611
Cdd:cd01470  159 VFGVG-DDVNKEELNDLASkKDNERHFFKLKDYEDLQE-VFD 198
FN3 smart00060
Fibronectin type 3 domain; One of three types of internal repeat within the plasma protein, ...
1387-1457 3.19e-10

Fibronectin type 3 domain; One of three types of internal repeat within the plasma protein, fibronectin. The tenth fibronectin type III repeat contains a RGD cell recognition sequence in a flexible loop between 2 strands. Type III modules are present in both extracellular and intracellular proteins.


Pssm-ID: 214495 [Multi-domain]  Cd Length: 83  Bit Score: 58.78  E-value: 3.19e-10
                            10        20        30        40        50        60        70
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*
gi 93141047    1387 APSNLVISERTHRSFRVSWTPP-SDSVDRYKVEY---YPVSGGKRQEFYVSRMETSTVLKDLKPETEYVVNVYSV 1457
Cdd:smart00060    3 PPSNLRVTDVTSTSVTLSWEPPpDDGITGYIVGYrveYREEGSEWKEVNVTPSSTSYTLTGLKPGTEYEFRVRAV 77
fn3 pfam00041
Fibronectin type III domain;
1847-1925 4.89e-10

Fibronectin type III domain;


Pssm-ID: 394996 [Multi-domain]  Cd Length: 85  Bit Score: 58.20  E-value: 4.89e-10
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 93141047   1847 VRNLRVYDPSTSTLNVRWDHAE---GNPRQYKLFYAPAAGGPEEL-VPIPGNTNYAILRNLQPDTSYTVTVVPVYTEGDG 1922
Cdd:pfam00041    3 PSNLTVTDVTSTSLTVSWTPPPdgnGPITGYEVEYRPKNSGEPWNeITVPGTTTSVTLTGLKPGTEYEVRVQAVNGGGEG 82

                   ...
gi 93141047   1923 GRT 1925
Cdd:pfam00041   83 PPS 85
ViaA COG2425
Uncharacterized conserved protein, contains a von Willebrand factor type A (vWA) domain ...
440-589 7.90e-10

Uncharacterized conserved protein, contains a von Willebrand factor type A (vWA) domain [Function unknown];


Pssm-ID: 441973 [Multi-domain]  Cd Length: 263  Bit Score: 62.39  E-value: 7.90e-10
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 93141047  440 DIVFLVDGSYSIGIANFVKVRAFLEVLVKSFeiSPNRvQISLVQYSRDPHTEFTLKKFTKVEDIIEAINTFPYRGGsTNT 519
Cdd:COG2425  120 PVVLCVDTSGSMAGSKEAAAKAAALALLRAL--RPNR-RFGVILFDTEVVEDLPLTADDGLEDAIEFLSGLFAGGG-TDI 195
                         90       100       110       120       130       140       150
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|...
gi 93141047  520 GKAMTYVREKIFVPSKGSRsnvpkVMILITDGK---SSDAFRDpAIKLRNSDVEIFAVGVKDAVRSELEAIAS 589
Cdd:COG2425  196 APALRAALELLEEPDYRNA-----DIVLITDGEagvSPEELLR-EVRAKESGVRLFTVAIGDAGNPGLLEALA 262
fn3 pfam00041
Fibronectin type III domain;
1476-1556 9.98e-10

Fibronectin type III domain;


Pssm-ID: 394996 [Multi-domain]  Cd Length: 85  Bit Score: 57.42  E-value: 9.98e-10
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 93141047   1476 PVVSLNIYDVGPTTMHVQWQPV----GGATGYILSYKPVKDTEPtrPKEVRLGPTVNDMQLTDLVPNTEYAVTVQAVLHD 1551
Cdd:pfam00041    2 APSNLTVTDVTSTSLTVSWTPPpdgnGPITGYEVEYRPKNSGEP--WNEITVPGTTTSVTLTGLKPGTEYEVRVQAVNGG 79

                   ....*
gi 93141047   1552 LTSEP 1556
Cdd:pfam00041   80 GEGPP 84
FN3 cd00063
Fibronectin type 3 domain; One of three types of internal repeats found in the plasma protein ...
26-106 1.03e-09

Fibronectin type 3 domain; One of three types of internal repeats found in the plasma protein fibronectin. Its tenth fibronectin type III repeat contains an RGD cell recognition sequence in a flexible loop between 2 strands. Approximately 2% of all animal proteins contain the FN3 repeat; including extracellular and intracellular proteins, membrane spanning cytokine receptors, growth hormone receptors, tyrosine phosphatase receptors, and adhesion molecules. FN3-like domains are also found in bacterial glycosyl hydrolases.


Pssm-ID: 238020 [Multi-domain]  Cd Length: 93  Bit Score: 57.89  E-value: 1.03e-09
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 93141047   26 DPPSDLNFKIIDENTVHMSWAKPVD---PIVGYRITVDPTTDGPTKEF-TLSASTTETLLSELVPETEYVVTITSYDEVE 101
Cdd:cd00063    2 SPPTNLRVTDVTSTSVTLSWTPPEDdggPITGYVVEYREKGSGDWKEVeVTPGSETSYTLTGLKPGTEYEFRVRAVNGGG 81

                 ....*
gi 93141047  102 ESVPV 106
Cdd:cd00063   82 ESPPS 86
FN3 smart00060
Fibronectin type 3 domain; One of three types of internal repeat within the plasma protein, ...
815-892 1.38e-09

Fibronectin type 3 domain; One of three types of internal repeat within the plasma protein, fibronectin. The tenth fibronectin type III repeat contains a RGD cell recognition sequence in a flexible loop between 2 strands. Type III modules are present in both extracellular and intracellular proteins.


Pssm-ID: 214495 [Multi-domain]  Cd Length: 83  Bit Score: 56.85  E-value: 1.38e-09
                            10        20        30        40        50        60        70        80
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 93141047     815 GNPRDLRVSDPTTSTMKLSWS-GAPGKVKQYLVTYTPV---AGGETQEVTVRGDTTNTVLQGLKEGTQYALSVTALYASG 890
Cdd:smart00060    2 SPPSNLRVTDVTSTSVTLSWEpPPDDGITGYIVGYRVEyreEGSEWKEVNVTPSSTSYTLTGLKPGTEYEFRVRAVNGAG 81

                    ..
gi 93141047     891 AG 892
Cdd:smart00060   82 EG 83
ChlD COG1240
vWFA (von Willebrand factor type A) domain of Mg and Co chelatases [Coenzyme transport and ...
2322-2492 1.99e-09

vWFA (von Willebrand factor type A) domain of Mg and Co chelatases [Coenzyme transport and metabolism];


Pssm-ID: 440853 [Multi-domain]  Cd Length: 262  Bit Score: 61.11  E-value: 1.99e-09
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 93141047 2322 ADIVFLTDASWSIGDDNFNKVVKfifNTVGGFDEISPAGIQVSFVQYSDEVksEFKLNTYNDKALALGALQNIRYRGGnT 2401
Cdd:COG1240   93 RDVVLVVDASGSMAAENRLEAAK---GALLDFLDDYRPRDRVGLVAFGGEA--EVLLPLTRDREALKRALDELPPGGG-T 166
                         90       100       110       120       130       140       150       160
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 93141047 2402 RTGKALtfikEKVLTWESGMRKNVPKVLVVVTDGR---SQDEVKKAALVIQQSGFSVFVVGVAD--VDYNELANIASKpS 2476
Cdd:COG1240  167 PLGDAL----ALALELLKRADPARRKVIVLLTDGRdnaGRIDPLEAAELAAAAGIRIYTIGVGTeaVDEGLLREIAEA-T 241
                        170
                 ....*....|....*.
gi 93141047 2477 ERHVFIVDDFESFEKI 2492
Cdd:COG1240  242 GGRYFRADDLSELAAI 257
fn3 pfam00041
Fibronectin type III domain;
1656-1728 2.31e-09

Fibronectin type III domain;


Pssm-ID: 394996 [Multi-domain]  Cd Length: 85  Bit Score: 56.27  E-value: 2.31e-09
                           10        20        30        40        50        60        70
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*..
gi 93141047   1656 PAPTNLKITEVTSEGFRGTW---DHGASDVSLYRITWAPFGSSDKM-ETILNGDENTLVFENLNPNTIYEVSITAIY 1728
Cdd:pfam00041    1 SAPSNLTVTDVTSTSLTVSWtppPDGNGPITGYEVEYRPKNSGEPWnEITVPGTTTSVTLTGLKPGTEYEVRVQAVN 77
FN3 smart00060
Fibronectin type 3 domain; One of three types of internal repeat within the plasma protein, ...
1002-1074 2.55e-09

Fibronectin type 3 domain; One of three types of internal repeat within the plasma protein, fibronectin. The tenth fibronectin type III repeat contains a RGD cell recognition sequence in a flexible loop between 2 strands. Type III modules are present in both extracellular and intracellular proteins.


Pssm-ID: 214495 [Multi-domain]  Cd Length: 83  Bit Score: 56.08  E-value: 2.55e-09
                            10        20        30        40        50        60        70
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 93141047    1002 LKVDEETENTMRVTWKPAP-----GKVVNYRVVYRPHGRGKQMVaKVPPTVTSTVLKRLQPQTTYDITVLPIYKMGEG 1074
Cdd:smart00060    7 LRVTDVTSTSVTLSWEPPPddgitGYIVGYRVEYREEGSEWKEV-NVTPSSTSYTLTGLKPGTEYEFRVRAVNGAGEG 83
TerY COG4245
Uncharacterized conserved protein YegL, contains vWA domain of TerY type [Function unknown];
440-616 3.61e-09

Uncharacterized conserved protein YegL, contains vWA domain of TerY type [Function unknown];


Pssm-ID: 443387 [Multi-domain]  Cd Length: 196  Bit Score: 59.17  E-value: 3.61e-09
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 93141047  440 DIVFLVDGSYSIGIANFVKVRAFLEVLVKSFEISPN---RVQISLVQYSRDPHTeftLKKFTKVEDIieAINTFPYRGGs 516
Cdd:COG4245    7 PVYLLLDTSGSMSGEPIEALNEGLQALIDELRQDPYaleTVEVSVITFDGEAKV---LLPLTDLEDF--QPPDLSASGG- 80
                         90       100       110       120       130       140       150       160
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 93141047  517 TNTGKAMTYVREKI-----FVPSKGSRSNVPkVMILITDGKSSD-AFRDPAIKLRNSD----VEIFAVGV-KDAVRSELE 585
Cdd:COG4245   81 TPLGAALELLLDLIerrvqKYTAEGKGDWRP-VVFLITDGEPTDsDWEAALQRLKDGEaakkANIFAIGVgPDADTEVLK 159
                        170       180       190
                 ....*....|....*....|....*....|..
gi 93141047  586 AIASPpaeTHVFTVEDFDAFQRIsFE-LTQSI 616
Cdd:COG4245  160 QLTDP---VRALDALDGLDFREF-FKwLSASV 187
fn3 pfam00041
Fibronectin type III domain;
1089-1168 6.02e-09

Fibronectin type III domain;


Pssm-ID: 394996 [Multi-domain]  Cd Length: 85  Bit Score: 55.11  E-value: 6.02e-09
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 93141047   1089 SPRNLKTSDPTMSSFRVTWEPAP---GEVKGYKVTFHPTGDDRRLGELVVGPYDNTVVLEELRAGTTYKVNVFGMFDGGE 1165
Cdd:pfam00041    2 APSNLTVTDVTSTSLTVSWTPPPdgnGPITGYEVEYRPKNSGEPWNEITVPGTTTSVTLTGLKPGTEYEVRVQAVNGGGE 81

                   ...
gi 93141047   1166 SSP 1168
Cdd:pfam00041   82 GPP 84
FN3 smart00060
Fibronectin type 3 domain; One of three types of internal repeat within the plasma protein, ...
2026-2104 6.85e-09

Fibronectin type 3 domain; One of three types of internal repeat within the plasma protein, fibronectin. The tenth fibronectin type III repeat contains a RGD cell recognition sequence in a flexible loop between 2 strands. Type III modules are present in both extracellular and intracellular proteins.


Pssm-ID: 214495 [Multi-domain]  Cd Length: 83  Bit Score: 54.93  E-value: 6.85e-09
                            10        20        30        40        50        60        70        80
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 93141047    2026 SGPRNLRVFGETTNSLSVAWDHADGPVQQ-YRIIYSPTVGDPIDEYTTV--PGRRNNVILQPLQPDTPYKITVIAVYEDG 2102
Cdd:smart00060    2 SPPSNLRVTDVTSTSVTLSWEPPPDDGITgYIVGYRVEYREEGSEWKEVnvTPSSTSYTLTGLKPGTEYEFRVRAVNGAG 81

                    ..
gi 93141047    2103 DG 2104
Cdd:smart00060   82 EG 83
PTZ00441 PTZ00441
sporozoite surface protein 2 (SSP2); Provisional
1197-1402 1.47e-08

sporozoite surface protein 2 (SSP2); Provisional


Pssm-ID: 240420 [Multi-domain]  Cd Length: 576  Bit Score: 60.36  E-value: 1.47e-08
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 93141047  1197 EADIVLLVDGSWSIGRANFRT-VRSFISRIVEVFDIGPKRVQIALAQYSGDPRTEWQL--NAHRDKKSLLQAV-----AN 1268
Cdd:PTZ00441   42 EVDLYLLVDGSGSIGYHNWIThVIPMLMGLIQQLNLSDDAINLYMSLFSNNTTELIRLgsGASKDKEQALIIVkslrkTY 121
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 93141047  1269 LPYkgGNTLTGMALNFIRQQ-NFRTQagmRPRARKIGVLITDG--KSQDDVEAPSKKLKDEGVELFAIGIKNADEVELK- 1344
Cdd:PTZ00441  122 LPY--GKTNMTDALLEVRKHlNDRVN---RENAIQLVILMTDGipNSKYRALEESRKLKDRNVKLAVIGIGQGINHQFNr 196
                         170       180       190       200       210       220
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*.
gi 93141047  1345 -MIATDPDDTHA--YNVADFESLSRIVDDLTINLCNSVK-----GPGDLEAPSNLVISERTHRSFR 1402
Cdd:PTZ00441  197 lLAGCRPREGKCkfYSDADWEEAKNLIKPFIAKVCTEVErtascGPWDEWTPCSVTCGKGTHSRSR 262
fn3 pfam00041
Fibronectin type III domain;
906-986 1.48e-08

Fibronectin type III domain;


Pssm-ID: 394996 [Multi-domain]  Cd Length: 85  Bit Score: 53.96  E-value: 1.48e-08
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 93141047    906 GSPQDLVTKDITDTSIGAYWTSAP---GMVRGYRVSWKSLYDDVDTGEKNLPEDAIHTMIENLQPETKYRISVFATYSSG 982
Cdd:pfam00041    1 SAPSNLTVTDVTSTSLTVSWTPPPdgnGPITGYEVEYRPKNSGEPWNEITVPGTTTSVTLTGLKPGTEYEVRVQAVNGGG 80

                   ....
gi 93141047    983 EGEP 986
Cdd:pfam00041   81 EGPP 84
gly_rich_SclB NF038329
LPXTG-anchored collagen-like adhesin Scl2/SclB; SclB (or Scl2 - streptococcal collagen-like ...
2764-2897 2.07e-08

LPXTG-anchored collagen-like adhesin Scl2/SclB; SclB (or Scl2 - streptococcal collagen-like protein 2) is an LPXTG-anchored surface-anchored adhesin with a variable-length region of triple helix-forming collagen-like Gly-Xaa-Xaa repeats.


Pssm-ID: 468478 [Multi-domain]  Cd Length: 440  Bit Score: 59.53  E-value: 2.07e-08
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 93141047  2764 RGERGISGAIGPPGPRGDIGPPGPQGPPGPQGPNGLSIP-GEQGRQGMKGDAGEPGLPGRTGTPGLPGPPGPMGPPGDRG 2842
Cdd:NF038329  137 RGDRGETGPAGPAGPPGPQGERGEKGPAGPQGEAGPQGPaGKDGEAGAKGPAGEKGPQGPRGETGPAGEQGPAGPAGPDG 216
                          90       100       110       120       130
                  ....*....|....*....|....*....|....*....|....*....|....*
gi 93141047  2843 FTGKDGAMGPRGPPGPPGspgspgvTGPSGKPGKPGDHGRPGPSGLKGEKGDRGD 2897
Cdd:NF038329  217 EAGPAGEDGPAGPAGDGQ-------QGPDGDPGPTGEDGPQGPDGPAGKDGPRGD 264
VWA_2 pfam13519
von Willebrand factor type A domain;
1200-1307 2.29e-08

von Willebrand factor type A domain;


Pssm-ID: 463909 [Multi-domain]  Cd Length: 103  Bit Score: 54.22  E-value: 2.29e-08
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 93141047   1200 IVLLVDGSWSI-----GRANFRTVRSFISRIVEVFdigpKRVQIALAQYSGDPRTEWQLNahRDKKSLLQAVANLPYKGG 1274
Cdd:pfam13519    1 LVFVLDTSGSMrngdyGPTRLEAAKDAVLALLKSL----PGDRVGLVTFGDGPEVLIPLT--KDRAKILRALRRLEPKGG 74
                           90       100       110
                   ....*....|....*....|....*....|...
gi 93141047   1275 NTLTGMALNFIRQqnfrTQAGMRPRARKIGVLI 1307
Cdd:pfam13519   75 GTNLAAALQLARA----ALKHRRKNQPRRIVLI 103
FN3 cd00063
Fibronectin type 3 domain; One of three types of internal repeats found in the plasma protein ...
1847-1922 2.61e-08

Fibronectin type 3 domain; One of three types of internal repeats found in the plasma protein fibronectin. Its tenth fibronectin type III repeat contains an RGD cell recognition sequence in a flexible loop between 2 strands. Approximately 2% of all animal proteins contain the FN3 repeat; including extracellular and intracellular proteins, membrane spanning cytokine receptors, growth hormone receptors, tyrosine phosphatase receptors, and adhesion molecules. FN3-like domains are also found in bacterial glycosyl hydrolases.


Pssm-ID: 238020 [Multi-domain]  Cd Length: 93  Bit Score: 53.65  E-value: 2.61e-08
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 93141047 1847 VRNLRVYDPSTSTLNVRWDHAE---GNPRQYKLFYAPA-AGGPEELVPIPGNTNYAILRNLQPDTSYTVTVVPVYTEGDG 1922
Cdd:cd00063    4 PTNLRVTDVTSTSVTLSWTPPEddgGPITGYVVEYREKgSGDWKEVEVTPGSETSYTLTGLKPGTEYEFRVRAVNGGGES 83
FN3 smart00060
Fibronectin type 3 domain; One of three types of internal repeat within the plasma protein, ...
1938-2013 2.74e-08

Fibronectin type 3 domain; One of three types of internal repeat within the plasma protein, fibronectin. The tenth fibronectin type III repeat contains a RGD cell recognition sequence in a flexible loop between 2 strands. Type III modules are present in both extracellular and intracellular proteins.


Pssm-ID: 214495 [Multi-domain]  Cd Length: 83  Bit Score: 53.39  E-value: 2.74e-08
                            10        20        30        40        50        60        70        80
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 93141047    1938 RNVQVYNPTPNSLDVRWDPAP-----GPVLQYRVVYSPVDGTrpSESIVVPGNTRMVHLERLIPDTLYSVNLVALYSDGE 2012
Cdd:smart00060    5 SNLRVTDVTSTSVTLSWEPPPddgitGYIVGYRVEYREEGSE--WKEVNVTPSSTSYTLTGLKPGTEYEFRVRAVNGAGE 82

                    .
gi 93141047    2013 G 2013
Cdd:smart00060   83 G 83
FN3 cd00063
Fibronectin type 3 domain; One of three types of internal repeats found in the plasma protein ...
906-988 3.24e-08

Fibronectin type 3 domain; One of three types of internal repeats found in the plasma protein fibronectin. Its tenth fibronectin type III repeat contains an RGD cell recognition sequence in a flexible loop between 2 strands. Approximately 2% of all animal proteins contain the FN3 repeat; including extracellular and intracellular proteins, membrane spanning cytokine receptors, growth hormone receptors, tyrosine phosphatase receptors, and adhesion molecules. FN3-like domains are also found in bacterial glycosyl hydrolases.


Pssm-ID: 238020 [Multi-domain]  Cd Length: 93  Bit Score: 53.27  E-value: 3.24e-08
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 93141047  906 GSPQDLVTKDITDTSIGAYWTSAP---GMVRGYRVSWKSLYDDVDTGEKNLPEDAIHTMIENLQPETKYRISVFATYSSG 982
Cdd:cd00063    2 SPPTNLRVTDVTSTSVTLSWTPPEddgGPITGYVVEYREKGSGDWKEVEVTPGSETSYTLTGLKPGTEYEFRVRAVNGGG 81

                 ....*.
gi 93141047  983 EGEPLT 988
Cdd:cd00063   82 ESPPSE 87
FN3 smart00060
Fibronectin type 3 domain; One of three types of internal repeat within the plasma protein, ...
906-984 3.27e-08

Fibronectin type 3 domain; One of three types of internal repeat within the plasma protein, fibronectin. The tenth fibronectin type III repeat contains a RGD cell recognition sequence in a flexible loop between 2 strands. Type III modules are present in both extracellular and intracellular proteins.


Pssm-ID: 214495 [Multi-domain]  Cd Length: 83  Bit Score: 53.00  E-value: 3.27e-08
                            10        20        30        40        50        60        70        80
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 93141047     906 GSPQDLVTKDITDTSIGAYWT-SAPGMVRGYRVSWKSLYDDVDTGEKNLPEDAIHT--MIENLQPETKYRISVFATYSSG 982
Cdd:smart00060    2 SPPSNLRVTDVTSTSVTLSWEpPPDDGITGYIVGYRVEYREEGSEWKEVNVTPSSTsyTLTGLKPGTEYEFRVRAVNGAG 81

                    ..
gi 93141047     983 EG 984
Cdd:smart00060   82 EG 83
fn3 pfam00041
Fibronectin type III domain;
1002-1075 5.01e-08

Fibronectin type III domain;


Pssm-ID: 394996 [Multi-domain]  Cd Length: 85  Bit Score: 52.80  E-value: 5.01e-08
                           10        20        30        40        50        60        70
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 93141047   1002 LKVDEETENTMRVTWKPAP---GKVVNYRVVYRPHGRGKQMVAK-VPPTVTSTVLKRLQPQTTYDITVLPIYKMGEGK 1075
Cdd:pfam00041    6 LTVTDVTSTSLTVSWTPPPdgnGPITGYEVEYRPKNSGEPWNEItVPGTTTSVTLTGLKPGTEYEVRVQAVNGGGEGP 83
gly_rich_SclB NF038329
LPXTG-anchored collagen-like adhesin Scl2/SclB; SclB (or Scl2 - streptococcal collagen-like ...
2764-2896 5.01e-08

LPXTG-anchored collagen-like adhesin Scl2/SclB; SclB (or Scl2 - streptococcal collagen-like protein 2) is an LPXTG-anchored surface-anchored adhesin with a variable-length region of triple helix-forming collagen-like Gly-Xaa-Xaa repeats.


Pssm-ID: 468478 [Multi-domain]  Cd Length: 440  Bit Score: 58.38  E-value: 5.01e-08
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 93141047  2764 RGERGISG---AIGPPGPRGDIGPPGPQGPPGPQGPNGLsiPGEQGRQGMKGDAGEPGLPGRTGTPGLPGPPGPMGPPGD 2840
Cdd:NF038329  119 KGEPGPAGpagPAGEQGPRGDRGETGPAGPAGPPGPQGE--RGEKGPAGPQGEAGPQGPAGKDGEAGAKGPAGEKGPQGP 196
                          90       100       110       120       130
                  ....*....|....*....|....*....|....*....|....*....|....*...
gi 93141047  2841 RGFTGKDGAMGPRGPPGPPGSPGSPGVTGPSGKPGKP--GDHGRPGPSGLKGEKGDRG 2896
Cdd:NF038329  197 RGETGPAGEQGPAGPAGPDGEAGPAGEDGPAGPAGDGqqGPDGDPGPTGEDGPQGPDG 254
FN3 smart00060
Fibronectin type 3 domain; One of three types of internal repeat within the plasma protein, ...
26-103 7.50e-08

Fibronectin type 3 domain; One of three types of internal repeat within the plasma protein, fibronectin. The tenth fibronectin type III repeat contains a RGD cell recognition sequence in a flexible loop between 2 strands. Type III modules are present in both extracellular and intracellular proteins.


Pssm-ID: 214495 [Multi-domain]  Cd Length: 83  Bit Score: 52.23  E-value: 7.50e-08
                            10        20        30        40        50        60        70        80
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 93141047      26 DPPSDLNFKIIDENTVHMSWAKPVDP-----IVGYRItVDPTTDGPTKEFTLSASTTETLLSELVPETEYVVTITSYDEV 100
Cdd:smart00060    2 SPPSNLRVTDVTSTSVTLSWEPPPDDgitgyIVGYRV-EYREEGSEWKEVNVTPSSTSYTLTGLKPGTEYEFRVRAVNGA 80

                    ...
gi 93141047     101 EES 103
Cdd:smart00060   81 GEG 83
FN3 COG3401
Fibronectin type 3 domain [General function prediction only];
634-939 1.08e-07

Fibronectin type 3 domain [General function prediction only];


Pssm-ID: 442628 [Multi-domain]  Cd Length: 603  Bit Score: 57.70  E-value: 1.08e-07
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 93141047  634 PPKDLSFSEVTSYGFKTNWSPAGE-NVFSYHItYKEAAGDDEVTVVEPASSTSVVLSSLKPETLYLVNVTAEYEDGFSIP 712
Cdd:COG3401  235 APTGLTATADTPGSVTLSWDPVTEsDATGYRV-YRSNSGDGPFTKVATVTTTSYTDTGLTNGTTYYYRVTAVDAAGNESA 313
                         90       100       110       120       130       140       150       160
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 93141047  713 LAGEETTEEVK---GAPRNLKVTDETTDSFKITWTQAPGRVLRYRIIYR-PVAGGESREVTTPPNQRRRTLENLIPDTKY 788
Cdd:COG3401  314 PSNVVSVTTDLtppAAPSGLTATAVGSSSITLSWTASSDADVTGYNVYRsTSGGGTYTKIAETVTTTSYTDTGLTPGTTY 393
                        170       180       190       200       210       220       230       240
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 93141047  789 EVSVIPEYFSGPGTPLTGN--AATEEVRGNPRDLRVSDPTTSTMKLSWSGAPGKVKQYLVTYTPVAGGETQEVTVRGDTT 866
Cdd:COG3401  394 YYKVTAVDAAGNESAPSEEvsATTASAASGESLTASVDAVPLTDVAGATAAASAASNPGVSAAVLADGGDTGNAVPFTTT 473
                        250       260       270       280       290       300       310
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*
gi 93141047  867 NTVLQGLKEGTQYALSVTALYASGAGDALFGEGTTLEERG--SPQDLVTKDITDTSIGAYWTSAPGMVRGYRVSW 939
Cdd:COG3401  474 SSTVTATTTDTTTANLSVTTGSLVGGSGASSVTNSVSVIGasAAAAVGGAPDGTPNVTGASPVTVGASTGDVLIT 548
vWA_CTRP cd01473
CTRP for CS protein-TRAP-related protein: Adhesion of Plasmodium to host cells is an ...
1199-1377 1.30e-07

CTRP for CS protein-TRAP-related protein: Adhesion of Plasmodium to host cells is an important phenomenon in parasite invasion and in malaria associated pathology.CTRP encodes a protein containing a putative signal sequence followed by a long extracellular region of 1990 amino acids, a transmembrane domain, and a short cytoplasmic segment. The extracellular region of CTRP contains two separated adhesive domains. The first domain contains six 210-amino acid-long homologous VWA domain repeats. The second domain contains seven repeats of 87-60 amino acids in length, which share similarities with the thrombospondin type 1 domain found in a variety of adhesive molecules. Finally, CTRP also contains consensus motifs found in the superfamily of haematopoietin receptors. The VWA domains in these proteins likely mediate protein-protein interactions.


Pssm-ID: 238750 [Multi-domain]  Cd Length: 192  Bit Score: 54.25  E-value: 1.30e-07
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 93141047 1199 DIVLLVDGSWSIGRANFRT-VRSFISRIVEVFDIGPKRVQIALAQYSGDPRT--EWQLNAHRDKKSLLQAVANLP--YK- 1272
Cdd:cd01473    2 DLTLILDESASIGYSNWRKdVIPFTEKIINNLNISKDKVHVGILLFAEKNRDvvPFSDEERYDKNELLKKINDLKnsYRs 81
                         90       100       110       120       130       140       150       160
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 93141047 1273 GGNTLTGMALNFIRQQNFRtQAGMRPRARKIGVLITDG----KSQDDVEAPSKKLKDEGVELFAIGIKNADEVELKMIAt 1348
Cdd:cd01473   82 GGETYIVEALKYGLKNYTK-HGNRRKDAPKVTMLFTDGndtsASKKELQDISLLYKEENVKLLVVGVGAASENKLKLLA- 159
                        170       180       190
                 ....*....|....*....|....*....|....*.
gi 93141047 1349 dpdDTHAYNV-------ADFESLSRIVDDLTINLCN 1377
Cdd:cd01473  160 ---GCDINNDncpnvikTEWNNLNGISKFLTDKICD 192
FN3 smart00060
Fibronectin type 3 domain; One of three types of internal repeat within the plasma protein, ...
336-413 1.50e-07

Fibronectin type 3 domain; One of three types of internal repeat within the plasma protein, fibronectin. The tenth fibronectin type III repeat contains a RGD cell recognition sequence in a flexible loop between 2 strands. Type III modules are present in both extracellular and intracellular proteins.


Pssm-ID: 214495 [Multi-domain]  Cd Length: 83  Bit Score: 51.08  E-value: 1.50e-07
                            10        20        30        40        50        60        70        80
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 93141047     336 PPSNLIAMEVSSKYVKLNWNPSPSPV-TGYKV--ILTPMTAGSRQHALSVGPQTTTLSVRDLSADTEYQISVSAMKGMTS 412
Cdd:smart00060    3 PPSNLRVTDVTSTSVTLSWEPPPDDGiTGYIVgyRVEYREEGSEWKEVNVTPSSTSYTLTGLKPGTEYEFRVRAVNGAGE 82

                    .
gi 93141047     413 S 413
Cdd:smart00060   83 G 83
vWA_complement_factors cd01470
Complement factors B and C2 are two critical proteases for complement activation. They both ...
2323-2494 1.72e-07

Complement factors B and C2 are two critical proteases for complement activation. They both contain three CCP or Sushi domains, a trypsin-type serine protease domain and a single VWA domain with a conserved metal ion dependent adhesion site referred commonly as the MIDAS motif. Orthologues of these molecules are found from echinoderms to chordates. During complement activation, the CCP domains are cleaved off, resulting in the formation of an active protease that cleaves and activates complement C3. Complement C2 is in the classical pathway and complement B is in the alternative pathway. The interaction of C2 with C4 and of factor B with C3b are both dependent on Mg2+ binding sites within the VWA domains and the VWA domain of factor B has been shown to mediate the binding of C3. This is consistent with the common inferred function of VWA domains as magnesium-dependent protein interaction domains.


Pssm-ID: 238747 [Multi-domain]  Cd Length: 198  Bit Score: 54.22  E-value: 1.72e-07
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 93141047 2323 DIVFLTDASWSIGDDNFNKVVKFIFNTVggfDEISPAGIQVSF--VQYSDEVKSEFKLNTY--NDKALALGALQNIRYRG 2398
Cdd:cd01470    2 NIYIALDASDSIGEEDFDEAKNAIKTLI---EKISSYEVSPRYeiISYASDPKEIVSIRDFnsNDADDVIKRLEDFNYDD 78
                         90       100       110       120       130       140       150       160
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 93141047 2399 GNTRTGkalTFIKEKVLTWESGMR--KNVPK--------VLVVVTDGRSQ---------DEVKkAALVIQQSGFS----- 2454
Cdd:cd01470   79 HGDKTG---TNTAAALKKVYERMAleKVRNKeafnetrhVIILFTDGKSNmggsplptvDKIK-NLVYKNNKSDNpredy 154
                        170       180       190       200
                 ....*....|....*....|....*....|....*....|....
gi 93141047 2455 --VFVVGV-ADVDYNELANIAS-KPSERHVFIVDDFESFEKIED 2494
Cdd:cd01470  155 ldVYVFGVgDDVNKEELNDLASkKDNERHFFKLKDYEDLQEVFD 198
ViaA COG2425
Uncharacterized conserved protein, contains a von Willebrand factor type A (vWA) domain ...
1195-1347 1.83e-07

Uncharacterized conserved protein, contains a von Willebrand factor type A (vWA) domain [Function unknown];


Pssm-ID: 441973 [Multi-domain]  Cd Length: 263  Bit Score: 55.07  E-value: 1.83e-07
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 93141047 1195 RAEADIVLLVDGSWSIGRANFRTVRSFISRIVEVFdigPKRVQIALAQYSGDPRTEWQLNAHRDKKSLLQAVANLPYKGG 1274
Cdd:COG2425  116 LLEGPVVLCVDTSGSMAGSKEAAAKAAALALLRAL---RPNRRFGVILFDTEVVEDLPLTADDGLEDAIEFLSGLFAGGG 192
                         90       100       110       120       130       140       150
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 93141047 1275 NTLTG---MALNFIRQQNFRtqagmrpraRKIGVLITDGKSQDDVEAPSKKL--KDEGVELFAIGIKNADEVEL-KMIA 1347
Cdd:COG2425  193 TDIAPalrAALELLEEPDYR---------NADIVLITDGEAGVSPEELLREVraKESGVRLFTVAIGDAGNPGLlEALA 262
FN3 smart00060
Fibronectin type 3 domain; One of three types of internal repeat within the plasma protein, ...
1847-1922 2.83e-07

Fibronectin type 3 domain; One of three types of internal repeat within the plasma protein, fibronectin. The tenth fibronectin type III repeat contains a RGD cell recognition sequence in a flexible loop between 2 strands. Type III modules are present in both extracellular and intracellular proteins.


Pssm-ID: 214495 [Multi-domain]  Cd Length: 83  Bit Score: 50.31  E-value: 2.83e-07
                            10        20        30        40        50        60        70        80
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 93141047    1847 VRNLRVYDPSTSTLNVRWDHAEGNPR-----QYKLFYAPAaGGPEELVPIPGNTNYAILRNLQPDTSYTVTVVPVYTEGD 1921
Cdd:smart00060    4 PSNLRVTDVTSTSVTLSWEPPPDDGItgyivGYRVEYREE-GSEWKEVNVTPSSTSYTLTGLKPGTEYEFRVRAVNGAGE 82

                    .
gi 93141047    1922 G 1922
Cdd:smart00060   83 G 83
FN3 smart00060
Fibronectin type 3 domain; One of three types of internal repeat within the plasma protein, ...
1089-1166 2.92e-07

Fibronectin type 3 domain; One of three types of internal repeat within the plasma protein, fibronectin. The tenth fibronectin type III repeat contains a RGD cell recognition sequence in a flexible loop between 2 strands. Type III modules are present in both extracellular and intracellular proteins.


Pssm-ID: 214495 [Multi-domain]  Cd Length: 83  Bit Score: 50.31  E-value: 2.92e-07
                            10        20        30        40        50        60        70        80
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 93141047    1089 SPRNLKTSDPTMSSFRVTWE-PAPGEVKGYKVTFHPTGDDRRLGELVV--GPYDNTVVLEELRAGTTYKVNVFGMFDGGE 1165
Cdd:smart00060    3 PPSNLRVTDVTSTSVTLSWEpPPDDGITGYIVGYRVEYREEGSEWKEVnvTPSSTSYTLTGLKPGTEYEFRVRAVNGAGE 82

                    .
gi 93141047    1166 S 1166
Cdd:smart00060   83 G 83
gly_rich_SclB NF038329
LPXTG-anchored collagen-like adhesin Scl2/SclB; SclB (or Scl2 - streptococcal collagen-like ...
2764-2896 4.87e-07

LPXTG-anchored collagen-like adhesin Scl2/SclB; SclB (or Scl2 - streptococcal collagen-like protein 2) is an LPXTG-anchored surface-anchored adhesin with a variable-length region of triple helix-forming collagen-like Gly-Xaa-Xaa repeats.


Pssm-ID: 468478 [Multi-domain]  Cd Length: 440  Bit Score: 55.30  E-value: 4.87e-07
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 93141047  2764 RGERGISGAIGPPGPRGDIGPPGPQGPPGPQGPNGLsiPGEQGRQGMKGDAGEPGLPGRTGTPGLPGPPGpmgppgdrgf 2843
Cdd:NF038329  176 AGKDGEAGAKGPAGEKGPQGPRGETGPAGEQGPAGP--AGPDGEAGPAGEDGPAGPAGDGQQGPDGDPGP---------- 243
                          90       100       110       120       130
                  ....*....|....*....|....*....|....*....|....*....|...
gi 93141047  2844 TGKDGAMGPRGPPGPPGSPGSPGVTGPSGKPGKPGDHGRPGPSGLKGEKGDRG 2896
Cdd:NF038329  244 TGEDGPQGPDGPAGKDGPRGDRGEAGPDGPDGKDGERGPVGPAGKDGQNGKDG 296
FN3 cd00063
Fibronectin type 3 domain; One of three types of internal repeats found in the plasma protein ...
1656-1728 1.21e-06

Fibronectin type 3 domain; One of three types of internal repeats found in the plasma protein fibronectin. Its tenth fibronectin type III repeat contains an RGD cell recognition sequence in a flexible loop between 2 strands. Approximately 2% of all animal proteins contain the FN3 repeat; including extracellular and intracellular proteins, membrane spanning cytokine receptors, growth hormone receptors, tyrosine phosphatase receptors, and adhesion molecules. FN3-like domains are also found in bacterial glycosyl hydrolases.


Pssm-ID: 238020 [Multi-domain]  Cd Length: 93  Bit Score: 49.03  E-value: 1.21e-06
                         10        20        30        40        50        60        70
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*..
gi 93141047 1656 PAPTNLKITEVTSEGFRGTWD---HGASDVSLYRITWAPFGSSDKME-TILNGDENTLVFENLNPNTIYEVSITAIY 1728
Cdd:cd00063    2 SPPTNLRVTDVTSTSVTLSWTppeDDGGPITGYVVEYREKGSGDWKEvEVTPGSETSYTLTGLKPGTEYEFRVRAVN 78
fn3 pfam00041
Fibronectin type III domain;
2207-2282 1.43e-06

Fibronectin type III domain;


Pssm-ID: 394996 [Multi-domain]  Cd Length: 85  Bit Score: 48.57  E-value: 1.43e-06
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 93141047   2207 VTDLKTYQIGWDTFCVKWSP----HRAATSYRLKLSPADGTRG-QEITVRGSETSHCFTGLSPDTDYGVTVFVQTPNLEG 2281
Cdd:pfam00041    3 PSNLTVTDVTSTSLTVSWTPppdgNGPITGYEVEYRPKNSGEPwNEITVPGTTTSVTLTGLKPGTEYEVRVQAVNGGGEG 82

                   .
gi 93141047   2282 P 2282
Cdd:pfam00041   83 P 83
FN3 COG3401
Fibronectin type 3 domain [General function prediction only];
1717-2233 1.43e-06

Fibronectin type 3 domain [General function prediction only];


Pssm-ID: 442628 [Multi-domain]  Cd Length: 603  Bit Score: 53.85  E-value: 1.43e-06
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 93141047 1717 NTIYEVSITAIYPDESESDDLIGSERTLPILTTQAPKSGPRNLQVYNATSNSLTVKWDPASGR-VQKYRITYQPSTGEGN 1795
Cdd:COG3401   15 ASAAANTAVNALSKAGGSGKTILVYLAVVLSVTTKESPGTLLVAAGLSSGGGLGTGGRAGTTSgVAAVAVAAAPPTATGL 94
                         90       100       110       120       130       140       150       160
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 93141047 1796 EQTTTIGGRQNSVVLQKLKPDTPYTITVSSLYPDGEGGRMTGRGKTKPLNTVRNLRVYDPSTSTLNVRWDHAEGNPRQYK 1875
Cdd:COG3401   95 TTLTGSGSVGGATNTGLTSSDEVPSPAVGTATTATAVAGGAATAGTYALGAGLYGVDGANASGTTASSVAGAGVVVSPDT 174
                        170       180       190       200       210       220       230       240
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 93141047 1876 LFYAPAAGGPeelVPIPGNTNYAILRNLQPDTSYTVTVVPVYTEGDGGRTSDTGRTLMRGLAR---NVQVYNPTPNSLDV 1952
Cdd:COG3401  175 SATAAVATTS---LTVTSTTLVDGGGDIEPGTTYYYRVAATDTGGESAPSNEVSVTTPTTPPSaptGLTATADTPGSVTL 251
                        250       260       270       280       290       300       310       320
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 93141047 1953 RWDPAPGPVLQ-YRVVYSPVDGTrPSESIVVPGNTRMVHLErLIPDTLYSVNLVALYSDGEG----NPSPAQGRTLPRSG 2027
Cdd:COG3401  252 SWDPVTESDATgYRVYRSNSGDG-PFTKVATVTTTSYTDTG-LTNGTTYYYRVTAVDAAGNEsapsNVVSVTTDLTPPAA 329
                        330       340       350       360       370       380       390       400
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 93141047 2028 PRNLRVFGETTNSLSVAWDHADGP-VQQYRIIYSPTVGDPIDEYTTVPgRRNNVILQPLQPDTPYKITVIAVYEDGDGGH 2106
Cdd:COG3401  330 PSGLTATAVGSSSITLSWTASSDAdVTGYNVYRSTSGGGTYTKIAETV-TTTSYTDTGLTPGTTYYYKVTAVDAAGNESA 408
                        410       420       430       440       450       460       470       480
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 93141047 2107 LTGNGRTVGLLPPQNIHISDEWYTRFRVSWDPSPSPVL--------------GYKIVYKPVGSNEPMEAFVGEMTSYTL- 2171
Cdd:COG3401  409 PSEEVSATTASAASGESLTASVDAVPLTDVAGATAAASaasnpgvsaavladGGDTGNAVPFTTTSSTVTATTTDTTTAn 488
                        490       500       510       520       530       540
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 93141047 2172 ------HNLNPSTTYDVNVYAQYDSGLSVPLTDQGTTLYLNVTDLKTYQIGWDTFCVKWSPHRAATSY 2233
Cdd:COG3401  489 lsvttgSLVGGSGASSVTNSVSVIGASAAAAVGGAPDGTPNVTGASPVTVGASTGDVLITDLVSLTTS 556
gly_rich_SclB NF038329
LPXTG-anchored collagen-like adhesin Scl2/SclB; SclB (or Scl2 - streptococcal collagen-like ...
2765-2896 2.96e-06

LPXTG-anchored collagen-like adhesin Scl2/SclB; SclB (or Scl2 - streptococcal collagen-like protein 2) is an LPXTG-anchored surface-anchored adhesin with a variable-length region of triple helix-forming collagen-like Gly-Xaa-Xaa repeats.


Pssm-ID: 468478 [Multi-domain]  Cd Length: 440  Bit Score: 52.60  E-value: 2.96e-06
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 93141047  2765 GERGISGAIGPPGPRGDigppgpqgppgpqgpnglsiPGEQGRQGMKGDAGEPGLPGRTGTPGLpgppgpmgppgdrgfT 2844
Cdd:NF038329  117 GEKGEPGPAGPAGPAGE--------------------QGPRGDRGETGPAGPAGPPGPQGERGE---------------K 161
                          90       100       110       120       130
                  ....*....|....*....|....*....|....*....|....*....|..
gi 93141047  2845 GKDGAMGPRGPPGPPGSPGSPGVTGPSGKPGKPGDHGRPGPSGLKGEKGDRG 2896
Cdd:NF038329  162 GPAGPQGEAGPQGPAGKDGEAGAKGPAGEKGPQGPRGETGPAGEQGPAGPAG 213
VWA_2 pfam13519
von Willebrand factor type A domain;
2324-2432 4.53e-06

von Willebrand factor type A domain;


Pssm-ID: 463909 [Multi-domain]  Cd Length: 103  Bit Score: 47.67  E-value: 4.53e-06
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 93141047   2324 IVFLTDASWSIGDD-----NFNKVVKFIFNTVGGFdeispAGIQVSFVQYSDEVKSEFKLNTynDKALALGALQNIRYRG 2398
Cdd:pfam13519    1 LVFVLDTSGSMRNGdygptRLEAAKDAVLALLKSL-----PGDRVGLVTFGDGPEVLIPLTK--DRAKILRALRRLEPKG 73
                           90       100       110
                   ....*....|....*....|....*....|....
gi 93141047   2399 GNTRTGKALtfikEKVLTWESGMRKNVPKVLVVV 2432
Cdd:pfam13519   74 GGTNLAAAL----QLARAALKHRRKNQPRRIVLI 103
YfbK COG2304
Secreted protein containing bacterial Ig-like domain and vWFA domain [General function ...
140-288 7.09e-06

Secreted protein containing bacterial Ig-like domain and vWFA domain [General function prediction only];


Pssm-ID: 441879 [Multi-domain]  Cd Length: 289  Bit Score: 50.49  E-value: 7.09e-06
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 93141047  140 DLVFLVDGSWSVGRNNFKYILDFIAALVSAFDigeEKTRVGVVQYSSDTRTEFNLNQYYQRDELLAAIKKIpYKGGNTMT 219
Cdd:COG2304   93 NLVFVIDVSGSMSGDKLELAKEAAKLLVDQLR---PGDRVSIVTFAGDARVLLPPTPATDRAKILAAIDRL-QAGGGTAL 168
                         90       100       110       120       130       140       150
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 93141047  220 GDAIDY---LVKNTFTESAGARVgfpkvaIIITDGKSQDEVEIP------ARELRNVGVEVFSLGI-KAADAKELKQIA 288
Cdd:COG2304  169 GAGLELayeLARKHFIPGRVNRV------ILLTDGDANVGITDPeellklAEEAREEGITLTTLGVgSDYNEDLLERLA 241
FN3 COG3401
Fibronectin type 3 domain [General function prediction only];
784-1189 8.14e-06

Fibronectin type 3 domain [General function prediction only];


Pssm-ID: 442628 [Multi-domain]  Cd Length: 603  Bit Score: 51.54  E-value: 8.14e-06
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 93141047  784 PDTKYEVSVIPEYFSGPGTPLTGNAATEEVRGNPRDLRVSDPTTSTMKLSWSGAPGKVKQYLVTYTPVAGGetqevtvrG 863
Cdd:COG3401   33 GKTILVYLAVVLSVTTKESPGTLLVAAGLSSGGGLGTGGRAGTTSGVAAVAVAAAPPTATGLTTLTGSGSV--------G 104
                         90       100       110       120       130       140       150       160
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 93141047  864 DTTNTVLQGLKEGTQYALSVTALYASGAGDALFGEGTTLEERGSPQDLVTKDITDTSIGAYWTSAPGMVRGYRVSWKSLY 943
Cdd:COG3401  105 GATNTGLTSSDEVPSPAVGTATTATAVAGGAATAGTYALGAGLYGVDGANASGTTASSVAGAGVVVSPDTSATAAVATTS 184
                        170       180       190       200       210       220       230       240
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 93141047  944 DDVDTGEKNLPEDAIhtmienlQPETKYRISVFATYSSGEGEPLTG-DATTELSQDSK--TLKVDEETENTMRVTWKPAP 1020
Cdd:COG3401  185 LTVTSTTLVDGGGDI-------EPGTTYYYRVAATDTGGESAPSNEvSVTTPTTPPSAptGLTATADTPGSVTLSWDPVT 257
                        250       260       270       280       290       300       310       320
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 93141047 1021 GK-VVNYRVVYRPHGRGK-QMVAKVppTVTSTVLKRLQPQTTYDITVLPIYKMG-EGKLRQGSGTTASRFK--SPRNLKT 1095
Cdd:COG3401  258 ESdATGYRVYRSNSGDGPfTKVATV--TTTSYTDTGLTNGTTYYYRVTAVDAAGnESAPSNVVSVTTDLTPpaAPSGLTA 335
                        330       340       350       360       370       380       390       400
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 93141047 1096 SDPTMSSFRVTWEPAPGE-VKGYKV--TFHPTGDDRRLGELVVGpydNTVVLEELRAGTTYKVNVFGMFDGGESSPLVGQ 1172
Cdd:COG3401  336 TAVGSSSITLSWTASSDAdVTGYNVyrSTSGGGTYTKIAETVTT---TSYTDTGLTPGTTYYYKVTAVDAAGNESAPSEE 412
                        410
                 ....*....|....*..
gi 93141047 1173 EMTTLSDTTVMPILSSG 1189
Cdd:COG3401  413 VSATTASAASGESLTAS 429
FN3 smart00060
Fibronectin type 3 domain; One of three types of internal repeat within the plasma protein, ...
1656-1728 1.84e-05

Fibronectin type 3 domain; One of three types of internal repeat within the plasma protein, fibronectin. The tenth fibronectin type III repeat contains a RGD cell recognition sequence in a flexible loop between 2 strands. Type III modules are present in both extracellular and intracellular proteins.


Pssm-ID: 214495 [Multi-domain]  Cd Length: 83  Bit Score: 45.30  E-value: 1.84e-05
                            10        20        30        40        50        60        70
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 93141047    1656 PAPTNLKITEVTSEGFRGTWDHGASDVSL-----YRITWAPfGSSDKMETILNGDENTLVFENLNPNTIYEVSITAIY 1728
Cdd:smart00060    2 SPPSNLRVTDVTSTSVTLSWEPPPDDGITgyivgYRVEYRE-EGSEWKEVNVTPSSTSYTLTGLKPGTEYEFRVRAVN 78
Collagen pfam01391
Collagen triple helix repeat (20 copies); Members of this family belong to the collagen ...
2869-2897 3.38e-05

Collagen triple helix repeat (20 copies); Members of this family belong to the collagen superfamily. Collagens are generally extracellular structural proteins involved in formation of connective tissue structure. The alignment contains 20 copies of the G-X-Y repeat that forms a triple helix. The first position of the repeat is glycine, the second and third positions can be any residue but are frequently proline and hydroxy-proline. Collagens are post translationally modified by proline hydroxylase to form the hydroxy-proline residues. Defective hydroxylation is the cause of scurvy. Some members of the collagen superfamily are not involved in connective tissue structure but share the same triple helical structure. The family includes bacterial collagen-like triple-helix repeat proteins.


Pssm-ID: 460189 [Multi-domain]  Cd Length: 57  Bit Score: 43.64  E-value: 3.38e-05
                           10        20
                   ....*....|....*....|....*....
gi 93141047   2869 GPSGKPGKPGDHGRPGPSGLKGEKGDRGD 2897
Cdd:pfam01391    4 GPPGPPGPPGPPGPPGPPGPPGPPGPPGE 32
vWA_CTRP cd01473
CTRP for CS protein-TRAP-related protein: Adhesion of Plasmodium to host cells is an ...
2323-2472 3.60e-05

CTRP for CS protein-TRAP-related protein: Adhesion of Plasmodium to host cells is an important phenomenon in parasite invasion and in malaria associated pathology.CTRP encodes a protein containing a putative signal sequence followed by a long extracellular region of 1990 amino acids, a transmembrane domain, and a short cytoplasmic segment. The extracellular region of CTRP contains two separated adhesive domains. The first domain contains six 210-amino acid-long homologous VWA domain repeats. The second domain contains seven repeats of 87-60 amino acids in length, which share similarities with the thrombospondin type 1 domain found in a variety of adhesive molecules. Finally, CTRP also contains consensus motifs found in the superfamily of haematopoietin receptors. The VWA domains in these proteins likely mediate protein-protein interactions.


Pssm-ID: 238750 [Multi-domain]  Cd Length: 192  Bit Score: 47.31  E-value: 3.60e-05
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 93141047 2323 DIVFLTDASWSIGDDNFNK-VVKFIFNTVGGFdEISPAGIQVSFVQYSDEVKSEFKL---NTYNDKAL--ALGALQNIRY 2396
Cdd:cd01473    2 DLTLILDESASIGYSNWRKdVIPFTEKIINNL-NISKDKVHVGILLFAEKNRDVVPFsdeERYDKNELlkKINDLKNSYR 80
                         90       100       110       120       130       140       150       160
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 93141047 2397 RGGNTRTGKALTFIKEKVlTWESGMRKNVPKVLVVVTDGRSQDEVKKA----ALVIQQSGFSVFVVGVADVDYNELANIA 2472
Cdd:cd01473   81 SGGETYIVEALKYGLKNY-TKHGNRRKDAPKVTMLFTDGNDTSASKKElqdiSLLYKEENVKLLVVGVGAASENKLKLLA 159
gly_rich_SclB NF038329
LPXTG-anchored collagen-like adhesin Scl2/SclB; SclB (or Scl2 - streptococcal collagen-like ...
2765-2896 3.99e-05

LPXTG-anchored collagen-like adhesin Scl2/SclB; SclB (or Scl2 - streptococcal collagen-like protein 2) is an LPXTG-anchored surface-anchored adhesin with a variable-length region of triple helix-forming collagen-like Gly-Xaa-Xaa repeats.


Pssm-ID: 468478 [Multi-domain]  Cd Length: 440  Bit Score: 49.13  E-value: 3.99e-05
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 93141047  2765 GERGISGAIGPPGPRGDIGPPGPQGPP--GPQGPNG-LSIPGEQGRQGMKGDAGEPGLPGRTGTPGLPgppgpmgppgdr 2841
Cdd:NF038329  201 GPAGEQGPAGPAGPDGEAGPAGEDGPAgpAGDGQQGpDGDPGPTGEDGPQGPDGPAGKDGPRGDRGEA------------ 268
                          90       100       110       120       130
                  ....*....|....*....|....*....|....*....|....*....|....*
gi 93141047  2842 gftGKDGAMGPRgppgppgspgspgvtGPSGKPGKPGDHGRPGPSGLKGEKGDRG 2896
Cdd:NF038329  269 ---GPDGPDGKD---------------GERGPVGPAGKDGQNGKDGLPGKDGKDG 305
Collagen pfam01391
Collagen triple helix repeat (20 copies); Members of this family belong to the collagen ...
2765-2891 6.21e-05

Collagen triple helix repeat (20 copies); Members of this family belong to the collagen superfamily. Collagens are generally extracellular structural proteins involved in formation of connective tissue structure. The alignment contains 20 copies of the G-X-Y repeat that forms a triple helix. The first position of the repeat is glycine, the second and third positions can be any residue but are frequently proline and hydroxy-proline. Collagens are post translationally modified by proline hydroxylase to form the hydroxy-proline residues. Defective hydroxylation is the cause of scurvy. Some members of the collagen superfamily are not involved in connective tissue structure but share the same triple helical structure. The family includes bacterial collagen-like triple-helix repeat proteins.


Pssm-ID: 460189 [Multi-domain]  Cd Length: 57  Bit Score: 42.87  E-value: 6.21e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 93141047   2765 GERGISGAIGPPGPRGDigppgpqgppgpqgpnglsiPGEQGRQGMKGDAGEPGLPGrtgtpglpgppgpmgppgdrgft 2844
Cdd:pfam01391    1 GPPGPPGPPGPPGPPGP--------------------PGPPGPPGPPGPPGEPGPPG----------------------- 37
                           90       100       110       120
                   ....*....|....*....|....*....|....*....|....*..
gi 93141047   2845 gkdgamgprgppgPpgspgspgvtgpsgkPGKPGDHGRPGPSGLKGE 2891
Cdd:pfam01391   38 -------------P---------------PGPPGPPGPPGAPGAPGP 56
vWA_F09G8-8_type cd01477
VWA F09G8.8 type: Von Willebrand factor type A (vWA) domain was originally found in the blood ...
138-289 6.29e-05

VWA F09G8.8 type: Von Willebrand factor type A (vWA) domain was originally found in the blood coagulation protein von Willebrand factor (vWF). Typically, the vWA domain is made up of approximately 200 amino acid residues folded into a classic a/b para-rossmann type of fold. The vWA domain, since its discovery, has drawn great interest because of its widespread occurrence and its involvement in a wide variety of important cellular functions. These include basal membrane formation, cell migration, cell differentiation, adhesion, haemostasis, signaling, chromosomal stability, malignant transformation and in immune defenses In integrins these domains form heterodimers while in vWF it forms multimers. There are different interaction surfaces of this domain as seen by the various molecules it complexes with. Ligand binding in most cases is mediated by the presence of a metal ion dependent adhesion site termed as the MIDAS motif that is a characteristic feature of most, if not all A domains. The members of this subgroup lack the MIDAS motif. This subgroup is found only in C. elegans and the members identified thus far are always found fused to a C-Lectin type domain. Biochemical function thus far has not be attributed to any of the members of this subgroup.


Pssm-ID: 238754 [Multi-domain]  Cd Length: 193  Bit Score: 46.65  E-value: 6.29e-05
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 93141047  138 WTDLVFLVDGSWSVGRNNFKYILDFIAALVSAFDIG------EEKTRVGVVQYSSDTRTEFNLNQYYQRDEL----LAAI 207
Cdd:cd01477   19 WLDIVFVVDNSKGMTQGGLWQVRATISSLFGSSSQIgtdyddPRSTRVGLVTYNSNATVVADLNDLQSFDDLysqiQGSL 98
                         90       100       110       120       130       140       150       160
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 93141047  208 KKIPYKGGNTM-TG-DAIDYLVKNTFtesAGARVGFPKVAIIIT----DGKSQDEVEIpARELRNVGVEV----FSLGIK 277
Cdd:cd01477   99 TDVSSTNASYLdTGlQAAEQMLAAGK---RTSRENYKKVVIVFAsdynDEGSNDPRPI-AARLKSTGIAIitvaFTQDES 174
                        170
                 ....*....|..
gi 93141047  278 AADAKELKQIAS 289
Cdd:cd01477  175 SNLLDKLGKIAS 186
Collagen pfam01391
Collagen triple helix repeat (20 copies); Members of this family belong to the collagen ...
2764-2822 1.12e-04

Collagen triple helix repeat (20 copies); Members of this family belong to the collagen superfamily. Collagens are generally extracellular structural proteins involved in formation of connective tissue structure. The alignment contains 20 copies of the G-X-Y repeat that forms a triple helix. The first position of the repeat is glycine, the second and third positions can be any residue but are frequently proline and hydroxy-proline. Collagens are post translationally modified by proline hydroxylase to form the hydroxy-proline residues. Defective hydroxylation is the cause of scurvy. Some members of the collagen superfamily are not involved in connective tissue structure but share the same triple helical structure. The family includes bacterial collagen-like triple-helix repeat proteins.


Pssm-ID: 460189 [Multi-domain]  Cd Length: 57  Bit Score: 42.10  E-value: 1.12e-04
                           10        20        30        40        50
                   ....*....|....*....|....*....|....*....|....*....|....*....
gi 93141047   2764 RGERGISGAIGPPGPRGDigppgpqgppgpqgpnglsiPGEQGRQGMKGDAGEPGLPGR 2822
Cdd:pfam01391   12 PGPPGPPGPPGPPGPPGP--------------------PGEPGPPGPPGPPGPPGPPGA 50
FN3 cd00063
Fibronectin type 3 domain; One of three types of internal repeats found in the plasma protein ...
634-719 1.17e-04

Fibronectin type 3 domain; One of three types of internal repeats found in the plasma protein fibronectin. Its tenth fibronectin type III repeat contains an RGD cell recognition sequence in a flexible loop between 2 strands. Approximately 2% of all animal proteins contain the FN3 repeat; including extracellular and intracellular proteins, membrane spanning cytokine receptors, growth hormone receptors, tyrosine phosphatase receptors, and adhesion molecules. FN3-like domains are also found in bacterial glycosyl hydrolases.


Pssm-ID: 238020 [Multi-domain]  Cd Length: 93  Bit Score: 43.25  E-value: 1.17e-04
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 93141047  634 PPKDLSFSEVTSYGFKTNWSPA---GENVFSYHITYKEAAGDDEVTVVEPASSTSVVLSSL-KPETLYLVNVTAEYEDGF 709
Cdd:cd00063    3 PPTNLRVTDVTSTSVTLSWTPPeddGGPITGYVVEYREKGSGDWKEVEVTPGSETSYTLTGlKPGTEYEFRVRAVNGGGE 82
                         90
                 ....*....|
gi 93141047  710 SIPLAGEETT 719
Cdd:cd00063   83 SPPSESVTVT 92
FN3 cd00063
Fibronectin type 3 domain; One of three types of internal repeats found in the plasma protein ...
2206-2291 1.37e-04

Fibronectin type 3 domain; One of three types of internal repeats found in the plasma protein fibronectin. Its tenth fibronectin type III repeat contains an RGD cell recognition sequence in a flexible loop between 2 strands. Approximately 2% of all animal proteins contain the FN3 repeat; including extracellular and intracellular proteins, membrane spanning cytokine receptors, growth hormone receptors, tyrosine phosphatase receptors, and adhesion molecules. FN3-like domains are also found in bacterial glycosyl hydrolases.


Pssm-ID: 238020 [Multi-domain]  Cd Length: 93  Bit Score: 43.25  E-value: 1.37e-04
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 93141047 2206 NVTDLKTYQIGWDTFCVKWSP----HRAATSYRLKLSPADGTRGQEITVRGSETSHC-FTGLSPDTDYGVTVFVQTPNLE 2280
Cdd:cd00063    3 PPTNLRVTDVTSTSVTLSWTPpeddGGPITGYVVEYREKGSGDWKEVEVTPGSETSYtLTGLKPGTEYEFRVRAVNGGGE 82
                         90
                 ....*....|.
gi 93141047 2281 GPGVSVKEHTT 2291
Cdd:cd00063   83 SPPSESVTVTT 93
FN3 COG3401
Fibronectin type 3 domain [General function prediction only];
781-1164 1.51e-04

Fibronectin type 3 domain [General function prediction only];


Pssm-ID: 442628 [Multi-domain]  Cd Length: 603  Bit Score: 47.30  E-value: 1.51e-04
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 93141047  781 NLIPDTKYEVSVIPEYFSGPGTPLTGNAATEEVRGNPRDLRVSDPTTSTMKLSWSGAPGKVKQYLVTYTPVAGGETQEVT 860
Cdd:COG3401  105 GATNTGLTSSDEVPSPAVGTATTATAVAGGAATAGTYALGAGLYGVDGANASGTTASSVAGAGVVVSPDTSATAAVATTS 184
                         90       100       110       120       130       140       150       160
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 93141047  861 VRGDTTNTVLQGL--KEGTQYALSVTALYASGAGDA--LFGEGTTLEERGSPQDLVTKDITDTSIGAYWTSAPGM-VRGY 935
Cdd:COG3401  185 LTVTSTTLVDGGGdiEPGTTYYYRVAATDTGGESAPsnEVSVTTPTTPPSAPTGLTATADTPGSVTLSWDPVTESdATGY 264
                        170       180       190       200       210       220       230       240
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 93141047  936 RVSWKSLYDD----VDTGEKNLPEDAihtmieNLQPETKYRISVFATYSSGEGEPLTGDATTELSQDSKT----LKVDEE 1007
Cdd:COG3401  265 RVYRSNSGDGpftkVATVTTTSYTDT------GLTNGTTYYYRVTAVDAAGNESAPSNVVSVTTDLTPPAapsgLTATAV 338
                        250       260       270       280       290       300       310       320
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 93141047 1008 TENTMRVTWKPAPGKVVNYRVVYRPHGRGKQmVAKVPPTVTST--VLKRLQPQTTYDITVLPIYKMGEGKL--RQGSGTT 1083
Cdd:COG3401  339 GSSSITLSWTASSDADVTGYNVYRSTSGGGT-YTKIAETVTTTsyTDTGLTPGTTYYYKVTAVDAAGNESApsEEVSATT 417
                        330       340       350       360       370       380       390       400
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 93141047 1084 ASRFKSPRNLKTSDPTMSSFRVTWEPAPGEVKGYKVTFHPTGDDRRLGELVVGPYDNTVVLEELRAGTTYKVNVFGMFDG 1163
Cdd:COG3401  418 ASAASGESLTASVDAVPLTDVAGATAAASAASNPGVSAAVLADGGDTGNAVPFTTTSSTVTATTTDTTTANLSVTTGSLV 497

                 .
gi 93141047 1164 G 1164
Cdd:COG3401  498 G 498
vWA_F09G8-8_type cd01477
VWA F09G8.8 type: Von Willebrand factor type A (vWA) domain was originally found in the blood ...
440-590 3.59e-04

VWA F09G8.8 type: Von Willebrand factor type A (vWA) domain was originally found in the blood coagulation protein von Willebrand factor (vWF). Typically, the vWA domain is made up of approximately 200 amino acid residues folded into a classic a/b para-rossmann type of fold. The vWA domain, since its discovery, has drawn great interest because of its widespread occurrence and its involvement in a wide variety of important cellular functions. These include basal membrane formation, cell migration, cell differentiation, adhesion, haemostasis, signaling, chromosomal stability, malignant transformation and in immune defenses In integrins these domains form heterodimers while in vWF it forms multimers. There are different interaction surfaces of this domain as seen by the various molecules it complexes with. Ligand binding in most cases is mediated by the presence of a metal ion dependent adhesion site termed as the MIDAS motif that is a characteristic feature of most, if not all A domains. The members of this subgroup lack the MIDAS motif. This subgroup is found only in C. elegans and the members identified thus far are always found fused to a C-Lectin type domain. Biochemical function thus far has not be attributed to any of the members of this subgroup.


Pssm-ID: 238754 [Multi-domain]  Cd Length: 193  Bit Score: 44.34  E-value: 3.59e-04
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 93141047  440 DIVFLVDGSYSIGIANFVKVRAFLEVLVKSFEI------SPNRVQISLVQYSRDPHTEFTLKKFTKVEDIIEAINTFPYR 513
Cdd:cd01477   21 DIVFVVDNSKGMTQGGLWQVRATISSLFGSSSQigtdydDPRSTRVGLVTYNSNATVVADLNDLQSFDDLYSQIQGSLTD 100
                         90       100       110       120       130       140       150       160
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 93141047  514 GGSTNT---GKAMTYVREKIFVPSKGSRSNVPKVMILIT----DGKSSDAfRDPAIKLRNSDVEIFAV-----GVKDAVr 581
Cdd:cd01477  101 VSSTNAsylDTGLQAAEQMLAAGKRTSRENYKKVVIVFAsdynDEGSNDP-RPIAARLKSTGIAIITVaftqdESSNLL- 178

                 ....*....
gi 93141047  582 SELEAIASP 590
Cdd:cd01477  179 DKLGKIASP 187
FN3 smart00060
Fibronectin type 3 domain; One of three types of internal repeat within the plasma protein, ...
2206-2281 5.90e-04

Fibronectin type 3 domain; One of three types of internal repeat within the plasma protein, fibronectin. The tenth fibronectin type III repeat contains a RGD cell recognition sequence in a flexible loop between 2 strands. Type III modules are present in both extracellular and intracellular proteins.


Pssm-ID: 214495 [Multi-domain]  Cd Length: 83  Bit Score: 41.06  E-value: 5.90e-04
                            10        20        30        40        50        60        70        80
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 93141047    2206 NVTDLKTYQIGWDTFCVKWSP--HRAATSYRLKLSPADGTRG---QEITVRGSETSHCFTGLSPDTDYGVTVFVQTPNLE 2280
Cdd:smart00060    3 PPSNLRVTDVTSTSVTLSWEPppDDGITGYIVGYRVEYREEGsewKEVNVTPSSTSYTLTGLKPGTEYEFRVRAVNGAGE 82

                    .
gi 93141047    2281 G 2281
Cdd:smart00060   83 G 83
fn3 pfam00041
Fibronectin type III domain;
634-712 6.85e-04

Fibronectin type III domain;


Pssm-ID: 394996 [Multi-domain]  Cd Length: 85  Bit Score: 40.86  E-value: 6.85e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 93141047    634 PPKDLSFSEVTSYGFKTNWSPA---GENVFSYHITYKEA-AGDDEVTVVEPASSTSVVLSSLKPETLYLVNVTAEYEDGF 709
Cdd:pfam00041    2 APSNLTVTDVTSTSLTVSWTPPpdgNGPITGYEVEYRPKnSGEPWNEITVPGTTTSVTLTGLKPGTEYEVRVQAVNGGGE 81

                   ...
gi 93141047    710 SIP 712
Cdd:pfam00041   82 GPP 84
FN3 COG3401
Fibronectin type 3 domain [General function prediction only];
2026-2272 8.33e-04

Fibronectin type 3 domain [General function prediction only];


Pssm-ID: 442628 [Multi-domain]  Cd Length: 603  Bit Score: 44.99  E-value: 8.33e-04
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 93141047 2026 SGPRNLRVFGETTNSLSVAWDHADGPVQQYRIIYSPTVGDPIDEYTTVPGRRNNVILQPLQPDTPYKITVIAVYEDGDG- 2104
Cdd:COG3401  140 TYALGAGLYGVDGANASGTTASSVAGAGVVVSPDTSATAAVATTSLTVTSTTLVDGGGDIEPGTTYYYRVAATDTGGESa 219
                         90       100       110       120       130       140       150       160
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 93141047 2105 --GHLTGNGRTVGLLPPQNIHISDEWYTRFRVSWDPSPSP-VLGYKIVYKPVGSNEPMEAFVGEMTSYTLHNLNPSTTYD 2181
Cdd:COG3401  220 psNEVSVTTPTTPPSAPTGLTATADTPGSVTLSWDPVTESdATGYRVYRSNSGDGPFTKVATVTTTSYTDTGLTNGTTYY 299
                        170       180       190       200       210       220       230       240
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 93141047 2182 VNVYAQYD----SGLSVPLTDQGTTLYLNV-TDLKTYQIGWDTFCVKW--SPHRAATSYRLKLSPADGTRGQEITVRGSE 2254
Cdd:COG3401  300 YRVTAVDAagneSAPSNVVSVTTDLTPPAApSGLTATAVGSSSITLSWtaSSDADVTGYNVYRSTSGGGTYTKIAETVTT 379
                        250
                 ....*....|....*...
gi 93141047 2255 TSHCFTGLSPDTDYGVTV 2272
Cdd:COG3401  380 TSYTDTGLTPGTTYYYKV 397
Collagen pfam01391
Collagen triple helix repeat (20 copies); Members of this family belong to the collagen ...
2761-2821 1.06e-03

Collagen triple helix repeat (20 copies); Members of this family belong to the collagen superfamily. Collagens are generally extracellular structural proteins involved in formation of connective tissue structure. The alignment contains 20 copies of the G-X-Y repeat that forms a triple helix. The first position of the repeat is glycine, the second and third positions can be any residue but are frequently proline and hydroxy-proline. Collagens are post translationally modified by proline hydroxylase to form the hydroxy-proline residues. Defective hydroxylation is the cause of scurvy. Some members of the collagen superfamily are not involved in connective tissue structure but share the same triple helical structure. The family includes bacterial collagen-like triple-helix repeat proteins.


Pssm-ID: 460189 [Multi-domain]  Cd Length: 57  Bit Score: 39.40  E-value: 1.06e-03
                           10        20        30        40        50        60
                   ....*....|....*....|....*....|....*....|....*....|....*....|.
gi 93141047   2761 KGPRGERGISGAIGPPGPRGDigppgpqgppgpqgpnglsiPGEQGRQGMKGDAGEPGLPG 2821
Cdd:pfam01391    9 PGPPGPPGPPGPPGPPGPPGP--------------------PGEPGPPGPPGPPGPPGPPG 49
PTZ00441 PTZ00441
sporozoite surface protein 2 (SSP2); Provisional
438-575 1.38e-03

sporozoite surface protein 2 (SSP2); Provisional


Pssm-ID: 240420 [Multi-domain]  Cd Length: 576  Bit Score: 44.18  E-value: 1.38e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 93141047   438 KADIVFLVDGSYSIGIANFV-KVRAFLEVLVKSFEISPNRVQISLVQYSRDP------HTEFTLKKFTKVEDIIEAINTF 510
Cdd:PTZ00441   42 EVDLYLLVDGSGSIGYHNWItHVIPMLMGLIQQLNLSDDAINLYMSLFSNNTtelirlGSGASKDKEQALIIVKSLRKTY 121
                          90       100       110       120       130       140       150
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 93141047   511 -PYrgGSTNTGKAMTYVREKIfvPSKGSRSNVPKVMILITDGKSSDAFR----DPAIKLRNSDVEIFAVG 575
Cdd:PTZ00441  122 lPY--GKTNMTDALLEVRKHL--NDRVNRENAIQLVILMTDGIPNSKYRaleeSRKLKDRNVKLAVIGIG 187
GATase1_PfpI_like cd03134
A type 1 glutamine amidotransferase (GATase1)-like domain found in PfpI from Pyrococcus ...
243-306 2.22e-03

A type 1 glutamine amidotransferase (GATase1)-like domain found in PfpI from Pyrococcus furiosus; A type 1 glutamine amidotransferase (GATase1)-like domain found in PfpI from Pyrococcus furiosus. This group includes proteins similar to PfpI from P. furiosus. and PH1704 from Pyrococcus horikoshii. These enzymes are ATP-independent intracellular proteases and may hydrolyze small peptides to provide a nutritional source. Only Cys of the catalytic triad typical of GATase1 domains is conserved in this group. This Cys residue is found in the sharp turn between a beta strand and an alpha helix termed the nucleophile elbow. For PH1704, it is believed that this Cys together with a different His in one monomer and Glu (from an adjacent monomer) forms a different catalytic triad from the typical GATase1domain. PfpI is homooligomeric. Protease activity is only found for oligomeric forms of PH1704.


Pssm-ID: 153228 [Multi-domain]  Cd Length: 165  Bit Score: 41.38  E-value: 2.22e-03
                         10        20        30        40        50        60
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 93141047  243 KVAIIITDGKSQDEVEIPARELRNVGVEVFSLGIKAADA----KELKQIASTPSLNHVfNVANFDAIV 306
Cdd:cd03134    1 KVAILAADGFEDVELTYPLYRLREAGAEVVVAGPEAGGEiqgkHGYDTVTVDLTIADV-DADDYDALV 67
FN3 smart00060
Fibronectin type 3 domain; One of three types of internal repeat within the plasma protein, ...
634-710 2.59e-03

Fibronectin type 3 domain; One of three types of internal repeat within the plasma protein, fibronectin. The tenth fibronectin type III repeat contains a RGD cell recognition sequence in a flexible loop between 2 strands. Type III modules are present in both extracellular and intracellular proteins.


Pssm-ID: 214495 [Multi-domain]  Cd Length: 83  Bit Score: 39.13  E-value: 2.59e-03
                            10        20        30        40        50        60        70        80
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 93141047     634 PPKDLSFSEVTSYGFKTNWS-PAGENVFSYHITYK---EAAGDDEVTVVEPASSTSVVLSSLKPETLYLVNVTAEYEDGF 709
Cdd:smart00060    3 PPSNLRVTDVTSTSVTLSWEpPPDDGITGYIVGYRveyREEGSEWKEVNVTPSSTSYTLTGLKPGTEYEFRVRAVNGAGE 82

                    .
gi 93141047     710 S 710
Cdd:smart00060   83 G 83
COG4733 COG4733
Phage-related protein, tail protein J [Mobilome: prophages, transposons];
1631-1819 3.81e-03

Phage-related protein, tail protein J [Mobilome: prophages, transposons];


Pssm-ID: 443767 [Multi-domain]  Cd Length: 978  Bit Score: 43.01  E-value: 3.81e-03
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 93141047 1631 TVSVSAV-HDEGESPPVT--AQETTRPVPAPTNLKITEVTSEGFRG--------TWDHGASDVSlYRITWAPFGSS--DK 1697
Cdd:COG4733  503 TYTITAVqHAPEKYAAIDagAFDDVPPQWPPVNVTTSESLSVVAQGtavttltvSWDAPAGAVA-YEVEWRRDDGNwvSV 581
                         90       100       110       120       130       140       150       160
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 93141047 1698 METILNGDEntlvFENLNPNTiYEVSITAIYPDESESDDLIGSERTLPILTTQAPKsgPRNLQVyNATSNSLTVKWDPAS 1777
Cdd:COG4733  582 PRTSGTSFE----VPGIYAGD-YEVRVRAINALGVSSAWAASSETTVTGKTAPPPA--PTGLTA-TGGLGGITLSWSFPV 653
                        170       180       190       200
                 ....*....|....*....|....*....|....*....|...
gi 93141047 1778 G-RVQKYRITYQPSTGEGNEQTTTIGGRQNSVVLQKLKPDTPY 1819
Cdd:COG4733  654 DaDTLRTEIRYSTTGDWASATVAQALYPGNTYTLAGLKAGQTY 696
vWA_subgroup cd01465
VWA subgroup: Von Willebrand factor type A (vWA) domain was originally found in the blood ...
1198-1356 5.12e-03

VWA subgroup: Von Willebrand factor type A (vWA) domain was originally found in the blood coagulation protein von Willebrand factor (vWF). Typically, the vWA domain is made up of approximately 200 amino acid residues folded into a classic a/b para-rossmann type of fold. The vWA domain, since its discovery, has drawn great interest because of its widespread occurrence and its involvement in a wide variety of important cellular functions. These include basal membrane formation, cell migration, cell differentiation, adhesion, haemostasis, signaling, chromosomal stability, malignant transformation and in immune defenses In integrins these domains form heterodimers while in vWF it forms multimers. There are different interaction surfaces of this domain as seen by the various molecules it complexes with. Ligand binding in most cases is mediated by the presence of a metal ion dependent adhesion site termed as the MIDAS motif that is a characteristic feature of most, if not all A domains. Not much is known about the function of the VWA domain in these proteins. The members do have a conserved MIDAS motif. The biochemical function however is not known.


Pssm-ID: 238742 [Multi-domain]  Cd Length: 170  Bit Score: 40.33  E-value: 5.12e-03
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 93141047 1198 ADIVLLVDGSWSIGRANFRTVRSFISRIVEvfDIGPKRvQIALAQYSGDPRTEWQLNAHRDKKSLLQAVANLPYKGGNTL 1277
Cdd:cd01465    1 LNLVFVIDRSGSMDGPKLPLVKSALKLLVD--QLRPDD-RLAIVTYDGAAETVLPATPVRDKAAILAAIDRLTAGGSTAG 77
                         90       100       110       120       130       140       150       160
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 93141047 1278 -TGMALNfIRQqnfrTQAGMRPRARKIGVLITDGK------SQDDVEAPSKKLKDEGVELFAIGI-KNADEVELKMIATD 1349
Cdd:cd01465   78 gAGIQLG-YQE----AQKHFVPGGVNRILLATDGDfnvgetDPDELARLVAQKRESGITLSTLGFgDNYNEDLMEAIADA 152

                 ....*..
gi 93141047 1350 PDDTHAY 1356
Cdd:cd01465  153 GNGNTAY 159
TerY COG4245
Uncharacterized conserved protein YegL, contains vWA domain of TerY type [Function unknown];
1199-1372 5.21e-03

Uncharacterized conserved protein YegL, contains vWA domain of TerY type [Function unknown];


Pssm-ID: 443387 [Multi-domain]  Cd Length: 196  Bit Score: 40.68  E-value: 5.21e-03
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 93141047 1199 DIVLLVDGSWS-----IGRANfRTVRSFISRIVEVfDIGPKRVQIALAQYSGDPRTewqlnaHRDKKSLLQ-AVANLPYk 1272
Cdd:COG4245    7 PVYLLLDTSGSmsgepIEALN-EGLQALIDELRQD-PYALETVEVSVITFDGEAKV------LLPLTDLEDfQPPDLSA- 77
                         90       100       110       120       130       140       150       160
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 93141047 1273 GGNTLTG----MALNFIRQQNFRTQAGMRPRARKIGVLITDGKSQD-DVEAPSKKLKD----EGVELFAIGI-KNADEVE 1342
Cdd:COG4245   78 SGGTPLGaaleLLLDLIERRVQKYTAEGKGDWRPVVFLITDGEPTDsDWEAALQRLKDgeaaKKANIFAIGVgPDADTEV 157
                        170       180       190
                 ....*....|....*....|....*....|
gi 93141047 1343 LKMIAtdpDDTHAYNVADFESLSRIVDDLT 1372
Cdd:COG4245  158 LKQLT---DPVRALDALDGLDFREFFKWLS 184
YajL COG0693
Protein/nucleotide deglycase, PfpI/YajL/DJ-1 family (repair of methylglyoxal-glycated proteins ...
243-306 7.33e-03

Protein/nucleotide deglycase, PfpI/YajL/DJ-1 family (repair of methylglyoxal-glycated proteins and nucleic acids) [Defense mechanisms];


Pssm-ID: 440457 [Multi-domain]  Cd Length: 170  Bit Score: 40.09  E-value: 7.33e-03
                         10        20        30        40        50        60
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*..
gi 93141047  243 KVAIIITDGKSQDEVEIPARELRNVGVEVFSLGIKAAD---AKELKQIASTPSLNHVfNVANFDAIV 306
Cdd:COG0693    4 KVLILLTDGFEDEELTVPYDALREAGAEVDVASPEGGPpvtSKHGITVTADKTLDDV-DPDDYDALV 69
 
Blast search parameters
Data Source: Precalculated data, version = cdd.v.3.21
Preset Options:Database: CDSEARCH/cdd   Low complexity filter: no  Composition Based Adjustment: yes   E-value threshold: 0.01

References:

  • Wang J et al. (2023), "The conserved domain database in 2023", Nucleic Acids Res.51(D)384-8.
  • Lu S et al. (2020), "The conserved domain database in 2020", Nucleic Acids Res.48(D)265-8.
  • Marchler-Bauer A et al. (2017), "CDD/SPARCLE: functional classification of proteins via subfamily domain architectures.", Nucleic Acids Res.45(D)200-3.
Help | Disclaimer | Write to the Help Desk
NCBI | NLM | NIH