NCBI Home Page NCBI Site Search page NCBI Guide that lists and describes the NCBI resources
Conserved domains on  [gi|2462586997|ref|XP_054201148|]
View 

collagen alpha-1(VII) chain isoform X6 [Homo sapiens]

Protein Classification

Graphical summary

 Zoom to residue level

show extra options »

Show site features     Horizontal zoom: ×

List of domain hits

Name Accession Description Interval E-value
vWA_collagen_alphaI-XII-like cd01482
Collagen: The extracellular matrix represents a complex alloy of variable members of diverse ...
37-201 1.12e-82

Collagen: The extracellular matrix represents a complex alloy of variable members of diverse protein families defining structural integrity and various physiological functions. The most abundant family is the collagens with more than 20 different collagen types identified thus far. Collagens are centrally involved in the formation of fibrillar and microfibrillar networks of the extracellular matrix, basement membranes as well as other structures of the extracellular matrix. Some collagens have about 15-18 vWA domains in them. The VWA domains present in these collagens mediate protein-protein interactions.


:

Pssm-ID: 238759 [Multi-domain]  Cd Length: 164  Bit Score: 268.77  E-value: 1.12e-82
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462586997   37 ADIVFLLDGSSSIGRSNFREVRSFLEGLVLPFsgAASAQGVRFATVQYSDDPRTEFGLDALGSGGDVIRAIRELSYKGGN 116
Cdd:cd01482      1 ADIVFLVDGSWSIGRSNFNLVRSFLSSVVEAF--EIGPDGVQVGLVQYSDDPRTEFDLNAYTSKEDVLAAIKNLPYKGGN 78
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462586997  117 TRTGAAILHVADHVFLPQ-LARPGVPKVCILITDGKSQDLVDTAAQRLKGQGVKLFAVGIKNADPEELKRVASQPTSDFF 195
Cdd:cd01482     79 TRTGKALTHVREKNFTPDaGARPGVPKVVILITDGKSQDDVELPARVLRNLGVNVFAVGVKDADESELKMIASKPSETHV 158

                   ....*.
gi 2462586997  196 FFVNDF 201
Cdd:cd01482    159 FNVADF 164
gly_rich_SclB super family cl45768
LPXTG-anchored collagen-like adhesin Scl2/SclB; SclB (or Scl2 - streptococcal collagen-like ...
1554-1850 1.45e-34

LPXTG-anchored collagen-like adhesin Scl2/SclB; SclB (or Scl2 - streptococcal collagen-like protein 2) is an LPXTG-anchored surface-anchored adhesin with a variable-length region of triple helix-forming collagen-like Gly-Xaa-Xaa repeats.


The actual alignment was detected with superfamily member NF038329:

Pssm-ID: 468478 [Multi-domain]  Cd Length: 440  Bit Score: 139.66  E-value: 1.45e-34
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462586997 1554 GEKGDVGPAGPRGATGVQGERGPPGLVlpGDPGPKGDPGDRGPIGLTGRAgppgdsgppgekgdpgrpgppgpvgprgrd 1633
Cdd:NF038329   117 GEKGEPGPAGPAGPAGEQGPRGDRGET--GPAGPAGPPGPQGERGEKGPA------------------------------ 164
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462586997 1634 GEVGEKGdegppgdpglpgKAGERGLRGAPGVRGPVGEKGDQGDPGEDGRNGSPGSSGPKGDRGEPGPPGPPGRLVDTGP 1713
Cdd:NF038329   165 GPQGEAG------------PQGPAGKDGEAGAKGPAGEKGPQGPRGETGPAGEQGPAGPAGPDGEAGPAGEDGPAGPAGD 232
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462586997 1714 GAR-EKGEPGDRGQEGPRGPKGDPGLPGAPGERGIEGfrgppgpqgdpgVRGPAGEKGDRGPPGLDGRSGLDGKPGAAGP 1792
Cdd:NF038329   233 GQQgPDGDPGPTGEDGPQGPDGPAGKDGPRGDRGEAG------------PDGPDGKDGERGPVGPAGKDGQNGKDGLPGK 300
                          250       260       270       280       290
                   ....*....|....*....|....*....|....*....|....*....|....*...
gi 2462586997 1793 SGPNGAAGKAGDPGRDGLPGLRGeqglpgpsgppglpgkpgEDGKPGLNGKNGEPGDP 1850
Cdd:NF038329   301 DGKDGQNGKDGLPGKDGKDGQPG------------------KDGLPGKDGKDGQPGKP 340
gly_rich_SclB super family cl45768
LPXTG-anchored collagen-like adhesin Scl2/SclB; SclB (or Scl2 - streptococcal collagen-like ...
2022-2365 1.79e-32

LPXTG-anchored collagen-like adhesin Scl2/SclB; SclB (or Scl2 - streptococcal collagen-like protein 2) is an LPXTG-anchored surface-anchored adhesin with a variable-length region of triple helix-forming collagen-like Gly-Xaa-Xaa repeats.


The actual alignment was detected with superfamily member NF038329:

Pssm-ID: 468478 [Multi-domain]  Cd Length: 440  Bit Score: 133.49  E-value: 1.79e-32
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462586997 2022 GERGPPGPSGLAGEPGKPGIPGLPGRAGGVGEAGRPGERGERGEKGERGEQGrdgppglpgtpgppgppgpkvsvdepgp 2101
Cdd:NF038329   117 GEKGEPGPAGPAGPAGEQGPRGDRGETGPAGPAGPPGPQGERGEKGPAGPQG---------------------------- 168
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462586997 2102 glsgEQGPPGLKGAKGEPGSNGDQGPKGDRGvpgikgDRGEPGPRGQDGNPGLPGERGMAGPEGKPGLQGPRGPPGpvgg 2181
Cdd:NF038329   169 ----EAGPQGPAGKDGEAGAKGPAGEKGPQG------PRGETGPAGEQGPAGPAGPDGEAGPAGEDGPAGPAGDGQ---- 234
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462586997 2182 hgdpgppgapglagpagpQGPSGLKGEPGETGPPG-RGLTGPTGAVglpgppgpsglvgpqgspglpGQVGETGKPGAPG 2260
Cdd:NF038329   235 ------------------QGPDGDPGPTGEDGPQGpDGPAGKDGPR---------------------GDRGEAGPDGPDG 275
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462586997 2261 RDGASGKDGDRGSPGVPGSPGLPGPVGPKGEPGPTGAPGQavvglpgakgekgapgglagdlvgepgakgdrglpgpRGE 2340
Cdd:NF038329   276 KDGERGPVGPAGKDGQNGKDGLPGKDGKDGQNGKDGLPGK-------------------------------------DGK 318
                          330       340
                   ....*....|....*....|....*
gi 2462586997 2341 KGEAGRAGEPGDPGEDGQKGAPGPK 2365
Cdd:NF038329   319 DGQPGKDGLPGKDGKDGQPGKPAPK 343
VWA pfam00092
von Willebrand factor type A domain;
1054-1227 3.41e-29

von Willebrand factor type A domain;


:

Pssm-ID: 459670 [Multi-domain]  Cd Length: 174  Bit Score: 115.84  E-value: 3.41e-29
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462586997 1054 DVVFLPHATQD-NAHRAEATRRVLERLVLALGpLGPQAVQVGLLSYSHRPSPLFPLNGSHDLGIILQRIRDMPYMDPSGN 1132
Cdd:pfam00092    1 DIVFLLDGSGSiGGDNFEKVKEFLKKLVESLD-IGPDGTRVGLVQYSSDVRTEFPLNDYSSKEELLSAVDNLRYLGGGTT 79
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462586997 1133 NLGTAVVTAHRYMLAPDApGRRQHVPGVMVLLVD-EPLRGDIFSPIREAQASGLNVVMLGMAGADPEQLRRLApGMDSVQ 1211
Cdd:pfam00092   80 NTGKALKYALENLFSSAA-GARPGAPKVVVLLTDgRSQDGDPEEVARELKSAGVTVFAVGVGNADDEELRKIA-SEPGEG 157
                          170
                   ....*....|....*.
gi 2462586997 1212 TFFAVDDGPSLDQAVS 1227
Cdd:pfam00092  158 HVFTVSDFEALEDLQD 173
gly_rich_SclB super family cl45768
LPXTG-anchored collagen-like adhesin Scl2/SclB; SclB (or Scl2 - streptococcal collagen-like ...
1783-2075 4.83e-27

LPXTG-anchored collagen-like adhesin Scl2/SclB; SclB (or Scl2 - streptococcal collagen-like protein 2) is an LPXTG-anchored surface-anchored adhesin with a variable-length region of triple helix-forming collagen-like Gly-Xaa-Xaa repeats.


The actual alignment was detected with superfamily member NF038329:

Pssm-ID: 468478 [Multi-domain]  Cd Length: 440  Bit Score: 116.93  E-value: 4.83e-27
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462586997 1783 LDGKPGAAGPSGPNGAAGKAGDPGRDGlpglrgeqglpgpsgppglpgkpgEDGKPGLNGKNGEPGDPGEDGRKGEKGDS 1862
Cdd:NF038329   115 GDGEKGEPGPAGPAGPAGEQGPRGDRG------------------------ETGPAGPAGPPGPQGERGEKGPAGPQGEA 170
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462586997 1863 GASGREGRDGPKGERGAPGilgpqgppglpgpvgppgqgfpgvpgGTGPKGDRGETGSKGEQGLPGERGLRGEPGSvpnv 1942
Cdd:NF038329   171 GPQGPAGKDGEAGAKGPAG--------------------------EKGPQGPRGETGPAGEQGPAGPAGPDGEAGP---- 220
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462586997 1943 drlletagikasalreivetwDESSGSFLPVPERRRGPKGDSGEQGPPGKEGPIGFPGERGLKGDRGDPGPQGPPGLAlG 2022
Cdd:NF038329   221 ---------------------AGEDGPAGPAGDGQQGPDGDPGPTGEDGPQGPDGPAGKDGPRGDRGEAGPDGPDGKD-G 278
                          250       260       270       280       290
                   ....*....|....*....|....*....|....*....|....*....|...
gi 2462586997 2023 ERGPPGPSGLAGEPGKPGIPGLPGRAGGVGEAGRPGERGERGEKGERGEQGRD 2075
Cdd:NF038329   279 ERGPVGPAGKDGQNGKDGLPGKDGKDGQNGKDGLPGKDGKDGQPGKDGLPGKD 331
gly_rich_SclB super family cl45768
LPXTG-anchored collagen-like adhesin Scl2/SclB; SclB (or Scl2 - streptococcal collagen-like ...
1304-1585 2.92e-25

LPXTG-anchored collagen-like adhesin Scl2/SclB; SclB (or Scl2 - streptococcal collagen-like protein 2) is an LPXTG-anchored surface-anchored adhesin with a variable-length region of triple helix-forming collagen-like Gly-Xaa-Xaa repeats.


The actual alignment was detected with superfamily member NF038329:

Pssm-ID: 468478 [Multi-domain]  Cd Length: 440  Bit Score: 111.54  E-value: 2.92e-25
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462586997 1304 DGRPGSPGRAGNPGTPGAPGLKGSPGLPGPRGDPGERGPRGPKGEPGAPGQViGGEGP-GLPGRKGDPgpsgppgprgpl 1382
Cdd:NF038329   116 DGEKGEPGPAGPAGPAGEQGPRGDRGETGPAGPAGPPGPQGERGEKGPAGPQ-GEAGPqGPAGKDGEA------------ 182
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462586997 1383 gdpgprgppglpgtamkGDKGDRGERGPPGPGEGGIAPGEPGLPGLPGSPGPQGPVGPPGKKGEKGDSEDGAPGLPGQPG 1462
Cdd:NF038329   183 -----------------GAKGPAGEKGPQGPRGETGPAGEQGPAGPAGPDGEAGPAGEDGPAGPAGDGQQGPDGDPGPTG 245
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462586997 1463 SPGEQGPRGPPGAIGPKGDRGFPGPLGEAGEKGERGPPGPAGSRGLPGVAGrpgakgpegppgPTGRQGEKGEPGRPGDP 1542
Cdd:NF038329   246 EDGPQGPDGPAGKDGPRGDRGEAGPDGPDGKDGERGPVGPAGKDGQNGKDG------------LPGKDGKDGQNGKDGLP 313
                          250       260       270       280
                   ....*....|....*....|....*....|....*....|...
gi 2462586997 1543 avvgpavagpkGEKGDVGPAGPRGATGVQGERGPPGLVLPGDP 1585
Cdd:NF038329   314 -----------GKDGKDGQPGKDGLPGKDGKDGQPGKPAPKTP 345
FN3 COG3401
Fibronectin type 3 domain [General function prediction only];
301-813 2.68e-21

Fibronectin type 3 domain [General function prediction only];


:

Pssm-ID: 442628 [Multi-domain]  Cd Length: 603  Bit Score: 101.23  E-value: 2.68e-21
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462586997  301 TEYQVTVIALYANSIGEAVSGTARTTALEGPELTIQNTTAHSLLVAWRSVPGATGYRVTWRVLSGGPTQQQELGPGQGSV 380
Cdd:COG3401     39 YLAVVLSVTTKESPGTLLVAAGLSSGGGLGTGGRAGTTSGVAAVAVAAAPPTATGLTTLTGSGSVGGATNTGLTSSDEVP 118
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462586997  381 LLRDLEPGTDYEVTVSTLFGRSVGPATSLMARTDASVEQTLRPVILGPTSILLSWNLVPEARGYRLewrretgleppqkV 460
Cdd:COG3401    119 SPAVGTATTATAVAGGAATAGTYALGAGLYGVDGANASGTTASSVAGAGVVVSPDTSATAAVATTS-------------L 185
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462586997  461 VLPSDVTRYQLDGLQPGTEYRLTLYTLLEGHEVATPATVVPTGPELPVSPVTDLQATELPGQRVRVSWSPVP--GATQYR 538
Cdd:COG3401    186 TVTSTTLVDGGGDIEPGTTYYYRVAATDTGGESAPSNEVSVTTPTTPPSAPTGLTATADTPGSVTLSWDPVTesDATGYR 265
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462586997  539 IIVRSTQGVERTLVLPGSQTAFDLDDVQAGLSYTVRVSA--RVGPREGSASVLTVRRELETPLAVPGLRVVVSDATRVRV 616
Cdd:COG3401    266 VYRSNSGDGPFTKVATVTTTSYTDTGLTNGTTYYYRVTAvdAAGNESAPSNVVSVTTDLTPPAAPSGLTATAVGSSSITL 345
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462586997  617 AWGPVPG--ASGFRISWSTGSGPESSQ--TLPPDSTATDiTGLQPGTTYQVAVSVL--RGREEGPAAVIVART------- 683
Cdd:COG3401    346 SWTASSDadVTGYNVYRSTSGGGTYTKiaETVTTTSYTD-TGLTPGTTYYYKVTAVdaAGNESAPSEEVSATTasaasge 424
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462586997  684 DPLGPVRTVHVTQASSSSVTITWTRVPG--ATGYRVSWHSAHGPEKSQL-VSGEATVAELDGLEPDTEYTVHVRAHVAGV 760
Cdd:COG3401    425 SLTASVDAVPLTDVAGATAAASAASNPGvsAAVLADGGDTGNAVPFTTTsSTVTATTTDTTTANLSVTTGSLVGGSGASS 504
                          490       500       510       520       530
                   ....*....|....*....|....*....|....*....|....*....|...
gi 2462586997  761 DGPPASVVVRTAPEPVGRVSRLQILNASSDVLRITWVGVTGATAYRLAWGRSE 813
Cdd:COG3401    505 VTNSVSVIGASAAAAVGGAPDGTPNVTGASPVTVGASTGDVLITDLVSLTTSA 557
fn3 pfam00041
Fibronectin type III domain;
233-317 3.90e-16

Fibronectin type III domain;


:

Pssm-ID: 394996 [Multi-domain]  Cd Length: 85  Bit Score: 75.53  E-value: 3.90e-16
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462586997  233 SAPRDLVLSEPSSQSLRVQWTAA---SGPVTGYKVQYTPLTGLGQPlpserQEVNVPAGETSVRLRGLRPLTEYQVTVIA 309
Cdd:pfam00041    1 SAPSNLTVTDVTSTSLTVSWTPPpdgNGPITGYEVEYRPKNSGEPW-----NEITVPGTTTSVTLTGLKPGTEYEVRVQA 75

                   ....*...
gi 2462586997  310 LYANSIGE 317
Cdd:pfam00041   76 VNGGGEGP 83
FN3 cd00063
Fibronectin type 3 domain; One of three types of internal repeats found in the plasma protein ...
867-952 5.93e-11

Fibronectin type 3 domain; One of three types of internal repeats found in the plasma protein fibronectin. Its tenth fibronectin type III repeat contains an RGD cell recognition sequence in a flexible loop between 2 strands. Approximately 2% of all animal proteins contain the FN3 repeat; including extracellular and intracellular proteins, membrane spanning cytokine receptors, growth hormone receptors, tyrosine phosphatase receptors, and adhesion molecules. FN3-like domains are also found in bacterial glycosyl hydrolases.


:

Pssm-ID: 238020 [Multi-domain]  Cd Length: 93  Bit Score: 60.97  E-value: 5.93e-11
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462586997  867 PPALGTLHVVQRGEHSLRLRWEPVP----RAQGFLLHWQPEGGQEQSRVLGPELS--SYHLDGLEPATQYRVRLSVLGPA 940
Cdd:cd00063      1 PSPPTNLRVTDVTSTSVTLSWTPPEddggPITGYVVEYREKGSGDWKEVEVTPGSetSYTLTGLKPGTEYEFRVRAVNGG 80
                           90
                   ....*....|...
gi 2462586997  941 GEG-PSAEVTART 952
Cdd:cd00063     81 GESpPSESVTVTT 93
gly_rich_SclB super family cl45768
LPXTG-anchored collagen-like adhesin Scl2/SclB; SclB (or Scl2 - streptococcal collagen-like ...
2287-2440 5.85e-09

LPXTG-anchored collagen-like adhesin Scl2/SclB; SclB (or Scl2 - streptococcal collagen-like protein 2) is an LPXTG-anchored surface-anchored adhesin with a variable-length region of triple helix-forming collagen-like Gly-Xaa-Xaa repeats.


The actual alignment was detected with superfamily member NF038329:

Pssm-ID: 468478 [Multi-domain]  Cd Length: 440  Bit Score: 61.07  E-value: 5.85e-09
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462586997 2287 GPKGEPGPTGAPGQAvvglpgakgekgapgglagdlvGEPGAKGDRGLPGPRGEKGEAGRAGEPGDPGEDGQKGAPGPKG 2366
Cdd:NF038329   117 GEKGEPGPAGPAGPA----------------------GEQGPRGDRGETGPAGPAGPPGPQGERGEKGPAGPQGEAGPQG 174
                           90       100       110       120       130       140       150
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....
gi 2462586997 2367 fkgdpgvgvpgspgppgPPGVKGDLGLPGLPGAPGVVGFPGQTGPRGEMGSGRPPRERRNPRTPGATWTTGVSG 2440
Cdd:NF038329   175 -----------------PAGKDGEAGAKGPAGEKGPQGPRGETGPAGEQGPAGPAGPDGEAGPAGEDGPAGPAG 231
FN3 cd00063
Fibronectin type 3 domain; One of three types of internal repeats found in the plasma protein ...
955-1044 1.02e-07

Fibronectin type 3 domain; One of three types of internal repeats found in the plasma protein fibronectin. Its tenth fibronectin type III repeat contains an RGD cell recognition sequence in a flexible loop between 2 strands. Approximately 2% of all animal proteins contain the FN3 repeat; including extracellular and intracellular proteins, membrane spanning cytokine receptors, growth hormone receptors, tyrosine phosphatase receptors, and adhesion molecules. FN3-like domains are also found in bacterial glycosyl hydrolases.


:

Pssm-ID: 238020 [Multi-domain]  Cd Length: 93  Bit Score: 51.73  E-value: 1.02e-07
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462586997  955 PRVPSiELRVVDTSIDSVTLAWTPV----SRASSYILSWRPL-RGPGQEVPGSPQTlpgiSSSQRVTGLEPGVSYIFSLT 1029
Cdd:cd00063      1 PSPPT-NLRVTDVTSTSVTLSWTPPeddgGPITGYVVEYREKgSGDWKEVEVTPGS----ETSYTLTGLKPGTEYEFRVR 75
                           90
                   ....*....|....*
gi 2462586997 1030 PVLDGVRGPEASVTQ 1044
Cdd:cd00063     76 AVNGGGESPPSESVT 90
fn3 pfam00041
Fibronectin type III domain;
777-855 1.03e-07

Fibronectin type III domain;


:

Pssm-ID: 394996 [Multi-domain]  Cd Length: 85  Bit Score: 51.65  E-value: 1.03e-07
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462586997  777 GRVSRLQILNASSDVLRITWVGVTGA----TAYRLA-WGRSEGGPMRHQILPGNTDSAEIRGLEGGVSYSVRVTALVGDR 851
Cdd:pfam00041    1 SAPSNLTVTDVTSTSLTVSWTPPPDGngpiTGYEVEyRPKNSGEPWNEITVPGTTTSVTLTGLKPGTEYEVRVQAVNGGG 80

                   ....
gi 2462586997  852 EGTP 855
Cdd:pfam00041   81 EGPP 84
 
Name Accession Description Interval E-value
vWA_collagen_alphaI-XII-like cd01482
Collagen: The extracellular matrix represents a complex alloy of variable members of diverse ...
37-201 1.12e-82

Collagen: The extracellular matrix represents a complex alloy of variable members of diverse protein families defining structural integrity and various physiological functions. The most abundant family is the collagens with more than 20 different collagen types identified thus far. Collagens are centrally involved in the formation of fibrillar and microfibrillar networks of the extracellular matrix, basement membranes as well as other structures of the extracellular matrix. Some collagens have about 15-18 vWA domains in them. The VWA domains present in these collagens mediate protein-protein interactions.


Pssm-ID: 238759 [Multi-domain]  Cd Length: 164  Bit Score: 268.77  E-value: 1.12e-82
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462586997   37 ADIVFLLDGSSSIGRSNFREVRSFLEGLVLPFsgAASAQGVRFATVQYSDDPRTEFGLDALGSGGDVIRAIRELSYKGGN 116
Cdd:cd01482      1 ADIVFLVDGSWSIGRSNFNLVRSFLSSVVEAF--EIGPDGVQVGLVQYSDDPRTEFDLNAYTSKEDVLAAIKNLPYKGGN 78
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462586997  117 TRTGAAILHVADHVFLPQ-LARPGVPKVCILITDGKSQDLVDTAAQRLKGQGVKLFAVGIKNADPEELKRVASQPTSDFF 195
Cdd:cd01482     79 TRTGKALTHVREKNFTPDaGARPGVPKVVILITDGKSQDDVELPARVLRNLGVNVFAVGVKDADESELKMIASKPSETHV 158

                   ....*.
gi 2462586997  196 FFVNDF 201
Cdd:cd01482    159 FNVADF 164
VWA pfam00092
von Willebrand factor type A domain;
38-207 2.52e-56

von Willebrand factor type A domain;


Pssm-ID: 459670 [Multi-domain]  Cd Length: 174  Bit Score: 193.65  E-value: 2.52e-56
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462586997   38 DIVFLLDGSSSIGRSNFREVRSFLEGLVLPFSgaASAQGVRFATVQYSDDPRTEFGLDALGSGGDVIRAIRELSYKGGNT 117
Cdd:pfam00092    1 DIVFLLDGSGSIGGDNFEKVKEFLKKLVESLD--IGPDGTRVGLVQYSSDVRTEFPLNDYSSKEELLSAVDNLRYLGGGT 78
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462586997  118 R-TGAAILHVADHVFLP-QLARPGVPKVCILITDGKSQDL-VDTAAQRLKGQGVKLFAVGIKNADPEELKRVASQPTSDF 194
Cdd:pfam00092   79 TnTGKALKYALENLFSSaAGARPGAPKVVVLLTDGRSQDGdPEEVARELKSAGVTVFAVGVGNADDEELRKIASEPGEGH 158
                          170
                   ....*....|...
gi 2462586997  195 FFFVNDFSILRTL 207
Cdd:pfam00092  159 VFTVSDFEALEDL 171
VWA smart00327
von Willebrand factor (vWF) type A domain; VWA domains in extracellular eukaryotic proteins ...
38-202 1.21e-43

von Willebrand factor (vWF) type A domain; VWA domains in extracellular eukaryotic proteins mediate adhesion via metal ion-dependent adhesion sites (MIDAS). Intracellular VWA domains and homologues in prokaryotes have recently been identified. The proposed VWA domains in integrin beta subunits have recently been substantiated using sequence-based methods.


Pssm-ID: 214621 [Multi-domain]  Cd Length: 175  Bit Score: 157.62  E-value: 1.21e-43
                            10        20        30        40        50        60        70        80
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462586997    38 DIVFLLDGSSSIGRSNFREVRSFLEGLVLPFSgaASAQGVRFATVQYSDDPRTEFGLDALGSGGDVIRAIRELSYK-GGN 116
Cdd:smart00327    1 DVVFLLDGSGSMGGNRFELAKEFVLKLVEQLD--IGPDGDRVGLVTFSDDARVLFPLNDSRSKDALLEALASLSYKlGGG 78
                            90       100       110       120       130       140       150       160
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462586997   117 TRTGAAILHVADHVFLPQL-ARPGVPKVCILITDGKSQDL---VDTAAQRLKGQGVKLFAVGIKNA-DPEELKRVASQPT 191
Cdd:smart00327   79 TNLGAALQYALENLFSKSAgSRRGAPKVVILITDGESNDGpkdLLKAAKELKRSGVKVFVVGVGNDvDEEELKKLASAPG 158
                           170
                    ....*....|.
gi 2462586997   192 SDFFFFVNDFS 202
Cdd:smart00327  159 GVYVFLPELLD 169
gly_rich_SclB NF038329
LPXTG-anchored collagen-like adhesin Scl2/SclB; SclB (or Scl2 - streptococcal collagen-like ...
1554-1850 1.45e-34

LPXTG-anchored collagen-like adhesin Scl2/SclB; SclB (or Scl2 - streptococcal collagen-like protein 2) is an LPXTG-anchored surface-anchored adhesin with a variable-length region of triple helix-forming collagen-like Gly-Xaa-Xaa repeats.


Pssm-ID: 468478 [Multi-domain]  Cd Length: 440  Bit Score: 139.66  E-value: 1.45e-34
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462586997 1554 GEKGDVGPAGPRGATGVQGERGPPGLVlpGDPGPKGDPGDRGPIGLTGRAgppgdsgppgekgdpgrpgppgpvgprgrd 1633
Cdd:NF038329   117 GEKGEPGPAGPAGPAGEQGPRGDRGET--GPAGPAGPPGPQGERGEKGPA------------------------------ 164
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462586997 1634 GEVGEKGdegppgdpglpgKAGERGLRGAPGVRGPVGEKGDQGDPGEDGRNGSPGSSGPKGDRGEPGPPGPPGRLVDTGP 1713
Cdd:NF038329   165 GPQGEAG------------PQGPAGKDGEAGAKGPAGEKGPQGPRGETGPAGEQGPAGPAGPDGEAGPAGEDGPAGPAGD 232
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462586997 1714 GAR-EKGEPGDRGQEGPRGPKGDPGLPGAPGERGIEGfrgppgpqgdpgVRGPAGEKGDRGPPGLDGRSGLDGKPGAAGP 1792
Cdd:NF038329   233 GQQgPDGDPGPTGEDGPQGPDGPAGKDGPRGDRGEAG------------PDGPDGKDGERGPVGPAGKDGQNGKDGLPGK 300
                          250       260       270       280       290
                   ....*....|....*....|....*....|....*....|....*....|....*...
gi 2462586997 1793 SGPNGAAGKAGDPGRDGLPGLRGeqglpgpsgppglpgkpgEDGKPGLNGKNGEPGDP 1850
Cdd:NF038329   301 DGKDGQNGKDGLPGKDGKDGQPG------------------KDGLPGKDGKDGQPGKP 340
gly_rich_SclB NF038329
LPXTG-anchored collagen-like adhesin Scl2/SclB; SclB (or Scl2 - streptococcal collagen-like ...
1447-1693 5.36e-34

LPXTG-anchored collagen-like adhesin Scl2/SclB; SclB (or Scl2 - streptococcal collagen-like protein 2) is an LPXTG-anchored surface-anchored adhesin with a variable-length region of triple helix-forming collagen-like Gly-Xaa-Xaa repeats.


Pssm-ID: 468478 [Multi-domain]  Cd Length: 440  Bit Score: 137.73  E-value: 5.36e-34
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462586997 1447 KGDSEDGAPGLPGQPGSPGEQGPRGPPGAIGPKGDRGFPGPLGEAGEKGERGPPGPAGSRGLPGVAGRpgakgpegppgp 1526
Cdd:NF038329   114 KGDGEKGEPGPAGPAGPAGEQGPRGDRGETGPAGPAGPPGPQGERGEKGPAGPQGEAGPQGPAGKDGE------------ 181
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462586997 1527 tgrQGEKGEPGRPGDPAVVGPAvaGPKGEKGDVGPAGPRGATGVQGERGPPGLVLPGDPGPKGDPGDRGPIGLTGRAGPP 1606
Cdd:NF038329   182 ---AGAKGPAGEKGPQGPRGET--GPAGEQGPAGPAGPDGEAGPAGEDGPAGPAGDGQQGPDGDPGPTGEDGPQGPDGPA 256
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462586997 1607 GDSGPPGEKGDPGRPGPPGPVGPRGRDGEVGEKGDEGPPGDPGLPGKAGERGLRGAPGVRGPVGEKGDQGDPGEDGRNGS 1686
Cdd:NF038329   257 GKDGPRGDRGEAGPDGPDGKDGERGPVGPAGKDGQNGKDGLPGKDGKDGQNGKDGLPGKDGKDGQPGKDGLPGKDGKDGQ 336

                   ....*..
gi 2462586997 1687 PGSSGPK 1693
Cdd:NF038329   337 PGKPAPK 343
gly_rich_SclB NF038329
LPXTG-anchored collagen-like adhesin Scl2/SclB; SclB (or Scl2 - streptococcal collagen-like ...
2022-2365 1.79e-32

LPXTG-anchored collagen-like adhesin Scl2/SclB; SclB (or Scl2 - streptococcal collagen-like protein 2) is an LPXTG-anchored surface-anchored adhesin with a variable-length region of triple helix-forming collagen-like Gly-Xaa-Xaa repeats.


Pssm-ID: 468478 [Multi-domain]  Cd Length: 440  Bit Score: 133.49  E-value: 1.79e-32
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462586997 2022 GERGPPGPSGLAGEPGKPGIPGLPGRAGGVGEAGRPGERGERGEKGERGEQGrdgppglpgtpgppgppgpkvsvdepgp 2101
Cdd:NF038329   117 GEKGEPGPAGPAGPAGEQGPRGDRGETGPAGPAGPPGPQGERGEKGPAGPQG---------------------------- 168
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462586997 2102 glsgEQGPPGLKGAKGEPGSNGDQGPKGDRGvpgikgDRGEPGPRGQDGNPGLPGERGMAGPEGKPGLQGPRGPPGpvgg 2181
Cdd:NF038329   169 ----EAGPQGPAGKDGEAGAKGPAGEKGPQG------PRGETGPAGEQGPAGPAGPDGEAGPAGEDGPAGPAGDGQ---- 234
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462586997 2182 hgdpgppgapglagpagpQGPSGLKGEPGETGPPG-RGLTGPTGAVglpgppgpsglvgpqgspglpGQVGETGKPGAPG 2260
Cdd:NF038329   235 ------------------QGPDGDPGPTGEDGPQGpDGPAGKDGPR---------------------GDRGEAGPDGPDG 275
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462586997 2261 RDGASGKDGDRGSPGVPGSPGLPGPVGPKGEPGPTGAPGQavvglpgakgekgapgglagdlvgepgakgdrglpgpRGE 2340
Cdd:NF038329   276 KDGERGPVGPAGKDGQNGKDGLPGKDGKDGQNGKDGLPGK-------------------------------------DGK 318
                          330       340
                   ....*....|....*....|....*
gi 2462586997 2341 KGEAGRAGEPGDPGEDGQKGAPGPK 2365
Cdd:NF038329   319 DGQPGKDGLPGKDGKDGQPGKPAPK 343
gly_rich_SclB NF038329
LPXTG-anchored collagen-like adhesin Scl2/SclB; SclB (or Scl2 - streptococcal collagen-like ...
1979-2298 1.28e-29

LPXTG-anchored collagen-like adhesin Scl2/SclB; SclB (or Scl2 - streptococcal collagen-like protein 2) is an LPXTG-anchored surface-anchored adhesin with a variable-length region of triple helix-forming collagen-like Gly-Xaa-Xaa repeats.


Pssm-ID: 468478 [Multi-domain]  Cd Length: 440  Bit Score: 124.63  E-value: 1.28e-29
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462586997 1979 GPKGDSGEQGPPGKEGPIGFPGERGLKGDRGDPGPQGPPglalGERGPPGPSGLAGEPGKPGIPGLPGRAGGVGEAGRPG 2058
Cdd:NF038329   117 GEKGEPGPAGPAGPAGEQGPRGDRGETGPAGPAGPPGPQ----GERGEKGPAGPQGEAGPQGPAGKDGEAGAKGPAGEKG 192
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462586997 2059 ERGERGEKGERGEQGRDGPPGLPGTPGPPGPPGpkvsvdEPGPGLSGEQGPpglkgaKGEPGSNGDQGPKGDRGVPGIKG 2138
Cdd:NF038329   193 PQGPRGETGPAGEQGPAGPAGPDGEAGPAGEDG------PAGPAGDGQQGP------DGDPGPTGEDGPQGPDGPAGKDG 260
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462586997 2139 DRGEPGPRGQDGNPGLPGERGMAGPEGKpglqgprgppgpvgghgdpgppgapglagpagpQGPSGLKGEPGETGPpgrg 2218
Cdd:NF038329   261 PRGDRGEAGPDGPDGKDGERGPVGPAGK---------------------------------DGQNGKDGLPGKDGK---- 303
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462586997 2219 ltgptgavglpgppgpsglvgpqgspglPGQVGETGKPGAPGRDGASGKDGDRGSPGVPGSPGLPGPVGPKGEPGPTGAP 2298
Cdd:NF038329   304 ----------------------------DGQNGKDGLPGKDGKDGQPGKDGLPGKDGKDGQPGKPAPKTPEVPQKPDTAP 355
VWA pfam00092
von Willebrand factor type A domain;
1054-1227 3.41e-29

von Willebrand factor type A domain;


Pssm-ID: 459670 [Multi-domain]  Cd Length: 174  Bit Score: 115.84  E-value: 3.41e-29
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462586997 1054 DVVFLPHATQD-NAHRAEATRRVLERLVLALGpLGPQAVQVGLLSYSHRPSPLFPLNGSHDLGIILQRIRDMPYMDPSGN 1132
Cdd:pfam00092    1 DIVFLLDGSGSiGGDNFEKVKEFLKKLVESLD-IGPDGTRVGLVQYSSDVRTEFPLNDYSSKEELLSAVDNLRYLGGGTT 79
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462586997 1133 NLGTAVVTAHRYMLAPDApGRRQHVPGVMVLLVD-EPLRGDIFSPIREAQASGLNVVMLGMAGADPEQLRRLApGMDSVQ 1211
Cdd:pfam00092   80 NTGKALKYALENLFSSAA-GARPGAPKVVVLLTDgRSQDGDPEEVARELKSAGVTVFAVGVGNADDEELRKIA-SEPGEG 157
                          170
                   ....*....|....*.
gi 2462586997 1212 TFFAVDDGPSLDQAVS 1227
Cdd:pfam00092  158 HVFTVSDFEALEDLQD 173
gly_rich_SclB NF038329
LPXTG-anchored collagen-like adhesin Scl2/SclB; SclB (or Scl2 - streptococcal collagen-like ...
2103-2416 1.49e-28

LPXTG-anchored collagen-like adhesin Scl2/SclB; SclB (or Scl2 - streptococcal collagen-like protein 2) is an LPXTG-anchored surface-anchored adhesin with a variable-length region of triple helix-forming collagen-like Gly-Xaa-Xaa repeats.


Pssm-ID: 468478 [Multi-domain]  Cd Length: 440  Bit Score: 121.55  E-value: 1.49e-28
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462586997 2103 LSGEQGPPGlkgAKGEPGSNGDQGPKGDRGVPGIKGDRGEPGPRGQDGNPGLPGERGMAGPegkpglqgprgppgpvggh 2182
Cdd:NF038329   115 GDGEKGEPG---PAGPAGPAGEQGPRGDRGETGPAGPAGPPGPQGERGEKGPAGPQGEAGP------------------- 172
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462586997 2183 gdpgppgapglagpagpQGPSGLKGEPGETGPPGRglTGPtgavglpgppgpsglvgpQGSPGLPGQVGETGKPGAPGRD 2262
Cdd:NF038329   173 -----------------QGPAGKDGEAGAKGPAGE--KGP------------------QGPRGETGPAGEQGPAGPAGPD 215
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462586997 2263 GASGKDGDRGSPGVPGSpglpGPVGPKGEPGPTGAPGQAvvglpgakgekgAPGGLAGdlvgEPGAKGDRGLPGPRGEKG 2342
Cdd:NF038329   216 GEAGPAGEDGPAGPAGD----GQQGPDGDPGPTGEDGPQ------------GPDGPAG----KDGPRGDRGEAGPDGPDG 275
                          250       260       270       280       290       300       310
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....
gi 2462586997 2343 EAGRAGEPGDPGEDGQKGAPGPKGFKGDPGVGvpgspgppgppgvkgdlGLPGLPGAPGVVGFPGQTGPRGEMG 2416
Cdd:NF038329   276 KDGERGPVGPAGKDGQNGKDGLPGKDGKDGQN-----------------GKDGLPGKDGKDGQPGKDGLPGKDG 332
gly_rich_SclB NF038329
LPXTG-anchored collagen-like adhesin Scl2/SclB; SclB (or Scl2 - streptococcal collagen-like ...
1783-2075 4.83e-27

LPXTG-anchored collagen-like adhesin Scl2/SclB; SclB (or Scl2 - streptococcal collagen-like protein 2) is an LPXTG-anchored surface-anchored adhesin with a variable-length region of triple helix-forming collagen-like Gly-Xaa-Xaa repeats.


Pssm-ID: 468478 [Multi-domain]  Cd Length: 440  Bit Score: 116.93  E-value: 4.83e-27
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462586997 1783 LDGKPGAAGPSGPNGAAGKAGDPGRDGlpglrgeqglpgpsgppglpgkpgEDGKPGLNGKNGEPGDPGEDGRKGEKGDS 1862
Cdd:NF038329   115 GDGEKGEPGPAGPAGPAGEQGPRGDRG------------------------ETGPAGPAGPPGPQGERGEKGPAGPQGEA 170
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462586997 1863 GASGREGRDGPKGERGAPGilgpqgppglpgpvgppgqgfpgvpgGTGPKGDRGETGSKGEQGLPGERGLRGEPGSvpnv 1942
Cdd:NF038329   171 GPQGPAGKDGEAGAKGPAG--------------------------EKGPQGPRGETGPAGEQGPAGPAGPDGEAGP---- 220
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462586997 1943 drlletagikasalreivetwDESSGSFLPVPERRRGPKGDSGEQGPPGKEGPIGFPGERGLKGDRGDPGPQGPPGLAlG 2022
Cdd:NF038329   221 ---------------------AGEDGPAGPAGDGQQGPDGDPGPTGEDGPQGPDGPAGKDGPRGDRGEAGPDGPDGKD-G 278
                          250       260       270       280       290
                   ....*....|....*....|....*....|....*....|....*....|...
gi 2462586997 2023 ERGPPGPSGLAGEPGKPGIPGLPGRAGGVGEAGRPGERGERGEKGERGEQGRD 2075
Cdd:NF038329   279 ERGPVGPAGKDGQNGKDGLPGKDGKDGQNGKDGLPGKDGKDGQPGKDGLPGKD 331
gly_rich_SclB NF038329
LPXTG-anchored collagen-like adhesin Scl2/SclB; SclB (or Scl2 - streptococcal collagen-like ...
1871-2166 3.96e-26

LPXTG-anchored collagen-like adhesin Scl2/SclB; SclB (or Scl2 - streptococcal collagen-like protein 2) is an LPXTG-anchored surface-anchored adhesin with a variable-length region of triple helix-forming collagen-like Gly-Xaa-Xaa repeats.


Pssm-ID: 468478 [Multi-domain]  Cd Length: 440  Bit Score: 114.23  E-value: 3.96e-26
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462586997 1871 DGPKGERGAPGilgpqgppglpgpvgppgqgFPGVPGGTGPKGDRGETGSKGEQGLPGERGLRGEPGSvpnvdrlletag 1950
Cdd:NF038329   116 DGEKGEPGPAG--------------------PAGPAGEQGPRGDRGETGPAGPAGPPGPQGERGEKGP------------ 163
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462586997 1951 ikasalreivetwdessgsflpvperrRGPKGDSGEQGPPGKEGPIGFPGERGLKGDRGDPgpqgppglalGERGPPGPS 2030
Cdd:NF038329   164 ---------------------------AGPQGEAGPQGPAGKDGEAGAKGPAGEKGPQGPR----------GETGPAGEQ 206
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462586997 2031 GLAGEPGKPGIPGLPGRAGGVGEAGRpGERGERGEKGERGEQGRDGPPGLpgtpgppgppgpkvsvdepgPGLSGEQGPP 2110
Cdd:NF038329   207 GPAGPAGPDGEAGPAGEDGPAGPAGD-GQQGPDGDPGPTGEDGPQGPDGP--------------------AGKDGPRGDR 265
                          250       260       270       280       290
                   ....*....|....*....|....*....|....*....|....*....|....*....
gi 2462586997 2111 GLKGAKGEPGSNGDQGPKGDRGVPGIKGDRGEP---GPRGQDGNPGLPGERGMAGPEGK 2166
Cdd:NF038329   266 GEAGPDGPDGKDGERGPVGPAGKDGQNGKDGLPgkdGKDGQNGKDGLPGKDGKDGQPGK 324
gly_rich_SclB NF038329
LPXTG-anchored collagen-like adhesin Scl2/SclB; SclB (or Scl2 - streptococcal collagen-like ...
1743-2045 1.11e-25

LPXTG-anchored collagen-like adhesin Scl2/SclB; SclB (or Scl2 - streptococcal collagen-like protein 2) is an LPXTG-anchored surface-anchored adhesin with a variable-length region of triple helix-forming collagen-like Gly-Xaa-Xaa repeats.


Pssm-ID: 468478 [Multi-domain]  Cd Length: 440  Bit Score: 113.08  E-value: 1.11e-25
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462586997 1743 GERGIEGFRGPPGPQGDPGVRGPAGEKGDRG---PPGLDGRSGLDGKPGAAGPSGPNGAAGKAGDPGRDGLPGLRGEQGL 1819
Cdd:NF038329   117 GEKGEPGPAGPAGPAGEQGPRGDRGETGPAGpagPPGPQGERGEKGPAGPQGEAGPQGPAGKDGEAGAKGPAGEKGPQGP 196
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462586997 1820 PGPSGPPGLPGKPGEDGKPGLNGKNGEPGDPGEDGRkGEKGDSGASGREGRDGPKGERGAPGilgpqgppglpgpvgppg 1899
Cdd:NF038329   197 RGETGPAGEQGPAGPAGPDGEAGPAGEDGPAGPAGD-GQQGPDGDPGPTGEDGPQGPDGPAG------------------ 257
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462586997 1900 qgfpgvpgGTGPKGDRGETGSKGEQGLPGERGLRGEPGSvpnvdrlletagikasalreivetwdessgsflpvperrrg 1979
Cdd:NF038329   258 --------KDGPRGDRGEAGPDGPDGKDGERGPVGPAGK----------------------------------------- 288
                          250       260       270       280       290       300
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*.
gi 2462586997 1980 pKGDSGEQGPPGKEGPIGFPGERGLKGDRgdpgpqgppglalGERGPPGPSGLAGEPGKPGIPGLP 2045
Cdd:NF038329   289 -DGQNGKDGLPGKDGKDGQNGKDGLPGKD-------------GKDGQPGKDGLPGKDGKDGQPGKP 340
gly_rich_SclB NF038329
LPXTG-anchored collagen-like adhesin Scl2/SclB; SclB (or Scl2 - streptococcal collagen-like ...
1304-1585 2.92e-25

LPXTG-anchored collagen-like adhesin Scl2/SclB; SclB (or Scl2 - streptococcal collagen-like protein 2) is an LPXTG-anchored surface-anchored adhesin with a variable-length region of triple helix-forming collagen-like Gly-Xaa-Xaa repeats.


Pssm-ID: 468478 [Multi-domain]  Cd Length: 440  Bit Score: 111.54  E-value: 2.92e-25
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462586997 1304 DGRPGSPGRAGNPGTPGAPGLKGSPGLPGPRGDPGERGPRGPKGEPGAPGQViGGEGP-GLPGRKGDPgpsgppgprgpl 1382
Cdd:NF038329   116 DGEKGEPGPAGPAGPAGEQGPRGDRGETGPAGPAGPPGPQGERGEKGPAGPQ-GEAGPqGPAGKDGEA------------ 182
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462586997 1383 gdpgprgppglpgtamkGDKGDRGERGPPGPGEGGIAPGEPGLPGLPGSPGPQGPVGPPGKKGEKGDSEDGAPGLPGQPG 1462
Cdd:NF038329   183 -----------------GAKGPAGEKGPQGPRGETGPAGEQGPAGPAGPDGEAGPAGEDGPAGPAGDGQQGPDGDPGPTG 245
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462586997 1463 SPGEQGPRGPPGAIGPKGDRGFPGPLGEAGEKGERGPPGPAGSRGLPGVAGrpgakgpegppgPTGRQGEKGEPGRPGDP 1542
Cdd:NF038329   246 EDGPQGPDGPAGKDGPRGDRGEAGPDGPDGKDGERGPVGPAGKDGQNGKDG------------LPGKDGKDGQNGKDGLP 313
                          250       260       270       280
                   ....*....|....*....|....*....|....*....|...
gi 2462586997 1543 avvgpavagpkGEKGDVGPAGPRGATGVQGERGPPGLVLPGDP 1585
Cdd:NF038329   314 -----------GKDGKDGQPGKDGLPGKDGKDGQPGKPAPKTP 345
vWFA_subfamily_ECM cd01450
Von Willebrand factor type A (vWA) domain was originally found in the blood coagulation ...
1053-1204 2.20e-21

Von Willebrand factor type A (vWA) domain was originally found in the blood coagulation protein von Willebrand factor (vWF). Typically, the vWA domain is made up of approximately 200 amino acid residues folded into a classic a/b para-rossmann type of fold. The vWA domain, since its discovery, has drawn great interest because of its widespread occurrence and its involvement in a wide variety of important cellular functions. These include basal membrane formation, cell migration, cell differentiation, adhesion, haemostasis, signaling, chromosomal stability, malignant transformation and in immune defenses In integrins these domains form heterodimers while in vWF it forms multimers. There are different interaction surfaces of this domain as seen by the various molecules it complexes with. Ligand binding in most cases is mediated by the presence of a metal ion dependent adhesion site termed as the MIDAS motif that is a characteristic feature of most, if not all A domains


Pssm-ID: 238727 [Multi-domain]  Cd Length: 161  Bit Score: 93.12  E-value: 2.20e-21
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462586997 1053 ADVVFLPHATQ--DNAHRAEAtRRVLERLVLALgPLGPQAVQVGLLSYSHRPSPLFPLNGSHDLGIILQRIRDMPYMDPS 1130
Cdd:cd01450      1 LDIVFLLDGSEsvGPENFEKV-KDFIEKLVEKL-DIGPDKTRVGLVQYSDDVRVEFSLNDYKSKDDLLKAVKNLKYLGGG 78
                           90       100       110       120       130       140       150
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*.
gi 2462586997 1131 GNNLGTAVVTAHRYMLAPDApgRRQHVPGVMVLLVDEPL--RGDIFSPIREAQASGLNVVMLGMAGADPEQLRRLA 1204
Cdd:cd01450     79 GTNTGKALQYALEQLFSESN--ARENVPKVIIVLTDGRSddGGDPKEAAAKLKDEGIKVFVVGVGPADEEELREIA 152
FN3 COG3401
Fibronectin type 3 domain [General function prediction only];
301-813 2.68e-21

Fibronectin type 3 domain [General function prediction only];


Pssm-ID: 442628 [Multi-domain]  Cd Length: 603  Bit Score: 101.23  E-value: 2.68e-21
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462586997  301 TEYQVTVIALYANSIGEAVSGTARTTALEGPELTIQNTTAHSLLVAWRSVPGATGYRVTWRVLSGGPTQQQELGPGQGSV 380
Cdd:COG3401     39 YLAVVLSVTTKESPGTLLVAAGLSSGGGLGTGGRAGTTSGVAAVAVAAAPPTATGLTTLTGSGSVGGATNTGLTSSDEVP 118
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462586997  381 LLRDLEPGTDYEVTVSTLFGRSVGPATSLMARTDASVEQTLRPVILGPTSILLSWNLVPEARGYRLewrretgleppqkV 460
Cdd:COG3401    119 SPAVGTATTATAVAGGAATAGTYALGAGLYGVDGANASGTTASSVAGAGVVVSPDTSATAAVATTS-------------L 185
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462586997  461 VLPSDVTRYQLDGLQPGTEYRLTLYTLLEGHEVATPATVVPTGPELPVSPVTDLQATELPGQRVRVSWSPVP--GATQYR 538
Cdd:COG3401    186 TVTSTTLVDGGGDIEPGTTYYYRVAATDTGGESAPSNEVSVTTPTTPPSAPTGLTATADTPGSVTLSWDPVTesDATGYR 265
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462586997  539 IIVRSTQGVERTLVLPGSQTAFDLDDVQAGLSYTVRVSA--RVGPREGSASVLTVRRELETPLAVPGLRVVVSDATRVRV 616
Cdd:COG3401    266 VYRSNSGDGPFTKVATVTTTSYTDTGLTNGTTYYYRVTAvdAAGNESAPSNVVSVTTDLTPPAAPSGLTATAVGSSSITL 345
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462586997  617 AWGPVPG--ASGFRISWSTGSGPESSQ--TLPPDSTATDiTGLQPGTTYQVAVSVL--RGREEGPAAVIVART------- 683
Cdd:COG3401    346 SWTASSDadVTGYNVYRSTSGGGTYTKiaETVTTTSYTD-TGLTPGTTYYYKVTAVdaAGNESAPSEEVSATTasaasge 424
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462586997  684 DPLGPVRTVHVTQASSSSVTITWTRVPG--ATGYRVSWHSAHGPEKSQL-VSGEATVAELDGLEPDTEYTVHVRAHVAGV 760
Cdd:COG3401    425 SLTASVDAVPLTDVAGATAAASAASNPGvsAAVLADGGDTGNAVPFTTTsSTVTATTTDTTTANLSVTTGSLVGGSGASS 504
                          490       500       510       520       530
                   ....*....|....*....|....*....|....*....|....*....|...
gi 2462586997  761 DGPPASVVVRTAPEPVGRVSRLQILNASSDVLRITWVGVTGATAYRLAWGRSE 813
Cdd:COG3401    505 VTNSVSVIGASAAAAVGGAPDGTPNVTGASPVTVGASTGDVLITDLVSLTTSA 557
gly_rich_SclB NF038329
LPXTG-anchored collagen-like adhesin Scl2/SclB; SclB (or Scl2 - streptococcal collagen-like ...
1254-1565 3.14e-21

LPXTG-anchored collagen-like adhesin Scl2/SclB; SclB (or Scl2 - streptococcal collagen-like protein 2) is an LPXTG-anchored surface-anchored adhesin with a variable-length region of triple helix-forming collagen-like Gly-Xaa-Xaa repeats.


Pssm-ID: 468478 [Multi-domain]  Cd Length: 440  Bit Score: 99.21  E-value: 3.14e-21
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462586997 1254 GQKGEPGEMGLRGQVGPPGDPGLPGRTGAPGPQGPPGSATAKGERGFPGADGRPGSPGRAGNPGTPGAPGLKGSPGLPGP 1333
Cdd:NF038329   117 GEKGEPGPAGPAGPAGEQGPRGDRGETGPAGPAGPPGPQGERGEKGPAGPQGEAGPQGPAGKDGEAGAKGPAGEKGPQGP 196
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462586997 1334 RGDPGERGPRGPKGEPGAPGQVIGGEGPGLPGRKGDPGpsgppgprgplgdpgprgppglpgtamKGDKGDrgergppgp 1413
Cdd:NF038329   197 RGETGPAGEQGPAGPAGPDGEAGPAGEDGPAGPAGDGQ---------------------------QGPDGD--------- 240
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462586997 1414 geggiaPGEPGLPGLPGSPgpqgpvGPPGKKGEKGDseDGAPGLPGQPGSPGEQGPRGPPGAIGPKGDRGFPGPLGEAGE 1493
Cdd:NF038329   241 ------PGPTGEDGPQGPD------GPAGKDGPRGD--RGEAGPDGPDGKDGERGPVGPAGKDGQNGKDGLPGKDGKDGQ 306
                          250       260       270       280       290       300       310
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|..
gi 2462586997 1494 KGERGPPGPAGSRGLPGVAGRPgakgpegppgptgrqGEKGEPGRPGDPAVVGPAVAgpkgEKGDVGPAGPR 1565
Cdd:NF038329   307 NGKDGLPGKDGKDGQPGKDGLP---------------GKDGKDGQPGKPAPKTPEVP----QKPDTAPHTPK 359
VWA smart00327
von Willebrand factor (vWF) type A domain; VWA domains in extracellular eukaryotic proteins ...
1054-1214 1.39e-19

von Willebrand factor (vWF) type A domain; VWA domains in extracellular eukaryotic proteins mediate adhesion via metal ion-dependent adhesion sites (MIDAS). Intracellular VWA domains and homologues in prokaryotes have recently been identified. The proposed VWA domains in integrin beta subunits have recently been substantiated using sequence-based methods.


Pssm-ID: 214621 [Multi-domain]  Cd Length: 175  Bit Score: 88.28  E-value: 1.39e-19
                            10        20        30        40        50        60        70        80
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462586997  1054 DVVFLPHATQDNAH-RAEATRRVLERLVLALgPLGPQAVQVGLLSYSHRPSPLFPLNGSHDLGIILQRIRDMPYMDPSGN 1132
Cdd:smart00327    1 DVVFLLDGSGSMGGnRFELAKEFVLKLVEQL-DIGPDGDRVGLVTFSDDARVLFPLNDSRSKDALLEALASLSYKLGGGT 79
                            90       100       110       120       130       140       150       160
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462586997  1133 NLGTAVVTAHRYMLAPDAPGRRQhVPGVMVLLVD---EPLRGDIFSPIREAQASGLNVVMLGM-AGADPEQLRRLAPGMD 1208
Cdd:smart00327   80 NLGAALQYALENLFSKSAGSRRG-APKVVILITDgesNDGPKDLLKAAKELKRSGVKVFVVGVgNDVDEEELKKLASAPG 158

                    ....*.
gi 2462586997  1209 SVQTFF 1214
Cdd:smart00327  159 GVYVFL 164
fn3 pfam00041
Fibronectin type III domain;
233-317 3.90e-16

Fibronectin type III domain;


Pssm-ID: 394996 [Multi-domain]  Cd Length: 85  Bit Score: 75.53  E-value: 3.90e-16
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462586997  233 SAPRDLVLSEPSSQSLRVQWTAA---SGPVTGYKVQYTPLTGLGQPlpserQEVNVPAGETSVRLRGLRPLTEYQVTVIA 309
Cdd:pfam00041    1 SAPSNLTVTDVTSTSLTVSWTPPpdgNGPITGYEVEYRPKNSGEPW-----NEITVPGTTTSVTLTGLKPGTEYEVRVQA 75

                   ....*...
gi 2462586997  310 LYANSIGE 317
Cdd:pfam00041   76 VNGGGEGP 83
FN3 cd00063
Fibronectin type 3 domain; One of three types of internal repeats found in the plasma protein ...
687-771 1.78e-13

Fibronectin type 3 domain; One of three types of internal repeats found in the plasma protein fibronectin. Its tenth fibronectin type III repeat contains an RGD cell recognition sequence in a flexible loop between 2 strands. Approximately 2% of all animal proteins contain the FN3 repeat; including extracellular and intracellular proteins, membrane spanning cytokine receptors, growth hormone receptors, tyrosine phosphatase receptors, and adhesion molecules. FN3-like domains are also found in bacterial glycosyl hydrolases.


Pssm-ID: 238020 [Multi-domain]  Cd Length: 93  Bit Score: 68.29  E-value: 1.78e-13
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462586997  687 GPVRTVHVTQASSSSVTITWTRVPGA----TGYRVSWHSAHGPEKSQL--VSGEATVAELDGLEPDTEYTVHVRAHVAGV 760
Cdd:cd00063      2 SPPTNLRVTDVTSTSVTLSWTPPEDDggpiTGYVVEYREKGSGDWKEVevTPGSETSYTLTGLKPGTEYEFRVRAVNGGG 81
                           90
                   ....*....|..
gi 2462586997  761 DGPPA-SVVVRT 771
Cdd:cd00063     82 ESPPSeSVTVTT 93
FN3 cd00063
Fibronectin type 3 domain; One of three types of internal repeats found in the plasma protein ...
233-326 3.32e-13

Fibronectin type 3 domain; One of three types of internal repeats found in the plasma protein fibronectin. Its tenth fibronectin type III repeat contains an RGD cell recognition sequence in a flexible loop between 2 strands. Approximately 2% of all animal proteins contain the FN3 repeat; including extracellular and intracellular proteins, membrane spanning cytokine receptors, growth hormone receptors, tyrosine phosphatase receptors, and adhesion molecules. FN3-like domains are also found in bacterial glycosyl hydrolases.


Pssm-ID: 238020 [Multi-domain]  Cd Length: 93  Bit Score: 67.52  E-value: 3.32e-13
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462586997  233 SAPRDLVLSEPSSQSLRVQWTAAS---GPVTGYKVQYTPLTGlgqplpSERQEVNVPAG-ETSVRLRGLRPLTEYQVTVI 308
Cdd:cd00063      2 SPPTNLRVTDVTSTSVTLSWTPPEddgGPITGYVVEYREKGS------GDWKEVEVTPGsETSYTLTGLKPGTEYEFRVR 75
                           90
                   ....*....|....*...
gi 2462586997  309 ALYANSIGEAVSGTARTT 326
Cdd:cd00063     76 AVNGGGESPPSESVTVTT 93
FN3 smart00060
Fibronectin type 3 domain; One of three types of internal repeat within the plasma protein, ...
233-316 1.86e-12

Fibronectin type 3 domain; One of three types of internal repeat within the plasma protein, fibronectin. The tenth fibronectin type III repeat contains a RGD cell recognition sequence in a flexible loop between 2 strands. Type III modules are present in both extracellular and intracellular proteins.


Pssm-ID: 214495 [Multi-domain]  Cd Length: 83  Bit Score: 64.94  E-value: 1.86e-12
                            10        20        30        40        50        60        70        80
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462586997   233 SAPRDLVLSEPSSQSLRVQWTAASGP-VTGYKVQYTPLtglGQPLPSERQEVNVPAGETSVRLRGLRPLTEYQVTVIALY 311
Cdd:smart00060    2 SPPSNLRVTDVTSTSVTLSWEPPPDDgITGYIVGYRVE---YREEGSEWKEVNVTPSSTSYTLTGLKPGTEYEFRVRAVN 78

                    ....*
gi 2462586997   312 ANSIG 316
Cdd:smart00060   79 GAGEG 83
fn3 pfam00041
Fibronectin type III domain;
687-764 4.98e-12

Fibronectin type III domain;


Pssm-ID: 394996 [Multi-domain]  Cd Length: 85  Bit Score: 63.59  E-value: 4.98e-12
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462586997  687 GPVRTVHVTQASSSSVTITWTRVPGA----TGYRVSWHSAHGPE--KSQLVSGEATVAELDGLEPDTEYTVHVRAHVAGV 760
Cdd:pfam00041    1 SAPSNLTVTDVTSTSLTVSWTPPPDGngpiTGYEVEYRPKNSGEpwNEITVPGTTTSVTLTGLKPGTEYEVRVQAVNGGG 80

                   ....
gi 2462586997  761 DGPP 764
Cdd:pfam00041   81 EGPP 84
FN3 smart00060
Fibronectin type 3 domain; One of three types of internal repeat within the plasma protein, ...
687-756 2.80e-11

Fibronectin type 3 domain; One of three types of internal repeat within the plasma protein, fibronectin. The tenth fibronectin type III repeat contains a RGD cell recognition sequence in a flexible loop between 2 strands. Type III modules are present in both extracellular and intracellular proteins.


Pssm-ID: 214495 [Multi-domain]  Cd Length: 83  Bit Score: 61.48  E-value: 2.80e-11
                            10        20        30        40        50        60        70
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*.
gi 2462586997   687 GPVRTVHVTQASSSSVTITWTRVPGA------TGYRVSWHSAHGPEKSQLVSGEATVAELDGLEPDTEYTVHVRAH 756
Cdd:smart00060    2 SPPSNLRVTDVTSTSVTLSWEPPPDDgitgyiVGYRVEYREEGSEWKEVNVTPSSTSYTLTGLKPGTEYEFRVRAV 77
ChlD COG1240
vWFA (von Willebrand factor type A) domain of Mg and Co chelatases [Coenzyme transport and ...
3-189 3.49e-11

vWFA (von Willebrand factor type A) domain of Mg and Co chelatases [Coenzyme transport and metabolism];


Pssm-ID: 440853 [Multi-domain]  Cd Length: 262  Bit Score: 66.12  E-value: 3.49e-11
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462586997    3 LRLLVAALCAGILAEAPRVRAQHRERVtctrlyAADIVFLLDGSSSIGRSN-FREVRSFLEGLVlpfsgAASAQGVRFAT 81
Cdd:COG1240     65 ALLLLLAVLLLLLALALAPLALARPQR------GRDVVLVVDASGSMAAENrLEAAKGALLDFL-----DDYRPRDRVGL 133
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462586997   82 VQYSDDPRTEFGLDAlgSGGDVIRAIRELSYKGGnTRTGAAILHVADHVflpQLARPGVPKVCILITDGK---SQDLVDT 158
Cdd:COG1240    134 VAFGGEAEVLLPLTR--DREALKRALDELPPGGG-TPLGDALALALELL---KRADPARRKVIVLLTDGRdnaGRIDPLE 207
                          170       180       190
                   ....*....|....*....|....*....|...
gi 2462586997  159 AAQRLKGQGVKLFAVGI--KNADPEELKRVASQ 189
Cdd:COG1240    208 AAELAAAAGIRIYTIGVgtEAVDEGLLREIAEA 240
FN3 cd00063
Fibronectin type 3 domain; One of three types of internal repeats found in the plasma protein ...
867-952 5.93e-11

Fibronectin type 3 domain; One of three types of internal repeats found in the plasma protein fibronectin. Its tenth fibronectin type III repeat contains an RGD cell recognition sequence in a flexible loop between 2 strands. Approximately 2% of all animal proteins contain the FN3 repeat; including extracellular and intracellular proteins, membrane spanning cytokine receptors, growth hormone receptors, tyrosine phosphatase receptors, and adhesion molecules. FN3-like domains are also found in bacterial glycosyl hydrolases.


Pssm-ID: 238020 [Multi-domain]  Cd Length: 93  Bit Score: 60.97  E-value: 5.93e-11
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462586997  867 PPALGTLHVVQRGEHSLRLRWEPVP----RAQGFLLHWQPEGGQEQSRVLGPELS--SYHLDGLEPATQYRVRLSVLGPA 940
Cdd:cd00063      1 PSPPTNLRVTDVTSTSVTLSWTPPEddggPITGYVVEYREKGSGDWKEVEVTPGSetSYTLTGLKPGTEYEFRVRAVNGG 80
                           90
                   ....*....|...
gi 2462586997  941 GEG-PSAEVTART 952
Cdd:cd00063     81 GESpPSESVTVTT 93
gly_rich_SclB NF038329
LPXTG-anchored collagen-like adhesin Scl2/SclB; SclB (or Scl2 - streptococcal collagen-like ...
2287-2440 5.85e-09

LPXTG-anchored collagen-like adhesin Scl2/SclB; SclB (or Scl2 - streptococcal collagen-like protein 2) is an LPXTG-anchored surface-anchored adhesin with a variable-length region of triple helix-forming collagen-like Gly-Xaa-Xaa repeats.


Pssm-ID: 468478 [Multi-domain]  Cd Length: 440  Bit Score: 61.07  E-value: 5.85e-09
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462586997 2287 GPKGEPGPTGAPGQAvvglpgakgekgapgglagdlvGEPGAKGDRGLPGPRGEKGEAGRAGEPGDPGEDGQKGAPGPKG 2366
Cdd:NF038329   117 GEKGEPGPAGPAGPA----------------------GEQGPRGDRGETGPAGPAGPPGPQGERGEKGPAGPQGEAGPQG 174
                           90       100       110       120       130       140       150
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....
gi 2462586997 2367 fkgdpgvgvpgspgppgPPGVKGDLGLPGLPGAPGVVGFPGQTGPRGEMGSGRPPRERRNPRTPGATWTTGVSG 2440
Cdd:NF038329   175 -----------------PAGKDGEAGAKGPAGEKGPQGPRGETGPAGEQGPAGPAGPDGEAGPAGEDGPAGPAG 231
fn3 pfam00041
Fibronectin type III domain;
868-944 6.24e-09

Fibronectin type III domain;


Pssm-ID: 394996 [Multi-domain]  Cd Length: 85  Bit Score: 55.11  E-value: 6.24e-09
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462586997  868 PALGTLHVVQRGEHSLRLRWEPVPRAQG----FLLHWQPEGGQE--QSRVLGPELSSYHLDGLEPATQYRVRLSVLGPAG 941
Cdd:pfam00041    1 SAPSNLTVTDVTSTSLTVSWTPPPDGNGpitgYEVEYRPKNSGEpwNEITVPGTTTSVTLTGLKPGTEYEVRVQAVNGGG 80

                   ...
gi 2462586997  942 EGP 944
Cdd:pfam00041   81 EGP 83
FN3 smart00060
Fibronectin type 3 domain; One of three types of internal repeat within the plasma protein, ...
867-943 1.41e-08

Fibronectin type 3 domain; One of three types of internal repeat within the plasma protein, fibronectin. The tenth fibronectin type III repeat contains a RGD cell recognition sequence in a flexible loop between 2 strands. Type III modules are present in both extracellular and intracellular proteins.


Pssm-ID: 214495 [Multi-domain]  Cd Length: 83  Bit Score: 53.77  E-value: 1.41e-08
                            10        20        30        40        50        60        70        80
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462586997   867 PPALGTLHVVQRGEHSLRLRWEPVPRAQ--GFLLHWQPEGGQEQSRVL----GPELSSYHLDGLEPATQYRVRLSVLGPA 940
Cdd:smart00060    1 PSPPSNLRVTDVTSTSVTLSWEPPPDDGitGYIVGYRVEYREEGSEWKevnvTPSSTSYTLTGLKPGTEYEFRVRAVNGA 80

                    ...
gi 2462586997   941 GEG 943
Cdd:smart00060   81 GEG 83
gly_rich_SclB NF038329
LPXTG-anchored collagen-like adhesin Scl2/SclB; SclB (or Scl2 - streptococcal collagen-like ...
1191-1367 2.86e-08

LPXTG-anchored collagen-like adhesin Scl2/SclB; SclB (or Scl2 - streptococcal collagen-like protein 2) is an LPXTG-anchored surface-anchored adhesin with a variable-length region of triple helix-forming collagen-like Gly-Xaa-Xaa repeats.


Pssm-ID: 468478 [Multi-domain]  Cd Length: 440  Bit Score: 58.76  E-value: 2.86e-08
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462586997 1191 GMAGADPEQLRRLAPGMDSVQTFfAVDDGPSLDQAVSGLATALCQASFTTQPRPEPCPVYCPKGQKGEPGEMGLRGQVGP 1270
Cdd:NF038329   174 GPAGKDGEAGAKGPAGEKGPQGP-RGETGPAGEQGPAGPAGPDGEAGPAGEDGPAGPAGDGQQGPDGDPGPTGEDGPQGP 252
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462586997 1271 PGDPGLPGRTGAPGPQGPPGSATAKGERGFPGADGRPGSPGRAGNPGTPGAPGLKGSPGLPGPRGDPGERGPRGPKGEPG 1350
Cdd:NF038329   253 DGPAGKDGPRGDRGEAGPDGPDGKDGERGPVGPAGKDGQNGKDGLPGKDGKDGQNGKDGLPGKDGKDGQPGKDGLPGKDG 332
                          170
                   ....*....|....*..
gi 2462586997 1351 APGQviggegPGLPGRK 1367
Cdd:NF038329   333 KDGQ------PGKPAPK 343
Collagen pfam01391
Collagen triple helix repeat (20 copies); Members of this family belong to the collagen ...
1299-1355 3.67e-08

Collagen triple helix repeat (20 copies); Members of this family belong to the collagen superfamily. Collagens are generally extracellular structural proteins involved in formation of connective tissue structure. The alignment contains 20 copies of the G-X-Y repeat that forms a triple helix. The first position of the repeat is glycine, the second and third positions can be any residue but are frequently proline and hydroxy-proline. Collagens are post translationally modified by proline hydroxylase to form the hydroxy-proline residues. Defective hydroxylation is the cause of scurvy. Some members of the collagen superfamily are not involved in connective tissue structure but share the same triple helical structure. The family includes bacterial collagen-like triple-helix repeat proteins.


Pssm-ID: 460189 [Multi-domain]  Cd Length: 57  Bit Score: 51.73  E-value: 3.67e-08
                           10        20        30        40        50
                   ....*....|....*....|....*....|....*....|....*....|....*..
gi 2462586997 1299 GFPGADGRPGSPGRAGNPGTPGAPGLKGSPGLPGPRGDPGERGPRGPKGEPGAPGQV 1355
Cdd:pfam01391    1 GPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGEPGPPGPPGPPGPPGPPGAPGAPGPP 57
FN3 cd00063
Fibronectin type 3 domain; One of three types of internal repeats found in the plasma protein ...
955-1044 1.02e-07

Fibronectin type 3 domain; One of three types of internal repeats found in the plasma protein fibronectin. Its tenth fibronectin type III repeat contains an RGD cell recognition sequence in a flexible loop between 2 strands. Approximately 2% of all animal proteins contain the FN3 repeat; including extracellular and intracellular proteins, membrane spanning cytokine receptors, growth hormone receptors, tyrosine phosphatase receptors, and adhesion molecules. FN3-like domains are also found in bacterial glycosyl hydrolases.


Pssm-ID: 238020 [Multi-domain]  Cd Length: 93  Bit Score: 51.73  E-value: 1.02e-07
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462586997  955 PRVPSiELRVVDTSIDSVTLAWTPV----SRASSYILSWRPL-RGPGQEVPGSPQTlpgiSSSQRVTGLEPGVSYIFSLT 1029
Cdd:cd00063      1 PSPPT-NLRVTDVTSTSVTLSWTPPeddgGPITGYVVEYREKgSGDWKEVEVTPGS----ETSYTLTGLKPGTEYEFRVR 75
                           90
                   ....*....|....*
gi 2462586997 1030 PVLDGVRGPEASVTQ 1044
Cdd:cd00063     76 AVNGGGESPPSESVT 90
fn3 pfam00041
Fibronectin type III domain;
777-855 1.03e-07

Fibronectin type III domain;


Pssm-ID: 394996 [Multi-domain]  Cd Length: 85  Bit Score: 51.65  E-value: 1.03e-07
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462586997  777 GRVSRLQILNASSDVLRITWVGVTGA----TAYRLA-WGRSEGGPMRHQILPGNTDSAEIRGLEGGVSYSVRVTALVGDR 851
Cdd:pfam00041    1 SAPSNLTVTDVTSTSLTVSWTPPPDGngpiTGYEVEyRPKNSGEPWNEITVPGTTTSVTLTGLKPGTEYEVRVQAVNGGG 80

                   ....
gi 2462586997  852 EGTP 855
Cdd:pfam00041   81 EGPP 84
Collagen pfam01391
Collagen triple helix repeat (20 copies); Members of this family belong to the collagen ...
2242-2298 1.06e-07

Collagen triple helix repeat (20 copies); Members of this family belong to the collagen superfamily. Collagens are generally extracellular structural proteins involved in formation of connective tissue structure. The alignment contains 20 copies of the G-X-Y repeat that forms a triple helix. The first position of the repeat is glycine, the second and third positions can be any residue but are frequently proline and hydroxy-proline. Collagens are post translationally modified by proline hydroxylase to form the hydroxy-proline residues. Defective hydroxylation is the cause of scurvy. Some members of the collagen superfamily are not involved in connective tissue structure but share the same triple helical structure. The family includes bacterial collagen-like triple-helix repeat proteins.


Pssm-ID: 460189 [Multi-domain]  Cd Length: 57  Bit Score: 50.57  E-value: 1.06e-07
                           10        20        30        40        50
                   ....*....|....*....|....*....|....*....|....*....|....*..
gi 2462586997 2242 GSPGLPGQVGETGKPGAPGRDGASGKDGDRGSPGVPGSPGLPGPVGPKGEPGPTGAP 2298
Cdd:pfam01391    1 GPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGEPGPPGPPGPPGPPGPPGAPGAPGPP 57
fn3 pfam00041
Fibronectin type III domain;
962-1039 2.75e-06

Fibronectin type III domain;


Pssm-ID: 394996 [Multi-domain]  Cd Length: 85  Bit Score: 47.41  E-value: 2.75e-06
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462586997  962 LRVVDTSIDSVTLAWTPVSRASS----YILSWRPLRGPGQEVPgspQTLPGISSSQRVTGLEPGVSYIFSLTPVLDGVRG 1037
Cdd:pfam00041    6 LTVTDVTSTSLTVSWTPPPDGNGpitgYEVEYRPKNSGEPWNE---ITVPGTTTSVTLTGLKPGTEYEVRVQAVNGGGEG 82

                   ..
gi 2462586997 1038 PE 1039
Cdd:pfam00041   83 PP 84
SPT5 COG5164
Transcription elongation factor SPT5 [Transcription];
2022-2316 5.23e-06

Transcription elongation factor SPT5 [Transcription];


Pssm-ID: 444063 [Multi-domain]  Cd Length: 495  Bit Score: 51.57  E-value: 5.23e-06
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462586997 2022 GERGPPGPSGLAGEPGKPGIPGLPGRAGGVGEAGRPGERGERGEKGERGEQGRDGPPGLPGTPGPPGPpgpkvsvdepgP 2101
Cdd:COG5164      7 GKTGPSDPGGVTTPAGSQGSTKPAQNQGSTRPAGNTGGTRPAQNQGSTTPAGNTGGTRPAGNQGATGP-----------A 75
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462586997 2102 GLSGEQGPPGLKGAKGEPGSNGDQGPKGDRGVPGIKGDRGEPGPRGQDGNPGlPGERGMAGPEGkpGLQGPRGPPGPVGG 2181
Cdd:COG5164     76 QNQGGTTPAQNQGGTRPAGNTGGTTPAGDGGATGPPDDGGATGPPDDGGSTT-PPSGGSTTPPG--DGGSTPPGPGSTGP 152
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462586997 2182 HGDPGPPGAPGLAGPAGPQGPSGLKGEPGETGPPGRGLTGPTGavglpgPPGPSGLVGPQGSPGLPGQVGETGKPgaPGR 2261
Cdd:COG5164    153 GGSTTPPGDGGSTTPPGPGGSTTPPDDGGSTTPPNKGETGTDI------PTGGTPRQGPDGPVKKDDKNGKGNPP--DDR 224
                          250       260       270       280       290
                   ....*....|....*....|....*....|....*....|....*....|....*
gi 2462586997 2262 DGASGKDGDRGSPGVPGSPGLPGPVGPKGEPGPTGAPGQAVVGLPGAKGEKGAPG 2316
Cdd:COG5164    225 GGKTGPKDQRPKTNPIERRGPERPEAAALPAELTALEAENRAANPEPATKTIPET 279
Collagen pfam01391
Collagen triple helix repeat (20 copies); Members of this family belong to the collagen ...
2304-2364 5.50e-06

Collagen triple helix repeat (20 copies); Members of this family belong to the collagen superfamily. Collagens are generally extracellular structural proteins involved in formation of connective tissue structure. The alignment contains 20 copies of the G-X-Y repeat that forms a triple helix. The first position of the repeat is glycine, the second and third positions can be any residue but are frequently proline and hydroxy-proline. Collagens are post translationally modified by proline hydroxylase to form the hydroxy-proline residues. Defective hydroxylation is the cause of scurvy. Some members of the collagen superfamily are not involved in connective tissue structure but share the same triple helical structure. The family includes bacterial collagen-like triple-helix repeat proteins.


Pssm-ID: 460189 [Multi-domain]  Cd Length: 57  Bit Score: 45.56  E-value: 5.50e-06
                           10        20        30        40        50        60
                   ....*....|....*....|....*....|....*....|....*....|....*....|.
gi 2462586997 2304 GLPGAKGEKGAPGGlagdlVGEPGAKGDRGLPGPRGEKGEAGRAGEPGDPGEDGQKGAPGP 2364
Cdd:pfam01391    1 GPPGPPGPPGPPGP-----PGPPGPPGPPGPPGPPGEPGPPGPPGPPGPPGPPGAPGAPGP 56
FN3 smart00060
Fibronectin type 3 domain; One of three types of internal repeat within the plasma protein, ...
958-1037 8.33e-06

Fibronectin type 3 domain; One of three types of internal repeat within the plasma protein, fibronectin. The tenth fibronectin type III repeat contains a RGD cell recognition sequence in a flexible loop between 2 strands. Type III modules are present in both extracellular and intracellular proteins.


Pssm-ID: 214495 [Multi-domain]  Cd Length: 83  Bit Score: 46.07  E-value: 8.33e-06
                            10        20        30        40        50        60        70        80
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462586997   958 PSIELRVVDTSIDSVTLAWTPVSRAS--SYILSWRPLRGPGQEvPGSPQTLPGISSSQRVTGLEPGVSYIFSLTPVLDGV 1035
Cdd:smart00060    3 PPSNLRVTDVTSTSVTLSWEPPPDDGitGYIVGYRVEYREEGS-EWKEVNVTPSSTSYTLTGLKPGTEYEFRVRAVNGAG 81

                    ..
gi 2462586997  1036 RG 1037
Cdd:smart00060   82 EG 83
FN3 cd00063
Fibronectin type 3 domain; One of three types of internal repeats found in the plasma protein ...
777-862 5.06e-05

Fibronectin type 3 domain; One of three types of internal repeats found in the plasma protein fibronectin. Its tenth fibronectin type III repeat contains an RGD cell recognition sequence in a flexible loop between 2 strands. Approximately 2% of all animal proteins contain the FN3 repeat; including extracellular and intracellular proteins, membrane spanning cytokine receptors, growth hormone receptors, tyrosine phosphatase receptors, and adhesion molecules. FN3-like domains are also found in bacterial glycosyl hydrolases.


Pssm-ID: 238020 [Multi-domain]  Cd Length: 93  Bit Score: 44.02  E-value: 5.06e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462586997  777 GRVSRLQILNASSDVLRITWVGVTGA----TAYRLAWGR-SEGGPMRHQILPGNTDSAEIRGLEGGVSYSVRVTALVGDR 851
Cdd:cd00063      2 SPPTNLRVTDVTSTSVTLSWTPPEDDggpiTGYVVEYREkGSGDWKEVEVTPGSETSYTLTGLKPGTEYEFRVRAVNGGG 81
                           90
                   ....*....|.
gi 2462586997  852 EGTPVSIVVTT 862
Cdd:cd00063     82 ESPPSESVTVT 92
PHA03169 PHA03169
hypothetical protein; Provisional
1709-1876 2.62e-04

hypothetical protein; Provisional


Pssm-ID: 223003 [Multi-domain]  Cd Length: 413  Bit Score: 46.12  E-value: 2.62e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462586997 1709 VDTGPGAREK-GEPGDRGQEGPRGPKGDPGLPGAPGERGIEGFRGPPGPQGDPGVRGPAGEKGDRGPPG--LDGRSGLDG 1785
Cdd:PHA03169    71 SDTETAEESRhGEKEERGQGGPSGSGSESVGSPTPSPSGSAEELASGLSPENTSGSSPESPASHSPPPSppSHPGPHEPA 150
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462586997 1786 KPGAAGPSGPNGAAGKAGDPGRDGLPGLRGEQGLPGPSGPPGLPGKPGEDGKPGLNGKNgEPGDPGEDGRKGEKGDSGAS 1865
Cdd:PHA03169   151 PPESHNPSPNQQPSSFLQPSHEDSPEEPEPPTSEPEPDSPGPPQSETPTSSPPPQSPPD-EPGEPQSPTPQQAPSPNTQQ 229
                          170
                   ....*....|.
gi 2462586997 1866 GREGRDGPKGE 1876
Cdd:PHA03169   230 AVEHEDEPTEP 240
SPT5 COG5164
Transcription elongation factor SPT5 [Transcription];
1711-1936 3.65e-04

Transcription elongation factor SPT5 [Transcription];


Pssm-ID: 444063 [Multi-domain]  Cd Length: 495  Bit Score: 45.79  E-value: 3.65e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462586997 1711 TGPGAreKGEPGDRGQEGPRGPKGDPGLPGAPGERGIEGFRGPPGPQGDPGVRGPAGEKGDRGPPGLDGRSGLDGKPGAA 1790
Cdd:COG5164      4 YGPGK--TGPSDPGGVTTPAGSQGSTKPAQNQGSTRPAGNTGGTRPAQNQGSTTPAGNTGGTRPAGNQGATGPAQNQGGT 81
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462586997 1791 GPSGPNGAAGKAGDPGRDGLPGLRGEQGLPGPSGPPGLPGKPGEDGkPGLNGKNGEPGD----PGEDGRKGEKGDSGASG 1866
Cdd:COG5164     82 TPAQNQGGTRPAGNTGGTTPAGDGGATGPPDDGGATGPPDDGGSTT-PPSGGSTTPPGDggstPPGPGSTGPGGSTTPPG 160
                          170       180       190       200       210       220       230
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462586997 1867 REGRDGPKGERGAPGILGPQGPPGLPGPVGPPGQGFPGVPGGTGPKGDRGETGSKGEQGLPGERGLRGEP 1936
Cdd:COG5164    161 DGGSTTPPGPGGSTTPPDDGGSTTPPNKGETGTDIPTGGTPRQGPDGPVKKDDKNGKGNPPDDRGGKTGP 230
PHA03169 PHA03169
hypothetical protein; Provisional
2242-2366 4.24e-04

hypothetical protein; Provisional


Pssm-ID: 223003 [Multi-domain]  Cd Length: 413  Bit Score: 45.35  E-value: 4.24e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462586997 2242 GSPGLPGQVGETGKPGAPGRDGASGKDGDRGSPGVPGSPGLPGPVGPKGEPGPTGAPGQAVVGLPGAKGEKGAPGGLAGD 2321
Cdd:PHA03169   103 PTPSPSGSAEELASGLSPENTSGSSPESPASHSPPPSPPSHPGPHEPAPPESHNPSPNQQPSSFLQPSHEDSPEEPEPPT 182
                           90       100       110       120
                   ....*....|....*....|....*....|....*....|....*
gi 2462586997 2322 LVGEPGAKGDRGLPGPRGEKGEAGRAGEPGDPGEDGQKGAPGPKG 2366
Cdd:PHA03169   183 SEPEPDSPGPPQSETPTSSPPPQSPPDEPGEPQSPTPQQAPSPNT 227
Collagen pfam01391
Collagen triple helix repeat (20 copies); Members of this family belong to the collagen ...
1767-1816 7.26e-04

Collagen triple helix repeat (20 copies); Members of this family belong to the collagen superfamily. Collagens are generally extracellular structural proteins involved in formation of connective tissue structure. The alignment contains 20 copies of the G-X-Y repeat that forms a triple helix. The first position of the repeat is glycine, the second and third positions can be any residue but are frequently proline and hydroxy-proline. Collagens are post translationally modified by proline hydroxylase to form the hydroxy-proline residues. Defective hydroxylation is the cause of scurvy. Some members of the collagen superfamily are not involved in connective tissue structure but share the same triple helical structure. The family includes bacterial collagen-like triple-helix repeat proteins.


Pssm-ID: 460189 [Multi-domain]  Cd Length: 57  Bit Score: 39.78  E-value: 7.26e-04
                           10        20        30        40        50
                   ....*....|....*....|....*....|....*....|....*....|
gi 2462586997 1767 GEKGDRGPPGLDGRSGLDGKPGAAGPSGPNGAAGKAGDPGRDGLPGLRGE 1816
Cdd:pfam01391    1 GPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGEPGPPGPPGPPGPPGPPGA 50
FN3 smart00060
Fibronectin type 3 domain; One of three types of internal repeat within the plasma protein, ...
777-853 9.42e-04

Fibronectin type 3 domain; One of three types of internal repeat within the plasma protein, fibronectin. The tenth fibronectin type III repeat contains a RGD cell recognition sequence in a flexible loop between 2 strands. Type III modules are present in both extracellular and intracellular proteins.


Pssm-ID: 214495 [Multi-domain]  Cd Length: 83  Bit Score: 40.29  E-value: 9.42e-04
                            10        20        30        40        50        60        70        80
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462586997   777 GRVSRLQILNASSDVLRITW--VGVTGATAYRLAW---GRSEGGPMRHQILPGNTDSAEIRGLEGGVSYSVRVTALVGDR 851
Cdd:smart00060    2 SPPSNLRVTDVTSTSVTLSWepPPDDGITGYIVGYrveYREEGSEWKEVNVTPSSTSYTLTGLKPGTEYEFRVRAVNGAG 81

                    ..
gi 2462586997   852 EG 853
Cdd:smart00060   82 EG 83
PRK07764 PRK07764
DNA polymerase III subunits gamma and tau; Validated
1450-1596 5.47e-03

DNA polymerase III subunits gamma and tau; Validated


Pssm-ID: 236090 [Multi-domain]  Cd Length: 824  Bit Score: 42.28  E-value: 5.47e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462586997 1450 SEDGAPGLPGQPGSPGEQGPRGPPGAigPKGDRGFPGPLGEAGEKGERGPPGPAGSRGLPGVAGRPGAKGPEGPPGPTGR 1529
Cdd:PRK07764   610 EEAARPAAPAAPAAPAAPAPAGAAAA--PAEASAAPAPGVAAPEHHPKHVAVPDASDGGDGWPAKAGGAAPAAPPPAPAP 687
                           90       100       110       120       130       140
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 2462586997 1530 QGEKGEPGRPGDPAVVGPAVAGPKGEKGDVGPAGPRGATGV--QGERGPPGLVLPGDPGPKGDPGDRGP 1596
Cdd:PRK07764   688 AAPAAPAGAAPAQPAPAPAATPPAGQADDPAAQPPQAAQGAsaPSPAADDPVPLPPEPDDPPDPAGAPA 756
Collagen pfam01391
Collagen triple helix repeat (20 copies); Members of this family belong to the collagen ...
1979-2045 9.96e-03

Collagen triple helix repeat (20 copies); Members of this family belong to the collagen superfamily. Collagens are generally extracellular structural proteins involved in formation of connective tissue structure. The alignment contains 20 copies of the G-X-Y repeat that forms a triple helix. The first position of the repeat is glycine, the second and third positions can be any residue but are frequently proline and hydroxy-proline. Collagens are post translationally modified by proline hydroxylase to form the hydroxy-proline residues. Defective hydroxylation is the cause of scurvy. Some members of the collagen superfamily are not involved in connective tissue structure but share the same triple helical structure. The family includes bacterial collagen-like triple-helix repeat proteins.


Pssm-ID: 460189 [Multi-domain]  Cd Length: 57  Bit Score: 36.32  E-value: 9.96e-03
                           10        20        30        40        50        60
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*..
gi 2462586997 1979 GPKGDSGEQGPPGKEGPIGFPGERGLKGDRGDPGPqgppglalgeRGPPGPSGLAGEPGKPGIPGLP 2045
Cdd:pfam01391    1 GPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGEPGP----------PGPPGPPGPPGPPGAPGAPGPP 57
 
Name Accession Description Interval E-value
vWA_collagen_alphaI-XII-like cd01482
Collagen: The extracellular matrix represents a complex alloy of variable members of diverse ...
37-201 1.12e-82

Collagen: The extracellular matrix represents a complex alloy of variable members of diverse protein families defining structural integrity and various physiological functions. The most abundant family is the collagens with more than 20 different collagen types identified thus far. Collagens are centrally involved in the formation of fibrillar and microfibrillar networks of the extracellular matrix, basement membranes as well as other structures of the extracellular matrix. Some collagens have about 15-18 vWA domains in them. The VWA domains present in these collagens mediate protein-protein interactions.


Pssm-ID: 238759 [Multi-domain]  Cd Length: 164  Bit Score: 268.77  E-value: 1.12e-82
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462586997   37 ADIVFLLDGSSSIGRSNFREVRSFLEGLVLPFsgAASAQGVRFATVQYSDDPRTEFGLDALGSGGDVIRAIRELSYKGGN 116
Cdd:cd01482      1 ADIVFLVDGSWSIGRSNFNLVRSFLSSVVEAF--EIGPDGVQVGLVQYSDDPRTEFDLNAYTSKEDVLAAIKNLPYKGGN 78
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462586997  117 TRTGAAILHVADHVFLPQ-LARPGVPKVCILITDGKSQDLVDTAAQRLKGQGVKLFAVGIKNADPEELKRVASQPTSDFF 195
Cdd:cd01482     79 TRTGKALTHVREKNFTPDaGARPGVPKVVILITDGKSQDDVELPARVLRNLGVNVFAVGVKDADESELKMIASKPSETHV 158

                   ....*.
gi 2462586997  196 FFVNDF 201
Cdd:cd01482    159 FNVADF 164
vWA_collagen cd01472
von Willebrand factor (vWF) type A domain; equivalent to the I-domain of integrins. This ...
37-201 8.63e-69

von Willebrand factor (vWF) type A domain; equivalent to the I-domain of integrins. This domain has a variety of functions including: intermolecular adhesion, cell migration, signalling, transcription, and DNA repair. In integrins these domains form heterodimers while in vWF it forms homodimers and multimers. There are different interaction surfaces of this domain as seen by its complexes with collagen with either integrin or human vWFA. In integrins collagen binding occurs via the metal ion-dependent adhesion site (MIDAS) and involves three surface loops located on the upper surface of the molecule. In human vWFA, collagen binding is thought to occur on the bottom of the molecule and does not involve the vestigial MIDAS motif.


Pssm-ID: 238749 [Multi-domain]  Cd Length: 164  Bit Score: 229.04  E-value: 8.63e-69
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462586997   37 ADIVFLLDGSSSIGRSNFREVRSFLEGLVLPFSgaASAQGVRFATVQYSDDPRTEFGLDALGSGGDVIRAIRELSYKGGN 116
Cdd:cd01472      1 ADIVFLVDGSESIGLSNFNLVKDFVKRVVERLD--IGPDGVRVGVVQYSDDPRTEFYLNTYRSKDDVLEAVKNLRYIGGG 78
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462586997  117 TRTGAAILHVADHVFLPQL-ARPGVPKVCILITDGKSQDLVDTAAQRLKGQGVKLFAVGIKNADPEELKRVASQPTSDFF 195
Cdd:cd01472     79 TNTGKALKYVRENLFTEASgSREGVPKVLVVITDGKSQDDVEEPAVELKQAGIEVFAVGVKNADEEELKQIASDPKELYV 158

                   ....*.
gi 2462586997  196 FFVNDF 201
Cdd:cd01472    159 FNVADF 164
VWA pfam00092
von Willebrand factor type A domain;
38-207 2.52e-56

von Willebrand factor type A domain;


Pssm-ID: 459670 [Multi-domain]  Cd Length: 174  Bit Score: 193.65  E-value: 2.52e-56
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462586997   38 DIVFLLDGSSSIGRSNFREVRSFLEGLVLPFSgaASAQGVRFATVQYSDDPRTEFGLDALGSGGDVIRAIRELSYKGGNT 117
Cdd:pfam00092    1 DIVFLLDGSGSIGGDNFEKVKEFLKKLVESLD--IGPDGTRVGLVQYSSDVRTEFPLNDYSSKEELLSAVDNLRYLGGGT 78
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462586997  118 R-TGAAILHVADHVFLP-QLARPGVPKVCILITDGKSQDL-VDTAAQRLKGQGVKLFAVGIKNADPEELKRVASQPTSDF 194
Cdd:pfam00092   79 TnTGKALKYALENLFSSaAGARPGAPKVVVLLTDGRSQDGdPEEVARELKSAGVTVFAVGVGNADDEELRKIASEPGEGH 158
                          170
                   ....*....|...
gi 2462586997  195 FFFVNDFSILRTL 207
Cdd:pfam00092  159 VFTVSDFEALEDL 171
vWFA_subfamily_ECM cd01450
Von Willebrand factor type A (vWA) domain was originally found in the blood coagulation ...
37-196 1.58e-51

Von Willebrand factor type A (vWA) domain was originally found in the blood coagulation protein von Willebrand factor (vWF). Typically, the vWA domain is made up of approximately 200 amino acid residues folded into a classic a/b para-rossmann type of fold. The vWA domain, since its discovery, has drawn great interest because of its widespread occurrence and its involvement in a wide variety of important cellular functions. These include basal membrane formation, cell migration, cell differentiation, adhesion, haemostasis, signaling, chromosomal stability, malignant transformation and in immune defenses In integrins these domains form heterodimers while in vWF it forms multimers. There are different interaction surfaces of this domain as seen by the various molecules it complexes with. Ligand binding in most cases is mediated by the presence of a metal ion dependent adhesion site termed as the MIDAS motif that is a characteristic feature of most, if not all A domains


Pssm-ID: 238727 [Multi-domain]  Cd Length: 161  Bit Score: 179.41  E-value: 1.58e-51
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462586997   37 ADIVFLLDGSSSIGRSNFREVRSFLEGLVLPFSgaASAQGVRFATVQYSDDPRTEFGLDALGSGGDVIRAIRELSYKGGN 116
Cdd:cd01450      1 LDIVFLLDGSESVGPENFEKVKDFIEKLVEKLD--IGPDKTRVGLVQYSDDVRVEFSLNDYKSKDDLLKAVKNLKYLGGG 78
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462586997  117 -TRTGAAILHVADHVFLPQLARPGVPKVCILITDGKSQDLVD--TAAQRLKGQGVKLFAVGIKNADPEELKRVASQPTSD 193
Cdd:cd01450     79 gTNTGKALQYALEQLFSESNARENVPKVIIVLTDGRSDDGGDpkEAAAKLKDEGIKVFVVGVGPADEEELREIASCPSER 158

                   ...
gi 2462586997  194 FFF 196
Cdd:cd01450    159 HVF 161
VWA smart00327
von Willebrand factor (vWF) type A domain; VWA domains in extracellular eukaryotic proteins ...
38-202 1.21e-43

von Willebrand factor (vWF) type A domain; VWA domains in extracellular eukaryotic proteins mediate adhesion via metal ion-dependent adhesion sites (MIDAS). Intracellular VWA domains and homologues in prokaryotes have recently been identified. The proposed VWA domains in integrin beta subunits have recently been substantiated using sequence-based methods.


Pssm-ID: 214621 [Multi-domain]  Cd Length: 175  Bit Score: 157.62  E-value: 1.21e-43
                            10        20        30        40        50        60        70        80
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462586997    38 DIVFLLDGSSSIGRSNFREVRSFLEGLVLPFSgaASAQGVRFATVQYSDDPRTEFGLDALGSGGDVIRAIRELSYK-GGN 116
Cdd:smart00327    1 DVVFLLDGSGSMGGNRFELAKEFVLKLVEQLD--IGPDGDRVGLVTFSDDARVLFPLNDSRSKDALLEALASLSYKlGGG 78
                            90       100       110       120       130       140       150       160
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462586997   117 TRTGAAILHVADHVFLPQL-ARPGVPKVCILITDGKSQDL---VDTAAQRLKGQGVKLFAVGIKNA-DPEELKRVASQPT 191
Cdd:smart00327   79 TNLGAALQYALENLFSKSAgSRRGAPKVVILITDGESNDGpkdLLKAAKELKRSGVKVFVVGVGNDvDEEELKKLASAPG 158
                           170
                    ....*....|.
gi 2462586997   192 SDFFFFVNDFS 202
Cdd:smart00327  159 GVYVFLPELLD 169
vWA_Matrilin cd01475
VWA_Matrilin: In cartilaginous plate, extracellular matrix molecules mediate cell-matrix and ...
36-218 6.77e-40

VWA_Matrilin: In cartilaginous plate, extracellular matrix molecules mediate cell-matrix and matrix-matrix interactions thereby providing tissue integrity. Some members of the matrilin family are expressed specifically in developing cartilage rudiments. The matrilin family consists of at least four members. All the members of the matrilin family contain VWA domains, EGF-like domains and a heptad repeat coiled-coiled domain at the carboxy terminus which is responsible for the oligomerization of the matrilins. The VWA domains have been shown to be essential for matrilin network formation by interacting with matrix ligands.


Pssm-ID: 238752 [Multi-domain]  Cd Length: 224  Bit Score: 148.69  E-value: 6.77e-40
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462586997   36 AADIVFLLDGSSSIGRSNFREVRSFLEGLVLPFSgaASAQGVRFATVQYSDDPRTEFGLDALGSGGDVIRAIRELSYKGG 115
Cdd:cd01475      2 PTDLVFLIDSSRSVRPENFELVKQFLNQIIDSLD--VGPDATRVGLVQYSSTVKQEFPLGRFKSKADLKRAVRRMEYLET 79
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462586997  116 NTRTGAAILHVADHVFLP-QLARPG---VPKVCILITDGKSQDLVDTAAQRLKGQGVKLFAVGIKNADPEELKRVASQPT 191
Cdd:cd01475     80 GTMTGLAIQYAMNNAFSEaEGARPGserVPRVGIVVTDGRPQDDVSEVAAKARALGIEMFAVGVGRADEEELREIASEPL 159
                          170       180
                   ....*....|....*....|....*..
gi 2462586997  192 SDFFFFVNDFSILRTLLPLVSRRVCTT 218
Cdd:cd01475    160 ADHVFYVEDFSTIEELTKKFQGKICVV 186
vWA_collagen_alpha3-VI-like cd01481
VWA_collagen alpha 3(VI) like: The extracellular matrix represents a complex alloy of variable ...
37-201 6.34e-37

VWA_collagen alpha 3(VI) like: The extracellular matrix represents a complex alloy of variable members of diverse protein families defining structural integrity and various physiological functions. The most abundant family is the collagens with more than 20 different collagen types identified thus far. Collagens are centrally involved in the formation of fibrillar and microfibrillar networks of the extracellular matrix, basement membranes as well as other structures of the extracellular matrix. Some collagens have about 15-18 vWA domains in them. The VWA domains present in these collagens mediate protein-protein interactions.


Pssm-ID: 238758  Cd Length: 165  Bit Score: 137.84  E-value: 6.34e-37
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462586997   37 ADIVFLLDGSSSIGRSNFREVRSFLEGLVLpfSGAASAQGVRFATVQYSDDPRTEFGLDALGSGGDVIRAIRELSYKGGN 116
Cdd:cd01481      1 KDIVFLIDGSDNVGSGNFPAIRDFIERIVQ--SLDVGPDKIRVAVVQFSDTPRPEFYLNTHSTKADVLGAVRRLRLRGGS 78
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462586997  117 T-RTGAAILHVADHVFLPQL---ARPGVPKVCILITDGKSQDLVDTAAQRLKGQGVKLFAVGIKNADPEELKRVASQPts 192
Cdd:cd01481     79 QlNTGSALDYVVKNLFTKSAgsrIEEGVPQFLVLITGGKSQDDVERPAVALKRAGIVPFAIGARNADLAELQQIAFDP-- 156

                   ....*....
gi 2462586997  193 DFFFFVNDF 201
Cdd:cd01481    157 SFVFQVSDF 165
vWA_integrins_alpha_subunit cd01469
Integrins are a class of adhesion receptors that link the extracellular matrix to the ...
38-207 1.20e-35

Integrins are a class of adhesion receptors that link the extracellular matrix to the cytoskeleton and cooperate with growth factor receptors to promote celll survival, cell cycle progression and cell migration. Integrins consist of an alpha and a beta sub-unit. Each sub-unit has a large extracellular portion, a single transmembrane segment and a short cytoplasmic domain. The N-terminal domains of the alpha and beta subunits associate to form the integrin headpiece, which contains the ligand binding site, whereas the C-terminal segments traverse the plasma membrane and mediate interaction with the cytoskeleton and with signalling proteins.The VWA domains present in the alpha subunits of integrins seem to be a chordate specific radiation of the gene family being found only in vertebrates. They mediate protein-protein interactions.


Pssm-ID: 238746 [Multi-domain]  Cd Length: 177  Bit Score: 134.41  E-value: 1.20e-35
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462586997   38 DIVFLLDGSSSIGRSNFREVRSFLEGLVLPFSGAASAqgVRFATVQYSDDPRTEFGLDALGSGGDVIRAIRELSYKGGNT 117
Cdd:cd01469      2 DIVFVLDGSGSIYPDDFQKVKNFLSTVMKKLDIGPTK--TQFGLVQYSESFRTEFTLNEYRTKEEPLSLVKHISQLLGLT 79
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462586997  118 RTGAAILHVADHVFLPQL-ARPGVPKVCILITDGKSQD--LVDTAAQRLKGQGVKLFAVGI-----KNADPEELKRVASQ 189
Cdd:cd01469     80 NTATAIQYVVTELFSESNgARKDATKVLVVITDGESHDdpLLKDVIPQAEREGIIRYAIGVgghfqRENSREELKTIASK 159
                          170
                   ....*....|....*...
gi 2462586997  190 PTSDFFFFVNDFSILRTL 207
Cdd:cd01469    160 PPEEHFFNVTDFAALKDI 177
gly_rich_SclB NF038329
LPXTG-anchored collagen-like adhesin Scl2/SclB; SclB (or Scl2 - streptococcal collagen-like ...
1554-1850 1.45e-34

LPXTG-anchored collagen-like adhesin Scl2/SclB; SclB (or Scl2 - streptococcal collagen-like protein 2) is an LPXTG-anchored surface-anchored adhesin with a variable-length region of triple helix-forming collagen-like Gly-Xaa-Xaa repeats.


Pssm-ID: 468478 [Multi-domain]  Cd Length: 440  Bit Score: 139.66  E-value: 1.45e-34
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462586997 1554 GEKGDVGPAGPRGATGVQGERGPPGLVlpGDPGPKGDPGDRGPIGLTGRAgppgdsgppgekgdpgrpgppgpvgprgrd 1633
Cdd:NF038329   117 GEKGEPGPAGPAGPAGEQGPRGDRGET--GPAGPAGPPGPQGERGEKGPA------------------------------ 164
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462586997 1634 GEVGEKGdegppgdpglpgKAGERGLRGAPGVRGPVGEKGDQGDPGEDGRNGSPGSSGPKGDRGEPGPPGPPGRLVDTGP 1713
Cdd:NF038329   165 GPQGEAG------------PQGPAGKDGEAGAKGPAGEKGPQGPRGETGPAGEQGPAGPAGPDGEAGPAGEDGPAGPAGD 232
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462586997 1714 GAR-EKGEPGDRGQEGPRGPKGDPGLPGAPGERGIEGfrgppgpqgdpgVRGPAGEKGDRGPPGLDGRSGLDGKPGAAGP 1792
Cdd:NF038329   233 GQQgPDGDPGPTGEDGPQGPDGPAGKDGPRGDRGEAG------------PDGPDGKDGERGPVGPAGKDGQNGKDGLPGK 300
                          250       260       270       280       290
                   ....*....|....*....|....*....|....*....|....*....|....*...
gi 2462586997 1793 SGPNGAAGKAGDPGRDGLPGLRGeqglpgpsgppglpgkpgEDGKPGLNGKNGEPGDP 1850
Cdd:NF038329   301 DGKDGQNGKDGLPGKDGKDGQPG------------------KDGLPGKDGKDGQPGKP 340
gly_rich_SclB NF038329
LPXTG-anchored collagen-like adhesin Scl2/SclB; SclB (or Scl2 - streptococcal collagen-like ...
1447-1693 5.36e-34

LPXTG-anchored collagen-like adhesin Scl2/SclB; SclB (or Scl2 - streptococcal collagen-like protein 2) is an LPXTG-anchored surface-anchored adhesin with a variable-length region of triple helix-forming collagen-like Gly-Xaa-Xaa repeats.


Pssm-ID: 468478 [Multi-domain]  Cd Length: 440  Bit Score: 137.73  E-value: 5.36e-34
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462586997 1447 KGDSEDGAPGLPGQPGSPGEQGPRGPPGAIGPKGDRGFPGPLGEAGEKGERGPPGPAGSRGLPGVAGRpgakgpegppgp 1526
Cdd:NF038329   114 KGDGEKGEPGPAGPAGPAGEQGPRGDRGETGPAGPAGPPGPQGERGEKGPAGPQGEAGPQGPAGKDGE------------ 181
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462586997 1527 tgrQGEKGEPGRPGDPAVVGPAvaGPKGEKGDVGPAGPRGATGVQGERGPPGLVLPGDPGPKGDPGDRGPIGLTGRAGPP 1606
Cdd:NF038329   182 ---AGAKGPAGEKGPQGPRGET--GPAGEQGPAGPAGPDGEAGPAGEDGPAGPAGDGQQGPDGDPGPTGEDGPQGPDGPA 256
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462586997 1607 GDSGPPGEKGDPGRPGPPGPVGPRGRDGEVGEKGDEGPPGDPGLPGKAGERGLRGAPGVRGPVGEKGDQGDPGEDGRNGS 1686
Cdd:NF038329   257 GKDGPRGDRGEAGPDGPDGKDGERGPVGPAGKDGQNGKDGLPGKDGKDGQNGKDGLPGKDGKDGQPGKDGLPGKDGKDGQ 336

                   ....*..
gi 2462586997 1687 PGSSGPK 1693
Cdd:NF038329   337 PGKPAPK 343
gly_rich_SclB NF038329
LPXTG-anchored collagen-like adhesin Scl2/SclB; SclB (or Scl2 - streptococcal collagen-like ...
2022-2365 1.79e-32

LPXTG-anchored collagen-like adhesin Scl2/SclB; SclB (or Scl2 - streptococcal collagen-like protein 2) is an LPXTG-anchored surface-anchored adhesin with a variable-length region of triple helix-forming collagen-like Gly-Xaa-Xaa repeats.


Pssm-ID: 468478 [Multi-domain]  Cd Length: 440  Bit Score: 133.49  E-value: 1.79e-32
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462586997 2022 GERGPPGPSGLAGEPGKPGIPGLPGRAGGVGEAGRPGERGERGEKGERGEQGrdgppglpgtpgppgppgpkvsvdepgp 2101
Cdd:NF038329   117 GEKGEPGPAGPAGPAGEQGPRGDRGETGPAGPAGPPGPQGERGEKGPAGPQG---------------------------- 168
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462586997 2102 glsgEQGPPGLKGAKGEPGSNGDQGPKGDRGvpgikgDRGEPGPRGQDGNPGLPGERGMAGPEGKPGLQGPRGPPGpvgg 2181
Cdd:NF038329   169 ----EAGPQGPAGKDGEAGAKGPAGEKGPQG------PRGETGPAGEQGPAGPAGPDGEAGPAGEDGPAGPAGDGQ---- 234
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462586997 2182 hgdpgppgapglagpagpQGPSGLKGEPGETGPPG-RGLTGPTGAVglpgppgpsglvgpqgspglpGQVGETGKPGAPG 2260
Cdd:NF038329   235 ------------------QGPDGDPGPTGEDGPQGpDGPAGKDGPR---------------------GDRGEAGPDGPDG 275
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462586997 2261 RDGASGKDGDRGSPGVPGSPGLPGPVGPKGEPGPTGAPGQavvglpgakgekgapgglagdlvgepgakgdrglpgpRGE 2340
Cdd:NF038329   276 KDGERGPVGPAGKDGQNGKDGLPGKDGKDGQNGKDGLPGK-------------------------------------DGK 318
                          330       340
                   ....*....|....*....|....*
gi 2462586997 2341 KGEAGRAGEPGDPGEDGQKGAPGPK 2365
Cdd:NF038329   319 DGQPGKDGLPGKDGKDGQPGKPAPK 343
vWFA cd00198
Von Willebrand factor type A (vWA) domain was originally found in the blood coagulation ...
37-196 3.11e-32

Von Willebrand factor type A (vWA) domain was originally found in the blood coagulation protein von Willebrand factor (vWF). Typically, the vWA domain is made up of approximately 200 amino acid residues folded into a classic a/b para-rossmann type of fold. The vWA domain, since its discovery, has drawn great interest because of its widespread occurrence and its involvement in a wide variety of important cellular functions. These include basal membrane formation, cell migration, cell differentiation, adhesion, haemostasis, signaling, chromosomal stability, malignant transformation and in immune defenses In integrins these domains form heterodimers while in vWF it forms multimers. There are different interaction surfaces of this domain as seen by the various molecules it complexes with. Ligand binding in most cases is mediated by the presence of a metal ion dependent adhesion site termed as the MIDAS motif that is a characteristic feature of most, if not all A domains.


Pssm-ID: 238119 [Multi-domain]  Cd Length: 161  Bit Score: 124.22  E-value: 3.11e-32
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462586997   37 ADIVFLLDGSSSIGRSNFREVRSFLEGLVLPFSgaASAQGVRFATVQYSDDPRTEFGLDALGSGGDVIRAIRELSYK-GG 115
Cdd:cd00198      1 ADIVFLLDVSGSMGGEKLDKAKEALKALVSSLS--ASPPGDRVGLVTFGSNARVVLPLTTDTDKADLLEAIDALKKGlGG 78
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462586997  116 NTRTGAAILHVADHvfLPQLARPGVPKVCILITDGKSQD---LVDTAAQRLKGQGVKLFAVGIKN-ADPEELKRVASQPT 191
Cdd:cd00198     79 GTNIGAALRLALEL--LKSAKRPNARRVIILLTDGEPNDgpeLLAEAARELRKLGITVYTIGIGDdANEDELKEIADKTT 156

                   ....*
gi 2462586997  192 SDFFF 196
Cdd:cd00198    157 GGAVF 161
gly_rich_SclB NF038329
LPXTG-anchored collagen-like adhesin Scl2/SclB; SclB (or Scl2 - streptococcal collagen-like ...
1979-2298 1.28e-29

LPXTG-anchored collagen-like adhesin Scl2/SclB; SclB (or Scl2 - streptococcal collagen-like protein 2) is an LPXTG-anchored surface-anchored adhesin with a variable-length region of triple helix-forming collagen-like Gly-Xaa-Xaa repeats.


Pssm-ID: 468478 [Multi-domain]  Cd Length: 440  Bit Score: 124.63  E-value: 1.28e-29
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462586997 1979 GPKGDSGEQGPPGKEGPIGFPGERGLKGDRGDPGPQGPPglalGERGPPGPSGLAGEPGKPGIPGLPGRAGGVGEAGRPG 2058
Cdd:NF038329   117 GEKGEPGPAGPAGPAGEQGPRGDRGETGPAGPAGPPGPQ----GERGEKGPAGPQGEAGPQGPAGKDGEAGAKGPAGEKG 192
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462586997 2059 ERGERGEKGERGEQGRDGPPGLPGTPGPPGPPGpkvsvdEPGPGLSGEQGPpglkgaKGEPGSNGDQGPKGDRGVPGIKG 2138
Cdd:NF038329   193 PQGPRGETGPAGEQGPAGPAGPDGEAGPAGEDG------PAGPAGDGQQGP------DGDPGPTGEDGPQGPDGPAGKDG 260
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462586997 2139 DRGEPGPRGQDGNPGLPGERGMAGPEGKpglqgprgppgpvgghgdpgppgapglagpagpQGPSGLKGEPGETGPpgrg 2218
Cdd:NF038329   261 PRGDRGEAGPDGPDGKDGERGPVGPAGK---------------------------------DGQNGKDGLPGKDGK---- 303
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462586997 2219 ltgptgavglpgppgpsglvgpqgspglPGQVGETGKPGAPGRDGASGKDGDRGSPGVPGSPGLPGPVGPKGEPGPTGAP 2298
Cdd:NF038329   304 ----------------------------DGQNGKDGLPGKDGKDGQPGKDGLPGKDGKDGQPGKPAPKTPEVPQKPDTAP 355
VWA pfam00092
von Willebrand factor type A domain;
1054-1227 3.41e-29

von Willebrand factor type A domain;


Pssm-ID: 459670 [Multi-domain]  Cd Length: 174  Bit Score: 115.84  E-value: 3.41e-29
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462586997 1054 DVVFLPHATQD-NAHRAEATRRVLERLVLALGpLGPQAVQVGLLSYSHRPSPLFPLNGSHDLGIILQRIRDMPYMDPSGN 1132
Cdd:pfam00092    1 DIVFLLDGSGSiGGDNFEKVKEFLKKLVESLD-IGPDGTRVGLVQYSSDVRTEFPLNDYSSKEELLSAVDNLRYLGGGTT 79
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462586997 1133 NLGTAVVTAHRYMLAPDApGRRQHVPGVMVLLVD-EPLRGDIFSPIREAQASGLNVVMLGMAGADPEQLRRLApGMDSVQ 1211
Cdd:pfam00092   80 NTGKALKYALENLFSSAA-GARPGAPKVVVLLTDgRSQDGDPEEVARELKSAGVTVFAVGVGNADDEELRKIA-SEPGEG 157
                          170
                   ....*....|....*.
gi 2462586997 1212 TFFAVDDGPSLDQAVS 1227
Cdd:pfam00092  158 HVFTVSDFEALEDLQD 173
gly_rich_SclB NF038329
LPXTG-anchored collagen-like adhesin Scl2/SclB; SclB (or Scl2 - streptococcal collagen-like ...
2103-2416 1.49e-28

LPXTG-anchored collagen-like adhesin Scl2/SclB; SclB (or Scl2 - streptococcal collagen-like protein 2) is an LPXTG-anchored surface-anchored adhesin with a variable-length region of triple helix-forming collagen-like Gly-Xaa-Xaa repeats.


Pssm-ID: 468478 [Multi-domain]  Cd Length: 440  Bit Score: 121.55  E-value: 1.49e-28
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462586997 2103 LSGEQGPPGlkgAKGEPGSNGDQGPKGDRGVPGIKGDRGEPGPRGQDGNPGLPGERGMAGPegkpglqgprgppgpvggh 2182
Cdd:NF038329   115 GDGEKGEPG---PAGPAGPAGEQGPRGDRGETGPAGPAGPPGPQGERGEKGPAGPQGEAGP------------------- 172
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462586997 2183 gdpgppgapglagpagpQGPSGLKGEPGETGPPGRglTGPtgavglpgppgpsglvgpQGSPGLPGQVGETGKPGAPGRD 2262
Cdd:NF038329   173 -----------------QGPAGKDGEAGAKGPAGE--KGP------------------QGPRGETGPAGEQGPAGPAGPD 215
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462586997 2263 GASGKDGDRGSPGVPGSpglpGPVGPKGEPGPTGAPGQAvvglpgakgekgAPGGLAGdlvgEPGAKGDRGLPGPRGEKG 2342
Cdd:NF038329   216 GEAGPAGEDGPAGPAGD----GQQGPDGDPGPTGEDGPQ------------GPDGPAG----KDGPRGDRGEAGPDGPDG 275
                          250       260       270       280       290       300       310
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....
gi 2462586997 2343 EAGRAGEPGDPGEDGQKGAPGPKGFKGDPGVGvpgspgppgppgvkgdlGLPGLPGAPGVVGFPGQTGPRGEMG 2416
Cdd:NF038329   276 KDGERGPVGPAGKDGQNGKDGLPGKDGKDGQN-----------------GKDGLPGKDGKDGQPGKDGLPGKDG 332
gly_rich_SclB NF038329
LPXTG-anchored collagen-like adhesin Scl2/SclB; SclB (or Scl2 - streptococcal collagen-like ...
1783-2075 4.83e-27

LPXTG-anchored collagen-like adhesin Scl2/SclB; SclB (or Scl2 - streptococcal collagen-like protein 2) is an LPXTG-anchored surface-anchored adhesin with a variable-length region of triple helix-forming collagen-like Gly-Xaa-Xaa repeats.


Pssm-ID: 468478 [Multi-domain]  Cd Length: 440  Bit Score: 116.93  E-value: 4.83e-27
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462586997 1783 LDGKPGAAGPSGPNGAAGKAGDPGRDGlpglrgeqglpgpsgppglpgkpgEDGKPGLNGKNGEPGDPGEDGRKGEKGDS 1862
Cdd:NF038329   115 GDGEKGEPGPAGPAGPAGEQGPRGDRG------------------------ETGPAGPAGPPGPQGERGEKGPAGPQGEA 170
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462586997 1863 GASGREGRDGPKGERGAPGilgpqgppglpgpvgppgqgfpgvpgGTGPKGDRGETGSKGEQGLPGERGLRGEPGSvpnv 1942
Cdd:NF038329   171 GPQGPAGKDGEAGAKGPAG--------------------------EKGPQGPRGETGPAGEQGPAGPAGPDGEAGP---- 220
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462586997 1943 drlletagikasalreivetwDESSGSFLPVPERRRGPKGDSGEQGPPGKEGPIGFPGERGLKGDRGDPGPQGPPGLAlG 2022
Cdd:NF038329   221 ---------------------AGEDGPAGPAGDGQQGPDGDPGPTGEDGPQGPDGPAGKDGPRGDRGEAGPDGPDGKD-G 278
                          250       260       270       280       290
                   ....*....|....*....|....*....|....*....|....*....|...
gi 2462586997 2023 ERGPPGPSGLAGEPGKPGIPGLPGRAGGVGEAGRPGERGERGEKGERGEQGRD 2075
Cdd:NF038329   279 ERGPVGPAGKDGQNGKDGLPGKDGKDGQNGKDGLPGKDGKDGQPGKDGLPGKD 331
gly_rich_SclB NF038329
LPXTG-anchored collagen-like adhesin Scl2/SclB; SclB (or Scl2 - streptococcal collagen-like ...
1871-2166 3.96e-26

LPXTG-anchored collagen-like adhesin Scl2/SclB; SclB (or Scl2 - streptococcal collagen-like protein 2) is an LPXTG-anchored surface-anchored adhesin with a variable-length region of triple helix-forming collagen-like Gly-Xaa-Xaa repeats.


Pssm-ID: 468478 [Multi-domain]  Cd Length: 440  Bit Score: 114.23  E-value: 3.96e-26
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462586997 1871 DGPKGERGAPGilgpqgppglpgpvgppgqgFPGVPGGTGPKGDRGETGSKGEQGLPGERGLRGEPGSvpnvdrlletag 1950
Cdd:NF038329   116 DGEKGEPGPAG--------------------PAGPAGEQGPRGDRGETGPAGPAGPPGPQGERGEKGP------------ 163
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462586997 1951 ikasalreivetwdessgsflpvperrRGPKGDSGEQGPPGKEGPIGFPGERGLKGDRGDPgpqgppglalGERGPPGPS 2030
Cdd:NF038329   164 ---------------------------AGPQGEAGPQGPAGKDGEAGAKGPAGEKGPQGPR----------GETGPAGEQ 206
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462586997 2031 GLAGEPGKPGIPGLPGRAGGVGEAGRpGERGERGEKGERGEQGRDGPPGLpgtpgppgppgpkvsvdepgPGLSGEQGPP 2110
Cdd:NF038329   207 GPAGPAGPDGEAGPAGEDGPAGPAGD-GQQGPDGDPGPTGEDGPQGPDGP--------------------AGKDGPRGDR 265
                          250       260       270       280       290
                   ....*....|....*....|....*....|....*....|....*....|....*....
gi 2462586997 2111 GLKGAKGEPGSNGDQGPKGDRGVPGIKGDRGEP---GPRGQDGNPGLPGERGMAGPEGK 2166
Cdd:NF038329   266 GEAGPDGPDGKDGERGPVGPAGKDGQNGKDGLPgkdGKDGQNGKDGLPGKDGKDGQPGK 324
gly_rich_SclB NF038329
LPXTG-anchored collagen-like adhesin Scl2/SclB; SclB (or Scl2 - streptococcal collagen-like ...
1743-2045 1.11e-25

LPXTG-anchored collagen-like adhesin Scl2/SclB; SclB (or Scl2 - streptococcal collagen-like protein 2) is an LPXTG-anchored surface-anchored adhesin with a variable-length region of triple helix-forming collagen-like Gly-Xaa-Xaa repeats.


Pssm-ID: 468478 [Multi-domain]  Cd Length: 440  Bit Score: 113.08  E-value: 1.11e-25
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462586997 1743 GERGIEGFRGPPGPQGDPGVRGPAGEKGDRG---PPGLDGRSGLDGKPGAAGPSGPNGAAGKAGDPGRDGLPGLRGEQGL 1819
Cdd:NF038329   117 GEKGEPGPAGPAGPAGEQGPRGDRGETGPAGpagPPGPQGERGEKGPAGPQGEAGPQGPAGKDGEAGAKGPAGEKGPQGP 196
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462586997 1820 PGPSGPPGLPGKPGEDGKPGLNGKNGEPGDPGEDGRkGEKGDSGASGREGRDGPKGERGAPGilgpqgppglpgpvgppg 1899
Cdd:NF038329   197 RGETGPAGEQGPAGPAGPDGEAGPAGEDGPAGPAGD-GQQGPDGDPGPTGEDGPQGPDGPAG------------------ 257
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462586997 1900 qgfpgvpgGTGPKGDRGETGSKGEQGLPGERGLRGEPGSvpnvdrlletagikasalreivetwdessgsflpvperrrg 1979
Cdd:NF038329   258 --------KDGPRGDRGEAGPDGPDGKDGERGPVGPAGK----------------------------------------- 288
                          250       260       270       280       290       300
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*.
gi 2462586997 1980 pKGDSGEQGPPGKEGPIGFPGERGLKGDRgdpgpqgppglalGERGPPGPSGLAGEPGKPGIPGLP 2045
Cdd:NF038329   289 -DGQNGKDGLPGKDGKDGQNGKDGLPGKD-------------GKDGQPGKDGLPGKDGKDGQPGKP 340
gly_rich_SclB NF038329
LPXTG-anchored collagen-like adhesin Scl2/SclB; SclB (or Scl2 - streptococcal collagen-like ...
1304-1585 2.92e-25

LPXTG-anchored collagen-like adhesin Scl2/SclB; SclB (or Scl2 - streptococcal collagen-like protein 2) is an LPXTG-anchored surface-anchored adhesin with a variable-length region of triple helix-forming collagen-like Gly-Xaa-Xaa repeats.


Pssm-ID: 468478 [Multi-domain]  Cd Length: 440  Bit Score: 111.54  E-value: 2.92e-25
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462586997 1304 DGRPGSPGRAGNPGTPGAPGLKGSPGLPGPRGDPGERGPRGPKGEPGAPGQViGGEGP-GLPGRKGDPgpsgppgprgpl 1382
Cdd:NF038329   116 DGEKGEPGPAGPAGPAGEQGPRGDRGETGPAGPAGPPGPQGERGEKGPAGPQ-GEAGPqGPAGKDGEA------------ 182
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462586997 1383 gdpgprgppglpgtamkGDKGDRGERGPPGPGEGGIAPGEPGLPGLPGSPGPQGPVGPPGKKGEKGDSEDGAPGLPGQPG 1462
Cdd:NF038329   183 -----------------GAKGPAGEKGPQGPRGETGPAGEQGPAGPAGPDGEAGPAGEDGPAGPAGDGQQGPDGDPGPTG 245
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462586997 1463 SPGEQGPRGPPGAIGPKGDRGFPGPLGEAGEKGERGPPGPAGSRGLPGVAGrpgakgpegppgPTGRQGEKGEPGRPGDP 1542
Cdd:NF038329   246 EDGPQGPDGPAGKDGPRGDRGEAGPDGPDGKDGERGPVGPAGKDGQNGKDG------------LPGKDGKDGQNGKDGLP 313
                          250       260       270       280
                   ....*....|....*....|....*....|....*....|...
gi 2462586997 1543 avvgpavagpkGEKGDVGPAGPRGATGVQGERGPPGLVLPGDP 1585
Cdd:NF038329   314 -----------GKDGKDGQPGKDGLPGKDGKDGQPGKPAPKTP 345
vWA_micronemal_protein cd01471
Micronemal proteins: The Toxoplasma lytic cycle begins when the parasite actively invades a ...
38-175 5.57e-22

Micronemal proteins: The Toxoplasma lytic cycle begins when the parasite actively invades a target cell. In association with invasion, T. gondii sequentially discharges three sets of secretory organelles beginning with the micronemes, which contain adhesive proteins involved in parasite attachment to a host cell. Deployed as protein complexes, several micronemal proteins possess vertebrate-derived adhesive sequences that function in binding receptors. The VWA domain likely mediates the protein-protein interactions of these with their interacting partners.


Pssm-ID: 238748 [Multi-domain]  Cd Length: 186  Bit Score: 95.53  E-value: 5.57e-22
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462586997   38 DIVFLLDGSSSIGRSN-FREVRSFLEGLV--LPFSgaasAQGVRFATVQYSDDPRTEFGLDALGS-----GGDVIRAIRE 109
Cdd:cd01471      2 DLYLLVDGSGSIGYSNwVTHVVPFLHTFVqnLNIS----PDEINLYLVTFSTNAKELIRLSSPNStnkdlALNAIRALLS 77
                           90       100       110       120       130       140
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 2462586997  110 LSYKGGNTRTGAAILHVADHVFLPQLARPGVPKVCILITDGKSQDLVDT--AAQRLKGQGVKLFAVGI 175
Cdd:cd01471     78 LYYPNGSTNTTSALLVVEKHLFDTRGNRENAPQLVIIMTDGIPDSKFRTlkEARKLRERGVIIAVLGV 145
vWFA_subfamily_ECM cd01450
Von Willebrand factor type A (vWA) domain was originally found in the blood coagulation ...
1053-1204 2.20e-21

Von Willebrand factor type A (vWA) domain was originally found in the blood coagulation protein von Willebrand factor (vWF). Typically, the vWA domain is made up of approximately 200 amino acid residues folded into a classic a/b para-rossmann type of fold. The vWA domain, since its discovery, has drawn great interest because of its widespread occurrence and its involvement in a wide variety of important cellular functions. These include basal membrane formation, cell migration, cell differentiation, adhesion, haemostasis, signaling, chromosomal stability, malignant transformation and in immune defenses In integrins these domains form heterodimers while in vWF it forms multimers. There are different interaction surfaces of this domain as seen by the various molecules it complexes with. Ligand binding in most cases is mediated by the presence of a metal ion dependent adhesion site termed as the MIDAS motif that is a characteristic feature of most, if not all A domains


Pssm-ID: 238727 [Multi-domain]  Cd Length: 161  Bit Score: 93.12  E-value: 2.20e-21
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462586997 1053 ADVVFLPHATQ--DNAHRAEAtRRVLERLVLALgPLGPQAVQVGLLSYSHRPSPLFPLNGSHDLGIILQRIRDMPYMDPS 1130
Cdd:cd01450      1 LDIVFLLDGSEsvGPENFEKV-KDFIEKLVEKL-DIGPDKTRVGLVQYSDDVRVEFSLNDYKSKDDLLKAVKNLKYLGGG 78
                           90       100       110       120       130       140       150
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*.
gi 2462586997 1131 GNNLGTAVVTAHRYMLAPDApgRRQHVPGVMVLLVDEPL--RGDIFSPIREAQASGLNVVMLGMAGADPEQLRRLA 1204
Cdd:cd01450     79 GTNTGKALQYALEQLFSESN--ARENVPKVIIVLTDGRSddGGDPKEAAAKLKDEGIKVFVVGVGPADEEELREIA 152
FN3 COG3401
Fibronectin type 3 domain [General function prediction only];
301-813 2.68e-21

Fibronectin type 3 domain [General function prediction only];


Pssm-ID: 442628 [Multi-domain]  Cd Length: 603  Bit Score: 101.23  E-value: 2.68e-21
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462586997  301 TEYQVTVIALYANSIGEAVSGTARTTALEGPELTIQNTTAHSLLVAWRSVPGATGYRVTWRVLSGGPTQQQELGPGQGSV 380
Cdd:COG3401     39 YLAVVLSVTTKESPGTLLVAAGLSSGGGLGTGGRAGTTSGVAAVAVAAAPPTATGLTTLTGSGSVGGATNTGLTSSDEVP 118
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462586997  381 LLRDLEPGTDYEVTVSTLFGRSVGPATSLMARTDASVEQTLRPVILGPTSILLSWNLVPEARGYRLewrretgleppqkV 460
Cdd:COG3401    119 SPAVGTATTATAVAGGAATAGTYALGAGLYGVDGANASGTTASSVAGAGVVVSPDTSATAAVATTS-------------L 185
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462586997  461 VLPSDVTRYQLDGLQPGTEYRLTLYTLLEGHEVATPATVVPTGPELPVSPVTDLQATELPGQRVRVSWSPVP--GATQYR 538
Cdd:COG3401    186 TVTSTTLVDGGGDIEPGTTYYYRVAATDTGGESAPSNEVSVTTPTTPPSAPTGLTATADTPGSVTLSWDPVTesDATGYR 265
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462586997  539 IIVRSTQGVERTLVLPGSQTAFDLDDVQAGLSYTVRVSA--RVGPREGSASVLTVRRELETPLAVPGLRVVVSDATRVRV 616
Cdd:COG3401    266 VYRSNSGDGPFTKVATVTTTSYTDTGLTNGTTYYYRVTAvdAAGNESAPSNVVSVTTDLTPPAAPSGLTATAVGSSSITL 345
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462586997  617 AWGPVPG--ASGFRISWSTGSGPESSQ--TLPPDSTATDiTGLQPGTTYQVAVSVL--RGREEGPAAVIVART------- 683
Cdd:COG3401    346 SWTASSDadVTGYNVYRSTSGGGTYTKiaETVTTTSYTD-TGLTPGTTYYYKVTAVdaAGNESAPSEEVSATTasaasge 424
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462586997  684 DPLGPVRTVHVTQASSSSVTITWTRVPG--ATGYRVSWHSAHGPEKSQL-VSGEATVAELDGLEPDTEYTVHVRAHVAGV 760
Cdd:COG3401    425 SLTASVDAVPLTDVAGATAAASAASNPGvsAAVLADGGDTGNAVPFTTTsSTVTATTTDTTTANLSVTTGSLVGGSGASS 504
                          490       500       510       520       530
                   ....*....|....*....|....*....|....*....|....*....|...
gi 2462586997  761 DGPPASVVVRTAPEPVGRVSRLQILNASSDVLRITWVGVTGATAYRLAWGRSE 813
Cdd:COG3401    505 VTNSVSVIGASAAAAVGGAPDGTPNVTGASPVTVGASTGDVLITDLVSLTTSA 557
gly_rich_SclB NF038329
LPXTG-anchored collagen-like adhesin Scl2/SclB; SclB (or Scl2 - streptococcal collagen-like ...
1254-1565 3.14e-21

LPXTG-anchored collagen-like adhesin Scl2/SclB; SclB (or Scl2 - streptococcal collagen-like protein 2) is an LPXTG-anchored surface-anchored adhesin with a variable-length region of triple helix-forming collagen-like Gly-Xaa-Xaa repeats.


Pssm-ID: 468478 [Multi-domain]  Cd Length: 440  Bit Score: 99.21  E-value: 3.14e-21
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462586997 1254 GQKGEPGEMGLRGQVGPPGDPGLPGRTGAPGPQGPPGSATAKGERGFPGADGRPGSPGRAGNPGTPGAPGLKGSPGLPGP 1333
Cdd:NF038329   117 GEKGEPGPAGPAGPAGEQGPRGDRGETGPAGPAGPPGPQGERGEKGPAGPQGEAGPQGPAGKDGEAGAKGPAGEKGPQGP 196
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462586997 1334 RGDPGERGPRGPKGEPGAPGQVIGGEGPGLPGRKGDPGpsgppgprgplgdpgprgppglpgtamKGDKGDrgergppgp 1413
Cdd:NF038329   197 RGETGPAGEQGPAGPAGPDGEAGPAGEDGPAGPAGDGQ---------------------------QGPDGD--------- 240
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462586997 1414 geggiaPGEPGLPGLPGSPgpqgpvGPPGKKGEKGDseDGAPGLPGQPGSPGEQGPRGPPGAIGPKGDRGFPGPLGEAGE 1493
Cdd:NF038329   241 ------PGPTGEDGPQGPD------GPAGKDGPRGD--RGEAGPDGPDGKDGERGPVGPAGKDGQNGKDGLPGKDGKDGQ 306
                          250       260       270       280       290       300       310
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|..
gi 2462586997 1494 KGERGPPGPAGSRGLPGVAGRPgakgpegppgptgrqGEKGEPGRPGDPAVVGPAVAgpkgEKGDVGPAGPR 1565
Cdd:NF038329   307 NGKDGLPGKDGKDGQPGKDGLP---------------GKDGKDGQPGKPAPKTPEVP----QKPDTAPHTPK 359
vWA_collagen_alpha_1-VI-type cd01480
VWA_collagen alpha(VI) type: The extracellular matrix represents a complex alloy of variable ...
36-192 3.57e-21

VWA_collagen alpha(VI) type: The extracellular matrix represents a complex alloy of variable members of diverse protein families defining structural integrity and various physiological functions. The most abundant family is the collagens with more than 20 different collagen types identified thus far. Collagens are centrally involved in the formation of fibrillar and microfibrillar networks of the extracellular matrix, basement membranes as well as other structures of the extracellular matrix. Some collagens have about 15-18 vWA domains in them. The VWA domains present in these collagens mediate protein-protein interactions.


Pssm-ID: 238757 [Multi-domain]  Cd Length: 186  Bit Score: 93.22  E-value: 3.57e-21
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462586997   36 AADIVFLLDGSSSIGRSNFREVRSFLEGLVLPFSGAA----SAQGVRFATVQYSDDPRTEFGLDALGSGGDVIR-AIREL 110
Cdd:cd01480      2 PVDITFVLDSSESVGLQNFDITKNFVKRVAERFLKDYyrkdPAGSWRVGVVQYSDQQEVEAGFLRDIRNYTSLKeAVDNL 81
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462586997  111 SYKGGNTRTGAAILHVADhvFLPQLARPGVPKVCILITDGKSQDLVDT----AAQRLKGQGVKLFAVGIKNADPEELKRV 186
Cdd:cd01480     82 EYIGGGTFTDCALKYATE--QLLEGSHQKENKFLLVITDGHSDGSPDGgiekAVNEADHLGIKIFFVAVGSQNEEPLSRI 159

                   ....*.
gi 2462586997  187 ASQPTS 192
Cdd:cd01480    160 ACDGKS 165
VWA smart00327
von Willebrand factor (vWF) type A domain; VWA domains in extracellular eukaryotic proteins ...
1054-1214 1.39e-19

von Willebrand factor (vWF) type A domain; VWA domains in extracellular eukaryotic proteins mediate adhesion via metal ion-dependent adhesion sites (MIDAS). Intracellular VWA domains and homologues in prokaryotes have recently been identified. The proposed VWA domains in integrin beta subunits have recently been substantiated using sequence-based methods.


Pssm-ID: 214621 [Multi-domain]  Cd Length: 175  Bit Score: 88.28  E-value: 1.39e-19
                            10        20        30        40        50        60        70        80
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462586997  1054 DVVFLPHATQDNAH-RAEATRRVLERLVLALgPLGPQAVQVGLLSYSHRPSPLFPLNGSHDLGIILQRIRDMPYMDPSGN 1132
Cdd:smart00327    1 DVVFLLDGSGSMGGnRFELAKEFVLKLVEQL-DIGPDGDRVGLVTFSDDARVLFPLNDSRSKDALLEALASLSYKLGGGT 79
                            90       100       110       120       130       140       150       160
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462586997  1133 NLGTAVVTAHRYMLAPDAPGRRQhVPGVMVLLVD---EPLRGDIFSPIREAQASGLNVVMLGM-AGADPEQLRRLAPGMD 1208
Cdd:smart00327   80 NLGAALQYALENLFSKSAGSRRG-APKVVILITDgesNDGPKDLLKAAKELKRSGVKVFVVGVgNDVDEEELKKLASAPG 158

                    ....*.
gi 2462586997  1209 SVQTFF 1214
Cdd:smart00327  159 GVYVFL 164
VWA_integrin_invertebrates cd01476
VWA_integrin (invertebrates): Integrins are a family of cell surface receptors that have ...
37-195 1.18e-18

VWA_integrin (invertebrates): Integrins are a family of cell surface receptors that have diverse functions in cell-cell and cell-extracellular matrix interactions. Because of their involvement in many biologically important adhesion processes, integrins are conserved across a wide range of multicellular animals. Integrins from invertebrates have been identified from six phyla. There are no data to date to suggest any immunological functions for the invertebrate integrins. The members of this sub-group have the conserved MIDAS motif that is charateristic of this domain suggesting the involvement of the integrins in the recognition and binding of multi-ligands.


Pssm-ID: 238753 [Multi-domain]  Cd Length: 163  Bit Score: 85.14  E-value: 1.18e-18
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462586997   37 ADIVFLLDGSSSIgRSNFREVRSFLEGLV--LPFSGAAsaqgVRFATVQYSDDPRT--EFGLDALGSGGDVIRAIRELSY 112
Cdd:cd01476      1 LDLLFVLDSSGSV-RGKFEKYKKYIERIVegLEIGPTA----TRVALITYSGRGRQrvRFNLPKHNDGEELLEKVDNLRF 75
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462586997  113 KGGNTRTGAAILHVADHVFLPQLARPGVPKVCILITDGKSQDLVDTAAQRLKGQ-GVKLFAVGIK---NADPEELKRVAS 188
Cdd:cd01476     76 IGGTTATGAAIEVALQQLDPSEGRREGIPKVVVVLTDGRSHDDPEKQARILRAVpNIETFAVGTGdpgTVDTEELHSITG 155

                   ....*..
gi 2462586997  189 QPTSDFF 195
Cdd:cd01476    156 NEDHIFT 162
FN3 COG3401
Fibronectin type 3 domain [General function prediction only];
526-876 7.13e-18

Fibronectin type 3 domain [General function prediction only];


Pssm-ID: 442628 [Multi-domain]  Cd Length: 603  Bit Score: 90.06  E-value: 7.13e-18
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462586997  526 VSWSPVPGATQYRIIVRSTQGVERTLVLPGSQTAFDLDDVQAGLSYTVRVSARVGPREGSASVLTVRRELETPLAVPGLR 605
Cdd:COG3401     60 GLSSGGGLGTGGRAGTTSGVAAVAVAAAPPTATGLTTLTGSGSVGGATNTGLTSSDEVPSPAVGTATTATAVAGGAATAG 139
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462586997  606 VVVSDATRVRVAWGPVPGASGFRISWSTGSGPES----------SQTLPPDSTATDITGLQPGTTYQVAVSVLRGREEGP 675
Cdd:COG3401    140 TYALGAGLYGVDGANASGTTASSVAGAGVVVSPDtsataavattSLTVTSTTLVDGGGDIEPGTTYYYRVAATDTGGESA 219
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462586997  676 AA----VIVARTDPLGPVRtVHVTQASSSSVTITWTRVP--GATGYRVSWHSAHGPEKSQLVSGEATVAELDGLEPDTEY 749
Cdd:COG3401    220 PSnevsVTTPTTPPSAPTG-LTATADTPGSVTLSWDPVTesDATGYRVYRSNSGDGPFTKVATVTTTSYTDTGLTNGTTY 298
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462586997  750 TVHVRAH-VAGVDGPPASVV-VRTAPEPVGRVSRLQILNASSDVLRITWVGVTG--ATAYRLAWGRSEGGPMRHQILPGN 825
Cdd:COG3401    299 YYRVTAVdAAGNESAPSNVVsVTTDLTPPAAPSGLTATAVGSSSITLSWTASSDadVTGYNVYRSTSGGGTYTKIAETVT 378
                          330       340       350       360       370
                   ....*....|....*....|....*....|....*....|....*....|..
gi 2462586997  826 TDSAEIRGLEGGVSYSVRVTAL-VGDREGTPVSIVVTTPPEAPPALGTLHVV 876
Cdd:COG3401    379 TTSYTDTGLTPGTTYYYKVTAVdAAGNESAPSEEVSATTASAASGESLTASV 430
FN3 COG3401
Fibronectin type 3 domain [General function prediction only];
561-1045 2.54e-17

Fibronectin type 3 domain [General function prediction only];


Pssm-ID: 442628 [Multi-domain]  Cd Length: 603  Bit Score: 88.52  E-value: 2.54e-17
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462586997  561 DLDDVQAGLSYTVRVSARVGPREGSASVLTVRRELETPLAVPGLRVVVSDATRVRVAWGPVPGAS-----GFRISWSTGS 635
Cdd:COG3401     12 GIAASAAANTAVNALSKAGGSGKTILVYLAVVLSVTTKESPGTLLVAAGLSSGGGLGTGGRAGTTsgvaaVAVAAAPPTA 91
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462586997  636 GPESSQTLPPDSTATDITGLQPGTTYQVAVSVLRGREEGPAAVIVARTDPLGPVRTVHVTQASSSSVTITWTRVPGATGY 715
Cdd:COG3401     92 TGLTTLTGSGSVGGATNTGLTSSDEVPSPAVGTATTATAVAGGAATAGTYALGAGLYGVDGANASGTTASSVAGAGVVVS 171
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462586997  716 RVSWHSAHGPEKSQLVSGEATVAELDGLEPDTEYTVHVRAHVAGVDGPPASVV----VRTAPEPVgrvSRLQILNASSDV 791
Cdd:COG3401    172 PDTSATAAVATTSLTVTSTTLVDGGGDIEPGTTYYYRVAATDTGGESAPSNEVsvttPTTPPSAP---TGLTATADTPGS 248
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462586997  792 LRITWVGVT--GATAYRLAWGRSEGGPMRhQILPGNTDSAEIRGLEGGVSYSVRVTALVGD-REGTPVSIV-VTTPPEAP 867
Cdd:COG3401    249 VTLSWDPVTesDATGYRVYRSNSGDGPFT-KVATVTTTSYTDTGLTNGTTYYYRVTAVDAAgNESAPSNVVsVTTDLTPP 327
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462586997  868 PALGTLHVVQRGEHSLRLRWEPV--PRAQGFLLHWQPEGGQEQSRVLGPELSSYHLD-GLEPATQYRVRLSVLGPAGE-- 942
Cdd:COG3401    328 AAPSGLTATAVGSSSITLSWTASsdADVTGYNVYRSTSGGGTYTKIAETVTTTSYTDtGLTPGTTYYYKVTAVDAAGNes 407
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462586997  943 GPSAEVTARTESPRV-PSIELRVVDTSIDSVTLAWTPVSRASSYILSWRPLRGPGQEVPGSPqtlPGISSSQRVTGLEPG 1021
Cdd:COG3401    408 APSEEVSATTASAASgESLTASVDAVPLTDVAGATAAASAASNPGVSAAVLADGGDTGNAVP---FTTTSSTVTATTTDT 484
                          490       500
                   ....*....|....*....|....
gi 2462586997 1022 VSYIFSLTPVLDGVRGPEASVTQT 1045
Cdd:COG3401    485 TTANLSVTTGSLVGGSGASSVTNS 508
fn3 pfam00041
Fibronectin type III domain;
233-317 3.90e-16

Fibronectin type III domain;


Pssm-ID: 394996 [Multi-domain]  Cd Length: 85  Bit Score: 75.53  E-value: 3.90e-16
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462586997  233 SAPRDLVLSEPSSQSLRVQWTAA---SGPVTGYKVQYTPLTGLGQPlpserQEVNVPAGETSVRLRGLRPLTEYQVTVIA 309
Cdd:pfam00041    1 SAPSNLTVTDVTSTSLTVSWTPPpdgNGPITGYEVEYRPKNSGEPW-----NEITVPGTTTSVTLTGLKPGTEYEVRVQA 75

                   ....*...
gi 2462586997  310 LYANSIGE 317
Cdd:pfam00041   76 VNGGGEGP 83
vWA_collagen_alphaI-XII-like cd01482
Collagen: The extracellular matrix represents a complex alloy of variable members of diverse ...
1053-1218 2.09e-14

Collagen: The extracellular matrix represents a complex alloy of variable members of diverse protein families defining structural integrity and various physiological functions. The most abundant family is the collagens with more than 20 different collagen types identified thus far. Collagens are centrally involved in the formation of fibrillar and microfibrillar networks of the extracellular matrix, basement membranes as well as other structures of the extracellular matrix. Some collagens have about 15-18 vWA domains in them. The VWA domains present in these collagens mediate protein-protein interactions.


Pssm-ID: 238759 [Multi-domain]  Cd Length: 164  Bit Score: 73.09  E-value: 2.09e-14
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462586997 1053 ADVVFLPHAT----QDNAHRaeaTRRVLERLVLALGpLGPQAVQVGLLSYSHRPSPLFPLNGSHDLGIILQRIRDMPYMd 1128
Cdd:cd01482      1 ADIVFLVDGSwsigRSNFNL---VRSFLSSVVEAFE-IGPDGVQVGLVQYSDDPRTEFDLNAYTSKEDVLAAIKNLPYK- 75
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462586997 1129 pSGN-NLGTAVVTAHRYMLAPDApGRRQHVPGVMVLLVDEPLRGDIFSPIREAQASGLNVVMLGMAGADPEQLRRLApGM 1207
Cdd:cd01482     76 -GGNtRTGKALTHVREKNFTPDA-GARPGVPKVVILITDGKSQDDVELPARVLRNLGVNVFAVGVKDADESELKMIA-SK 152
                          170
                   ....*....|.
gi 2462586997 1208 DSVQTFFAVDD 1218
Cdd:cd01482    153 PSETHVFNVAD 163
FN3 cd00063
Fibronectin type 3 domain; One of three types of internal repeats found in the plasma protein ...
687-771 1.78e-13

Fibronectin type 3 domain; One of three types of internal repeats found in the plasma protein fibronectin. Its tenth fibronectin type III repeat contains an RGD cell recognition sequence in a flexible loop between 2 strands. Approximately 2% of all animal proteins contain the FN3 repeat; including extracellular and intracellular proteins, membrane spanning cytokine receptors, growth hormone receptors, tyrosine phosphatase receptors, and adhesion molecules. FN3-like domains are also found in bacterial glycosyl hydrolases.


Pssm-ID: 238020 [Multi-domain]  Cd Length: 93  Bit Score: 68.29  E-value: 1.78e-13
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462586997  687 GPVRTVHVTQASSSSVTITWTRVPGA----TGYRVSWHSAHGPEKSQL--VSGEATVAELDGLEPDTEYTVHVRAHVAGV 760
Cdd:cd00063      2 SPPTNLRVTDVTSTSVTLSWTPPEDDggpiTGYVVEYREKGSGDWKEVevTPGSETSYTLTGLKPGTEYEFRVRAVNGGG 81
                           90
                   ....*....|..
gi 2462586997  761 DGPPA-SVVVRT 771
Cdd:cd00063     82 ESPPSeSVTVTT 93
FN3 cd00063
Fibronectin type 3 domain; One of three types of internal repeats found in the plasma protein ...
233-326 3.32e-13

Fibronectin type 3 domain; One of three types of internal repeats found in the plasma protein fibronectin. Its tenth fibronectin type III repeat contains an RGD cell recognition sequence in a flexible loop between 2 strands. Approximately 2% of all animal proteins contain the FN3 repeat; including extracellular and intracellular proteins, membrane spanning cytokine receptors, growth hormone receptors, tyrosine phosphatase receptors, and adhesion molecules. FN3-like domains are also found in bacterial glycosyl hydrolases.


Pssm-ID: 238020 [Multi-domain]  Cd Length: 93  Bit Score: 67.52  E-value: 3.32e-13
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462586997  233 SAPRDLVLSEPSSQSLRVQWTAAS---GPVTGYKVQYTPLTGlgqplpSERQEVNVPAG-ETSVRLRGLRPLTEYQVTVI 308
Cdd:cd00063      2 SPPTNLRVTDVTSTSVTLSWTPPEddgGPITGYVVEYREKGS------GDWKEVEVTPGsETSYTLTGLKPGTEYEFRVR 75
                           90
                   ....*....|....*...
gi 2462586997  309 ALYANSIGEAVSGTARTT 326
Cdd:cd00063     76 AVNGGGESPPSESVTVTT 93
vWA_collagen cd01472
von Willebrand factor (vWF) type A domain; equivalent to the I-domain of integrins. This ...
1053-1204 1.56e-12

von Willebrand factor (vWF) type A domain; equivalent to the I-domain of integrins. This domain has a variety of functions including: intermolecular adhesion, cell migration, signalling, transcription, and DNA repair. In integrins these domains form heterodimers while in vWF it forms homodimers and multimers. There are different interaction surfaces of this domain as seen by its complexes with collagen with either integrin or human vWFA. In integrins collagen binding occurs via the metal ion-dependent adhesion site (MIDAS) and involves three surface loops located on the upper surface of the molecule. In human vWFA, collagen binding is thought to occur on the bottom of the molecule and does not involve the vestigial MIDAS motif.


Pssm-ID: 238749 [Multi-domain]  Cd Length: 164  Bit Score: 67.64  E-value: 1.56e-12
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462586997 1053 ADVVFLPHaTQDNA--HRAEATRRVLERLVLALGPlGPQAVQVGLLSYSHRPSPLFPLNGSHDLGIILQRIRDMPYMDpS 1130
Cdd:cd01472      1 ADIVFLVD-GSESIglSNFNLVKDFVKRVVERLDI-GPDGVRVGVVQYSDDPRTEFYLNTYRSKDDVLEAVKNLRYIG-G 77
                           90       100       110       120       130       140       150
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....
gi 2462586997 1131 GNNLGTAVVTAHRYMLAPDApGRRQHVPGVMVLLVDEPLRGDIFSPIREAQASGLNVVMLGMAGADPEQLRRLA 1204
Cdd:cd01472     78 GTNTGKALKYVRENLFTEAS-GSREGVPKVLVVITDGKSQDDVEEPAVELKQAGIEVFAVGVKNADEEELKQIA 150
FN3 smart00060
Fibronectin type 3 domain; One of three types of internal repeat within the plasma protein, ...
233-316 1.86e-12

Fibronectin type 3 domain; One of three types of internal repeat within the plasma protein, fibronectin. The tenth fibronectin type III repeat contains a RGD cell recognition sequence in a flexible loop between 2 strands. Type III modules are present in both extracellular and intracellular proteins.


Pssm-ID: 214495 [Multi-domain]  Cd Length: 83  Bit Score: 64.94  E-value: 1.86e-12
                            10        20        30        40        50        60        70        80
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462586997   233 SAPRDLVLSEPSSQSLRVQWTAASGP-VTGYKVQYTPLtglGQPLPSERQEVNVPAGETSVRLRGLRPLTEYQVTVIALY 311
Cdd:smart00060    2 SPPSNLRVTDVTSTSVTLSWEPPPDDgITGYIVGYRVE---YREEGSEWKEVNVTPSSTSYTLTGLKPGTEYEFRVRAVN 78

                    ....*
gi 2462586997   312 ANSIG 316
Cdd:smart00060   79 GAGEG 83
fn3 pfam00041
Fibronectin type III domain;
687-764 4.98e-12

Fibronectin type III domain;


Pssm-ID: 394996 [Multi-domain]  Cd Length: 85  Bit Score: 63.59  E-value: 4.98e-12
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462586997  687 GPVRTVHVTQASSSSVTITWTRVPGA----TGYRVSWHSAHGPE--KSQLVSGEATVAELDGLEPDTEYTVHVRAHVAGV 760
Cdd:pfam00041    1 SAPSNLTVTDVTSTSLTVSWTPPPDGngpiTGYEVEYRPKNSGEpwNEITVPGTTTSVTLTGLKPGTEYEVRVQAVNGGG 80

                   ....
gi 2462586997  761 DGPP 764
Cdd:pfam00041   81 EGPP 84
VWA_2 pfam13519
von Willebrand factor type A domain;
39-147 2.74e-11

von Willebrand factor type A domain;


Pssm-ID: 463909 [Multi-domain]  Cd Length: 103  Bit Score: 62.31  E-value: 2.74e-11
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462586997   39 IVFLLDGSSSI-----GRSNFREVRSFLEGLVlpfsgaASAQGVRFATVQYSDDPRTEFGLDalGSGGDVIRAIRELSYK 113
Cdd:pfam13519    1 LVFVLDTSGSMrngdyGPTRLEAAKDAVLALL------KSLPGDRVGLVTFGDGPEVLIPLT--KDRAKILRALRRLEPK 72
                           90       100       110
                   ....*....|....*....|....*....|....
gi 2462586997  114 GGNTRTGAAILHVADHVflpQLARPGVPKVCILI 147
Cdd:pfam13519   73 GGGTNLAAALQLARAAL---KHRRKNQPRRIVLI 103
FN3 smart00060
Fibronectin type 3 domain; One of three types of internal repeat within the plasma protein, ...
687-756 2.80e-11

Fibronectin type 3 domain; One of three types of internal repeat within the plasma protein, fibronectin. The tenth fibronectin type III repeat contains a RGD cell recognition sequence in a flexible loop between 2 strands. Type III modules are present in both extracellular and intracellular proteins.


Pssm-ID: 214495 [Multi-domain]  Cd Length: 83  Bit Score: 61.48  E-value: 2.80e-11
                            10        20        30        40        50        60        70
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*.
gi 2462586997   687 GPVRTVHVTQASSSSVTITWTRVPGA------TGYRVSWHSAHGPEKSQLVSGEATVAELDGLEPDTEYTVHVRAH 756
Cdd:smart00060    2 SPPSNLRVTDVTSTSVTLSWEPPPDDgitgyiVGYRVEYREEGSEWKEVNVTPSSTSYTLTGLKPGTEYEFRVRAV 77
vWA_ATR cd01474
ATR (Anthrax Toxin Receptor): Anthrax toxin is a key virulence factor for Bacillus anthracis, ...
36-217 3.43e-11

ATR (Anthrax Toxin Receptor): Anthrax toxin is a key virulence factor for Bacillus anthracis, the causative agent of anthrax. ATR is the cellular receptor for the anthrax protective antigen and facilitates entry of the toxin into cells. The VWA domain in ATR contains the toxin binding site and mediates interaction with protective antigen. The binding is mediated by divalent cations that binds to the MIDAS motif. These proteins are a family of vertebrate ECM receptors expressed by endothelial cells.


Pssm-ID: 238751 [Multi-domain]  Cd Length: 185  Bit Score: 64.45  E-value: 3.43e-11
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462586997   36 AADIVFLLDGSSSIGrSNFREVRSFLEGLVLPFSgaasAQGVRFATVQYSDDPRTEFGLDalGSGGDVIRAIRELS--YK 113
Cdd:cd01474      4 HFDLYFVLDKSGSVA-ANWIEIYDFVEQLVDRFN----SPGLRFSFITFSTRATKILPLT--DDSSAIIKGLEVLKkvTP 76
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462586997  114 GGNTRTGAAILHVADHVFLPQLARPGVPKVCILITDGKSQDLV----DTAAQRLKGQGVKLFAVGIKNADPEELKRVASQ 189
Cdd:cd01474     77 SGQTYIHEGLENANEQIFNRNGGGRETVSVIIALTDGQLLLNGhkypEHEAKLSRKLGAIVYCVGVTDFLKSQLINIADS 156
                          170       180
                   ....*....|....*....|....*....
gi 2462586997  190 PtsDFFFFVND-FSILRTLLPLVSRRVCT 217
Cdd:cd01474    157 K--EYVFPVTSgFQALSGIIESVVKKACI 183
ChlD COG1240
vWFA (von Willebrand factor type A) domain of Mg and Co chelatases [Coenzyme transport and ...
3-189 3.49e-11

vWFA (von Willebrand factor type A) domain of Mg and Co chelatases [Coenzyme transport and metabolism];


Pssm-ID: 440853 [Multi-domain]  Cd Length: 262  Bit Score: 66.12  E-value: 3.49e-11
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462586997    3 LRLLVAALCAGILAEAPRVRAQHRERVtctrlyAADIVFLLDGSSSIGRSN-FREVRSFLEGLVlpfsgAASAQGVRFAT 81
Cdd:COG1240     65 ALLLLLAVLLLLLALALAPLALARPQR------GRDVVLVVDASGSMAAENrLEAAKGALLDFL-----DDYRPRDRVGL 133
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462586997   82 VQYSDDPRTEFGLDAlgSGGDVIRAIRELSYKGGnTRTGAAILHVADHVflpQLARPGVPKVCILITDGK---SQDLVDT 158
Cdd:COG1240    134 VAFGGEAEVLLPLTR--DREALKRALDELPPGGG-TPLGDALALALELL---KRADPARRKVIVLLTDGRdnaGRIDPLE 207
                          170       180       190
                   ....*....|....*....|....*....|...
gi 2462586997  159 AAQRLKGQGVKLFAVGI--KNADPEELKRVASQ 189
Cdd:COG1240    208 AAELAAAAGIRIYTIGVgtEAVDEGLLREIAEA 240
FN3 cd00063
Fibronectin type 3 domain; One of three types of internal repeats found in the plasma protein ...
867-952 5.93e-11

Fibronectin type 3 domain; One of three types of internal repeats found in the plasma protein fibronectin. Its tenth fibronectin type III repeat contains an RGD cell recognition sequence in a flexible loop between 2 strands. Approximately 2% of all animal proteins contain the FN3 repeat; including extracellular and intracellular proteins, membrane spanning cytokine receptors, growth hormone receptors, tyrosine phosphatase receptors, and adhesion molecules. FN3-like domains are also found in bacterial glycosyl hydrolases.


Pssm-ID: 238020 [Multi-domain]  Cd Length: 93  Bit Score: 60.97  E-value: 5.93e-11
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462586997  867 PPALGTLHVVQRGEHSLRLRWEPVP----RAQGFLLHWQPEGGQEQSRVLGPELS--SYHLDGLEPATQYRVRLSVLGPA 940
Cdd:cd00063      1 PSPPTNLRVTDVTSTSVTLSWTPPEddggPITGYVVEYREKGSGDWKEVEVTPGSetSYTLTGLKPGTEYEFRVRAVNGG 80
                           90
                   ....*....|...
gi 2462586997  941 GEG-PSAEVTART 952
Cdd:cd00063     81 GESpPSESVTVTT 93
fn3 pfam00041
Fibronectin type III domain;
333-407 1.56e-09

Fibronectin type III domain;


Pssm-ID: 394996 [Multi-domain]  Cd Length: 85  Bit Score: 56.65  E-value: 1.56e-09
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462586997  333 LTIQNTTAHSLLVAWRSVPGA----TGYRVTWR-VLSGGPTQQQELGPGQGSVLLRDLEPGTDYEVTVSTLFGRSVGPAT 407
Cdd:pfam00041    6 LTVTDVTSTSLTVSWTPPPDGngpiTGYEVEYRpKNSGEPWNEITVPGTTTSVTLTGLKPGTEYEVRVQAVNGGGEGPPS 85
fn3 pfam00041
Fibronectin type III domain;
426-487 1.70e-09

Fibronectin type III domain;


Pssm-ID: 394996 [Multi-domain]  Cd Length: 85  Bit Score: 56.65  E-value: 1.70e-09
                           10        20        30        40        50        60
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*.
gi 2462586997  426 LGPTSILLSWNLVPEARG----YRLEWRRETGLEPPQKVVLPSDVTRYQLDGLQPGTEYRLTLYTL 487
Cdd:pfam00041   11 VTSTSLTVSWTPPPDGNGpitgYEVEYRPKNSGEPWNEITVPGTTTSVTLTGLKPGTEYEVRVQAV 76
vWFA cd00198
Von Willebrand factor type A (vWA) domain was originally found in the blood coagulation ...
1070-1213 2.15e-09

Von Willebrand factor type A (vWA) domain was originally found in the blood coagulation protein von Willebrand factor (vWF). Typically, the vWA domain is made up of approximately 200 amino acid residues folded into a classic a/b para-rossmann type of fold. The vWA domain, since its discovery, has drawn great interest because of its widespread occurrence and its involvement in a wide variety of important cellular functions. These include basal membrane formation, cell migration, cell differentiation, adhesion, haemostasis, signaling, chromosomal stability, malignant transformation and in immune defenses In integrins these domains form heterodimers while in vWF it forms multimers. There are different interaction surfaces of this domain as seen by the various molecules it complexes with. Ligand binding in most cases is mediated by the presence of a metal ion dependent adhesion site termed as the MIDAS motif that is a characteristic feature of most, if not all A domains.


Pssm-ID: 238119 [Multi-domain]  Cd Length: 161  Bit Score: 58.73  E-value: 2.15e-09
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462586997 1070 EATRRVLERLVLALGPLGPQaVQVGLLSYSHRPSPLFPLNGSHDLGIILQRIRDMPYMDPSGNNLGTAVVTAHRYMLAPD 1149
Cdd:cd00198     19 DKAKEALKALVSSLSASPPG-DRVGLVTFGSNARVVLPLTTDTDKADLLEAIDALKKGLGGGTNIGAALRLALELLKSAK 97
                           90       100       110       120       130       140
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 2462586997 1150 APGRRQhvpgVMVLLVDEPLRGDIFSP---IREAQASGLNV--VMLGMaGADPEQLRRLAPGMDSVQTF 1213
Cdd:cd00198     98 RPNARR----VIILLTDGEPNDGPELLaeaARELRKLGITVytIGIGD-DANEDELKEIADKTTGGAVF 161
ViaA COG2425
Uncharacterized conserved protein, contains a von Willebrand factor type A (vWA) domain ...
2-189 2.67e-09

Uncharacterized conserved protein, contains a von Willebrand factor type A (vWA) domain [Function unknown];


Pssm-ID: 441973 [Multi-domain]  Cd Length: 263  Bit Score: 60.46  E-value: 2.67e-09
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462586997    2 TLRLLVAALCAGILAEAPRVRAQHRERVtctRLYAADIVFLLDGSSSIGRSNFREVRSFLEGLVlpfsgAASAQGVRFAT 81
Cdd:COG2425     87 ALLLAVLLLALLLLAALLLLAAPASAAV---PLLEGPVVLCVDTSGSMAGSKEAAAKAAALALL-----RALRPNRRFGV 158
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462586997   82 VQYSDDPRTEFGLDALGSGGDVIRAIRELSYKGGnTRTGAAILHVADHVFLPQLARpgvpKVCILITDGKSQDLVDTAAQ 161
Cdd:COG2425    159 ILFDTEVVEDLPLTADDGLEDAIEFLSGLFAGGG-TDIAPALRAALELLEEPDYRN----ADIVLITDGEAGVSPEELLR 233
                          170       180       190
                   ....*....|....*....|....*....|
gi 2462586997  162 RL--KGQGVKLFAVGIKNADPEELKRVASQ 189
Cdd:COG2425    234 EVraKESGVRLFTVAIGDAGNPGLLEALAD 263
gly_rich_SclB NF038329
LPXTG-anchored collagen-like adhesin Scl2/SclB; SclB (or Scl2 - streptococcal collagen-like ...
2287-2440 5.85e-09

LPXTG-anchored collagen-like adhesin Scl2/SclB; SclB (or Scl2 - streptococcal collagen-like protein 2) is an LPXTG-anchored surface-anchored adhesin with a variable-length region of triple helix-forming collagen-like Gly-Xaa-Xaa repeats.


Pssm-ID: 468478 [Multi-domain]  Cd Length: 440  Bit Score: 61.07  E-value: 5.85e-09
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462586997 2287 GPKGEPGPTGAPGQAvvglpgakgekgapgglagdlvGEPGAKGDRGLPGPRGEKGEAGRAGEPGDPGEDGQKGAPGPKG 2366
Cdd:NF038329   117 GEKGEPGPAGPAGPA----------------------GEQGPRGDRGETGPAGPAGPPGPQGERGEKGPAGPQGEAGPQG 174
                           90       100       110       120       130       140       150
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....
gi 2462586997 2367 fkgdpgvgvpgspgppgPPGVKGDLGLPGLPGAPGVVGFPGQTGPRGEMGSGRPPRERRNPRTPGATWTTGVSG 2440
Cdd:NF038329   175 -----------------PAGKDGEAGAKGPAGEKGPQGPRGETGPAGEQGPAGPAGPDGEAGPAGEDGPAGPAG 231
fn3 pfam00041
Fibronectin type III domain;
868-944 6.24e-09

Fibronectin type III domain;


Pssm-ID: 394996 [Multi-domain]  Cd Length: 85  Bit Score: 55.11  E-value: 6.24e-09
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462586997  868 PALGTLHVVQRGEHSLRLRWEPVPRAQG----FLLHWQPEGGQE--QSRVLGPELSSYHLDGLEPATQYRVRLSVLGPAG 941
Cdd:pfam00041    1 SAPSNLTVTDVTSTSLTVSWTPPPDGNGpitgYEVEYRPKNSGEpwNEITVPGTTTSVTLTGLKPGTEYEVRVQAVNGGG 80

                   ...
gi 2462586997  942 EGP 944
Cdd:pfam00041   81 EGP 83
FN3 cd00063
Fibronectin type 3 domain; One of three types of internal repeats found in the plasma protein ...
598-680 7.84e-09

Fibronectin type 3 domain; One of three types of internal repeats found in the plasma protein fibronectin. Its tenth fibronectin type III repeat contains an RGD cell recognition sequence in a flexible loop between 2 strands. Approximately 2% of all animal proteins contain the FN3 repeat; including extracellular and intracellular proteins, membrane spanning cytokine receptors, growth hormone receptors, tyrosine phosphatase receptors, and adhesion molecules. FN3-like domains are also found in bacterial glycosyl hydrolases.


Pssm-ID: 238020 [Multi-domain]  Cd Length: 93  Bit Score: 54.81  E-value: 7.84e-09
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462586997  598 PLAVPGLRVVVSDATRVRVAWGPVPGA----SGFRISWSTGSGPESSQ--TLPPDSTATDITGLQPGTTYQVAVSVLRGR 671
Cdd:cd00063      1 PSPPTNLRVTDVTSTSVTLSWTPPEDDggpiTGYVVEYREKGSGDWKEveVTPGSETSYTLTGLKPGTEYEFRVRAVNGG 80

                   ....*....
gi 2462586997  672 EEGPAAVIV 680
Cdd:cd00063     81 GESPPSESV 89
COG4733 COG4733
Phage-related protein, tail protein J [Mobilome: prophages, transposons];
495-693 1.20e-08

Phage-related protein, tail protein J [Mobilome: prophages, transposons];


Pssm-ID: 443767 [Multi-domain]  Cd Length: 978  Bit Score: 60.73  E-value: 1.20e-08
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462586997  495 TPATVVPTGPELPVSPVTDLQATELPGQRVRVSWSPVPGATQYRIIVRSTQGvERTLVLPGSQTAFDLDDVQAGlSYTVR 574
Cdd:COG4733    525 DDVPPQWPPVNVTTSESLSVVAQGTAVTTLTVSWDAPAGAVAYEVEWRRDDG-NWVSVPRTSGTSFEVPGIYAG-DYEVR 602
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462586997  575 VSA-----RVGPREGSASVLTVRRELeTPLAVPGLRvVVSDATRVRVAWGPVPGA--SGFRISWSTGSGPESS---QTLP 644
Cdd:COG4733    603 VRAinalgVSSAWAASSETTVTGKTA-PPPAPTGLT-ATGGLGGITLSWSFPVDAdtLRTEIRYSTTGDWASAtvaQALY 680
                          170       180       190       200       210
                   ....*....|....*....|....*....|....*....|....*....|.
gi 2462586997  645 PDSTATDiTGLQPGTTYQVAVSVL--RGREEGPAAVIVARTDPLGPVRTVH 693
Cdd:COG4733    681 PGNTYTL-AGLKAGQTYYYRARAVdrSGNVSAWWVSGQASADAAGILDAIT 730
FN3 smart00060
Fibronectin type 3 domain; One of three types of internal repeat within the plasma protein, ...
867-943 1.41e-08

Fibronectin type 3 domain; One of three types of internal repeat within the plasma protein, fibronectin. The tenth fibronectin type III repeat contains a RGD cell recognition sequence in a flexible loop between 2 strands. Type III modules are present in both extracellular and intracellular proteins.


Pssm-ID: 214495 [Multi-domain]  Cd Length: 83  Bit Score: 53.77  E-value: 1.41e-08
                            10        20        30        40        50        60        70        80
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462586997   867 PPALGTLHVVQRGEHSLRLRWEPVPRAQ--GFLLHWQPEGGQEQSRVL----GPELSSYHLDGLEPATQYRVRLSVLGPA 940
Cdd:smart00060    1 PSPPSNLRVTDVTSTSVTLSWEPPPDDGitGYIVGYRVEYREEGSEWKevnvTPSSTSYTLTGLKPGTEYEFRVRAVNGA 80

                    ...
gi 2462586997   941 GEG 943
Cdd:smart00060   81 GEG 83
gly_rich_SclB NF038329
LPXTG-anchored collagen-like adhesin Scl2/SclB; SclB (or Scl2 - streptococcal collagen-like ...
1191-1367 2.86e-08

LPXTG-anchored collagen-like adhesin Scl2/SclB; SclB (or Scl2 - streptococcal collagen-like protein 2) is an LPXTG-anchored surface-anchored adhesin with a variable-length region of triple helix-forming collagen-like Gly-Xaa-Xaa repeats.


Pssm-ID: 468478 [Multi-domain]  Cd Length: 440  Bit Score: 58.76  E-value: 2.86e-08
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462586997 1191 GMAGADPEQLRRLAPGMDSVQTFfAVDDGPSLDQAVSGLATALCQASFTTQPRPEPCPVYCPKGQKGEPGEMGLRGQVGP 1270
Cdd:NF038329   174 GPAGKDGEAGAKGPAGEKGPQGP-RGETGPAGEQGPAGPAGPDGEAGPAGEDGPAGPAGDGQQGPDGDPGPTGEDGPQGP 252
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462586997 1271 PGDPGLPGRTGAPGPQGPPGSATAKGERGFPGADGRPGSPGRAGNPGTPGAPGLKGSPGLPGPRGDPGERGPRGPKGEPG 1350
Cdd:NF038329   253 DGPAGKDGPRGDRGEAGPDGPDGKDGERGPVGPAGKDGQNGKDGLPGKDGKDGQNGKDGLPGKDGKDGQPGKDGLPGKDG 332
                          170
                   ....*....|....*..
gi 2462586997 1351 APGQviggegPGLPGRK 1367
Cdd:NF038329   333 KDGQ------PGKPAPK 343
FN3 cd00063
Fibronectin type 3 domain; One of three types of internal repeats found in the plasma protein ...
333-413 3.16e-08

Fibronectin type 3 domain; One of three types of internal repeats found in the plasma protein fibronectin. Its tenth fibronectin type III repeat contains an RGD cell recognition sequence in a flexible loop between 2 strands. Approximately 2% of all animal proteins contain the FN3 repeat; including extracellular and intracellular proteins, membrane spanning cytokine receptors, growth hormone receptors, tyrosine phosphatase receptors, and adhesion molecules. FN3-like domains are also found in bacterial glycosyl hydrolases.


Pssm-ID: 238020 [Multi-domain]  Cd Length: 93  Bit Score: 53.27  E-value: 3.16e-08
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462586997  333 LTIQNTTAHSLLVAWRSVPGA----TGYRVTWRVLSGGPTQQQELGPG-QGSVLLRDLEPGTDYEVTVSTLFGRSVG-PA 406
Cdd:cd00063      7 LRVTDVTSTSVTLSWTPPEDDggpiTGYVVEYREKGSGDWKEVEVTPGsETSYTLTGLKPGTEYEFRVRAVNGGGESpPS 86

                   ....*..
gi 2462586997  407 TSLMART 413
Cdd:cd00063     87 ESVTVTT 93
Collagen pfam01391
Collagen triple helix repeat (20 copies); Members of this family belong to the collagen ...
1299-1355 3.67e-08

Collagen triple helix repeat (20 copies); Members of this family belong to the collagen superfamily. Collagens are generally extracellular structural proteins involved in formation of connective tissue structure. The alignment contains 20 copies of the G-X-Y repeat that forms a triple helix. The first position of the repeat is glycine, the second and third positions can be any residue but are frequently proline and hydroxy-proline. Collagens are post translationally modified by proline hydroxylase to form the hydroxy-proline residues. Defective hydroxylation is the cause of scurvy. Some members of the collagen superfamily are not involved in connective tissue structure but share the same triple helical structure. The family includes bacterial collagen-like triple-helix repeat proteins.


Pssm-ID: 460189 [Multi-domain]  Cd Length: 57  Bit Score: 51.73  E-value: 3.67e-08
                           10        20        30        40        50
                   ....*....|....*....|....*....|....*....|....*....|....*..
gi 2462586997 1299 GFPGADGRPGSPGRAGNPGTPGAPGLKGSPGLPGPRGDPGERGPRGPKGEPGAPGQV 1355
Cdd:pfam01391    1 GPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGEPGPPGPPGPPGPPGPPGAPGAPGPP 57
FN3 smart00060
Fibronectin type 3 domain; One of three types of internal repeat within the plasma protein, ...
598-674 4.46e-08

Fibronectin type 3 domain; One of three types of internal repeat within the plasma protein, fibronectin. The tenth fibronectin type III repeat contains a RGD cell recognition sequence in a flexible loop between 2 strands. Type III modules are present in both extracellular and intracellular proteins.


Pssm-ID: 214495 [Multi-domain]  Cd Length: 83  Bit Score: 52.62  E-value: 4.46e-08
                            10        20        30        40        50        60        70        80
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462586997   598 PLAVPGLRVVVSDATRVRVAWGPVPGAS------GFRISWSTGSGPESSQTLPPDSTATDITGLQPGTTYQVAVSVLRGR 671
Cdd:smart00060    1 PSPPSNLRVTDVTSTSVTLSWEPPPDDGitgyivGYRVEYREEGSEWKEVNVTPSSTSYTLTGLKPGTEYEFRVRAVNGA 80

                    ...
gi 2462586997   672 EEG 674
Cdd:smart00060   81 GEG 83
TerY COG4245
Uncharacterized conserved protein YegL, contains vWA domain of TerY type [Function unknown];
38-189 4.77e-08

Uncharacterized conserved protein YegL, contains vWA domain of TerY type [Function unknown];


Pssm-ID: 443387 [Multi-domain]  Cd Length: 196  Bit Score: 55.32  E-value: 4.77e-08
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462586997   38 DIVFLLDGSSSIGRSNFREVRSFLEGLVLPFSGAASAQG-VRFATVQYSDDPRT--------EFGLDALGSGGDvirair 108
Cdd:COG4245      7 PVYLLLDTSGSMSGEPIEALNEGLQALIDELRQDPYALEtVEVSVITFDGEAKVllpltdleDFQPPDLSASGG------ 80
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462586997  109 elsykggnTRTGAAILHVADHV-----FLPQLARPGVPKVCILITDGKSQDL-VDTAAQRLK----GQGVKLFAVGI-KN 177
Cdd:COG4245     81 --------TPLGAALELLLDLIerrvqKYTAEGKGDWRPVVFLITDGEPTDSdWEAALQRLKdgeaAKKANIFAIGVgPD 152
                          170
                   ....*....|..
gi 2462586997  178 ADPEELKRVASQ 189
Cdd:COG4245    153 ADTEVLKQLTDP 164
Collagen pfam01391
Collagen triple helix repeat (20 copies); Members of this family belong to the collagen ...
1451-1506 6.17e-08

Collagen triple helix repeat (20 copies); Members of this family belong to the collagen superfamily. Collagens are generally extracellular structural proteins involved in formation of connective tissue structure. The alignment contains 20 copies of the G-X-Y repeat that forms a triple helix. The first position of the repeat is glycine, the second and third positions can be any residue but are frequently proline and hydroxy-proline. Collagens are post translationally modified by proline hydroxylase to form the hydroxy-proline residues. Defective hydroxylation is the cause of scurvy. Some members of the collagen superfamily are not involved in connective tissue structure but share the same triple helical structure. The family includes bacterial collagen-like triple-helix repeat proteins.


Pssm-ID: 460189 [Multi-domain]  Cd Length: 57  Bit Score: 50.96  E-value: 6.17e-08
                           10        20        30        40        50
                   ....*....|....*....|....*....|....*....|....*....|....*.
gi 2462586997 1451 EDGAPGLPGQPGSPGEQGPRGPPGAIGPKGDRGFPGPLGEAGEKGERGPPGPAGSR 1506
Cdd:pfam01391    2 PPGPPGPPGPPGPPGPPGPPGPPGPPGPPGEPGPPGPPGPPGPPGPPGAPGAPGPP 57
FN3 cd00063
Fibronectin type 3 domain; One of three types of internal repeats found in the plasma protein ...
419-492 1.01e-07

Fibronectin type 3 domain; One of three types of internal repeats found in the plasma protein fibronectin. Its tenth fibronectin type III repeat contains an RGD cell recognition sequence in a flexible loop between 2 strands. Approximately 2% of all animal proteins contain the FN3 repeat; including extracellular and intracellular proteins, membrane spanning cytokine receptors, growth hormone receptors, tyrosine phosphatase receptors, and adhesion molecules. FN3-like domains are also found in bacterial glycosyl hydrolases.


Pssm-ID: 238020 [Multi-domain]  Cd Length: 93  Bit Score: 51.73  E-value: 1.01e-07
                           10        20        30        40        50        60        70
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 2462586997  419 QTLRPVILGPTSILLSWNLVP----EARGYRLEWRRETGLEPPQKVVLPSDVTRYQLDGLQPGTEYRLTLYTLLEGHE 492
Cdd:cd00063      5 TNLRVTDVTSTSVTLSWTPPEddggPITGYVVEYREKGSGDWKEVEVTPGSETSYTLTGLKPGTEYEFRVRAVNGGGE 82
FN3 cd00063
Fibronectin type 3 domain; One of three types of internal repeats found in the plasma protein ...
955-1044 1.02e-07

Fibronectin type 3 domain; One of three types of internal repeats found in the plasma protein fibronectin. Its tenth fibronectin type III repeat contains an RGD cell recognition sequence in a flexible loop between 2 strands. Approximately 2% of all animal proteins contain the FN3 repeat; including extracellular and intracellular proteins, membrane spanning cytokine receptors, growth hormone receptors, tyrosine phosphatase receptors, and adhesion molecules. FN3-like domains are also found in bacterial glycosyl hydrolases.


Pssm-ID: 238020 [Multi-domain]  Cd Length: 93  Bit Score: 51.73  E-value: 1.02e-07
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462586997  955 PRVPSiELRVVDTSIDSVTLAWTPV----SRASSYILSWRPL-RGPGQEVPGSPQTlpgiSSSQRVTGLEPGVSYIFSLT 1029
Cdd:cd00063      1 PSPPT-NLRVTDVTSTSVTLSWTPPeddgGPITGYVVEYREKgSGDWKEVEVTPGS----ETSYTLTGLKPGTEYEFRVR 75
                           90
                   ....*....|....*
gi 2462586997 1030 PVLDGVRGPEASVTQ 1044
Cdd:cd00063     76 AVNGGGESPPSESVT 90
fn3 pfam00041
Fibronectin type III domain;
777-855 1.03e-07

Fibronectin type III domain;


Pssm-ID: 394996 [Multi-domain]  Cd Length: 85  Bit Score: 51.65  E-value: 1.03e-07
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462586997  777 GRVSRLQILNASSDVLRITWVGVTGA----TAYRLA-WGRSEGGPMRHQILPGNTDSAEIRGLEGGVSYSVRVTALVGDR 851
Cdd:pfam00041    1 SAPSNLTVTDVTSTSLTVSWTPPPDGngpiTGYEVEyRPKNSGEPWNEITVPGTTTSVTLTGLKPGTEYEVRVQAVNGGG 80

                   ....
gi 2462586997  852 EGTP 855
Cdd:pfam00041   81 EGPP 84
Collagen pfam01391
Collagen triple helix repeat (20 copies); Members of this family belong to the collagen ...
2242-2298 1.06e-07

Collagen triple helix repeat (20 copies); Members of this family belong to the collagen superfamily. Collagens are generally extracellular structural proteins involved in formation of connective tissue structure. The alignment contains 20 copies of the G-X-Y repeat that forms a triple helix. The first position of the repeat is glycine, the second and third positions can be any residue but are frequently proline and hydroxy-proline. Collagens are post translationally modified by proline hydroxylase to form the hydroxy-proline residues. Defective hydroxylation is the cause of scurvy. Some members of the collagen superfamily are not involved in connective tissue structure but share the same triple helical structure. The family includes bacterial collagen-like triple-helix repeat proteins.


Pssm-ID: 460189 [Multi-domain]  Cd Length: 57  Bit Score: 50.57  E-value: 1.06e-07
                           10        20        30        40        50
                   ....*....|....*....|....*....|....*....|....*....|....*..
gi 2462586997 2242 GSPGLPGQVGETGKPGAPGRDGASGKDGDRGSPGVPGSPGLPGPVGPKGEPGPTGAP 2298
Cdd:pfam01391    1 GPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGEPGPPGPPGPPGPPGPPGAPGAPGPP 57
fn3 pfam00041
Fibronectin type III domain;
600-675 1.42e-07

Fibronectin type III domain;


Pssm-ID: 394996 [Multi-domain]  Cd Length: 85  Bit Score: 51.26  E-value: 1.42e-07
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462586997  600 AVPGLRVVVSDATRVRVAWGPVPGASG----FRISWSTGSGPES--SQTLPPDSTATDITGLQPGTTYQVAVSVLRGREE 673
Cdd:pfam00041    2 APSNLTVTDVTSTSLTVSWTPPPDGNGpitgYEVEYRPKNSGEPwnEITVPGTTTSVTLTGLKPGTEYEVRVQAVNGGGE 81

                   ..
gi 2462586997  674 GP 675
Cdd:pfam00041   82 GP 83
Collagen pfam01391
Collagen triple helix repeat (20 copies); Members of this family belong to the collagen ...
1453-1509 2.20e-07

Collagen triple helix repeat (20 copies); Members of this family belong to the collagen superfamily. Collagens are generally extracellular structural proteins involved in formation of connective tissue structure. The alignment contains 20 copies of the G-X-Y repeat that forms a triple helix. The first position of the repeat is glycine, the second and third positions can be any residue but are frequently proline and hydroxy-proline. Collagens are post translationally modified by proline hydroxylase to form the hydroxy-proline residues. Defective hydroxylation is the cause of scurvy. Some members of the collagen superfamily are not involved in connective tissue structure but share the same triple helical structure. The family includes bacterial collagen-like triple-helix repeat proteins.


Pssm-ID: 460189 [Multi-domain]  Cd Length: 57  Bit Score: 49.41  E-value: 2.20e-07
                           10        20        30        40        50
                   ....*....|....*....|....*....|....*....|....*....|....*..
gi 2462586997 1453 GAPGLPGQPGSPGEQGPRGPPGAIGPKGDRGFPGPLGEAGEKGERGPPGPAGSRGLP 1509
Cdd:pfam01391    1 GPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGEPGPPGPPGPPGPPGPPGAPGAPGPP 57
Collagen pfam01391
Collagen triple helix repeat (20 copies); Members of this family belong to the collagen ...
1296-1349 3.46e-07

Collagen triple helix repeat (20 copies); Members of this family belong to the collagen superfamily. Collagens are generally extracellular structural proteins involved in formation of connective tissue structure. The alignment contains 20 copies of the G-X-Y repeat that forms a triple helix. The first position of the repeat is glycine, the second and third positions can be any residue but are frequently proline and hydroxy-proline. Collagens are post translationally modified by proline hydroxylase to form the hydroxy-proline residues. Defective hydroxylation is the cause of scurvy. Some members of the collagen superfamily are not involved in connective tissue structure but share the same triple helical structure. The family includes bacterial collagen-like triple-helix repeat proteins.


Pssm-ID: 460189 [Multi-domain]  Cd Length: 57  Bit Score: 49.03  E-value: 3.46e-07
                           10        20        30        40        50
                   ....*....|....*....|....*....|....*....|....*....|....
gi 2462586997 1296 GERGFPGADGRPGSPGRAGNPGTPGAPGLKGSPGLPGPRGDPGERGPRGPKGEP 1349
Cdd:pfam01391    4 GPPGPPGPPGPPGPPGPPGPPGPPGPPGEPGPPGPPGPPGPPGPPGAPGAPGPP 57
vWA_collagen_alpha3-VI-like cd01481
VWA_collagen alpha 3(VI) like: The extracellular matrix represents a complex alloy of variable ...
1053-1204 4.49e-07

VWA_collagen alpha 3(VI) like: The extracellular matrix represents a complex alloy of variable members of diverse protein families defining structural integrity and various physiological functions. The most abundant family is the collagens with more than 20 different collagen types identified thus far. Collagens are centrally involved in the formation of fibrillar and microfibrillar networks of the extracellular matrix, basement membranes as well as other structures of the extracellular matrix. Some collagens have about 15-18 vWA domains in them. The VWA domains present in these collagens mediate protein-protein interactions.


Pssm-ID: 238758  Cd Length: 165  Bit Score: 51.94  E-value: 4.49e-07
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462586997 1053 ADVVFLPHATqDNAHRAE--ATRRVLERLVLALGpLGPQAVQVGLLSYSHRPSPLFPLNGSHDLGIILQRIRDMPYMDPS 1130
Cdd:cd01481      1 KDIVFLIDGS-DNVGSGNfpAIRDFIERIVQSLD-VGPDKIRVAVVQFSDTPRPEFYLNTHSTKADVLGAVRRLRLRGGS 78
                           90       100       110       120       130       140       150
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*
gi 2462586997 1131 GNNLGTAVVTAHRYMLAPDAPGR-RQHVPGVMVLLVDEPLRGDIFSPIREAQASGLNVVMLGMAGADPEQLRRLA 1204
Cdd:cd01481     79 QLNTGSALDYVVKNLFTKSAGSRiEEGVPQFLVLITGGKSQDDVERPAVALKRAGIVPFAIGARNADLAELQQIA 153
Collagen pfam01391
Collagen triple helix repeat (20 copies); Members of this family belong to the collagen ...
2111-2166 4.60e-07

Collagen triple helix repeat (20 copies); Members of this family belong to the collagen superfamily. Collagens are generally extracellular structural proteins involved in formation of connective tissue structure. The alignment contains 20 copies of the G-X-Y repeat that forms a triple helix. The first position of the repeat is glycine, the second and third positions can be any residue but are frequently proline and hydroxy-proline. Collagens are post translationally modified by proline hydroxylase to form the hydroxy-proline residues. Defective hydroxylation is the cause of scurvy. Some members of the collagen superfamily are not involved in connective tissue structure but share the same triple helical structure. The family includes bacterial collagen-like triple-helix repeat proteins.


Pssm-ID: 460189 [Multi-domain]  Cd Length: 57  Bit Score: 48.64  E-value: 4.60e-07
                           10        20        30        40        50
                   ....*....|....*....|....*....|....*....|....*....|....*.
gi 2462586997 2111 GLKGAKGEPGSNGDQGPKGDRGVPGIKGDRGEPGPRGQDGNPGLPGERGMAGPEGK 2166
Cdd:pfam01391    1 GPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGEPGPPGPPGPPGPPGPPGAPGAPGP 56
FN3 smart00060
Fibronectin type 3 domain; One of three types of internal repeat within the plasma protein, ...
509-584 4.98e-07

Fibronectin type 3 domain; One of three types of internal repeat within the plasma protein, fibronectin. The tenth fibronectin type III repeat contains a RGD cell recognition sequence in a flexible loop between 2 strands. Type III modules are present in both extracellular and intracellular proteins.


Pssm-ID: 214495 [Multi-domain]  Cd Length: 83  Bit Score: 49.54  E-value: 4.98e-07
                            10        20        30        40        50        60        70        80
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462586997   509 SPVTDLQATELPGQRVRVSWSPVPGA------TQYRIIVRSTQGVERTLVLPGSQTAFDLDDVQAGLSYTVRVSARVGPR 582
Cdd:smart00060    2 SPPSNLRVTDVTSTSVTLSWEPPPDDgitgyiVGYRVEYREEGSEWKEVNVTPSSTSYTLTGLKPGTEYEFRVRAVNGAG 81

                    ..
gi 2462586997   583 EG 584
Cdd:smart00060   82 EG 83
COG4733 COG4733
Phage-related protein, tail protein J [Mobilome: prophages, transposons];
212-654 5.23e-07

Phage-related protein, tail protein J [Mobilome: prophages, transposons];


Pssm-ID: 443767 [Multi-domain]  Cd Length: 978  Bit Score: 55.34  E-value: 5.23e-07
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462586997  212 SRRVCTTAGGVPVTRPPDDSTSAPRDLVLSEPSSQSLRVQWTAASGP-VTgykvqytpLTGLGQPLPSERQEVNVPAGET 290
Cdd:COG4733    415 GGRVSSVDGRVVTLDRPVTMEAGDRYLRVRLPDGTSVARTVQSVAGRtLT--------VSTAYSETPEAGAVWAFGPDEL 486
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462586997  291 S---VRLRGLRPLTEYQVTVIALYANSIGEAV--SGTARTTALEGPELTI----------QNTTAHSLLVAWRSVPGATG 355
Cdd:COG4733    487 EtqlFRVVSIEENEDGTYTITAVQHAPEKYAAidAGAFDDVPPQWPPVNVttseslsvvaQGTAVTTLTVSWDAPAGAVA 566
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462586997  356 YRVTWRvlSGGPTQQQELGPGQGSVLLRDLEPGtDYEVTVS--TLFGRSVGPATSlmARTDASVEQTLRP------VILG 427
Cdd:COG4733    567 YEVEWR--RDDGNWVSVPRTSGTSFEVPGIYAG-DYEVRVRaiNALGVSSAWAAS--SETTVTGKTAPPPaptgltATGG 641
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462586997  428 PTSILLSWNLVPEARGYRLEWRRETGLEPPQKVVLPSDVT--RYQLDGLQPGTEYRLTL-YTLLEGHEVATPATVVPT-- 502
Cdd:COG4733    642 LGGITLSWSFPVDADTLRTEIRYSTTGDWASATVAQALYPgnTYTLAGLKAGQTYYYRArAVDRSGNVSAWWVSGQASad 721
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462586997  503 -----------------GPELPVSPVTDLQATELPGQRVRVSWS--------PVPGATQYRIIVRSTQGVERTLVLPGSQ 557
Cdd:COG4733    722 aagildaitgqiletelGQELDAIIQNATVAEVVAATVTDVTAQidtavlfaGVATAAAIGAEARVAATVAESATAAAAT 801
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462586997  558 TAFdldDVQAGLSYTVRVSARVGPREGSASVLTVRRELETplAVPGLRVVVSDATRVRV-AWGPVPGASGFRISWSTGSG 636
Cdd:COG4733    802 GTA---ADAAGDASGGVTAGTSGTTGAGDTAASTTRVAAA--VVLAGVVVYGDAIIESGnTGDIVATGDIASAAAGAVAT 876
                          490
                   ....*....|....*...
gi 2462586997  637 PESSQTLPPDSTATDITG 654
Cdd:COG4733    877 TVSGTTAADVSAVADSTA 894
FN3 smart00060
Fibronectin type 3 domain; One of three types of internal repeat within the plasma protein, ...
331-404 5.34e-07

Fibronectin type 3 domain; One of three types of internal repeat within the plasma protein, fibronectin. The tenth fibronectin type III repeat contains a RGD cell recognition sequence in a flexible loop between 2 strands. Type III modules are present in both extracellular and intracellular proteins.


Pssm-ID: 214495 [Multi-domain]  Cd Length: 83  Bit Score: 49.54  E-value: 5.34e-07
                            10        20        30        40        50        60        70
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 2462586997   331 PELTIQNTTAHSLLVAWRSV--PGATGYRVTWRVL---SGGPTQQQELGPGQGSVLLRDLEPGTDYEVTVSTLFGRSVG 404
Cdd:smart00060    5 SNLRVTDVTSTSVTLSWEPPpdDGITGYIVGYRVEyreEGSEWKEVNVTPSSTSYTLTGLKPGTEYEFRVRAVNGAGEG 83
FN3 cd00063
Fibronectin type 3 domain; One of three types of internal repeats found in the plasma protein ...
509-587 6.77e-07

Fibronectin type 3 domain; One of three types of internal repeats found in the plasma protein fibronectin. Its tenth fibronectin type III repeat contains an RGD cell recognition sequence in a flexible loop between 2 strands. Approximately 2% of all animal proteins contain the FN3 repeat; including extracellular and intracellular proteins, membrane spanning cytokine receptors, growth hormone receptors, tyrosine phosphatase receptors, and adhesion molecules. FN3-like domains are also found in bacterial glycosyl hydrolases.


Pssm-ID: 238020 [Multi-domain]  Cd Length: 93  Bit Score: 49.42  E-value: 6.77e-07
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462586997  509 SPVTDLQATELPGQRVRVSWSPVPGA----TQYRIIVR--STQGVERTLVLPGSQTAFDLDDVQAGLSYTVRVSARVGPR 582
Cdd:cd00063      2 SPPTNLRVTDVTSTSVTLSWTPPEDDggpiTGYVVEYRekGSGDWKEVEVTPGSETSYTLTGLKPGTEYEFRVRAVNGGG 81

                   ....*
gi 2462586997  583 EGSAS 587
Cdd:cd00063     82 ESPPS 86
Collagen pfam01391
Collagen triple helix repeat (20 copies); Members of this family belong to the collagen ...
2257-2315 8.61e-07

Collagen triple helix repeat (20 copies); Members of this family belong to the collagen superfamily. Collagens are generally extracellular structural proteins involved in formation of connective tissue structure. The alignment contains 20 copies of the G-X-Y repeat that forms a triple helix. The first position of the repeat is glycine, the second and third positions can be any residue but are frequently proline and hydroxy-proline. Collagens are post translationally modified by proline hydroxylase to form the hydroxy-proline residues. Defective hydroxylation is the cause of scurvy. Some members of the collagen superfamily are not involved in connective tissue structure but share the same triple helical structure. The family includes bacterial collagen-like triple-helix repeat proteins.


Pssm-ID: 460189 [Multi-domain]  Cd Length: 57  Bit Score: 47.87  E-value: 8.61e-07
                           10        20        30        40        50
                   ....*....|....*....|....*....|....*....|....*....|....*....
gi 2462586997 2257 GAPGRDGASGKDGDRGSPGVPGSPGLPGPVGPKGEPGPTGAPGQAvvGLPGAKGEKGAP 2315
Cdd:pfam01391    1 GPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGEPGPPGPPGPPGPP--GPPGAPGAPGPP 57
Collagen pfam01391
Collagen triple helix repeat (20 copies); Members of this family belong to the collagen ...
2102-2158 1.08e-06

Collagen triple helix repeat (20 copies); Members of this family belong to the collagen superfamily. Collagens are generally extracellular structural proteins involved in formation of connective tissue structure. The alignment contains 20 copies of the G-X-Y repeat that forms a triple helix. The first position of the repeat is glycine, the second and third positions can be any residue but are frequently proline and hydroxy-proline. Collagens are post translationally modified by proline hydroxylase to form the hydroxy-proline residues. Defective hydroxylation is the cause of scurvy. Some members of the collagen superfamily are not involved in connective tissue structure but share the same triple helical structure. The family includes bacterial collagen-like triple-helix repeat proteins.


Pssm-ID: 460189 [Multi-domain]  Cd Length: 57  Bit Score: 47.49  E-value: 1.08e-06
                           10        20        30        40        50
                   ....*....|....*....|....*....|....*....|....*....|....*..
gi 2462586997 2102 GLSGEQGPPGLKGAKGEPGSNGDQGPKGDRGVPGIKGDRGEPGPRGQDGNPGLPGER 2158
Cdd:pfam01391    1 GPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGEPGPPGPPGPPGPPGPPGAPGAPGPP 57
FN3 smart00060
Fibronectin type 3 domain; One of three types of internal repeat within the plasma protein, ...
419-492 1.69e-06

Fibronectin type 3 domain; One of three types of internal repeat within the plasma protein, fibronectin. The tenth fibronectin type III repeat contains a RGD cell recognition sequence in a flexible loop between 2 strands. Type III modules are present in both extracellular and intracellular proteins.


Pssm-ID: 214495 [Multi-domain]  Cd Length: 83  Bit Score: 47.99  E-value: 1.69e-06
                            10        20        30        40        50        60        70
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 2462586997   419 QTLRPVILGPTSILLSWNLVPEA--RGYRLEWRRETGLEPP--QKVVLPSDVTRYQLDGLQPGTEYRLTLYTLLEGHE 492
Cdd:smart00060    5 SNLRVTDVTSTSVTLSWEPPPDDgiTGYIVGYRVEYREEGSewKEVNVTPSSTSYTLTGLKPGTEYEFRVRAVNGAGE 82
fn3 pfam00041
Fibronectin type III domain;
962-1039 2.75e-06

Fibronectin type III domain;


Pssm-ID: 394996 [Multi-domain]  Cd Length: 85  Bit Score: 47.41  E-value: 2.75e-06
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462586997  962 LRVVDTSIDSVTLAWTPVSRASS----YILSWRPLRGPGQEVPgspQTLPGISSSQRVTGLEPGVSYIFSLTPVLDGVRG 1037
Cdd:pfam00041    6 LTVTDVTSTSLTVSWTPPPDGNGpitgYEVEYRPKNSGEPWNE---ITVPGTTTSVTLTGLKPGTEYEVRVQAVNGGGEG 82

                   ..
gi 2462586997 1038 PE 1039
Cdd:pfam00041   83 PP 84
fn3 pfam00041
Fibronectin type III domain;
509-587 2.83e-06

Fibronectin type III domain;


Pssm-ID: 394996 [Multi-domain]  Cd Length: 85  Bit Score: 47.41  E-value: 2.83e-06
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462586997  509 SPVTDLQATELPGQRVRVSWSPVPGA----TQYRIIVRS--TQGVERTLVLPGSQTAFDLDDVQAGLSYTVRVSARVGPR 582
Cdd:pfam00041    1 SAPSNLTVTDVTSTSLTVSWTPPPDGngpiTGYEVEYRPknSGEPWNEITVPGTTTSVTLTGLKPGTEYEVRVQAVNGGG 80

                   ....*
gi 2462586997  583 EGSAS 587
Cdd:pfam00041   81 EGPPS 85
Collagen pfam01391
Collagen triple helix repeat (20 copies); Members of this family belong to the collagen ...
1468-1513 4.18e-06

Collagen triple helix repeat (20 copies); Members of this family belong to the collagen superfamily. Collagens are generally extracellular structural proteins involved in formation of connective tissue structure. The alignment contains 20 copies of the G-X-Y repeat that forms a triple helix. The first position of the repeat is glycine, the second and third positions can be any residue but are frequently proline and hydroxy-proline. Collagens are post translationally modified by proline hydroxylase to form the hydroxy-proline residues. Defective hydroxylation is the cause of scurvy. Some members of the collagen superfamily are not involved in connective tissue structure but share the same triple helical structure. The family includes bacterial collagen-like triple-helix repeat proteins.


Pssm-ID: 460189 [Multi-domain]  Cd Length: 57  Bit Score: 45.95  E-value: 4.18e-06
                           10        20        30        40
                   ....*....|....*....|....*....|....*....|....*.
gi 2462586997 1468 GPRGPPGAIGPKGDRGFPGPLGEAGEKGERGPPGPAGSRGLPGVAG 1513
Cdd:pfam01391    1 GPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGEPGPPGPPGPPGPPG 46
vWA_norD_type cd01454
norD type: Denitrifying bacteria contain both membrane bound and periplasmic nitrate ...
39-189 4.30e-06

norD type: Denitrifying bacteria contain both membrane bound and periplasmic nitrate reductases. Denitrification plays a major role in completing the nitrogen cycle by converting nitrate or nitrite to nitrogen gas. The pathway for microbial denitrification has been established as NO3- ------> NO2- ------> NO -------> N2O ---------> N2. This reaction generally occurs under oxygen limiting conditions. Genetic and biochemical studies have shown that the first srep of the biochemical pathway is catalyzed by periplasmic nitrate reductases. This family is widely present in proteobacteria and firmicutes. This version of the domain is also present in some archaeal members. The function of the vWA domain in this sub-group is not known. Members of this subgroup have a conserved MIDAS motif.


Pssm-ID: 238731 [Multi-domain]  Cd Length: 174  Bit Score: 49.25  E-value: 4.30e-06
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462586997   39 IVFLLDGSSSIgrSNFREVRSFLEGLVLpFSGAASAQGVRFATVQYSDD--PRTEFGLDALGS-----GGDVIRAIRELS 111
Cdd:cd01454      3 VTLLLDLSGSM--RSDRRIDVAKKAAVL-LAEALEACGVPHAILGFTTDagGRERVRWIKIKDfdeslHERARKRLAALS 79
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462586997  112 yKGGNTRTGAAILHVADHVflpqLARPGVPKVCILITDGKSQDLVDT------------AAQRLKGQGVKLFAVGIKNAD 179
Cdd:cd01454     80 -PGGNTRDGAAIRHAAERL----LARPEKRKILLVISDGEPNDLDYYegnvfatedalrAVIEARKLGIEVFGITIDRDA 154
                          170
                   ....*....|....
gi 2462586997  180 P----EELKRVASQ 189
Cdd:cd01454    155 TtvdkEYLKNIFGE 168
SPT5 COG5164
Transcription elongation factor SPT5 [Transcription];
2022-2316 5.23e-06

Transcription elongation factor SPT5 [Transcription];


Pssm-ID: 444063 [Multi-domain]  Cd Length: 495  Bit Score: 51.57  E-value: 5.23e-06
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462586997 2022 GERGPPGPSGLAGEPGKPGIPGLPGRAGGVGEAGRPGERGERGEKGERGEQGRDGPPGLPGTPGPPGPpgpkvsvdepgP 2101
Cdd:COG5164      7 GKTGPSDPGGVTTPAGSQGSTKPAQNQGSTRPAGNTGGTRPAQNQGSTTPAGNTGGTRPAGNQGATGP-----------A 75
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462586997 2102 GLSGEQGPPGLKGAKGEPGSNGDQGPKGDRGVPGIKGDRGEPGPRGQDGNPGlPGERGMAGPEGkpGLQGPRGPPGPVGG 2181
Cdd:COG5164     76 QNQGGTTPAQNQGGTRPAGNTGGTTPAGDGGATGPPDDGGATGPPDDGGSTT-PPSGGSTTPPG--DGGSTPPGPGSTGP 152
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462586997 2182 HGDPGPPGAPGLAGPAGPQGPSGLKGEPGETGPPGRGLTGPTGavglpgPPGPSGLVGPQGSPGLPGQVGETGKPgaPGR 2261
Cdd:COG5164    153 GGSTTPPGDGGSTTPPGPGGSTTPPDDGGSTTPPNKGETGTDI------PTGGTPRQGPDGPVKKDDKNGKGNPP--DDR 224
                          250       260       270       280       290
                   ....*....|....*....|....*....|....*....|....*....|....*
gi 2462586997 2262 DGASGKDGDRGSPGVPGSPGLPGPVGPKGEPGPTGAPGQAVVGLPGAKGEKGAPG 2316
Cdd:COG5164    225 GGKTGPKDQRPKTNPIERRGPERPEAAALPAELTALEAENRAANPEPATKTIPET 279
Collagen pfam01391
Collagen triple helix repeat (20 copies); Members of this family belong to the collagen ...
2304-2364 5.50e-06

Collagen triple helix repeat (20 copies); Members of this family belong to the collagen superfamily. Collagens are generally extracellular structural proteins involved in formation of connective tissue structure. The alignment contains 20 copies of the G-X-Y repeat that forms a triple helix. The first position of the repeat is glycine, the second and third positions can be any residue but are frequently proline and hydroxy-proline. Collagens are post translationally modified by proline hydroxylase to form the hydroxy-proline residues. Defective hydroxylation is the cause of scurvy. Some members of the collagen superfamily are not involved in connective tissue structure but share the same triple helical structure. The family includes bacterial collagen-like triple-helix repeat proteins.


Pssm-ID: 460189 [Multi-domain]  Cd Length: 57  Bit Score: 45.56  E-value: 5.50e-06
                           10        20        30        40        50        60
                   ....*....|....*....|....*....|....*....|....*....|....*....|.
gi 2462586997 2304 GLPGAKGEKGAPGGlagdlVGEPGAKGDRGLPGPRGEKGEAGRAGEPGDPGEDGQKGAPGP 2364
Cdd:pfam01391    1 GPPGPPGPPGPPGP-----PGPPGPPGPPGPPGPPGEPGPPGPPGPPGPPGPPGAPGAPGP 56
FN3 smart00060
Fibronectin type 3 domain; One of three types of internal repeat within the plasma protein, ...
958-1037 8.33e-06

Fibronectin type 3 domain; One of three types of internal repeat within the plasma protein, fibronectin. The tenth fibronectin type III repeat contains a RGD cell recognition sequence in a flexible loop between 2 strands. Type III modules are present in both extracellular and intracellular proteins.


Pssm-ID: 214495 [Multi-domain]  Cd Length: 83  Bit Score: 46.07  E-value: 8.33e-06
                            10        20        30        40        50        60        70        80
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462586997   958 PSIELRVVDTSIDSVTLAWTPVSRAS--SYILSWRPLRGPGQEvPGSPQTLPGISSSQRVTGLEPGVSYIFSLTPVLDGV 1035
Cdd:smart00060    3 PPSNLRVTDVTSTSVTLSWEPPPDDGitGYIVGYRVEYREEGS-EWKEVNVTPSSTSYTLTGLKPGTEYEFRVRAVNGAG 81

                    ..
gi 2462586997  1036 RG 1037
Cdd:smart00060   82 EG 83
PksD COG3321
Acyl transferase domain in polyketide synthase (PKS) enzymes [Secondary metabolites ...
409-966 1.68e-05

Acyl transferase domain in polyketide synthase (PKS) enzymes [Secondary metabolites biosynthesis, transport and catabolism];


Pssm-ID: 442550 [Multi-domain]  Cd Length: 1386  Bit Score: 50.64  E-value: 1.68e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462586997  409 LMARTDASVEQTLRPVILGPTSILLS----WnlvpeARGYRLEWRRETGLEPPQKVVLPS-----DVTRYQLDGLQPGTE 479
Cdd:COG3321    811 LAAAGDAVVLPSLRRGEDELAQLLTAlaqlW-----VAGVPVDWSALYPGRGRRRVPLPTypfqrEDAAAALLAAALAAA 885
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462586997  480 YRLTLYTLLEGHEVATPATVVPTGPELPVSPVTDLQATELPGQRVRVSWSPVPGATQYRIIVRSTQGVERTLVLPGSQTA 559
Cdd:COG3321    886 LAAAAALGALLLAALAAALAAALLALAAAAAAALALAAAALAALLALVALAAAAAALLALAAAAAAAAAALAAAEAGALL 965
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462586997  560 FDLDDVQAGLSYTVRVSARVGPREGSASVLTVRRELETPLAVPGLRVVVSDATRVRVAWGPVPGASGFRISWSTGSGPES 639
Cdd:COG3321    966 LLAAAAAAAAAAAAAAAAAAAAAAAAAAAALAAAAALALLAAAALLLAAAAAAAALLALAALLAAAAAALAAAAAAAAAA 1045
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462586997  640 SQTLPPDSTATDITGLQPGTTYQVAVSVLRGREEGPAAVIVARTDPLGPVRTVHVTQASSSSVTITWTRVPGATGYRVSW 719
Cdd:COG3321   1046 AALAALAAAAAAAAALALALAALLLLAALAELALAAAALALAAALAAAALALALAALAAALLLLALLAALALAAAAAALL 1125
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462586997  720 HSAHGPEKSQLVSGEATVAELDGLEPDTEYTVHVRAHVAGVDGPPASVVVRTAPEPVGRVSRLQILNASSDVLRITWVGV 799
Cdd:COG3321   1126 ALAALLAAAAAAAALAAAAAAAAALALAAAAAALAAALAAALLAAAALLLALALALAAALAAALAGLAALLLAALLAALL 1205
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462586997  800 TGATAYRLAWGRSEGGPMRHQILPGNTDSAEIRGLEGGVSYSVRVTALVGDREGTPVSIVVTTPPEAPPALGTLHVVQRG 879
Cdd:COG3321   1206 AALLALALAALAAAAAALLAAAAAAAALALLALAAAAAAVAALAAAAAALLAALAALALLAAAAGLAALAAAAAAAAAAL 1285
                          490       500       510       520       530       540       550       560
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462586997  880 EHSLRLRWEPVPRAQGFLLHWQPEGGQEQSRVLGPELSSYHLDGLEPATQYRVRLSVLGPAGEGPSAEVTARTESPRVPS 959
Cdd:COG3321   1286 ALAAAAAAAAAALAALLAAAAAAAAAAAAAAAAAALAAALLAAALAALAAAVAAALALAAAAAAAAAAAAAAAAAAALAA 1365

                   ....*..
gi 2462586997  960 IELRVVD 966
Cdd:COG3321   1366 AAGAAAA 1372
vWA_Matrilin cd01475
VWA_Matrilin: In cartilaginous plate, extracellular matrix molecules mediate cell-matrix and ...
1087-1261 2.12e-05

VWA_Matrilin: In cartilaginous plate, extracellular matrix molecules mediate cell-matrix and matrix-matrix interactions thereby providing tissue integrity. Some members of the matrilin family are expressed specifically in developing cartilage rudiments. The matrilin family consists of at least four members. All the members of the matrilin family contain VWA domains, EGF-like domains and a heptad repeat coiled-coiled domain at the carboxy terminus which is responsible for the oligomerization of the matrilins. The VWA domains have been shown to be essential for matrilin network formation by interacting with matrix ligands.


Pssm-ID: 238752 [Multi-domain]  Cd Length: 224  Bit Score: 48.15  E-value: 2.12e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462586997 1087 GPQAVQVGLLSYSHRPSPLFPLNGSHDLGIILQRIRDMPYMDPsgnnlGTAVVTAHRYML------APDAPGRRQHVPGV 1160
Cdd:cd01475     37 GPDATRVGLVQYSSTVKQEFPLGRFKSKADLKRAVRRMEYLET-----GTMTGLAIQYAMnnafseAEGARPGSERVPRV 111
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462586997 1161 MVLLVDEPLRGDIFSPIREAQASGLNVVMLGMAGADPEQLRRLA--PGMDSVqtfFAVDDGPSLDQAVSGLATALCQAsf 1238
Cdd:cd01475    112 GIVVTDGRPQDDVSEVAAKARALGIEMFAVGVGRADEEELREIAsePLADHV---FYVEDFSTIEELTKKFQGKICVV-- 186
                          170       180
                   ....*....|....*....|....*.
gi 2462586997 1239 ttqprPEPC---PVYCPKGQKGEPGE 1261
Cdd:cd01475    187 -----PDLCatlSHVCQQVCISTPGS 207
FN3 COG3401
Fibronectin type 3 domain [General function prediction only];
218-443 3.80e-05

Fibronectin type 3 domain [General function prediction only];


Pssm-ID: 442628 [Multi-domain]  Cd Length: 603  Bit Score: 49.23  E-value: 3.80e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462586997  218 TAGGVPVTRPPddstSAPRDLVLSEPSSQSLRVQWTAASGP-VTGYKVQYTPLTGlgqplpSERQEVNVPAgETSVRLRG 296
Cdd:COG3401    223 EVSVTTPTTPP----SAPTGLTATADTPGSVTLSWDPVTESdATGYRVYRSNSGD------GPFTKVATVT-TTSYTDTG 291
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462586997  297 LRPLTEYQVTVIALYANSIG----EAVSGTARTTALEGPE-LTIQNTTAHSLLVAWRSVPG--ATGYRVTWRVLSGGPTQ 369
Cdd:COG3401    292 LTNGTTYYYRVTAVDAAGNEsapsNVVSVTTDLTPPAAPSgLTATAVGSSSITLSWTASSDadVTGYNVYRSTSGGGTYT 371
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462586997  370 QqeLGPGQGSVLLRD--LEPGTDYEVTVSTLF--GRSVGPATSLMAR--TDASVEQTLRPVILGPTSILLSWNLVPEARG 443
Cdd:COG3401    372 K--IAETVTTTSYTDtgLTPGTTYYYKVTAVDaaGNESAPSEEVSATtaSAASGESLTASVDAVPLTDVAGATAAASAAS 449
Collagen pfam01391
Collagen triple helix repeat (20 copies); Members of this family belong to the collagen ...
2324-2369 4.91e-05

Collagen triple helix repeat (20 copies); Members of this family belong to the collagen superfamily. Collagens are generally extracellular structural proteins involved in formation of connective tissue structure. The alignment contains 20 copies of the G-X-Y repeat that forms a triple helix. The first position of the repeat is glycine, the second and third positions can be any residue but are frequently proline and hydroxy-proline. Collagens are post translationally modified by proline hydroxylase to form the hydroxy-proline residues. Defective hydroxylation is the cause of scurvy. Some members of the collagen superfamily are not involved in connective tissue structure but share the same triple helical structure. The family includes bacterial collagen-like triple-helix repeat proteins.


Pssm-ID: 460189 [Multi-domain]  Cd Length: 57  Bit Score: 42.87  E-value: 4.91e-05
                           10        20        30        40
                   ....*....|....*....|....*....|....*....|....*.
gi 2462586997 2324 GEPGAKGDRGLPGPRGEKGEAGRAGEPGDPGEDGQKGAPGPKGFKG 2369
Cdd:pfam01391    1 GPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGEPGPPGPPGPPGPPG 46
FN3 cd00063
Fibronectin type 3 domain; One of three types of internal repeats found in the plasma protein ...
777-862 5.06e-05

Fibronectin type 3 domain; One of three types of internal repeats found in the plasma protein fibronectin. Its tenth fibronectin type III repeat contains an RGD cell recognition sequence in a flexible loop between 2 strands. Approximately 2% of all animal proteins contain the FN3 repeat; including extracellular and intracellular proteins, membrane spanning cytokine receptors, growth hormone receptors, tyrosine phosphatase receptors, and adhesion molecules. FN3-like domains are also found in bacterial glycosyl hydrolases.


Pssm-ID: 238020 [Multi-domain]  Cd Length: 93  Bit Score: 44.02  E-value: 5.06e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462586997  777 GRVSRLQILNASSDVLRITWVGVTGA----TAYRLAWGR-SEGGPMRHQILPGNTDSAEIRGLEGGVSYSVRVTALVGDR 851
Cdd:cd00063      2 SPPTNLRVTDVTSTSVTLSWTPPEDDggpiTGYVVEYREkGSGDWKEVEVTPGSETSYTLTGLKPGTEYEFRVRAVNGGG 81
                           90
                   ....*....|.
gi 2462586997  852 EGTPVSIVVTT 862
Cdd:cd00063     82 ESPPSESVTVT 92
Pur_ac_phosph_N pfam16656
Purple acid Phosphatase, N-terminal domain; This domain is found at the N-terminus of Purple ...
692-755 7.06e-05

Purple acid Phosphatase, N-terminal domain; This domain is found at the N-terminus of Purple acid phosphatase proteins.


Pssm-ID: 465220 [Multi-domain]  Cd Length: 93  Bit Score: 43.55  E-value: 7.06e-05
                           10        20        30        40        50        60        70
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 2462586997  692 VHVT-QASSSSVTITWTRVPGATGYRVSWHSAHGpEKSQLVSGEATV-------------AELDGLEPDTEYTVHVRA 755
Cdd:pfam16656    4 VHLSlTGDSTSMTVSWVTPSAVTSPVVQYGTSSS-ALTSTATATSSTyttgdggtgyihrATLTGLEPGTTYYYRVGD 80
Collagen pfam01391
Collagen triple helix repeat (20 copies); Members of this family belong to the collagen ...
1294-1340 8.77e-05

Collagen triple helix repeat (20 copies); Members of this family belong to the collagen superfamily. Collagens are generally extracellular structural proteins involved in formation of connective tissue structure. The alignment contains 20 copies of the G-X-Y repeat that forms a triple helix. The first position of the repeat is glycine, the second and third positions can be any residue but are frequently proline and hydroxy-proline. Collagens are post translationally modified by proline hydroxylase to form the hydroxy-proline residues. Defective hydroxylation is the cause of scurvy. Some members of the collagen superfamily are not involved in connective tissue structure but share the same triple helical structure. The family includes bacterial collagen-like triple-helix repeat proteins.


Pssm-ID: 460189 [Multi-domain]  Cd Length: 57  Bit Score: 42.10  E-value: 8.77e-05
                           10        20        30        40
                   ....*....|....*....|....*....|....*....|....*..
gi 2462586997 1294 AKGERGFPGADGRPGSPGRAGNPGTPGAPGLKGSPGLPGPRGDPGER 1340
Cdd:pfam01391   11 PPGPPGPPGPPGPPGPPGPPGEPGPPGPPGPPGPPGPPGAPGAPGPP 57
Collagen pfam01391
Collagen triple helix repeat (20 copies); Members of this family belong to the collagen ...
2281-2350 9.58e-05

Collagen triple helix repeat (20 copies); Members of this family belong to the collagen superfamily. Collagens are generally extracellular structural proteins involved in formation of connective tissue structure. The alignment contains 20 copies of the G-X-Y repeat that forms a triple helix. The first position of the repeat is glycine, the second and third positions can be any residue but are frequently proline and hydroxy-proline. Collagens are post translationally modified by proline hydroxylase to form the hydroxy-proline residues. Defective hydroxylation is the cause of scurvy. Some members of the collagen superfamily are not involved in connective tissue structure but share the same triple helical structure. The family includes bacterial collagen-like triple-helix repeat proteins.


Pssm-ID: 460189 [Multi-domain]  Cd Length: 57  Bit Score: 42.10  E-value: 9.58e-05
                           10        20        30        40        50        60        70
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462586997 2281 GLPGPVGPKGEPGPTGAPGQavvglPGAKGEKGAPGglagdlvgEPGAKGDRGLPGPRGEKGEAGRAGEP 2350
Cdd:pfam01391    1 GPPGPPGPPGPPGPPGPPGP-----PGPPGPPGPPG--------EPGPPGPPGPPGPPGPPGAPGAPGPP 57
Collagen pfam01391
Collagen triple helix repeat (20 copies); Members of this family belong to the collagen ...
2022-2071 1.64e-04

Collagen triple helix repeat (20 copies); Members of this family belong to the collagen superfamily. Collagens are generally extracellular structural proteins involved in formation of connective tissue structure. The alignment contains 20 copies of the G-X-Y repeat that forms a triple helix. The first position of the repeat is glycine, the second and third positions can be any residue but are frequently proline and hydroxy-proline. Collagens are post translationally modified by proline hydroxylase to form the hydroxy-proline residues. Defective hydroxylation is the cause of scurvy. Some members of the collagen superfamily are not involved in connective tissue structure but share the same triple helical structure. The family includes bacterial collagen-like triple-helix repeat proteins.


Pssm-ID: 460189 [Multi-domain]  Cd Length: 57  Bit Score: 41.33  E-value: 1.64e-04
                           10        20        30        40        50
                   ....*....|....*....|....*....|....*....|....*....|
gi 2462586997 2022 GERGPPGPSGLAGEPGKPGIPGLPGRAGGVGEAGRPGERGERGEKGERGE 2071
Cdd:pfam01391    1 GPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGEPGPPGPPGPPGPPGPPGA 50
SPT5 COG5164
Transcription elongation factor SPT5 [Transcription];
2201-2410 1.97e-04

Transcription elongation factor SPT5 [Transcription];


Pssm-ID: 444063 [Multi-domain]  Cd Length: 495  Bit Score: 46.56  E-value: 1.97e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462586997 2201 GPSGLKGEPGETGPPG-RGLTGPTGAVGLPGPPGPSGLVGPQGSPGLPGQVGETGKPGAPGRDGASGKDGDRGSPGVPGS 2279
Cdd:COG5164      7 GKTGPSDPGGVTTPAGsQGSTKPAQNQGSTRPAGNTGGTRPAQNQGSTTPAGNTGGTRPAGNQGATGPAQNQGGTTPAQN 86
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462586997 2280 PGLPGPVGPKGEPGPTGAPGQAvvglpGAKGEKGAPGGLAGDLVGEPGAKGDRGLPGPRGEK----GEAGRAGEPGDPGE 2355
Cdd:COG5164     87 QGGTRPAGNTGGTTPAGDGGAT-----GPPDDGGATGPPDDGGSTTPPSGGSTTPPGDGGSTppgpGSTGPGGSTTPPGD 161
                          170       180       190       200       210
                   ....*....|....*....|....*....|....*....|....*....|....*
gi 2462586997 2356 DGQKGAPGPKGFKGDPGVGVPGSPGPPGPPGVKGDLGLPGLPGAPGVVGFPGQTG 2410
Cdd:COG5164    162 GGSTTPPGPGGSTTPPDDGGSTTPPNKGETGTDIPTGGTPRQGPDGPVKKDDKNG 216
VWA_integrin_invertebrates cd01476
VWA_integrin (invertebrates): Integrins are a family of cell surface receptors that have ...
1053-1200 2.36e-04

VWA_integrin (invertebrates): Integrins are a family of cell surface receptors that have diverse functions in cell-cell and cell-extracellular matrix interactions. Because of their involvement in many biologically important adhesion processes, integrins are conserved across a wide range of multicellular animals. Integrins from invertebrates have been identified from six phyla. There are no data to date to suggest any immunological functions for the invertebrate integrins. The members of this sub-group have the conserved MIDAS motif that is charateristic of this domain suggesting the involvement of the integrins in the recognition and binding of multi-ligands.


Pssm-ID: 238753 [Multi-domain]  Cd Length: 163  Bit Score: 43.93  E-value: 2.36e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462586997 1053 ADVVFLPHATQDNAHRAEATRRVLERLVLALgPLGPQAVQVGLLSYS--HRPSPLFPLNGSHDLGIILQRIRDMPYMdpS 1130
Cdd:cd01476      1 LDLLFVLDSSGSVRGKFEKYKKYIERIVEGL-EIGPTATRVALITYSgrGRQRVRFNLPKHNDGEELLEKVDNLRFI--G 77
                           90       100       110       120       130       140       150
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|.
gi 2462586997 1131 G-NNLGTAVVTAHRYMlaPDAPGRRQHVPGVMVLLVDEPLRGDIFSPIREAQAsGLNVVMLGMAGADPEQL 1200
Cdd:cd01476     78 GtTATGAAIEVALQQL--DPSEGRREGIPKVVVVLTDGRSHDDPEKQARILRA-VPNIETFAVGTGDPGTV 145
PHA03169 PHA03169
hypothetical protein; Provisional
1709-1876 2.62e-04

hypothetical protein; Provisional


Pssm-ID: 223003 [Multi-domain]  Cd Length: 413  Bit Score: 46.12  E-value: 2.62e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462586997 1709 VDTGPGAREK-GEPGDRGQEGPRGPKGDPGLPGAPGERGIEGFRGPPGPQGDPGVRGPAGEKGDRGPPG--LDGRSGLDG 1785
Cdd:PHA03169    71 SDTETAEESRhGEKEERGQGGPSGSGSESVGSPTPSPSGSAEELASGLSPENTSGSSPESPASHSPPPSppSHPGPHEPA 150
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462586997 1786 KPGAAGPSGPNGAAGKAGDPGRDGLPGLRGEQGLPGPSGPPGLPGKPGEDGKPGLNGKNgEPGDPGEDGRKGEKGDSGAS 1865
Cdd:PHA03169   151 PPESHNPSPNQQPSSFLQPSHEDSPEEPEPPTSEPEPDSPGPPQSETPTSSPPPQSPPD-EPGEPQSPTPQQAPSPNTQQ 229
                          170
                   ....*....|.
gi 2462586997 1866 GREGRDGPKGE 1876
Cdd:PHA03169   230 AVEHEDEPTEP 240
PhoD COG3540
Phosphodiesterase/alkaline phosphatase D [Inorganic ion transport and metabolism];
698-806 2.74e-04

Phosphodiesterase/alkaline phosphatase D [Inorganic ion transport and metabolism];


Pssm-ID: 442761 [Multi-domain]  Cd Length: 482  Bit Score: 46.07  E-value: 2.74e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462586997  698 SSSSVTItWTRV---PGATGYRVSWHSAHGPEKSQLV-SGEATV---------AELDGLEPDTEYTVHVRAhvagvdGPP 764
Cdd:COG3540     44 TPDSVVL-WTRLapdPPARPVPVRWEVATDESFRRVVrSGTVTAtperdhtvkVDVTGLEPGTRYFYRFRA------GGE 116
                           90       100       110       120
                   ....*....|....*....|....*....|....*....|....*....
gi 2462586997  765 ASVV--VRTAPEPvGRVSRlqilnassdvLRITWVGVT-----GATAYR 806
Cdd:COG3540    117 TSPVgrFRTAPAP-GAPDR----------LRFAFASCQnyeggYFTAYR 154
SPT5 COG5164
Transcription elongation factor SPT5 [Transcription];
1711-1936 3.65e-04

Transcription elongation factor SPT5 [Transcription];


Pssm-ID: 444063 [Multi-domain]  Cd Length: 495  Bit Score: 45.79  E-value: 3.65e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462586997 1711 TGPGAreKGEPGDRGQEGPRGPKGDPGLPGAPGERGIEGFRGPPGPQGDPGVRGPAGEKGDRGPPGLDGRSGLDGKPGAA 1790
Cdd:COG5164      4 YGPGK--TGPSDPGGVTTPAGSQGSTKPAQNQGSTRPAGNTGGTRPAQNQGSTTPAGNTGGTRPAGNQGATGPAQNQGGT 81
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462586997 1791 GPSGPNGAAGKAGDPGRDGLPGLRGEQGLPGPSGPPGLPGKPGEDGkPGLNGKNGEPGD----PGEDGRKGEKGDSGASG 1866
Cdd:COG5164     82 TPAQNQGGTRPAGNTGGTTPAGDGGATGPPDDGGATGPPDDGGSTT-PPSGGSTTPPGDggstPPGPGSTGPGGSTTPPG 160
                          170       180       190       200       210       220       230
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462586997 1867 REGRDGPKGERGAPGILGPQGPPGLPGPVGPPGQGFPGVPGGTGPKGDRGETGSKGEQGLPGERGLRGEP 1936
Cdd:COG5164    161 DGGSTTPPGPGGSTTPPDDGGSTTPPNKGETGTDIPTGGTPRQGPDGPVKKDDKNGKGNPPDDRGGKTGP 230
PHA03169 PHA03169
hypothetical protein; Provisional
2242-2366 4.24e-04

hypothetical protein; Provisional


Pssm-ID: 223003 [Multi-domain]  Cd Length: 413  Bit Score: 45.35  E-value: 4.24e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462586997 2242 GSPGLPGQVGETGKPGAPGRDGASGKDGDRGSPGVPGSPGLPGPVGPKGEPGPTGAPGQAVVGLPGAKGEKGAPGGLAGD 2321
Cdd:PHA03169   103 PTPSPSGSAEELASGLSPENTSGSSPESPASHSPPPSPPSHPGPHEPAPPESHNPSPNQQPSSFLQPSHEDSPEEPEPPT 182
                           90       100       110       120
                   ....*....|....*....|....*....|....*....|....*
gi 2462586997 2322 LVGEPGAKGDRGLPGPRGEKGEAGRAGEPGDPGEDGQKGAPGPKG 2366
Cdd:PHA03169   183 SEPEPDSPGPPQSETPTSSPPPQSPPDEPGEPQSPTPQQAPSPNT 227
Collagen pfam01391
Collagen triple helix repeat (20 copies); Members of this family belong to the collagen ...
1988-2060 4.80e-04

Collagen triple helix repeat (20 copies); Members of this family belong to the collagen superfamily. Collagens are generally extracellular structural proteins involved in formation of connective tissue structure. The alignment contains 20 copies of the G-X-Y repeat that forms a triple helix. The first position of the repeat is glycine, the second and third positions can be any residue but are frequently proline and hydroxy-proline. Collagens are post translationally modified by proline hydroxylase to form the hydroxy-proline residues. Defective hydroxylation is the cause of scurvy. Some members of the collagen superfamily are not involved in connective tissue structure but share the same triple helical structure. The family includes bacterial collagen-like triple-helix repeat proteins.


Pssm-ID: 460189 [Multi-domain]  Cd Length: 57  Bit Score: 40.17  E-value: 4.80e-04
                           10        20        30        40        50        60        70
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|...
gi 2462586997 1988 GPPGKEGPIGFPGERGlkgdrgdpgpqgppglALGERGPPGPSGLAGEPGKPGIPGLPGRAGGVGEAGRPGER 2060
Cdd:pfam01391    1 GPPGPPGPPGPPGPPG----------------PPGPPGPPGPPGPPGEPGPPGPPGPPGPPGPPGAPGAPGPP 57
vWA_micronemal_protein cd01471
Micronemal proteins: The Toxoplasma lytic cycle begins when the parasite actively invades a ...
1066-1197 6.53e-04

Micronemal proteins: The Toxoplasma lytic cycle begins when the parasite actively invades a target cell. In association with invasion, T. gondii sequentially discharges three sets of secretory organelles beginning with the micronemes, which contain adhesive proteins involved in parasite attachment to a host cell. Deployed as protein complexes, several micronemal proteins possess vertebrate-derived adhesive sequences that function in binding receptors. The VWA domain likely mediates the protein-protein interactions of these with their interacting partners.


Pssm-ID: 238748 [Multi-domain]  Cd Length: 186  Bit Score: 43.14  E-value: 6.53e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462586997 1066 AHRAEATRRVLERLVLALgPLGPQAVQVGLLSYSHRPSPLFPLNGSHD-----LGIILQRIRDMPYmdPSGN-NLGTAVV 1139
Cdd:cd01471     16 SNWVTHVVPFLHTFVQNL-NISPDEINLYLVTFSTNAKELIRLSSPNStnkdlALNAIRALLSLYY--PNGStNTTSALL 92
                           90       100       110       120       130       140       150
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....
gi 2462586997 1140 TAHryMLAPDAPGRRQHVPGVMVLLVD-EPlrGDIFSPIREAQA---SGLNVVMLG------------MAGADP 1197
Cdd:cd01471     93 VVE--KHLFDTRGNRENAPQLVIIMTDgIP--DSKFRTLKEARKlreRGVIIAVLGvgqgvnheenrsLVGCDP 162
Collagen pfam01391
Collagen triple helix repeat (20 copies); Members of this family belong to the collagen ...
1767-1816 7.26e-04

Collagen triple helix repeat (20 copies); Members of this family belong to the collagen superfamily. Collagens are generally extracellular structural proteins involved in formation of connective tissue structure. The alignment contains 20 copies of the G-X-Y repeat that forms a triple helix. The first position of the repeat is glycine, the second and third positions can be any residue but are frequently proline and hydroxy-proline. Collagens are post translationally modified by proline hydroxylase to form the hydroxy-proline residues. Defective hydroxylation is the cause of scurvy. Some members of the collagen superfamily are not involved in connective tissue structure but share the same triple helical structure. The family includes bacterial collagen-like triple-helix repeat proteins.


Pssm-ID: 460189 [Multi-domain]  Cd Length: 57  Bit Score: 39.78  E-value: 7.26e-04
                           10        20        30        40        50
                   ....*....|....*....|....*....|....*....|....*....|
gi 2462586997 1767 GEKGDRGPPGLDGRSGLDGKPGAAGPSGPNGAAGKAGDPGRDGLPGLRGE 1816
Cdd:pfam01391    1 GPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGEPGPPGPPGPPGPPGPPGA 50
VWA_2 pfam13519
von Willebrand factor type A domain;
1060-1164 9.06e-04

von Willebrand factor type A domain;


Pssm-ID: 463909 [Multi-domain]  Cd Length: 103  Bit Score: 40.74  E-value: 9.06e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462586997 1060 HATQDNAHRAEATRRVLERLVLALgplgpQAVQVGLLSYSHRPSPLFPLNGshDLGIILQRIRDMPYMDPsGNNLGTAVV 1139
Cdd:pfam13519   12 RNGDYGPTRLEAAKDAVLALLKSL-----PGDRVGLVTFGDGPEVLIPLTK--DRAKILRALRRLEPKGG-GTNLAAALQ 83
                           90       100
                   ....*....|....*....|....*
gi 2462586997 1140 TAHRYMlapdaPGRRQHVPGVMVLL 1164
Cdd:pfam13519   84 LARAAL-----KHRRKNQPRRIVLI 103
FN3 smart00060
Fibronectin type 3 domain; One of three types of internal repeat within the plasma protein, ...
777-853 9.42e-04

Fibronectin type 3 domain; One of three types of internal repeat within the plasma protein, fibronectin. The tenth fibronectin type III repeat contains a RGD cell recognition sequence in a flexible loop between 2 strands. Type III modules are present in both extracellular and intracellular proteins.


Pssm-ID: 214495 [Multi-domain]  Cd Length: 83  Bit Score: 40.29  E-value: 9.42e-04
                            10        20        30        40        50        60        70        80
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462586997   777 GRVSRLQILNASSDVLRITW--VGVTGATAYRLAW---GRSEGGPMRHQILPGNTDSAEIRGLEGGVSYSVRVTALVGDR 851
Cdd:smart00060    2 SPPSNLRVTDVTSTSVTLSWepPPDDGITGYIVGYrveYREEGSEWKEVNVTPSSTSYTLTGLKPGTEYEFRVRAVNGAG 81

                    ..
gi 2462586997   852 EG 853
Cdd:smart00060   82 EG 83
FN3 COG3401
Fibronectin type 3 domain [General function prediction only];
217-417 9.64e-04

Fibronectin type 3 domain [General function prediction only];


Pssm-ID: 442628 [Multi-domain]  Cd Length: 603  Bit Score: 44.61  E-value: 9.64e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462586997  217 TTAGGVPVTRPPddstSAPRDLVLSEPSSQSLRVQWTAASGP-VTGYKVQYTplTGLGQPLpserQEVNVPAGETSVRLR 295
Cdd:COG3401    316 NVVSVTTDLTPP----AAPSGLTATAVGSSSITLSWTASSDAdVTGYNVYRS--TSGGGTY----TKIAETVTTTSYTDT 385
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462586997  296 GLRPLTEYQVTVIALYANSIGEAVSGTARTTALEGPELTIQNTTAhsllVAWRSVPGATGYRVTWRVLSGGPTQQQELGP 375
Cdd:COG3401    386 GLTPGTTYYYKVTAVDAAGNESAPSEEVSATTASAASGESLTASV----DAVPLTDVAGATAAASAASNPGVSAAVLADG 461
                          170       180       190       200
                   ....*....|....*....|....*....|....*....|...
gi 2462586997  376 GQ-GSVLLRDLEPGTDYEVTVSTLFGRSVGPATSLMARTDASV 417
Cdd:COG3401    462 GDtGNAVPFTTTSSTVTATTTDTTTANLSVTTGSLVGGSGASS 504
Collagen pfam01391
Collagen triple helix repeat (20 copies); Members of this family belong to the collagen ...
1737-1806 1.43e-03

Collagen triple helix repeat (20 copies); Members of this family belong to the collagen superfamily. Collagens are generally extracellular structural proteins involved in formation of connective tissue structure. The alignment contains 20 copies of the G-X-Y repeat that forms a triple helix. The first position of the repeat is glycine, the second and third positions can be any residue but are frequently proline and hydroxy-proline. Collagens are post translationally modified by proline hydroxylase to form the hydroxy-proline residues. Defective hydroxylation is the cause of scurvy. Some members of the collagen superfamily are not involved in connective tissue structure but share the same triple helical structure. The family includes bacterial collagen-like triple-helix repeat proteins.


Pssm-ID: 460189 [Multi-domain]  Cd Length: 57  Bit Score: 38.63  E-value: 1.43e-03
                           10        20        30        40        50        60        70
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462586997 1737 GLPGAPGERGIegfrgppgpqgdpgvRGPAGEKGDRGPPGLDGRSGLDGKPGAAGPSGPNGAAGKAGDPG 1806
Cdd:pfam01391    1 GPPGPPGPPGP---------------PGPPGPPGPPGPPGPPGPPGEPGPPGPPGPPGPPGPPGAPGAPG 55
PRK12678 PRK12678
transcription termination factor Rho; Provisional
1654-1951 2.15e-03

transcription termination factor Rho; Provisional


Pssm-ID: 237171 [Multi-domain]  Cd Length: 672  Bit Score: 43.35  E-value: 2.15e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462586997 1654 AGERGLRGAPGVRgpvgeKGDQGDPGEDGRNGSPGSSGPKGDRGEPgppgppgrlVDTGPGAREKGEPGDRGQEGPRGPK 1733
Cdd:PRK12678    35 AKQLGIKGTSGMR-----KGELIAAIKEARGGGAAAAAATPAAPAA---------AARRAARAAAAARQAEQPAAEAAAA 100
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462586997 1734 GDPGLPGAPGERGIEGFRGPPGPQGDPGVRGPAGEKGDRGPPGLDGRSGLDGKPGAAGPSGPNGAAGKAGDPGRDGLPGL 1813
Cdd:PRK12678   101 KAEAAPAARAAAAAAAEAASAPEAAQARERRERGEAARRGAARKAGEGGEQPATEARADAAERTEEEERDERRRRGDRED 180
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462586997 1814 RGEQGLPGPSGPPGLPGKPGEDGKPGLNGKNGEPGDpgEDGRKGEKGDSGASGREGRDGPKGERGapgilgpqgppglpg 1893
Cdd:PRK12678   181 RQAEAERGERGRREERGRDGDDRDRRDRREQGDRRE--ERGRRDGGDRRGRRRRRDRRDARGDDN--------------- 243
                          250       260       270       280       290       300
                   ....*....|....*....|....*....|....*....|....*....|....*....|.
gi 2462586997 1894 pvgPPGQGFPGVPGGTGPKGDRGETGSKGEQGlpGERGLRGEPGSVPNV---DRLLETAGI 1951
Cdd:PRK12678   244 ---REDRGDRDGDDGEGRGGRRGRRFRDRDRR--GRRGGDGGNEREPELredDVLVPVAGI 299
SPT5 COG5164
Transcription elongation factor SPT5 [Transcription];
1445-1692 3.26e-03

Transcription elongation factor SPT5 [Transcription];


Pssm-ID: 444063 [Multi-domain]  Cd Length: 495  Bit Score: 42.71  E-value: 3.26e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462586997 1445 GEKGDSEDGAPGLP----GQPGSPGEQGPRGPPGAIGPKGDRGFPGPLGEAGEKGERGPPGPAGSRGLPGVAGRPGAKGP 1520
Cdd:COG5164      7 GKTGPSDPGGVTTPagsqGSTKPAQNQGSTRPAGNTGGTRPAQNQGSTTPAGNTGGTRPAGNQGATGPAQNQGGTTPAQN 86
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462586997 1521 EGPPGPTGRQGEKGEPGR------PGDPAVVGPAVAGPKGEKGDVGPAGPRGATGvQGERGPPGLVLPGDPGPKGDPGDR 1594
Cdd:COG5164     87 QGGTRPAGNTGGTTPAGDggatgpPDDGGATGPPDDGGSTTPPSGGSTTPPGDGG-STPPGPGSTGPGGSTTPPGDGGST 165
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462586997 1595 GPIGLTGRAGPPGDSGPPGEKGDPGRPGPPGPVGPRGRDGEVGEKGDEGPPGDPGLPGKAGERGLRGAPGVRGPVGEKGD 1674
Cdd:COG5164    166 TPPGPGGSTTPPDDGGSTTPPNKGETGTDIPTGGTPRQGPDGPVKKDDKNGKGNPPDDRGGKTGPKDQRPKTNPIERRGP 245
                          250
                   ....*....|....*...
gi 2462586997 1675 QGDPGEDGRNGSPGSSGP 1692
Cdd:COG5164    246 ERPEAAALPAELTALEAE 263
vWA_CTRP cd01473
CTRP for CS protein-TRAP-related protein: Adhesion of Plasmodium to host cells is an ...
38-187 3.50e-03

CTRP for CS protein-TRAP-related protein: Adhesion of Plasmodium to host cells is an important phenomenon in parasite invasion and in malaria associated pathology.CTRP encodes a protein containing a putative signal sequence followed by a long extracellular region of 1990 amino acids, a transmembrane domain, and a short cytoplasmic segment. The extracellular region of CTRP contains two separated adhesive domains. The first domain contains six 210-amino acid-long homologous VWA domain repeats. The second domain contains seven repeats of 87-60 amino acids in length, which share similarities with the thrombospondin type 1 domain found in a variety of adhesive molecules. Finally, CTRP also contains consensus motifs found in the superfamily of haematopoietin receptors. The VWA domains in these proteins likely mediate protein-protein interactions.


Pssm-ID: 238750 [Multi-domain]  Cd Length: 192  Bit Score: 41.15  E-value: 3.50e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462586997   38 DIVFLLDGSSSIGRSNFR-EVRSFLEGLVLPF---------SGAASAQGVRfATVQYSDDPRTEfgldalgsGGDVIRAI 107
Cdd:cd01473      2 DLTLILDESASIGYSNWRkDVIPFTEKIINNLniskdkvhvGILLFAEKNR-DVVPFSDEERYD--------KNELLKKI 72
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462586997  108 REL--SYK-GGNTRTGAAILHVADHVFLPQLARPGVPKVCILITDG----KSQDLVDTAAQRLKGQGVKLFAVGIKNADP 180
Cdd:cd01473     73 NDLknSYRsGGETYIVEALKYGLKNYTKHGNRRKDAPKVTMLFTDGndtsASKKELQDISLLYKEENVKLLVVGVGAASE 152

                   ....*..
gi 2462586997  181 EELKRVA 187
Cdd:cd01473    153 NKLKLLA 159
Collagen pfam01391
Collagen triple helix repeat (20 copies); Members of this family belong to the collagen ...
1534-1591 3.54e-03

Collagen triple helix repeat (20 copies); Members of this family belong to the collagen superfamily. Collagens are generally extracellular structural proteins involved in formation of connective tissue structure. The alignment contains 20 copies of the G-X-Y repeat that forms a triple helix. The first position of the repeat is glycine, the second and third positions can be any residue but are frequently proline and hydroxy-proline. Collagens are post translationally modified by proline hydroxylase to form the hydroxy-proline residues. Defective hydroxylation is the cause of scurvy. Some members of the collagen superfamily are not involved in connective tissue structure but share the same triple helical structure. The family includes bacterial collagen-like triple-helix repeat proteins.


Pssm-ID: 460189 [Multi-domain]  Cd Length: 57  Bit Score: 37.86  E-value: 3.54e-03
                           10        20        30        40        50
                   ....*....|....*....|....*....|....*....|....*....|....*....
gi 2462586997 1534 GEPGRPGDPAVVGPA-VAGPKGEKGDVGPAGPRGATGVQGERGPPGlvLPGDPGPKGDP 1591
Cdd:pfam01391    1 GPPGPPGPPGPPGPPgPPGPPGPPGPPGPPGEPGPPGPPGPPGPPG--PPGAPGAPGPP 57
Collagen pfam01391
Collagen triple helix repeat (20 copies); Members of this family belong to the collagen ...
1551-1603 4.27e-03

Collagen triple helix repeat (20 copies); Members of this family belong to the collagen superfamily. Collagens are generally extracellular structural proteins involved in formation of connective tissue structure. The alignment contains 20 copies of the G-X-Y repeat that forms a triple helix. The first position of the repeat is glycine, the second and third positions can be any residue but are frequently proline and hydroxy-proline. Collagens are post translationally modified by proline hydroxylase to form the hydroxy-proline residues. Defective hydroxylation is the cause of scurvy. Some members of the collagen superfamily are not involved in connective tissue structure but share the same triple helical structure. The family includes bacterial collagen-like triple-helix repeat proteins.


Pssm-ID: 460189 [Multi-domain]  Cd Length: 57  Bit Score: 37.47  E-value: 4.27e-03
                           10        20        30        40        50
                   ....*....|....*....|....*....|....*....|....*....|...
gi 2462586997 1551 GPKGEKGDVGPAGPRGATGVQGERGPPGlvLPGDPGPKGDPGDRGPIGLTGRA 1603
Cdd:pfam01391    1 GPPGPPGPPGPPGPPGPPGPPGPPGPPG--PPGEPGPPGPPGPPGPPGPPGAP 51
Collagen pfam01391
Collagen triple helix repeat (20 copies); Members of this family belong to the collagen ...
2022-2065 4.95e-03

Collagen triple helix repeat (20 copies); Members of this family belong to the collagen superfamily. Collagens are generally extracellular structural proteins involved in formation of connective tissue structure. The alignment contains 20 copies of the G-X-Y repeat that forms a triple helix. The first position of the repeat is glycine, the second and third positions can be any residue but are frequently proline and hydroxy-proline. Collagens are post translationally modified by proline hydroxylase to form the hydroxy-proline residues. Defective hydroxylation is the cause of scurvy. Some members of the collagen superfamily are not involved in connective tissue structure but share the same triple helical structure. The family includes bacterial collagen-like triple-helix repeat proteins.


Pssm-ID: 460189 [Multi-domain]  Cd Length: 57  Bit Score: 37.47  E-value: 4.95e-03
                           10        20        30        40
                   ....*....|....*....|....*....|....*....|....
gi 2462586997 2022 GERGPPGPSGLAGEPGKPGIPGLPGRAGGVGEAGRPGERGERGE 2065
Cdd:pfam01391   13 GPPGPPGPPGPPGPPGPPGEPGPPGPPGPPGPPGPPGAPGAPGP 56
PRK07764 PRK07764
DNA polymerase III subunits gamma and tau; Validated
1450-1596 5.47e-03

DNA polymerase III subunits gamma and tau; Validated


Pssm-ID: 236090 [Multi-domain]  Cd Length: 824  Bit Score: 42.28  E-value: 5.47e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462586997 1450 SEDGAPGLPGQPGSPGEQGPRGPPGAigPKGDRGFPGPLGEAGEKGERGPPGPAGSRGLPGVAGRPGAKGPEGPPGPTGR 1529
Cdd:PRK07764   610 EEAARPAAPAAPAAPAAPAPAGAAAA--PAEASAAPAPGVAAPEHHPKHVAVPDASDGGDGWPAKAGGAAPAAPPPAPAP 687
                           90       100       110       120       130       140
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 2462586997 1530 QGEKGEPGRPGDPAVVGPAVAGPKGEKGDVGPAGPRGATGV--QGERGPPGLVLPGDPGPKGDPGDRGP 1596
Cdd:PRK07764   688 AAPAAPAGAAPAQPAPAPAATPPAGQADDPAAQPPQAAQGAsaPSPAADDPVPLPPEPDDPPDPAGAPA 756
vWA_complement_factors cd01470
Complement factors B and C2 are two critical proteases for complement activation. They both ...
38-198 8.47e-03

Complement factors B and C2 are two critical proteases for complement activation. They both contain three CCP or Sushi domains, a trypsin-type serine protease domain and a single VWA domain with a conserved metal ion dependent adhesion site referred commonly as the MIDAS motif. Orthologues of these molecules are found from echinoderms to chordates. During complement activation, the CCP domains are cleaved off, resulting in the formation of an active protease that cleaves and activates complement C3. Complement C2 is in the classical pathway and complement B is in the alternative pathway. The interaction of C2 with C4 and of factor B with C3b are both dependent on Mg2+ binding sites within the VWA domains and the VWA domain of factor B has been shown to mediate the binding of C3. This is consistent with the common inferred function of VWA domains as magnesium-dependent protein interaction domains.


Pssm-ID: 238747 [Multi-domain]  Cd Length: 198  Bit Score: 39.96  E-value: 8.47e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462586997   38 DIVFLLDGSSSIGRSNFREVRSFLEGLVlpfsGAASAQGV--RFATVQYSDDP------RTEFGLDAlgsgGDVIRAIRE 109
Cdd:cd01470      2 NIYIALDASDSIGEEDFDEAKNAIKTLI----EKISSYEVspRYEIISYASDPkeivsiRDFNSNDA----DDVIKRLED 73
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462586997  110 LSYKGGNTRTG----AAILHVADHVFLPQLARPG----VPKVCILITDGKS-------------QDLVD----TAAQRLK 164
Cdd:cd01470     74 FNYDDHGDKTGtntaAALKKVYERMALEKVRNKEafneTRHVIILFTDGKSnmggsplptvdkiKNLVYknnkSDNPRED 153
                          170       180       190
                   ....*....|....*....|....*....|....
gi 2462586997  165 GQGVKLFAVGiKNADPEELKRVASQPTSDFFFFV 198
Cdd:cd01470    154 YLDVYVFGVG-DDVNKEELNDLASKKDNERHFFK 186
Collagen pfam01391
Collagen triple helix repeat (20 copies); Members of this family belong to the collagen ...
1979-2045 9.96e-03

Collagen triple helix repeat (20 copies); Members of this family belong to the collagen superfamily. Collagens are generally extracellular structural proteins involved in formation of connective tissue structure. The alignment contains 20 copies of the G-X-Y repeat that forms a triple helix. The first position of the repeat is glycine, the second and third positions can be any residue but are frequently proline and hydroxy-proline. Collagens are post translationally modified by proline hydroxylase to form the hydroxy-proline residues. Defective hydroxylation is the cause of scurvy. Some members of the collagen superfamily are not involved in connective tissue structure but share the same triple helical structure. The family includes bacterial collagen-like triple-helix repeat proteins.


Pssm-ID: 460189 [Multi-domain]  Cd Length: 57  Bit Score: 36.32  E-value: 9.96e-03
                           10        20        30        40        50        60
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*..
gi 2462586997 1979 GPKGDSGEQGPPGKEGPIGFPGERGLKGDRGDPGPqgppglalgeRGPPGPSGLAGEPGKPGIPGLP 2045
Cdd:pfam01391    1 GPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGEPGP----------PGPPGPPGPPGPPGAPGAPGPP 57
 
Blast search parameters
Data Source: Precalculated data, version = cdd.v.3.21
Preset Options:Database: CDSEARCH/cdd   Low complexity filter: no  Composition Based Adjustment: yes   E-value threshold: 0.01

References:

  • Wang J et al. (2023), "The conserved domain database in 2023", Nucleic Acids Res.51(D)384-8.
  • Lu S et al. (2020), "The conserved domain database in 2020", Nucleic Acids Res.48(D)265-8.
  • Marchler-Bauer A et al. (2017), "CDD/SPARCLE: functional classification of proteins via subfamily domain architectures.", Nucleic Acids Res.45(D)200-3.
Help | Disclaimer | Write to the Help Desk
NCBI | NLM | NIH