NCBI Home Page NCBI Site Search page NCBI Guide that lists and describes the NCBI resources
Conserved domains on  [gi|2462507488|ref|XP_054191762|]
View 

collagen alpha-1(XXIV) chain isoform X2 [Homo sapiens]

Protein Classification

Graphical summary

 Zoom to residue level

show extra options »

Show site features     Horizontal zoom: ×

List of domain hits

Name Accession Description Interval E-value
gly_rich_SclB super family cl45768
LPXTG-anchored collagen-like adhesin Scl2/SclB; SclB (or Scl2 - streptococcal collagen-like ...
924-1160 8.85e-42

LPXTG-anchored collagen-like adhesin Scl2/SclB; SclB (or Scl2 - streptococcal collagen-like protein 2) is an LPXTG-anchored surface-anchored adhesin with a variable-length region of triple helix-forming collagen-like Gly-Xaa-Xaa repeats.


The actual alignment was detected with superfamily member NF038329:

Pssm-ID: 468478 [Multi-domain]  Cd Length: 440  Bit Score: 160.46  E-value: 8.85e-42
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462507488  924 GHVGARGPPGSQGPKGQRGSRGPDGLLGEQGIQGAKGEKGDQGKRGPHGLIGKTGNPGERGFQGKPGLQGLPGSTGDRGL 1003
Cdd:NF038329   117 GEKGEPGPAGPAGPAGEQGPRGDRGETGPAGPAGPPGPQGERGEKGPAGPQGEAGPQGPAGKDGEAGAKGPAGEKGPQGP 196
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462507488 1004 PGEPGLRGLQGDVGPPGEMGMEGPPGTEGESGLQGEpgakgdvgtaGSVGGTGEPGLRGEPGAPGEEGLQGKDGLKGVPG 1083
Cdd:NF038329   197 RGETGPAGEQGPAGPAGPDGEAGPAGEDGPAGPAGD----------GQQGPDGDPGPTGEDGPQGPDGPAGKDGPRGDRG 266
                          170       180       190       200       210       220       230
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*..
gi 2462507488 1084 GRGLPGEDGEKGEMGLPGIIGPLGRSGQTGLPGPEGIVGIPGQRGRPGKKGDKGQIGPTGEVGSRGPPGKIGKSGPK 1160
Cdd:NF038329   267 EAGPDGPDGKDGERGPVGPAGKDGQNGKDGLPGKDGKDGQNGKDGLPGKDGKDGQPGKDGLPGKDGKDGQPGKPAPK 343
COLFI super family cl02436
Fibrillar collagen C-terminal domain; Found at C-termini of fibrillar collagens: Ephydatia ...
1528-1715 3.03e-36

Fibrillar collagen C-terminal domain; Found at C-termini of fibrillar collagens: Ephydatia muelleri procollagen EMF1 alpha, vertebrate collagens alpha(1)III, alpha(1)II, alpha(2)V etc.


The actual alignment was detected with superfamily member pfam01410:

Pssm-ID: 470578  Cd Length: 233  Bit Score: 137.86  E-value: 3.03e-36
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462507488 1528 HSEEIFKTLNYLSNLLHSIKNPLGTRDNPARICKDLLNCEQKVSD-------------DAIEVFCNFSAGgQTCLPP--- 1591
Cdd:pfam01410    1 RDEEVMATLKSLSQQIENIRSPDGSKKNPARTCRDLKLCHPDWKSgeywidpnqgctrDAIKVFCNFETG-ETCIYPtka 79
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462507488 1592 -----VSVTK----------------LEFGVGK-------VQMNFLHLLSSEATHIITIHCLNTPRWTSTQTSGPGLPIG 1643
Cdd:pfam01410   80 siprkNWWTKeskhvwfgefmnggsqFSYGVDGvgpsvaaVQLTFLRLLSTEASQNITYHCKNSVAYMDQATGNLKKALL 159
                          170       180       190       200       210       220       230
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....
gi 2462507488 1644 FKGWNGQIFKV--NTLLEPKVLSDDCKIQDGSWHKATFLFHTQEPNQLPVIEVQKLPHLKTERKYYIDSSSVCF 1715
Cdd:pfam01410  160 LQGSNDEEIRAegNSRFTYTVLEDGCTKRTGQWGKTVIEYRTQKVSRLPIVDIAPMDIGGADQEFGVEVGPVCF 233
gly_rich_SclB super family cl45768
LPXTG-anchored collagen-like adhesin Scl2/SclB; SclB (or Scl2 - streptococcal collagen-like ...
1091-1328 5.00e-36

LPXTG-anchored collagen-like adhesin Scl2/SclB; SclB (or Scl2 - streptococcal collagen-like protein 2) is an LPXTG-anchored surface-anchored adhesin with a variable-length region of triple helix-forming collagen-like Gly-Xaa-Xaa repeats.


The actual alignment was detected with superfamily member NF038329:

Pssm-ID: 468478 [Multi-domain]  Cd Length: 440  Bit Score: 143.51  E-value: 5.00e-36
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462507488 1091 DGEKGEMGlpgiigPLGRSGQTGLPGPEGIVGIPGQRGRPGKKGDKGQIGPTGEVGSRGPPGKIGKSGPKGARGTRGAVG 1170
Cdd:NF038329   116 DGEKGEPG------PAGPAGPAGEQGPRGDRGETGPAGPAGPPGPQGERGEKGPAGPQGEAGPQGPAGKDGEAGAKGPAG 189
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462507488 1171 HLGLMGPDGEPGIPGYRGHQGQPGPSGLPGPKGEKGYPGEDSTvlGPPGPRGEPgpvgdqGERGEPGAEGYKGHVGVPGL 1250
Cdd:NF038329   190 EKGPQGPRGETGPAGEQGPAGPAGPDGEAGPAGEDGPAGPAGD--GQQGPDGDP------GPTGEDGPQGPDGPAGKDGP 261
                          170       180       190       200       210       220       230
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 2462507488 1251 RGATGQQGPPGEPGDQGEQGLKGERGSEGNKGKKGAPGPSGKPGIPGLQGLLGPKGIQGYHGADGISGNPGKIGPPGK 1328
Cdd:NF038329   262 RGDRGEAGPDGPDGKDGERGPVGPAGKDGQNGKDGLPGKDGKDGQNGKDGLPGKDGKDGQPGKDGLPGKDGKDGQPGK 339
gly_rich_SclB super family cl45768
LPXTG-anchored collagen-like adhesin Scl2/SclB; SclB (or Scl2 - streptococcal collagen-like ...
1227-1464 1.31e-32

LPXTG-anchored collagen-like adhesin Scl2/SclB; SclB (or Scl2 - streptococcal collagen-like protein 2) is an LPXTG-anchored surface-anchored adhesin with a variable-length region of triple helix-forming collagen-like Gly-Xaa-Xaa repeats.


The actual alignment was detected with superfamily member NF038329:

Pssm-ID: 468478 [Multi-domain]  Cd Length: 440  Bit Score: 133.11  E-value: 1.31e-32
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462507488 1227 VGDQGERGEPGAEGYkghvgvpglRGATGQQGPPGEPGDQGEQGLKGERGSEGNKGKKGAPGPSGKPgipglqgllGPKG 1306
Cdd:NF038329   125 AGPAGPAGEQGPRGD---------RGETGPAGPAGPPGPQGERGEKGPAGPQGEAGPQGPAGKDGEA---------GAKG 186
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462507488 1307 IQGYHGADGISGNPGKIGPPGKQGLPGIRGGPGRTGLAGAPGPPGvKGSSGLPGSPGIQGPKGEQGLPGQPGIQGKRGHR 1386
Cdd:NF038329   187 PAGEKGPQGPRGETGPAGEQGPAGPAGPDGEAGPAGEDGPAGPAG-DGQQGPDGDPGPTGEDGPQGPDGPAGKDGPRGDR 265
                          170       180       190       200       210       220       230
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 2462507488 1387 GAQGDQGPCGDPGLKGQPGEYGVQGLTGFQGFPGPKGPEGDAGIVGISGPKGPIGHRGNTGPLGREGIIGPTGRTGPR 1464
Cdd:NF038329   266 GEAGPDGPDGKDGERGPVGPAGKDGQNGKDGLPGKDGKDGQNGKDGLPGKDGKDGQPGKDGLPGKDGKDGQPGKPAPK 343
gly_rich_SclB super family cl45768
LPXTG-anchored collagen-like adhesin Scl2/SclB; SclB (or Scl2 - streptococcal collagen-like ...
512-756 1.40e-31

LPXTG-anchored collagen-like adhesin Scl2/SclB; SclB (or Scl2 - streptococcal collagen-like protein 2) is an LPXTG-anchored surface-anchored adhesin with a variable-length region of triple helix-forming collagen-like Gly-Xaa-Xaa repeats.


The actual alignment was detected with superfamily member NF038329:

Pssm-ID: 468478 [Multi-domain]  Cd Length: 440  Bit Score: 130.03  E-value: 1.40e-31
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462507488  512 RGPRGIPGPHGNPGLPGLPGPKGPKGDpgfspgqpvPGEKGDQGLSGLMGPPGMQGDKGLKGHPGLPGLPGEQGIPGFAG 591
Cdd:NF038329   128 AGPAGEQGPRGDRGETGPAGPAGPPGP---------QGERGEKGPAGPQGEAGPQGPAGKDGEAGAKGPAGEKGPQGPRG 198
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462507488  592 NIGSPGYPGRQGLAGPEGNPGPKGAQGFIGSPGEagqlgpegergipGIRGKKGFKGRQGFPGDFGDRGPAGLDGSPGlv 671
Cdd:NF038329   199 ETGPAGEQGPAGPAGPDGEAGPAGEDGPAGPAGD-------------GQQGPDGDPGPTGEDGPQGPDGPAGKDGPRG-- 263
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462507488  672 ggtgppgfpgLRGSVGPVGPIGPAGIPGPMGLSGNKGLPGIKGDKGEQGTAGELGEPGYPGDKGAVGLPGPPGMRGKSGP 751
Cdd:NF038329   264 ----------DRGEAGPDGPDGKDGERGPVGPAGKDGQNGKDGLPGKDGKDGQNGKDGLPGKDGKDGQPGKDGLPGKDGK 333

                   ....*
gi 2462507488  752 SGQTG 756
Cdd:NF038329   334 DGQPG 338
gly_rich_SclB super family cl45768
LPXTG-anchored collagen-like adhesin Scl2/SclB; SclB (or Scl2 - streptococcal collagen-like ...
706-1007 3.85e-31

LPXTG-anchored collagen-like adhesin Scl2/SclB; SclB (or Scl2 - streptococcal collagen-like protein 2) is an LPXTG-anchored surface-anchored adhesin with a variable-length region of triple helix-forming collagen-like Gly-Xaa-Xaa repeats.


The actual alignment was detected with superfamily member NF038329:

Pssm-ID: 468478 [Multi-domain]  Cd Length: 440  Bit Score: 128.87  E-value: 3.85e-31
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462507488  706 NKGLPGIKGD--KGEQGTAGELGEPGYPGDKGAVGLPGPPGMRGKSGPSGQTGDPglqgpsgppgpegfpgdiGIPGQNG 783
Cdd:NF038329   107 DEGLQQLKGDgeKGEPGPAGPAGPAGEQGPRGDRGETGPAGPAGPPGPQGERGEK------------------GPAGPQG 168
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462507488  784 PEGPKelvilwkhepfisekgllgnrgppgppglkGTQGEEGPIGAFGELGPRGKPGQKGYAGEPGPEGLKGEVGDQGNI 863
Cdd:NF038329   169 EAGPQ------------------------------GPAGKDGEAGAKGPAGEKGPQGPRGETGPAGEQGPAGPAGPDGEA 218
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462507488  864 GKIGETGPVGLPGEvGMTGSIGEKGERGSPGPLGPQGEKGVmgypgppgvpgpigplglPGHVGARGPPGSQGPKGQRGS 943
Cdd:NF038329   219 GPAGEDGPAGPAGD-GQQGPDGDPGPTGEDGPQGPDGPAGK------------------DGPRGDRGEAGPDGPDGKDGE 279
                          250       260       270       280       290       300
                   ....*....|....*....|....*....|....*....|....*....|....*....|....
gi 2462507488  944 RGPDGLLGEQGIQGAKGEKGDQGKRGPHgliGKTGNPGERGFQGKPGLQGLPGSTGDRGLPGEP 1007
Cdd:NF038329   280 RGPVGPAGKDGQNGKDGLPGKDGKDGQN---GKDGLPGKDGKDGQPGKDGLPGKDGKDGQPGKP 340
LamG super family cl22861
Laminin G domain; Laminin G-like domains are usually Ca++ mediated receptors that can have ...
41-227 3.57e-11

Laminin G domain; Laminin G-like domains are usually Ca++ mediated receptors that can have binding sites for steroids, beta1 integrins, heparin, sulfatides, fibulin-1, and alpha-dystroglycans. Proteins that contain LamG domains serve a variety of purposes including signal transduction via cell-surface steroid receptors, adhesion, migration and differentiation through mediation of cell adhesion molecules.


The actual alignment was detected with superfamily member smart00210:

Pssm-ID: 473984  Cd Length: 184  Bit Score: 63.92  E-value: 3.57e-11
                            10        20        30        40        50        60        70        80
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462507488    41 GIDILHQLGLGGKdvrhSSPVTAVPSASTPLPqgvhltesGVIFKNDAYIETPFVKILPVNLGQPFTILTGLQSHRVNNA 120
Cdd:smart00210    1 GQDLLQVFDLPSL----SFAIRQVVGPEPGSP--------AYRLGDPALVPQPTRDLFPSGLPEDFSLLTTFRQTPKSRG 68
                            90       100       110       120       130       140       150       160
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462507488   121 FLFSIRNK-NRLQLGVQL--LPKKLVVH---IRGKQPAVF--NYSVHDEQWHSFAITIRNQSVSMFVECGKKYfsTETIP 192
Cdd:smart00210   69 VLFAIYDAqNVRQFGLEVdgRANTLLLRyqgVDGKQHTVSfrNLPLADGQWHKLALSVSGSSATLYVDCNEID--SRPLD 146
                           170       180       190
                    ....*....|....*....|....*....|....*..
gi 2462507488   193 E--VQTFDSNSVFTLGSMNNNSIHFEGIVCQLDIIPS 227
Cdd:smart00210  147 RpgQPPIDTDGIEVRGAQAADRKPFQGDLQQLKIVCD 183
 
Name Accession Description Interval E-value
gly_rich_SclB NF038329
LPXTG-anchored collagen-like adhesin Scl2/SclB; SclB (or Scl2 - streptococcal collagen-like ...
924-1160 8.85e-42

LPXTG-anchored collagen-like adhesin Scl2/SclB; SclB (or Scl2 - streptococcal collagen-like protein 2) is an LPXTG-anchored surface-anchored adhesin with a variable-length region of triple helix-forming collagen-like Gly-Xaa-Xaa repeats.


Pssm-ID: 468478 [Multi-domain]  Cd Length: 440  Bit Score: 160.46  E-value: 8.85e-42
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462507488  924 GHVGARGPPGSQGPKGQRGSRGPDGLLGEQGIQGAKGEKGDQGKRGPHGLIGKTGNPGERGFQGKPGLQGLPGSTGDRGL 1003
Cdd:NF038329   117 GEKGEPGPAGPAGPAGEQGPRGDRGETGPAGPAGPPGPQGERGEKGPAGPQGEAGPQGPAGKDGEAGAKGPAGEKGPQGP 196
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462507488 1004 PGEPGLRGLQGDVGPPGEMGMEGPPGTEGESGLQGEpgakgdvgtaGSVGGTGEPGLRGEPGAPGEEGLQGKDGLKGVPG 1083
Cdd:NF038329   197 RGETGPAGEQGPAGPAGPDGEAGPAGEDGPAGPAGD----------GQQGPDGDPGPTGEDGPQGPDGPAGKDGPRGDRG 266
                          170       180       190       200       210       220       230
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*..
gi 2462507488 1084 GRGLPGEDGEKGEMGLPGIIGPLGRSGQTGLPGPEGIVGIPGQRGRPGKKGDKGQIGPTGEVGSRGPPGKIGKSGPK 1160
Cdd:NF038329   267 EAGPDGPDGKDGERGPVGPAGKDGQNGKDGLPGKDGKDGQNGKDGLPGKDGKDGQPGKDGLPGKDGKDGQPGKPAPK 343
gly_rich_SclB NF038329
LPXTG-anchored collagen-like adhesin Scl2/SclB; SclB (or Scl2 - streptococcal collagen-like ...
947-1185 2.71e-40

LPXTG-anchored collagen-like adhesin Scl2/SclB; SclB (or Scl2 - streptococcal collagen-like protein 2) is an LPXTG-anchored surface-anchored adhesin with a variable-length region of triple helix-forming collagen-like Gly-Xaa-Xaa repeats.


Pssm-ID: 468478 [Multi-domain]  Cd Length: 440  Bit Score: 155.83  E-value: 2.71e-40
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462507488  947 DGLLGEQGIQGAKGEKGDQGKRGPHGLIGKTGNPGERGFQGKpglqglPGSTGDRGLPGEPGLRGLQGDVGPPGEMGMEG 1026
Cdd:NF038329   107 DEGLQQLKGDGEKGEPGPAGPAGPAGEQGPRGDRGETGPAGP------AGPPGPQGERGEKGPAGPQGEAGPQGPAGKDG 180
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462507488 1027 PPGTEGESGLQGEPGAKGDVGTAGSVGGTGEPGLRGEPGAPGEEGLQGKDGlKGVPGGRGLPGEDGEKGEMGLPGIIGPL 1106
Cdd:NF038329   181 EAGAKGPAGEKGPQGPRGETGPAGEQGPAGPAGPDGEAGPAGEDGPAGPAG-DGQQGPDGDPGPTGEDGPQGPDGPAGKD 259
                          170       180       190       200       210       220       230
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 2462507488 1107 GRSGQTGLPGPEGIVGIPGQRGRPGKKGDKGQIGPTGEVGSRGPPGKIGKSGPKGARGTRGAVGHLGLMGPDGEPGIPG 1185
Cdd:NF038329   260 GPRGDRGEAGPDGPDGKDGERGPVGPAGKDGQNGKDGLPGKDGKDGQNGKDGLPGKDGKDGQPGKDGLPGKDGKDGQPG 338
gly_rich_SclB NF038329
LPXTG-anchored collagen-like adhesin Scl2/SclB; SclB (or Scl2 - streptococcal collagen-like ...
851-1117 5.20e-39

LPXTG-anchored collagen-like adhesin Scl2/SclB; SclB (or Scl2 - streptococcal collagen-like protein 2) is an LPXTG-anchored surface-anchored adhesin with a variable-length region of triple helix-forming collagen-like Gly-Xaa-Xaa repeats.


Pssm-ID: 468478 [Multi-domain]  Cd Length: 440  Bit Score: 151.98  E-value: 5.20e-39
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462507488  851 EGLKGEVGDqGNIGKIGETGPVGLPGEVGMTGSIGEKGERGSPGPLGPQGEKGvmgypgppgvpgpigplgLPGHVGARG 930
Cdd:NF038329   108 EGLQQLKGD-GEKGEPGPAGPAGPAGEQGPRGDRGETGPAGPAGPPGPQGERG------------------EKGPAGPQG 168
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462507488  931 PPGSQGPKGQRGSRGPDGLLGEQGIQGAKGEKGDQGKRGPHGLIGKTGNPGERGFQGKPGLQGlpgstgdrglPGEPGLR 1010
Cdd:NF038329   169 EAGPQGPAGKDGEAGAKGPAGEKGPQGPRGETGPAGEQGPAGPAGPDGEAGPAGEDGPAGPAG----------DGQQGPD 238
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462507488 1011 GLQGDVGPPGEMGMEGPPGTEGESGLQGEPGAKGDVGTAGSvggTGEPGLRGEPGAPGEEGLQGKDGLKGVPGGRGLPGE 1090
Cdd:NF038329   239 GDPGPTGEDGPQGPDGPAGKDGPRGDRGEAGPDGPDGKDGE---RGPVGPAGKDGQNGKDGLPGKDGKDGQNGKDGLPGK 315
                          250       260
                   ....*....|....*....|....*..
gi 2462507488 1091 DGEKGEMGLPGIIGPLGRSGQTGLPGP 1117
Cdd:NF038329   316 DGKDGQPGKDGLPGKDGKDGQPGKPAP 342
gly_rich_SclB NF038329
LPXTG-anchored collagen-like adhesin Scl2/SclB; SclB (or Scl2 - streptococcal collagen-like ...
1008-1288 2.44e-37

LPXTG-anchored collagen-like adhesin Scl2/SclB; SclB (or Scl2 - streptococcal collagen-like protein 2) is an LPXTG-anchored surface-anchored adhesin with a variable-length region of triple helix-forming collagen-like Gly-Xaa-Xaa repeats.


Pssm-ID: 468478 [Multi-domain]  Cd Length: 440  Bit Score: 147.36  E-value: 2.44e-37
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462507488 1008 GLRGLQGDvGPPGEMGMEGPPGTEGESGLQGEPGAKGDVGTAGSVGGTGEPGLRGEPGAPGEEGLQGKDGLKGVPGGRGL 1087
Cdd:NF038329   109 GLQQLKGD-GEKGEPGPAGPAGPAGEQGPRGDRGETGPAGPAGPPGPQGERGEKGPAGPQGEAGPQGPAGKDGEAGAKGP 187
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462507488 1088 PGEDGEKGEMGLPGiigPLGRSGQTGLPGPEGIVGIPGQRGRPGKKGDkgqigptGEVGSRGPPGKIGKSGPKGargtrg 1167
Cdd:NF038329   188 AGEKGPQGPRGETG---PAGEQGPAGPAGPDGEAGPAGEDGPAGPAGD-------GQQGPDGDPGPTGEDGPQG------ 251
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462507488 1168 avghlglmgPDGEPGIPGYRGHQGQPGPSGLPGPKGEKgypgedstvlgppgprgepgpvgdqGERGEPGAEGYKGHVGV 1247
Cdd:NF038329   252 ---------PDGPAGKDGPRGDRGEAGPDGPDGKDGER-------------------------GPVGPAGKDGQNGKDGL 297
                          250       260       270       280
                   ....*....|....*....|....*....|....*....|.
gi 2462507488 1248 PGLRGATGQQGPPGEPGDQGEQGLKGERGSEGNKGKKGAPG 1288
Cdd:NF038329   298 PGKDGKDGQNGKDGLPGKDGKDGQPGKDGLPGKDGKDGQPG 338
COLFI pfam01410
Fibrillar collagen C-terminal domain; Found at C-termini of fibrillar collagens: Ephydatia ...
1528-1715 3.03e-36

Fibrillar collagen C-terminal domain; Found at C-termini of fibrillar collagens: Ephydatia muelleri procollagen EMF1 alpha, vertebrate collagens alpha(1)III, alpha(1)II, alpha(2)V etc.


Pssm-ID: 460199  Cd Length: 233  Bit Score: 137.86  E-value: 3.03e-36
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462507488 1528 HSEEIFKTLNYLSNLLHSIKNPLGTRDNPARICKDLLNCEQKVSD-------------DAIEVFCNFSAGgQTCLPP--- 1591
Cdd:pfam01410    1 RDEEVMATLKSLSQQIENIRSPDGSKKNPARTCRDLKLCHPDWKSgeywidpnqgctrDAIKVFCNFETG-ETCIYPtka 79
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462507488 1592 -----VSVTK----------------LEFGVGK-------VQMNFLHLLSSEATHIITIHCLNTPRWTSTQTSGPGLPIG 1643
Cdd:pfam01410   80 siprkNWWTKeskhvwfgefmnggsqFSYGVDGvgpsvaaVQLTFLRLLSTEASQNITYHCKNSVAYMDQATGNLKKALL 159
                          170       180       190       200       210       220       230
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....
gi 2462507488 1644 FKGWNGQIFKV--NTLLEPKVLSDDCKIQDGSWHKATFLFHTQEPNQLPVIEVQKLPHLKTERKYYIDSSSVCF 1715
Cdd:pfam01410  160 LQGSNDEEIRAegNSRFTYTVLEDGCTKRTGQWGKTVIEYRTQKVSRLPIVDIAPMDIGGADQEFGVEVGPVCF 233
gly_rich_SclB NF038329
LPXTG-anchored collagen-like adhesin Scl2/SclB; SclB (or Scl2 - streptococcal collagen-like ...
1091-1328 5.00e-36

LPXTG-anchored collagen-like adhesin Scl2/SclB; SclB (or Scl2 - streptococcal collagen-like protein 2) is an LPXTG-anchored surface-anchored adhesin with a variable-length region of triple helix-forming collagen-like Gly-Xaa-Xaa repeats.


Pssm-ID: 468478 [Multi-domain]  Cd Length: 440  Bit Score: 143.51  E-value: 5.00e-36
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462507488 1091 DGEKGEMGlpgiigPLGRSGQTGLPGPEGIVGIPGQRGRPGKKGDKGQIGPTGEVGSRGPPGKIGKSGPKGARGTRGAVG 1170
Cdd:NF038329   116 DGEKGEPG------PAGPAGPAGEQGPRGDRGETGPAGPAGPPGPQGERGEKGPAGPQGEAGPQGPAGKDGEAGAKGPAG 189
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462507488 1171 HLGLMGPDGEPGIPGYRGHQGQPGPSGLPGPKGEKGYPGEDSTvlGPPGPRGEPgpvgdqGERGEPGAEGYKGHVGVPGL 1250
Cdd:NF038329   190 EKGPQGPRGETGPAGEQGPAGPAGPDGEAGPAGEDGPAGPAGD--GQQGPDGDP------GPTGEDGPQGPDGPAGKDGP 261
                          170       180       190       200       210       220       230
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 2462507488 1251 RGATGQQGPPGEPGDQGEQGLKGERGSEGNKGKKGAPGPSGKPGIPGLQGLLGPKGIQGYHGADGISGNPGKIGPPGK 1328
Cdd:NF038329   262 RGDRGEAGPDGPDGKDGERGPVGPAGKDGQNGKDGLPGKDGKDGQNGKDGLPGKDGKDGQPGKDGLPGKDGKDGQPGK 339
gly_rich_SclB NF038329
LPXTG-anchored collagen-like adhesin Scl2/SclB; SclB (or Scl2 - streptococcal collagen-like ...
804-1044 2.36e-35

LPXTG-anchored collagen-like adhesin Scl2/SclB; SclB (or Scl2 - streptococcal collagen-like protein 2) is an LPXTG-anchored surface-anchored adhesin with a variable-length region of triple helix-forming collagen-like Gly-Xaa-Xaa repeats.


Pssm-ID: 468478 [Multi-domain]  Cd Length: 440  Bit Score: 141.20  E-value: 2.36e-35
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462507488  804 GLLGNRGPPGPPGLKGTQGEEGPIGAFGELGPRGKPGQKGYAGEPGPEGLKGEVGDQGNIGKIGETGPVGLPGEVGMTGS 883
Cdd:NF038329   117 GEKGEPGPAGPAGPAGEQGPRGDRGETGPAGPAGPPGPQGERGEKGPAGPQGEAGPQGPAGKDGEAGAKGPAGEKGPQGP 196
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462507488  884 IGEKGERGSPGPLGPQGEKGvmgypgppgvpgpigplgLPGHVGARGPPGSQGpKGQRGSRGPDGLLGEQGIQGAKGEKG 963
Cdd:NF038329   197 RGETGPAGEQGPAGPAGPDG------------------EAGPAGEDGPAGPAG-DGQQGPDGDPGPTGEDGPQGPDGPAG 257
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462507488  964 DQGKRGPHGLIGKTGNPGERGFQGKPGLQGLPGSTGDRGLPGEPGLRGLQGDVGPPGEMGMEGPPGTEGESGLQGEPGAK 1043
Cdd:NF038329   258 KDGPRGDRGEAGPDGPDGKDGERGPVGPAGKDGQNGKDGLPGKDGKDGQNGKDGLPGKDGKDGQPGKDGLPGKDGKDGQP 337

                   .
gi 2462507488 1044 G 1044
Cdd:NF038329   338 G 338
COLFI smart00038
Fibrillar collagens C-terminal domain; Found at C-termini of fibrillar collagens: Ephydatia ...
1530-1716 3.04e-33

Fibrillar collagens C-terminal domain; Found at C-termini of fibrillar collagens: Ephydatia muelleri procollagen EMF1alpha, vertebrate collagens alpha(1)III, alpha(1)II, alpha(2)V etc.


Pssm-ID: 197483  Cd Length: 232  Bit Score: 129.13  E-value: 3.04e-33
                            10        20        30        40        50        60        70        80
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462507488  1530 EEIFKTLNYLSNLLHSIKNPLGTRDNPARICKDLLNCEQKVSD-------------DAIEVFCNFsAGGQTCLPP----- 1591
Cdd:smart00038    2 EEVFASLKSLNNQIEQLKSPTGSRKNPARTCKDLKLCHPEWKSgeywvdpnqgcirDAIKVFCNF-ETGETCVSPspssi 80
                            90       100       110       120       130       140       150       160
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462507488  1592 -------------------VSVTKLEFG------VGKVQMNFLHLLSSEATHIITIHCLNTPRWTSTQTSGPGLPIGFKG 1646
Cdd:smart00038   81 prktwysgkskhvwfgetmNGGFKFSYGdsegppVGVVQLTFLRLLSTEAHQNITYHCKNSVAYMDEATGNLKKALRLRG 160
                           170       180       190       200       210       220       230
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|..
gi 2462507488  1647 WNGQIFK--VNTLLEPKVLSDDCKIQDGSWHKATFLFHTQEPNQLPVIEVQKLPHLKTERKYYIDSSSVCFL 1716
Cdd:smart00038  161 SNDVELSaeGNSKFTYEVLEDGCQKRTGKWGKTVIEYRTKKTERLPIVDIAPSDIGGPDQEFGVEIGPVCFS 232
gly_rich_SclB NF038329
LPXTG-anchored collagen-like adhesin Scl2/SclB; SclB (or Scl2 - streptococcal collagen-like ...
1227-1464 1.31e-32

LPXTG-anchored collagen-like adhesin Scl2/SclB; SclB (or Scl2 - streptococcal collagen-like protein 2) is an LPXTG-anchored surface-anchored adhesin with a variable-length region of triple helix-forming collagen-like Gly-Xaa-Xaa repeats.


Pssm-ID: 468478 [Multi-domain]  Cd Length: 440  Bit Score: 133.11  E-value: 1.31e-32
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462507488 1227 VGDQGERGEPGAEGYkghvgvpglRGATGQQGPPGEPGDQGEQGLKGERGSEGNKGKKGAPGPSGKPgipglqgllGPKG 1306
Cdd:NF038329   125 AGPAGPAGEQGPRGD---------RGETGPAGPAGPPGPQGERGEKGPAGPQGEAGPQGPAGKDGEA---------GAKG 186
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462507488 1307 IQGYHGADGISGNPGKIGPPGKQGLPGIRGGPGRTGLAGAPGPPGvKGSSGLPGSPGIQGPKGEQGLPGQPGIQGKRGHR 1386
Cdd:NF038329   187 PAGEKGPQGPRGETGPAGEQGPAGPAGPDGEAGPAGEDGPAGPAG-DGQQGPDGDPGPTGEDGPQGPDGPAGKDGPRGDR 265
                          170       180       190       200       210       220       230
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 2462507488 1387 GAQGDQGPCGDPGLKGQPGEYGVQGLTGFQGFPGPKGPEGDAGIVGISGPKGPIGHRGNTGPLGREGIIGPTGRTGPR 1464
Cdd:NF038329   266 GEAGPDGPDGKDGERGPVGPAGKDGQNGKDGLPGKDGKDGQNGKDGLPGKDGKDGQPGKDGLPGKDGKDGQPGKPAPK 343
gly_rich_SclB NF038329
LPXTG-anchored collagen-like adhesin Scl2/SclB; SclB (or Scl2 - streptococcal collagen-like ...
512-756 1.40e-31

LPXTG-anchored collagen-like adhesin Scl2/SclB; SclB (or Scl2 - streptococcal collagen-like protein 2) is an LPXTG-anchored surface-anchored adhesin with a variable-length region of triple helix-forming collagen-like Gly-Xaa-Xaa repeats.


Pssm-ID: 468478 [Multi-domain]  Cd Length: 440  Bit Score: 130.03  E-value: 1.40e-31
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462507488  512 RGPRGIPGPHGNPGLPGLPGPKGPKGDpgfspgqpvPGEKGDQGLSGLMGPPGMQGDKGLKGHPGLPGLPGEQGIPGFAG 591
Cdd:NF038329   128 AGPAGEQGPRGDRGETGPAGPAGPPGP---------QGERGEKGPAGPQGEAGPQGPAGKDGEAGAKGPAGEKGPQGPRG 198
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462507488  592 NIGSPGYPGRQGLAGPEGNPGPKGAQGFIGSPGEagqlgpegergipGIRGKKGFKGRQGFPGDFGDRGPAGLDGSPGlv 671
Cdd:NF038329   199 ETGPAGEQGPAGPAGPDGEAGPAGEDGPAGPAGD-------------GQQGPDGDPGPTGEDGPQGPDGPAGKDGPRG-- 263
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462507488  672 ggtgppgfpgLRGSVGPVGPIGPAGIPGPMGLSGNKGLPGIKGDKGEQGTAGELGEPGYPGDKGAVGLPGPPGMRGKSGP 751
Cdd:NF038329   264 ----------DRGEAGPDGPDGKDGERGPVGPAGKDGQNGKDGLPGKDGKDGQNGKDGLPGKDGKDGQPGKDGLPGKDGK 333

                   ....*
gi 2462507488  752 SGQTG 756
Cdd:NF038329   334 DGQPG 338
gly_rich_SclB NF038329
LPXTG-anchored collagen-like adhesin Scl2/SclB; SclB (or Scl2 - streptococcal collagen-like ...
706-1007 3.85e-31

LPXTG-anchored collagen-like adhesin Scl2/SclB; SclB (or Scl2 - streptococcal collagen-like protein 2) is an LPXTG-anchored surface-anchored adhesin with a variable-length region of triple helix-forming collagen-like Gly-Xaa-Xaa repeats.


Pssm-ID: 468478 [Multi-domain]  Cd Length: 440  Bit Score: 128.87  E-value: 3.85e-31
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462507488  706 NKGLPGIKGD--KGEQGTAGELGEPGYPGDKGAVGLPGPPGMRGKSGPSGQTGDPglqgpsgppgpegfpgdiGIPGQNG 783
Cdd:NF038329   107 DEGLQQLKGDgeKGEPGPAGPAGPAGEQGPRGDRGETGPAGPAGPPGPQGERGEK------------------GPAGPQG 168
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462507488  784 PEGPKelvilwkhepfisekgllgnrgppgppglkGTQGEEGPIGAFGELGPRGKPGQKGYAGEPGPEGLKGEVGDQGNI 863
Cdd:NF038329   169 EAGPQ------------------------------GPAGKDGEAGAKGPAGEKGPQGPRGETGPAGEQGPAGPAGPDGEA 218
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462507488  864 GKIGETGPVGLPGEvGMTGSIGEKGERGSPGPLGPQGEKGVmgypgppgvpgpigplglPGHVGARGPPGSQGPKGQRGS 943
Cdd:NF038329   219 GPAGEDGPAGPAGD-GQQGPDGDPGPTGEDGPQGPDGPAGK------------------DGPRGDRGEAGPDGPDGKDGE 279
                          250       260       270       280       290       300
                   ....*....|....*....|....*....|....*....|....*....|....*....|....
gi 2462507488  944 RGPDGLLGEQGIQGAKGEKGDQGKRGPHgliGKTGNPGERGFQGKPGLQGLPGSTGDRGLPGEP 1007
Cdd:NF038329   280 RGPVGPAGKDGQNGKDGLPGKDGKDGQN---GKDGLPGKDGKDGQPGKDGLPGKDGKDGQPGKP 340
gly_rich_SclB NF038329
LPXTG-anchored collagen-like adhesin Scl2/SclB; SclB (or Scl2 - streptococcal collagen-like ...
600-865 8.62e-28

LPXTG-anchored collagen-like adhesin Scl2/SclB; SclB (or Scl2 - streptococcal collagen-like protein 2) is an LPXTG-anchored surface-anchored adhesin with a variable-length region of triple helix-forming collagen-like Gly-Xaa-Xaa repeats.


Pssm-ID: 468478 [Multi-domain]  Cd Length: 440  Bit Score: 118.85  E-value: 8.62e-28
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462507488  600 GRQGLAGPEGNPGPKGAQGFIGSPGEAGQLGPEGERGIPGIRGKKGFKGRQGFPGDfgdRGPAGLDGSPGLVGGTGPPGF 679
Cdd:NF038329   117 GEKGEPGPAGPAGPAGEQGPRGDRGETGPAGPAGPPGPQGERGEKGPAGPQGEAGP---QGPAGKDGEAGAKGPAGEKGP 193
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462507488  680 PGLRGSVGPVGPIGPAGIPGPMGLSGNKGLPGIKGD--KGEQGTAGELGEPGYPGDKGAVGLPGPPGMRGKSGPSGQTGD 757
Cdd:NF038329   194 QGPRGETGPAGEQGPAGPAGPDGEAGPAGEDGPAGPagDGQQGPDGDPGPTGEDGPQGPDGPAGKDGPRGDRGEAGPDGP 273
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462507488  758 PglqgpsgppgpegfpgdiGIPGQNGPEGPkelvilwkhepfisekgllgnrgpPGPPGLKGTQGEEGPIGAFGELGPRG 837
Cdd:NF038329   274 D------------------GKDGERGPVGP------------------------AGKDGQNGKDGLPGKDGKDGQNGKDG 311
                          250       260
                   ....*....|....*....|....*...
gi 2462507488  838 KPGQKGYAGEPGPEGLKGEVGDQGNIGK 865
Cdd:NF038329   312 LPGKDGKDGQPGKDGLPGKDGKDGQPGK 339
gly_rich_SclB NF038329
LPXTG-anchored collagen-like adhesin Scl2/SclB; SclB (or Scl2 - streptococcal collagen-like ...
652-903 2.49e-24

LPXTG-anchored collagen-like adhesin Scl2/SclB; SclB (or Scl2 - streptococcal collagen-like protein 2) is an LPXTG-anchored surface-anchored adhesin with a variable-length region of triple helix-forming collagen-like Gly-Xaa-Xaa repeats.


Pssm-ID: 468478 [Multi-domain]  Cd Length: 440  Bit Score: 108.45  E-value: 2.49e-24
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462507488  652 FPGDFGDRGPAGLDGSPGLVggtgppgfpglrGSVGPVGPIGPAGIPGPMGLSGNKGLPGIKGDKGEQGTAGELGEPGYP 731
Cdd:NF038329   115 GDGEKGEPGPAGPAGPAGEQ------------GPRGDRGETGPAGPAGPPGPQGERGEKGPAGPQGEAGPQGPAGKDGEA 182
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462507488  732 GDKGAVGLPGPPGMRGKSGPSGQTGdpglqgPSGPPGPEGFPGDIGIPGQNGPEGPKElvilwkhepfISEKGLLGNRGP 811
Cdd:NF038329   183 GAKGPAGEKGPQGPRGETGPAGEQG------PAGPAGPDGEAGPAGEDGPAGPAGDGQ----------QGPDGDPGPTGE 246
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462507488  812 PGPPGLKGTQGEEGPIGAFGELGPRGKPGQKGYAGEPGPEGLKGEVGDQGNIGKIGETGPVGLPGEVGMTGSIGEKGERG 891
Cdd:NF038329   247 DGPQGPDGPAGKDGPRGDRGEAGPDGPDGKDGERGPVGPAGKDGQNGKDGLPGKDGKDGQNGKDGLPGKDGKDGQPGKDG 326
                          250
                   ....*....|..
gi 2462507488  892 SPGPLGPQGEKG 903
Cdd:NF038329   327 LPGKDGKDGQPG 338
SPT5 COG5164
Transcription elongation factor SPT5 [Transcription];
957-1213 7.01e-14

Transcription elongation factor SPT5 [Transcription];


Pssm-ID: 444063 [Multi-domain]  Cd Length: 495  Bit Score: 76.22  E-value: 7.01e-14
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462507488  957 GAKGEKGDQGKRGPHGLIGKTGNPGERGFQGKPGLQGLPGSTGDRGLPGEPGLrglQGDVGPPGEMGMEGPPGTEGESGL 1036
Cdd:COG5164      7 GKTGPSDPGGVTTPAGSQGSTKPAQNQGSTRPAGNTGGTRPAQNQGSTTPAGN---TGGTRPAGNQGATGPAQNQGGTTP 83
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462507488 1037 QGEPGAKGDVGTAGSVGGTGEPGLRGEPGAPGEEGLQGKDGLKGVP--GGRGLPGEDGEK----GEMGLPGIIGPLGRSG 1110
Cdd:COG5164     84 AQNQGGTRPAGNTGGTTPAGDGGATGPPDDGGATGPPDDGGSTTPPsgGSTTPPGDGGSTppgpGSTGPGGSTTPPGDGG 163
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462507488 1111 QTGLPGPEGIVGIPGQRGR--PGKKGDKGQIGPTGEVGSRGPPGKIGKSGPKGARGTRGAVGhlGLMGPDGEPGIPGYRG 1188
Cdd:COG5164    164 STTPPGPGGSTTPPDDGGSttPPNKGETGTDIPTGGTPRQGPDGPVKKDDKNGKGNPPDDRG--GKTGPKDQRPKTNPIE 241
                          250       260
                   ....*....|....*....|....*
gi 2462507488 1189 HQGQPGPSGLPGPKGEKGYPGEDST 1213
Cdd:COG5164    242 RRGPERPEAAALPAELTALEAENRA 266
TSPN smart00210
Thrombospondin N-terminal -like domains; Heparin-binding and cell adhesion domain of ...
41-227 3.57e-11

Thrombospondin N-terminal -like domains; Heparin-binding and cell adhesion domain of thrombospondin


Pssm-ID: 214560  Cd Length: 184  Bit Score: 63.92  E-value: 3.57e-11
                            10        20        30        40        50        60        70        80
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462507488    41 GIDILHQLGLGGKdvrhSSPVTAVPSASTPLPqgvhltesGVIFKNDAYIETPFVKILPVNLGQPFTILTGLQSHRVNNA 120
Cdd:smart00210    1 GQDLLQVFDLPSL----SFAIRQVVGPEPGSP--------AYRLGDPALVPQPTRDLFPSGLPEDFSLLTTFRQTPKSRG 68
                            90       100       110       120       130       140       150       160
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462507488   121 FLFSIRNK-NRLQLGVQL--LPKKLVVH---IRGKQPAVF--NYSVHDEQWHSFAITIRNQSVSMFVECGKKYfsTETIP 192
Cdd:smart00210   69 VLFAIYDAqNVRQFGLEVdgRANTLLLRyqgVDGKQHTVSfrNLPLADGQWHKLALSVSGSSATLYVDCNEID--SRPLD 146
                           170       180       190
                    ....*....|....*....|....*....|....*..
gi 2462507488   193 E--VQTFDSNSVFTLGSMNNNSIHFEGIVCQLDIIPS 227
Cdd:smart00210  147 RpgQPPIDTDGIEVRGAQAADRKPFQGDLQQLKIVCD 183
SPT5 COG5164
Transcription elongation factor SPT5 [Transcription];
1193-1463 3.68e-11

Transcription elongation factor SPT5 [Transcription];


Pssm-ID: 444063 [Multi-domain]  Cd Length: 495  Bit Score: 67.75  E-value: 3.68e-11
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462507488 1193 PGPSGLPGPKGEKGYPGeDSTVLGPPGPRGEPGPVGDQGERGEPGAEGYKGHVGVPGLRGATGQQGPPGEPGDQGEQGLK 1272
Cdd:COG5164      6 PGKTGPSDPGGVTTPAG-SQGSTKPAQNQGSTRPAGNTGGTRPAQNQGSTTPAGNTGGTRPAGNQGATGPAQNQGGTTPA 84
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462507488 1273 GERGSEGNKGKKGAPGPSGKPGIPGLQGLLGPKGIQGYHGADGISGNpGKIGPPGKQGlpGIRGGPGRTGLAGAPGPPGV 1352
Cdd:COG5164     85 QNQGGTRPAGNTGGTTPAGDGGATGPPDDGGATGPPDDGGSTTPPSG-GSTTPPGDGG--STPPGPGSTGPGGSTTPPGD 161
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462507488 1353 KGSSGLPGSPGIQGPKGEQGlPGQPGIQGKRGHRGAQGDQGPCGDPGLKGQPGEYGVQGLTGFQGfpGPKGPEGDAGIVG 1432
Cdd:COG5164    162 GGSTTPPGPGGSTTPPDDGG-STTPPNKGETGTDIPTGGTPRQGPDGPVKKDDKNGKGNPPDDRG--GKTGPKDQRPKTN 238
                          250       260       270
                   ....*....|....*....|....*....|.
gi 2462507488 1433 ISGPKGPIGHRGNTGPLGREGIIGPTGRTGP 1463
Cdd:COG5164    239 PIERRGPERPEAAALPAELTALEAENRAANP 269
Collagen pfam01391
Collagen triple helix repeat (20 copies); Members of this family belong to the collagen ...
1321-1377 3.55e-07

Collagen triple helix repeat (20 copies); Members of this family belong to the collagen superfamily. Collagens are generally extracellular structural proteins involved in formation of connective tissue structure. The alignment contains 20 copies of the G-X-Y repeat that forms a triple helix. The first position of the repeat is glycine, the second and third positions can be any residue but are frequently proline and hydroxy-proline. Collagens are post translationally modified by proline hydroxylase to form the hydroxy-proline residues. Defective hydroxylation is the cause of scurvy. Some members of the collagen superfamily are not involved in connective tissue structure but share the same triple helical structure. The family includes bacterial collagen-like triple-helix repeat proteins.


Pssm-ID: 460189 [Multi-domain]  Cd Length: 57  Bit Score: 48.64  E-value: 3.55e-07
                           10        20        30        40        50
                   ....*....|....*....|....*....|....*....|....*....|....*..
gi 2462507488 1321 GKIGPPGKQGLPGIRGGPGRTGLAGAPGPPGVKGSSGLPGSPGIQGPKGEQGLPGQP 1377
Cdd:pfam01391    1 GPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGEPGPPGPPGPPGPPGPPGAPGAPGPP 57
Collagen pfam01391
Collagen triple helix repeat (20 copies); Members of this family belong to the collagen ...
561-616 1.41e-06

Collagen triple helix repeat (20 copies); Members of this family belong to the collagen superfamily. Collagens are generally extracellular structural proteins involved in formation of connective tissue structure. The alignment contains 20 copies of the G-X-Y repeat that forms a triple helix. The first position of the repeat is glycine, the second and third positions can be any residue but are frequently proline and hydroxy-proline. Collagens are post translationally modified by proline hydroxylase to form the hydroxy-proline residues. Defective hydroxylation is the cause of scurvy. Some members of the collagen superfamily are not involved in connective tissue structure but share the same triple helical structure. The family includes bacterial collagen-like triple-helix repeat proteins.


Pssm-ID: 460189 [Multi-domain]  Cd Length: 57  Bit Score: 46.72  E-value: 1.41e-06
                           10        20        30        40        50
                   ....*....|....*....|....*....|....*....|....*....|....*.
gi 2462507488  561 GPPGMQGDKGLKGHPGLPGLPGEQGIPGFAGNIGSPGYPGRQGLAGPEGNPGPKGA 616
Cdd:pfam01391    1 GPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGEPGPPGPPGPPGPPGPPGAPGAPGP 56
Glutenin_hmw pfam03157
High molecular weight glutenin subunit; Members of this family include high molecular weight ...
1057-1420 4.68e-06

High molecular weight glutenin subunit; Members of this family include high molecular weight subunits of glutenin. This group of gluten proteins is thought to be largely responsible for the elastic properties of gluten, and hence, doughs. Indeed, glutenin high molecular weight subunits are classified as elastomeric proteins, because the glutenin network can withstand significant deformations without breaking, and return to the original conformation when the stress is removed. Elastomeric proteins differ considerably in amino acid sequence, but they are all polymers whose subunits consist of elastomeric domains, composed of repeated motifs, and non-elastic domains that mediate cross-linking between the subunits. The elastomeric domain motifs are all rich in glycine residues in addition to other hydrophobic residues. High molecular weight glutenin subunits have an extensive central elastomeric domain, flanked by two terminal non-elastic domains that form disulphide cross-links. The central elastomeric domain is characterized by the following three repeated motifs: PGQGQQ, GYYPTS[P/L]QQ, GQQ. It possesses overlapping beta-turns within and between the repeated motifs, and assumes a regular helical secondary structure with a diameter of approx. 1.9 nm and a pitch of approx. 1.5 nm.


Pssm-ID: 367362 [Multi-domain]  Cd Length: 786  Bit Score: 51.49  E-value: 4.68e-06
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462507488 1057 EPGLRGEPGapgeEGLQGKDGLKGVPGGRGLPGEDGEKGEMglPGIIGPLGRSGQTGLPGPEGIVGIPGQRGRPGKkGDK 1136
Cdd:pfam03157  172 QSGQRQQPG----QGQQLRQGQQGQQSGQGQPGYYPTSSQQ--PGQLQQTGQGQQGQQPERGQQGQQPGQGQQPGQ-GQQ 244
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462507488 1137 GQIGPTGEVGSRGPPGKIGKSGPKGARGTRGAVGHLGLMGPDGEPGIPGYRGHQgqPGPSGLPGPKGEKGYPGEDSTvlG 1216
Cdd:pfam03157  245 GQQPGQPQQLGQGQQGYYPISPQQPRQWQQSGQGQQGYYPTSLQQPGQGQSGYY--PTSQQQAGQLQQEQQLGQEQQ--D 320
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462507488 1217 PPGPRGEPGPVGDQGERGEPGAEGYKGHVGVPGLRGATGQQGPPGEPGDQGEQGLKGERGSEGNKGKKGAPGPSGKPGIP 1296
Cdd:pfam03157  321 QQPGQGRQGQQPGQGQQGQQPAQGQQPGQGQPGYYPTSPQQPGQGQPGYYPTSQQQPQQGQQPEQGQQGQQQGQGQQGQQ 400
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462507488 1297 GLQGLLGPKGIQGYHGAD---GISGNPGKIGPPGKQGLPGIRGGPGRTGLAGAPGPPGVKGSSGLPGSPGiQGPKGEQGL 1373
Cdd:pfam03157  401 PGQGQQPGQGQPGYYPTSpqqSGQGQPGYYPTSPQQSGQGQQPGQGQQPGQEQPGQGQQPGQGQQGQQPG-QPEQGQQPG 479
                          330       340       350       360
                   ....*....|....*....|....*....|....*....|....*..
gi 2462507488 1374 PGQPGIQGKRGHRGAQGDQGPCGDPGLKGQPGEYGVQGLTGFQGFPG 1420
Cdd:pfam03157  480 QGQPGYYPTSPQQSGQGQQLGQWQQQGQGQPGYYPTSPLQPGQGQPG 526
Collagen pfam01391
Collagen triple helix repeat (20 copies); Members of this family belong to the collagen ...
1017-1071 8.69e-06

Collagen triple helix repeat (20 copies); Members of this family belong to the collagen superfamily. Collagens are generally extracellular structural proteins involved in formation of connective tissue structure. The alignment contains 20 copies of the G-X-Y repeat that forms a triple helix. The first position of the repeat is glycine, the second and third positions can be any residue but are frequently proline and hydroxy-proline. Collagens are post translationally modified by proline hydroxylase to form the hydroxy-proline residues. Defective hydroxylation is the cause of scurvy. Some members of the collagen superfamily are not involved in connective tissue structure but share the same triple helical structure. The family includes bacterial collagen-like triple-helix repeat proteins.


Pssm-ID: 460189 [Multi-domain]  Cd Length: 57  Bit Score: 44.79  E-value: 8.69e-06
                           10        20        30        40        50
                   ....*....|....*....|....*....|....*....|....*....|....*
gi 2462507488 1017 GPPGEMGMEGPPGTEGESGLQGEPGAKGDVGTAGSVGGTGEPGLRGEPGAPGEEG 1071
Cdd:pfam01391    1 GPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGEPGPPGPPGPPGPPGPPGAPGAPG 55
PHA03169 PHA03169
hypothetical protein; Provisional
1187-1349 3.42e-04

hypothetical protein; Provisional


Pssm-ID: 223003 [Multi-domain]  Cd Length: 413  Bit Score: 44.96  E-value: 3.42e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462507488 1187 RGHQGQPGPSGLPGPKGEKGYPGEDSTVLGPPGPRGEPGPVGDQGERGEPGAEGYKGHvgvpglrGATGQQGPPGEPGDQ 1266
Cdd:PHA03169    86 ERGQGGPSGSGSESVGSPTPSPSGSAEELASGLSPENTSGSSPESPASHSPPPSPPSH-------PGPHEPAPPESHNPS 158
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462507488 1267 GEQGLKGERGSEGNKGKKGAPGPSGKPGIPGLQGLLGPKGIQGYHGADGiSGNPGKIGPPGKQGLPGIRGGPGRTGLAGA 1346
Cdd:PHA03169   159 PNQQPSSFLQPSHEDSPEEPEPPTSEPEPDSPGPPQSETPTSSPPPQSP-PDEPGEPQSPTPQQAPSPNTQQAVEHEDEP 237

                   ...
gi 2462507488 1347 PGP 1349
Cdd:PHA03169   238 TEP 240
SPT5 COG5164
Transcription elongation factor SPT5 [Transcription];
638-903 3.58e-04

Transcription elongation factor SPT5 [Transcription];


Pssm-ID: 444063 [Multi-domain]  Cd Length: 495  Bit Score: 45.02  E-value: 3.58e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462507488  638 PGIRGKKGFKGRQGFPGDFGDRGPAGLDGSpglvggTG------PPGFPGLRGSVGPVGPIG---PAGIPGPMGLSGNKG 708
Cdd:COG5164      6 PGKTGPSDPGGVTTPAGSQGSTKPAQNQGS------TRpagntgGTRPAQNQGSTTPAGNTGgtrPAGNQGATGPAQNQG 79
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462507488  709 LPGIKGDKGEQGTAGELGEPGYPGDKGAVGLPGPPGMRGKSGPSGQTGDPGLQGPSGPPGPEGFPGDIGIPGQNGPEGPK 788
Cdd:COG5164     80 GTTPAQNQGGTRPAGNTGGTTPAGDGGATGPPDDGGATGPPDDGGSTTPPSGGSTTPPGDGGSTPPGPGSTGPGGSTTPP 159
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462507488  789 ELVILWKHEPFISEKGLLGNRGPPGPP--GLKGTQGEEGPIGAFGELGPRGKPGQKGYAGEPGPEGLKGEVGDQ-GNIGK 865
Cdd:COG5164    160 GDGGSTTPPGPGGSTTPPDDGGSTTPPnkGETGTDIPTGGTPRQGPDGPVKKDDKNGKGNPPDDRGGKTGPKDQrPKTNP 239
                          250       260       270       280
                   ....*....|....*....|....*....|....*....|
gi 2462507488  866 IGETGPVGLPGEVGMTGSIGEKGER--GSPGPLGPQGEKG 903
Cdd:COG5164    240 IERRGPERPEAAALPAELTALEAENraANPEPATKTIPET 279
gly_rich_SclB NF038329
LPXTG-anchored collagen-like adhesin Scl2/SclB; SclB (or Scl2 - streptococcal collagen-like ...
1418-1475 7.63e-04

LPXTG-anchored collagen-like adhesin Scl2/SclB; SclB (or Scl2 - streptococcal collagen-like protein 2) is an LPXTG-anchored surface-anchored adhesin with a variable-length region of triple helix-forming collagen-like Gly-Xaa-Xaa repeats.


Pssm-ID: 468478 [Multi-domain]  Cd Length: 440  Bit Score: 44.13  E-value: 7.63e-04
                           10        20        30        40        50
                   ....*....|....*....|....*....|....*....|....*....|....*...
gi 2462507488 1418 FPGPKGPEGDAGIVGISGPKGPIGHRGNTGPLGREGIIGPTGRTGPRGEKGFRGETGP 1475
Cdd:NF038329   115 GDGEKGEPGPAGPAGPAGEQGPRGDRGETGPAGPAGPPGPQGERGEKGPAGPQGEAGP 172
Collagen pfam01391
Collagen triple helix repeat (20 copies); Members of this family belong to the collagen ...
819-872 7.75e-04

Collagen triple helix repeat (20 copies); Members of this family belong to the collagen superfamily. Collagens are generally extracellular structural proteins involved in formation of connective tissue structure. The alignment contains 20 copies of the G-X-Y repeat that forms a triple helix. The first position of the repeat is glycine, the second and third positions can be any residue but are frequently proline and hydroxy-proline. Collagens are post translationally modified by proline hydroxylase to form the hydroxy-proline residues. Defective hydroxylation is the cause of scurvy. Some members of the collagen superfamily are not involved in connective tissue structure but share the same triple helical structure. The family includes bacterial collagen-like triple-helix repeat proteins.


Pssm-ID: 460189 [Multi-domain]  Cd Length: 57  Bit Score: 39.01  E-value: 7.75e-04
                           10        20        30        40        50
                   ....*....|....*....|....*....|....*....|....*....|....
gi 2462507488  819 GTQGEEGPIGAFGELGPRGKPGQKGYAGEPGPEGLKGEVGDQGNIGKIGETGPV 872
Cdd:pfam01391    4 GPPGPPGPPGPPGPPGPPGPPGPPGPPGEPGPPGPPGPPGPPGPPGAPGAPGPP 57
PHA03169 PHA03169
hypothetical protein; Provisional
925-1155 8.17e-04

hypothetical protein; Provisional


Pssm-ID: 223003 [Multi-domain]  Cd Length: 413  Bit Score: 43.81  E-value: 8.17e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462507488  925 HVGARGPPGSQGPKGQR------------GSRGPDGLLGEQGIQGAKGEKGDQGKRGPHGLIGKTGNPGERGFQGKPGLQ 992
Cdd:PHA03169    26 HGGTREQAGRRRGTAARaakpappapttsGPQVRAVAEQGHRQTESDTETAEESRHGEKEERGQGGPSGSGSESVGSPTP 105
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462507488  993 GLPGSTGDRGLPGEPGLRGLQGDVGPPGEMGMEGPPGTEGEsglqGEPGAKGDVGTAGSVGGTGEPGLRGEPGAPGEEGL 1072
Cdd:PHA03169   106 SPSGSAEELASGLSPENTSGSSPESPASHSPPPSPPSHPGP----HEPAPPESHNPSPNQQPSSFLQPSHEDSPEEPEPP 181
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462507488 1073 QGKDGLKGVPGGRGLPGEDGEKGEMGlPGIIGPLGRSGQTGLPGPEGIVGI-----PGQRGRPGkKGDKGQIGPTGEVGS 1147
Cdd:PHA03169   182 TSEPEPDSPGPPQSETPTSSPPPQSP-PDEPGEPQSPTPQQAPSPNTQQAVehedePTEPEREG-PPFPGHRSHSYTVVG 259

                   ....*...
gi 2462507488 1148 RGPPGKIG 1155
Cdd:PHA03169   260 WKPSTRPG 267
LamG cd00110
Laminin G domain; Laminin G-like domains are usually Ca++ mediated receptors that can have ...
81-212 2.23e-03

Laminin G domain; Laminin G-like domains are usually Ca++ mediated receptors that can have binding sites for steroids, beta1 integrins, heparin, sulfatides, fibulin-1, and alpha-dystroglycans. Proteins that contain LamG domains serve a variety of purposes including signal transduction via cell-surface steroid receptors, adhesion, migration and differentiation through mediation of cell adhesion molecules.


Pssm-ID: 238058 [Multi-domain]  Cd Length: 151  Bit Score: 40.48  E-value: 2.23e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462507488   81 GVIFKNDAYIETPFvkilPVNLGQPFTILTGLQShRVNNAFLFSIRNKNRLQ-LGVQLLPKKLVVHIR-GKQPAVFNY-- 156
Cdd:cd00110      1 GVSFSGSSYVRLPT----LPAPRTRLSISFSFRT-TSPNGLLLYAGSQNGGDfLALELEDGRLVLRYDlGSGSLVLSSkt 75
                           90       100       110       120       130
                   ....*....|....*....|....*....|....*....|....*....|....*.
gi 2462507488  157 SVHDEQWHSFAITIRNQSVSMFVEcGKKYFSTETIPEVQTFDSNSVFTLGSMNNNS 212
Cdd:cd00110     76 PLNDGQWHSVSVERNGRSVTLSVD-GERVVESGSPGGSALLNLDGPLYLGGLPEDL 130
dermokine cd21118
dermokine; Dermokine, also known as epidermis-specific secreted protein SK30/SK89, is a ...
1143-1435 5.51e-03

dermokine; Dermokine, also known as epidermis-specific secreted protein SK30/SK89, is a skin-specific glycoprotein that may play a regulatory role in the crosstalk between barrier dysfunction and inflammation, and therefore play a role in inflammatory diseases such as psoriasis. Dermokine is one of the most highly expressed proteins in differentiating keratinocytes, found mainly in the spinous and granular layers of the epidermis, but also in the epithelia of the small intestine, macrophages of the lung, and endothelial cells of the lung. Mouse dermokine has been reported to be encoded by 22 exons, and its expression leads to alpha, beta, and gamma transcripts.


Pssm-ID: 411053 [Multi-domain]  Cd Length: 495  Bit Score: 41.14  E-value: 5.51e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462507488 1143 GEVGSRGppGKIGKSGPKGARGTRGAVGHLGLMGPDGEPGIpgyrghQGQPGPSGLPGPKGEKGYPGEDStvLGPPGPRG 1222
Cdd:cd21118    100 NEIGRQA--EDIIRHGVDAVHNSWQGSGGHGAYGSQGGPGV------QGHGIPGGTGGPWASGGNYGTNS--LGGSVGQG 169
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462507488 1223 EPGPVGDQGERGEpGAEGYKGHVGVPGLRGATGQQGPPGEpgdqgeqglkGERGSEGNKGKKGAPGPSGKPGIPGLQGll 1302
Cdd:cd21118    170 GNGGPLNYGTNSQ-GAVAQPGYGTVRGNNQNSGCTNPPPS----------GSHESFSNSGGSSSSGSSGSQGSHGSNG-- 236
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462507488 1303 gpkgiQGYHGADGISGNPGKIGppgkqglpGIRGGPGRTGLAGAPGPPGVKGSSGLPGSPGIQGPKGEQGLPGQPGIQGK 1382
Cdd:cd21118    237 -----QGSSGSSGGQGNGGNNG--------SSSSNSGNSGGSNGGSSGNSGSGSGGSSSGGSNGWGGSSSSGGSGGSGGG 303
                          250       260       270       280       290
                   ....*....|....*....|....*....|....*....|....*....|....
gi 2462507488 1383 RGHRgaqgdqgpCGDPGLK-GQPGEYGVQGLTGFQGFPGPKGPEGDAGIVGISG 1435
Cdd:cd21118    304 NKPE--------CNNPGNDvRMAGGGGSQGSKESSGSHGSNGGNGQAEAVGGLN 349
 
Name Accession Description Interval E-value
gly_rich_SclB NF038329
LPXTG-anchored collagen-like adhesin Scl2/SclB; SclB (or Scl2 - streptococcal collagen-like ...
924-1160 8.85e-42

LPXTG-anchored collagen-like adhesin Scl2/SclB; SclB (or Scl2 - streptococcal collagen-like protein 2) is an LPXTG-anchored surface-anchored adhesin with a variable-length region of triple helix-forming collagen-like Gly-Xaa-Xaa repeats.


Pssm-ID: 468478 [Multi-domain]  Cd Length: 440  Bit Score: 160.46  E-value: 8.85e-42
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462507488  924 GHVGARGPPGSQGPKGQRGSRGPDGLLGEQGIQGAKGEKGDQGKRGPHGLIGKTGNPGERGFQGKPGLQGLPGSTGDRGL 1003
Cdd:NF038329   117 GEKGEPGPAGPAGPAGEQGPRGDRGETGPAGPAGPPGPQGERGEKGPAGPQGEAGPQGPAGKDGEAGAKGPAGEKGPQGP 196
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462507488 1004 PGEPGLRGLQGDVGPPGEMGMEGPPGTEGESGLQGEpgakgdvgtaGSVGGTGEPGLRGEPGAPGEEGLQGKDGLKGVPG 1083
Cdd:NF038329   197 RGETGPAGEQGPAGPAGPDGEAGPAGEDGPAGPAGD----------GQQGPDGDPGPTGEDGPQGPDGPAGKDGPRGDRG 266
                          170       180       190       200       210       220       230
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*..
gi 2462507488 1084 GRGLPGEDGEKGEMGLPGIIGPLGRSGQTGLPGPEGIVGIPGQRGRPGKKGDKGQIGPTGEVGSRGPPGKIGKSGPK 1160
Cdd:NF038329   267 EAGPDGPDGKDGERGPVGPAGKDGQNGKDGLPGKDGKDGQNGKDGLPGKDGKDGQPGKDGLPGKDGKDGQPGKPAPK 343
gly_rich_SclB NF038329
LPXTG-anchored collagen-like adhesin Scl2/SclB; SclB (or Scl2 - streptococcal collagen-like ...
947-1185 2.71e-40

LPXTG-anchored collagen-like adhesin Scl2/SclB; SclB (or Scl2 - streptococcal collagen-like protein 2) is an LPXTG-anchored surface-anchored adhesin with a variable-length region of triple helix-forming collagen-like Gly-Xaa-Xaa repeats.


Pssm-ID: 468478 [Multi-domain]  Cd Length: 440  Bit Score: 155.83  E-value: 2.71e-40
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462507488  947 DGLLGEQGIQGAKGEKGDQGKRGPHGLIGKTGNPGERGFQGKpglqglPGSTGDRGLPGEPGLRGLQGDVGPPGEMGMEG 1026
Cdd:NF038329   107 DEGLQQLKGDGEKGEPGPAGPAGPAGEQGPRGDRGETGPAGP------AGPPGPQGERGEKGPAGPQGEAGPQGPAGKDG 180
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462507488 1027 PPGTEGESGLQGEPGAKGDVGTAGSVGGTGEPGLRGEPGAPGEEGLQGKDGlKGVPGGRGLPGEDGEKGEMGLPGIIGPL 1106
Cdd:NF038329   181 EAGAKGPAGEKGPQGPRGETGPAGEQGPAGPAGPDGEAGPAGEDGPAGPAG-DGQQGPDGDPGPTGEDGPQGPDGPAGKD 259
                          170       180       190       200       210       220       230
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 2462507488 1107 GRSGQTGLPGPEGIVGIPGQRGRPGKKGDKGQIGPTGEVGSRGPPGKIGKSGPKGARGTRGAVGHLGLMGPDGEPGIPG 1185
Cdd:NF038329   260 GPRGDRGEAGPDGPDGKDGERGPVGPAGKDGQNGKDGLPGKDGKDGQNGKDGLPGKDGKDGQPGKDGLPGKDGKDGQPG 338
gly_rich_SclB NF038329
LPXTG-anchored collagen-like adhesin Scl2/SclB; SclB (or Scl2 - streptococcal collagen-like ...
851-1117 5.20e-39

LPXTG-anchored collagen-like adhesin Scl2/SclB; SclB (or Scl2 - streptococcal collagen-like protein 2) is an LPXTG-anchored surface-anchored adhesin with a variable-length region of triple helix-forming collagen-like Gly-Xaa-Xaa repeats.


Pssm-ID: 468478 [Multi-domain]  Cd Length: 440  Bit Score: 151.98  E-value: 5.20e-39
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462507488  851 EGLKGEVGDqGNIGKIGETGPVGLPGEVGMTGSIGEKGERGSPGPLGPQGEKGvmgypgppgvpgpigplgLPGHVGARG 930
Cdd:NF038329   108 EGLQQLKGD-GEKGEPGPAGPAGPAGEQGPRGDRGETGPAGPAGPPGPQGERG------------------EKGPAGPQG 168
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462507488  931 PPGSQGPKGQRGSRGPDGLLGEQGIQGAKGEKGDQGKRGPHGLIGKTGNPGERGFQGKPGLQGlpgstgdrglPGEPGLR 1010
Cdd:NF038329   169 EAGPQGPAGKDGEAGAKGPAGEKGPQGPRGETGPAGEQGPAGPAGPDGEAGPAGEDGPAGPAG----------DGQQGPD 238
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462507488 1011 GLQGDVGPPGEMGMEGPPGTEGESGLQGEPGAKGDVGTAGSvggTGEPGLRGEPGAPGEEGLQGKDGLKGVPGGRGLPGE 1090
Cdd:NF038329   239 GDPGPTGEDGPQGPDGPAGKDGPRGDRGEAGPDGPDGKDGE---RGPVGPAGKDGQNGKDGLPGKDGKDGQNGKDGLPGK 315
                          250       260
                   ....*....|....*....|....*..
gi 2462507488 1091 DGEKGEMGLPGIIGPLGRSGQTGLPGP 1117
Cdd:NF038329   316 DGKDGQPGKDGLPGKDGKDGQPGKPAP 342
gly_rich_SclB NF038329
LPXTG-anchored collagen-like adhesin Scl2/SclB; SclB (or Scl2 - streptococcal collagen-like ...
1008-1288 2.44e-37

LPXTG-anchored collagen-like adhesin Scl2/SclB; SclB (or Scl2 - streptococcal collagen-like protein 2) is an LPXTG-anchored surface-anchored adhesin with a variable-length region of triple helix-forming collagen-like Gly-Xaa-Xaa repeats.


Pssm-ID: 468478 [Multi-domain]  Cd Length: 440  Bit Score: 147.36  E-value: 2.44e-37
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462507488 1008 GLRGLQGDvGPPGEMGMEGPPGTEGESGLQGEPGAKGDVGTAGSVGGTGEPGLRGEPGAPGEEGLQGKDGLKGVPGGRGL 1087
Cdd:NF038329   109 GLQQLKGD-GEKGEPGPAGPAGPAGEQGPRGDRGETGPAGPAGPPGPQGERGEKGPAGPQGEAGPQGPAGKDGEAGAKGP 187
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462507488 1088 PGEDGEKGEMGLPGiigPLGRSGQTGLPGPEGIVGIPGQRGRPGKKGDkgqigptGEVGSRGPPGKIGKSGPKGargtrg 1167
Cdd:NF038329   188 AGEKGPQGPRGETG---PAGEQGPAGPAGPDGEAGPAGEDGPAGPAGD-------GQQGPDGDPGPTGEDGPQG------ 251
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462507488 1168 avghlglmgPDGEPGIPGYRGHQGQPGPSGLPGPKGEKgypgedstvlgppgprgepgpvgdqGERGEPGAEGYKGHVGV 1247
Cdd:NF038329   252 ---------PDGPAGKDGPRGDRGEAGPDGPDGKDGER-------------------------GPVGPAGKDGQNGKDGL 297
                          250       260       270       280
                   ....*....|....*....|....*....|....*....|.
gi 2462507488 1248 PGLRGATGQQGPPGEPGDQGEQGLKGERGSEGNKGKKGAPG 1288
Cdd:NF038329   298 PGKDGKDGQNGKDGLPGKDGKDGQPGKDGLPGKDGKDGQPG 338
COLFI pfam01410
Fibrillar collagen C-terminal domain; Found at C-termini of fibrillar collagens: Ephydatia ...
1528-1715 3.03e-36

Fibrillar collagen C-terminal domain; Found at C-termini of fibrillar collagens: Ephydatia muelleri procollagen EMF1 alpha, vertebrate collagens alpha(1)III, alpha(1)II, alpha(2)V etc.


Pssm-ID: 460199  Cd Length: 233  Bit Score: 137.86  E-value: 3.03e-36
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462507488 1528 HSEEIFKTLNYLSNLLHSIKNPLGTRDNPARICKDLLNCEQKVSD-------------DAIEVFCNFSAGgQTCLPP--- 1591
Cdd:pfam01410    1 RDEEVMATLKSLSQQIENIRSPDGSKKNPARTCRDLKLCHPDWKSgeywidpnqgctrDAIKVFCNFETG-ETCIYPtka 79
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462507488 1592 -----VSVTK----------------LEFGVGK-------VQMNFLHLLSSEATHIITIHCLNTPRWTSTQTSGPGLPIG 1643
Cdd:pfam01410   80 siprkNWWTKeskhvwfgefmnggsqFSYGVDGvgpsvaaVQLTFLRLLSTEASQNITYHCKNSVAYMDQATGNLKKALL 159
                          170       180       190       200       210       220       230
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....
gi 2462507488 1644 FKGWNGQIFKV--NTLLEPKVLSDDCKIQDGSWHKATFLFHTQEPNQLPVIEVQKLPHLKTERKYYIDSSSVCF 1715
Cdd:pfam01410  160 LQGSNDEEIRAegNSRFTYTVLEDGCTKRTGQWGKTVIEYRTQKVSRLPIVDIAPMDIGGADQEFGVEVGPVCF 233
gly_rich_SclB NF038329
LPXTG-anchored collagen-like adhesin Scl2/SclB; SclB (or Scl2 - streptococcal collagen-like ...
1091-1328 5.00e-36

LPXTG-anchored collagen-like adhesin Scl2/SclB; SclB (or Scl2 - streptococcal collagen-like protein 2) is an LPXTG-anchored surface-anchored adhesin with a variable-length region of triple helix-forming collagen-like Gly-Xaa-Xaa repeats.


Pssm-ID: 468478 [Multi-domain]  Cd Length: 440  Bit Score: 143.51  E-value: 5.00e-36
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462507488 1091 DGEKGEMGlpgiigPLGRSGQTGLPGPEGIVGIPGQRGRPGKKGDKGQIGPTGEVGSRGPPGKIGKSGPKGARGTRGAVG 1170
Cdd:NF038329   116 DGEKGEPG------PAGPAGPAGEQGPRGDRGETGPAGPAGPPGPQGERGEKGPAGPQGEAGPQGPAGKDGEAGAKGPAG 189
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462507488 1171 HLGLMGPDGEPGIPGYRGHQGQPGPSGLPGPKGEKGYPGEDSTvlGPPGPRGEPgpvgdqGERGEPGAEGYKGHVGVPGL 1250
Cdd:NF038329   190 EKGPQGPRGETGPAGEQGPAGPAGPDGEAGPAGEDGPAGPAGD--GQQGPDGDP------GPTGEDGPQGPDGPAGKDGP 261
                          170       180       190       200       210       220       230
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 2462507488 1251 RGATGQQGPPGEPGDQGEQGLKGERGSEGNKGKKGAPGPSGKPGIPGLQGLLGPKGIQGYHGADGISGNPGKIGPPGK 1328
Cdd:NF038329   262 RGDRGEAGPDGPDGKDGERGPVGPAGKDGQNGKDGLPGKDGKDGQNGKDGLPGKDGKDGQPGKDGLPGKDGKDGQPGK 339
gly_rich_SclB NF038329
LPXTG-anchored collagen-like adhesin Scl2/SclB; SclB (or Scl2 - streptococcal collagen-like ...
804-1044 2.36e-35

LPXTG-anchored collagen-like adhesin Scl2/SclB; SclB (or Scl2 - streptococcal collagen-like protein 2) is an LPXTG-anchored surface-anchored adhesin with a variable-length region of triple helix-forming collagen-like Gly-Xaa-Xaa repeats.


Pssm-ID: 468478 [Multi-domain]  Cd Length: 440  Bit Score: 141.20  E-value: 2.36e-35
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462507488  804 GLLGNRGPPGPPGLKGTQGEEGPIGAFGELGPRGKPGQKGYAGEPGPEGLKGEVGDQGNIGKIGETGPVGLPGEVGMTGS 883
Cdd:NF038329   117 GEKGEPGPAGPAGPAGEQGPRGDRGETGPAGPAGPPGPQGERGEKGPAGPQGEAGPQGPAGKDGEAGAKGPAGEKGPQGP 196
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462507488  884 IGEKGERGSPGPLGPQGEKGvmgypgppgvpgpigplgLPGHVGARGPPGSQGpKGQRGSRGPDGLLGEQGIQGAKGEKG 963
Cdd:NF038329   197 RGETGPAGEQGPAGPAGPDG------------------EAGPAGEDGPAGPAG-DGQQGPDGDPGPTGEDGPQGPDGPAG 257
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462507488  964 DQGKRGPHGLIGKTGNPGERGFQGKPGLQGLPGSTGDRGLPGEPGLRGLQGDVGPPGEMGMEGPPGTEGESGLQGEPGAK 1043
Cdd:NF038329   258 KDGPRGDRGEAGPDGPDGKDGERGPVGPAGKDGQNGKDGLPGKDGKDGQNGKDGLPGKDGKDGQPGKDGLPGKDGKDGQP 337

                   .
gi 2462507488 1044 G 1044
Cdd:NF038329   338 G 338
COLFI smart00038
Fibrillar collagens C-terminal domain; Found at C-termini of fibrillar collagens: Ephydatia ...
1530-1716 3.04e-33

Fibrillar collagens C-terminal domain; Found at C-termini of fibrillar collagens: Ephydatia muelleri procollagen EMF1alpha, vertebrate collagens alpha(1)III, alpha(1)II, alpha(2)V etc.


Pssm-ID: 197483  Cd Length: 232  Bit Score: 129.13  E-value: 3.04e-33
                            10        20        30        40        50        60        70        80
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462507488  1530 EEIFKTLNYLSNLLHSIKNPLGTRDNPARICKDLLNCEQKVSD-------------DAIEVFCNFsAGGQTCLPP----- 1591
Cdd:smart00038    2 EEVFASLKSLNNQIEQLKSPTGSRKNPARTCKDLKLCHPEWKSgeywvdpnqgcirDAIKVFCNF-ETGETCVSPspssi 80
                            90       100       110       120       130       140       150       160
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462507488  1592 -------------------VSVTKLEFG------VGKVQMNFLHLLSSEATHIITIHCLNTPRWTSTQTSGPGLPIGFKG 1646
Cdd:smart00038   81 prktwysgkskhvwfgetmNGGFKFSYGdsegppVGVVQLTFLRLLSTEAHQNITYHCKNSVAYMDEATGNLKKALRLRG 160
                           170       180       190       200       210       220       230
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|..
gi 2462507488  1647 WNGQIFK--VNTLLEPKVLSDDCKIQDGSWHKATFLFHTQEPNQLPVIEVQKLPHLKTERKYYIDSSSVCFL 1716
Cdd:smart00038  161 SNDVELSaeGNSKFTYEVLEDGCQKRTGKWGKTVIEYRTKKTERLPIVDIAPSDIGGPDQEFGVEIGPVCFS 232
gly_rich_SclB NF038329
LPXTG-anchored collagen-like adhesin Scl2/SclB; SclB (or Scl2 - streptococcal collagen-like ...
1227-1464 1.31e-32

LPXTG-anchored collagen-like adhesin Scl2/SclB; SclB (or Scl2 - streptococcal collagen-like protein 2) is an LPXTG-anchored surface-anchored adhesin with a variable-length region of triple helix-forming collagen-like Gly-Xaa-Xaa repeats.


Pssm-ID: 468478 [Multi-domain]  Cd Length: 440  Bit Score: 133.11  E-value: 1.31e-32
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462507488 1227 VGDQGERGEPGAEGYkghvgvpglRGATGQQGPPGEPGDQGEQGLKGERGSEGNKGKKGAPGPSGKPgipglqgllGPKG 1306
Cdd:NF038329   125 AGPAGPAGEQGPRGD---------RGETGPAGPAGPPGPQGERGEKGPAGPQGEAGPQGPAGKDGEA---------GAKG 186
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462507488 1307 IQGYHGADGISGNPGKIGPPGKQGLPGIRGGPGRTGLAGAPGPPGvKGSSGLPGSPGIQGPKGEQGLPGQPGIQGKRGHR 1386
Cdd:NF038329   187 PAGEKGPQGPRGETGPAGEQGPAGPAGPDGEAGPAGEDGPAGPAG-DGQQGPDGDPGPTGEDGPQGPDGPAGKDGPRGDR 265
                          170       180       190       200       210       220       230
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 2462507488 1387 GAQGDQGPCGDPGLKGQPGEYGVQGLTGFQGFPGPKGPEGDAGIVGISGPKGPIGHRGNTGPLGREGIIGPTGRTGPR 1464
Cdd:NF038329   266 GEAGPDGPDGKDGERGPVGPAGKDGQNGKDGLPGKDGKDGQNGKDGLPGKDGKDGQPGKDGLPGKDGKDGQPGKPAPK 343
gly_rich_SclB NF038329
LPXTG-anchored collagen-like adhesin Scl2/SclB; SclB (or Scl2 - streptococcal collagen-like ...
512-756 1.40e-31

LPXTG-anchored collagen-like adhesin Scl2/SclB; SclB (or Scl2 - streptococcal collagen-like protein 2) is an LPXTG-anchored surface-anchored adhesin with a variable-length region of triple helix-forming collagen-like Gly-Xaa-Xaa repeats.


Pssm-ID: 468478 [Multi-domain]  Cd Length: 440  Bit Score: 130.03  E-value: 1.40e-31
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462507488  512 RGPRGIPGPHGNPGLPGLPGPKGPKGDpgfspgqpvPGEKGDQGLSGLMGPPGMQGDKGLKGHPGLPGLPGEQGIPGFAG 591
Cdd:NF038329   128 AGPAGEQGPRGDRGETGPAGPAGPPGP---------QGERGEKGPAGPQGEAGPQGPAGKDGEAGAKGPAGEKGPQGPRG 198
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462507488  592 NIGSPGYPGRQGLAGPEGNPGPKGAQGFIGSPGEagqlgpegergipGIRGKKGFKGRQGFPGDFGDRGPAGLDGSPGlv 671
Cdd:NF038329   199 ETGPAGEQGPAGPAGPDGEAGPAGEDGPAGPAGD-------------GQQGPDGDPGPTGEDGPQGPDGPAGKDGPRG-- 263
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462507488  672 ggtgppgfpgLRGSVGPVGPIGPAGIPGPMGLSGNKGLPGIKGDKGEQGTAGELGEPGYPGDKGAVGLPGPPGMRGKSGP 751
Cdd:NF038329   264 ----------DRGEAGPDGPDGKDGERGPVGPAGKDGQNGKDGLPGKDGKDGQNGKDGLPGKDGKDGQPGKDGLPGKDGK 333

                   ....*
gi 2462507488  752 SGQTG 756
Cdd:NF038329   334 DGQPG 338
gly_rich_SclB NF038329
LPXTG-anchored collagen-like adhesin Scl2/SclB; SclB (or Scl2 - streptococcal collagen-like ...
706-1007 3.85e-31

LPXTG-anchored collagen-like adhesin Scl2/SclB; SclB (or Scl2 - streptococcal collagen-like protein 2) is an LPXTG-anchored surface-anchored adhesin with a variable-length region of triple helix-forming collagen-like Gly-Xaa-Xaa repeats.


Pssm-ID: 468478 [Multi-domain]  Cd Length: 440  Bit Score: 128.87  E-value: 3.85e-31
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462507488  706 NKGLPGIKGD--KGEQGTAGELGEPGYPGDKGAVGLPGPPGMRGKSGPSGQTGDPglqgpsgppgpegfpgdiGIPGQNG 783
Cdd:NF038329   107 DEGLQQLKGDgeKGEPGPAGPAGPAGEQGPRGDRGETGPAGPAGPPGPQGERGEK------------------GPAGPQG 168
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462507488  784 PEGPKelvilwkhepfisekgllgnrgppgppglkGTQGEEGPIGAFGELGPRGKPGQKGYAGEPGPEGLKGEVGDQGNI 863
Cdd:NF038329   169 EAGPQ------------------------------GPAGKDGEAGAKGPAGEKGPQGPRGETGPAGEQGPAGPAGPDGEA 218
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462507488  864 GKIGETGPVGLPGEvGMTGSIGEKGERGSPGPLGPQGEKGVmgypgppgvpgpigplglPGHVGARGPPGSQGPKGQRGS 943
Cdd:NF038329   219 GPAGEDGPAGPAGD-GQQGPDGDPGPTGEDGPQGPDGPAGK------------------DGPRGDRGEAGPDGPDGKDGE 279
                          250       260       270       280       290       300
                   ....*....|....*....|....*....|....*....|....*....|....*....|....
gi 2462507488  944 RGPDGLLGEQGIQGAKGEKGDQGKRGPHgliGKTGNPGERGFQGKPGLQGLPGSTGDRGLPGEP 1007
Cdd:NF038329   280 RGPVGPAGKDGQNGKDGLPGKDGKDGQN---GKDGLPGKDGKDGQPGKDGLPGKDGKDGQPGKP 340
gly_rich_SclB NF038329
LPXTG-anchored collagen-like adhesin Scl2/SclB; SclB (or Scl2 - streptococcal collagen-like ...
600-865 8.62e-28

LPXTG-anchored collagen-like adhesin Scl2/SclB; SclB (or Scl2 - streptococcal collagen-like protein 2) is an LPXTG-anchored surface-anchored adhesin with a variable-length region of triple helix-forming collagen-like Gly-Xaa-Xaa repeats.


Pssm-ID: 468478 [Multi-domain]  Cd Length: 440  Bit Score: 118.85  E-value: 8.62e-28
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462507488  600 GRQGLAGPEGNPGPKGAQGFIGSPGEAGQLGPEGERGIPGIRGKKGFKGRQGFPGDfgdRGPAGLDGSPGLVGGTGPPGF 679
Cdd:NF038329   117 GEKGEPGPAGPAGPAGEQGPRGDRGETGPAGPAGPPGPQGERGEKGPAGPQGEAGP---QGPAGKDGEAGAKGPAGEKGP 193
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462507488  680 PGLRGSVGPVGPIGPAGIPGPMGLSGNKGLPGIKGD--KGEQGTAGELGEPGYPGDKGAVGLPGPPGMRGKSGPSGQTGD 757
Cdd:NF038329   194 QGPRGETGPAGEQGPAGPAGPDGEAGPAGEDGPAGPagDGQQGPDGDPGPTGEDGPQGPDGPAGKDGPRGDRGEAGPDGP 273
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462507488  758 PglqgpsgppgpegfpgdiGIPGQNGPEGPkelvilwkhepfisekgllgnrgpPGPPGLKGTQGEEGPIGAFGELGPRG 837
Cdd:NF038329   274 D------------------GKDGERGPVGP------------------------AGKDGQNGKDGLPGKDGKDGQNGKDG 311
                          250       260
                   ....*....|....*....|....*...
gi 2462507488  838 KPGQKGYAGEPGPEGLKGEVGDQGNIGK 865
Cdd:NF038329   312 LPGKDGKDGQPGKDGLPGKDGKDGQPGK 339
gly_rich_SclB NF038329
LPXTG-anchored collagen-like adhesin Scl2/SclB; SclB (or Scl2 - streptococcal collagen-like ...
652-903 2.49e-24

LPXTG-anchored collagen-like adhesin Scl2/SclB; SclB (or Scl2 - streptococcal collagen-like protein 2) is an LPXTG-anchored surface-anchored adhesin with a variable-length region of triple helix-forming collagen-like Gly-Xaa-Xaa repeats.


Pssm-ID: 468478 [Multi-domain]  Cd Length: 440  Bit Score: 108.45  E-value: 2.49e-24
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462507488  652 FPGDFGDRGPAGLDGSPGLVggtgppgfpglrGSVGPVGPIGPAGIPGPMGLSGNKGLPGIKGDKGEQGTAGELGEPGYP 731
Cdd:NF038329   115 GDGEKGEPGPAGPAGPAGEQ------------GPRGDRGETGPAGPAGPPGPQGERGEKGPAGPQGEAGPQGPAGKDGEA 182
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462507488  732 GDKGAVGLPGPPGMRGKSGPSGQTGdpglqgPSGPPGPEGFPGDIGIPGQNGPEGPKElvilwkhepfISEKGLLGNRGP 811
Cdd:NF038329   183 GAKGPAGEKGPQGPRGETGPAGEQG------PAGPAGPDGEAGPAGEDGPAGPAGDGQ----------QGPDGDPGPTGE 246
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462507488  812 PGPPGLKGTQGEEGPIGAFGELGPRGKPGQKGYAGEPGPEGLKGEVGDQGNIGKIGETGPVGLPGEVGMTGSIGEKGERG 891
Cdd:NF038329   247 DGPQGPDGPAGKDGPRGDRGEAGPDGPDGKDGERGPVGPAGKDGQNGKDGLPGKDGKDGQNGKDGLPGKDGKDGQPGKDG 326
                          250
                   ....*....|..
gi 2462507488  892 SPGPLGPQGEKG 903
Cdd:NF038329   327 LPGKDGKDGQPG 338
SPT5 COG5164
Transcription elongation factor SPT5 [Transcription];
957-1213 7.01e-14

Transcription elongation factor SPT5 [Transcription];


Pssm-ID: 444063 [Multi-domain]  Cd Length: 495  Bit Score: 76.22  E-value: 7.01e-14
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462507488  957 GAKGEKGDQGKRGPHGLIGKTGNPGERGFQGKPGLQGLPGSTGDRGLPGEPGLrglQGDVGPPGEMGMEGPPGTEGESGL 1036
Cdd:COG5164      7 GKTGPSDPGGVTTPAGSQGSTKPAQNQGSTRPAGNTGGTRPAQNQGSTTPAGN---TGGTRPAGNQGATGPAQNQGGTTP 83
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462507488 1037 QGEPGAKGDVGTAGSVGGTGEPGLRGEPGAPGEEGLQGKDGLKGVP--GGRGLPGEDGEK----GEMGLPGIIGPLGRSG 1110
Cdd:COG5164     84 AQNQGGTRPAGNTGGTTPAGDGGATGPPDDGGATGPPDDGGSTTPPsgGSTTPPGDGGSTppgpGSTGPGGSTTPPGDGG 163
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462507488 1111 QTGLPGPEGIVGIPGQRGR--PGKKGDKGQIGPTGEVGSRGPPGKIGKSGPKGARGTRGAVGhlGLMGPDGEPGIPGYRG 1188
Cdd:COG5164    164 STTPPGPGGSTTPPDDGGSttPPNKGETGTDIPTGGTPRQGPDGPVKKDDKNGKGNPPDDRG--GKTGPKDQRPKTNPIE 241
                          250       260
                   ....*....|....*....|....*
gi 2462507488 1189 HQGQPGPSGLPGPKGEKGYPGEDST 1213
Cdd:COG5164    242 RRGPERPEAAALPAELTALEAENRA 266
TSPN smart00210
Thrombospondin N-terminal -like domains; Heparin-binding and cell adhesion domain of ...
41-227 3.57e-11

Thrombospondin N-terminal -like domains; Heparin-binding and cell adhesion domain of thrombospondin


Pssm-ID: 214560  Cd Length: 184  Bit Score: 63.92  E-value: 3.57e-11
                            10        20        30        40        50        60        70        80
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462507488    41 GIDILHQLGLGGKdvrhSSPVTAVPSASTPLPqgvhltesGVIFKNDAYIETPFVKILPVNLGQPFTILTGLQSHRVNNA 120
Cdd:smart00210    1 GQDLLQVFDLPSL----SFAIRQVVGPEPGSP--------AYRLGDPALVPQPTRDLFPSGLPEDFSLLTTFRQTPKSRG 68
                            90       100       110       120       130       140       150       160
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462507488   121 FLFSIRNK-NRLQLGVQL--LPKKLVVH---IRGKQPAVF--NYSVHDEQWHSFAITIRNQSVSMFVECGKKYfsTETIP 192
Cdd:smart00210   69 VLFAIYDAqNVRQFGLEVdgRANTLLLRyqgVDGKQHTVSfrNLPLADGQWHKLALSVSGSSATLYVDCNEID--SRPLD 146
                           170       180       190
                    ....*....|....*....|....*....|....*..
gi 2462507488   193 E--VQTFDSNSVFTLGSMNNNSIHFEGIVCQLDIIPS 227
Cdd:smart00210  147 RpgQPPIDTDGIEVRGAQAADRKPFQGDLQQLKIVCD 183
SPT5 COG5164
Transcription elongation factor SPT5 [Transcription];
1193-1463 3.68e-11

Transcription elongation factor SPT5 [Transcription];


Pssm-ID: 444063 [Multi-domain]  Cd Length: 495  Bit Score: 67.75  E-value: 3.68e-11
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462507488 1193 PGPSGLPGPKGEKGYPGeDSTVLGPPGPRGEPGPVGDQGERGEPGAEGYKGHVGVPGLRGATGQQGPPGEPGDQGEQGLK 1272
Cdd:COG5164      6 PGKTGPSDPGGVTTPAG-SQGSTKPAQNQGSTRPAGNTGGTRPAQNQGSTTPAGNTGGTRPAGNQGATGPAQNQGGTTPA 84
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462507488 1273 GERGSEGNKGKKGAPGPSGKPGIPGLQGLLGPKGIQGYHGADGISGNpGKIGPPGKQGlpGIRGGPGRTGLAGAPGPPGV 1352
Cdd:COG5164     85 QNQGGTRPAGNTGGTTPAGDGGATGPPDDGGATGPPDDGGSTTPPSG-GSTTPPGDGG--STPPGPGSTGPGGSTTPPGD 161
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462507488 1353 KGSSGLPGSPGIQGPKGEQGlPGQPGIQGKRGHRGAQGDQGPCGDPGLKGQPGEYGVQGLTGFQGfpGPKGPEGDAGIVG 1432
Cdd:COG5164    162 GGSTTPPGPGGSTTPPDDGG-STTPPNKGETGTDIPTGGTPRQGPDGPVKKDDKNGKGNPPDDRG--GKTGPKDQRPKTN 238
                          250       260       270
                   ....*....|....*....|....*....|.
gi 2462507488 1433 ISGPKGPIGHRGNTGPLGREGIIGPTGRTGP 1463
Cdd:COG5164    239 PIERRGPERPEAAALPAELTALEAENRAANP 269
Collagen pfam01391
Collagen triple helix repeat (20 copies); Members of this family belong to the collagen ...
1321-1377 3.55e-07

Collagen triple helix repeat (20 copies); Members of this family belong to the collagen superfamily. Collagens are generally extracellular structural proteins involved in formation of connective tissue structure. The alignment contains 20 copies of the G-X-Y repeat that forms a triple helix. The first position of the repeat is glycine, the second and third positions can be any residue but are frequently proline and hydroxy-proline. Collagens are post translationally modified by proline hydroxylase to form the hydroxy-proline residues. Defective hydroxylation is the cause of scurvy. Some members of the collagen superfamily are not involved in connective tissue structure but share the same triple helical structure. The family includes bacterial collagen-like triple-helix repeat proteins.


Pssm-ID: 460189 [Multi-domain]  Cd Length: 57  Bit Score: 48.64  E-value: 3.55e-07
                           10        20        30        40        50
                   ....*....|....*....|....*....|....*....|....*....|....*..
gi 2462507488 1321 GKIGPPGKQGLPGIRGGPGRTGLAGAPGPPGVKGSSGLPGSPGIQGPKGEQGLPGQP 1377
Cdd:pfam01391    1 GPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGEPGPPGPPGPPGPPGPPGAPGAPGPP 57
Collagen pfam01391
Collagen triple helix repeat (20 copies); Members of this family belong to the collagen ...
1324-1378 5.05e-07

Collagen triple helix repeat (20 copies); Members of this family belong to the collagen superfamily. Collagens are generally extracellular structural proteins involved in formation of connective tissue structure. The alignment contains 20 copies of the G-X-Y repeat that forms a triple helix. The first position of the repeat is glycine, the second and third positions can be any residue but are frequently proline and hydroxy-proline. Collagens are post translationally modified by proline hydroxylase to form the hydroxy-proline residues. Defective hydroxylation is the cause of scurvy. Some members of the collagen superfamily are not involved in connective tissue structure but share the same triple helical structure. The family includes bacterial collagen-like triple-helix repeat proteins.


Pssm-ID: 460189 [Multi-domain]  Cd Length: 57  Bit Score: 48.26  E-value: 5.05e-07
                           10        20        30        40        50
                   ....*....|....*....|....*....|....*....|....*....|....*
gi 2462507488 1324 GPPGKQGLPGIRGGPGRTGLAGAPGPPGVKGSSGLPGSPGIQGPKGEQGLPGQPG 1378
Cdd:pfam01391    1 GPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGEPGPPGPPGPPGPPGPPGAPGAPG 55
Collagen pfam01391
Collagen triple helix repeat (20 copies); Members of this family belong to the collagen ...
1315-1370 5.15e-07

Collagen triple helix repeat (20 copies); Members of this family belong to the collagen superfamily. Collagens are generally extracellular structural proteins involved in formation of connective tissue structure. The alignment contains 20 copies of the G-X-Y repeat that forms a triple helix. The first position of the repeat is glycine, the second and third positions can be any residue but are frequently proline and hydroxy-proline. Collagens are post translationally modified by proline hydroxylase to form the hydroxy-proline residues. Defective hydroxylation is the cause of scurvy. Some members of the collagen superfamily are not involved in connective tissue structure but share the same triple helical structure. The family includes bacterial collagen-like triple-helix repeat proteins.


Pssm-ID: 460189 [Multi-domain]  Cd Length: 57  Bit Score: 48.26  E-value: 5.15e-07
                           10        20        30        40        50
                   ....*....|....*....|....*....|....*....|....*....|....*.
gi 2462507488 1315 GISGNPGKIGPPGKQGLPGIRGGPGRTGLAGAPGPPGVKGSSGLPGSPGIQGPKGE 1370
Cdd:pfam01391    1 GPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGEPGPPGPPGPPGPPGPPGAPGAPGP 56
Collagen pfam01391
Collagen triple helix repeat (20 copies); Members of this family belong to the collagen ...
1342-1398 9.18e-07

Collagen triple helix repeat (20 copies); Members of this family belong to the collagen superfamily. Collagens are generally extracellular structural proteins involved in formation of connective tissue structure. The alignment contains 20 copies of the G-X-Y repeat that forms a triple helix. The first position of the repeat is glycine, the second and third positions can be any residue but are frequently proline and hydroxy-proline. Collagens are post translationally modified by proline hydroxylase to form the hydroxy-proline residues. Defective hydroxylation is the cause of scurvy. Some members of the collagen superfamily are not involved in connective tissue structure but share the same triple helical structure. The family includes bacterial collagen-like triple-helix repeat proteins.


Pssm-ID: 460189 [Multi-domain]  Cd Length: 57  Bit Score: 47.49  E-value: 9.18e-07
                           10        20        30        40        50
                   ....*....|....*....|....*....|....*....|....*....|....*..
gi 2462507488 1342 GLAGAPGPPGVKGSSGLPGSPGIQGPKGEQGLPGQPGIQGKRGHRGAQGDQGPCGDP 1398
Cdd:pfam01391    1 GPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGEPGPPGPPGPPGPPGPPGAPGAPGPP 57
Collagen pfam01391
Collagen triple helix repeat (20 copies); Members of this family belong to the collagen ...
561-616 1.41e-06

Collagen triple helix repeat (20 copies); Members of this family belong to the collagen superfamily. Collagens are generally extracellular structural proteins involved in formation of connective tissue structure. The alignment contains 20 copies of the G-X-Y repeat that forms a triple helix. The first position of the repeat is glycine, the second and third positions can be any residue but are frequently proline and hydroxy-proline. Collagens are post translationally modified by proline hydroxylase to form the hydroxy-proline residues. Defective hydroxylation is the cause of scurvy. Some members of the collagen superfamily are not involved in connective tissue structure but share the same triple helical structure. The family includes bacterial collagen-like triple-helix repeat proteins.


Pssm-ID: 460189 [Multi-domain]  Cd Length: 57  Bit Score: 46.72  E-value: 1.41e-06
                           10        20        30        40        50
                   ....*....|....*....|....*....|....*....|....*....|....*.
gi 2462507488  561 GPPGMQGDKGLKGHPGLPGLPGEQGIPGFAGNIGSPGYPGRQGLAGPEGNPGPKGA 616
Cdd:pfam01391    1 GPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGEPGPPGPPGPPGPPGPPGAPGAPGP 56
Collagen pfam01391
Collagen triple helix repeat (20 copies); Members of this family belong to the collagen ...
1357-1411 3.04e-06

Collagen triple helix repeat (20 copies); Members of this family belong to the collagen superfamily. Collagens are generally extracellular structural proteins involved in formation of connective tissue structure. The alignment contains 20 copies of the G-X-Y repeat that forms a triple helix. The first position of the repeat is glycine, the second and third positions can be any residue but are frequently proline and hydroxy-proline. Collagens are post translationally modified by proline hydroxylase to form the hydroxy-proline residues. Defective hydroxylation is the cause of scurvy. Some members of the collagen superfamily are not involved in connective tissue structure but share the same triple helical structure. The family includes bacterial collagen-like triple-helix repeat proteins.


Pssm-ID: 460189 [Multi-domain]  Cd Length: 57  Bit Score: 45.95  E-value: 3.04e-06
                           10        20        30        40        50
                   ....*....|....*....|....*....|....*....|....*....|....*
gi 2462507488 1357 GLPGSPGIQGPKGEQGLPGQPGIQGKRGHRGAQGDQGPCGDPGLKGQPGEYGVQG 1411
Cdd:pfam01391    1 GPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGEPGPPGPPGPPGPPGPPGAPGAPG 55
Glutenin_hmw pfam03157
High molecular weight glutenin subunit; Members of this family include high molecular weight ...
1057-1420 4.68e-06

High molecular weight glutenin subunit; Members of this family include high molecular weight subunits of glutenin. This group of gluten proteins is thought to be largely responsible for the elastic properties of gluten, and hence, doughs. Indeed, glutenin high molecular weight subunits are classified as elastomeric proteins, because the glutenin network can withstand significant deformations without breaking, and return to the original conformation when the stress is removed. Elastomeric proteins differ considerably in amino acid sequence, but they are all polymers whose subunits consist of elastomeric domains, composed of repeated motifs, and non-elastic domains that mediate cross-linking between the subunits. The elastomeric domain motifs are all rich in glycine residues in addition to other hydrophobic residues. High molecular weight glutenin subunits have an extensive central elastomeric domain, flanked by two terminal non-elastic domains that form disulphide cross-links. The central elastomeric domain is characterized by the following three repeated motifs: PGQGQQ, GYYPTS[P/L]QQ, GQQ. It possesses overlapping beta-turns within and between the repeated motifs, and assumes a regular helical secondary structure with a diameter of approx. 1.9 nm and a pitch of approx. 1.5 nm.


Pssm-ID: 367362 [Multi-domain]  Cd Length: 786  Bit Score: 51.49  E-value: 4.68e-06
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462507488 1057 EPGLRGEPGapgeEGLQGKDGLKGVPGGRGLPGEDGEKGEMglPGIIGPLGRSGQTGLPGPEGIVGIPGQRGRPGKkGDK 1136
Cdd:pfam03157  172 QSGQRQQPG----QGQQLRQGQQGQQSGQGQPGYYPTSSQQ--PGQLQQTGQGQQGQQPERGQQGQQPGQGQQPGQ-GQQ 244
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462507488 1137 GQIGPTGEVGSRGPPGKIGKSGPKGARGTRGAVGHLGLMGPDGEPGIPGYRGHQgqPGPSGLPGPKGEKGYPGEDSTvlG 1216
Cdd:pfam03157  245 GQQPGQPQQLGQGQQGYYPISPQQPRQWQQSGQGQQGYYPTSLQQPGQGQSGYY--PTSQQQAGQLQQEQQLGQEQQ--D 320
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462507488 1217 PPGPRGEPGPVGDQGERGEPGAEGYKGHVGVPGLRGATGQQGPPGEPGDQGEQGLKGERGSEGNKGKKGAPGPSGKPGIP 1296
Cdd:pfam03157  321 QQPGQGRQGQQPGQGQQGQQPAQGQQPGQGQPGYYPTSPQQPGQGQPGYYPTSQQQPQQGQQPEQGQQGQQQGQGQQGQQ 400
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462507488 1297 GLQGLLGPKGIQGYHGAD---GISGNPGKIGPPGKQGLPGIRGGPGRTGLAGAPGPPGVKGSSGLPGSPGiQGPKGEQGL 1373
Cdd:pfam03157  401 PGQGQQPGQGQPGYYPTSpqqSGQGQPGYYPTSPQQSGQGQQPGQGQQPGQEQPGQGQQPGQGQQGQQPG-QPEQGQQPG 479
                          330       340       350       360
                   ....*....|....*....|....*....|....*....|....*..
gi 2462507488 1374 PGQPGIQGKRGHRGAQGDQGPCGDPGLKGQPGEYGVQGLTGFQGFPG 1420
Cdd:pfam03157  480 QGQPGYYPTSPQQSGQGQQLGQWQQQGQGQPGYYPTSPLQPGQGQPG 526
Collagen pfam01391
Collagen triple helix repeat (20 copies); Members of this family belong to the collagen ...
1252-1306 6.47e-06

Collagen triple helix repeat (20 copies); Members of this family belong to the collagen superfamily. Collagens are generally extracellular structural proteins involved in formation of connective tissue structure. The alignment contains 20 copies of the G-X-Y repeat that forms a triple helix. The first position of the repeat is glycine, the second and third positions can be any residue but are frequently proline and hydroxy-proline. Collagens are post translationally modified by proline hydroxylase to form the hydroxy-proline residues. Defective hydroxylation is the cause of scurvy. Some members of the collagen superfamily are not involved in connective tissue structure but share the same triple helical structure. The family includes bacterial collagen-like triple-helix repeat proteins.


Pssm-ID: 460189 [Multi-domain]  Cd Length: 57  Bit Score: 44.79  E-value: 6.47e-06
                           10        20        30        40        50
                   ....*....|....*....|....*....|....*....|....*....|....*
gi 2462507488 1252 GATGQQGPPGEPGDQGEQGLKGERGSEGNKGKKGAPGPSGKPGIPGLQGLLGPKG 1306
Cdd:pfam01391    1 GPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGEPGPPGPPGPPGPPGPPGAPGAPG 55
Collagen pfam01391
Collagen triple helix repeat (20 copies); Members of this family belong to the collagen ...
573-628 6.66e-06

Collagen triple helix repeat (20 copies); Members of this family belong to the collagen superfamily. Collagens are generally extracellular structural proteins involved in formation of connective tissue structure. The alignment contains 20 copies of the G-X-Y repeat that forms a triple helix. The first position of the repeat is glycine, the second and third positions can be any residue but are frequently proline and hydroxy-proline. Collagens are post translationally modified by proline hydroxylase to form the hydroxy-proline residues. Defective hydroxylation is the cause of scurvy. Some members of the collagen superfamily are not involved in connective tissue structure but share the same triple helical structure. The family includes bacterial collagen-like triple-helix repeat proteins.


Pssm-ID: 460189 [Multi-domain]  Cd Length: 57  Bit Score: 44.79  E-value: 6.66e-06
                           10        20        30        40        50
                   ....*....|....*....|....*....|....*....|....*....|....*.
gi 2462507488  573 GHPGLPGLPGEQGIPGFAGNIGSPGYPGRQGLAGPEGNPGPKGAQGFIGSPGEAGQ 628
Cdd:pfam01391    1 GPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGEPGPPGPPGPPGPPGPPGAPGAPGP 56
Collagen pfam01391
Collagen triple helix repeat (20 copies); Members of this family belong to the collagen ...
687-743 6.80e-06

Collagen triple helix repeat (20 copies); Members of this family belong to the collagen superfamily. Collagens are generally extracellular structural proteins involved in formation of connective tissue structure. The alignment contains 20 copies of the G-X-Y repeat that forms a triple helix. The first position of the repeat is glycine, the second and third positions can be any residue but are frequently proline and hydroxy-proline. Collagens are post translationally modified by proline hydroxylase to form the hydroxy-proline residues. Defective hydroxylation is the cause of scurvy. Some members of the collagen superfamily are not involved in connective tissue structure but share the same triple helical structure. The family includes bacterial collagen-like triple-helix repeat proteins.


Pssm-ID: 460189 [Multi-domain]  Cd Length: 57  Bit Score: 44.79  E-value: 6.80e-06
                           10        20        30        40        50
                   ....*....|....*....|....*....|....*....|....*....|....*..
gi 2462507488  687 GPVGPIGPAGIPGPMGLSGNKGLPGIKGDKGEQGTAGELGEPGYPGDKGAVGLPGPP 743
Cdd:pfam01391    1 GPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGEPGPPGPPGPPGPPGPPGAPGAPGPP 57
Collagen pfam01391
Collagen triple helix repeat (20 copies); Members of this family belong to the collagen ...
1155-1210 8.35e-06

Collagen triple helix repeat (20 copies); Members of this family belong to the collagen superfamily. Collagens are generally extracellular structural proteins involved in formation of connective tissue structure. The alignment contains 20 copies of the G-X-Y repeat that forms a triple helix. The first position of the repeat is glycine, the second and third positions can be any residue but are frequently proline and hydroxy-proline. Collagens are post translationally modified by proline hydroxylase to form the hydroxy-proline residues. Defective hydroxylation is the cause of scurvy. Some members of the collagen superfamily are not involved in connective tissue structure but share the same triple helical structure. The family includes bacterial collagen-like triple-helix repeat proteins.


Pssm-ID: 460189 [Multi-domain]  Cd Length: 57  Bit Score: 44.79  E-value: 8.35e-06
                           10        20        30        40        50
                   ....*....|....*....|....*....|....*....|....*....|....*.
gi 2462507488 1155 GKSGPKGARGTRGAVGHLGLMGPDGEPGIPGYRGHQGQPGPSGLPGPKGEKGYPGE 1210
Cdd:pfam01391    1 GPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGEPGPPGPPGPPGPPGPPGAPGAPGP 56
Collagen pfam01391
Collagen triple helix repeat (20 copies); Members of this family belong to the collagen ...
1017-1071 8.69e-06

Collagen triple helix repeat (20 copies); Members of this family belong to the collagen superfamily. Collagens are generally extracellular structural proteins involved in formation of connective tissue structure. The alignment contains 20 copies of the G-X-Y repeat that forms a triple helix. The first position of the repeat is glycine, the second and third positions can be any residue but are frequently proline and hydroxy-proline. Collagens are post translationally modified by proline hydroxylase to form the hydroxy-proline residues. Defective hydroxylation is the cause of scurvy. Some members of the collagen superfamily are not involved in connective tissue structure but share the same triple helical structure. The family includes bacterial collagen-like triple-helix repeat proteins.


Pssm-ID: 460189 [Multi-domain]  Cd Length: 57  Bit Score: 44.79  E-value: 8.69e-06
                           10        20        30        40        50
                   ....*....|....*....|....*....|....*....|....*....|....*
gi 2462507488 1017 GPPGEMGMEGPPGTEGESGLQGEPGAKGDVGTAGSVGGTGEPGLRGEPGAPGEEG 1071
Cdd:pfam01391    1 GPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGEPGPPGPPGPPGPPGPPGAPGAPG 55
Collagen pfam01391
Collagen triple helix repeat (20 copies); Members of this family belong to the collagen ...
1330-1384 9.21e-06

Collagen triple helix repeat (20 copies); Members of this family belong to the collagen superfamily. Collagens are generally extracellular structural proteins involved in formation of connective tissue structure. The alignment contains 20 copies of the G-X-Y repeat that forms a triple helix. The first position of the repeat is glycine, the second and third positions can be any residue but are frequently proline and hydroxy-proline. Collagens are post translationally modified by proline hydroxylase to form the hydroxy-proline residues. Defective hydroxylation is the cause of scurvy. Some members of the collagen superfamily are not involved in connective tissue structure but share the same triple helical structure. The family includes bacterial collagen-like triple-helix repeat proteins.


Pssm-ID: 460189 [Multi-domain]  Cd Length: 57  Bit Score: 44.41  E-value: 9.21e-06
                           10        20        30        40        50
                   ....*....|....*....|....*....|....*....|....*....|....*
gi 2462507488 1330 GLPGIRGGPGRTGLAGAPGPPGVKGSSGLPGSPGIQGPKGEQGLPGQPGIQGKRG 1384
Cdd:pfam01391    1 GPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGEPGPPGPPGPPGPPGPPGAPGAPG 55
Collagen pfam01391
Collagen triple helix repeat (20 copies); Members of this family belong to the collagen ...
1372-1428 1.08e-05

Collagen triple helix repeat (20 copies); Members of this family belong to the collagen superfamily. Collagens are generally extracellular structural proteins involved in formation of connective tissue structure. The alignment contains 20 copies of the G-X-Y repeat that forms a triple helix. The first position of the repeat is glycine, the second and third positions can be any residue but are frequently proline and hydroxy-proline. Collagens are post translationally modified by proline hydroxylase to form the hydroxy-proline residues. Defective hydroxylation is the cause of scurvy. Some members of the collagen superfamily are not involved in connective tissue structure but share the same triple helical structure. The family includes bacterial collagen-like triple-helix repeat proteins.


Pssm-ID: 460189 [Multi-domain]  Cd Length: 57  Bit Score: 44.41  E-value: 1.08e-05
                           10        20        30        40        50
                   ....*....|....*....|....*....|....*....|....*....|....*..
gi 2462507488 1372 GLPGQPGIQGKRGHRGAQGDQGPCGDPGLKGQPGEYGVQGLTGFQGFPGPKGPEGDA 1428
Cdd:pfam01391    1 GPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGEPGPPGPPGPPGPPGPPGAPGAPGPP 57
Collagen pfam01391
Collagen triple helix repeat (20 copies); Members of this family belong to the collagen ...
1246-1300 1.18e-05

Collagen triple helix repeat (20 copies); Members of this family belong to the collagen superfamily. Collagens are generally extracellular structural proteins involved in formation of connective tissue structure. The alignment contains 20 copies of the G-X-Y repeat that forms a triple helix. The first position of the repeat is glycine, the second and third positions can be any residue but are frequently proline and hydroxy-proline. Collagens are post translationally modified by proline hydroxylase to form the hydroxy-proline residues. Defective hydroxylation is the cause of scurvy. Some members of the collagen superfamily are not involved in connective tissue structure but share the same triple helical structure. The family includes bacterial collagen-like triple-helix repeat proteins.


Pssm-ID: 460189 [Multi-domain]  Cd Length: 57  Bit Score: 44.41  E-value: 1.18e-05
                           10        20        30        40        50
                   ....*....|....*....|....*....|....*....|....*....|....*
gi 2462507488 1246 GVPGLRGATGQQGPPGEPGDQGEQGLKGERGSEGNKGKKGAPGPSGKPGIPGLQG 1300
Cdd:pfam01391    1 GPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGEPGPPGPPGPPGPPGPPGAPGAPG 55
Collagen pfam01391
Collagen triple helix repeat (20 copies); Members of this family belong to the collagen ...
1309-1363 2.74e-05

Collagen triple helix repeat (20 copies); Members of this family belong to the collagen superfamily. Collagens are generally extracellular structural proteins involved in formation of connective tissue structure. The alignment contains 20 copies of the G-X-Y repeat that forms a triple helix. The first position of the repeat is glycine, the second and third positions can be any residue but are frequently proline and hydroxy-proline. Collagens are post translationally modified by proline hydroxylase to form the hydroxy-proline residues. Defective hydroxylation is the cause of scurvy. Some members of the collagen superfamily are not involved in connective tissue structure but share the same triple helical structure. The family includes bacterial collagen-like triple-helix repeat proteins.


Pssm-ID: 460189 [Multi-domain]  Cd Length: 57  Bit Score: 43.25  E-value: 2.74e-05
                           10        20        30        40        50
                   ....*....|....*....|....*....|....*....|....*....|....*
gi 2462507488 1309 GYHGADGISGNPGKIGPPGKQGLPGIRGGPGRTGLAGAPGPPGVKGSSGLPGSPG 1363
Cdd:pfam01391    1 GPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGEPGPPGPPGPPGPPGPPGAPGAPG 55
Collagen pfam01391
Collagen triple helix repeat (20 copies); Members of this family belong to the collagen ...
1255-1309 2.79e-05

Collagen triple helix repeat (20 copies); Members of this family belong to the collagen superfamily. Collagens are generally extracellular structural proteins involved in formation of connective tissue structure. The alignment contains 20 copies of the G-X-Y repeat that forms a triple helix. The first position of the repeat is glycine, the second and third positions can be any residue but are frequently proline and hydroxy-proline. Collagens are post translationally modified by proline hydroxylase to form the hydroxy-proline residues. Defective hydroxylation is the cause of scurvy. Some members of the collagen superfamily are not involved in connective tissue structure but share the same triple helical structure. The family includes bacterial collagen-like triple-helix repeat proteins.


Pssm-ID: 460189 [Multi-domain]  Cd Length: 57  Bit Score: 43.25  E-value: 2.79e-05
                           10        20        30        40        50
                   ....*....|....*....|....*....|....*....|....*....|....*
gi 2462507488 1255 GQQGPPGEPGDQGEQGLKGERGSEGNKGKKGAPGPSGKPGIPGLQGLLGPKGIQG 1309
Cdd:pfam01391    1 GPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGEPGPPGPPGPPGPPGPPGAPGAPG 55
Collagen pfam01391
Collagen triple helix repeat (20 copies); Members of this family belong to the collagen ...
975-1029 3.02e-05

Collagen triple helix repeat (20 copies); Members of this family belong to the collagen superfamily. Collagens are generally extracellular structural proteins involved in formation of connective tissue structure. The alignment contains 20 copies of the G-X-Y repeat that forms a triple helix. The first position of the repeat is glycine, the second and third positions can be any residue but are frequently proline and hydroxy-proline. Collagens are post translationally modified by proline hydroxylase to form the hydroxy-proline residues. Defective hydroxylation is the cause of scurvy. Some members of the collagen superfamily are not involved in connective tissue structure but share the same triple helical structure. The family includes bacterial collagen-like triple-helix repeat proteins.


Pssm-ID: 460189 [Multi-domain]  Cd Length: 57  Bit Score: 43.25  E-value: 3.02e-05
                           10        20        30        40        50
                   ....*....|....*....|....*....|....*....|....*....|....*
gi 2462507488  975 GKTGNPGERGFQGKPGLQGLPGSTGDRGLPGEPGLRGLQGDVGPPGEMGMEGPPG 1029
Cdd:pfam01391    1 GPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGEPGPPGPPGPPGPPGPPGAPGAPG 55
Collagen pfam01391
Collagen triple helix repeat (20 copies); Members of this family belong to the collagen ...
1056-1111 3.14e-05

Collagen triple helix repeat (20 copies); Members of this family belong to the collagen superfamily. Collagens are generally extracellular structural proteins involved in formation of connective tissue structure. The alignment contains 20 copies of the G-X-Y repeat that forms a triple helix. The first position of the repeat is glycine, the second and third positions can be any residue but are frequently proline and hydroxy-proline. Collagens are post translationally modified by proline hydroxylase to form the hydroxy-proline residues. Defective hydroxylation is the cause of scurvy. Some members of the collagen superfamily are not involved in connective tissue structure but share the same triple helical structure. The family includes bacterial collagen-like triple-helix repeat proteins.


Pssm-ID: 460189 [Multi-domain]  Cd Length: 57  Bit Score: 42.87  E-value: 3.14e-05
                           10        20        30        40        50
                   ....*....|....*....|....*....|....*....|....*....|....*.
gi 2462507488 1056 GEPGLRGEPGAPGEEGLQGKDGLKGVPGGRGLPGEDGEKGEMGLPGIIGPLGRSGQ 1111
Cdd:pfam01391    1 GPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGEPGPPGPPGPPGPPGPPGAPGAPGP 56
Collagen pfam01391
Collagen triple helix repeat (20 copies); Members of this family belong to the collagen ...
966-1021 3.37e-05

Collagen triple helix repeat (20 copies); Members of this family belong to the collagen superfamily. Collagens are generally extracellular structural proteins involved in formation of connective tissue structure. The alignment contains 20 copies of the G-X-Y repeat that forms a triple helix. The first position of the repeat is glycine, the second and third positions can be any residue but are frequently proline and hydroxy-proline. Collagens are post translationally modified by proline hydroxylase to form the hydroxy-proline residues. Defective hydroxylation is the cause of scurvy. Some members of the collagen superfamily are not involved in connective tissue structure but share the same triple helical structure. The family includes bacterial collagen-like triple-helix repeat proteins.


Pssm-ID: 460189 [Multi-domain]  Cd Length: 57  Bit Score: 42.87  E-value: 3.37e-05
                           10        20        30        40        50
                   ....*....|....*....|....*....|....*....|....*....|....*.
gi 2462507488  966 GKRGPHGLIGKTGNPGERGFQGKPGLQGLPGSTGDRGLPGEPGLRGLQGDVGPPGE 1021
Cdd:pfam01391    1 GPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGEPGPPGPPGPPGPPGPPGAPGAPGP 56
Collagen pfam01391
Collagen triple helix repeat (20 copies); Members of this family belong to the collagen ...
987-1042 3.43e-05

Collagen triple helix repeat (20 copies); Members of this family belong to the collagen superfamily. Collagens are generally extracellular structural proteins involved in formation of connective tissue structure. The alignment contains 20 copies of the G-X-Y repeat that forms a triple helix. The first position of the repeat is glycine, the second and third positions can be any residue but are frequently proline and hydroxy-proline. Collagens are post translationally modified by proline hydroxylase to form the hydroxy-proline residues. Defective hydroxylation is the cause of scurvy. Some members of the collagen superfamily are not involved in connective tissue structure but share the same triple helical structure. The family includes bacterial collagen-like triple-helix repeat proteins.


Pssm-ID: 460189 [Multi-domain]  Cd Length: 57  Bit Score: 42.87  E-value: 3.43e-05
                           10        20        30        40        50
                   ....*....|....*....|....*....|....*....|....*....|....*.
gi 2462507488  987 GKPGLQGLPGSTGDRGLPGEPGLRGLQGDVGPPGEMGMEGPPGTEGESGLQGEPGA 1042
Cdd:pfam01391    1 GPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGEPGPPGPPGPPGPPGPPGAPGAPGP 56
Collagen pfam01391
Collagen triple helix repeat (20 copies); Members of this family belong to the collagen ...
1071-1127 3.86e-05

Collagen triple helix repeat (20 copies); Members of this family belong to the collagen superfamily. Collagens are generally extracellular structural proteins involved in formation of connective tissue structure. The alignment contains 20 copies of the G-X-Y repeat that forms a triple helix. The first position of the repeat is glycine, the second and third positions can be any residue but are frequently proline and hydroxy-proline. Collagens are post translationally modified by proline hydroxylase to form the hydroxy-proline residues. Defective hydroxylation is the cause of scurvy. Some members of the collagen superfamily are not involved in connective tissue structure but share the same triple helical structure. The family includes bacterial collagen-like triple-helix repeat proteins.


Pssm-ID: 460189 [Multi-domain]  Cd Length: 57  Bit Score: 42.87  E-value: 3.86e-05
                           10        20        30        40        50
                   ....*....|....*....|....*....|....*....|....*....|....*..
gi 2462507488 1071 GLQGKDGLKGVPGGRGLPGEDGEKGEMGLPGIIGPLGRSGQTGLPGPEGIVGIPGQR 1127
Cdd:pfam01391    1 GPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGEPGPPGPPGPPGPPGPPGAPGAPGPP 57
Collagen pfam01391
Collagen triple helix repeat (20 copies); Members of this family belong to the collagen ...
576-631 3.94e-05

Collagen triple helix repeat (20 copies); Members of this family belong to the collagen superfamily. Collagens are generally extracellular structural proteins involved in formation of connective tissue structure. The alignment contains 20 copies of the G-X-Y repeat that forms a triple helix. The first position of the repeat is glycine, the second and third positions can be any residue but are frequently proline and hydroxy-proline. Collagens are post translationally modified by proline hydroxylase to form the hydroxy-proline residues. Defective hydroxylation is the cause of scurvy. Some members of the collagen superfamily are not involved in connective tissue structure but share the same triple helical structure. The family includes bacterial collagen-like triple-helix repeat proteins.


Pssm-ID: 460189 [Multi-domain]  Cd Length: 57  Bit Score: 42.87  E-value: 3.94e-05
                           10        20        30        40        50
                   ....*....|....*....|....*....|....*....|....*....|....*.
gi 2462507488  576 GLPGLPGEQGIPGFAGNIGSPGYPGRQGLAGPEGNPGPKGAQGFIGSPGEAGQLGP 631
Cdd:pfam01391    1 GPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGEPGPPGPPGPPGPPGPPGAPGAPGP 56
Collagen pfam01391
Collagen triple helix repeat (20 copies); Members of this family belong to the collagen ...
693-748 4.80e-05

Collagen triple helix repeat (20 copies); Members of this family belong to the collagen superfamily. Collagens are generally extracellular structural proteins involved in formation of connective tissue structure. The alignment contains 20 copies of the G-X-Y repeat that forms a triple helix. The first position of the repeat is glycine, the second and third positions can be any residue but are frequently proline and hydroxy-proline. Collagens are post translationally modified by proline hydroxylase to form the hydroxy-proline residues. Defective hydroxylation is the cause of scurvy. Some members of the collagen superfamily are not involved in connective tissue structure but share the same triple helical structure. The family includes bacterial collagen-like triple-helix repeat proteins.


Pssm-ID: 460189 [Multi-domain]  Cd Length: 57  Bit Score: 42.48  E-value: 4.80e-05
                           10        20        30        40        50
                   ....*....|....*....|....*....|....*....|....*....|....*.
gi 2462507488  693 GPAGIPGPMGLSGNKGLPGIKGDKGEQGTAGELGEPGYPGDKGAVGLPGPPGMRGK 748
Cdd:pfam01391    1 GPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGEPGPPGPPGPPGPPGPPGAPGAPGP 56
Collagen pfam01391
Collagen triple helix repeat (20 copies); Members of this family belong to the collagen ...
564-618 5.34e-05

Collagen triple helix repeat (20 copies); Members of this family belong to the collagen superfamily. Collagens are generally extracellular structural proteins involved in formation of connective tissue structure. The alignment contains 20 copies of the G-X-Y repeat that forms a triple helix. The first position of the repeat is glycine, the second and third positions can be any residue but are frequently proline and hydroxy-proline. Collagens are post translationally modified by proline hydroxylase to form the hydroxy-proline residues. Defective hydroxylation is the cause of scurvy. Some members of the collagen superfamily are not involved in connective tissue structure but share the same triple helical structure. The family includes bacterial collagen-like triple-helix repeat proteins.


Pssm-ID: 460189 [Multi-domain]  Cd Length: 57  Bit Score: 42.48  E-value: 5.34e-05
                           10        20        30        40        50
                   ....*....|....*....|....*....|....*....|....*....|....*
gi 2462507488  564 GMQGDKGLKGHPGLPGLPGEQGIPGFAGNIGSPGYPGRQGLAGPEGNPGPKGAQG 618
Cdd:pfam01391    1 GPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGEPGPPGPPGPPGPPGPPGAPGAPG 55
Collagen pfam01391
Collagen triple helix repeat (20 copies); Members of this family belong to the collagen ...
1312-1367 5.34e-05

Collagen triple helix repeat (20 copies); Members of this family belong to the collagen superfamily. Collagens are generally extracellular structural proteins involved in formation of connective tissue structure. The alignment contains 20 copies of the G-X-Y repeat that forms a triple helix. The first position of the repeat is glycine, the second and third positions can be any residue but are frequently proline and hydroxy-proline. Collagens are post translationally modified by proline hydroxylase to form the hydroxy-proline residues. Defective hydroxylation is the cause of scurvy. Some members of the collagen superfamily are not involved in connective tissue structure but share the same triple helical structure. The family includes bacterial collagen-like triple-helix repeat proteins.


Pssm-ID: 460189 [Multi-domain]  Cd Length: 57  Bit Score: 42.48  E-value: 5.34e-05
                           10        20        30        40        50
                   ....*....|....*....|....*....|....*....|....*....|....*.
gi 2462507488 1312 GADGISGNPGKIGPPGKQGLPGIRGGPGRTGLAGAPGPPGVKGSSGLPGSPGIQGP 1367
Cdd:pfam01391    1 GPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGEPGPPGPPGPPGPPGPPGAPGAPGP 56
Collagen pfam01391
Collagen triple helix repeat (20 copies); Members of this family belong to the collagen ...
984-1040 5.34e-05

Collagen triple helix repeat (20 copies); Members of this family belong to the collagen superfamily. Collagens are generally extracellular structural proteins involved in formation of connective tissue structure. The alignment contains 20 copies of the G-X-Y repeat that forms a triple helix. The first position of the repeat is glycine, the second and third positions can be any residue but are frequently proline and hydroxy-proline. Collagens are post translationally modified by proline hydroxylase to form the hydroxy-proline residues. Defective hydroxylation is the cause of scurvy. Some members of the collagen superfamily are not involved in connective tissue structure but share the same triple helical structure. The family includes bacterial collagen-like triple-helix repeat proteins.


Pssm-ID: 460189 [Multi-domain]  Cd Length: 57  Bit Score: 42.48  E-value: 5.34e-05
                           10        20        30        40        50
                   ....*....|....*....|....*....|....*....|....*....|....*..
gi 2462507488  984 GFQGKPGLQGLPGSTGDRGLPGEPGLRGLQGDVGPPGEMGMEGPPGTEGESGLQGEP 1040
Cdd:pfam01391    1 GPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGEPGPPGPPGPPGPPGPPGAPGAPGPP 57
Collagen pfam01391
Collagen triple helix repeat (20 copies); Members of this family belong to the collagen ...
1134-1188 5.50e-05

Collagen triple helix repeat (20 copies); Members of this family belong to the collagen superfamily. Collagens are generally extracellular structural proteins involved in formation of connective tissue structure. The alignment contains 20 copies of the G-X-Y repeat that forms a triple helix. The first position of the repeat is glycine, the second and third positions can be any residue but are frequently proline and hydroxy-proline. Collagens are post translationally modified by proline hydroxylase to form the hydroxy-proline residues. Defective hydroxylation is the cause of scurvy. Some members of the collagen superfamily are not involved in connective tissue structure but share the same triple helical structure. The family includes bacterial collagen-like triple-helix repeat proteins.


Pssm-ID: 460189 [Multi-domain]  Cd Length: 57  Bit Score: 42.48  E-value: 5.50e-05
                           10        20        30        40        50
                   ....*....|....*....|....*....|....*....|....*....|....*
gi 2462507488 1134 GDKGQIGPTGEVGSRGPPGKIGKSGPKGARGTRGAVGHLGLMGPDGEPGIPGYRG 1188
Cdd:pfam01391    1 GPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGEPGPPGPPGPPGPPGPPGAPGAPG 55
Collagen pfam01391
Collagen triple helix repeat (20 copies); Members of this family belong to the collagen ...
930-984 5.84e-05

Collagen triple helix repeat (20 copies); Members of this family belong to the collagen superfamily. Collagens are generally extracellular structural proteins involved in formation of connective tissue structure. The alignment contains 20 copies of the G-X-Y repeat that forms a triple helix. The first position of the repeat is glycine, the second and third positions can be any residue but are frequently proline and hydroxy-proline. Collagens are post translationally modified by proline hydroxylase to form the hydroxy-proline residues. Defective hydroxylation is the cause of scurvy. Some members of the collagen superfamily are not involved in connective tissue structure but share the same triple helical structure. The family includes bacterial collagen-like triple-helix repeat proteins.


Pssm-ID: 460189 [Multi-domain]  Cd Length: 57  Bit Score: 42.10  E-value: 5.84e-05
                           10        20        30        40        50
                   ....*....|....*....|....*....|....*....|....*....|....*
gi 2462507488  930 GPPGSQGPKGQRGSRGPDGLLGEQGIQGAKGEKGDQGKRGPHGLIGKTGNPGERG 984
Cdd:pfam01391    1 GPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGEPGPPGPPGPPGPPGPPGAPGAPG 55
Collagen pfam01391
Collagen triple helix repeat (20 copies); Members of this family belong to the collagen ...
990-1044 6.19e-05

Collagen triple helix repeat (20 copies); Members of this family belong to the collagen superfamily. Collagens are generally extracellular structural proteins involved in formation of connective tissue structure. The alignment contains 20 copies of the G-X-Y repeat that forms a triple helix. The first position of the repeat is glycine, the second and third positions can be any residue but are frequently proline and hydroxy-proline. Collagens are post translationally modified by proline hydroxylase to form the hydroxy-proline residues. Defective hydroxylation is the cause of scurvy. Some members of the collagen superfamily are not involved in connective tissue structure but share the same triple helical structure. The family includes bacterial collagen-like triple-helix repeat proteins.


Pssm-ID: 460189 [Multi-domain]  Cd Length: 57  Bit Score: 42.10  E-value: 6.19e-05
                           10        20        30        40        50
                   ....*....|....*....|....*....|....*....|....*....|....*
gi 2462507488  990 GLQGLPGSTGDRGLPGEPGLRGLQGDVGPPGEMGMEGPPGTEGESGLQGEPGAKG 1044
Cdd:pfam01391    1 GPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGEPGPPGPPGPPGPPGPPGAPGAPG 55
Collagen pfam01391
Collagen triple helix repeat (20 copies); Members of this family belong to the collagen ...
1354-1408 6.50e-05

Collagen triple helix repeat (20 copies); Members of this family belong to the collagen superfamily. Collagens are generally extracellular structural proteins involved in formation of connective tissue structure. The alignment contains 20 copies of the G-X-Y repeat that forms a triple helix. The first position of the repeat is glycine, the second and third positions can be any residue but are frequently proline and hydroxy-proline. Collagens are post translationally modified by proline hydroxylase to form the hydroxy-proline residues. Defective hydroxylation is the cause of scurvy. Some members of the collagen superfamily are not involved in connective tissue structure but share the same triple helical structure. The family includes bacterial collagen-like triple-helix repeat proteins.


Pssm-ID: 460189 [Multi-domain]  Cd Length: 57  Bit Score: 42.10  E-value: 6.50e-05
                           10        20        30        40        50
                   ....*....|....*....|....*....|....*....|....*....|....*
gi 2462507488 1354 GSSGLPGSPGIQGPKGEQGLPGQPGIQGKRGHRGAQGDQGPCGDPGLKGQPGEYG 1408
Cdd:pfam01391    1 GPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGEPGPPGPPGPPGPPGPPGAPGAPG 55
Collagen pfam01391
Collagen triple helix repeat (20 copies); Members of this family belong to the collagen ...
1333-1388 7.39e-05

Collagen triple helix repeat (20 copies); Members of this family belong to the collagen superfamily. Collagens are generally extracellular structural proteins involved in formation of connective tissue structure. The alignment contains 20 copies of the G-X-Y repeat that forms a triple helix. The first position of the repeat is glycine, the second and third positions can be any residue but are frequently proline and hydroxy-proline. Collagens are post translationally modified by proline hydroxylase to form the hydroxy-proline residues. Defective hydroxylation is the cause of scurvy. Some members of the collagen superfamily are not involved in connective tissue structure but share the same triple helical structure. The family includes bacterial collagen-like triple-helix repeat proteins.


Pssm-ID: 460189 [Multi-domain]  Cd Length: 57  Bit Score: 42.10  E-value: 7.39e-05
                           10        20        30        40        50
                   ....*....|....*....|....*....|....*....|....*....|....*.
gi 2462507488 1333 GIRGGPGRTGLAGAPGPPGVKGSSGLPGSPGIQGPKGEQGLPGQPGIQGKRGHRGA 1388
Cdd:pfam01391    1 GPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGEPGPPGPPGPPGPPGPPGAPGAPGP 56
Collagen pfam01391
Collagen triple helix repeat (20 copies); Members of this family belong to the collagen ...
588-643 8.73e-05

Collagen triple helix repeat (20 copies); Members of this family belong to the collagen superfamily. Collagens are generally extracellular structural proteins involved in formation of connective tissue structure. The alignment contains 20 copies of the G-X-Y repeat that forms a triple helix. The first position of the repeat is glycine, the second and third positions can be any residue but are frequently proline and hydroxy-proline. Collagens are post translationally modified by proline hydroxylase to form the hydroxy-proline residues. Defective hydroxylation is the cause of scurvy. Some members of the collagen superfamily are not involved in connective tissue structure but share the same triple helical structure. The family includes bacterial collagen-like triple-helix repeat proteins.


Pssm-ID: 460189 [Multi-domain]  Cd Length: 57  Bit Score: 41.71  E-value: 8.73e-05
                           10        20        30        40        50
                   ....*....|....*....|....*....|....*....|....*....|....*.
gi 2462507488  588 GFAGNIGSPGYPGRQGLAGPEGNPGPKGAQGFIGSPGEAGQLGPEGERGIPGIRGK 643
Cdd:pfam01391    1 GPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGEPGPPGPPGPPGPPGPPGAPGAPGP 56
Collagen pfam01391
Collagen triple helix repeat (20 copies); Members of this family belong to the collagen ...
1113-1169 1.06e-04

Collagen triple helix repeat (20 copies); Members of this family belong to the collagen superfamily. Collagens are generally extracellular structural proteins involved in formation of connective tissue structure. The alignment contains 20 copies of the G-X-Y repeat that forms a triple helix. The first position of the repeat is glycine, the second and third positions can be any residue but are frequently proline and hydroxy-proline. Collagens are post translationally modified by proline hydroxylase to form the hydroxy-proline residues. Defective hydroxylation is the cause of scurvy. Some members of the collagen superfamily are not involved in connective tissue structure but share the same triple helical structure. The family includes bacterial collagen-like triple-helix repeat proteins.


Pssm-ID: 460189 [Multi-domain]  Cd Length: 57  Bit Score: 41.71  E-value: 1.06e-04
                           10        20        30        40        50
                   ....*....|....*....|....*....|....*....|....*....|....*..
gi 2462507488 1113 GLPGPEGIVGIPGQRGRPGKKGDKGQIGPTGEVGSRGPPGKIGKSGPKGARGTRGAV 1169
Cdd:pfam01391    1 GPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGEPGPPGPPGPPGPPGPPGAPGAPGPP 57
Collagen pfam01391
Collagen triple helix repeat (20 copies); Members of this family belong to the collagen ...
1053-1107 1.14e-04

Collagen triple helix repeat (20 copies); Members of this family belong to the collagen superfamily. Collagens are generally extracellular structural proteins involved in formation of connective tissue structure. The alignment contains 20 copies of the G-X-Y repeat that forms a triple helix. The first position of the repeat is glycine, the second and third positions can be any residue but are frequently proline and hydroxy-proline. Collagens are post translationally modified by proline hydroxylase to form the hydroxy-proline residues. Defective hydroxylation is the cause of scurvy. Some members of the collagen superfamily are not involved in connective tissue structure but share the same triple helical structure. The family includes bacterial collagen-like triple-helix repeat proteins.


Pssm-ID: 460189 [Multi-domain]  Cd Length: 57  Bit Score: 41.33  E-value: 1.14e-04
                           10        20        30        40        50
                   ....*....|....*....|....*....|....*....|....*....|....*
gi 2462507488 1053 GGTGEPGLRGEPGAPGEEGLQGKDGLKGVPGGRGLPGEDGEKGEMGLPGIIGPLG 1107
Cdd:pfam01391    1 GPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGEPGPPGPPGPPGPPGPPGAPGAPG 55
Collagen pfam01391
Collagen triple helix repeat (20 copies); Members of this family belong to the collagen ...
1270-1326 1.21e-04

Collagen triple helix repeat (20 copies); Members of this family belong to the collagen superfamily. Collagens are generally extracellular structural proteins involved in formation of connective tissue structure. The alignment contains 20 copies of the G-X-Y repeat that forms a triple helix. The first position of the repeat is glycine, the second and third positions can be any residue but are frequently proline and hydroxy-proline. Collagens are post translationally modified by proline hydroxylase to form the hydroxy-proline residues. Defective hydroxylation is the cause of scurvy. Some members of the collagen superfamily are not involved in connective tissue structure but share the same triple helical structure. The family includes bacterial collagen-like triple-helix repeat proteins.


Pssm-ID: 460189 [Multi-domain]  Cd Length: 57  Bit Score: 41.33  E-value: 1.21e-04
                           10        20        30        40        50
                   ....*....|....*....|....*....|....*....|....*....|....*..
gi 2462507488 1270 GLKGERGSEGNKGKKGAPGPSGKPGIPGLQGLLGPKGIQGYHGADGISGNPGKIGPP 1326
Cdd:pfam01391    1 GPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGEPGPPGPPGPPGPPGPPGAPGAPGPP 57
Collagen pfam01391
Collagen triple helix repeat (20 copies); Members of this family belong to the collagen ...
1038-1093 1.23e-04

Collagen triple helix repeat (20 copies); Members of this family belong to the collagen superfamily. Collagens are generally extracellular structural proteins involved in formation of connective tissue structure. The alignment contains 20 copies of the G-X-Y repeat that forms a triple helix. The first position of the repeat is glycine, the second and third positions can be any residue but are frequently proline and hydroxy-proline. Collagens are post translationally modified by proline hydroxylase to form the hydroxy-proline residues. Defective hydroxylation is the cause of scurvy. Some members of the collagen superfamily are not involved in connective tissue structure but share the same triple helical structure. The family includes bacterial collagen-like triple-helix repeat proteins.


Pssm-ID: 460189 [Multi-domain]  Cd Length: 57  Bit Score: 41.33  E-value: 1.23e-04
                           10        20        30        40        50
                   ....*....|....*....|....*....|....*....|....*....|....*.
gi 2462507488 1038 GEPGAKGDVGTAGSVGGTGEPGLRGEPGAPGEEGLQGKDGLKGVPGGRGLPGEDGE 1093
Cdd:pfam01391    1 GPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGEPGPPGPPGPPGPPGPPGAPGAPGP 56
LamG smart00282
Laminin G domain;
119-212 1.25e-04

Laminin G domain;


Pssm-ID: 214598 [Multi-domain]  Cd Length: 132  Bit Score: 43.48  E-value: 1.25e-04
                            10        20        30        40        50        60        70        80
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462507488   119 NAFLFSIRNKNRLQ-LGVQLLPKKLVVHIR-GKQPAVF---NYSVHDEQWHSFAITIRNQSVSMFVECGKKyFSTETIPE 193
Cdd:smart00282   12 NGLLLYAGSKGGGDyLALELRDGRLVLRYDlGSGPARLtsdPTPLNDGQWHRVAVERNGRSVTLSVDGGNR-VSGESPGG 90
                            90
                    ....*....|....*....
gi 2462507488   194 VQTFDSNSVFTLGSMNNNS 212
Cdd:smart00282   91 LTILNLDGPLYLGGLPEDL 109
Collagen pfam01391
Collagen triple helix repeat (20 copies); Members of this family belong to the collagen ...
1306-1362 1.53e-04

Collagen triple helix repeat (20 copies); Members of this family belong to the collagen superfamily. Collagens are generally extracellular structural proteins involved in formation of connective tissue structure. The alignment contains 20 copies of the G-X-Y repeat that forms a triple helix. The first position of the repeat is glycine, the second and third positions can be any residue but are frequently proline and hydroxy-proline. Collagens are post translationally modified by proline hydroxylase to form the hydroxy-proline residues. Defective hydroxylation is the cause of scurvy. Some members of the collagen superfamily are not involved in connective tissue structure but share the same triple helical structure. The family includes bacterial collagen-like triple-helix repeat proteins.


Pssm-ID: 460189 [Multi-domain]  Cd Length: 57  Bit Score: 40.94  E-value: 1.53e-04
                           10        20        30        40        50
                   ....*....|....*....|....*....|....*....|....*....|....*..
gi 2462507488 1306 GIQGYHGADGISGNPGKIGPPGKQGLPGIRGGPGRTGLAGAPGPPGVKGSSGLPGSP 1362
Cdd:pfam01391    1 GPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGEPGPPGPPGPPGPPGPPGAPGAPGPP 57
Collagen pfam01391
Collagen triple helix repeat (20 copies); Members of this family belong to the collagen ...
1047-1101 1.83e-04

Collagen triple helix repeat (20 copies); Members of this family belong to the collagen superfamily. Collagens are generally extracellular structural proteins involved in formation of connective tissue structure. The alignment contains 20 copies of the G-X-Y repeat that forms a triple helix. The first position of the repeat is glycine, the second and third positions can be any residue but are frequently proline and hydroxy-proline. Collagens are post translationally modified by proline hydroxylase to form the hydroxy-proline residues. Defective hydroxylation is the cause of scurvy. Some members of the collagen superfamily are not involved in connective tissue structure but share the same triple helical structure. The family includes bacterial collagen-like triple-helix repeat proteins.


Pssm-ID: 460189 [Multi-domain]  Cd Length: 57  Bit Score: 40.94  E-value: 1.83e-04
                           10        20        30        40        50
                   ....*....|....*....|....*....|....*....|....*....|....*
gi 2462507488 1047 GTAGSVGGTGEPGLRGEPGAPGEEGLQGKDGLKGVPGGRGLPGEDGEKGEMGLPG 1101
Cdd:pfam01391    1 GPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGEPGPPGPPGPPGPPGPPGAPGAPG 55
Collagen pfam01391
Collagen triple helix repeat (20 copies); Members of this family belong to the collagen ...
603-659 2.33e-04

Collagen triple helix repeat (20 copies); Members of this family belong to the collagen superfamily. Collagens are generally extracellular structural proteins involved in formation of connective tissue structure. The alignment contains 20 copies of the G-X-Y repeat that forms a triple helix. The first position of the repeat is glycine, the second and third positions can be any residue but are frequently proline and hydroxy-proline. Collagens are post translationally modified by proline hydroxylase to form the hydroxy-proline residues. Defective hydroxylation is the cause of scurvy. Some members of the collagen superfamily are not involved in connective tissue structure but share the same triple helical structure. The family includes bacterial collagen-like triple-helix repeat proteins.


Pssm-ID: 460189 [Multi-domain]  Cd Length: 57  Bit Score: 40.55  E-value: 2.33e-04
                           10        20        30        40        50
                   ....*....|....*....|....*....|....*....|....*....|....*..
gi 2462507488  603 GLAGPEGNPGPKGAQGFIGSPGEAGQLGPEGERGIPGIRGKKGFKGRQGFPGDFGDR 659
Cdd:pfam01391    1 GPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGEPGPPGPPGPPGPPGPPGAPGAPGPP 57
Collagen pfam01391
Collagen triple helix repeat (20 copies); Members of this family belong to the collagen ...
1035-1090 2.60e-04

Collagen triple helix repeat (20 copies); Members of this family belong to the collagen superfamily. Collagens are generally extracellular structural proteins involved in formation of connective tissue structure. The alignment contains 20 copies of the G-X-Y repeat that forms a triple helix. The first position of the repeat is glycine, the second and third positions can be any residue but are frequently proline and hydroxy-proline. Collagens are post translationally modified by proline hydroxylase to form the hydroxy-proline residues. Defective hydroxylation is the cause of scurvy. Some members of the collagen superfamily are not involved in connective tissue structure but share the same triple helical structure. The family includes bacterial collagen-like triple-helix repeat proteins.


Pssm-ID: 460189 [Multi-domain]  Cd Length: 57  Bit Score: 40.55  E-value: 2.60e-04
                           10        20        30        40        50
                   ....*....|....*....|....*....|....*....|....*....|....*.
gi 2462507488 1035 GLQGEPGAKGDVGTAGSVGGTGEPGLRGEPGAPGEEGLQGKDGLKGVPGGRGLPGE 1090
Cdd:pfam01391    1 GPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGEPGPPGPPGPPGPPGPPGAPGAPGP 56
Collagen pfam01391
Collagen triple helix repeat (20 copies); Members of this family belong to the collagen ...
528-588 2.73e-04

Collagen triple helix repeat (20 copies); Members of this family belong to the collagen superfamily. Collagens are generally extracellular structural proteins involved in formation of connective tissue structure. The alignment contains 20 copies of the G-X-Y repeat that forms a triple helix. The first position of the repeat is glycine, the second and third positions can be any residue but are frequently proline and hydroxy-proline. Collagens are post translationally modified by proline hydroxylase to form the hydroxy-proline residues. Defective hydroxylation is the cause of scurvy. Some members of the collagen superfamily are not involved in connective tissue structure but share the same triple helical structure. The family includes bacterial collagen-like triple-helix repeat proteins.


Pssm-ID: 460189 [Multi-domain]  Cd Length: 57  Bit Score: 40.55  E-value: 2.73e-04
                           10        20        30        40        50        60
                   ....*....|....*....|....*....|....*....|....*....|....*....|.
gi 2462507488  528 GLPGPKGPKGDPGFspgqpvPGEKGDQGLSGLMGPPGMQGDKGLKGHPGLPGLPGEQGIPG 588
Cdd:pfam01391    1 GPPGPPGPPGPPGP------PGPPGPPGPPGPPGPPGEPGPPGPPGPPGPPGPPGAPGAPG 55
Collagen pfam01391
Collagen triple helix repeat (20 copies); Members of this family belong to the collagen ...
954-1008 3.23e-04

Collagen triple helix repeat (20 copies); Members of this family belong to the collagen superfamily. Collagens are generally extracellular structural proteins involved in formation of connective tissue structure. The alignment contains 20 copies of the G-X-Y repeat that forms a triple helix. The first position of the repeat is glycine, the second and third positions can be any residue but are frequently proline and hydroxy-proline. Collagens are post translationally modified by proline hydroxylase to form the hydroxy-proline residues. Defective hydroxylation is the cause of scurvy. Some members of the collagen superfamily are not involved in connective tissue structure but share the same triple helical structure. The family includes bacterial collagen-like triple-helix repeat proteins.


Pssm-ID: 460189 [Multi-domain]  Cd Length: 57  Bit Score: 40.17  E-value: 3.23e-04
                           10        20        30        40        50
                   ....*....|....*....|....*....|....*....|....*....|....*
gi 2462507488  954 GIQGAKGEKGDQGKRGPHGLIGKTGNPGERGFQGKPGLQGLPGSTGDRGLPGEPG 1008
Cdd:pfam01391    1 GPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGEPGPPGPPGPPGPPGPPGAPGAPG 55
PHA03169 PHA03169
hypothetical protein; Provisional
1187-1349 3.42e-04

hypothetical protein; Provisional


Pssm-ID: 223003 [Multi-domain]  Cd Length: 413  Bit Score: 44.96  E-value: 3.42e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462507488 1187 RGHQGQPGPSGLPGPKGEKGYPGEDSTVLGPPGPRGEPGPVGDQGERGEPGAEGYKGHvgvpglrGATGQQGPPGEPGDQ 1266
Cdd:PHA03169    86 ERGQGGPSGSGSESVGSPTPSPSGSAEELASGLSPENTSGSSPESPASHSPPPSPPSH-------PGPHEPAPPESHNPS 158
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462507488 1267 GEQGLKGERGSEGNKGKKGAPGPSGKPGIPGLQGLLGPKGIQGYHGADGiSGNPGKIGPPGKQGLPGIRGGPGRTGLAGA 1346
Cdd:PHA03169   159 PNQQPSSFLQPSHEDSPEEPEPPTSEPEPDSPGPPQSETPTSSPPPQSP-PDEPGEPQSPTPQQAPSPNTQQAVEHEDEP 237

                   ...
gi 2462507488 1347 PGP 1349
Cdd:PHA03169   238 TEP 240
SPT5 COG5164
Transcription elongation factor SPT5 [Transcription];
638-903 3.58e-04

Transcription elongation factor SPT5 [Transcription];


Pssm-ID: 444063 [Multi-domain]  Cd Length: 495  Bit Score: 45.02  E-value: 3.58e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462507488  638 PGIRGKKGFKGRQGFPGDFGDRGPAGLDGSpglvggTG------PPGFPGLRGSVGPVGPIG---PAGIPGPMGLSGNKG 708
Cdd:COG5164      6 PGKTGPSDPGGVTTPAGSQGSTKPAQNQGS------TRpagntgGTRPAQNQGSTTPAGNTGgtrPAGNQGATGPAQNQG 79
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462507488  709 LPGIKGDKGEQGTAGELGEPGYPGDKGAVGLPGPPGMRGKSGPSGQTGDPGLQGPSGPPGPEGFPGDIGIPGQNGPEGPK 788
Cdd:COG5164     80 GTTPAQNQGGTRPAGNTGGTTPAGDGGATGPPDDGGATGPPDDGGSTTPPSGGSTTPPGDGGSTPPGPGSTGPGGSTTPP 159
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462507488  789 ELVILWKHEPFISEKGLLGNRGPPGPP--GLKGTQGEEGPIGAFGELGPRGKPGQKGYAGEPGPEGLKGEVGDQ-GNIGK 865
Cdd:COG5164    160 GDGGSTTPPGPGGSTTPPDDGGSTTPPnkGETGTDIPTGGTPRQGPDGPVKKDDKNGKGNPPDDRGGKTGPKDQrPKTNP 239
                          250       260       270       280
                   ....*....|....*....|....*....|....*....|
gi 2462507488  866 IGETGPVGLPGEVGMTGSIGEKGER--GSPGPLGPQGEKG 903
Cdd:COG5164    240 IERRGPERPEAAALPAELTALEAENraANPEPATKTIPET 279
Collagen pfam01391
Collagen triple helix repeat (20 copies); Members of this family belong to the collagen ...
511-564 3.63e-04

Collagen triple helix repeat (20 copies); Members of this family belong to the collagen superfamily. Collagens are generally extracellular structural proteins involved in formation of connective tissue structure. The alignment contains 20 copies of the G-X-Y repeat that forms a triple helix. The first position of the repeat is glycine, the second and third positions can be any residue but are frequently proline and hydroxy-proline. Collagens are post translationally modified by proline hydroxylase to form the hydroxy-proline residues. Defective hydroxylation is the cause of scurvy. Some members of the collagen superfamily are not involved in connective tissue structure but share the same triple helical structure. The family includes bacterial collagen-like triple-helix repeat proteins.


Pssm-ID: 460189 [Multi-domain]  Cd Length: 57  Bit Score: 40.17  E-value: 3.63e-04
                           10        20        30        40        50
                   ....*....|....*....|....*....|....*....|....*....|....
gi 2462507488  511 KRGPRGIPGPHGNPGLPGLPGPKGPKGDPGFSPGQPVPGEKGDQGLSGLMGPPG 564
Cdd:pfam01391    2 PPGPPGPPGPPGPPGPPGPPGPPGPPGPPGEPGPPGPPGPPGPPGPPGAPGAPG 55
Collagen pfam01391
Collagen triple helix repeat (20 copies); Members of this family belong to the collagen ...
1080-1138 4.21e-04

Collagen triple helix repeat (20 copies); Members of this family belong to the collagen superfamily. Collagens are generally extracellular structural proteins involved in formation of connective tissue structure. The alignment contains 20 copies of the G-X-Y repeat that forms a triple helix. The first position of the repeat is glycine, the second and third positions can be any residue but are frequently proline and hydroxy-proline. Collagens are post translationally modified by proline hydroxylase to form the hydroxy-proline residues. Defective hydroxylation is the cause of scurvy. Some members of the collagen superfamily are not involved in connective tissue structure but share the same triple helical structure. The family includes bacterial collagen-like triple-helix repeat proteins.


Pssm-ID: 460189 [Multi-domain]  Cd Length: 57  Bit Score: 39.78  E-value: 4.21e-04
                           10        20        30        40        50
                   ....*....|....*....|....*....|....*....|....*....|....*....
gi 2462507488 1080 GVPGGRGLPGEDGEKGEmglPGIIGPLGRSGQTGLPGPEGIVGIPGQRGRPGKKGDKGQ 1138
Cdd:pfam01391    1 GPPGPPGPPGPPGPPGP---PGPPGPPGPPGPPGEPGPPGPPGPPGPPGPPGAPGAPGP 56
Collagen pfam01391
Collagen triple helix repeat (20 copies); Members of this family belong to the collagen ...
708-756 4.38e-04

Collagen triple helix repeat (20 copies); Members of this family belong to the collagen superfamily. Collagens are generally extracellular structural proteins involved in formation of connective tissue structure. The alignment contains 20 copies of the G-X-Y repeat that forms a triple helix. The first position of the repeat is glycine, the second and third positions can be any residue but are frequently proline and hydroxy-proline. Collagens are post translationally modified by proline hydroxylase to form the hydroxy-proline residues. Defective hydroxylation is the cause of scurvy. Some members of the collagen superfamily are not involved in connective tissue structure but share the same triple helical structure. The family includes bacterial collagen-like triple-helix repeat proteins.


Pssm-ID: 460189 [Multi-domain]  Cd Length: 57  Bit Score: 39.78  E-value: 4.38e-04
                           10        20        30        40
                   ....*....|....*....|....*....|....*....|....*....
gi 2462507488  708 GLPGIKGDKGEQGTAGELGEPGYPGDKGAVGLPGPPGMRGKSGPSGQTG 756
Cdd:pfam01391    1 GPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGEPGPPGPPGPPGPPGPPG 49
Collagen pfam01391
Collagen triple helix repeat (20 copies); Members of this family belong to the collagen ...
600-654 4.56e-04

Collagen triple helix repeat (20 copies); Members of this family belong to the collagen superfamily. Collagens are generally extracellular structural proteins involved in formation of connective tissue structure. The alignment contains 20 copies of the G-X-Y repeat that forms a triple helix. The first position of the repeat is glycine, the second and third positions can be any residue but are frequently proline and hydroxy-proline. Collagens are post translationally modified by proline hydroxylase to form the hydroxy-proline residues. Defective hydroxylation is the cause of scurvy. Some members of the collagen superfamily are not involved in connective tissue structure but share the same triple helical structure. The family includes bacterial collagen-like triple-helix repeat proteins.


Pssm-ID: 460189 [Multi-domain]  Cd Length: 57  Bit Score: 39.78  E-value: 4.56e-04
                           10        20        30        40        50
                   ....*....|....*....|....*....|....*....|....*....|....*
gi 2462507488  600 GRQGLAGPEGNPGPKGAQGFIGSPGEAGQLGPEGERGIPGIRGKKGFKGRQGFPG 654
Cdd:pfam01391    1 GPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGEPGPPGPPGPPGPPGPPGAPGAPG 55
Collagen pfam01391
Collagen triple helix repeat (20 copies); Members of this family belong to the collagen ...
996-1050 4.60e-04

Collagen triple helix repeat (20 copies); Members of this family belong to the collagen superfamily. Collagens are generally extracellular structural proteins involved in formation of connective tissue structure. The alignment contains 20 copies of the G-X-Y repeat that forms a triple helix. The first position of the repeat is glycine, the second and third positions can be any residue but are frequently proline and hydroxy-proline. Collagens are post translationally modified by proline hydroxylase to form the hydroxy-proline residues. Defective hydroxylation is the cause of scurvy. Some members of the collagen superfamily are not involved in connective tissue structure but share the same triple helical structure. The family includes bacterial collagen-like triple-helix repeat proteins.


Pssm-ID: 460189 [Multi-domain]  Cd Length: 57  Bit Score: 39.78  E-value: 4.60e-04
                           10        20        30        40        50
                   ....*....|....*....|....*....|....*....|....*....|....*
gi 2462507488  996 GSTGDRGLPGEPGLRGLQGDVGPPGEMGMEGPPGTEGESGLQGEPGAKGDVGTAG 1050
Cdd:pfam01391    1 GPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGEPGPPGPPGPPGPPGPPGAPGAPG 55
Collagen pfam01391
Collagen triple helix repeat (20 copies); Members of this family belong to the collagen ...
1107-1163 5.88e-04

Collagen triple helix repeat (20 copies); Members of this family belong to the collagen superfamily. Collagens are generally extracellular structural proteins involved in formation of connective tissue structure. The alignment contains 20 copies of the G-X-Y repeat that forms a triple helix. The first position of the repeat is glycine, the second and third positions can be any residue but are frequently proline and hydroxy-proline. Collagens are post translationally modified by proline hydroxylase to form the hydroxy-proline residues. Defective hydroxylation is the cause of scurvy. Some members of the collagen superfamily are not involved in connective tissue structure but share the same triple helical structure. The family includes bacterial collagen-like triple-helix repeat proteins.


Pssm-ID: 460189 [Multi-domain]  Cd Length: 57  Bit Score: 39.40  E-value: 5.88e-04
                           10        20        30        40        50
                   ....*....|....*....|....*....|....*....|....*....|....*..
gi 2462507488 1107 GRSGQTGLPGPEGIVGIPGQRGRPGKKGDKGQIGPTGEVGSRGPPGKIGKSGPKGAR 1163
Cdd:pfam01391    1 GPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGEPGPPGPPGPPGPPGPPGAPGAPGPP 57
Collagen pfam01391
Collagen triple helix repeat (20 copies); Members of this family belong to the collagen ...
1104-1159 6.18e-04

Collagen triple helix repeat (20 copies); Members of this family belong to the collagen superfamily. Collagens are generally extracellular structural proteins involved in formation of connective tissue structure. The alignment contains 20 copies of the G-X-Y repeat that forms a triple helix. The first position of the repeat is glycine, the second and third positions can be any residue but are frequently proline and hydroxy-proline. Collagens are post translationally modified by proline hydroxylase to form the hydroxy-proline residues. Defective hydroxylation is the cause of scurvy. Some members of the collagen superfamily are not involved in connective tissue structure but share the same triple helical structure. The family includes bacterial collagen-like triple-helix repeat proteins.


Pssm-ID: 460189 [Multi-domain]  Cd Length: 57  Bit Score: 39.40  E-value: 6.18e-04
                           10        20        30        40        50
                   ....*....|....*....|....*....|....*....|....*....|....*.
gi 2462507488 1104 GPLGRSGQTGLPGPEGIVGIPGQRGRPGKKGDKGQIGPTGEVGSRGPPGKIGKSGP 1159
Cdd:pfam01391    1 GPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGEPGPPGPPGPPGPPGPPGAPGAPGP 56
Med15 pfam09606
ARC105 or Med15 subunit of Mediator complex non-fungal; The approx. 70 residue Med15 domain of ...
926-1197 6.25e-04

ARC105 or Med15 subunit of Mediator complex non-fungal; The approx. 70 residue Med15 domain of the ARC-Mediator co-activator is a three-helix bundle with marked similarity to the KIX domain. The sterol regulatory element binding protein (SREBP) family of transcription activators use the ARC105 subunit to activate target genes in the regulation of cholesterol and fatty acid homeostasis. In addition, Med15 is a critical transducer of gene activation signals that control early metazoan development.


Pssm-ID: 312941 [Multi-domain]  Cd Length: 732  Bit Score: 44.61  E-value: 6.25e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462507488  926 VGARGPPGSQ-GPKGQRGSRGPDGLLGEQGIQGAKGEKGDQGKRGPHGLIGKTGNPGERGFQGKPGLQGLPGSTGDRGLP 1004
Cdd:pfam09606  104 PGPGGPMGQQmGGPGTASNLLASLGRPQMPMGGAGFPSQMSRVGRMQPGGQAGGMMQPSSGQPGSGTPNQMGPNGGPGQG 183
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462507488 1005 GEPGLRGLQGD---VGPPGEMGMEGPPGT-EGESGLQGEPGAKGDVGTAGSVGGTGEPGLRGEPgaPGEEGLQGKDGLKG 1080
Cdd:pfam09606  184 QAGGMNGGQQGpmgGQMPPQMGVPGMPGPaDAGAQMGQQAQANGGMNPQQMGGAPNQVAMQQQQ--PQQQGQQSQLGMGI 261
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462507488 1081 VPGGRgLPGEDGEKGEMGLPGIIGPLGRSGQTGLP---GPEGIVGIPGQRGRPGKKGDKGQIGPTGE---VGSRGPPGKI 1154
Cdd:pfam09606  262 NQMQQ-MPQGVGGGAGQGGPGQPMGPPGQQPGAMPnvmSIGDQNNYQQQQTRQQQQQQGGNHPAAHQqqmNQSVGQGGQV 340
                          250       260       270       280
                   ....*....|....*....|....*....|....*....|...
gi 2462507488 1155 GKSGPKGARGTRGAVGHLGLMGPDGEPGIPGYRGhQGQPGPSG 1197
Cdd:pfam09606  341 VALGGLNHLETWNPGNFGGLGANPMQRGQPGMMS-SPSPVPGQ 382
Collagen pfam01391
Collagen triple helix repeat (20 copies); Members of this family belong to the collagen ...
1131-1185 6.49e-04

Collagen triple helix repeat (20 copies); Members of this family belong to the collagen superfamily. Collagens are generally extracellular structural proteins involved in formation of connective tissue structure. The alignment contains 20 copies of the G-X-Y repeat that forms a triple helix. The first position of the repeat is glycine, the second and third positions can be any residue but are frequently proline and hydroxy-proline. Collagens are post translationally modified by proline hydroxylase to form the hydroxy-proline residues. Defective hydroxylation is the cause of scurvy. Some members of the collagen superfamily are not involved in connective tissue structure but share the same triple helical structure. The family includes bacterial collagen-like triple-helix repeat proteins.


Pssm-ID: 460189 [Multi-domain]  Cd Length: 57  Bit Score: 39.40  E-value: 6.49e-04
                           10        20        30        40        50
                   ....*....|....*....|....*....|....*....|....*....|....*
gi 2462507488 1131 GKKGDKGQIGPTGEVGSRGPPGKIGKSGPKGARGTRGAVGHLGLMGPDGEPGIPG 1185
Cdd:pfam01391    1 GPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGEPGPPGPPGPPGPPGPPGAPGAPG 55
Collagen pfam01391
Collagen triple helix repeat (20 copies); Members of this family belong to the collagen ...
594-648 7.52e-04

Collagen triple helix repeat (20 copies); Members of this family belong to the collagen superfamily. Collagens are generally extracellular structural proteins involved in formation of connective tissue structure. The alignment contains 20 copies of the G-X-Y repeat that forms a triple helix. The first position of the repeat is glycine, the second and third positions can be any residue but are frequently proline and hydroxy-proline. Collagens are post translationally modified by proline hydroxylase to form the hydroxy-proline residues. Defective hydroxylation is the cause of scurvy. Some members of the collagen superfamily are not involved in connective tissue structure but share the same triple helical structure. The family includes bacterial collagen-like triple-helix repeat proteins.


Pssm-ID: 460189 [Multi-domain]  Cd Length: 57  Bit Score: 39.01  E-value: 7.52e-04
                           10        20        30        40        50
                   ....*....|....*....|....*....|....*....|....*....|....*
gi 2462507488  594 GSPGYPGRQGLAGPEGNPGPKGAQGFIGSPGEAGQLGPEGERGIPGIRGKKGFKG 648
Cdd:pfam01391    1 GPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGEPGPPGPPGPPGPPGPPGAPGAPG 55
gly_rich_SclB NF038329
LPXTG-anchored collagen-like adhesin Scl2/SclB; SclB (or Scl2 - streptococcal collagen-like ...
1418-1475 7.63e-04

LPXTG-anchored collagen-like adhesin Scl2/SclB; SclB (or Scl2 - streptococcal collagen-like protein 2) is an LPXTG-anchored surface-anchored adhesin with a variable-length region of triple helix-forming collagen-like Gly-Xaa-Xaa repeats.


Pssm-ID: 468478 [Multi-domain]  Cd Length: 440  Bit Score: 44.13  E-value: 7.63e-04
                           10        20        30        40        50
                   ....*....|....*....|....*....|....*....|....*....|....*...
gi 2462507488 1418 FPGPKGPEGDAGIVGISGPKGPIGHRGNTGPLGREGIIGPTGRTGPRGEKGFRGETGP 1475
Cdd:NF038329   115 GDGEKGEPGPAGPAGPAGEQGPRGDRGETGPAGPAGPPGPQGERGEKGPAGPQGEAGP 172
Collagen pfam01391
Collagen triple helix repeat (20 copies); Members of this family belong to the collagen ...
819-872 7.75e-04

Collagen triple helix repeat (20 copies); Members of this family belong to the collagen superfamily. Collagens are generally extracellular structural proteins involved in formation of connective tissue structure. The alignment contains 20 copies of the G-X-Y repeat that forms a triple helix. The first position of the repeat is glycine, the second and third positions can be any residue but are frequently proline and hydroxy-proline. Collagens are post translationally modified by proline hydroxylase to form the hydroxy-proline residues. Defective hydroxylation is the cause of scurvy. Some members of the collagen superfamily are not involved in connective tissue structure but share the same triple helical structure. The family includes bacterial collagen-like triple-helix repeat proteins.


Pssm-ID: 460189 [Multi-domain]  Cd Length: 57  Bit Score: 39.01  E-value: 7.75e-04
                           10        20        30        40        50
                   ....*....|....*....|....*....|....*....|....*....|....
gi 2462507488  819 GTQGEEGPIGAFGELGPRGKPGQKGYAGEPGPEGLKGEVGDQGNIGKIGETGPV 872
Cdd:pfam01391    4 GPPGPPGPPGPPGPPGPPGPPGPPGPPGEPGPPGPPGPPGPPGPPGAPGAPGPP 57
PHA03169 PHA03169
hypothetical protein; Provisional
925-1155 8.17e-04

hypothetical protein; Provisional


Pssm-ID: 223003 [Multi-domain]  Cd Length: 413  Bit Score: 43.81  E-value: 8.17e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462507488  925 HVGARGPPGSQGPKGQR------------GSRGPDGLLGEQGIQGAKGEKGDQGKRGPHGLIGKTGNPGERGFQGKPGLQ 992
Cdd:PHA03169    26 HGGTREQAGRRRGTAARaakpappapttsGPQVRAVAEQGHRQTESDTETAEESRHGEKEERGQGGPSGSGSESVGSPTP 105
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462507488  993 GLPGSTGDRGLPGEPGLRGLQGDVGPPGEMGMEGPPGTEGEsglqGEPGAKGDVGTAGSVGGTGEPGLRGEPGAPGEEGL 1072
Cdd:PHA03169   106 SPSGSAEELASGLSPENTSGSSPESPASHSPPPSPPSHPGP----HEPAPPESHNPSPNQQPSSFLQPSHEDSPEEPEPP 181
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462507488 1073 QGKDGLKGVPGGRGLPGEDGEKGEMGlPGIIGPLGRSGQTGLPGPEGIVGI-----PGQRGRPGkKGDKGQIGPTGEVGS 1147
Cdd:PHA03169   182 TSEPEPDSPGPPQSETPTSSPPPQSP-PDEPGEPQSPTPQQAPSPNTQQAVehedePTEPEREG-PPFPGHRSHSYTVVG 259

                   ....*...
gi 2462507488 1148 RGPPGKIG 1155
Cdd:PHA03169   260 WKPSTRPG 267
Collagen pfam01391
Collagen triple helix repeat (20 copies); Members of this family belong to the collagen ...
1110-1164 8.98e-04

Collagen triple helix repeat (20 copies); Members of this family belong to the collagen superfamily. Collagens are generally extracellular structural proteins involved in formation of connective tissue structure. The alignment contains 20 copies of the G-X-Y repeat that forms a triple helix. The first position of the repeat is glycine, the second and third positions can be any residue but are frequently proline and hydroxy-proline. Collagens are post translationally modified by proline hydroxylase to form the hydroxy-proline residues. Defective hydroxylation is the cause of scurvy. Some members of the collagen superfamily are not involved in connective tissue structure but share the same triple helical structure. The family includes bacterial collagen-like triple-helix repeat proteins.


Pssm-ID: 460189 [Multi-domain]  Cd Length: 57  Bit Score: 39.01  E-value: 8.98e-04
                           10        20        30        40        50
                   ....*....|....*....|....*....|....*....|....*....|....*
gi 2462507488 1110 GQTGLPGPEGIVGIPGQRGRPGKKGDKGQIGPTGEVGSRGPPGKIGKSGPKGARG 1164
Cdd:pfam01391    1 GPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGEPGPPGPPGPPGPPGPPGAPGAPG 55
Collagen pfam01391
Collagen triple helix repeat (20 copies); Members of this family belong to the collagen ...
609-663 1.28e-03

Collagen triple helix repeat (20 copies); Members of this family belong to the collagen superfamily. Collagens are generally extracellular structural proteins involved in formation of connective tissue structure. The alignment contains 20 copies of the G-X-Y repeat that forms a triple helix. The first position of the repeat is glycine, the second and third positions can be any residue but are frequently proline and hydroxy-proline. Collagens are post translationally modified by proline hydroxylase to form the hydroxy-proline residues. Defective hydroxylation is the cause of scurvy. Some members of the collagen superfamily are not involved in connective tissue structure but share the same triple helical structure. The family includes bacterial collagen-like triple-helix repeat proteins.


Pssm-ID: 460189 [Multi-domain]  Cd Length: 57  Bit Score: 38.63  E-value: 1.28e-03
                           10        20        30        40        50
                   ....*....|....*....|....*....|....*....|....*....|....*
gi 2462507488  609 GNPGPKGAQGFIGSPGEAGQLGPEGERGIPGIRGKKGFKGRQGFPGDFGDRGPAG 663
Cdd:pfam01391    1 GPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGEPGPPGPPGPPGPPGPPGAPGAPG 55
PHA03169 PHA03169
hypothetical protein; Provisional
1005-1200 1.46e-03

hypothetical protein; Provisional


Pssm-ID: 223003 [Multi-domain]  Cd Length: 413  Bit Score: 43.04  E-value: 1.46e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462507488 1005 GEPGLRGLQGDVGPpGEMGMEGPPGTEGESGLQGEPGAKGDVGTAGSVGGTGEPGLRGEPGAPGEEGLQGKDGLKGVPGG 1084
Cdd:PHA03169    82 GEKEERGQGGPSGS-GSESVGSPTPSPSGSAEELASGLSPENTSGSSPESPASHSPPPSPPSHPGPHEPAPPESHNPSPN 160
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462507488 1085 RGLPGEDGEKGEMglpgiiGPLGRSGQTGLPGPEGivGIPGQRGRPGKKGDKGQigptgEVGSRGPPGKIGKSGPKGARG 1164
Cdd:PHA03169   161 QQPSSFLQPSHED------SPEEPEPPTSEPEPDS--PGPPQSETPTSSPPPQS-----PPDEPGEPQSPTPQQAPSPNT 227
                          170       180       190       200
                   ....*....|....*....|....*....|....*....|
gi 2462507488 1165 TRGAVGHLGLMGPDGE-PGIPGYRGHQ---GQPGPSGLPG 1200
Cdd:PHA03169   228 QQAVEHEDEPTEPEREgPPFPGHRSHSytvVGWKPSTRPG 267
PHA03169 PHA03169
hypothetical protein; Provisional
1126-1294 1.97e-03

hypothetical protein; Provisional


Pssm-ID: 223003 [Multi-domain]  Cd Length: 413  Bit Score: 42.65  E-value: 1.97e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462507488 1126 QRGRPGKKGDKGQIGPTGEvgsrgppGKIGKSGPKGARGTRGAVGHLGLMGPDGEPGIP-GYRGHQGQPGPSGLPGPkGE 1204
Cdd:PHA03169    77 EESRHGEKEERGQGGPSGS-------GSESVGSPTPSPSGSAEELASGLSPENTSGSSPeSPASHSPPPSPPSHPGP-HE 148
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462507488 1205 KGYPGEDSTVLGPPGPRGEPGPVGDQGERGEPGAEGYKGHVGVPGLRGATGQQGPPGEPGDqgeqglkgERGSEGNKGKK 1284
Cdd:PHA03169   149 PAPPESHNPSPNQQPSSFLQPSHEDSPEEPEPPTSEPEPDSPGPPQSETPTSSPPPQSPPD--------EPGEPQSPTPQ 220
                          170
                   ....*....|
gi 2462507488 1285 GAPGPSGKPG 1294
Cdd:PHA03169   221 QAPSPNTQQA 230
LamG cd00110
Laminin G domain; Laminin G-like domains are usually Ca++ mediated receptors that can have ...
81-212 2.23e-03

Laminin G domain; Laminin G-like domains are usually Ca++ mediated receptors that can have binding sites for steroids, beta1 integrins, heparin, sulfatides, fibulin-1, and alpha-dystroglycans. Proteins that contain LamG domains serve a variety of purposes including signal transduction via cell-surface steroid receptors, adhesion, migration and differentiation through mediation of cell adhesion molecules.


Pssm-ID: 238058 [Multi-domain]  Cd Length: 151  Bit Score: 40.48  E-value: 2.23e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462507488   81 GVIFKNDAYIETPFvkilPVNLGQPFTILTGLQShRVNNAFLFSIRNKNRLQ-LGVQLLPKKLVVHIR-GKQPAVFNY-- 156
Cdd:cd00110      1 GVSFSGSSYVRLPT----LPAPRTRLSISFSFRT-TSPNGLLLYAGSQNGGDfLALELEDGRLVLRYDlGSGSLVLSSkt 75
                           90       100       110       120       130
                   ....*....|....*....|....*....|....*....|....*....|....*.
gi 2462507488  157 SVHDEQWHSFAITIRNQSVSMFVEcGKKYFSTETIPEVQTFDSNSVFTLGSMNNNS 212
Cdd:cd00110     76 PLNDGQWHSVSVERNGRSVTLSVD-GERVVESGSPGGSALLNLDGPLYLGGLPEDL 130
Collagen pfam01391
Collagen triple helix repeat (20 copies); Members of this family belong to the collagen ...
621-669 2.65e-03

Collagen triple helix repeat (20 copies); Members of this family belong to the collagen superfamily. Collagens are generally extracellular structural proteins involved in formation of connective tissue structure. The alignment contains 20 copies of the G-X-Y repeat that forms a triple helix. The first position of the repeat is glycine, the second and third positions can be any residue but are frequently proline and hydroxy-proline. Collagens are post translationally modified by proline hydroxylase to form the hydroxy-proline residues. Defective hydroxylation is the cause of scurvy. Some members of the collagen superfamily are not involved in connective tissue structure but share the same triple helical structure. The family includes bacterial collagen-like triple-helix repeat proteins.


Pssm-ID: 460189 [Multi-domain]  Cd Length: 57  Bit Score: 37.47  E-value: 2.65e-03
                           10        20        30        40
                   ....*....|....*....|....*....|....*....|....*....
gi 2462507488  621 GSPGEAGQLGPEGERGIPGIRGKKGFKGRQGFPGDFGDRGPAGLDGSPG 669
Cdd:pfam01391    1 GPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGEPGPPGPPGPPGPPGPPG 49
Collagen pfam01391
Collagen triple helix repeat (20 copies); Members of this family belong to the collagen ...
1387-1441 3.43e-03

Collagen triple helix repeat (20 copies); Members of this family belong to the collagen superfamily. Collagens are generally extracellular structural proteins involved in formation of connective tissue structure. The alignment contains 20 copies of the G-X-Y repeat that forms a triple helix. The first position of the repeat is glycine, the second and third positions can be any residue but are frequently proline and hydroxy-proline. Collagens are post translationally modified by proline hydroxylase to form the hydroxy-proline residues. Defective hydroxylation is the cause of scurvy. Some members of the collagen superfamily are not involved in connective tissue structure but share the same triple helical structure. The family includes bacterial collagen-like triple-helix repeat proteins.


Pssm-ID: 460189 [Multi-domain]  Cd Length: 57  Bit Score: 37.47  E-value: 3.43e-03
                           10        20        30        40        50
                   ....*....|....*....|....*....|....*....|....*....|....*
gi 2462507488 1387 GAQGDQGPCGDPGLKGQPGEYGVQGLTGFQGFPGPKGPEGDAGIVGISGPKGPIG 1441
Cdd:pfam01391    1 GPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGEPGPPGPPGPPGPPGPPGAPGAPG 55
dermokine cd21118
dermokine; Dermokine, also known as epidermis-specific secreted protein SK30/SK89, is a ...
1143-1435 5.51e-03

dermokine; Dermokine, also known as epidermis-specific secreted protein SK30/SK89, is a skin-specific glycoprotein that may play a regulatory role in the crosstalk between barrier dysfunction and inflammation, and therefore play a role in inflammatory diseases such as psoriasis. Dermokine is one of the most highly expressed proteins in differentiating keratinocytes, found mainly in the spinous and granular layers of the epidermis, but also in the epithelia of the small intestine, macrophages of the lung, and endothelial cells of the lung. Mouse dermokine has been reported to be encoded by 22 exons, and its expression leads to alpha, beta, and gamma transcripts.


Pssm-ID: 411053 [Multi-domain]  Cd Length: 495  Bit Score: 41.14  E-value: 5.51e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462507488 1143 GEVGSRGppGKIGKSGPKGARGTRGAVGHLGLMGPDGEPGIpgyrghQGQPGPSGLPGPKGEKGYPGEDStvLGPPGPRG 1222
Cdd:cd21118    100 NEIGRQA--EDIIRHGVDAVHNSWQGSGGHGAYGSQGGPGV------QGHGIPGGTGGPWASGGNYGTNS--LGGSVGQG 169
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462507488 1223 EPGPVGDQGERGEpGAEGYKGHVGVPGLRGATGQQGPPGEpgdqgeqglkGERGSEGNKGKKGAPGPSGKPGIPGLQGll 1302
Cdd:cd21118    170 GNGGPLNYGTNSQ-GAVAQPGYGTVRGNNQNSGCTNPPPS----------GSHESFSNSGGSSSSGSSGSQGSHGSNG-- 236
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2462507488 1303 gpkgiQGYHGADGISGNPGKIGppgkqglpGIRGGPGRTGLAGAPGPPGVKGSSGLPGSPGIQGPKGEQGLPGQPGIQGK 1382
Cdd:cd21118    237 -----QGSSGSSGGQGNGGNNG--------SSSSNSGNSGGSNGGSSGNSGSGSGGSSSGGSNGWGGSSSSGGSGGSGGG 303
                          250       260       270       280       290
                   ....*....|....*....|....*....|....*....|....*....|....
gi 2462507488 1383 RGHRgaqgdqgpCGDPGLK-GQPGEYGVQGLTGFQGFPGPKGPEGDAGIVGISG 1435
Cdd:cd21118    304 NKPE--------CNNPGNDvRMAGGGGSQGSKESSGSHGSNGGNGQAEAVGGLN 349
Collagen pfam01391
Collagen triple helix repeat (20 copies); Members of this family belong to the collagen ...
630-700 5.95e-03

Collagen triple helix repeat (20 copies); Members of this family belong to the collagen superfamily. Collagens are generally extracellular structural proteins involved in formation of connective tissue structure. The alignment contains 20 copies of the G-X-Y repeat that forms a triple helix. The first position of the repeat is glycine, the second and third positions can be any residue but are frequently proline and hydroxy-proline. Collagens are post translationally modified by proline hydroxylase to form the hydroxy-proline residues. Defective hydroxylation is the cause of scurvy. Some members of the collagen superfamily are not involved in connective tissue structure but share the same triple helical structure. The family includes bacterial collagen-like triple-helix repeat proteins.


Pssm-ID: 460189 [Multi-domain]  Cd Length: 57  Bit Score: 36.70  E-value: 5.95e-03
                           10        20        30        40        50        60        70
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|.
gi 2462507488  630 GPEGERGIPGIRGKKGFKGRQGFPGDFGDRGPAGLDGSPGLvggtgppgfpglrgsVGPVGPIGPAGIPGP 700
Cdd:pfam01391    1 GPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGEPGPPGPPGP---------------PGPPGPPGAPGAPGP 56
 
Blast search parameters
Data Source: Precalculated data, version = cdd.v.3.21
Preset Options:Database: CDSEARCH/cdd   Low complexity filter: no  Composition Based Adjustment: yes   E-value threshold: 0.01

References:

  • Wang J et al. (2023), "The conserved domain database in 2023", Nucleic Acids Res.51(D)384-8.
  • Lu S et al. (2020), "The conserved domain database in 2020", Nucleic Acids Res.48(D)265-8.
  • Marchler-Bauer A et al. (2017), "CDD/SPARCLE: functional classification of proteins via subfamily domain architectures.", Nucleic Acids Res.45(D)200-3.
Help | Disclaimer | Write to the Help Desk
NCBI | NLM | NIH