NCBI Home Page NCBI Site Search page NCBI Guide that lists and describes the NCBI resources
Conserved domains on  [gi|38570073|ref|NP_115907|]
View 

collagen alpha-1(XXV) chain isoform 2 [Homo sapiens]

Protein Classification

collagen-like domain-containing protein( domain architecture ID 1903237)

collagen-like domain-containing protein such as collagens, which are extracellular structural proteins involved in formation of connective tissue structure

PubMed:  21421911|15837519

Graphical summary

 Zoom to residue level

show extra options »

Show site features     Horizontal zoom: ×

List of domain hits

Name Accession Description Interval E-value
gly_rich_SclB super family cl45768
LPXTG-anchored collagen-like adhesin Scl2/SclB; SclB (or Scl2 - streptococcal collagen-like ...
258-495 1.40e-28

LPXTG-anchored collagen-like adhesin Scl2/SclB; SclB (or Scl2 - streptococcal collagen-like protein 2) is an LPXTG-anchored surface-anchored adhesin with a variable-length region of triple helix-forming collagen-like Gly-Xaa-Xaa repeats.


The actual alignment was detected with superfamily member NF038329:

Pssm-ID: 468478 [Multi-domain]  Cd Length: 440  Bit Score: 118.85  E-value: 1.40e-28
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 38570073  258 GQKGEPGLPGAVGQNGIPGPKGEPGEQGEKGDAGENGPKGDTGEKGDPGSSA-AGIKGEPGESGRPGQKGEPGLPGLPGL 336
Cdd:NF038329 117 GEKGEPGPAGPAGPAGEQGPRGDRGETGPAGPAGPPGPQGERGEKGPAGPQGeAGPQGPAGKDGEAGAKGPAGEKGPQGP 196
                         90       100       110       120       130       140       150       160
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 38570073  337 PGIKGEPGFIGPQGEPGLPGLPGTKGERGEAGPPGRGERGEPGAPGPKGKQGESGTRGPKGSKGDRGEKGDSGAQGPRGP 416
Cdd:NF038329 197 RGETGPAGEQGPAGPAGPDGEAGPAGEDGPAGPAGDGQQGPDGDPGPTGEDGPQGPDGPAGKDGPRGDRGEAGPDGPDGK 276
                        170       180       190       200       210       220       230
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 38570073  417 PGQKGDQGATEIIDYNGnlhealqrittltvtgppgPPGPQGLQGPKGEQGSPGIPGMDGEQGLKGSKGDMGDPGMTGE 495
Cdd:NF038329 277 DGERGPVGPAGKDGQNG-------------------KDGLPGKDGKDGQNGKDGLPGKDGKDGQPGKDGLPGKDGKDGQ 336
gly_rich_SclB super family cl45768
LPXTG-anchored collagen-like adhesin Scl2/SclB; SclB (or Scl2 - streptococcal collagen-like ...
458-614 1.59e-12

LPXTG-anchored collagen-like adhesin Scl2/SclB; SclB (or Scl2 - streptococcal collagen-like protein 2) is an LPXTG-anchored surface-anchored adhesin with a variable-length region of triple helix-forming collagen-like Gly-Xaa-Xaa repeats.


The actual alignment was detected with superfamily member NF038329:

Pssm-ID: 468478 [Multi-domain]  Cd Length: 440  Bit Score: 69.93  E-value: 1.59e-12
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 38570073  458 GLQGPKGEQGSPGIPGMDGEQGLKGSKGDMGDPGMTGEKGGIGLPGLPGANGMKGEKGDSGMPGPQgpSIIGPPGPPGPH 537
Cdd:NF038329 129 GPAGEQGPRGDRGETGPAGPAGPPGPQGERGEKGPAGPQGEAGPQGPAGKDGEAGAKGPAGEKGPQ--GPRGETGPAGEQ 206
                         90       100       110       120       130       140       150
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*..
gi 38570073  538 GPPGPMGPHGLPGPKGTDGPMGPHGPaGPKGERGEKGAMGEPGPRGPYGLPGKDGEPGLDGFPGPRGEKGDLGEKGE 614
Cdd:NF038329 207 GPAGPAGPDGEAGPAGEDGPAGPAGD-GQQGPDGDPGPTGEDGPQGPDGPAGKDGPRGDRGEAGPDGPDGKDGERGP 282
Collagen pfam01391
Collagen triple helix repeat (20 copies); Members of this family belong to the collagen ...
216-272 1.08e-05

Collagen triple helix repeat (20 copies); Members of this family belong to the collagen superfamily. Collagens are generally extracellular structural proteins involved in formation of connective tissue structure. The alignment contains 20 copies of the G-X-Y repeat that forms a triple helix. The first position of the repeat is glycine, the second and third positions can be any residue but are frequently proline and hydroxy-proline. Collagens are post translationally modified by proline hydroxylase to form the hydroxy-proline residues. Defective hydroxylation is the cause of scurvy. Some members of the collagen superfamily are not involved in connective tissue structure but share the same triple helical structure. The family includes bacterial collagen-like triple-helix repeat proteins.


:

Pssm-ID: 460189 [Multi-domain]  Cd Length: 57  Bit Score: 43.25  E-value: 1.08e-05
                          10        20        30        40        50
                  ....*....|....*....|....*....|....*....|....*....|....*..
gi 38570073   216 GPRGMPGVPGEPGKPGEQGLMGPLGPPGQKGSIGAPGIPGMNGQKGEPGLPGAVGQN 272
Cdd:pfam01391   1 GPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGEPGPPGPPGPPGPPGPPGAPGAPGPP 57
 
Name Accession Description Interval E-value
gly_rich_SclB NF038329
LPXTG-anchored collagen-like adhesin Scl2/SclB; SclB (or Scl2 - streptococcal collagen-like ...
258-495 1.40e-28

LPXTG-anchored collagen-like adhesin Scl2/SclB; SclB (or Scl2 - streptococcal collagen-like protein 2) is an LPXTG-anchored surface-anchored adhesin with a variable-length region of triple helix-forming collagen-like Gly-Xaa-Xaa repeats.


Pssm-ID: 468478 [Multi-domain]  Cd Length: 440  Bit Score: 118.85  E-value: 1.40e-28
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 38570073  258 GQKGEPGLPGAVGQNGIPGPKGEPGEQGEKGDAGENGPKGDTGEKGDPGSSA-AGIKGEPGESGRPGQKGEPGLPGLPGL 336
Cdd:NF038329 117 GEKGEPGPAGPAGPAGEQGPRGDRGETGPAGPAGPPGPQGERGEKGPAGPQGeAGPQGPAGKDGEAGAKGPAGEKGPQGP 196
                         90       100       110       120       130       140       150       160
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 38570073  337 PGIKGEPGFIGPQGEPGLPGLPGTKGERGEAGPPGRGERGEPGAPGPKGKQGESGTRGPKGSKGDRGEKGDSGAQGPRGP 416
Cdd:NF038329 197 RGETGPAGEQGPAGPAGPDGEAGPAGEDGPAGPAGDGQQGPDGDPGPTGEDGPQGPDGPAGKDGPRGDRGEAGPDGPDGK 276
                        170       180       190       200       210       220       230
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 38570073  417 PGQKGDQGATEIIDYNGnlhealqrittltvtgppgPPGPQGLQGPKGEQGSPGIPGMDGEQGLKGSKGDMGDPGMTGE 495
Cdd:NF038329 277 DGERGPVGPAGKDGQNG-------------------KDGLPGKDGKDGQNGKDGLPGKDGKDGQPGKDGLPGKDGKDGQ 336
gly_rich_SclB NF038329
LPXTG-anchored collagen-like adhesin Scl2/SclB; SclB (or Scl2 - streptococcal collagen-like ...
192-421 9.08e-28

LPXTG-anchored collagen-like adhesin Scl2/SclB; SclB (or Scl2 - streptococcal collagen-like protein 2) is an LPXTG-anchored surface-anchored adhesin with a variable-length region of triple helix-forming collagen-like Gly-Xaa-Xaa repeats.


Pssm-ID: 468478 [Multi-domain]  Cd Length: 440  Bit Score: 116.54  E-value: 9.08e-28
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 38570073  192 GQAGPPGPPGPPGPRGPPGDTGKDGPRGMPGVPGEPGKPGEQGLMGPLGPPGQKGSIGAPGIPGMNGQKGEPGLPGAVGQ 271
Cdd:NF038329 117 GEKGEPGPAGPAGPAGEQGPRGDRGETGPAGPAGPPGPQGERGEKGPAGPQGEAGPQGPAGKDGEAGAKGPAGEKGPQGP 196
                         90       100       110       120       130       140       150       160
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 38570073  272 NGIPGPKGEPGEQGEKGDAGENGPKGDTGEKGDPGSSAAGIKGEPGESGRPGQKGEpglpglpglpgiKGEPGFIGPQGE 351
Cdd:NF038329 197 RGETGPAGEQGPAGPAGPDGEAGPAGEDGPAGPAGDGQQGPDGDPGPTGEDGPQGP------------DGPAGKDGPRGD 264
                        170       180       190       200       210       220       230
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....
gi 38570073  352 PGLPGLPGTKGERGEAGPPGR----GERGEPGAPGPKGKQGESGTRGPKGSKGDRGEKGDSGAQGPRGPPGQKG 421
Cdd:NF038329 265 RGEAGPDGPDGKDGERGPVGPagkdGQNGKDGLPGKDGKDGQNGKDGLPGKDGKDGQPGKDGLPGKDGKDGQPG 338
gly_rich_SclB NF038329
LPXTG-anchored collagen-like adhesin Scl2/SclB; SclB (or Scl2 - streptococcal collagen-like ...
340-604 8.52e-23

LPXTG-anchored collagen-like adhesin Scl2/SclB; SclB (or Scl2 - streptococcal collagen-like protein 2) is an LPXTG-anchored surface-anchored adhesin with a variable-length region of triple helix-forming collagen-like Gly-Xaa-Xaa repeats.


Pssm-ID: 468478 [Multi-domain]  Cd Length: 440  Bit Score: 101.52  E-value: 8.52e-23
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 38570073  340 KGEPGFIGPQGEPGLPGLPGTKGERGEAGPPG-RGERGEPGAPGPKGKQGESGTRGPKGSKGDRGEKGDSGAQGPRGPPG 418
Cdd:NF038329 119 KGEPGPAGPAGPAGEQGPRGDRGETGPAGPAGpPGPQGERGEKGPAGPQGEAGPQGPAGKDGEAGAKGPAGEKGPQGPRG 198
                         90       100       110       120       130       140       150       160
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 38570073  419 QKGDQGAteiidyngnlhealqrittltvtgppgpPGPQGLQGPKGEQGSPGIPGMDGEQGlKGSKGDMGDPGMTGEKGG 498
Cdd:NF038329 199 ETGPAGE----------------------------QGPAGPAGPDGEAGPAGEDGPAGPAG-DGQQGPDGDPGPTGEDGP 249
                        170       180       190       200       210       220       230       240
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 38570073  499 IGLPGLPGANGMKGEKGDSGMPGPQGPSIIGPPGPPGphgppgpmgphglpgpkGTDGPMGPHGPAGPKGERGEKGAMGE 578
Cdd:NF038329 250 QGPDGPAGKDGPRGDRGEAGPDGPDGKDGERGPVGPA-----------------GKDGQNGKDGLPGKDGKDGQNGKDGL 312
                        250       260
                 ....*....|....*....|....*.
gi 38570073  579 PGPRGPYGLPGKDGEPGLDGFPGPRG 604
Cdd:NF038329 313 PGKDGKDGQPGKDGLPGKDGKDGQPG 338
gly_rich_SclB NF038329
LPXTG-anchored collagen-like adhesin Scl2/SclB; SclB (or Scl2 - streptococcal collagen-like ...
121-384 2.93e-15

LPXTG-anchored collagen-like adhesin Scl2/SclB; SclB (or Scl2 - streptococcal collagen-like protein 2) is an LPXTG-anchored surface-anchored adhesin with a variable-length region of triple helix-forming collagen-like Gly-Xaa-Xaa repeats.


Pssm-ID: 468478 [Multi-domain]  Cd Length: 440  Bit Score: 78.41  E-value: 2.93e-15
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 38570073  121 PAGPPGKRGKRGRRGESGPPGQPGPQGPPGPKGDKGEQGDQGPRmvfpkinhgflsadqqlikrrlikgdqgqagppgpp 200
Cdd:NF038329 145 PAGPAGPPGPQGERGEKGPAGPQGEAGPQGPAGKDGEAGAKGPA------------------------------------ 188
                         90       100       110       120       130       140       150       160
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 38570073  201 gppgprgppgdtGKDGPRGMPGVPGEPGKPGEQGLMGPLGPPGQKGSIGAPGIPGmNGQKGEPGLPGAVGQNGIPGPKGE 280
Cdd:NF038329 189 ------------GEKGPQGPRGETGPAGEQGPAGPAGPDGEAGPAGEDGPAGPAG-DGQQGPDGDPGPTGEDGPQGPDGP 255
                        170       180       190       200       210       220       230       240
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 38570073  281 PGEQGEKGDAGENGPKGDTGEKGDpgssaagiKGEPGESGRPGQKGEpglpglpglpgiKGEPGFIGPQGEPGLPGLPGT 360
Cdd:NF038329 256 AGKDGPRGDRGEAGPDGPDGKDGE--------RGPVGPAGKDGQNGK------------DGLPGKDGKDGQNGKDGLPGK 315
                        250       260
                 ....*....|....*....|....*...
gi 38570073  361 KGERGEAGPPGR----GERGEPGAPGPK 384
Cdd:NF038329 316 DGKDGQPGKDGLpgkdGKDGQPGKPAPK 343
gly_rich_SclB NF038329
LPXTG-anchored collagen-like adhesin Scl2/SclB; SclB (or Scl2 - streptococcal collagen-like ...
458-614 1.59e-12

LPXTG-anchored collagen-like adhesin Scl2/SclB; SclB (or Scl2 - streptococcal collagen-like protein 2) is an LPXTG-anchored surface-anchored adhesin with a variable-length region of triple helix-forming collagen-like Gly-Xaa-Xaa repeats.


Pssm-ID: 468478 [Multi-domain]  Cd Length: 440  Bit Score: 69.93  E-value: 1.59e-12
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 38570073  458 GLQGPKGEQGSPGIPGMDGEQGLKGSKGDMGDPGMTGEKGGIGLPGLPGANGMKGEKGDSGMPGPQgpSIIGPPGPPGPH 537
Cdd:NF038329 129 GPAGEQGPRGDRGETGPAGPAGPPGPQGERGEKGPAGPQGEAGPQGPAGKDGEAGAKGPAGEKGPQ--GPRGETGPAGEQ 206
                         90       100       110       120       130       140       150
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*..
gi 38570073  538 GPPGPMGPHGLPGPKGTDGPMGPHGPaGPKGERGEKGAMGEPGPRGPYGLPGKDGEPGLDGFPGPRGEKGDLGEKGE 614
Cdd:NF038329 207 GPAGPAGPDGEAGPAGEDGPAGPAGD-GQQGPDGDPGPTGEDGPQGPDGPAGKDGPRGDRGEAGPDGPDGKDGERGP 282
gly_rich_SclB NF038329
LPXTG-anchored collagen-like adhesin Scl2/SclB; SclB (or Scl2 - streptococcal collagen-like ...
463-615 1.37e-10

LPXTG-anchored collagen-like adhesin Scl2/SclB; SclB (or Scl2 - streptococcal collagen-like protein 2) is an LPXTG-anchored surface-anchored adhesin with a variable-length region of triple helix-forming collagen-like Gly-Xaa-Xaa repeats.


Pssm-ID: 468478 [Multi-domain]  Cd Length: 440  Bit Score: 63.77  E-value: 1.37e-10
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 38570073  463 KGEQGSPGIPGMDGEQGLKGSKGDMGDPGMTGEKggiglpglpganGMKGEKGDSGMPGPQgpsiigppgppgphgppgp 542
Cdd:NF038329 119 KGEPGPAGPAGPAGEQGPRGDRGETGPAGPAGPP------------GPQGERGEKGPAGPQ------------------- 167
                         90       100       110       120       130       140       150
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|...
gi 38570073  543 mgphglpgpkgtdGPMGPHGPAGPKGERGEKGAMGEPGPRGPYGLPGKDGEPGLDGFPGPRGEKGDLGEKGEK 615
Cdd:NF038329 168 -------------GEAGPQGPAGKDGEAGAKGPAGEKGPQGPRGETGPAGEQGPAGPAGPDGEAGPAGEDGPA 227
Collagen pfam01391
Collagen triple helix repeat (20 copies); Members of this family belong to the collagen ...
556-611 2.28e-06

Collagen triple helix repeat (20 copies); Members of this family belong to the collagen superfamily. Collagens are generally extracellular structural proteins involved in formation of connective tissue structure. The alignment contains 20 copies of the G-X-Y repeat that forms a triple helix. The first position of the repeat is glycine, the second and third positions can be any residue but are frequently proline and hydroxy-proline. Collagens are post translationally modified by proline hydroxylase to form the hydroxy-proline residues. Defective hydroxylation is the cause of scurvy. Some members of the collagen superfamily are not involved in connective tissue structure but share the same triple helical structure. The family includes bacterial collagen-like triple-helix repeat proteins.


Pssm-ID: 460189 [Multi-domain]  Cd Length: 57  Bit Score: 45.18  E-value: 2.28e-06
                          10        20        30        40        50
                  ....*....|....*....|....*....|....*....|....*....|....*.
gi 38570073   556 GPMGPHGPAGPKGERGEKGAMGEPGPRGPYGLPGKDGEPGLDGFPGPRGEKGDLGE 611
Cdd:pfam01391   1 GPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGEPGPPGPPGPPGPPGPPGAPGAPGP 56
Collagen pfam01391
Collagen triple helix repeat (20 copies); Members of this family belong to the collagen ...
373-425 6.91e-06

Collagen triple helix repeat (20 copies); Members of this family belong to the collagen superfamily. Collagens are generally extracellular structural proteins involved in formation of connective tissue structure. The alignment contains 20 copies of the G-X-Y repeat that forms a triple helix. The first position of the repeat is glycine, the second and third positions can be any residue but are frequently proline and hydroxy-proline. Collagens are post translationally modified by proline hydroxylase to form the hydroxy-proline residues. Defective hydroxylation is the cause of scurvy. Some members of the collagen superfamily are not involved in connective tissue structure but share the same triple helical structure. The family includes bacterial collagen-like triple-helix repeat proteins.


Pssm-ID: 460189 [Multi-domain]  Cd Length: 57  Bit Score: 43.64  E-value: 6.91e-06
                          10        20        30        40        50
                  ....*....|....*....|....*....|....*....|....*....|...
gi 38570073   373 GERGEPGAPGPKGKQGESGTRGPKGSKGDRGEKGDSGAQGPRGPPGQKGDQGA 425
Cdd:pfam01391   1 GPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGEPGPPGPPGPPGPPGPPGAPGA 53
Collagen pfam01391
Collagen triple helix repeat (20 copies); Members of this family belong to the collagen ...
216-272 1.08e-05

Collagen triple helix repeat (20 copies); Members of this family belong to the collagen superfamily. Collagens are generally extracellular structural proteins involved in formation of connective tissue structure. The alignment contains 20 copies of the G-X-Y repeat that forms a triple helix. The first position of the repeat is glycine, the second and third positions can be any residue but are frequently proline and hydroxy-proline. Collagens are post translationally modified by proline hydroxylase to form the hydroxy-proline residues. Defective hydroxylation is the cause of scurvy. Some members of the collagen superfamily are not involved in connective tissue structure but share the same triple helical structure. The family includes bacterial collagen-like triple-helix repeat proteins.


Pssm-ID: 460189 [Multi-domain]  Cd Length: 57  Bit Score: 43.25  E-value: 1.08e-05
                          10        20        30        40        50
                  ....*....|....*....|....*....|....*....|....*....|....*..
gi 38570073   216 GPRGMPGVPGEPGKPGEQGLMGPLGPPGQKGSIGAPGIPGMNGQKGEPGLPGAVGQN 272
Cdd:pfam01391   1 GPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGEPGPPGPPGPPGPPGPPGAPGAPGPP 57
PHA03169 PHA03169
hypothetical protein; Provisional
264-421 1.11e-04

hypothetical protein; Provisional


Pssm-ID: 223003 [Multi-domain]  Cd Length: 413  Bit Score: 44.96  E-value: 1.11e-04
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 38570073  264 GLPGAVGQNGIPGPKGEPGEQGEKG-------DAGENGPKGDTGEKGDPGSSAAGIKGEPGESGRPGQKGEPGLPGLPGL 336
Cdd:PHA03169  90 GGPSGSGSESVGSPTPSPSGSAEELasglspeNTSGSSPESPASHSPPPSPPSHPGPHEPAPPESHNPSPNQQPSSFLQP 169
                         90       100       110       120       130       140       150       160
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 38570073  337 PGIKGEPGFIGPQGEPGLPGLPGTKGERGEAGPPGRGERGEPGAPgpkgkQGESGTRGPKGSKGDRGEKGDSGAQGPRGP 416
Cdd:PHA03169 170 SHEDSPEEPEPPTSEPEPDSPGPPQSETPTSSPPPQSPPDEPGEP-----QSPTPQQAPSPNTQQAVEHEDEPTEPEREG 244

                 ....*
gi 38570073  417 PGQKG 421
Cdd:PHA03169 245 PPFPG 249
SPT5 COG5164
Transcription elongation factor SPT5 [Transcription];
211-392 1.56e-03

Transcription elongation factor SPT5 [Transcription];


Pssm-ID: 444063 [Multi-domain]  Cd Length: 495  Bit Score: 41.55  E-value: 1.56e-03
                        10        20        30        40        50        60        70        80
                ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 38570073 211 DTGKDGPRGMPGVPGEPGKPGEQGLMGPLGPPGQKGSIGAPGIPGMNGQKGEPGLPGAVGQNGI---------------P 275
Cdd:COG5164  68 NQGATGPAQNQGGTTPAQNQGGTRPAGNTGGTTPAGDGGATGPPDDGGATGPPDDGGSTTPPSGgsttppgdggstppgP 147
                        90       100       110       120       130       140       150       160
                ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 38570073 276 GPKGEPGEQGEKGDAGENGPKGDTGEKGDPGSSAAGIKGEPGESGRPGQKGEPGLPGLPGLPGIKGEPGFIGPQGEPGlp 355
Cdd:COG5164 148 GSTGPGGSTTPPGDGGSTTPPGPGGSTTPPDDGGSTTPPNKGETGTDIPTGGTPRQGPDGPVKKDDKNGKGNPPDDRG-- 225
                       170       180       190
                ....*....|....*....|....*....|....*...
gi 38570073 356 GLPGTKGERGEAGPPG-RGERGEPGAPGPKGKQGESGT 392
Cdd:COG5164 226 GKTGPKDQRPKTNPIErRGPERPEAAALPAELTALEAE 263
 
Name Accession Description Interval E-value
gly_rich_SclB NF038329
LPXTG-anchored collagen-like adhesin Scl2/SclB; SclB (or Scl2 - streptococcal collagen-like ...
258-495 1.40e-28

LPXTG-anchored collagen-like adhesin Scl2/SclB; SclB (or Scl2 - streptococcal collagen-like protein 2) is an LPXTG-anchored surface-anchored adhesin with a variable-length region of triple helix-forming collagen-like Gly-Xaa-Xaa repeats.


Pssm-ID: 468478 [Multi-domain]  Cd Length: 440  Bit Score: 118.85  E-value: 1.40e-28
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 38570073  258 GQKGEPGLPGAVGQNGIPGPKGEPGEQGEKGDAGENGPKGDTGEKGDPGSSA-AGIKGEPGESGRPGQKGEPGLPGLPGL 336
Cdd:NF038329 117 GEKGEPGPAGPAGPAGEQGPRGDRGETGPAGPAGPPGPQGERGEKGPAGPQGeAGPQGPAGKDGEAGAKGPAGEKGPQGP 196
                         90       100       110       120       130       140       150       160
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 38570073  337 PGIKGEPGFIGPQGEPGLPGLPGTKGERGEAGPPGRGERGEPGAPGPKGKQGESGTRGPKGSKGDRGEKGDSGAQGPRGP 416
Cdd:NF038329 197 RGETGPAGEQGPAGPAGPDGEAGPAGEDGPAGPAGDGQQGPDGDPGPTGEDGPQGPDGPAGKDGPRGDRGEAGPDGPDGK 276
                        170       180       190       200       210       220       230
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 38570073  417 PGQKGDQGATEIIDYNGnlhealqrittltvtgppgPPGPQGLQGPKGEQGSPGIPGMDGEQGLKGSKGDMGDPGMTGE 495
Cdd:NF038329 277 DGERGPVGPAGKDGQNG-------------------KDGLPGKDGKDGQNGKDGLPGKDGKDGQPGKDGLPGKDGKDGQ 336
gly_rich_SclB NF038329
LPXTG-anchored collagen-like adhesin Scl2/SclB; SclB (or Scl2 - streptococcal collagen-like ...
192-421 9.08e-28

LPXTG-anchored collagen-like adhesin Scl2/SclB; SclB (or Scl2 - streptococcal collagen-like protein 2) is an LPXTG-anchored surface-anchored adhesin with a variable-length region of triple helix-forming collagen-like Gly-Xaa-Xaa repeats.


Pssm-ID: 468478 [Multi-domain]  Cd Length: 440  Bit Score: 116.54  E-value: 9.08e-28
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 38570073  192 GQAGPPGPPGPPGPRGPPGDTGKDGPRGMPGVPGEPGKPGEQGLMGPLGPPGQKGSIGAPGIPGMNGQKGEPGLPGAVGQ 271
Cdd:NF038329 117 GEKGEPGPAGPAGPAGEQGPRGDRGETGPAGPAGPPGPQGERGEKGPAGPQGEAGPQGPAGKDGEAGAKGPAGEKGPQGP 196
                         90       100       110       120       130       140       150       160
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 38570073  272 NGIPGPKGEPGEQGEKGDAGENGPKGDTGEKGDPGSSAAGIKGEPGESGRPGQKGEpglpglpglpgiKGEPGFIGPQGE 351
Cdd:NF038329 197 RGETGPAGEQGPAGPAGPDGEAGPAGEDGPAGPAGDGQQGPDGDPGPTGEDGPQGP------------DGPAGKDGPRGD 264
                        170       180       190       200       210       220       230
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....
gi 38570073  352 PGLPGLPGTKGERGEAGPPGR----GERGEPGAPGPKGKQGESGTRGPKGSKGDRGEKGDSGAQGPRGPPGQKG 421
Cdd:NF038329 265 RGEAGPDGPDGKDGERGPVGPagkdGQNGKDGLPGKDGKDGQNGKDGLPGKDGKDGQPGKDGLPGKDGKDGQPG 338
gly_rich_SclB NF038329
LPXTG-anchored collagen-like adhesin Scl2/SclB; SclB (or Scl2 - streptococcal collagen-like ...
340-604 8.52e-23

LPXTG-anchored collagen-like adhesin Scl2/SclB; SclB (or Scl2 - streptococcal collagen-like protein 2) is an LPXTG-anchored surface-anchored adhesin with a variable-length region of triple helix-forming collagen-like Gly-Xaa-Xaa repeats.


Pssm-ID: 468478 [Multi-domain]  Cd Length: 440  Bit Score: 101.52  E-value: 8.52e-23
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 38570073  340 KGEPGFIGPQGEPGLPGLPGTKGERGEAGPPG-RGERGEPGAPGPKGKQGESGTRGPKGSKGDRGEKGDSGAQGPRGPPG 418
Cdd:NF038329 119 KGEPGPAGPAGPAGEQGPRGDRGETGPAGPAGpPGPQGERGEKGPAGPQGEAGPQGPAGKDGEAGAKGPAGEKGPQGPRG 198
                         90       100       110       120       130       140       150       160
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 38570073  419 QKGDQGAteiidyngnlhealqrittltvtgppgpPGPQGLQGPKGEQGSPGIPGMDGEQGlKGSKGDMGDPGMTGEKGG 498
Cdd:NF038329 199 ETGPAGE----------------------------QGPAGPAGPDGEAGPAGEDGPAGPAG-DGQQGPDGDPGPTGEDGP 249
                        170       180       190       200       210       220       230       240
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 38570073  499 IGLPGLPGANGMKGEKGDSGMPGPQGPSIIGPPGPPGphgppgpmgphglpgpkGTDGPMGPHGPAGPKGERGEKGAMGE 578
Cdd:NF038329 250 QGPDGPAGKDGPRGDRGEAGPDGPDGKDGERGPVGPA-----------------GKDGQNGKDGLPGKDGKDGQNGKDGL 312
                        250       260
                 ....*....|....*....|....*.
gi 38570073  579 PGPRGPYGLPGKDGEPGLDGFPGPRG 604
Cdd:NF038329 313 PGKDGKDGQPGKDGLPGKDGKDGQPG 338
gly_rich_SclB NF038329
LPXTG-anchored collagen-like adhesin Scl2/SclB; SclB (or Scl2 - streptococcal collagen-like ...
121-384 2.93e-15

LPXTG-anchored collagen-like adhesin Scl2/SclB; SclB (or Scl2 - streptococcal collagen-like protein 2) is an LPXTG-anchored surface-anchored adhesin with a variable-length region of triple helix-forming collagen-like Gly-Xaa-Xaa repeats.


Pssm-ID: 468478 [Multi-domain]  Cd Length: 440  Bit Score: 78.41  E-value: 2.93e-15
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 38570073  121 PAGPPGKRGKRGRRGESGPPGQPGPQGPPGPKGDKGEQGDQGPRmvfpkinhgflsadqqlikrrlikgdqgqagppgpp 200
Cdd:NF038329 145 PAGPAGPPGPQGERGEKGPAGPQGEAGPQGPAGKDGEAGAKGPA------------------------------------ 188
                         90       100       110       120       130       140       150       160
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 38570073  201 gppgprgppgdtGKDGPRGMPGVPGEPGKPGEQGLMGPLGPPGQKGSIGAPGIPGmNGQKGEPGLPGAVGQNGIPGPKGE 280
Cdd:NF038329 189 ------------GEKGPQGPRGETGPAGEQGPAGPAGPDGEAGPAGEDGPAGPAG-DGQQGPDGDPGPTGEDGPQGPDGP 255
                        170       180       190       200       210       220       230       240
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 38570073  281 PGEQGEKGDAGENGPKGDTGEKGDpgssaagiKGEPGESGRPGQKGEpglpglpglpgiKGEPGFIGPQGEPGLPGLPGT 360
Cdd:NF038329 256 AGKDGPRGDRGEAGPDGPDGKDGE--------RGPVGPAGKDGQNGK------------DGLPGKDGKDGQNGKDGLPGK 315
                        250       260
                 ....*....|....*....|....*...
gi 38570073  361 KGERGEAGPPGR----GERGEPGAPGPK 384
Cdd:NF038329 316 DGKDGQPGKDGLpgkdGKDGQPGKPAPK 343
gly_rich_SclB NF038329
LPXTG-anchored collagen-like adhesin Scl2/SclB; SclB (or Scl2 - streptococcal collagen-like ...
458-614 1.59e-12

LPXTG-anchored collagen-like adhesin Scl2/SclB; SclB (or Scl2 - streptococcal collagen-like protein 2) is an LPXTG-anchored surface-anchored adhesin with a variable-length region of triple helix-forming collagen-like Gly-Xaa-Xaa repeats.


Pssm-ID: 468478 [Multi-domain]  Cd Length: 440  Bit Score: 69.93  E-value: 1.59e-12
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 38570073  458 GLQGPKGEQGSPGIPGMDGEQGLKGSKGDMGDPGMTGEKGGIGLPGLPGANGMKGEKGDSGMPGPQgpSIIGPPGPPGPH 537
Cdd:NF038329 129 GPAGEQGPRGDRGETGPAGPAGPPGPQGERGEKGPAGPQGEAGPQGPAGKDGEAGAKGPAGEKGPQ--GPRGETGPAGEQ 206
                         90       100       110       120       130       140       150
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*..
gi 38570073  538 GPPGPMGPHGLPGPKGTDGPMGPHGPaGPKGERGEKGAMGEPGPRGPYGLPGKDGEPGLDGFPGPRGEKGDLGEKGE 614
Cdd:NF038329 207 GPAGPAGPDGEAGPAGEDGPAGPAGD-GQQGPDGDPGPTGEDGPQGPDGPAGKDGPRGDRGEAGPDGPDGKDGERGP 282
gly_rich_SclB NF038329
LPXTG-anchored collagen-like adhesin Scl2/SclB; SclB (or Scl2 - streptococcal collagen-like ...
463-615 1.37e-10

LPXTG-anchored collagen-like adhesin Scl2/SclB; SclB (or Scl2 - streptococcal collagen-like protein 2) is an LPXTG-anchored surface-anchored adhesin with a variable-length region of triple helix-forming collagen-like Gly-Xaa-Xaa repeats.


Pssm-ID: 468478 [Multi-domain]  Cd Length: 440  Bit Score: 63.77  E-value: 1.37e-10
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 38570073  463 KGEQGSPGIPGMDGEQGLKGSKGDMGDPGMTGEKggiglpglpganGMKGEKGDSGMPGPQgpsiigppgppgphgppgp 542
Cdd:NF038329 119 KGEPGPAGPAGPAGEQGPRGDRGETGPAGPAGPP------------GPQGERGEKGPAGPQ------------------- 167
                         90       100       110       120       130       140       150
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|...
gi 38570073  543 mgphglpgpkgtdGPMGPHGPAGPKGERGEKGAMGEPGPRGPYGLPGKDGEPGLDGFPGPRGEKGDLGEKGEK 615
Cdd:NF038329 168 -------------GEAGPQGPAGKDGEAGAKGPAGEKGPQGPRGETGPAGEQGPAGPAGPDGEAGPAGEDGPA 227
Collagen pfam01391
Collagen triple helix repeat (20 copies); Members of this family belong to the collagen ...
556-611 2.28e-06

Collagen triple helix repeat (20 copies); Members of this family belong to the collagen superfamily. Collagens are generally extracellular structural proteins involved in formation of connective tissue structure. The alignment contains 20 copies of the G-X-Y repeat that forms a triple helix. The first position of the repeat is glycine, the second and third positions can be any residue but are frequently proline and hydroxy-proline. Collagens are post translationally modified by proline hydroxylase to form the hydroxy-proline residues. Defective hydroxylation is the cause of scurvy. Some members of the collagen superfamily are not involved in connective tissue structure but share the same triple helical structure. The family includes bacterial collagen-like triple-helix repeat proteins.


Pssm-ID: 460189 [Multi-domain]  Cd Length: 57  Bit Score: 45.18  E-value: 2.28e-06
                          10        20        30        40        50
                  ....*....|....*....|....*....|....*....|....*....|....*.
gi 38570073   556 GPMGPHGPAGPKGERGEKGAMGEPGPRGPYGLPGKDGEPGLDGFPGPRGEKGDLGE 611
Cdd:pfam01391   1 GPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGEPGPPGPPGPPGPPGPPGAPGAPGP 56
Collagen pfam01391
Collagen triple helix repeat (20 copies); Members of this family belong to the collagen ...
373-425 6.91e-06

Collagen triple helix repeat (20 copies); Members of this family belong to the collagen superfamily. Collagens are generally extracellular structural proteins involved in formation of connective tissue structure. The alignment contains 20 copies of the G-X-Y repeat that forms a triple helix. The first position of the repeat is glycine, the second and third positions can be any residue but are frequently proline and hydroxy-proline. Collagens are post translationally modified by proline hydroxylase to form the hydroxy-proline residues. Defective hydroxylation is the cause of scurvy. Some members of the collagen superfamily are not involved in connective tissue structure but share the same triple helical structure. The family includes bacterial collagen-like triple-helix repeat proteins.


Pssm-ID: 460189 [Multi-domain]  Cd Length: 57  Bit Score: 43.64  E-value: 6.91e-06
                          10        20        30        40        50
                  ....*....|....*....|....*....|....*....|....*....|...
gi 38570073   373 GERGEPGAPGPKGKQGESGTRGPKGSKGDRGEKGDSGAQGPRGPPGQKGDQGA 425
Cdd:pfam01391   1 GPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGEPGPPGPPGPPGPPGPPGAPGA 53
Collagen pfam01391
Collagen triple helix repeat (20 copies); Members of this family belong to the collagen ...
216-272 1.08e-05

Collagen triple helix repeat (20 copies); Members of this family belong to the collagen superfamily. Collagens are generally extracellular structural proteins involved in formation of connective tissue structure. The alignment contains 20 copies of the G-X-Y repeat that forms a triple helix. The first position of the repeat is glycine, the second and third positions can be any residue but are frequently proline and hydroxy-proline. Collagens are post translationally modified by proline hydroxylase to form the hydroxy-proline residues. Defective hydroxylation is the cause of scurvy. Some members of the collagen superfamily are not involved in connective tissue structure but share the same triple helical structure. The family includes bacterial collagen-like triple-helix repeat proteins.


Pssm-ID: 460189 [Multi-domain]  Cd Length: 57  Bit Score: 43.25  E-value: 1.08e-05
                          10        20        30        40        50
                  ....*....|....*....|....*....|....*....|....*....|....*..
gi 38570073   216 GPRGMPGVPGEPGKPGEQGLMGPLGPPGQKGSIGAPGIPGMNGQKGEPGLPGAVGQN 272
Cdd:pfam01391   1 GPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGEPGPPGPPGPPGPPGPPGAPGAPGPP 57
Collagen pfam01391
Collagen triple helix repeat (20 copies); Members of this family belong to the collagen ...
553-607 2.65e-05

Collagen triple helix repeat (20 copies); Members of this family belong to the collagen superfamily. Collagens are generally extracellular structural proteins involved in formation of connective tissue structure. The alignment contains 20 copies of the G-X-Y repeat that forms a triple helix. The first position of the repeat is glycine, the second and third positions can be any residue but are frequently proline and hydroxy-proline. Collagens are post translationally modified by proline hydroxylase to form the hydroxy-proline residues. Defective hydroxylation is the cause of scurvy. Some members of the collagen superfamily are not involved in connective tissue structure but share the same triple helical structure. The family includes bacterial collagen-like triple-helix repeat proteins.


Pssm-ID: 460189 [Multi-domain]  Cd Length: 57  Bit Score: 42.10  E-value: 2.65e-05
                          10        20        30        40        50
                  ....*....|....*....|....*....|....*....|....*....|....*
gi 38570073   553 GTDGPMGPHGPAGPKGERGEKGAMGEPGPRGPYGLPGKDGEPGLDGFPGPRGEKG 607
Cdd:pfam01391   1 GPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGEPGPPGPPGPPGPPGPPGAPGAPG 55
Collagen pfam01391
Collagen triple helix repeat (20 copies); Members of this family belong to the collagen ...
559-614 2.93e-05

Collagen triple helix repeat (20 copies); Members of this family belong to the collagen superfamily. Collagens are generally extracellular structural proteins involved in formation of connective tissue structure. The alignment contains 20 copies of the G-X-Y repeat that forms a triple helix. The first position of the repeat is glycine, the second and third positions can be any residue but are frequently proline and hydroxy-proline. Collagens are post translationally modified by proline hydroxylase to form the hydroxy-proline residues. Defective hydroxylation is the cause of scurvy. Some members of the collagen superfamily are not involved in connective tissue structure but share the same triple helical structure. The family includes bacterial collagen-like triple-helix repeat proteins.


Pssm-ID: 460189 [Multi-domain]  Cd Length: 57  Bit Score: 42.10  E-value: 2.93e-05
                          10        20        30        40        50
                  ....*....|....*....|....*....|....*....|....*....|....*.
gi 38570073   559 GPHGPAGPKGERGEKGAMGEPGPRGPYGLPGKDGEPGLDGFPGPRGEKGDLGEKGE 614
Cdd:pfam01391   1 GPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGEPGPPGPPGPPGPPGPPGAPGAPGP 56
Collagen pfam01391
Collagen triple helix repeat (20 copies); Members of this family belong to the collagen ...
237-293 7.08e-05

Collagen triple helix repeat (20 copies); Members of this family belong to the collagen superfamily. Collagens are generally extracellular structural proteins involved in formation of connective tissue structure. The alignment contains 20 copies of the G-X-Y repeat that forms a triple helix. The first position of the repeat is glycine, the second and third positions can be any residue but are frequently proline and hydroxy-proline. Collagens are post translationally modified by proline hydroxylase to form the hydroxy-proline residues. Defective hydroxylation is the cause of scurvy. Some members of the collagen superfamily are not involved in connective tissue structure but share the same triple helical structure. The family includes bacterial collagen-like triple-helix repeat proteins.


Pssm-ID: 460189 [Multi-domain]  Cd Length: 57  Bit Score: 40.94  E-value: 7.08e-05
                          10        20        30        40        50
                  ....*....|....*....|....*....|....*....|....*....|....*..
gi 38570073   237 GPLGPPGQKGSIGAPGIPGMNGQKGEPGLPGAVGQNGIPGPKGEPGEQGEKGDAGEN 293
Cdd:pfam01391   1 GPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGEPGPPGPPGPPGPPGPPGAPGAPGPP 57
Collagen pfam01391
Collagen triple helix repeat (20 copies); Members of this family belong to the collagen ...
240-295 1.10e-04

Collagen triple helix repeat (20 copies); Members of this family belong to the collagen superfamily. Collagens are generally extracellular structural proteins involved in formation of connective tissue structure. The alignment contains 20 copies of the G-X-Y repeat that forms a triple helix. The first position of the repeat is glycine, the second and third positions can be any residue but are frequently proline and hydroxy-proline. Collagens are post translationally modified by proline hydroxylase to form the hydroxy-proline residues. Defective hydroxylation is the cause of scurvy. Some members of the collagen superfamily are not involved in connective tissue structure but share the same triple helical structure. The family includes bacterial collagen-like triple-helix repeat proteins.


Pssm-ID: 460189 [Multi-domain]  Cd Length: 57  Bit Score: 40.17  E-value: 1.10e-04
                          10        20        30        40        50
                  ....*....|....*....|....*....|....*....|....*....|....*.
gi 38570073   240 GPPGQKGSIGAPGIPGMNGQKGEPGLPGAVGQNGIPGPKGEPGEQGEKGDAGENGP 295
Cdd:pfam01391   1 GPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGEPGPPGPPGPPGPPGPPGAPGAPGP 56
PHA03169 PHA03169
hypothetical protein; Provisional
264-421 1.11e-04

hypothetical protein; Provisional


Pssm-ID: 223003 [Multi-domain]  Cd Length: 413  Bit Score: 44.96  E-value: 1.11e-04
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 38570073  264 GLPGAVGQNGIPGPKGEPGEQGEKG-------DAGENGPKGDTGEKGDPGSSAAGIKGEPGESGRPGQKGEPGLPGLPGL 336
Cdd:PHA03169  90 GGPSGSGSESVGSPTPSPSGSAEELasglspeNTSGSSPESPASHSPPPSPPSHPGPHEPAPPESHNPSPNQQPSSFLQP 169
                         90       100       110       120       130       140       150       160
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 38570073  337 PGIKGEPGFIGPQGEPGLPGLPGTKGERGEAGPPGRGERGEPGAPgpkgkQGESGTRGPKGSKGDRGEKGDSGAQGPRGP 416
Cdd:PHA03169 170 SHEDSPEEPEPPTSEPEPDSPGPPQSETPTSSPPPQSPPDEPGEP-----QSPTPQQAPSPNTQQAVEHEDEPTEPEREG 244

                 ....*
gi 38570073  417 PGQKG 421
Cdd:PHA03169 245 PPFPG 249
Collagen pfam01391
Collagen triple helix repeat (20 copies); Members of this family belong to the collagen ...
252-306 1.34e-04

Collagen triple helix repeat (20 copies); Members of this family belong to the collagen superfamily. Collagens are generally extracellular structural proteins involved in formation of connective tissue structure. The alignment contains 20 copies of the G-X-Y repeat that forms a triple helix. The first position of the repeat is glycine, the second and third positions can be any residue but are frequently proline and hydroxy-proline. Collagens are post translationally modified by proline hydroxylase to form the hydroxy-proline residues. Defective hydroxylation is the cause of scurvy. Some members of the collagen superfamily are not involved in connective tissue structure but share the same triple helical structure. The family includes bacterial collagen-like triple-helix repeat proteins.


Pssm-ID: 460189 [Multi-domain]  Cd Length: 57  Bit Score: 40.17  E-value: 1.34e-04
                          10        20        30        40        50
                  ....*....|....*....|....*....|....*....|....*....|....*
gi 38570073   252 GIPGMNGQKGEPGLPGAVGQNGIPGPKGEPGEQGEKGDAGENGPKGDTGEKGDPG 306
Cdd:pfam01391   1 GPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGEPGPPGPPGPPGPPGPPGAPGAPG 55
Collagen pfam01391
Collagen triple helix repeat (20 copies); Members of this family belong to the collagen ...
365-419 1.79e-04

Collagen triple helix repeat (20 copies); Members of this family belong to the collagen superfamily. Collagens are generally extracellular structural proteins involved in formation of connective tissue structure. The alignment contains 20 copies of the G-X-Y repeat that forms a triple helix. The first position of the repeat is glycine, the second and third positions can be any residue but are frequently proline and hydroxy-proline. Collagens are post translationally modified by proline hydroxylase to form the hydroxy-proline residues. Defective hydroxylation is the cause of scurvy. Some members of the collagen superfamily are not involved in connective tissue structure but share the same triple helical structure. The family includes bacterial collagen-like triple-helix repeat proteins.


Pssm-ID: 460189 [Multi-domain]  Cd Length: 57  Bit Score: 39.78  E-value: 1.79e-04
                          10        20        30        40        50
                  ....*....|....*....|....*....|....*....|....*....|....*.
gi 38570073   365 GEAGPPG-RGERGEPGAPGPKGKQGESGTRGPKGSKGDRGEKGDSGAQGPRGPPGQ 419
Cdd:pfam01391   1 GPPGPPGpPGPPGPPGPPGPPGPPGPPGPPGEPGPPGPPGPPGPPGPPGAPGAPGP 56
Collagen pfam01391
Collagen triple helix repeat (20 copies); Members of this family belong to the collagen ...
553-605 2.01e-04

Collagen triple helix repeat (20 copies); Members of this family belong to the collagen superfamily. Collagens are generally extracellular structural proteins involved in formation of connective tissue structure. The alignment contains 20 copies of the G-X-Y repeat that forms a triple helix. The first position of the repeat is glycine, the second and third positions can be any residue but are frequently proline and hydroxy-proline. Collagens are post translationally modified by proline hydroxylase to form the hydroxy-proline residues. Defective hydroxylation is the cause of scurvy. Some members of the collagen superfamily are not involved in connective tissue structure but share the same triple helical structure. The family includes bacterial collagen-like triple-helix repeat proteins.


Pssm-ID: 460189 [Multi-domain]  Cd Length: 57  Bit Score: 39.40  E-value: 2.01e-04
                          10        20        30        40        50
                  ....*....|....*....|....*....|....*....|....*....|...
gi 38570073   553 GTDGPMGPHGPAGPKGERGEKGAMGEPGPRGPYGLPGKDGEPGLDGFPGPRGE 605
Cdd:pfam01391   4 GPPGPPGPPGPPGPPGPPGPPGPPGPPGEPGPPGPPGPPGPPGPPGAPGAPGP 56
Collagen pfam01391
Collagen triple helix repeat (20 copies); Members of this family belong to the collagen ...
368-421 3.01e-04

Collagen triple helix repeat (20 copies); Members of this family belong to the collagen superfamily. Collagens are generally extracellular structural proteins involved in formation of connective tissue structure. The alignment contains 20 copies of the G-X-Y repeat that forms a triple helix. The first position of the repeat is glycine, the second and third positions can be any residue but are frequently proline and hydroxy-proline. Collagens are post translationally modified by proline hydroxylase to form the hydroxy-proline residues. Defective hydroxylation is the cause of scurvy. Some members of the collagen superfamily are not involved in connective tissue structure but share the same triple helical structure. The family includes bacterial collagen-like triple-helix repeat proteins.


Pssm-ID: 460189 [Multi-domain]  Cd Length: 57  Bit Score: 39.01  E-value: 3.01e-04
                          10        20        30        40        50
                  ....*....|....*....|....*....|....*....|....*....|....*
gi 38570073   368 GPPG-RGERGEPGAPGPKGKQGESGTRGPKGSKGDRGEKGDSGAQGPRGPPGQKG 421
Cdd:pfam01391   1 GPPGpPGPPGPPGPPGPPGPPGPPGPPGPPGEPGPPGPPGPPGPPGPPGAPGAPG 55
Collagen pfam01391
Collagen triple helix repeat (20 copies); Members of this family belong to the collagen ...
362-417 3.04e-04

Collagen triple helix repeat (20 copies); Members of this family belong to the collagen superfamily. Collagens are generally extracellular structural proteins involved in formation of connective tissue structure. The alignment contains 20 copies of the G-X-Y repeat that forms a triple helix. The first position of the repeat is glycine, the second and third positions can be any residue but are frequently proline and hydroxy-proline. Collagens are post translationally modified by proline hydroxylase to form the hydroxy-proline residues. Defective hydroxylation is the cause of scurvy. Some members of the collagen superfamily are not involved in connective tissue structure but share the same triple helical structure. The family includes bacterial collagen-like triple-helix repeat proteins.


Pssm-ID: 460189 [Multi-domain]  Cd Length: 57  Bit Score: 39.01  E-value: 3.04e-04
                          10        20        30        40        50
                  ....*....|....*....|....*....|....*....|....*....|....*..
gi 38570073   362 GERGEAGPPG-RGERGEPGAPGPKGKQGESGTRGPKGSKGDRGEKGDSGAQGPRGPP 417
Cdd:pfam01391   1 GPPGPPGPPGpPGPPGPPGPPGPPGPPGPPGEPGPPGPPGPPGPPGPPGAPGAPGPP 57
Collagen pfam01391
Collagen triple helix repeat (20 copies); Members of this family belong to the collagen ...
219-277 3.19e-04

Collagen triple helix repeat (20 copies); Members of this family belong to the collagen superfamily. Collagens are generally extracellular structural proteins involved in formation of connective tissue structure. The alignment contains 20 copies of the G-X-Y repeat that forms a triple helix. The first position of the repeat is glycine, the second and third positions can be any residue but are frequently proline and hydroxy-proline. Collagens are post translationally modified by proline hydroxylase to form the hydroxy-proline residues. Defective hydroxylation is the cause of scurvy. Some members of the collagen superfamily are not involved in connective tissue structure but share the same triple helical structure. The family includes bacterial collagen-like triple-helix repeat proteins.


Pssm-ID: 460189 [Multi-domain]  Cd Length: 57  Bit Score: 39.01  E-value: 3.19e-04
                          10        20        30        40        50
                  ....*....|....*....|....*....|....*....|....*....|....*....
gi 38570073   219 GMPGVPGEPGKPGEQglmGPLGPPGQKGSIGAPGIPGMNGQKGEPGLPGAVGQNGIPGP 277
Cdd:pfam01391   1 GPPGPPGPPGPPGPP---GPPGPPGPPGPPGPPGEPGPPGPPGPPGPPGPPGAPGAPGP 56
Collagen pfam01391
Collagen triple helix repeat (20 copies); Members of this family belong to the collagen ...
249-305 3.66e-04

Collagen triple helix repeat (20 copies); Members of this family belong to the collagen superfamily. Collagens are generally extracellular structural proteins involved in formation of connective tissue structure. The alignment contains 20 copies of the G-X-Y repeat that forms a triple helix. The first position of the repeat is glycine, the second and third positions can be any residue but are frequently proline and hydroxy-proline. Collagens are post translationally modified by proline hydroxylase to form the hydroxy-proline residues. Defective hydroxylation is the cause of scurvy. Some members of the collagen superfamily are not involved in connective tissue structure but share the same triple helical structure. The family includes bacterial collagen-like triple-helix repeat proteins.


Pssm-ID: 460189 [Multi-domain]  Cd Length: 57  Bit Score: 38.63  E-value: 3.66e-04
                          10        20        30        40        50
                  ....*....|....*....|....*....|....*....|....*....|....*..
gi 38570073   249 GAPGIPGMNGQKGEPGLPGAVGQNGIPGPKGEPGEQGEKGDAGENGPKGDTGEKGDP 305
Cdd:pfam01391   1 GPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGEPGPPGPPGPPGPPGPPGAPGAPGPP 57
Collagen pfam01391
Collagen triple helix repeat (20 copies); Members of this family belong to the collagen ...
213-263 7.58e-04

Collagen triple helix repeat (20 copies); Members of this family belong to the collagen superfamily. Collagens are generally extracellular structural proteins involved in formation of connective tissue structure. The alignment contains 20 copies of the G-X-Y repeat that forms a triple helix. The first position of the repeat is glycine, the second and third positions can be any residue but are frequently proline and hydroxy-proline. Collagens are post translationally modified by proline hydroxylase to form the hydroxy-proline residues. Defective hydroxylation is the cause of scurvy. Some members of the collagen superfamily are not involved in connective tissue structure but share the same triple helical structure. The family includes bacterial collagen-like triple-helix repeat proteins.


Pssm-ID: 460189 [Multi-domain]  Cd Length: 57  Bit Score: 37.86  E-value: 7.58e-04
                          10        20        30        40        50
                  ....*....|....*....|....*....|....*....|....*....|.
gi 38570073   213 GKDGPRGMPGVPGEPGKPGEQGLMGPLGPPGQKGSIGAPGIPGMNGQKGEP 263
Cdd:pfam01391   7 GPPGPPGPPGPPGPPGPPGPPGPPGEPGPPGPPGPPGPPGPPGAPGAPGPP 57
Collagen pfam01391
Collagen triple helix repeat (20 copies); Members of this family belong to the collagen ...
556-603 8.12e-04

Collagen triple helix repeat (20 copies); Members of this family belong to the collagen superfamily. Collagens are generally extracellular structural proteins involved in formation of connective tissue structure. The alignment contains 20 copies of the G-X-Y repeat that forms a triple helix. The first position of the repeat is glycine, the second and third positions can be any residue but are frequently proline and hydroxy-proline. Collagens are post translationally modified by proline hydroxylase to form the hydroxy-proline residues. Defective hydroxylation is the cause of scurvy. Some members of the collagen superfamily are not involved in connective tissue structure but share the same triple helical structure. The family includes bacterial collagen-like triple-helix repeat proteins.


Pssm-ID: 460189 [Multi-domain]  Cd Length: 57  Bit Score: 37.86  E-value: 8.12e-04
                          10        20        30        40
                  ....*....|....*....|....*....|....*....|....*...
gi 38570073   556 GPMGPHGPAGPKGERGEKGAMGEPGPRGPYGLPGKDGEPGLDGFPGPR 603
Cdd:pfam01391  10 GPPGPPGPPGPPGPPGPPGPPGEPGPPGPPGPPGPPGPPGAPGAPGPP 57
PHA03169 PHA03169
hypothetical protein; Provisional
213-395 9.63e-04

hypothetical protein; Provisional


Pssm-ID: 223003 [Multi-domain]  Cd Length: 413  Bit Score: 41.88  E-value: 9.63e-04
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 38570073  213 GKDGPRGMPGVPGEPGKPGEQGLMGPLGPPGQKGSIGAPGIPGMNGQKGEPGLPGAVGQNGIPGPKGEPGEQGEKGDAGE 292
Cdd:PHA03169  90 GGPSGSGSESVGSPTPSPSGSAEELASGLSPENTSGSSPESPASHSPPPSPPSHPGPHEPAPPESHNPSPNQQPSSFLQP 169
                         90       100       110       120       130       140       150       160
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 38570073  293 NGPKGDtgEKGDPGSSAAGIKGEPGESGRPGQKGEPGLPGLPGLpgikGEPGFIGPQGEPGLPGLPGTKGERGEAGPpgr 372
Cdd:PHA03169 170 SHEDSP--EEPEPPTSEPEPDSPGPPQSETPTSSPPPQSPPDEP----GEPQSPTPQQAPSPNTQQAVEHEDEPTEP--- 240
                        170       180
                 ....*....|....*....|...
gi 38570073  373 gERGEPGAPGPKGKQGESGTRGP 395
Cdd:PHA03169 241 -EREGPPFPGHRSHSYTVVGWKP 262
PHA03169 PHA03169
hypothetical protein; Provisional
276-438 1.05e-03

hypothetical protein; Provisional


Pssm-ID: 223003 [Multi-domain]  Cd Length: 413  Bit Score: 41.88  E-value: 1.05e-03
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 38570073  276 GPKGEPGEQGEKGDAGENGPKGDTGEKGDPGSSAAGIKGEPGESGRPGQKGEPGLPGLPGLPGIKGEPGFIGPQGEPGLP 355
Cdd:PHA03169  70 ESDTETAEESRHGEKEERGQGGPSGSGSESVGSPTPSPSGSAEELASGLSPENTSGSSPESPASHSPPPSPPSHPGPHEP 149
                         90       100       110       120       130       140       150       160
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 38570073  356 GLPGTKGERGEAGPPGR-GERGEPGAPGPKGKQGESGTRGPKGSKGDRGEKGDSGAQGPR--GPPGQKGDQGATEIIDYN 432
Cdd:PHA03169 150 APPESHNPSPNQQPSSFlQPSHEDSPEEPEPPTSEPEPDSPGPPQSETPTSSPPPQSPPDepGEPQSPTPQQAPSPNTQQ 229

                 ....*.
gi 38570073  433 GNLHEA 438
Cdd:PHA03169 230 AVEHED 235
Collagen pfam01391
Collagen triple helix repeat (20 copies); Members of this family belong to the collagen ...
255-307 1.40e-03

Collagen triple helix repeat (20 copies); Members of this family belong to the collagen superfamily. Collagens are generally extracellular structural proteins involved in formation of connective tissue structure. The alignment contains 20 copies of the G-X-Y repeat that forms a triple helix. The first position of the repeat is glycine, the second and third positions can be any residue but are frequently proline and hydroxy-proline. Collagens are post translationally modified by proline hydroxylase to form the hydroxy-proline residues. Defective hydroxylation is the cause of scurvy. Some members of the collagen superfamily are not involved in connective tissue structure but share the same triple helical structure. The family includes bacterial collagen-like triple-helix repeat proteins.


Pssm-ID: 460189 [Multi-domain]  Cd Length: 57  Bit Score: 37.09  E-value: 1.40e-03
                          10        20        30        40        50
                  ....*....|....*....|....*....|....*....|....*....|...
gi 38570073   255 GMNGQKGEPGLPGAVGQNGIPGPKGEPGEQGEKGDAGENGPKGDTGEKGDPGS 307
Cdd:pfam01391   1 GPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGEPGPPGPPGPPGPPGPPGAPGA 53
Collagen pfam01391
Collagen triple helix repeat (20 copies); Members of this family belong to the collagen ...
562-615 1.41e-03

Collagen triple helix repeat (20 copies); Members of this family belong to the collagen superfamily. Collagens are generally extracellular structural proteins involved in formation of connective tissue structure. The alignment contains 20 copies of the G-X-Y repeat that forms a triple helix. The first position of the repeat is glycine, the second and third positions can be any residue but are frequently proline and hydroxy-proline. Collagens are post translationally modified by proline hydroxylase to form the hydroxy-proline residues. Defective hydroxylation is the cause of scurvy. Some members of the collagen superfamily are not involved in connective tissue structure but share the same triple helical structure. The family includes bacterial collagen-like triple-helix repeat proteins.


Pssm-ID: 460189 [Multi-domain]  Cd Length: 57  Bit Score: 37.09  E-value: 1.41e-03
                          10        20        30        40        50
                  ....*....|....*....|....*....|....*....|....*....|....
gi 38570073   562 GPAGPKGERGEKGAMGEPGPRGPYGLPGKDGEPGLDGFPGPRGEKGDLGEKGEK 615
Cdd:pfam01391   1 GPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGEPGPPGPPGPPGPPGPPGAPGAP 54
SPT5 COG5164
Transcription elongation factor SPT5 [Transcription];
211-392 1.56e-03

Transcription elongation factor SPT5 [Transcription];


Pssm-ID: 444063 [Multi-domain]  Cd Length: 495  Bit Score: 41.55  E-value: 1.56e-03
                        10        20        30        40        50        60        70        80
                ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 38570073 211 DTGKDGPRGMPGVPGEPGKPGEQGLMGPLGPPGQKGSIGAPGIPGMNGQKGEPGLPGAVGQNGI---------------P 275
Cdd:COG5164  68 NQGATGPAQNQGGTTPAQNQGGTRPAGNTGGTTPAGDGGATGPPDDGGATGPPDDGGSTTPPSGgsttppgdggstppgP 147
                        90       100       110       120       130       140       150       160
                ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 38570073 276 GPKGEPGEQGEKGDAGENGPKGDTGEKGDPGSSAAGIKGEPGESGRPGQKGEPGLPGLPGLPGIKGEPGFIGPQGEPGlp 355
Cdd:COG5164 148 GSTGPGGSTTPPGDGGSTTPPGPGGSTTPPDDGGSTTPPNKGETGTDIPTGGTPRQGPDGPVKKDDKNGKGNPPDDRG-- 225
                       170       180       190
                ....*....|....*....|....*....|....*...
gi 38570073 356 GLPGTKGERGEAGPPG-RGERGEPGAPGPKGKQGESGT 392
Cdd:COG5164 226 GKTGPKDQRPKTNPIErRGPERPEAAALPAELTALEAE 263
PHA03169 PHA03169
hypothetical protein; Provisional
267-409 2.66e-03

hypothetical protein; Provisional


Pssm-ID: 223003 [Multi-domain]  Cd Length: 413  Bit Score: 40.72  E-value: 2.66e-03
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 38570073  267 GAVGQNGIPGPKGEPGEQGEKGDAGENGPKGDTGEKGDP------GSSAAGIKGEPGESGRPGQKGEPGLPGLPGLPGIK 340
Cdd:PHA03169  82 GEKEERGQGGPSGSGSESVGSPTPSPSGSAEELASGLSPentsgsSPESPASHSPPPSPPSHPGPHEPAPPESHNPSPNQ 161
                         90       100       110       120       130       140       150
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 38570073  341 GEPGFIGPQGEPGL-PGLPGTKGERGEAGPPGRGERGEPGAPGPKGKQGESGTRGPKGSKGDRGEKGDSG 409
Cdd:PHA03169 162 QPSSFLQPSHEDSPeEPEPPTSEPEPDSPGPPQSETPTSSPPPQSPPDEPGEPQSPTPQQAPSPNTQQAV 231
PRK12678 PRK12678
transcription termination factor Rho; Provisional
246-424 6.00e-03

transcription termination factor Rho; Provisional


Pssm-ID: 237171 [Multi-domain]  Cd Length: 672  Bit Score: 39.89  E-value: 6.00e-03
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 38570073  246 GSIGAPGIPGMngQKGEpgLPGAV--GQNGIPGPKGEPGEQGEKGDAGENGPKGDTGEKGDPGSSAAGIKGEPGESGRPG 323
Cdd:PRK12678  36 KQLGIKGTSGM--RKGE--LIAAIkeARGGGAAAAAATPAAPAAAARRAARAAAAARQAEQPAAEAAAAKAEAAPAARAA 111
                         90       100       110       120       130       140       150       160
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 38570073  324 QKGEPGLPGLPGLPGIKGEPGfigpQGEPGLPGLPGTKGERGEagppgRGERGEPGAPGPKGKQGESGTRGPKGSKGDRG 403
Cdd:PRK12678 112 AAAAAEAASAPEAAQARERRE----RGEAARRGAARKAGEGGE-----QPATEARADAAERTEEEERDERRRRGDREDRQ 182
                        170       180
                 ....*....|....*....|.
gi 38570073  404 EKGDSGAQGPRGPPGQKGDQG 424
Cdd:PRK12678 183 AEAERGERGRREERGRDGDDR 203
Collagen pfam01391
Collagen triple helix repeat (20 copies); Members of this family belong to the collagen ...
314-370 6.00e-03

Collagen triple helix repeat (20 copies); Members of this family belong to the collagen superfamily. Collagens are generally extracellular structural proteins involved in formation of connective tissue structure. The alignment contains 20 copies of the G-X-Y repeat that forms a triple helix. The first position of the repeat is glycine, the second and third positions can be any residue but are frequently proline and hydroxy-proline. Collagens are post translationally modified by proline hydroxylase to form the hydroxy-proline residues. Defective hydroxylation is the cause of scurvy. Some members of the collagen superfamily are not involved in connective tissue structure but share the same triple helical structure. The family includes bacterial collagen-like triple-helix repeat proteins.


Pssm-ID: 460189 [Multi-domain]  Cd Length: 57  Bit Score: 35.55  E-value: 6.00e-03
                          10        20        30        40        50
                  ....*....|....*....|....*....|....*....|....*....|....*..
gi 38570073   314 GEPGESGRPGQKGEPGLPGLPGLPGIKGEPGFIGPQGEPGLPGLPGTKGERGEAGPP 370
Cdd:pfam01391   1 GPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGEPGPPGPPGPPGPPGPPGAPGAPGPP 57
PRK07764 PRK07764
DNA polymerase III subunits gamma and tau; Validated
237-422 6.95e-03

DNA polymerase III subunits gamma and tau; Validated


Pssm-ID: 236090 [Multi-domain]  Cd Length: 824  Bit Score: 39.58  E-value: 6.95e-03
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 38570073  237 GPLGPPGQKGSIGAPGIPGMNGQKGEPGLPGAVGQNGIPGPKGEPGEQGEKGDAGENGPKGDTGEKGDPGSSAA--GIKG 314
Cdd:PRK07764 589 GPAPGAAGGEGPPAPASSGPPEEAARPAAPAAPAAPAAPAPAGAAAAPAEASAAPAPGVAAPEHHPKHVAVPDAsdGGDG 668
                         90       100       110       120       130       140       150       160
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 38570073  315 EPGESGRPGQKGEPGLPGLPGLPGIKGEPGFIGPQGEPGLPGLPGTKGERGEAGPPGRG-----ERGEPGAPGPKGKQGE 389
Cdd:PRK07764 669 WPAKAGGAAPAAPPPAPAPAAPAAPAGAAPAQPAPAPAATPPAGQADDPAAQPPQAAQGasapsPAADDPVPLPPEPDDP 748
                        170       180       190
                 ....*....|....*....|....*....|...
gi 38570073  390 SGTRGPKGSKGDRGEKGDSGAQGPRGPPGQKGD 422
Cdd:PRK07764 749 PDPAGAPAQPPPPPAPAPAAAPAAAPPPSPPSE 781
 
Blast search parameters
Data Source: Precalculated data, version = cdd.v.3.21
Preset Options:Database: CDSEARCH/cdd   Low complexity filter: no  Composition Based Adjustment: yes   E-value threshold: 0.01

References:

  • Wang J et al. (2023), "The conserved domain database in 2023", Nucleic Acids Res.51(D)384-8.
  • Lu S et al. (2020), "The conserved domain database in 2020", Nucleic Acids Res.48(D)265-8.
  • Marchler-Bauer A et al. (2017), "CDD/SPARCLE: functional classification of proteins via subfamily domain architectures.", Nucleic Acids Res.45(D)200-3.
Help | Disclaimer | Write to the Help Desk
NCBI | NLM | NIH