NCBI Home Page NCBI Site Search page NCBI Guide that lists and describes the NCBI resources
Conserved domains on  [gi|1032536545|gb|ANG88125|]
View 

hypothetical protein SZ58_007105 [Mycobacterium tuberculosis variant bovis]

Protein Classification

PPE family protein( domain architecture ID 11475754)

proline-proline-glutamate (PPE) family protein containing pentapeptide repeats, similar to various Mycobacterium tuberculosis PPE virulence/immunogenicity factors

CATH:  1.10.287.850
SCOP:  4001235

Graphical summary

 Zoom to residue level

show extra options »

Show site features     Horizontal zoom: ×

List of domain hits

Name Accession Description Interval E-value
PPE COG5651
PPE-repeat protein [Function unknown];
3-401 1.87e-27

PPE-repeat protein [Function unknown];


:

Pssm-ID: 444372 [Multi-domain]  Cd Length: 385  Bit Score: 113.06  E-value: 1.87e-27
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1032536545   3 NFSVLPPEINSGRMFFGAGSGPMLAAAAAWDGLAAELGLAAESFGLVTSGLAGGSGQAWQGAAAAAMVVaaaPYAGWLAA 82
Cdd:COG5651     2 DFMALPPEVNSARMYAGPGSGPLLAAAAAWDGLAAELASAAASLESVLSGLTGGSWQGPAAAAMAAAAA---PYVAWLTA 78
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1032536545  83 AAARAGGAAVQAKAVAGAFEAARAAMVDPVVVAANRSAFVQLVLSNVFGQNAPAIAAAEATYEQMWAADVAAMVGYHGGA 162
Cdd:COG5651    79 AAAQAEQAAAQAEAAAAAYEAALAAMVPPAEVAANRAQLAVLVATNFFGQNTPAIAANEADYAEMWAQDAAAMYGYAAAS 158
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1032536545 163 SAAAALAPWQQAVPGLLglldsaqSSAQAVTAQAVGStvpgPLQGINFGFGNIGSLNLGSGNTGDTNVGSGNIGnTNLGG 242
Cdd:COG5651   159 AAAVALTPFTQPPPTIT-------NPGGLLGAQNAGS----GNTSSNPGFANLGLTGLNQVGIGGLNSGSGPIG-LNSGP 226
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1032536545 243 GNIGSFNLGsGNQGDINLGIGNVGNLNLGSGNFGSQNLGSGNIGSTNVGSGNIGSTNVGSGNIGDTNFGNGNNGNFNFGS 322
Cdd:COG5651   227 GNTGFAGTG-AAAGAAAAAAAAAAAAGAGASAALASLAATLLNASSLGLAATAASSAATNLGLAGSPLGLAGGGAGAAAA 305
                         330       340       350       360       370       380       390
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 1032536545 323 GNTGSNNIGFGNTGSGNFGFGNTGNNNIGIGLTGDGQIGIGGLNSGSGNIGFGNSGTGNVGLFNSGTGNVGFGNSGTAN 401
Cdd:COG5651   306 TGLGLGAGGAAGAAGATGAGAALGAGAAAAAAGAAAGAGAAAAAAAGGAGGGGGGALGAGGGGGSAGAAAGAASGGGAA 384
PTZ00395 super family cl33180
Sec24-related protein; Provisional
366-441 3.44e-05

Sec24-related protein; Provisional


The actual alignment was detected with superfamily member PTZ00395:

Pssm-ID: 185594 [Multi-domain]  Cd Length: 1560  Bit Score: 46.61  E-value: 3.44e-05
                           10        20        30        40        50        60        70
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*.
gi 1032536545  366 NSGSGNIGFGNSGTGNVGLFNSGTGNVGFGNSGTANTGFGNAGNVNTGFWNGGSTNTGLANAGAGNTGFFDAGNYN 441
Cdd:PTZ00395   396 NAAQSNAAQSNAGFSNAGYSNPGNSNPGYNNAPNSNTPYNNPPNSNTPYSNPPNSNPPYSNLPYSNTPYSNAPLSN 471
 
Name Accession Description Interval E-value
PPE COG5651
PPE-repeat protein [Function unknown];
3-401 1.87e-27

PPE-repeat protein [Function unknown];


Pssm-ID: 444372 [Multi-domain]  Cd Length: 385  Bit Score: 113.06  E-value: 1.87e-27
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1032536545   3 NFSVLPPEINSGRMFFGAGSGPMLAAAAAWDGLAAELGLAAESFGLVTSGLAGGSGQAWQGAAAAAMVVaaaPYAGWLAA 82
Cdd:COG5651     2 DFMALPPEVNSARMYAGPGSGPLLAAAAAWDGLAAELASAAASLESVLSGLTGGSWQGPAAAAMAAAAA---PYVAWLTA 78
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1032536545  83 AAARAGGAAVQAKAVAGAFEAARAAMVDPVVVAANRSAFVQLVLSNVFGQNAPAIAAAEATYEQMWAADVAAMVGYHGGA 162
Cdd:COG5651    79 AAAQAEQAAAQAEAAAAAYEAALAAMVPPAEVAANRAQLAVLVATNFFGQNTPAIAANEADYAEMWAQDAAAMYGYAAAS 158
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1032536545 163 SAAAALAPWQQAVPGLLglldsaqSSAQAVTAQAVGStvpgPLQGINFGFGNIGSLNLGSGNTGDTNVGSGNIGnTNLGG 242
Cdd:COG5651   159 AAAVALTPFTQPPPTIT-------NPGGLLGAQNAGS----GNTSSNPGFANLGLTGLNQVGIGGLNSGSGPIG-LNSGP 226
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1032536545 243 GNIGSFNLGsGNQGDINLGIGNVGNLNLGSGNFGSQNLGSGNIGSTNVGSGNIGSTNVGSGNIGDTNFGNGNNGNFNFGS 322
Cdd:COG5651   227 GNTGFAGTG-AAAGAAAAAAAAAAAAGAGASAALASLAATLLNASSLGLAATAASSAATNLGLAGSPLGLAGGGAGAAAA 305
                         330       340       350       360       370       380       390
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 1032536545 323 GNTGSNNIGFGNTGSGNFGFGNTGNNNIGIGLTGDGQIGIGGLNSGSGNIGFGNSGTGNVGLFNSGTGNVGFGNSGTAN 401
Cdd:COG5651   306 TGLGLGAGGAAGAAGATGAGAALGAGAAAAAAGAAAGAGAAAAAAAGGAGGGGGGALGAGGGGGSAGAAAGAASGGGAA 384
PPE pfam00823
PPE family; This family named after a PPE motif near to the amino terminus of the domain. The ...
6-161 9.05e-26

PPE family; This family named after a PPE motif near to the amino terminus of the domain. The PPE family of proteins all contain an amino-terminal region of about 180 amino acids. The carboxyl terminus of this family are variable, and on the basis of this region fall into at least three groups. The MPTR subgroup has tandem copies of a motif NXGXGNXG. The second subgroup contains a conserved motif at about position 350. The third group are only related in the amino terminal region. The function of these proteins is uncertain but it has been suggested that they may be related to antigenic variation of Mycobacterium tuberculosis.


Pssm-ID: 425887  Cd Length: 158  Bit Score: 102.66  E-value: 9.05e-26
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1032536545   6 VLPPEINSGRMFFGAGSGPMLAAAAAWDGLAAELGLAAESFGLVTSGLAGGSGQAWQGAAAAAMVVaaaPYAGWLAAAAA 85
Cdd:pfam00823   1 ALPPEVNSARLYAGPGSGPLLAAAAAWDGLAAELASAAASYSSVLAGLTGGAWQGPSSAAMAAAAA---PYVAWLTAAAA 77
                          90       100       110       120       130       140       150
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*.
gi 1032536545  86 RAGGAAVQAKAVAGAFEAARAAMVDPVVVAANRSAFVQLVLSNVFGQNAPAIAAAEATYEQMWAADVAAMVGYHGG 161
Cdd:pfam00823  78 QAEQAAAQAEAAAAAYEAALAAMVPPAEIAANRAELAVLVATNFFGQNTPAIAATEADYAEMWAQDAAAMYGYAAA 153
PHA02515 PHA02515
hypothetical protein; Provisional
199-307 2.00e-06

hypothetical protein; Provisional


Pssm-ID: 107197 [Multi-domain]  Cd Length: 508  Bit Score: 50.16  E-value: 2.00e-06
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1032536545 199 STVPGPLQGINFGFGNIGSLNLGSGNTGDTNVGSGNIGNTNLGGGNIGSFNLGSGNQGDINLGIGNVGNLNLGSGNFGSQ 278
Cdd:PHA02515  281 NTVAGANANVNTVASNILDVGTVAGNIDDVQAVAGNAANINVVADNADNINATAANQANINAAVGNADNINAAVANQANI 360
                          90       100
                  ....*....|....*....|....*....
gi 1032536545 279 NLGSGNIGSTNVGSGNIGSTNVGSGNIGD 307
Cdd:PHA02515  361 NAVVGNANNINAVAANEGNVNTVVDNLAD 389
PTZ00395 PTZ00395
Sec24-related protein; Provisional
366-441 3.44e-05

Sec24-related protein; Provisional


Pssm-ID: 185594 [Multi-domain]  Cd Length: 1560  Bit Score: 46.61  E-value: 3.44e-05
                           10        20        30        40        50        60        70
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*.
gi 1032536545  366 NSGSGNIGFGNSGTGNVGLFNSGTGNVGFGNSGTANTGFGNAGNVNTGFWNGGSTNTGLANAGAGNTGFFDAGNYN 441
Cdd:PTZ00395   396 NAAQSNAAQSNAGFSNAGYSNPGNSNPGYNNAPNSNTPYNNPPNSNTPYSNPPNSNPPYSNLPYSNTPYSNAPLSN 471
Pentapeptide_2 pfam01469
Pentapeptide repeats (8 copies); These repeats are found in many mycobacterial proteins. These ...
400-438 1.27e-04

Pentapeptide repeats (8 copies); These repeats are found in many mycobacterial proteins. These repeats are most common in the pfam00823 family of proteins, where they are found in the MPTR subfamily of PPE proteins. The function of these repeats is unknown. The repeat can be approximately described as XNXGX, where X can be any amino acid. These repeats are similar to pfam00805, however it is not clear if these two families are structurally related.


Pssm-ID: 279771 [Multi-domain]  Cd Length: 39  Bit Score: 39.08  E-value: 1.27e-04
                          10        20        30
                  ....*....|....*....|....*....|....*....
gi 1032536545 400 ANTGFGNAGNVNTGFWNGGSTNTGLANAGAGNTGFFDAG 438
Cdd:pfam01469   1 GNTGSGNSGSGNTGFFNLGSGNTGSFNLGSGNTGNGNTG 39
 
Name Accession Description Interval E-value
PPE COG5651
PPE-repeat protein [Function unknown];
3-401 1.87e-27

PPE-repeat protein [Function unknown];


Pssm-ID: 444372 [Multi-domain]  Cd Length: 385  Bit Score: 113.06  E-value: 1.87e-27
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1032536545   3 NFSVLPPEINSGRMFFGAGSGPMLAAAAAWDGLAAELGLAAESFGLVTSGLAGGSGQAWQGAAAAAMVVaaaPYAGWLAA 82
Cdd:COG5651     2 DFMALPPEVNSARMYAGPGSGPLLAAAAAWDGLAAELASAAASLESVLSGLTGGSWQGPAAAAMAAAAA---PYVAWLTA 78
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1032536545  83 AAARAGGAAVQAKAVAGAFEAARAAMVDPVVVAANRSAFVQLVLSNVFGQNAPAIAAAEATYEQMWAADVAAMVGYHGGA 162
Cdd:COG5651    79 AAAQAEQAAAQAEAAAAAYEAALAAMVPPAEVAANRAQLAVLVATNFFGQNTPAIAANEADYAEMWAQDAAAMYGYAAAS 158
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1032536545 163 SAAAALAPWQQAVPGLLglldsaqSSAQAVTAQAVGStvpgPLQGINFGFGNIGSLNLGSGNTGDTNVGSGNIGnTNLGG 242
Cdd:COG5651   159 AAAVALTPFTQPPPTIT-------NPGGLLGAQNAGS----GNTSSNPGFANLGLTGLNQVGIGGLNSGSGPIG-LNSGP 226
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1032536545 243 GNIGSFNLGsGNQGDINLGIGNVGNLNLGSGNFGSQNLGSGNIGSTNVGSGNIGSTNVGSGNIGDTNFGNGNNGNFNFGS 322
Cdd:COG5651   227 GNTGFAGTG-AAAGAAAAAAAAAAAAGAGASAALASLAATLLNASSLGLAATAASSAATNLGLAGSPLGLAGGGAGAAAA 305
                         330       340       350       360       370       380       390
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 1032536545 323 GNTGSNNIGFGNTGSGNFGFGNTGNNNIGIGLTGDGQIGIGGLNSGSGNIGFGNSGTGNVGLFNSGTGNVGFGNSGTAN 401
Cdd:COG5651   306 TGLGLGAGGAAGAAGATGAGAALGAGAAAAAAGAAAGAGAAAAAAAGGAGGGGGGALGAGGGGGSAGAAAGAASGGGAA 384
PPE pfam00823
PPE family; This family named after a PPE motif near to the amino terminus of the domain. The ...
6-161 9.05e-26

PPE family; This family named after a PPE motif near to the amino terminus of the domain. The PPE family of proteins all contain an amino-terminal region of about 180 amino acids. The carboxyl terminus of this family are variable, and on the basis of this region fall into at least three groups. The MPTR subgroup has tandem copies of a motif NXGXGNXG. The second subgroup contains a conserved motif at about position 350. The third group are only related in the amino terminal region. The function of these proteins is uncertain but it has been suggested that they may be related to antigenic variation of Mycobacterium tuberculosis.


Pssm-ID: 425887  Cd Length: 158  Bit Score: 102.66  E-value: 9.05e-26
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1032536545   6 VLPPEINSGRMFFGAGSGPMLAAAAAWDGLAAELGLAAESFGLVTSGLAGGSGQAWQGAAAAAMVVaaaPYAGWLAAAAA 85
Cdd:pfam00823   1 ALPPEVNSARLYAGPGSGPLLAAAAAWDGLAAELASAAASYSSVLAGLTGGAWQGPSSAAMAAAAA---PYVAWLTAAAA 77
                          90       100       110       120       130       140       150
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*.
gi 1032536545  86 RAGGAAVQAKAVAGAFEAARAAMVDPVVVAANRSAFVQLVLSNVFGQNAPAIAAAEATYEQMWAADVAAMVGYHGG 161
Cdd:pfam00823  78 QAEQAAAQAEAAAAAYEAALAAMVPPAEIAANRAELAVLVATNFFGQNTPAIAATEADYAEMWAQDAAAMYGYAAA 153
FhaB COG3210
Large exoprotein involved in heme utilization or adhesion [Intracellular trafficking, ...
212-456 5.32e-09

Large exoprotein involved in heme utilization or adhesion [Intracellular trafficking, secretion, and vesicular transport];


Pssm-ID: 442443 [Multi-domain]  Cd Length: 1698  Bit Score: 58.62  E-value: 5.32e-09
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1032536545  212 FGNIGSLNLGSGNTGDTNVGSGNIGNTNLGGGNIGSFNLGSGNQGDINLGIGNVGNLNLGSGNFGSQNLGSGNIGSTNVG 291
Cdd:COG3210    815 TGSGGTITINTATTGLTGTGDTTSGAGGSNTTDTTTGTTSDGASGGGTAGANSGSLAATAASITVGSGGVATSTGTANAG 894
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1032536545  292 SGNIGSTNVGSGNIGDTNFGNGNNGNFNFGSGNTGSNNIGFGNTGSGNFGFGNTGNNNIGIGLTGDGQIGIGGLNSGSGN 371
Cdd:COG3210    895 TLTNLGTTTNAASGNGAVLATVTATGTGGGGLTGGNAAAGGTGAGNGTTALSGTQGNAGLSAASASDGAGDTGASSAAGS 974
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1032536545  372 IGFGNSGTGNVGLFNSGTGNVGFGNSGTANTGFGNAGNVNTGFWNGGSTNTGLANAGAGNTGFFDAGNYNFGSLNAGNIN 451
Cdd:COG3210    975 SAVGTSANSAGSTGGVIAATGILVAGNSGTTASTTGGSGAIVAGGNGVTGTTGTASATGTGTAATAGGQNGVGVNASGIS 1054

                   ....*
gi 1032536545  452 SSFVG 456
Cdd:COG3210   1055 GGNAA 1059
FhaB COG3210
Large exoprotein involved in heme utilization or adhesion [Intracellular trafficking, ...
183-451 6.55e-09

Large exoprotein involved in heme utilization or adhesion [Intracellular trafficking, secretion, and vesicular transport];


Pssm-ID: 442443 [Multi-domain]  Cd Length: 1698  Bit Score: 58.62  E-value: 6.55e-09
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1032536545  183 DSAQSSAQAVTAQAVGSTVPGPLQGINFGFGNIGSLNLGSGNTGDTNVGSGNIGNTNLGGGNIGSFNLGSGNQGDINLGI 262
Cdd:COG3210    463 NTDVSGTGTVTNSAGNTTSATTLAGGGIGTVTTNATISNNAGGDANGIATGLTGITAGGGGGGNATSGGTGGDGTTLSGS 542
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1032536545  263 GNVGNLNLGSGNFGSQNLGSGNIGSTNVGSGNIGSTNVGSGNIGDTNFGNGNNGNFNFGSGNTGSNNIGFGNTGSGNFGF 342
Cdd:COG3210    543 GLTTTVSGGASGTTAASGSNTANTLGVLAATGGTSNATTAGNSTSATGGTGTNSGGTVLSIGTGSAGATGTITLGAGTSG 622
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1032536545  343 GNTGNNNIGIGLTGDGQIGIGGLNSGSGNIGFGNSGTGNVGLFNSGTGNVGFGNSGTANTGFGNAGNVNTGFWNGGSTNT 422
Cdd:COG3210    623 AGANATGGGAGLTGSAVGAALSGTGSGTTGTASANGSNTTGVNTAGGTGGGTTGTVTSGATGGTTGTTLNAATGGTLNNA 702
                          250       260
                   ....*....|....*....|....*....
gi 1032536545  423 GLANAGAGNTGFFDAGNYNFGSLNAGNIN 451
Cdd:COG3210    703 GNTLTISTGSITVTGQIGALANANGDTVT 731
FhaB COG3210
Large exoprotein involved in heme utilization or adhesion [Intracellular trafficking, ...
174-458 2.80e-08

Large exoprotein involved in heme utilization or adhesion [Intracellular trafficking, secretion, and vesicular transport];


Pssm-ID: 442443 [Multi-domain]  Cd Length: 1698  Bit Score: 56.31  E-value: 2.80e-08
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1032536545  174 AVPGLLGLLDSAQSSAQAVTAQAVGSTVPGPLQGINFGFGNIGSLNLGSGNTGDTnvGSGNIGNTNLGGGNIGSFNLGSG 253
Cdd:COG3210    904 NAASGNGAVLATVTATGTGGGGLTGGNAAAGGTGAGNGTTALSGTQGNAGLSAAS--ASDGAGDTGASSAAGSSAVGTSA 981
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1032536545  254 NQGDINLGIGNVGNLNLGSGNFGSQNLGSGNIGSTNVGSGNIGSTNVGSGNIGDTNFGNGNNGNFNFGSGNTGSNNIGFG 333
Cdd:COG3210    982 NSAGSTGGVIAATGILVAGNSGTTASTTGGSGAIVAGGNGVTGTTGTASATGTGTAATAGGQNGVGVNASGISGGNAAAL 1061
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1032536545  334 NTGSGNFGFGNTGNNNIGIGLTGDGQIGIGGLNSGSGNIGFGNSGTGNVGLFNSGTGNVGFGNSGTANTGFGNAGNVNTG 413
Cdd:COG3210   1062 TASGTAGTTGGTAASNGGGGTAQASGAGTTHTLGGITNGGATGTSGGTTTSTGGVTASKVGGTTTVGATGTSTASTEAAG 1141
                          250       260       270       280
                   ....*....|....*....|....*....|....*....|....*
gi 1032536545  414 FWNGGSTNTGLANAGAGNTGFFDAGNYNFGSLNAGNINSSFVGRG 458
Cdd:COG3210   1142 AGTLTGLVAVSAVAGGASSASAGDTTAVAAATTTTTGSAINGGAD 1186
COG4625 COG4625
Uncharacterized conserved protein, contains a C-terminal beta-barrel porin domain [Function ...
177-444 1.15e-06

Uncharacterized conserved protein, contains a C-terminal beta-barrel porin domain [Function unknown];


Pssm-ID: 443664 [Multi-domain]  Cd Length: 900  Bit Score: 51.32  E-value: 1.15e-06
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1032536545 177 GLLGLLDSAQSSAQAVTAQAVGSTVPGPLQGINFGFGNIGSLNLGSGNTGDTNVGSGNIGNTNLGGGNIGSFNLGSGNQG 256
Cdd:COG4625   233 GGGGGGGGGGGGGGGAGGGGGGGGGNGGGGGAGGGGGGGGGGSGGGGGGGGGGGSGGGGGGGGGGGGGGGGGGGGGGGGG 312
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1032536545 257 DINLGIGNVGNLNLGSGNFGSQNLGSGNIGSTNVGSGNIGSTNVGSGNIGDTNFGNGNNGNFNFGSGNTGSNNIGFGNTG 336
Cdd:COG4625   313 GGGGGGGGGGGGGGGGGGGGGAGGGGGSGGAGAGGGGAGGGGAGGGGGGGTGGGGGGGGGGGGGSGGGGAGGGGGSGGGG 392
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1032536545 337 SGNFGFGNTGNNNIGIGLTGDGQIGIGGLNSGSGNIGFGNSGTGNVGLFNSGTGNVGFGNSGTANTGFGNAGNVNTGFWN 416
Cdd:COG4625   393 GGGAGGGGGGGGAGGTGGGGAGGGGGAAGGGGGGTGAGGGGGGGGTGAGGGGATGGGGGGGGGAGGSGGGAGAGGGSGSG 472
                         250       260
                  ....*....|....*....|....*...
gi 1032536545 417 GGSTNTGLANAGAGNTGFFDAGNYNFGS 444
Cdd:COG4625   473 AGTLTLTGNNTYTGTTTVNGGGNYTQSA 500
PHA02515 PHA02515
hypothetical protein; Provisional
199-307 2.00e-06

hypothetical protein; Provisional


Pssm-ID: 107197 [Multi-domain]  Cd Length: 508  Bit Score: 50.16  E-value: 2.00e-06
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1032536545 199 STVPGPLQGINFGFGNIGSLNLGSGNTGDTNVGSGNIGNTNLGGGNIGSFNLGSGNQGDINLGIGNVGNLNLGSGNFGSQ 278
Cdd:PHA02515  281 NTVAGANANVNTVASNILDVGTVAGNIDDVQAVAGNAANINVVADNADNINATAANQANINAAVGNADNINAAVANQANI 360
                          90       100
                  ....*....|....*....|....*....
gi 1032536545 279 NLGSGNIGSTNVGSGNIGSTNVGSGNIGD 307
Cdd:PHA02515  361 NAVVGNANNINAVAANEGNVNTVVDNLAD 389
Pentapeptide_2 pfam01469
Pentapeptide repeats (8 copies); These repeats are found in many mycobacterial proteins. These ...
370-408 1.64e-05

Pentapeptide repeats (8 copies); These repeats are found in many mycobacterial proteins. These repeats are most common in the pfam00823 family of proteins, where they are found in the MPTR subfamily of PPE proteins. The function of these repeats is unknown. The repeat can be approximately described as XNXGX, where X can be any amino acid. These repeats are similar to pfam00805, however it is not clear if these two families are structurally related.


Pssm-ID: 279771 [Multi-domain]  Cd Length: 39  Bit Score: 41.77  E-value: 1.64e-05
                          10        20        30
                  ....*....|....*....|....*....|....*....
gi 1032536545 370 GNIGFGNSGTGNVGLFNSGTGNVGFGNSGTANTGFGNAG 408
Cdd:pfam01469   1 GNTGSGNSGSGNTGFFNLGSGNTGSFNLGSGNTGNGNTG 39
FhaB COG3210
Large exoprotein involved in heme utilization or adhesion [Intracellular trafficking, ...
184-433 2.09e-05

Large exoprotein involved in heme utilization or adhesion [Intracellular trafficking, secretion, and vesicular transport];


Pssm-ID: 442443 [Multi-domain]  Cd Length: 1698  Bit Score: 47.07  E-value: 2.09e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1032536545  184 SAQSSAQAVTAQAVGSTVPGPLQGINFGFGNIGSLNLGSGNTGDTNVGSGNIGNTNLGGGNIGSFNLGSGNQGDINLGIG 263
Cdd:COG3210    489 GIGTVTTNATISNNAGGDANGIATGLTGITAGGGGGGNATSGGTGGDGTTLSGSGLTTTVSGGASGTTAASGSNTANTLG 568
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1032536545  264 NVGNLNLGSGNFGSQNLGSGNIGSTNVGSGNIGSTNVGSGNIGDTNFGNGNNGNFNFGSGNTGSNNIGFGNTGSGNFGFG 343
Cdd:COG3210    569 VLAATGGTSNATTAGNSTSATGGTGTNSGGTVLSIGTGSAGATGTITLGAGTSGAGANATGGGAGLTGSAVGAALSGTGS 648
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1032536545  344 NTGNNNIGIGLTGDGQIGIGGLNSGSGNIGFGNSGTGNVGLFNSGTGNVGFGNSGTANTGFGNAGNVNTGFWNGGSTNTG 423
Cdd:COG3210    649 GTTGTASANGSNTTGVNTAGGTGGGTTGTVTSGATGGTTGTTLNAATGGTLNNAGNTLTISTGSITVTGQIGALANANGD 728
                          250
                   ....*....|
gi 1032536545  424 LANAGAGNTG 433
Cdd:COG3210    729 TVTFGNLGTG 738
PTZ00395 PTZ00395
Sec24-related protein; Provisional
366-441 3.44e-05

Sec24-related protein; Provisional


Pssm-ID: 185594 [Multi-domain]  Cd Length: 1560  Bit Score: 46.61  E-value: 3.44e-05
                           10        20        30        40        50        60        70
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*.
gi 1032536545  366 NSGSGNIGFGNSGTGNVGLFNSGTGNVGFGNSGTANTGFGNAGNVNTGFWNGGSTNTGLANAGAGNTGFFDAGNYN 441
Cdd:PTZ00395   396 NAAQSNAAQSNAGFSNAGYSNPGNSNPGYNNAPNSNTPYNNPPNSNTPYSNPPNSNPPYSNLPYSNTPYSNAPLSN 471
Pentapeptide_2 pfam01469
Pentapeptide repeats (8 copies); These repeats are found in many mycobacterial proteins. These ...
213-251 4.50e-05

Pentapeptide repeats (8 copies); These repeats are found in many mycobacterial proteins. These repeats are most common in the pfam00823 family of proteins, where they are found in the MPTR subfamily of PPE proteins. The function of these repeats is unknown. The repeat can be approximately described as XNXGX, where X can be any amino acid. These repeats are similar to pfam00805, however it is not clear if these two families are structurally related.


Pssm-ID: 279771 [Multi-domain]  Cd Length: 39  Bit Score: 40.62  E-value: 4.50e-05
                          10        20        30
                  ....*....|....*....|....*....|....*....
gi 1032536545 213 GNIGSLNLGSGNTGDTNVGSGNIGNTNLGGGNIGSFNLG 251
Cdd:pfam01469   1 GNTGSGNSGSGNTGFFNLGSGNTGSFNLGSGNTGNGNTG 39
Pentapeptide_2 pfam01469
Pentapeptide repeats (8 copies); These repeats are found in many mycobacterial proteins. These ...
263-301 9.31e-05

Pentapeptide repeats (8 copies); These repeats are found in many mycobacterial proteins. These repeats are most common in the pfam00823 family of proteins, where they are found in the MPTR subfamily of PPE proteins. The function of these repeats is unknown. The repeat can be approximately described as XNXGX, where X can be any amino acid. These repeats are similar to pfam00805, however it is not clear if these two families are structurally related.


Pssm-ID: 279771 [Multi-domain]  Cd Length: 39  Bit Score: 39.46  E-value: 9.31e-05
                          10        20        30
                  ....*....|....*....|....*....|....*....
gi 1032536545 263 GNVGNLNLGSGNFGSQNLGSGNIGSTNVGSGNIGSTNVG 301
Cdd:pfam01469   1 GNTGSGNSGSGNTGFFNLGSGNTGSFNLGSGNTGNGNTG 39
PHA02515 PHA02515
hypothetical protein; Provisional
183-307 1.17e-04

hypothetical protein; Provisional


Pssm-ID: 107197 [Multi-domain]  Cd Length: 508  Bit Score: 44.38  E-value: 1.17e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1032536545 183 DSAQSSAQAVTAQAVGSTVPGPLQGINFGFGNIGSLNLGSGNTGDTNVGSGNIGNTNLGGGNIGSFNLGSGNQGDINLGI 262
Cdd:PHA02515  275 DNAANINTVAGANANVNTVASNILDVGTVAGNIDDVQAVAGNAANINVVADNADNINATAANQANINAAVGNADNINAAV 354
                          90       100       110       120
                  ....*....|....*....|....*....|....*....|....*
gi 1032536545 263 GNVGNLNLGSGNFGSQNLGSGNIGSTNVGSGNIGSTNVGSGNIGD 307
Cdd:PHA02515  355 ANQANINAVVGNANNINAVAANEGNVNTVVDNLADVQTVAGIAAD 399
Pentapeptide_2 pfam01469
Pentapeptide repeats (8 copies); These repeats are found in many mycobacterial proteins. These ...
400-438 1.27e-04

Pentapeptide repeats (8 copies); These repeats are found in many mycobacterial proteins. These repeats are most common in the pfam00823 family of proteins, where they are found in the MPTR subfamily of PPE proteins. The function of these repeats is unknown. The repeat can be approximately described as XNXGX, where X can be any amino acid. These repeats are similar to pfam00805, however it is not clear if these two families are structurally related.


Pssm-ID: 279771 [Multi-domain]  Cd Length: 39  Bit Score: 39.08  E-value: 1.27e-04
                          10        20        30
                  ....*....|....*....|....*....|....*....
gi 1032536545 400 ANTGFGNAGNVNTGFWNGGSTNTGLANAGAGNTGFFDAG 438
Cdd:pfam01469   1 GNTGSGNSGSGNTGFFNLGSGNTGSFNLGSGNTGNGNTG 39
PTZ00395 PTZ00395
Sec24-related protein; Provisional
366-434 1.55e-04

Sec24-related protein; Provisional


Pssm-ID: 185594 [Multi-domain]  Cd Length: 1560  Bit Score: 44.30  E-value: 1.55e-04
                           10        20        30        40        50        60
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 1032536545  366 NSGSGNIGFGNSGTGNVGLFNSGTGNVGFGNSGTANTGFGNAGNVNTGFWNGGSTNTGLANAGAGNTGF 434
Cdd:PTZ00395   386 NASYNCAAYSNAAQSNAAQSNAGFSNAGYSNPGNSNPGYNNAPNSNTPYNNPPNSNTPYSNPPNSNPPY 454
Pentapeptide_2 pfam01469
Pentapeptide repeats (8 copies); These repeats are found in many mycobacterial proteins. These ...
380-418 1.87e-04

Pentapeptide repeats (8 copies); These repeats are found in many mycobacterial proteins. These repeats are most common in the pfam00823 family of proteins, where they are found in the MPTR subfamily of PPE proteins. The function of these repeats is unknown. The repeat can be approximately described as XNXGX, where X can be any amino acid. These repeats are similar to pfam00805, however it is not clear if these two families are structurally related.


Pssm-ID: 279771 [Multi-domain]  Cd Length: 39  Bit Score: 38.69  E-value: 1.87e-04
                          10        20        30
                  ....*....|....*....|....*....|....*....
gi 1032536545 380 GNVGLFNSGTGNVGFGNSGTANTGFGNAGNVNTGFWNGG 418
Cdd:pfam01469   1 GNTGSGNSGSGNTGFFNLGSGNTGSFNLGSGNTGNGNTG 39
Pentapeptide_2 pfam01469
Pentapeptide repeats (8 copies); These repeats are found in many mycobacterial proteins. These ...
362-398 3.24e-04

Pentapeptide repeats (8 copies); These repeats are found in many mycobacterial proteins. These repeats are most common in the pfam00823 family of proteins, where they are found in the MPTR subfamily of PPE proteins. The function of these repeats is unknown. The repeat can be approximately described as XNXGX, where X can be any amino acid. These repeats are similar to pfam00805, however it is not clear if these two families are structurally related.


Pssm-ID: 279771 [Multi-domain]  Cd Length: 39  Bit Score: 37.92  E-value: 3.24e-04
                          10        20        30
                  ....*....|....*....|....*....|....*..
gi 1032536545 362 IGGLNSGSGNIGFGNSGTGNVGLFNSGTGNVGFGNSG 398
Cdd:pfam01469   3 TGSGNSGSGNTGFFNLGSGNTGSFNLGSGNTGNGNTG 39
Pentapeptide_2 pfam01469
Pentapeptide repeats (8 copies); These repeats are found in many mycobacterial proteins. These ...
223-261 3.95e-04

Pentapeptide repeats (8 copies); These repeats are found in many mycobacterial proteins. These repeats are most common in the pfam00823 family of proteins, where they are found in the MPTR subfamily of PPE proteins. The function of these repeats is unknown. The repeat can be approximately described as XNXGX, where X can be any amino acid. These repeats are similar to pfam00805, however it is not clear if these two families are structurally related.


Pssm-ID: 279771 [Multi-domain]  Cd Length: 39  Bit Score: 37.92  E-value: 3.95e-04
                          10        20        30
                  ....*....|....*....|....*....|....*....
gi 1032536545 223 GNTGDTNVGSGNIGNTNLGGGNIGSFNLGSGNQGDINLG 261
Cdd:pfam01469   1 GNTGSGNSGSGNTGFFNLGSGNTGSFNLGSGNTGNGNTG 39
Pentapeptide_2 pfam01469
Pentapeptide repeats (8 copies); These repeats are found in many mycobacterial proteins. These ...
390-428 4.15e-04

Pentapeptide repeats (8 copies); These repeats are found in many mycobacterial proteins. These repeats are most common in the pfam00823 family of proteins, where they are found in the MPTR subfamily of PPE proteins. The function of these repeats is unknown. The repeat can be approximately described as XNXGX, where X can be any amino acid. These repeats are similar to pfam00805, however it is not clear if these two families are structurally related.


Pssm-ID: 279771 [Multi-domain]  Cd Length: 39  Bit Score: 37.92  E-value: 4.15e-04
                          10        20        30
                  ....*....|....*....|....*....|....*....
gi 1032536545 390 GNVGFGNSGTANTGFGNAGNVNTGFWNGGSTNTGLANAG 428
Cdd:pfam01469   1 GNTGSGNSGSGNTGFFNLGSGNTGSFNLGSGNTGNGNTG 39
Pentapeptide_2 pfam01469
Pentapeptide repeats (8 copies); These repeats are found in many mycobacterial proteins. These ...
253-291 8.76e-04

Pentapeptide repeats (8 copies); These repeats are found in many mycobacterial proteins. These repeats are most common in the pfam00823 family of proteins, where they are found in the MPTR subfamily of PPE proteins. The function of these repeats is unknown. The repeat can be approximately described as XNXGX, where X can be any amino acid. These repeats are similar to pfam00805, however it is not clear if these two families are structurally related.


Pssm-ID: 279771 [Multi-domain]  Cd Length: 39  Bit Score: 36.77  E-value: 8.76e-04
                          10        20        30
                  ....*....|....*....|....*....|....*....
gi 1032536545 253 GNQGDINLGIGNVGNLNLGSGNFGSQNLGSGNIGSTNVG 291
Cdd:pfam01469   1 GNTGSGNSGSGNTGFFNLGSGNTGSFNLGSGNTGNGNTG 39
PTZ00395 PTZ00395
Sec24-related protein; Provisional
358-431 9.21e-04

Sec24-related protein; Provisional


Pssm-ID: 185594 [Multi-domain]  Cd Length: 1560  Bit Score: 41.98  E-value: 9.21e-04
                           10        20        30        40        50        60        70
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....
gi 1032536545  358 GQIGIGGLNSGSGNIGFGNSGTGNVGLFNSGTGNVGFGNSGTANTGFGNAGNVNTGFWNGGSTNTGLANAGAGN 431
Cdd:PTZ00395   398 AQSNAAQSNAGFSNAGYSNPGNSNPGYNNAPNSNTPYNNPPNSNTPYSNPPNSNPPYSNLPYSNTPYSNAPLSN 471
PHA02515 PHA02515
hypothetical protein; Provisional
200-304 1.05e-03

hypothetical protein; Provisional


Pssm-ID: 107197 [Multi-domain]  Cd Length: 508  Bit Score: 41.30  E-value: 1.05e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1032536545 200 TVPGPLQGINFGFGNIGSLNLGSGNTGDTNVGSGNIGNTNLGGGNIGSFNLGSGNQGDINLGIGNVGNLNLGSGNFGSQN 279
Cdd:PHA02515  262 SVAGDLENIDAVADNAANINTVAGANANVNTVASNILDVGTVAGNIDDVQAVAGNAANINVVADNADNINATAANQANIN 341
                          90       100
                  ....*....|....*....|....*
gi 1032536545 280 LGSGNIGSTNVGSGNIGSTNVGSGN 304
Cdd:PHA02515  342 AAVGNADNINAAVANQANINAVVGN 366
COG4625 COG4625
Uncharacterized conserved protein, contains a C-terminal beta-barrel porin domain [Function ...
18-430 1.18e-03

Uncharacterized conserved protein, contains a C-terminal beta-barrel porin domain [Function unknown];


Pssm-ID: 443664 [Multi-domain]  Cd Length: 900  Bit Score: 41.30  E-value: 1.18e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1032536545  18 FGAGSGPMLAAAAAWDGLAAELGLAAESFGLVTSGLAGGSGQAWQGAAAAAMVVAAAPYAGWLAAAAARAGGAAVQAKAV 97
Cdd:COG4625   103 GGGGGGGGGGGGGGGSAGGGGGGAGGAGGGGGGGAGGGGGGGGGGGAGGGGGGGAGGAGGGGGGGGGGGGGGGGGGGGGG 182
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1032536545  98 AGAFEAARAAMVDPVVVAANRSAFVQLVLSNVFGQNAPAIAAAEATYEQMWAADVAAMVGYHGGASAAAALAPWQQAVPG 177
Cdd:COG4625   183 GGGGGGGGGGNGGGGGGGGGGGGGGGGGGGGAGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGAGGGGGGGGGNGGGG 262
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1032536545 178 LLGLLDSAQSSAQAVTAQAVGSTVPGPLQGINFGFGNIGSLNLGSGNTGDTNVGSGNIGNTNLGGGNIGSFNLGSGNQGD 257
Cdd:COG4625   263 GAGGGGGGGGGGSGGGGGGGGGGGSGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGAGGGGGSGG 342
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1032536545 258 INLGIGNVGNLNLGSGNFGSQNLGSGNIGSTNVGSGNIGSTNVGSGNIGDTNFGNGNNGNFNFGSGNTGSNNIGFGNTGS 337
Cdd:COG4625   343 AGAGGGGAGGGGAGGGGGGGTGGGGGGGGGGGGGSGGGGAGGGGGSGGGGGGGAGGGGGGGGAGGTGGGGAGGGGGAAGG 422
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1032536545 338 GNFGFGNTGNNNIGIGLTGDGqIGIGGLNSGSGNIGFGNSGTGNVGLFNSGTGNVGFGNSGTANTGFGNAGNVNTGFWNG 417
Cdd:COG4625   423 GGGGTGAGGGGGGGGTGAGGG-GATGGGGGGGGGAGGSGGGAGAGGGSGSGAGTLTLTGNNTYTGTTTVNGGGNYTQSAG 501
                         410
                  ....*....|...
gi 1032536545 418 GSTNTGLANAGAG 430
Cdd:COG4625   502 STLAVEVDAANSD 514
Pentapeptide_2 pfam01469
Pentapeptide repeats (8 copies); These repeats are found in many mycobacterial proteins. These ...
233-271 1.20e-03

Pentapeptide repeats (8 copies); These repeats are found in many mycobacterial proteins. These repeats are most common in the pfam00823 family of proteins, where they are found in the MPTR subfamily of PPE proteins. The function of these repeats is unknown. The repeat can be approximately described as XNXGX, where X can be any amino acid. These repeats are similar to pfam00805, however it is not clear if these two families are structurally related.


Pssm-ID: 279771 [Multi-domain]  Cd Length: 39  Bit Score: 36.38  E-value: 1.20e-03
                          10        20        30
                  ....*....|....*....|....*....|....*....
gi 1032536545 233 GNIGNTNLGGGNIGSFNLGSGNQGDINLGIGNVGNLNLG 271
Cdd:pfam01469   1 GNTGSGNSGSGNTGFFNLGSGNTGSFNLGSGNTGNGNTG 39
Pentapeptide_2 pfam01469
Pentapeptide repeats (8 copies); These repeats are found in many mycobacterial proteins. These ...
273-308 1.85e-03

Pentapeptide repeats (8 copies); These repeats are found in many mycobacterial proteins. These repeats are most common in the pfam00823 family of proteins, where they are found in the MPTR subfamily of PPE proteins. The function of these repeats is unknown. The repeat can be approximately described as XNXGX, where X can be any amino acid. These repeats are similar to pfam00805, however it is not clear if these two families are structurally related.


Pssm-ID: 279771 [Multi-domain]  Cd Length: 39  Bit Score: 36.00  E-value: 1.85e-03
                          10        20        30
                  ....*....|....*....|....*....|....*.
gi 1032536545 273 GNFGSQNLGSGNIGSTNVGSGNIGSTNVGSGNIGDT 308
Cdd:pfam01469   1 GNTGSGNSGSGNTGFFNLGSGNTGSFNLGSGNTGNG 36
COG2931 COG2931
Ca2+-binding protein, RTX toxin-related [Secondary metabolites biosynthesis, transport and ...
209-458 3.67e-03

Ca2+-binding protein, RTX toxin-related [Secondary metabolites biosynthesis, transport and catabolism];


Pssm-ID: 442175 [Multi-domain]  Cd Length: 252  Bit Score: 39.12  E-value: 3.67e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1032536545 209 NFGFGNIGSLNLGSGNTGDTNVGSGNIGNTNLGGGNIGSFNLGSGNQGDINLGIGNVGNLNLGSGNFGSQNLGSGNIGST 288
Cdd:COG2931     1 GGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGDGGGGGGGGGGGGGGGGLDGGGGGGGGDGGG 80
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1032536545 289 NVGSGNIGSTNVGSGNIGDTNFGNGNNGNFNFGSGNTGSNNIGFGNTGSGNFGFGNTGNNNIGIGLTGD---GQIGIGGL 365
Cdd:COG2931    81 GGGGDDTDGGGDGGDGGGGGTGDDTGDGGGGNDTLTGGDGNDTLTGGAGDDTLYGGAGNDTLTGGAGNDtlyGGAGNDTL 160
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1032536545 366 NSGSGNIGFgNSGTGNVGLFNSGTGNVGFGNSGTANTGFGNAGNVNTGFWNGGSTNTGLANAGAGNTGFFDAGNYNFGSL 445
Cdd:COG2931   161 YGGAGNDTL-DGGAGNDTLTGGAGNDTLTGGAGNDTLDGGGGDDTLGGGGGDDGLDGGDGDDGLGGGGGDDTLGGGGGGD 239
                         250
                  ....*....|...
gi 1032536545 446 NAGNINSSFVGRG 458
Cdd:COG2931   240 GGGGGGGDDGLGG 252
PTZ00395 PTZ00395
Sec24-related protein; Provisional
386-454 3.96e-03

Sec24-related protein; Provisional


Pssm-ID: 185594 [Multi-domain]  Cd Length: 1560  Bit Score: 40.06  E-value: 3.96e-03
                           10        20        30        40        50        60
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 1032536545  386 NSGTGNVGFGNSGTANTGFGNAGNVNTGFWNGGSTNTGLANAGAGNTGFFDAGNYNFGSLNAGNINSSF 454
Cdd:PTZ00395   386 NASYNCAAYSNAAQSNAAQSNAGFSNAGYSNPGNSNPGYNNAPNSNTPYNNPPNSNTPYSNPPNSNPPY 454
Pentapeptide_2 pfam01469
Pentapeptide repeats (8 copies); These repeats are found in many mycobacterial proteins. These ...
209-241 4.19e-03

Pentapeptide repeats (8 copies); These repeats are found in many mycobacterial proteins. These repeats are most common in the pfam00823 family of proteins, where they are found in the MPTR subfamily of PPE proteins. The function of these repeats is unknown. The repeat can be approximately described as XNXGX, where X can be any amino acid. These repeats are similar to pfam00805, however it is not clear if these two families are structurally related.


Pssm-ID: 279771 [Multi-domain]  Cd Length: 39  Bit Score: 34.84  E-value: 4.19e-03
                          10        20        30
                  ....*....|....*....|....*....|...
gi 1032536545 209 NFGFGNIGSLNLGSGNTGDTNVGSGNIGNTNLG 241
Cdd:pfam01469   7 NSGSGNTGFFNLGSGNTGSFNLGSGNTGNGNTG 39
Pentapeptide_2 pfam01469
Pentapeptide repeats (8 copies); These repeats are found in many mycobacterial proteins. These ...
228-266 4.28e-03

Pentapeptide repeats (8 copies); These repeats are found in many mycobacterial proteins. These repeats are most common in the pfam00823 family of proteins, where they are found in the MPTR subfamily of PPE proteins. The function of these repeats is unknown. The repeat can be approximately described as XNXGX, where X can be any amino acid. These repeats are similar to pfam00805, however it is not clear if these two families are structurally related.


Pssm-ID: 279771 [Multi-domain]  Cd Length: 39  Bit Score: 34.84  E-value: 4.28e-03
                          10        20        30
                  ....*....|....*....|....*....|....*....
gi 1032536545 228 TNVGSGNIGNTNLGGGNIGSFNLGSGNQGDINLGIGNVG 266
Cdd:pfam01469   1 GNTGSGNSGSGNTGFFNLGSGNTGSFNLGSGNTGNGNTG 39
Pentapeptide_2 pfam01469
Pentapeptide repeats (8 copies); These repeats are found in many mycobacterial proteins. These ...
238-276 9.99e-03

Pentapeptide repeats (8 copies); These repeats are found in many mycobacterial proteins. These repeats are most common in the pfam00823 family of proteins, where they are found in the MPTR subfamily of PPE proteins. The function of these repeats is unknown. The repeat can be approximately described as XNXGX, where X can be any amino acid. These repeats are similar to pfam00805, however it is not clear if these two families are structurally related.


Pssm-ID: 279771 [Multi-domain]  Cd Length: 39  Bit Score: 33.68  E-value: 9.99e-03
                          10        20        30
                  ....*....|....*....|....*....|....*....
gi 1032536545 238 TNLGGGNIGSFNLGSGNQGDINLGIGNVGNLNLGSGNFG 276
Cdd:pfam01469   1 GNTGSGNSGSGNTGFFNLGSGNTGSFNLGSGNTGNGNTG 39
 
Blast search parameters
Data Source: Precalculated data, version = cdd.v.3.21
Preset Options:Database: CDSEARCH/cdd   Low complexity filter: no  Composition Based Adjustment: yes   E-value threshold: 0.01

References:

  • Wang J et al. (2023), "The conserved domain database in 2023", Nucleic Acids Res.51(D)384-8.
  • Lu S et al. (2020), "The conserved domain database in 2020", Nucleic Acids Res.48(D)265-8.
  • Marchler-Bauer A et al. (2017), "CDD/SPARCLE: functional classification of proteins via subfamily domain architectures.", Nucleic Acids Res.45(D)200-3.
Help | Disclaimer | Write to the Help Desk
NCBI | NLM | NIH