NCBI Home Page NCBI Site Search page NCBI Guide that lists and describes the NCBI resources
Conserved domains on  [gi|1509759454|gb|AYQ76966|]
View 

PPE family protein [Mycobacterium avium subsp. paratuberculosis]

Protein Classification

PPE family protein( domain architecture ID 11475825)

proline-proline-glutamate (PPE) family protein containing pentapeptide repeats, similar to various Mycobacterium tuberculosis PPE virulence/immunogenicity factors

CATH:  1.10.287.850
SCOP:  4001235

Graphical summary

 Zoom to residue level

show extra options »

Show site features     Horizontal zoom: ×

List of domain hits

Name Accession Description Interval E-value
PPE pfam00823
PPE family; This family named after a PPE motif near to the amino terminus of the domain. The ...
6-155 8.53e-41

PPE family; This family named after a PPE motif near to the amino terminus of the domain. The PPE family of proteins all contain an amino-terminal region of about 180 amino acids. The carboxyl terminus of this family are variable, and on the basis of this region fall into at least three groups. The MPTR subgroup has tandem copies of a motif NXGXGNXG. The second subgroup contains a conserved motif at about position 350. The third group are only related in the amino terminal region. The function of these proteins is uncertain but it has been suggested that they may be related to antigenic variation of Mycobacterium tuberculosis.


:

Pssm-ID: 425887  Cd Length: 158  Bit Score: 141.56  E-value: 8.53e-41
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1509759454   6 ALPPEVNSARMYAGPGPLSLTAAAVAWDALAAELHAAASCYRSVIAGLTTGRWLGPSSLAMASAFAPYMAWMAGAAGRAA 85
Cdd:pfam00823   1 ALPPEVNSARLYAGPGSGPLLAAAAAWDGLAAELASAAASYSSVLAGLTGGAWQGPSSAAMAAAAAPYVAWLTAAAAQAE 80
                          90       100       110       120       130       140       150
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1509759454  86 ETAGQARLAVEVFEAAFAMTVPPPAVAANRVQLATLIATNFFGQNAAAIAATEAEYAEMWAQDAAAMYEY 155
Cdd:pfam00823  81 QAAAQAEAAAAAYEAALAAMVPPAEIAANRAELAVLVATNFFGQNTPAIAATEADYAEMWAQDAAAMYGY 150
PPE COG5651
PPE-repeat protein [Function unknown];
3-374 1.29e-32

PPE-repeat protein [Function unknown];


:

Pssm-ID: 444372 [Multi-domain]  Cd Length: 385  Bit Score: 126.16  E-value: 1.29e-32
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1509759454   3 DFGALPPEVNSARMYAGPGPLSLTAAAVAWDALAAELHAAASCYRSVIAGLTTGRWLGPSSLAMASAFAPYMAWMAGAAG 82
Cdd:COG5651     2 DFMALPPEVNSARMYAGPGSGPLLAAAAAWDGLAAELASAAASLESVLSGLTGGSWQGPAAAAMAAAAAPYVAWLTAAAA 81
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1509759454  83 RAAETAGQARLAVEVFEAAFAMTVPPPAVAANRVQLATLIATNFFGQNAAAIAATEAEYAEMWAQDAAAMYEYAAGSAAA 162
Cdd:COG5651    82 QAEQAAAQAEAAAAAYEAALAAMVPPAEVAANRAQLAVLVATNFFGQNTPAIAANEADYAEMWAQDAAAMYGYAAASAAA 161
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1509759454 163 CAVTPFTPPPDTTDEAGvarqAAVVGQVTTQAELNHTVSKIPTTLQGLSSPMTAGLDTVTDTGSGAAGGAAAGAANSTTS 242
Cdd:COG5651   162 VALTPFTQPPPTITNPG----GLLGAQNAGSGNTSSNPGFANLGLTGLNQVGIGGLNSGSGPIGLNSGPGNTGFAGTGAA 237
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1509759454 243 SIMTGLASGIPGAIPSAFSAAATPLYGMSSILGIAQTAQGLAKAAGDGVAAAASGVASAASSGAGALGSLGSGVLGTLGK 322
Cdd:COG5651   238 AGAAAAAAAAAAAAGAGASAALASLAATLLNASSLGLAATAASSAATNLGLAGSPLGLAGGGAGAAAATGLGLGAGGAAG 317
                         330       340       350       360       370
                  ....*....|....*....|....*....|....*....|....*....|..
gi 1509759454 323 AAALGPLAVPASWTSVIPAAHSAVSALPTINLAGANVPPSVMGSLPRLAAAS 374
Cdd:COG5651   318 AAGATGAGAALGAGAAAAAAGAAAGAGAAAAAAAGGAGGGGGGALGAGGGGG 369
 
Name Accession Description Interval E-value
PPE pfam00823
PPE family; This family named after a PPE motif near to the amino terminus of the domain. The ...
6-155 8.53e-41

PPE family; This family named after a PPE motif near to the amino terminus of the domain. The PPE family of proteins all contain an amino-terminal region of about 180 amino acids. The carboxyl terminus of this family are variable, and on the basis of this region fall into at least three groups. The MPTR subgroup has tandem copies of a motif NXGXGNXG. The second subgroup contains a conserved motif at about position 350. The third group are only related in the amino terminal region. The function of these proteins is uncertain but it has been suggested that they may be related to antigenic variation of Mycobacterium tuberculosis.


Pssm-ID: 425887  Cd Length: 158  Bit Score: 141.56  E-value: 8.53e-41
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1509759454   6 ALPPEVNSARMYAGPGPLSLTAAAVAWDALAAELHAAASCYRSVIAGLTTGRWLGPSSLAMASAFAPYMAWMAGAAGRAA 85
Cdd:pfam00823   1 ALPPEVNSARLYAGPGSGPLLAAAAAWDGLAAELASAAASYSSVLAGLTGGAWQGPSSAAMAAAAAPYVAWLTAAAAQAE 80
                          90       100       110       120       130       140       150
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1509759454  86 ETAGQARLAVEVFEAAFAMTVPPPAVAANRVQLATLIATNFFGQNAAAIAATEAEYAEMWAQDAAAMYEY 155
Cdd:pfam00823  81 QAAAQAEAAAAAYEAALAAMVPPAEIAANRAELAVLVATNFFGQNTPAIAATEADYAEMWAQDAAAMYGY 150
PPE COG5651
PPE-repeat protein [Function unknown];
3-374 1.29e-32

PPE-repeat protein [Function unknown];


Pssm-ID: 444372 [Multi-domain]  Cd Length: 385  Bit Score: 126.16  E-value: 1.29e-32
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1509759454   3 DFGALPPEVNSARMYAGPGPLSLTAAAVAWDALAAELHAAASCYRSVIAGLTTGRWLGPSSLAMASAFAPYMAWMAGAAG 82
Cdd:COG5651     2 DFMALPPEVNSARMYAGPGSGPLLAAAAAWDGLAAELASAAASLESVLSGLTGGSWQGPAAAAMAAAAAPYVAWLTAAAA 81
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1509759454  83 RAAETAGQARLAVEVFEAAFAMTVPPPAVAANRVQLATLIATNFFGQNAAAIAATEAEYAEMWAQDAAAMYEYAAGSAAA 162
Cdd:COG5651    82 QAEQAAAQAEAAAAAYEAALAAMVPPAEVAANRAQLAVLVATNFFGQNTPAIAANEADYAEMWAQDAAAMYGYAAASAAA 161
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1509759454 163 CAVTPFTPPPDTTDEAGvarqAAVVGQVTTQAELNHTVSKIPTTLQGLSSPMTAGLDTVTDTGSGAAGGAAAGAANSTTS 242
Cdd:COG5651   162 VALTPFTQPPPTITNPG----GLLGAQNAGSGNTSSNPGFANLGLTGLNQVGIGGLNSGSGPIGLNSGPGNTGFAGTGAA 237
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1509759454 243 SIMTGLASGIPGAIPSAFSAAATPLYGMSSILGIAQTAQGLAKAAGDGVAAAASGVASAASSGAGALGSLGSGVLGTLGK 322
Cdd:COG5651   238 AGAAAAAAAAAAAAGAGASAALASLAATLLNASSLGLAATAASSAATNLGLAGSPLGLAGGGAGAAAATGLGLGAGGAAG 317
                         330       340       350       360       370
                  ....*....|....*....|....*....|....*....|....*....|..
gi 1509759454 323 AAALGPLAVPASWTSVIPAAHSAVSALPTINLAGANVPPSVMGSLPRLAAAS 374
Cdd:COG5651   318 AAGATGAGAALGAGAAAAAAGAAAGAGAAAAAAAGGAGGGGGGALGAGGGGG 369
PPE-SVP pfam12484
PPE-SVP subfamily C-terminal region; This domain family is found in bacteria, and is ...
317-392 1.36e-04

PPE-SVP subfamily C-terminal region; This domain family is found in bacteria, and is approximately 90 amino acids in length. The family is found to the C-terminus of pfam00823. There is a conserved SVP sequence motif which is diagnostic of this subfamily. There is a single completely conserved residue W that may be functionally important. The proteins in this family are PPE proteins implicated in immunostimulation and virulence.


Pssm-ID: 372141  Cd Length: 80  Bit Score: 39.99  E-value: 1.36e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1509759454 317 LGTLGKAAALGPLAVPASWTSVIPAAHSAVSALP-TINLAGANVPPSVMGSLPRLAAASGKTLG---PRYGVIPTVMTRP 392
Cdd:pfam12484   1 SAGLGRAASVGALSVPPSWAAAAPAASAAAAALPgTTVAAAAAAAGAALGGMPGGAAGAGGGGGgggRRGGFRPTVMPRP 80
 
Name Accession Description Interval E-value
PPE pfam00823
PPE family; This family named after a PPE motif near to the amino terminus of the domain. The ...
6-155 8.53e-41

PPE family; This family named after a PPE motif near to the amino terminus of the domain. The PPE family of proteins all contain an amino-terminal region of about 180 amino acids. The carboxyl terminus of this family are variable, and on the basis of this region fall into at least three groups. The MPTR subgroup has tandem copies of a motif NXGXGNXG. The second subgroup contains a conserved motif at about position 350. The third group are only related in the amino terminal region. The function of these proteins is uncertain but it has been suggested that they may be related to antigenic variation of Mycobacterium tuberculosis.


Pssm-ID: 425887  Cd Length: 158  Bit Score: 141.56  E-value: 8.53e-41
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1509759454   6 ALPPEVNSARMYAGPGPLSLTAAAVAWDALAAELHAAASCYRSVIAGLTTGRWLGPSSLAMASAFAPYMAWMAGAAGRAA 85
Cdd:pfam00823   1 ALPPEVNSARLYAGPGSGPLLAAAAAWDGLAAELASAAASYSSVLAGLTGGAWQGPSSAAMAAAAAPYVAWLTAAAAQAE 80
                          90       100       110       120       130       140       150
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1509759454  86 ETAGQARLAVEVFEAAFAMTVPPPAVAANRVQLATLIATNFFGQNAAAIAATEAEYAEMWAQDAAAMYEY 155
Cdd:pfam00823  81 QAAAQAEAAAAAYEAALAAMVPPAEIAANRAELAVLVATNFFGQNTPAIAATEADYAEMWAQDAAAMYGY 150
PPE COG5651
PPE-repeat protein [Function unknown];
3-374 1.29e-32

PPE-repeat protein [Function unknown];


Pssm-ID: 444372 [Multi-domain]  Cd Length: 385  Bit Score: 126.16  E-value: 1.29e-32
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1509759454   3 DFGALPPEVNSARMYAGPGPLSLTAAAVAWDALAAELHAAASCYRSVIAGLTTGRWLGPSSLAMASAFAPYMAWMAGAAG 82
Cdd:COG5651     2 DFMALPPEVNSARMYAGPGSGPLLAAAAAWDGLAAELASAAASLESVLSGLTGGSWQGPAAAAMAAAAAPYVAWLTAAAA 81
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1509759454  83 RAAETAGQARLAVEVFEAAFAMTVPPPAVAANRVQLATLIATNFFGQNAAAIAATEAEYAEMWAQDAAAMYEYAAGSAAA 162
Cdd:COG5651    82 QAEQAAAQAEAAAAAYEAALAAMVPPAEVAANRAQLAVLVATNFFGQNTPAIAANEADYAEMWAQDAAAMYGYAAASAAA 161
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1509759454 163 CAVTPFTPPPDTTDEAGvarqAAVVGQVTTQAELNHTVSKIPTTLQGLSSPMTAGLDTVTDTGSGAAGGAAAGAANSTTS 242
Cdd:COG5651   162 VALTPFTQPPPTITNPG----GLLGAQNAGSGNTSSNPGFANLGLTGLNQVGIGGLNSGSGPIGLNSGPGNTGFAGTGAA 237
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1509759454 243 SIMTGLASGIPGAIPSAFSAAATPLYGMSSILGIAQTAQGLAKAAGDGVAAAASGVASAASSGAGALGSLGSGVLGTLGK 322
Cdd:COG5651   238 AGAAAAAAAAAAAAGAGASAALASLAATLLNASSLGLAATAASSAATNLGLAGSPLGLAGGGAGAAAATGLGLGAGGAAG 317
                         330       340       350       360       370
                  ....*....|....*....|....*....|....*....|....*....|..
gi 1509759454 323 AAALGPLAVPASWTSVIPAAHSAVSALPTINLAGANVPPSVMGSLPRLAAAS 374
Cdd:COG5651   318 AAGATGAGAALGAGAAAAAAGAAAGAGAAAAAAAGGAGGGGGGALGAGGGGG 369
PPE-SVP pfam12484
PPE-SVP subfamily C-terminal region; This domain family is found in bacteria, and is ...
317-392 1.36e-04

PPE-SVP subfamily C-terminal region; This domain family is found in bacteria, and is approximately 90 amino acids in length. The family is found to the C-terminus of pfam00823. There is a conserved SVP sequence motif which is diagnostic of this subfamily. There is a single completely conserved residue W that may be functionally important. The proteins in this family are PPE proteins implicated in immunostimulation and virulence.


Pssm-ID: 372141  Cd Length: 80  Bit Score: 39.99  E-value: 1.36e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1509759454 317 LGTLGKAAALGPLAVPASWTSVIPAAHSAVSALP-TINLAGANVPPSVMGSLPRLAAASGKTLG---PRYGVIPTVMTRP 392
Cdd:pfam12484   1 SAGLGRAASVGALSVPPSWAAAAPAASAAAAALPgTTVAAAAAAAGAALGGMPGGAAGAGGGGGgggRRGGFRPTVMPRP 80
 
Blast search parameters
Data Source: Precalculated data, version = cdd.v.3.21
Preset Options:Database: CDSEARCH/cdd   Low complexity filter: no  Composition Based Adjustment: yes   E-value threshold: 0.01

References:

  • Wang J et al. (2023), "The conserved domain database in 2023", Nucleic Acids Res.51(D)384-8.
  • Lu S et al. (2020), "The conserved domain database in 2020", Nucleic Acids Res.48(D)265-8.
  • Marchler-Bauer A et al. (2017), "CDD/SPARCLE: functional classification of proteins via subfamily domain architectures.", Nucleic Acids Res.45(D)200-3.
Help | Disclaimer | Write to the Help Desk
NCBI | NLM | NIH