NCBI Home Page NCBI Site Search page NCBI Guide that lists and describes the NCBI resources
Conserved domains on  [gi|57116882|ref|YP_177817|]
View 

PPE family protein PPE21 [Mycobacterium tuberculosis H37Rv]

Protein Classification

PPE family protein( domain architecture ID 12841089)

proline-proline-glutamate (PPE) family protein similar to various Mycobacterium tuberculosis PPE virulence/immunogenicity factors

Graphical summary

 Zoom to residue level

show extra options »

Show site features     Horizontal zoom: ×

List of domain hits

Name Accession Description Interval E-value
PPE pfam00823
PPE family; This family named after a PPE motif near to the amino terminus of the domain. The ...
5-162 5.44e-69

PPE family; This family named after a PPE motif near to the amino terminus of the domain. The PPE family of proteins all contain an amino-terminal region of about 180 amino acids. The carboxyl terminus of this family are variable, and on the basis of this region fall into at least three groups. The MPTR subgroup has tandem copies of a motif NXGXGNXG. The second subgroup contains a conserved motif at about position 350. The third group are only related in the amino terminal region. The function of these proteins is uncertain but it has been suggested that they may be related to antigenic variation of Mycobacterium tuberculosis.


:

Pssm-ID: 425887  Cd Length: 158  Bit Score: 222.07  E-value: 5.44e-69
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 57116882     5 VLPPEINSALMFAGAGPGPMLAAASAWTGLAGDLGSAAASFSAVTSQLATGSWQGPASAAMTGVAASYARWLTTAAAQAE 84
Cdd:pfam00823   1 ALPPEVNSARLYAGPGSGPLLAAAAAWDGLAAELASAAASYSSVLAGLTGGAWQGPSSAAMAAAAAPYVAWLTAAAAQAE 80
                          90       100       110       120       130       140       150
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 57116882    85 QAAGQAQAAVSAFEAALAATVHPGAVSANRGRLRSLVASNLLGQNAPAIAAVEAVYEQMWAADVAAMLGYHGEASAVA 162
Cdd:pfam00823  81 QAAAQAEAAAAAYEAALAAMVPPAEIAANRAELAVLVATNFFGQNTPAIAATEADYAEMWAQDAAAMYGYAAASAAAA 158
PPE COG5651
PPE-repeat protein [Function unknown];
1-406 1.84e-62

PPE-repeat protein [Function unknown];


:

Pssm-ID: 444372 [Multi-domain]  Cd Length: 385  Bit Score: 212.83  E-value: 1.84e-62
                        10        20        30        40        50        60        70        80
                ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 57116882   1 MNFSVLPPEINSALMFAGAGPGPMLAAASAWTGLAGDLGSAAASFSAVTSQLATGSWQGPASAAMTGVAASYARWLTTAA 80
Cdd:COG5651   1 MDFMALPPEVNSARMYAGPGSGPLLAAAAAWDGLAAELASAAASLESVLSGLTGGSWQGPAAAAMAAAAAPYVAWLTAAA 80
                        90       100       110       120       130       140       150       160
                ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 57116882  81 AQAEQAAGQAQAAVSAFEAALAATVHPGAVSANRGRLRSLVASNLLGQNAPAIAAVEAVYEQMWAADVAAMLGYHGeASA 160
Cdd:COG5651  81 AQAEQAAAQAEAAAAAYEAALAAMVPPAEVAANRAQLAVLVATNFFGQNTPAIAANEADYAEMWAQDAAAMYGYAA-ASA 159
                       170       180       190       200       210       220       230       240
                ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 57116882 161 VALSLTPFTPSPSAAATPGGaviiagfpfldlgnvtiggfNLASGNLGLGNlGSFNPGSANTGSVNLGNANIGDLNLGSG 240
Cdd:COG5651 160 AAVALTPFTQPPPTITNPGG--------------------LLGAQNAGSGN-TSSNPGFANLGLTGLNQVGIGGLNSGSG 218
                       250       260       270       280       290       300       310       320
                ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 57116882 241 NIGSyNLGGGNTGDLNPdSGNTGTLNWGSGNIGSYNLGGGNLGSYNLGSGNTGDTNFGGGNTGNLNVGGGNTGNSNFGFG 320
Cdd:COG5651 219 PIGL-NSGPGNTGFAGT-GAAAGAAAAAAAAAAAAGAGASAALASLAATLLNASSLGLAATAASSAATNLGLAGSPLGLA 296
                       330       340       350       360       370       380       390       400
                ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 57116882 321 NTGNVNFGNGNTGDTNFGSGNLGSGNIGFGNKGSHNIGFGNSGNNNIGFGLTGDNQIGFGALNSGSGNLGFGNSGNGNIG 400
Cdd:COG5651 297 GGGAGAAAATGLGLGAGGAAGAAGATGAGAALGAGAAAAAAGAAAGAGAAAAAAAGGAGGGGGGALGAGGGGGSAGAAAG 376

                ....*.
gi 57116882 401 FFNSGN 406
Cdd:COG5651 377 AASGGG 382
FhaB COG3210
Large exoprotein involved in heme utilization or adhesion [Intracellular trafficking, ...
210-675 1.10e-15

Large exoprotein involved in heme utilization or adhesion [Intracellular trafficking, secretion, and vesicular transport];


:

Pssm-ID: 442443 [Multi-domain]  Cd Length: 1698  Bit Score: 81.35  E-value: 1.10e-15
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 57116882  210 GNLGSFNPGSANTGSVNLGNANIGDLNLGSGNIGSYNLGGGNTGDLNPDSGNTGTLNWGSGNIGSYNLGGGNLGSYNLGS 289
Cdd:COG3210  816 GSGGTITINTATTGLTGTGDTTSGAGGSNTTDTTTGTTSDGASGGGTAGANSGSLAATAASITVGSGGVATSTGTANAGT 895
                         90       100       110       120       130       140       150       160
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 57116882  290 GNTGDTNFGGGNTGNLNVGGGNTGNSNFGFGNTGNVNFGNGNTGDTNFGSGNLGSGNIGFGNKGSHNIGFGNSGNNNIGF 369
Cdd:COG3210  896 LTNLGTTTNAASGNGAVLATVTATGTGGGGLTGGNAAAGGTGAGNGTTALSGTQGNAGLSAASASDGAGDTGASSAAGSS 975
                        170       180       190       200       210       220       230       240
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 57116882  370 GLTGDNQIGFGALNSGSGNLGFGNSGNGNIGFFNSGNNNIGMGNSGNGVGALSVEFGSSAERSSGFGNSGELSTGIGNSG 449
Cdd:COG3210  976 AVGTSANSAGSTGGVIAATGILVAGNSGTTASTTGGSGAIVAGGNGVTGTTGTASATGTGTAATAGGQNGVGVNASGISG 1055
                        250       260       270       280       290       300       310       320
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 57116882  450 QLSTGWFNSATTSTGWFNSGTTNTGWFNSGTTNTGIGNSGGNLVTGSMGLFNSGHTNTGSFNAGSMNTGDFNSGNVNTGY 529
Cdd:COG3210 1056 GNAAALTASGTAGTTGGTAASNGGGGTAQASGAGTTHTLGGITNGGATGTSGGTTTSTGGVTASKVGGTTTVGATGTSTA 1135
                        330       340       350       360       370       380       390       400
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 57116882  530 FNSGNINTGFFNSGDLNTGLFNSVNQPVQNSGWLHTGTNNSGYANAGTFNSGFDNNARDEHAEFVTGNSGLANVGNYNAG 609
Cdd:COG3210 1136 STEAAGAGTLTGLVAVSAVAGGASSASAGDTTAVAAATTTTTGSAINGGADSAATEGTAGTDLKGGDSTGGSTTTIGTTN 1215
                        410       420       430       440       450       460
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*.
gi 57116882  610 IINVGDHLSGFRNSVPTITGTANISGFVNAGTSISGFFNFGSLMSGFANFDDEVSGYLNGDSRASG 675
Cdd:COG3210 1216 VTTTTTLTASDTGNTTATGGSSAGQTGSFVAAGSASGTGDATTGATAGAVSNGATSTVAGNAGATA 1281
 
Name Accession Description Interval E-value
PPE pfam00823
PPE family; This family named after a PPE motif near to the amino terminus of the domain. The ...
5-162 5.44e-69

PPE family; This family named after a PPE motif near to the amino terminus of the domain. The PPE family of proteins all contain an amino-terminal region of about 180 amino acids. The carboxyl terminus of this family are variable, and on the basis of this region fall into at least three groups. The MPTR subgroup has tandem copies of a motif NXGXGNXG. The second subgroup contains a conserved motif at about position 350. The third group are only related in the amino terminal region. The function of these proteins is uncertain but it has been suggested that they may be related to antigenic variation of Mycobacterium tuberculosis.


Pssm-ID: 425887  Cd Length: 158  Bit Score: 222.07  E-value: 5.44e-69
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 57116882     5 VLPPEINSALMFAGAGPGPMLAAASAWTGLAGDLGSAAASFSAVTSQLATGSWQGPASAAMTGVAASYARWLTTAAAQAE 84
Cdd:pfam00823   1 ALPPEVNSARLYAGPGSGPLLAAAAAWDGLAAELASAAASYSSVLAGLTGGAWQGPSSAAMAAAAAPYVAWLTAAAAQAE 80
                          90       100       110       120       130       140       150
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 57116882    85 QAAGQAQAAVSAFEAALAATVHPGAVSANRGRLRSLVASNLLGQNAPAIAAVEAVYEQMWAADVAAMLGYHGEASAVA 162
Cdd:pfam00823  81 QAAAQAEAAAAAYEAALAAMVPPAEIAANRAELAVLVATNFFGQNTPAIAATEADYAEMWAQDAAAMYGYAAASAAAA 158
PPE COG5651
PPE-repeat protein [Function unknown];
1-406 1.84e-62

PPE-repeat protein [Function unknown];


Pssm-ID: 444372 [Multi-domain]  Cd Length: 385  Bit Score: 212.83  E-value: 1.84e-62
                        10        20        30        40        50        60        70        80
                ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 57116882   1 MNFSVLPPEINSALMFAGAGPGPMLAAASAWTGLAGDLGSAAASFSAVTSQLATGSWQGPASAAMTGVAASYARWLTTAA 80
Cdd:COG5651   1 MDFMALPPEVNSARMYAGPGSGPLLAAAAAWDGLAAELASAAASLESVLSGLTGGSWQGPAAAAMAAAAAPYVAWLTAAA 80
                        90       100       110       120       130       140       150       160
                ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 57116882  81 AQAEQAAGQAQAAVSAFEAALAATVHPGAVSANRGRLRSLVASNLLGQNAPAIAAVEAVYEQMWAADVAAMLGYHGeASA 160
Cdd:COG5651  81 AQAEQAAAQAEAAAAAYEAALAAMVPPAEVAANRAQLAVLVATNFFGQNTPAIAANEADYAEMWAQDAAAMYGYAA-ASA 159
                       170       180       190       200       210       220       230       240
                ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 57116882 161 VALSLTPFTPSPSAAATPGGaviiagfpfldlgnvtiggfNLASGNLGLGNlGSFNPGSANTGSVNLGNANIGDLNLGSG 240
Cdd:COG5651 160 AAVALTPFTQPPPTITNPGG--------------------LLGAQNAGSGN-TSSNPGFANLGLTGLNQVGIGGLNSGSG 218
                       250       260       270       280       290       300       310       320
                ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 57116882 241 NIGSyNLGGGNTGDLNPdSGNTGTLNWGSGNIGSYNLGGGNLGSYNLGSGNTGDTNFGGGNTGNLNVGGGNTGNSNFGFG 320
Cdd:COG5651 219 PIGL-NSGPGNTGFAGT-GAAAGAAAAAAAAAAAAGAGASAALASLAATLLNASSLGLAATAASSAATNLGLAGSPLGLA 296
                       330       340       350       360       370       380       390       400
                ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 57116882 321 NTGNVNFGNGNTGDTNFGSGNLGSGNIGFGNKGSHNIGFGNSGNNNIGFGLTGDNQIGFGALNSGSGNLGFGNSGNGNIG 400
Cdd:COG5651 297 GGGAGAAAATGLGLGAGGAAGAAGATGAGAALGAGAAAAAAGAAAGAGAAAAAAAGGAGGGGGGALGAGGGGGSAGAAAG 376

                ....*.
gi 57116882 401 FFNSGN 406
Cdd:COG5651 377 AASGGG 382
FhaB COG3210
Large exoprotein involved in heme utilization or adhesion [Intracellular trafficking, ...
210-675 1.10e-15

Large exoprotein involved in heme utilization or adhesion [Intracellular trafficking, secretion, and vesicular transport];


Pssm-ID: 442443 [Multi-domain]  Cd Length: 1698  Bit Score: 81.35  E-value: 1.10e-15
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 57116882  210 GNLGSFNPGSANTGSVNLGNANIGDLNLGSGNIGSYNLGGGNTGDLNPDSGNTGTLNWGSGNIGSYNLGGGNLGSYNLGS 289
Cdd:COG3210  816 GSGGTITINTATTGLTGTGDTTSGAGGSNTTDTTTGTTSDGASGGGTAGANSGSLAATAASITVGSGGVATSTGTANAGT 895
                         90       100       110       120       130       140       150       160
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 57116882  290 GNTGDTNFGGGNTGNLNVGGGNTGNSNFGFGNTGNVNFGNGNTGDTNFGSGNLGSGNIGFGNKGSHNIGFGNSGNNNIGF 369
Cdd:COG3210  896 LTNLGTTTNAASGNGAVLATVTATGTGGGGLTGGNAAAGGTGAGNGTTALSGTQGNAGLSAASASDGAGDTGASSAAGSS 975
                        170       180       190       200       210       220       230       240
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 57116882  370 GLTGDNQIGFGALNSGSGNLGFGNSGNGNIGFFNSGNNNIGMGNSGNGVGALSVEFGSSAERSSGFGNSGELSTGIGNSG 449
Cdd:COG3210  976 AVGTSANSAGSTGGVIAATGILVAGNSGTTASTTGGSGAIVAGGNGVTGTTGTASATGTGTAATAGGQNGVGVNASGISG 1055
                        250       260       270       280       290       300       310       320
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 57116882  450 QLSTGWFNSATTSTGWFNSGTTNTGWFNSGTTNTGIGNSGGNLVTGSMGLFNSGHTNTGSFNAGSMNTGDFNSGNVNTGY 529
Cdd:COG3210 1056 GNAAALTASGTAGTTGGTAASNGGGGTAQASGAGTTHTLGGITNGGATGTSGGTTTSTGGVTASKVGGTTTVGATGTSTA 1135
                        330       340       350       360       370       380       390       400
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 57116882  530 FNSGNINTGFFNSGDLNTGLFNSVNQPVQNSGWLHTGTNNSGYANAGTFNSGFDNNARDEHAEFVTGNSGLANVGNYNAG 609
Cdd:COG3210 1136 STEAAGAGTLTGLVAVSAVAGGASSASAGDTTAVAAATTTTTGSAINGGADSAATEGTAGTDLKGGDSTGGSTTTIGTTN 1215
                        410       420       430       440       450       460
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*.
gi 57116882  610 IINVGDHLSGFRNSVPTITGTANISGFVNAGTSISGFFNFGSLMSGFANFDDEVSGYLNGDSRASG 675
Cdd:COG3210 1216 VTTTTTLTASDTGNTTATGGSSAGQTGSFVAAGSASGTGDATTGATAGAVSNGATSTVAGNAGATA 1281
ser_rich_anae_1 NF033849
serine-rich protein; This serine-rich protein belongs to a family with large size (over 1000 ...
259-499 3.82e-07

serine-rich protein; This serine-rich protein belongs to a family with large size (over 1000 amino acids), which a highly serine-rich central region that averages over 300 aa in length. Species encoding members of this family of proteins tend to be anaerobic bacteria, including Gram-positive bacteria of the human gut microbiome and Chloroflexi from marine sediments.


Pssm-ID: 468206 [Multi-domain]  Cd Length: 1122  Bit Score: 53.47  E-value: 3.82e-07
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 57116882   259 SGNTGTLNWGSGNIGSYNLGGGNLGSYNLGSGNTGDTNFGGGNTGNLNVGGGNTGNSNFGFGNTGNVnfgngntgdtnfg 338
Cdd:NF033849  327 QSSSYNVSSGTGVSSSHSDGTSQSTSISHSESSSESTGTSVGHSTSSSVSSSESSSRSSSSGVSGGF------------- 393
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 57116882   339 sgnlgSGNIGFGNKGSHNIGFGNSGNNNIGfglTGDNQIGFGALNSGSGNLGFGNSGNGNIGFFNSGNNNIGMGNSGNGV 418
Cdd:NF033849  394 -----SGGIAGGGVTSEGLGASQGGSEGWG---SGDSVQSVSQSYGSSSSTGTSSGHSDSSSHSTSSGQADSVSQGTSWS 465
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 57116882   419 GALSVEFGSSAERSSGFGNSGELSTGIGNSGQLSTGWFNSATTSTGwfNSGTTNTGWFNSGTTNTGIGnsggnlvtGSMG 498
Cdd:NF033849  466 EGTGTSQGQSVGTSESWSTSQSETDSVGDSTGTSESVSQGDGRSTG--RSESQGTSLGTSGGRTSGAG--------GSMG 535

                  .
gi 57116882   499 L 499
Cdd:NF033849  536 L 536
ser_rich_anae_1 NF033849
serine-rich protein; This serine-rich protein belongs to a family with large size (over 1000 ...
260-523 9.92e-07

serine-rich protein; This serine-rich protein belongs to a family with large size (over 1000 amino acids), which a highly serine-rich central region that averages over 300 aa in length. Species encoding members of this family of proteins tend to be anaerobic bacteria, including Gram-positive bacteria of the human gut microbiome and Chloroflexi from marine sediments.


Pssm-ID: 468206 [Multi-domain]  Cd Length: 1122  Bit Score: 52.31  E-value: 9.92e-07
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 57116882   260 GNTGTLNWGSGNIGSYNLGGGNLGSYNLGSGNTGDTNFGGGNTGNLNVGGGNTGNSNFGFGNTGNVNFGNGNTGDTNFGS 339
Cdd:NF033849  278 GHGSTRGWSHTQSTSESESTGQSSSVGTSESQSHGTTEGTSTTDSSSHSQSSSYNVSSGTGVSSSHSDGTSQSTSISHSE 357
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 57116882   340 GNLGSGNIGFGNKGSHNIGFGNSGNNNIGFGLTGdnqiGFGAlnsgsGNLGFGNSGNGnIGFFNSGNNNIGMGNSGNGVG 419
Cdd:NF033849  358 SSSESTGTSVGHSTSSSVSSSESSSRSSSSGVSG----GFSG-----GIAGGGVTSEG-LGASQGGSEGWGSGDSVQSVS 427
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 57116882   420 AlSVEFGSSAERSSGFGNSGELSTGIGNSGQLSTG--WFNSATTSTGWfnSGTTNTGWFNSGTTNTGIGNSGGNLVTGSM 497
Cdd:NF033849  428 Q-SYGSSSSTGTSSGHSDSSSHSTSSGQADSVSQGtsWSEGTGTSQGQ--SVGTSESWSTSQSETDSVGDSTGTSESVSQ 504
                         250       260
                  ....*....|....*....|....*.
gi 57116882   498 GlFNSGHTNTGSFNAGSMNTGDFNSG 523
Cdd:NF033849  505 G-DGRSTGRSESQGTSLGTSGGRTSG 529
PHA02515 PHA02515
hypothetical protein; Provisional
148-366 1.46e-06

hypothetical protein; Provisional


Pssm-ID: 107197 [Multi-domain]  Cd Length: 508  Bit Score: 51.32  E-value: 1.46e-06
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 57116882  148 VAAMLGYHGEASAVAlslTPFTPSPSAAATPGGAVIIAGfpflDLGNVTIGGFNLASGNLGLGNLGSFNPGSANT----- 222
Cdd:PHA02515 153 VSAVYGHLANIDAVA---TNEADIDTVAASVGAVDTVAG----DLGGTWAAGVSYDFGSIAVPPIGNTSPPGGNIvivan 225
                         90       100       110       120       130       140       150       160
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 57116882  223 --GSVNLGNANIGDLNLGS----------GNIGSYNLGGGNTGDLNPDSGNTGTLNWGSGNIGSYNLGGGNLGSYNLGSG 290
Cdd:PHA02515 226 siGNVDTVAENIGDVSTVSthlssmlavaNDIDSVVSVAGDLENIDAVADNAANINTVAGANANVNTVASNILDVGTVAG 305
                        170       180       190       200       210       220       230
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*.
gi 57116882  291 NTGDTNFGGGNTGNLNVGGGNTGNSNFGFGNTGNVNFGNGNTGDTNFGSGNLGSGNIGFGNKGSHNIGFGNSGNNN 366
Cdd:PHA02515 306 NIDDVQAVAGNAANINVVADNADNINATAANQANINAAVGNADNINAAVANQANINAVVGNANNINAVAANEGNVN 381
PTZ00395 PTZ00395
Sec24-related protein; Provisional
425-575 7.78e-06

Sec24-related protein; Provisional


Pssm-ID: 185594 [Multi-domain]  Cd Length: 1560  Bit Score: 49.30  E-value: 7.78e-06
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 57116882   425 FGSSAERSSGFGNSGELSTGIGNsgQLSTGWFNSATTST-GWFNSGTTNTGWFNSGTTNtgignsggnlvtgsmglfNSG 503
Cdd:PTZ00395  339 YGGFHDGSPNAASAGAPFNGLGN--QADGGHINQVHPDArGAWAGGPHSNASYNCAAYS------------------NAA 398
                          90       100       110       120       130       140       150
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....
gi 57116882   504 HTNTGSFNAGSMNTGDFNSGNVNTGYFNSGNINTGFFNSGDLNTGLFN--SVNQPVQNSGWLHTGTNNSGYANA 575
Cdd:PTZ00395  399 QSNAAQSNAGFSNAGYSNPGNSNPGYNNAPNSNTPYNNPPNSNTPYSNppNSNPPYSNLPYSNTPYSNAPLSNA 472
Pentapeptide_2 pfam01469
Pentapeptide repeats (8 copies); These repeats are found in many mycobacterial proteins. These ...
505-543 1.30e-05

Pentapeptide repeats (8 copies); These repeats are found in many mycobacterial proteins. These repeats are most common in the pfam00823 family of proteins, where they are found in the MPTR subfamily of PPE proteins. The function of these repeats is unknown. The repeat can be approximately described as XNXGX, where X can be any amino acid. These repeats are similar to pfam00805, however it is not clear if these two families are structurally related.


Pssm-ID: 279771 [Multi-domain]  Cd Length: 39  Bit Score: 42.54  E-value: 1.30e-05
                          10        20        30
                  ....*....|....*....|....*....|....*....
gi 57116882   505 TNTGSFNAGSMNTGDFNSGNVNTGYFNSGNINTGFFNSG 543
Cdd:pfam01469   1 GNTGSGNSGSGNTGFFNLGSGNTGSFNLGSGNTGNGNTG 39
Pentapeptide_2 pfam01469
Pentapeptide repeats (8 copies); These repeats are found in many mycobacterial proteins. These ...
260-298 2.83e-05

Pentapeptide repeats (8 copies); These repeats are found in many mycobacterial proteins. These repeats are most common in the pfam00823 family of proteins, where they are found in the MPTR subfamily of PPE proteins. The function of these repeats is unknown. The repeat can be approximately described as XNXGX, where X can be any amino acid. These repeats are similar to pfam00805, however it is not clear if these two families are structurally related.


Pssm-ID: 279771 [Multi-domain]  Cd Length: 39  Bit Score: 41.39  E-value: 2.83e-05
                          10        20        30
                  ....*....|....*....|....*....|....*....
gi 57116882   260 GNTGTLNWGSGNIGSYNLGGGNLGSYNLGSGNTGDTNFG 298
Cdd:pfam01469   1 GNTGSGNSGSGNTGFFNLGSGNTGSFNLGSGNTGNGNTG 39
ser_rich_anae_1 NF033849
serine-rich protein; This serine-rich protein belongs to a family with large size (over 1000 ...
301-532 2.57e-04

serine-rich protein; This serine-rich protein belongs to a family with large size (over 1000 amino acids), which a highly serine-rich central region that averages over 300 aa in length. Species encoding members of this family of proteins tend to be anaerobic bacteria, including Gram-positive bacteria of the human gut microbiome and Chloroflexi from marine sediments.


Pssm-ID: 468206 [Multi-domain]  Cd Length: 1122  Bit Score: 44.61  E-value: 2.57e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 57116882   301 NTGNLNVGGGNTGNSNFG----------------FGNTGNVNFGNGNTGDTNFG--SGNLGSGNIGFGNKGSHNIGFGNS 362
Cdd:NF033849  231 YAANLGQSAGTGYGESVGhstsqgqshsvgtsesHSVGTSQSQSHTTGHGSTRGwsHTQSTSESESTGQSSSVGTSESQS 310
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 57116882   363 GNNNIGFGLTGDNQIGFGALNSGSGNLGFGNS----GNGNIGFFNSGNNNIGMGNSGNGVGALSVEFGSSAERSSGFGNS 438
Cdd:NF033849  311 HGTTEGTSTTDSSSHSQSSSYNVSSGTGVSSShsdgTSQSTSISHSESSSESTGTSVGHSTSSSVSSSESSSRSSSSGVS 390
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 57116882   439 GELSTGIGNSGQLSTGWFNSATTSTGWfNSGTTNTGWFNSGTTNTGIGNSGGNLVTGSMGLfNSGHTNTGSFNAG-SMNT 517
Cdd:NF033849  391 GGFSGGIAGGGVTSEGLGASQGGSEGW-GSGDSVQSVSQSYGSSSSTGTSSGHSDSSSHST-SSGQADSVSQGTSwSEGT 468
                         250
                  ....*....|....*
gi 57116882   518 GDFNSGNVNTGYFNS 532
Cdd:NF033849  469 GTSQGQSVGTSESWS 483
ser_rich_anae_1 NF033849
serine-rich protein; This serine-rich protein belongs to a family with large size (over 1000 ...
425-580 3.21e-04

serine-rich protein; This serine-rich protein belongs to a family with large size (over 1000 amino acids), which a highly serine-rich central region that averages over 300 aa in length. Species encoding members of this family of proteins tend to be anaerobic bacteria, including Gram-positive bacteria of the human gut microbiome and Chloroflexi from marine sediments.


Pssm-ID: 468206 [Multi-domain]  Cd Length: 1122  Bit Score: 44.23  E-value: 3.21e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 57116882   425 FGSSAERSSGFGNSGELSTGIGNSGQLSTGWFNSATTSTGwfNSGTTNTGWFNSGTTNTGIGNSGGNLVTGSMGLFNSGH 504
Cdd:NF033849  283 RGWSHTQSTSESESTGQSSSVGTSESQSHGTTEGTSTTDS--SSHSQSSSYNVSSGTGVSSSHSDGTSQSTSISHSESSS 360
                          90       100       110       120       130       140       150
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 57116882   505 TNTGSFNAGSMNTGdfNSGNVNTGYFNSGNINTGFfnSGDLNTGLFNSVNQPV---QNSGWLHTGTNNSGYANAGTFNS 580
Cdd:NF033849  361 ESTGTSVGHSTSSS--VSSSESSSRSSSSGVSGGF--SGGIAGGGVTSEGLGAsqgGSEGWGSGDSVQSVSQSYGSSSS 435
 
Name Accession Description Interval E-value
PPE pfam00823
PPE family; This family named after a PPE motif near to the amino terminus of the domain. The ...
5-162 5.44e-69

PPE family; This family named after a PPE motif near to the amino terminus of the domain. The PPE family of proteins all contain an amino-terminal region of about 180 amino acids. The carboxyl terminus of this family are variable, and on the basis of this region fall into at least three groups. The MPTR subgroup has tandem copies of a motif NXGXGNXG. The second subgroup contains a conserved motif at about position 350. The third group are only related in the amino terminal region. The function of these proteins is uncertain but it has been suggested that they may be related to antigenic variation of Mycobacterium tuberculosis.


Pssm-ID: 425887  Cd Length: 158  Bit Score: 222.07  E-value: 5.44e-69
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 57116882     5 VLPPEINSALMFAGAGPGPMLAAASAWTGLAGDLGSAAASFSAVTSQLATGSWQGPASAAMTGVAASYARWLTTAAAQAE 84
Cdd:pfam00823   1 ALPPEVNSARLYAGPGSGPLLAAAAAWDGLAAELASAAASYSSVLAGLTGGAWQGPSSAAMAAAAAPYVAWLTAAAAQAE 80
                          90       100       110       120       130       140       150
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 57116882    85 QAAGQAQAAVSAFEAALAATVHPGAVSANRGRLRSLVASNLLGQNAPAIAAVEAVYEQMWAADVAAMLGYHGEASAVA 162
Cdd:pfam00823  81 QAAAQAEAAAAAYEAALAAMVPPAEIAANRAELAVLVATNFFGQNTPAIAATEADYAEMWAQDAAAMYGYAAASAAAA 158
PPE COG5651
PPE-repeat protein [Function unknown];
1-406 1.84e-62

PPE-repeat protein [Function unknown];


Pssm-ID: 444372 [Multi-domain]  Cd Length: 385  Bit Score: 212.83  E-value: 1.84e-62
                        10        20        30        40        50        60        70        80
                ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 57116882   1 MNFSVLPPEINSALMFAGAGPGPMLAAASAWTGLAGDLGSAAASFSAVTSQLATGSWQGPASAAMTGVAASYARWLTTAA 80
Cdd:COG5651   1 MDFMALPPEVNSARMYAGPGSGPLLAAAAAWDGLAAELASAAASLESVLSGLTGGSWQGPAAAAMAAAAAPYVAWLTAAA 80
                        90       100       110       120       130       140       150       160
                ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 57116882  81 AQAEQAAGQAQAAVSAFEAALAATVHPGAVSANRGRLRSLVASNLLGQNAPAIAAVEAVYEQMWAADVAAMLGYHGeASA 160
Cdd:COG5651  81 AQAEQAAAQAEAAAAAYEAALAAMVPPAEVAANRAQLAVLVATNFFGQNTPAIAANEADYAEMWAQDAAAMYGYAA-ASA 159
                       170       180       190       200       210       220       230       240
                ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 57116882 161 VALSLTPFTPSPSAAATPGGaviiagfpfldlgnvtiggfNLASGNLGLGNlGSFNPGSANTGSVNLGNANIGDLNLGSG 240
Cdd:COG5651 160 AAVALTPFTQPPPTITNPGG--------------------LLGAQNAGSGN-TSSNPGFANLGLTGLNQVGIGGLNSGSG 218
                       250       260       270       280       290       300       310       320
                ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 57116882 241 NIGSyNLGGGNTGDLNPdSGNTGTLNWGSGNIGSYNLGGGNLGSYNLGSGNTGDTNFGGGNTGNLNVGGGNTGNSNFGFG 320
Cdd:COG5651 219 PIGL-NSGPGNTGFAGT-GAAAGAAAAAAAAAAAAGAGASAALASLAATLLNASSLGLAATAASSAATNLGLAGSPLGLA 296
                       330       340       350       360       370       380       390       400
                ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 57116882 321 NTGNVNFGNGNTGDTNFGSGNLGSGNIGFGNKGSHNIGFGNSGNNNIGFGLTGDNQIGFGALNSGSGNLGFGNSGNGNIG 400
Cdd:COG5651 297 GGGAGAAAATGLGLGAGGAAGAAGATGAGAALGAGAAAAAAGAAAGAGAAAAAAAGGAGGGGGGALGAGGGGGSAGAAAG 376

                ....*.
gi 57116882 401 FFNSGN 406
Cdd:COG5651 377 AASGGG 382
FhaB COG3210
Large exoprotein involved in heme utilization or adhesion [Intracellular trafficking, ...
210-675 1.10e-15

Large exoprotein involved in heme utilization or adhesion [Intracellular trafficking, secretion, and vesicular transport];


Pssm-ID: 442443 [Multi-domain]  Cd Length: 1698  Bit Score: 81.35  E-value: 1.10e-15
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 57116882  210 GNLGSFNPGSANTGSVNLGNANIGDLNLGSGNIGSYNLGGGNTGDLNPDSGNTGTLNWGSGNIGSYNLGGGNLGSYNLGS 289
Cdd:COG3210  816 GSGGTITINTATTGLTGTGDTTSGAGGSNTTDTTTGTTSDGASGGGTAGANSGSLAATAASITVGSGGVATSTGTANAGT 895
                         90       100       110       120       130       140       150       160
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 57116882  290 GNTGDTNFGGGNTGNLNVGGGNTGNSNFGFGNTGNVNFGNGNTGDTNFGSGNLGSGNIGFGNKGSHNIGFGNSGNNNIGF 369
Cdd:COG3210  896 LTNLGTTTNAASGNGAVLATVTATGTGGGGLTGGNAAAGGTGAGNGTTALSGTQGNAGLSAASASDGAGDTGASSAAGSS 975
                        170       180       190       200       210       220       230       240
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 57116882  370 GLTGDNQIGFGALNSGSGNLGFGNSGNGNIGFFNSGNNNIGMGNSGNGVGALSVEFGSSAERSSGFGNSGELSTGIGNSG 449
Cdd:COG3210  976 AVGTSANSAGSTGGVIAATGILVAGNSGTTASTTGGSGAIVAGGNGVTGTTGTASATGTGTAATAGGQNGVGVNASGISG 1055
                        250       260       270       280       290       300       310       320
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 57116882  450 QLSTGWFNSATTSTGWFNSGTTNTGWFNSGTTNTGIGNSGGNLVTGSMGLFNSGHTNTGSFNAGSMNTGDFNSGNVNTGY 529
Cdd:COG3210 1056 GNAAALTASGTAGTTGGTAASNGGGGTAQASGAGTTHTLGGITNGGATGTSGGTTTSTGGVTASKVGGTTTVGATGTSTA 1135
                        330       340       350       360       370       380       390       400
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 57116882  530 FNSGNINTGFFNSGDLNTGLFNSVNQPVQNSGWLHTGTNNSGYANAGTFNSGFDNNARDEHAEFVTGNSGLANVGNYNAG 609
Cdd:COG3210 1136 STEAAGAGTLTGLVAVSAVAGGASSASAGDTTAVAAATTTTTGSAINGGADSAATEGTAGTDLKGGDSTGGSTTTIGTTN 1215
                        410       420       430       440       450       460
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*.
gi 57116882  610 IINVGDHLSGFRNSVPTITGTANISGFVNAGTSISGFFNFGSLMSGFANFDDEVSGYLNGDSRASG 675
Cdd:COG3210 1216 VTTTTTLTASDTGNTTATGGSSAGQTGSFVAAGSASGTGDATTGATAGAVSNGATSTVAGNAGATA 1281
FhaB COG3210
Large exoprotein involved in heme utilization or adhesion [Intracellular trafficking, ...
32-645 8.85e-15

Large exoprotein involved in heme utilization or adhesion [Intracellular trafficking, secretion, and vesicular transport];


Pssm-ID: 442443 [Multi-domain]  Cd Length: 1698  Bit Score: 78.65  E-value: 8.85e-15
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 57116882   32 TGLAGDLGSAAASFSAVTSQLATGSWQGPASAAMTGVAASYARWLTTAAAQAEQAAGQAQAAVSAFEAALAATVHPGAVS 111
Cdd:COG3210  108 SNSTGNGTLTTTAASATTGNNTGGTTTSSTNTVTTLGGTTTGNTVLSTSGAGNNTNTNNSSSGTNIGNSIPTTGGSLNVV 187
                         90       100       110       120       130       140       150       160
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 57116882  112 ANRGRLRSLVASNLLGQNAPAIAAVEAVYEQMWAADVAAMLGYHGEASAVALSLTPFTPSPSAAATPGGAVIIAGfpflD 191
Cdd:COG3210  188 AANPTGVTGVGGALINATAGVLANAGGGTAGGVASANSTLTGGVVAAGTGAGVISTGGTDISSLSVAAGAGTGGA----G 263
                        170       180       190       200       210       220       230       240
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 57116882  192 LGNVTIGGFNLASGNLGLGNLGSFNPGSANTGSVNLGNANIGDLNLGSGNIGSYNLGGGNTGDLNPDSGNTGTLNWGSGN 271
Cdd:COG3210  264 GTGNAGNTTIGTTVTGTNATGSNTAGASSGDTTTNGTSSVTGAGGTGVLGGGTAAGITTTNTVGGNGDGNNTTANSGAGL 343
                        250       260       270       280       290       300       310       320
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 57116882  272 IGSYNLGGGNLGSYNLGSGNTGDTNFGGGNTGNLNVGGGNTGNSNFGFGNTGNVNFGNGNTGDTNFGSGNLGSGNIGFGN 351
Cdd:COG3210  344 VSGGTGGNNGTTGTGAGSGLTGTGNGGGLTTAGAGTVASTVGTATASTGNASSTTVLGSGSLATGNTGTTIAGNGGSANA 423
                        330       340       350       360       370       380       390       400
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 57116882  352 KGSHNIGFGNSGNNNIGFGLTGDNQIGFGALNSGSGNLGFGNSGNGNIGFFNSGNNNIGMGNSGNGVGALSVEFGSSAER 431
Cdd:COG3210  424 GGFTTTGGVLGITGNGTVTGGTIGGLTGSGTTNGAGLSGNTDVSGTGTVTNSAGNTTSATTLAGGGIGTVTTNATISNNA 503
                        410       420       430       440       450       460       470       480
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 57116882  432 SSGFGNSGELSTGIGNSGQLSTGWFNSATTSTGWFNSGTTNTGWFNSGTTNTGIGNSGGNLVTGSMGLFNSGHTNTGSFN 511
Cdd:COG3210  504 GGDANGIATGLTGITAGGGGGGNATSGGTGGDGTTLSGSGLTTTVSGGASGTTAASGSNTANTLGVLAATGGTSNATTAG 583
                        490       500       510       520       530       540       550       560
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 57116882  512 AGSMNTGDFNSGNVNTGYFNSGNINTGFFNSGDLNTGLFNSVNQPVQNSGWLHTGTNNSGYANAGTFNSGFDNNARDEHA 591
Cdd:COG3210  584 NSTSATGGTGTNSGGTVLSIGTGSAGATGTITLGAGTSGAGANATGGGAGLTGSAVGAALSGTGSGTTGTASANGSNTTG 663
                        570       580       590       600       610
                 ....*....|....*....|....*....|....*....|....*....|....
gi 57116882  592 EFVTGNSGLANVGNYNAGIINVGDHLSGFRNSVPTITGTANISGFVNAGTSISG 645
Cdd:COG3210  664 VNTAGGTGGGTTGTVTSGATGGTTGTTLNAATGGTLNNAGNTLTISTGSITVTG 717
FhaB COG3210
Large exoprotein involved in heme utilization or adhesion [Intracellular trafficking, ...
239-675 3.17e-14

Large exoprotein involved in heme utilization or adhesion [Intracellular trafficking, secretion, and vesicular transport];


Pssm-ID: 442443 [Multi-domain]  Cd Length: 1698  Bit Score: 76.73  E-value: 3.17e-14
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 57116882  239 SGNIGSYNLGGGNTGDLNPDSGNTGTLNWGSGNIGSYNLGGGNLGSYNLGSGNTGDTNFGGGNTGNLNVGGGNTGNSNFG 318
Cdd:COG3210  815 TGSGGTITINTATTGLTGTGDTTSGAGGSNTTDTTTGTTSDGASGGGTAGANSGSLAATAASITVGSGGVATSTGTANAG 894
                         90       100       110       120       130       140       150       160
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 57116882  319 FGNTGNVNFGNGNTGDTNFGSGNLGSGNIGFGNKGSHNIGFGNSGNNNIGFGLTGDNQIGFGALNSGSGNLGFGNSGNGN 398
Cdd:COG3210  895 TLTNLGTTTNAASGNGAVLATVTATGTGGGGLTGGNAAAGGTGAGNGTTALSGTQGNAGLSAASASDGAGDTGASSAAGS 974
                        170       180       190       200       210       220       230       240
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 57116882  399 IGFFNSGNNNIGMGNSGNGVGALSVEFGSSAERSSGFGNSGELSTGIGNSGQLSTGWFNSATTSTGWFNSGTTNTGWFNS 478
Cdd:COG3210  975 SAVGTSANSAGSTGGVIAATGILVAGNSGTTASTTGGSGAIVAGGNGVTGTTGTASATGTGTAATAGGQNGVGVNASGIS 1054
                        250       260       270       280       290       300       310       320
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 57116882  479 GTTNTGIGNSGGNLVTGSMGLFNSGHTNTGSFNAGSMNTGDFNSGNVNTGYFNSGNINTGFFNSGDLNTGLFNSVNQPVQ 558
Cdd:COG3210 1055 GGNAAALTASGTAGTTGGTAASNGGGGTAQASGAGTTHTLGGITNGGATGTSGGTTTSTGGVTASKVGGTTTVGATGTST 1134
                        330       340       350       360       370       380       390       400
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 57116882  559 NSGWLHTGTNNSGYANAGTFNSGFDNNARDEHAEFVTGNSGLANVGNYNAGIINVGDHLSGFRNSVPTITGTANISGFVN 638
Cdd:COG3210 1135 ASTEAAGAGTLTGLVAVSAVAGGASSASAGDTTAVAAATTTTTGSAINGGADSAATEGTAGTDLKGGDSTGGSTTTIGTT 1214
                        410       420       430
                 ....*....|....*....|....*....|....*..
gi 57116882  639 AGTSISGFFNFGSLMSGFANFDDEVSGYLNGDSRASG 675
Cdd:COG3210 1215 NVTTTTTLTASDTGNTTATGGSSAGQTGSFVAAGSAS 1251
COG4625 COG4625
Uncharacterized conserved protein, contains a C-terminal beta-barrel porin domain [Function ...
173-635 7.65e-14

Uncharacterized conserved protein, contains a C-terminal beta-barrel porin domain [Function unknown];


Pssm-ID: 443664 [Multi-domain]  Cd Length: 900  Bit Score: 75.20  E-value: 7.65e-14
                        10        20        30        40        50        60        70        80
                ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 57116882 173 SAAATPGGAVIIAGFPFLDLGNVTIGGFNLASGNLGLGNLGSFNPGSANTGSVNLGNANIGDLNLGSGNIGSYNLGGGNT 252
Cdd:COG4625  18 GGGGAGGGGGAGGGAGGGGAGGGGGGGGGGGGAGGGGGGGGTGGGGGGGGGGGGGGAGGGGGGGGGGGGGGGTGGVGGGG 97
                        90       100       110       120       130       140       150       160
                ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 57116882 253 GDLNPDSGNTGTLNWGSGNIGSYNLGGGNLGSYNLGSGNTGDTNFGGGNTGNLNVGGGNTGNSNFGFGNTGNVNFGNGNT 332
Cdd:COG4625  98 GGGGGGGGGGGGGGGGGGGGSAGGGGGGAGGAGGGGGGGAGGGGGGGGGGGAGGGGGGGAGGAGGGGGGGGGGGGGGGGG 177
                       170       180       190       200       210       220       230       240
                ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 57116882 333 GDTNFGSGNLGSGNIGFGNKGSHNIGFGNSGNNNIGFGLTGDNQIGFGALNSGSGNLGFGNSGNGNIGFFNSGNNNIGMG 412
Cdd:COG4625 178 GGGGGGGGGGGGGGGNGGGGGGGGGGGGGGGGGGGGAGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGAGGGGGGGGG 257
                       250       260       270       280       290       300       310       320
                ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 57116882 413 NSGNGVGALSVEFGSSAERSSGFGNSGELSTGIGNSGQLSTGWFNSATTSTGWFNSGTTNTGWFNSGTTNTGIGNSGGNL 492
Cdd:COG4625 258 NGGGGGAGGGGGGGGGGSGGGGGGGGGGGSGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGAGGG 337
                       330       340       350       360       370       380       390       400
                ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 57116882 493 VTGSMGLFNSGHTNTGSFNAGSMNTGDFNSGNVNTGYFNSGNINTGFFNSGDLNTGLFNSVNQPVQNSGWLHTGTNNSGY 572
Cdd:COG4625 338 GGSGGAGAGGGGAGGGGAGGGGGGGTGGGGGGGGGGGGGSGGGGAGGGGGSGGGGGGGAGGGGGGGGAGGTGGGGAGGGG 417
                       410       420       430       440       450       460
                ....*....|....*....|....*....|....*....|....*....|....*....|...
gi 57116882 573 ANAGTFNSGFDNNARDEHAEFVTGNSGLANVGNYNAGIINVGDHLSGFRNSVPTITGTANISG 635
Cdd:COG4625 418 GAAGGGGGGTGAGGGGGGGGTGAGGGGATGGGGGGGGGAGGSGGGAGAGGGSGSGAGTLTLTG 480
FhaB COG3210
Large exoprotein involved in heme utilization or adhesion [Intracellular trafficking, ...
219-677 3.32e-13

Large exoprotein involved in heme utilization or adhesion [Intracellular trafficking, secretion, and vesicular transport];


Pssm-ID: 442443 [Multi-domain]  Cd Length: 1698  Bit Score: 73.26  E-value: 3.32e-13
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 57116882  219 SANTGSVNLGNANIGDLNLGSGNIGSYNLGGGNTGDLNPDSGNTGTLNWGSGNIGSYNLGGGNLGSYNLGSGNTGDTNFG 298
Cdd:COG3210  815 TGSGGTITINTATTGLTGTGDTTSGAGGSNTTDTTTGTTSDGASGGGTAGANSGSLAATAASITVGSGGVATSTGTANAG 894
                         90       100       110       120       130       140       150       160
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 57116882  299 GGNTGNLNVGGGNTGNSNFGFGNTGNVNFGNGNTGDTNFGSGNLGSGNIGFGNKGSHNIGFGNSGNN-NIGFGLTGDNQI 377
Cdd:COG3210  895 TLTNLGTTTNAASGNGAVLATVTATGTGGGGLTGGNAAAGGTGAGNGTTALSGTQGNAGLSAASASDgAGDTGASSAAGS 974
                        170       180       190       200       210       220       230       240
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 57116882  378 GFGALNSGSGNLGFGNSGNGNIGFFNSGNNNIGMGNSGNGVGALSVEFGSSAERSSGFGNSGELSTGIGNSGQLSTGWFN 457
Cdd:COG3210  975 SAVGTSANSAGSTGGVIAATGILVAGNSGTTASTTGGSGAIVAGGNGVTGTTGTASATGTGTAATAGGQNGVGVNASGIS 1054
                        250       260       270       280       290       300       310       320
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 57116882  458 SATTSTGWFNSGTTNTGWFNSGTTNTGIGNSGGNLVTGSMGLFNSGHTNTGSFNAGSMNTGDFNSGNVNTGYFNSGNINT 537
Cdd:COG3210 1055 GGNAAALTASGTAGTTGGTAASNGGGGTAQASGAGTTHTLGGITNGGATGTSGGTTTSTGGVTASKVGGTTTVGATGTST 1134
                        330       340       350       360       370       380       390       400
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 57116882  538 GFFNSGDLNTGLFNSVNQPVQNSGWLHTGTNNSGYANAGTFNSGFDNNARDEHAEFVTGNSGLANVGNYNAGIINVGDHL 617
Cdd:COG3210 1135 ASTEAAGAGTLTGLVAVSAVAGGASSASAGDTTAVAAATTTTTGSAINGGADSAATEGTAGTDLKGGDSTGGSTTTIGTT 1214
                        410       420       430       440       450       460
                 ....*....|....*....|....*....|....*....|....*....|....*....|
gi 57116882  618 SGFRNSVPTITGTANISGFVNAGTSISGFFNFGSLMSGFANFDDEVSGYLNGDSRASGWI 677
Cdd:COG3210 1215 NVTTTTTLTASDTGNTTATGGSSAGQTGSFVAAGSASGTGDATTGATAGAVSNGATSTVA 1274
COG4625 COG4625
Uncharacterized conserved protein, contains a C-terminal beta-barrel porin domain [Function ...
94-581 7.34e-13

Uncharacterized conserved protein, contains a C-terminal beta-barrel porin domain [Function unknown];


Pssm-ID: 443664 [Multi-domain]  Cd Length: 900  Bit Score: 72.12  E-value: 7.34e-13
                        10        20        30        40        50        60        70        80
                ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 57116882  94 VSAFEAALAATVHPGAVSANRGRLRSLVASNLLGQNAPAIAAVEAVYEQMWAADVAAMLGYHGEASAVALSLTPFTPSPS 173
Cdd:COG4625  16 GTGGGGAGGGGGAGGGAGGGGAGGGGGGGGGGGGAGGGGGGGGTGGGGGGGGGGGGGGAGGGGGGGGGGGGGGGTGGVGG 95
                        90       100       110       120       130       140       150       160
                ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 57116882 174 AAATPGGAVIIAGFPFLDLGNVTIGGFNLASGNLGLGNLGSFNPGSANTGSVNLGNANIGDLNLGSGNIGSYNLGGGNTG 253
Cdd:COG4625  96 GGGGGGGGGGGGGGGGGGGGGGSAGGGGGGAGGAGGGGGGGAGGGGGGGGGGGAGGGGGGGAGGAGGGGGGGGGGGGGGG 175
                       170       180       190       200       210       220       230       240
                ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 57116882 254 DLNPDSGNTGTLNWGSGNIGSYNLGGGNLGSYNLGSGNTGDTNFGGGNTGNLNVGGGNTGNSNFGFGNTGNVNFGNGNTG 333
Cdd:COG4625 176 GGGGGGGGGGGGGGGGGNGGGGGGGGGGGGGGGGGGGGAGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGAGGGGGGG 255
                       250       260       270       280       290       300       310       320
                ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 57116882 334 DTNFGSGNLGSGNIGFGNKGSHNIGFGNSGNNNIGFGLTGDNQIGFGALNSGSGNLGFGNSGNGNIGffNSGNNNIGMGN 413
Cdd:COG4625 256 GGNGGGGGAGGGGGGGGGGSGGGGGGGGGGGSGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGG--GGGGGGGGGGG 333
                       330       340       350       360       370       380       390       400
                ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 57116882 414 SGNGVGALSVEFGSSAERSSGFGNSGELSTGIGNSGQLSTGWFNSATTSTGWFNSGTTNTGWFNSGTTNTGIGNSGGNLV 493
Cdd:COG4625 334 AGGGGGSGGAGAGGGGAGGGGAGGGGGGGTGGGGGGGGGGGGGSGGGGAGGGGGSGGGGGGGAGGGGGGGGAGGTGGGGA 413
                       410       420       430       440       450       460       470       480
                ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 57116882 494 TGSMGLFNSGHTNTGSFNAGSMNTGDFNSGNVNTGYFNSGNINTGFFNSGDLNTGLFNSVNQPVQNSGWLHTGTNNSGYA 573
Cdd:COG4625 414 GGGGGAAGGGGGGTGAGGGGGGGGTGAGGGGATGGGGGGGGGAGGSGGGAGAGGGSGSGAGTLTLTGNNTYTGTTTVNGG 493

                ....*...
gi 57116882 574 NAGTFNSG 581
Cdd:COG4625 494 GNYTQSAG 501
FhaB COG3210
Large exoprotein involved in heme utilization or adhesion [Intracellular trafficking, ...
10-675 7.61e-13

Large exoprotein involved in heme utilization or adhesion [Intracellular trafficking, secretion, and vesicular transport];


Pssm-ID: 442443 [Multi-domain]  Cd Length: 1698  Bit Score: 72.11  E-value: 7.61e-13
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 57116882   10 INSALMFAGAGPGPMLAAASAWTGLAGDLGSAAASFSAVTSQLATGSWQGPASAAMTGVAASYARWLTTAAAQAEQAAGQ 89
Cdd:COG3210  528 TSGGTGGDGTTLSGSGLTTTVSGGASGTTAASGSNTANTLGVLAATGGTSNATTAGNSTSATGGTGTNSGGTVLSIGTGS 607
                         90       100       110       120       130       140       150       160
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 57116882   90 AQAAVSAFEAALAATVHPGAVSANRGRLRSLVASNLLGQNAPAIAAVEAVYEQMWAADVAAMLGYHGEASAVALSLTPFT 169
Cdd:COG3210  608 AGATGTITLGAGTSGAGANATGGGAGLTGSAVGAALSGTGSGTTGTASANGSNTTGVNTAGGTGGGTTGTVTSGATGGTT 687
                        170       180       190       200       210       220       230       240
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 57116882  170 PSPSAAATPGGAVIIAGFPFLDLGNVTIGGF--NLASGNLGLGNLGSFNPGSANT---------GSVNLGNANIGDLNLG 238
Cdd:COG3210  688 GTTLNAATGGTLNNAGNTLTISTGSITVTGQigALANANGDTVTFGNLGTGATLTlnagvtitsGNAGTLSIGLTANTTA 767
                        250       260       270       280       290       300       310       320
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 57116882  239 SGNIGSYNLGGGNTGDLNPDSGNTGTLNWGSGNIGSYNLGGGNLGSYNLGSGNTGDTNFGGGNTGNLN--------VGGG 310
Cdd:COG3210  768 SGTTLTLANANGNTSAGATLDNAGAEISIDITADGTITAAGTTAINVTGSGGTITINTATTGLTGTGDttsgaggsNTTD 847
                        330       340       350       360       370       380       390       400
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 57116882  311 NTGNSNFGFGNTGNVNFGNGNTGDTNFGSGNLGSGNIGFGNKGSHNIGFGNSGNNNIGFGLTGDNQIGFGALNSGSGNLG 390
Cdd:COG3210  848 TTTGTTSDGASGGGTAGANSGSLAATAASITVGSGGVATSTGTANAGTLTNLGTTTNAASGNGAVLATVTATGTGGGGLT 927
                        410       420       430       440       450       460       470       480
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 57116882  391 FGNSGNGNIGFFNSGNNNIGMGNSGNGVGALSVEFGSSAERSSGFGNSGELSTGIGNSGQLSTGWFNSATTSTGWFNSGT 470
Cdd:COG3210  928 GGNAAAGGTGAGNGTTALSGTQGNAGLSAASASDGAGDTGASSAAGSSAVGTSANSAGSTGGVIAATGILVAGNSGTTAS 1007
                        490       500       510       520       530       540       550       560
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 57116882  471 TNTGWFNSGTTNTGIGNSGGNLVTGSMGLFNSGHTNTGSFNAGSMNTGDFNSGNVNTGYFNSGNINTGFFNSGDLNTGLF 550
Cdd:COG3210 1008 TTGGSGAIVAGGNGVTGTTGTASATGTGTAATAGGQNGVGVNASGISGGNAAALTASGTAGTTGGTAASNGGGGTAQASG 1087
                        570       580       590       600       610       620       630       640
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 57116882  551 NSVNQPVQNSGWLHTGTNNSGYANAGTFNSGFDNNARDEHAEFVTGNSGLANVGNYNAGIINVGDHLSGFRNSVPTITGT 630
Cdd:COG3210 1088 AGTTHTLGGITNGGATGTSGGTTTSTGGVTASKVGGTTTVGATGTSTASTEAAGAGTLTGLVAVSAVAGGASSASAGDTT 1167
                        650       660       670       680
                 ....*....|....*....|....*....|....*....|....*
gi 57116882  631 ANISGFVNAGTSISGFFNFGSLMSGFANFDDEVSGYLNGDSRASG 675
Cdd:COG3210 1168 AVAAATTTTTGSAINGGADSAATEGTAGTDLKGGDSTGGSTTTIG 1212
FhaB COG3210
Large exoprotein involved in heme utilization or adhesion [Intracellular trafficking, ...
26-675 9.74e-13

Large exoprotein involved in heme utilization or adhesion [Intracellular trafficking, secretion, and vesicular transport];


Pssm-ID: 442443 [Multi-domain]  Cd Length: 1698  Bit Score: 72.11  E-value: 9.74e-13
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 57116882   26 AAASAWTGLAGDLGSAAASFSAVTSQLATGSWQGPASAAMTGVAASYARWLTTAAAQAEQAAGQAQAAVSAFEAALAATV 105
Cdd:COG3210  231 VVAAGTGAGVISTGGTDISSLSVAAGAGTGGAGGTGNAGNTTIGTTVTGTNATGSNTAGASSGDTTTNGTSSVTGAGGTG 310
                         90       100       110       120       130       140       150       160
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 57116882  106 HPGAVSANRGRLRSLVASNLLGQNAPAIAAVEAVYEQMWAADVAAMLGYHGEASAVALSLTPFTPSPSAAATPGGAVIIA 185
Cdd:COG3210  311 VLGGGTAAGITTTNTVGGNGDGNNTTANSGAGLVSGGTGGNNGTTGTGAGSGLTGTGNGGGLTTAGAGTVASTVGTATAS 390
                        170       180       190       200       210       220       230       240
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 57116882  186 GFPFLDLGNVTIGGFNLASGNLGLGNLGSFNPGSANTGSVNLGNANIGDLNLGSGNIGSYNLGGGNTGDLNPDSGNTGTL 265
Cdd:COG3210  391 TGNASSTTVLGSGSLATGNTGTTIAGNGGSANAGGFTTTGGVLGITGNGTVTGGTIGGLTGSGTTNGAGLSGNTDVSGTG 470
                        250       260       270       280       290       300       310       320
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 57116882  266 NWGSGNIGSYNLGGGNLGSYNLGSGNTGDTNFGGGNTGNLNVGGGNTGNSNFGFGNTGNVNFGNGNTGDTNFGSGNLGSG 345
Cdd:COG3210  471 TVTNSAGNTTSATTLAGGGIGTVTTNATISNNAGGDANGIATGLTGITAGGGGGGNATSGGTGGDGTTLSGSGLTTTVSG 550
                        330       340       350       360       370       380       390       400
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 57116882  346 NIGFGNKGSHNIGFGNSGNNNIGFGLTGDNQIGFGALNSGSGNLGFGNSGNGNIGFFNSGNNNIGMGNSGNGVGALSVEF 425
Cdd:COG3210  551 GASGTTAASGSNTANTLGVLAATGGTSNATTAGNSTSATGGTGTNSGGTVLSIGTGSAGATGTITLGAGTSGAGANATGG 630
                        410       420       430       440       450       460       470       480
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 57116882  426 GSSAERSSGFGNSGELSTGIGNSGQLSTGWFNSATTSTGWFNSGTTNTGWFNSGTTNTGIGNSGGNLVTGSMGLFNSGHT 505
Cdd:COG3210  631 GAGLTGSAVGAALSGTGSGTTGTASANGSNTTGVNTAGGTGGGTTGTVTSGATGGTTGTTLNAATGGTLNNAGNTLTIST 710
                        490       500       510       520       530       540       550       560
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 57116882  506 NTGSFNAGSMNTGDFNSGNVNTGYFNSGN---INTGFFNSGDLNTGLFNSVNQPVQNSGWLHTGTNNSGYANAGTF---N 579
Cdd:COG3210  711 GSITVTGQIGALANANGDTVTFGNLGTGAtltLNAGVTITSGNAGTLSIGLTANTTASGTTLTLANANGNTSAGATldnA 790
                        570       580       590       600       610       620       630       640
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 57116882  580 SGFDNNARDEHAEFVTGNSGLANVGNYNAGIINVGDHLSGFRNSVPTITGTANISGFVNAGTSISGFFNFGSLMSGFANF 659
Cdd:COG3210  791 GAEISIDITADGTITAAGTTAINVTGSGGTITINTATTGLTGTGDTTSGAGGSNTTDTTTGTTSDGASGGGTAGANSGSL 870
                        650
                 ....*....|....*.
gi 57116882  660 DDEVSGYLNGDSRASG 675
Cdd:COG3210  871 AATAASITVGSGGVAT 886
FhaB COG3210
Large exoprotein involved in heme utilization or adhesion [Intracellular trafficking, ...
17-677 3.82e-12

Large exoprotein involved in heme utilization or adhesion [Intracellular trafficking, secretion, and vesicular transport];


Pssm-ID: 442443 [Multi-domain]  Cd Length: 1698  Bit Score: 70.18  E-value: 3.82e-12
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 57116882   17 AGAGPGPMLAAASAWTGLAGDLGSAAASFSAVTSQLATGSWQGPASAAMTGVAASYARWLTTAAAQAEQAAGQAQAAVSA 96
Cdd:COG3210  495 TNATISNNAGGDANGIATGLTGITAGGGGGGNATSGGTGGDGTTLSGSGLTTTVSGGASGTTAASGSNTANTLGVLAATG 574
                         90       100       110       120       130       140       150       160
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 57116882   97 FEAALAATVHPGAVSANRGRLRSLVASNLLGQNAPAIAAVEAVYEQMWAADVAAMLGYHGEASAVALSLTPFTPSPSAAA 176
Cdd:COG3210  575 GTSNATTAGNSTSATGGTGTNSGGTVLSIGTGSAGATGTITLGAGTSGAGANATGGGAGLTGSAVGAALSGTGSGTTGTA 654
                        170       180       190       200       210       220       230       240
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 57116882  177 TPGGAVIIAGFPFLDLGNVTIGGFNLASGNLGLGNLGSFNPGSANTGSVNLGNANIGDLNLGSGNIGSYNLGGGNTGDLN 256
Cdd:COG3210  655 SANGSNTTGVNTAGGTGGGTTGTVTSGATGGTTGTTLNAATGGTLNNAGNTLTISTGSITVTGQIGALANANGDTVTFGN 734
                        250       260       270       280       290       300       310       320
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 57116882  257 PDSGNTGTLNWG-----------SGNIGSYNLGGGNLGSYNLGSGNTGDTNFGGGNTGNLNVGGGNTGNSNFGFGNTGNV 325
Cdd:COG3210  735 LGTGATLTLNAGvtitsgnagtlSIGLTANTTASGTTLTLANANGNTSAGATLDNAGAEISIDITADGTITAAGTTAINV 814
                        330       340       350       360       370       380       390       400
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 57116882  326 NFGNGNTGDTNFGSGNLGSGNIGFGNKGSHNIGFGNSGNNNIGFGLTGDNQIGFGALNSGSGNLGFGNSGNGNIGFFNSG 405
Cdd:COG3210  815 TGSGGTITINTATTGLTGTGDTTSGAGGSNTTDTTTGTTSDGASGGGTAGANSGSLAATAASITVGSGGVATSTGTANAG 894
                        410       420       430       440       450       460       470       480
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 57116882  406 NNNIGMGNSGNGVGALSVEFGSSAERSSGFGNSGELSTGIGNSGQLSTGWFNSATTSTGWFNSGTTNTGWFNSGTTNTGI 485
Cdd:COG3210  895 TLTNLGTTTNAASGNGAVLATVTATGTGGGGLTGGNAAAGGTGAGNGTTALSGTQGNAGLSAASASDGAGDTGASSAAGS 974
                        490       500       510       520       530       540       550       560
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 57116882  486 GNSGGNLVTGSMGLFNSGHTNTGSFNAGSMNTGDFNSGNVNTGYFNSGNINTGFFNSGDLNTGLFNSVNQPVQNSGWLHT 565
Cdd:COG3210  975 SAVGTSANSAGSTGGVIAATGILVAGNSGTTASTTGGSGAIVAGGNGVTGTTGTASATGTGTAATAGGQNGVGVNASGIS 1054
                        570       580       590       600       610       620       630       640
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 57116882  566 GTNNSGYANAGTFNSGFDNNARDEHAEFVTGNSGLANVGNYNAGIINVGDHLSGFRNSVPTITGTANISGFVNAGTSISG 645
Cdd:COG3210 1055 GGNAAALTASGTAGTTGGTAASNGGGGTAQASGAGTTHTLGGITNGGATGTSGGTTTSTGGVTASKVGGTTTVGATGTST 1134
                        650       660       670
                 ....*....|....*....|....*....|..
gi 57116882  646 FFNFGSLMSGFANFDDEVSGYLNGDSRASGWI 677
Cdd:COG3210 1135 ASTEAAGAGTLTGLVAVSAVAGGASSASAGDT 1166
AidA COG3468
Autotransporter adhesin AidA [Cell wall/membrane/envelope biogenesis, Intracellular ...
209-624 1.28e-11

Autotransporter adhesin AidA [Cell wall/membrane/envelope biogenesis, Intracellular trafficking, secretion, and vesicular transport];


Pssm-ID: 442691 [Multi-domain]  Cd Length: 846  Bit Score: 68.05  E-value: 1.28e-11
                        10        20        30        40        50        60        70        80
                ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 57116882 209 LGNLGSFNPGSANTGSVNLGNANIGDLNLGSGNIGSYNLGGGNTGDLNPDSGNTGTLNWGSGNIGSYNLGGGNLGSYNLG 288
Cdd:COG3468   1 TASGGGGGATGLGGGGTGGGGGLGGTGGGNAGLGIGNGGGGGAASGSGAGGVAGNGGGGGGGAGGGGGGAGSGGGLAGAG 80
                        90       100       110       120       130       140       150       160
                ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 57116882 289 SGNTGD--TNFGGGNTGNLNVGGGNTGNSNFGFGNTGNVNFGNGNTGDTNFGSGNLGSGNIGFGNKGSHNIGFGNSGNNN 366
Cdd:COG3468  81 SGGTGGnsTGGGGGNSGTGGTGGGGGGGGSGNGGGGGGGGGGGGTGGGGGGGTGSAGGGGGGGGGGTGVGGTGAAAAGGG 160
                       170       180       190       200       210       220       230       240
                ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 57116882 367 IGFGLTGDNQIGFGALNSGSGNLGFGNSGNGNIGFFNSGNNNIGMGNSGNGVGALSVEFGSSAERSSGFGNSGELSTGIG 446
Cdd:COG3468 161 TGSGGGGSGGGGGAGGGGGGGAGGSGGAGSTGSGAGGGGGGSGGGGGAAGTGGGGGGGGGAGGATGGAGSGGNTGGGVGG 240
                       250       260       270       280       290       300       310       320
                ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 57116882 447 NSGQLSTGWFNSATTSTGWFNSGTTNTGWFNSGTTNTGIGN-SGGNLVTGSMGLFNSGHTNTGSFNAGSMNTGDFNSGNV 525
Cdd:COG3468 241 GGGSAGGTGGGGLTGGGAAGTGGGGGGTGTGSGGGGGGGANgGGSGGGGGASGTGGGGTASTGGGGGGGGGNGGGGGGGS 320
                       330       340       350       360       370       380       390       400
                ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 57116882 526 NTGYFNSGNINTGFFNSGDLNTGLFNSVNQPVQNSGWLHTGTNNSGYANAGTFNSGFDNNARDEHAEFVTGNSGLANVGN 605
Cdd:COG3468 321 NAGGGSGGGGGGGGGGGGGGTTLNGAGSAGGGTGAALAGTGGSGSGGGGGGGSGGGGGAGGGGANTGSDGVGTGLTTGGT 400
                       410
                ....*....|....*....
gi 57116882 606 YNAGIINVGDHLSGFRNSV 624
Cdd:COG3468 401 GNNGGGGVGGGGGGGLTLT 419
FhaB COG3210
Large exoprotein involved in heme utilization or adhesion [Intracellular trafficking, ...
32-677 3.92e-11

Large exoprotein involved in heme utilization or adhesion [Intracellular trafficking, secretion, and vesicular transport];


Pssm-ID: 442443 [Multi-domain]  Cd Length: 1698  Bit Score: 66.71  E-value: 3.92e-11
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 57116882   32 TGLAGDLGSAAASFSAVTSQLATGSWQGPASAAMTGVAASYARWLTTAAAQAEQAAGQAQAAVSAFEAALAATVHPGAVS 111
Cdd:COG3210  309 TGVLGGGTAAGITTTNTVGGNGDGNNTTANSGAGLVSGGTGGNNGTTGTGAGSGLTGTGNGGGLTTAGAGTVASTVGTAT 388
                         90       100       110       120       130       140       150       160
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 57116882  112 ANRGRLRSLVASNLLGQNAPAIAAVEAVYEQMWAADVAAMLGYHGEASAVALSLTPFTPSPSAAATPGGAVIIAGFPFLD 191
Cdd:COG3210  389 ASTGNASSTTVLGSGSLATGNTGTTIAGNGGSANAGGFTTTGGVLGITGNGTVTGGTIGGLTGSGTTNGAGLSGNTDVSG 468
                        170       180       190       200       210       220       230       240
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 57116882  192 LGNVTIGGFNLASGNLGLGNLGSFNPGSANTGSVNLGNANIGDLNLGSGNIGSYNLGGGNTGDLNPDSGNTGTLNWGSGN 271
Cdd:COG3210  469 TGTVTNSAGNTTSATTLAGGGIGTVTTNATISNNAGGDANGIATGLTGITAGGGGGGNATSGGTGGDGTTLSGSGLTTTV 548
                        250       260       270       280       290       300       310       320
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 57116882  272 IGSYNLGGGNLGSYNLGSGNTGDTNFGGGNTGNLNVGGGNTGNSNFGFGNTGNVNFGNGNTGDTNFGSGNLGSGNIGFGN 351
Cdd:COG3210  549 SGGASGTTAASGSNTANTLGVLAATGGTSNATTAGNSTSATGGTGTNSGGTVLSIGTGSAGATGTITLGAGTSGAGANAT 628
                        330       340       350       360       370       380       390       400
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 57116882  352 KGSHNIGFGNSGNNNIGFGLTGDNQIGFGALNSGSGNLGFGNSGNGNIGFFNSGNNNIGMGNSGNGVGALSVEFGSSAER 431
Cdd:COG3210  629 GGGAGLTGSAVGAALSGTGSGTTGTASANGSNTTGVNTAGGTGGGTTGTVTSGATGGTTGTTLNAATGGTLNNAGNTLTI 708
                        410       420       430       440       450       460       470       480
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 57116882  432 SSGFGNSGELSTGIGNSGQLSTGWFNSATTSTGWFNSGTT------------NTGWFNSGTTNTGIGNSGGNLVTGSMGL 499
Cdd:COG3210  709 STGSITVTGQIGALANANGDTVTFGNLGTGATLTLNAGVTitsgnagtlsigLTANTTASGTTLTLANANGNTSAGATLD 788
                        490       500       510       520       530       540       550       560
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 57116882  500 FNSGHTNTGSFNAGSMNTGDFNSGNV--NTGYFNSGNINTGFFNSGDLNTGLFNSVNQPVQNSGWLHTGTNNSGYANAGT 577
Cdd:COG3210  789 NAGAEISIDITADGTITAAGTTAINVtgSGGTITINTATTGLTGTGDTTSGAGGSNTTDTTTGTTSDGASGGGTAGANSG 868
                        570       580       590       600       610       620       630       640
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 57116882  578 FNSGFDNNARDEHAEFVTGNSGLANVGNYNAGIINVGDHLSGFRNSVPTITGTANISGFVNAGTSISGFFNFGSLMSGFA 657
Cdd:COG3210  869 SLAATAASITVGSGGVATSTGTANAGTLTNLGTTTNAASGNGAVLATVTATGTGGGGLTGGNAAAGGTGAGNGTTALSGT 948
                        650       660
                 ....*....|....*....|
gi 57116882  658 NFDDEVSGYLNGDSRASGWI 677
Cdd:COG3210  949 QGNAGLSAASASDGAGDTGA 968
AidA COG3468
Autotransporter adhesin AidA [Cell wall/membrane/envelope biogenesis, Intracellular ...
145-534 1.21e-09

Autotransporter adhesin AidA [Cell wall/membrane/envelope biogenesis, Intracellular trafficking, secretion, and vesicular transport];


Pssm-ID: 442691 [Multi-domain]  Cd Length: 846  Bit Score: 61.50  E-value: 1.21e-09
                        10        20        30        40        50        60        70        80
                ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 57116882 145 AADVAAMLGYHGEASAVALSLTPFTPSPSAAATPGGAVIIAGFPFLDLGNVTIGGFNLASGNLGLGNLGSFNPGSANTGS 224
Cdd:COG3468  43 AASGSGAGGVAGNGGGGGGGAGGGGGGAGSGGGLAGAGSGGTGGNSTGGGGGNSGTGGTGGGGGGGGSGNGGGGGGGGGG 122
                        90       100       110       120       130       140       150       160
                ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 57116882 225 VNLGNANIGDLNLGSGNIGSYNLGGGNTGDLNPDSGNTGTLNWGSGNIGSYNLGGGNLGSYNLGSGNTGDTNFGGGNTGN 304
Cdd:COG3468 123 GGTGGGGGGGTGSAGGGGGGGGGGTGVGGTGAAAAGGGTGSGGGGSGGGGGAGGGGGGGAGGSGGAGSTGSGAGGGGGGS 202
                       170       180       190       200       210       220       230       240
                ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 57116882 305 LNVGGGNTGNSNFGFGNTGNVNFGNGNTGDTNFGSGNLGSGNIGFGNKGSHNIGFGNSGNNNIGFGLTGDNQIGFGALNS 384
Cdd:COG3468 203 GGGGGAAGTGGGGGGGGGAGGATGGAGSGGNTGGGVGGGGGSAGGTGGGGLTGGGAAGTGGGGGGTGTGSGGGGGGGANG 282
                       250       260       270       280       290       300       310       320
                ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 57116882 385 GSGNLGFGNSGNGNIGFFNSGNNNIGMGNSGNGVGALSVEFGSSAERSSGFGNSGELSTGIGNSGQLSTGWFNSATTSTG 464
Cdd:COG3468 283 GGSGGGGGASGTGGGGTASTGGGGGGGGGNGGGGGGGSNAGGGSGGGGGGGGGGGGGGTTLNGAGSAGGGTGAALAGTGG 362
                       330       340       350       360       370       380       390
                ....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 57116882 465 WFNSGTTNTGWFNSGTTNTGIGNSGGNLVTGSMGLFNSGHTNTGSFNAGSMNTGDFNSGNVNTGYFNSGN 534
Cdd:COG3468 363 SGSGGGGGGGSGGGGGAGGGGANTGSDGVGTGLTTGGTGNNGGGGVGGGGGGGLTLTGGTLTVNGNYTGN 432
Hia COG5295
Autotransporter adhesin [Intracellular trafficking, secretion, and vesicular transport, ...
17-612 1.16e-08

Autotransporter adhesin [Intracellular trafficking, secretion, and vesicular transport, Extracellular structures];


Pssm-ID: 444098 [Multi-domain]  Cd Length: 785  Bit Score: 58.24  E-value: 1.16e-08
                        10        20        30        40        50        60        70        80
                ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 57116882  17 AGAGPGPMLAAASAWTGLAGDLGSAAASFSAVTSQLATGSWQGPASAAMTGVAASYARWLTTAAAQAEQAAGQAQAAVSA 96
Cdd:COG5295  27 GSSATVTSAAQSTGSAATSSGSSSAAGGSGSTSSLTAAAATAGAGSGGTSATAASSVASGGASAATAASTGTGNTAGTAA 106
                        90       100       110       120       130       140       150       160
                ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 57116882  97 FEAALAATVHPGAVSANRGRLRSLVASNLLGQNAPAIAAVEAVYEQMWAADVAAMLGYHGEASAVALSLTPFTPSPSAAA 176
Cdd:COG5295 107 TVAGAASSGSATNAGASAGASAAAAAGSTAAAGGAAASTGGSSAAGGSNTATATGSSTANAATAAAGATSTSASGSSSGA 186
                       170       180       190       200       210       220       230       240
                ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 57116882 177 TPGGAVIIAGFPFLDLGNVTIGGFNLASGNLGLGNLGSFNPGSANTGSVNLGNANIGDLNLGSGNIGSYNLGGGNTGDLN 256
Cdd:COG5295 187 SGAAAASAATGASAGGTASAAASASSSATGTSASVGVNAGAATGSAASAGGSASAGAASGNATTASASSVSGSAVAAGTA 266
                       250       260       270       280       290       300       310       320
                ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 57116882 257 PDSGNTGTLNWGSGNIGSYNLGGGNLGSynlGSGNTGDTNFGGGNTGNLNVGGGNTGNSNFGFGNTGNVNFGNGNTGDTN 336
Cdd:COG5295 267 STATTASTTAASGAAGTATAAAGGDAAA---AGSASSTGAANATAGGGNAGSGGGGAAALGSAGGSSGVGTASGASAAAA 343
                       330       340       350       360       370       380       390       400
                ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 57116882 337 FGSGNLGSGNIGFGNKGSHNIGFGNSGNNNIGFGLTGDNQIGFGALNSGSGNLGFGNSGNGNIGFFNSGNNNIGMGNSGN 416
Cdd:COG5295 344 TNDGTANGAGTSAAADATSGGGAGGGGAAATSSSGGSATAAGNAAGAAGAGSAGSGGSSTGASAGGGASAAGGAAAGSAA 423
                       410       420       430       440       450       460       470       480
                ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 57116882 417 GVGALSVEFGSSAERSSGFGNSGELSTGIGNSGQLSTGWFNSATTSTGWFNSGTTNTGWF-NSGTTNTGIGNSGGNLVTG 495
Cdd:COG5295 424 AGTSSNTSAVGASNGASGTSSSASSAGAAGGGTAGAGGAANVGAATTAASAAATAAAATSsAAIAGATATGAGAAAGGAG 503
                       490       500       510       520       530       540       550       560
                ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 57116882 496 SMGLFNSGHTNTGSFNAGSMNTGDFNSGNVNTGYFNSGNINTGFFNSGDLNTGLFNSVNQPVQNSGWLHTGTNNSGYANA 575
Cdd:COG5295 504 AGAAGGAGSAAAGGAANAAAASGATATAGSAGGGAAAAAGGGSTTAATGTNSVAVGNNTATGANSVALGAGSVASGANSV 583
                       570       580       590
                ....*....|....*....|....*....|....*..
gi 57116882 576 GTFNSGFDNNARDEHAEFVTGNSGLANVGNYNAGIIN 612
Cdd:COG5295 584 SVGAAGAENVAAGATDTDAVNGGGAVATGDNSVAVGN 620
COG4625 COG4625
Uncharacterized conserved protein, contains a C-terminal beta-barrel porin domain [Function ...
17-474 1.43e-08

Uncharacterized conserved protein, contains a C-terminal beta-barrel porin domain [Function unknown];


Pssm-ID: 443664 [Multi-domain]  Cd Length: 900  Bit Score: 58.25  E-value: 1.43e-08
                        10        20        30        40        50        60        70        80
                ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 57116882  17 AGAGPGPMLAAASAWTGLAGDLGSAAASFSAVTSQLATGSWQGPASAAMTGVAASYARWLTTAAAQAEQAAGQAQAAVSA 96
Cdd:COG4625  45 GGGGGAGGGGGGGGTGGGGGGGGGGGGGGAGGGGGGGGGGGGGGGTGGVGGGGGGGGGGGGGGGGGGGGGGGGSAGGGGG 124
                        90       100       110       120       130       140       150       160
                ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 57116882  97 FEAALAATVHPGAVSANRGRLRSLVASNLLGQNAPAIAAVEAVYEQMWAADVAAMLGYHGEASAVALSLTPFTPSPSAAA 176
Cdd:COG4625 125 GAGGAGGGGGGGAGGGGGGGGGGGAGGGGGGGAGGAGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGNGGGGGGGGGGG 204
                       170       180       190       200       210       220       230       240
                ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 57116882 177 TPGGAVIIAGFPFLDLGNVTIGGFNLASGNLGLGNLGSFNPGSANTGSVNLGNANIGDLNLGSGNIGSYNLGGGNTGDLN 256
Cdd:COG4625 205 GGGGGGGGGAGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGAGGGGGGGGGNGGGGGAGGGGGGGGGGSGGGGGGGGG 284
                       250       260       270       280       290       300       310       320
                ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 57116882 257 PDSGNTGTLNWGSGNIGSYNLGGGNLGSYNLGSGNTGDTNFGGGNTGNLNVGGGNTGNSNFGFGNTGNVNFGNGNTGDTN 336
Cdd:COG4625 285 GGSGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGAGGGGGSGGAGAGGGGAGGGGAGGGGGGGTG 364
                       330       340       350       360       370       380       390       400
                ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 57116882 337 FGSGNLGSGNIGFGNKGSHNIGFGNSGNNNIGFGLTGDNQIGFGALNSGSGNLGFGNSGNGNIGFFNSGNNNIGMGNSGN 416
Cdd:COG4625 365 GGGGGGGGGGGGSGGGGAGGGGGSGGGGGGGAGGGGGGGGAGGTGGGGAGGGGGAAGGGGGGTGAGGGGGGGGTGAGGGG 444
                       410       420       430       440       450
                ....*....|....*....|....*....|....*....|....*....|....*...
gi 57116882 417 GVGALSVEFGSSAERSSGFGNSGELSTGIGNSGQLSTGWFNSATTSTGWFNSGTTNTG 474
Cdd:COG4625 445 ATGGGGGGGGGAGGSGGGAGAGGGSGSGAGTLTLTGNNTYTGTTTVNGGGNYTQSAGS 502
ser_rich_anae_1 NF033849
serine-rich protein; This serine-rich protein belongs to a family with large size (over 1000 ...
259-499 3.82e-07

serine-rich protein; This serine-rich protein belongs to a family with large size (over 1000 amino acids), which a highly serine-rich central region that averages over 300 aa in length. Species encoding members of this family of proteins tend to be anaerobic bacteria, including Gram-positive bacteria of the human gut microbiome and Chloroflexi from marine sediments.


Pssm-ID: 468206 [Multi-domain]  Cd Length: 1122  Bit Score: 53.47  E-value: 3.82e-07
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 57116882   259 SGNTGTLNWGSGNIGSYNLGGGNLGSYNLGSGNTGDTNFGGGNTGNLNVGGGNTGNSNFGFGNTGNVnfgngntgdtnfg 338
Cdd:NF033849  327 QSSSYNVSSGTGVSSSHSDGTSQSTSISHSESSSESTGTSVGHSTSSSVSSSESSSRSSSSGVSGGF------------- 393
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 57116882   339 sgnlgSGNIGFGNKGSHNIGFGNSGNNNIGfglTGDNQIGFGALNSGSGNLGFGNSGNGNIGFFNSGNNNIGMGNSGNGV 418
Cdd:NF033849  394 -----SGGIAGGGVTSEGLGASQGGSEGWG---SGDSVQSVSQSYGSSSSTGTSSGHSDSSSHSTSSGQADSVSQGTSWS 465
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 57116882   419 GALSVEFGSSAERSSGFGNSGELSTGIGNSGQLSTGWFNSATTSTGwfNSGTTNTGWFNSGTTNTGIGnsggnlvtGSMG 498
Cdd:NF033849  466 EGTGTSQGQSVGTSESWSTSQSETDSVGDSTGTSESVSQGDGRSTG--RSESQGTSLGTSGGRTSGAG--------GSMG 535

                  .
gi 57116882   499 L 499
Cdd:NF033849  536 L 536
Hia COG5295
Autotransporter adhesin [Intracellular trafficking, secretion, and vesicular transport, ...
158-645 7.75e-07

Autotransporter adhesin [Intracellular trafficking, secretion, and vesicular transport, Extracellular structures];


Pssm-ID: 444098 [Multi-domain]  Cd Length: 785  Bit Score: 52.47  E-value: 7.75e-07
                        10        20        30        40        50        60        70        80
                ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 57116882 158 ASAVALSLTPFTPSPSAAATPGGAVIIAGFPFLDLGNVTIGGFNLASGNLGLGNLGSFNPGSANTGSVNLGNANIGDLNL 237
Cdd:COG5295 112 ASSGSATNAGASAGASAAAAAGSTAAAGGAAASTGGSSAAGGSNTATATGSSTANAATAAAGATSTSASGSSSGASGAAA 191
                        90       100       110       120       130       140       150       160
                ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 57116882 238 GSGNIGSYNLGGGNTGDLNPDSGNTGTLNWGSGNIGSYNLGGGNLGSYNLGSGNTGDTNFGGGNTGNLNVGGGNTGNSNF 317
Cdd:COG5295 192 ASAATGASAGGTASAAASASSSATGTSASVGVNAGAATGSAASAGGSASAGAASGNATTASASSVSGSAVAAGTASTATT 271
                       170       180       190       200       210       220       230       240
                ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 57116882 318 GFGNTGNVNFGNGNTGDTNFGSGNLGSGNIGFGNKGSH--NIGFGNSGNNNIGFGLT----GDNQIGFGALNSGSGNLGF 391
Cdd:COG5295 272 ASTTAASGAAGTATAAAGGDAAAAGSASSTGAANATAGggNAGSGGGGAAALGSAGGssgvGTASGASAAAATNDGTANG 351
                       250       260       270       280       290       300       310       320
                ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 57116882 392 GNSGNGNIGFFNSGNNNIGMGNSGNGVGALSVEFGSSAERSSGFGNSGELSTGIGNSGQLSTGWFNSATTSTGWFNSGTT 471
Cdd:COG5295 352 AGTSAAADATSGGGAGGGGAAATSSSGGSATAAGNAAGAAGAGSAGSGGSSTGASAGGGASAAGGAAAGSAAAGTSSNTS 431
                       330       340       350       360       370       380       390       400
                ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 57116882 472 NTGWFNSGTTNTGIGNSGGNLVTGSMGLFNSGHTNTGSFNAGSMNTGDFNSGNVNTGYFNSGNINTGFFNSGDLNTGLFN 551
Cdd:COG5295 432 AVGASNGASGTSSSASSAGAAGGGTAGAGGAANVGAATTAASAAATAAAATSSAAIAGATATGAGAAAGGAGAGAAGGAG 511
                       410       420       430       440       450       460       470       480
                ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 57116882 552 SVNQPVQNSGWLHTGTNNSGYANAGTFNSGFDNNARDEHAEFVTGNSGLANVGNYNAGIINVGDHLSGFRNSVPTITGTA 631
Cdd:COG5295 512 SAAAGGAANAAAASGATATAGSAGGGAAAAAGGGSTTAATGTNSVAVGNNTATGANSVALGAGSVASGANSVSVGAAGAE 591
                       490
                ....*....|....
gi 57116882 632 NISGFVNAGTSISG 645
Cdd:COG5295 592 NVAAGATDTDAVNG 605
AidA COG3468
Autotransporter adhesin AidA [Cell wall/membrane/envelope biogenesis, Intracellular ...
193-459 9.42e-07

Autotransporter adhesin AidA [Cell wall/membrane/envelope biogenesis, Intracellular trafficking, secretion, and vesicular transport];


Pssm-ID: 442691 [Multi-domain]  Cd Length: 846  Bit Score: 52.26  E-value: 9.42e-07
                        10        20        30        40        50        60        70        80
                ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 57116882 193 GNVTIGGFNLASGNLGLGNLGSFNPGSANTGSVNLGNANIGDLNLGSGNIGSYNLGGGNTGDLNPDSGNTGTLNWGSGNI 272
Cdd:COG3468 175 GGGGGGGAGGSGGAGSTGSGAGGGGGGSGGGGGAAGTGGGGGGGGGAGGATGGAGSGGNTGGGVGGGGGSAGGTGGGGLT 254
                        90       100       110       120       130       140       150       160
                ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 57116882 273 GSYNLGGGNLGSYNLGSGNTGDTNFGGGNTGNLNVGGGNTGNSNFGFGNTGNVNFGNGNTGDTNFGSGNLGSGNIGFGNK 352
Cdd:COG3468 255 GGGAAGTGGGGGGTGTGSGGGGGGGANGGGSGGGGGASGTGGGGTASTGGGGGGGGGNGGGGGGGSNAGGGSGGGGGGGG 334
                       170       180       190       200       210       220       230       240
                ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 57116882 353 GSHNIGFGNSGNNNIGFGLTGDNQIGFGALNSGSGNLGFGNSGNGNIGFFNSGNNNIGMGNSGNGVGALSVEFGSSAERS 432
Cdd:COG3468 335 GGGGGGTTLNGAGSAGGGTGAALAGTGGSGSGGGGGGGSGGGGGAGGGGANTGSDGVGTGLTTGGTGNNGGGGVGGGGGG 414
                       250       260
                ....*....|....*....|....*..
gi 57116882 433 SGFGNSGELSTGIGNSGQLSTGWFNSA 459
Cdd:COG3468 415 GLTLTGGTLTVNGNYTGNNGTLVLNTV 441
ser_rich_anae_1 NF033849
serine-rich protein; This serine-rich protein belongs to a family with large size (over 1000 ...
260-523 9.92e-07

serine-rich protein; This serine-rich protein belongs to a family with large size (over 1000 amino acids), which a highly serine-rich central region that averages over 300 aa in length. Species encoding members of this family of proteins tend to be anaerobic bacteria, including Gram-positive bacteria of the human gut microbiome and Chloroflexi from marine sediments.


Pssm-ID: 468206 [Multi-domain]  Cd Length: 1122  Bit Score: 52.31  E-value: 9.92e-07
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 57116882   260 GNTGTLNWGSGNIGSYNLGGGNLGSYNLGSGNTGDTNFGGGNTGNLNVGGGNTGNSNFGFGNTGNVNFGNGNTGDTNFGS 339
Cdd:NF033849  278 GHGSTRGWSHTQSTSESESTGQSSSVGTSESQSHGTTEGTSTTDSSSHSQSSSYNVSSGTGVSSSHSDGTSQSTSISHSE 357
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 57116882   340 GNLGSGNIGFGNKGSHNIGFGNSGNNNIGFGLTGdnqiGFGAlnsgsGNLGFGNSGNGnIGFFNSGNNNIGMGNSGNGVG 419
Cdd:NF033849  358 SSSESTGTSVGHSTSSSVSSSESSSRSSSSGVSG----GFSG-----GIAGGGVTSEG-LGASQGGSEGWGSGDSVQSVS 427
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 57116882   420 AlSVEFGSSAERSSGFGNSGELSTGIGNSGQLSTG--WFNSATTSTGWfnSGTTNTGWFNSGTTNTGIGNSGGNLVTGSM 497
Cdd:NF033849  428 Q-SYGSSSSTGTSSGHSDSSSHSTSSGQADSVSQGtsWSEGTGTSQGQ--SVGTSESWSTSQSETDSVGDSTGTSESVSQ 504
                         250       260
                  ....*....|....*....|....*.
gi 57116882   498 GlFNSGHTNTGSFNAGSMNTGDFNSG 523
Cdd:NF033849  505 G-DGRSTGRSESQGTSLGTSGGRTSG 529
PHA02515 PHA02515
hypothetical protein; Provisional
148-366 1.46e-06

hypothetical protein; Provisional


Pssm-ID: 107197 [Multi-domain]  Cd Length: 508  Bit Score: 51.32  E-value: 1.46e-06
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 57116882  148 VAAMLGYHGEASAVAlslTPFTPSPSAAATPGGAVIIAGfpflDLGNVTIGGFNLASGNLGLGNLGSFNPGSANT----- 222
Cdd:PHA02515 153 VSAVYGHLANIDAVA---TNEADIDTVAASVGAVDTVAG----DLGGTWAAGVSYDFGSIAVPPIGNTSPPGGNIvivan 225
                         90       100       110       120       130       140       150       160
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 57116882  223 --GSVNLGNANIGDLNLGS----------GNIGSYNLGGGNTGDLNPDSGNTGTLNWGSGNIGSYNLGGGNLGSYNLGSG 290
Cdd:PHA02515 226 siGNVDTVAENIGDVSTVSthlssmlavaNDIDSVVSVAGDLENIDAVADNAANINTVAGANANVNTVASNILDVGTVAG 305
                        170       180       190       200       210       220       230
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*.
gi 57116882  291 NTGDTNFGGGNTGNLNVGGGNTGNSNFGFGNTGNVNFGNGNTGDTNFGSGNLGSGNIGFGNKGSHNIGFGNSGNNN 366
Cdd:PHA02515 306 NIDDVQAVAGNAANINVVADNADNINATAANQANINAAVGNADNINAAVANQANINAVVGNANNINAVAANEGNVN 381
PTZ00395 PTZ00395
Sec24-related protein; Provisional
425-575 7.78e-06

Sec24-related protein; Provisional


Pssm-ID: 185594 [Multi-domain]  Cd Length: 1560  Bit Score: 49.30  E-value: 7.78e-06
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 57116882   425 FGSSAERSSGFGNSGELSTGIGNsgQLSTGWFNSATTST-GWFNSGTTNTGWFNSGTTNtgignsggnlvtgsmglfNSG 503
Cdd:PTZ00395  339 YGGFHDGSPNAASAGAPFNGLGN--QADGGHINQVHPDArGAWAGGPHSNASYNCAAYS------------------NAA 398
                          90       100       110       120       130       140       150
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....
gi 57116882   504 HTNTGSFNAGSMNTGDFNSGNVNTGYFNSGNINTGFFNSGDLNTGLFN--SVNQPVQNSGWLHTGTNNSGYANA 575
Cdd:PTZ00395  399 QSNAAQSNAGFSNAGYSNPGNSNPGYNNAPNSNTPYNNPPNSNTPYSNppNSNPPYSNLPYSNTPYSNAPLSNA 472
Pentapeptide_2 pfam01469
Pentapeptide repeats (8 copies); These repeats are found in many mycobacterial proteins. These ...
505-543 1.30e-05

Pentapeptide repeats (8 copies); These repeats are found in many mycobacterial proteins. These repeats are most common in the pfam00823 family of proteins, where they are found in the MPTR subfamily of PPE proteins. The function of these repeats is unknown. The repeat can be approximately described as XNXGX, where X can be any amino acid. These repeats are similar to pfam00805, however it is not clear if these two families are structurally related.


Pssm-ID: 279771 [Multi-domain]  Cd Length: 39  Bit Score: 42.54  E-value: 1.30e-05
                          10        20        30
                  ....*....|....*....|....*....|....*....
gi 57116882   505 TNTGSFNAGSMNTGDFNSGNVNTGYFNSGNINTGFFNSG 543
Cdd:pfam01469   1 GNTGSGNSGSGNTGFFNLGSGNTGSFNLGSGNTGNGNTG 39
PHA02515 PHA02515
hypothetical protein; Provisional
209-340 2.76e-05

hypothetical protein; Provisional


Pssm-ID: 107197 [Multi-domain]  Cd Length: 508  Bit Score: 47.08  E-value: 2.76e-05
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 57116882  209 LGNLGSFNPGSANTGSVNLGNANIgdlNLGSGNIGSYNLGGGNTGDLNPDSGNTGTLNWGSGNIGSYNLGGGNLGSYNLG 288
Cdd:PHA02515 267 LENIDAVADNAANINTVAGANANV---NTVASNILDVGTVAGNIDDVQAVAGNAANINVVADNADNINATAANQANINAA 343
                         90       100       110       120       130
                 ....*....|....*....|....*....|....*....|....*....|..
gi 57116882  289 SGNTGDTNFGGGNTGNLNVGGGNTGNSNFGFGNTGNVNFGNGNTGDTNFGSG 340
Cdd:PHA02515 344 VGNADNINAAVANQANINAVVGNANNINAVAANEGNVNTVVDNLADVQTVAG 395
Pentapeptide_2 pfam01469
Pentapeptide repeats (8 copies); These repeats are found in many mycobacterial proteins. These ...
260-298 2.83e-05

Pentapeptide repeats (8 copies); These repeats are found in many mycobacterial proteins. These repeats are most common in the pfam00823 family of proteins, where they are found in the MPTR subfamily of PPE proteins. The function of these repeats is unknown. The repeat can be approximately described as XNXGX, where X can be any amino acid. These repeats are similar to pfam00805, however it is not clear if these two families are structurally related.


Pssm-ID: 279771 [Multi-domain]  Cd Length: 39  Bit Score: 41.39  E-value: 2.83e-05
                          10        20        30
                  ....*....|....*....|....*....|....*....
gi 57116882   260 GNTGTLNWGSGNIGSYNLGGGNLGSYNLGSGNTGDTNFG 298
Cdd:pfam01469   1 GNTGSGNSGSGNTGFFNLGSGNTGSFNLGSGNTGNGNTG 39
Pentapeptide_2 pfam01469
Pentapeptide repeats (8 copies); These repeats are found in many mycobacterial proteins. These ...
270-308 8.09e-05

Pentapeptide repeats (8 copies); These repeats are found in many mycobacterial proteins. These repeats are most common in the pfam00823 family of proteins, where they are found in the MPTR subfamily of PPE proteins. The function of these repeats is unknown. The repeat can be approximately described as XNXGX, where X can be any amino acid. These repeats are similar to pfam00805, however it is not clear if these two families are structurally related.


Pssm-ID: 279771 [Multi-domain]  Cd Length: 39  Bit Score: 40.23  E-value: 8.09e-05
                          10        20        30
                  ....*....|....*....|....*....|....*....
gi 57116882   270 GNIGSYNLGGGNLGSYNLGSGNTGDTNFGGGNTGNLNVG 308
Cdd:pfam01469   1 GNTGSGNSGSGNTGFFNLGSGNTGSFNLGSGNTGNGNTG 39
Pentapeptide_2 pfam01469
Pentapeptide repeats (8 copies); These repeats are found in many mycobacterial proteins. These ...
300-338 9.95e-05

Pentapeptide repeats (8 copies); These repeats are found in many mycobacterial proteins. These repeats are most common in the pfam00823 family of proteins, where they are found in the MPTR subfamily of PPE proteins. The function of these repeats is unknown. The repeat can be approximately described as XNXGX, where X can be any amino acid. These repeats are similar to pfam00805, however it is not clear if these two families are structurally related.


Pssm-ID: 279771 [Multi-domain]  Cd Length: 39  Bit Score: 39.85  E-value: 9.95e-05
                          10        20        30
                  ....*....|....*....|....*....|....*....
gi 57116882   300 GNTGNLNVGGGNTGNSNFGFGNTGNVNFGNGNTGDTNFG 338
Cdd:pfam01469   1 GNTGSGNSGSGNTGFFNLGSGNTGSFNLGSGNTGNGNTG 39
Pentapeptide_2 pfam01469
Pentapeptide repeats (8 copies); These repeats are found in many mycobacterial proteins. These ...
515-552 1.36e-04

Pentapeptide repeats (8 copies); These repeats are found in many mycobacterial proteins. These repeats are most common in the pfam00823 family of proteins, where they are found in the MPTR subfamily of PPE proteins. The function of these repeats is unknown. The repeat can be approximately described as XNXGX, where X can be any amino acid. These repeats are similar to pfam00805, however it is not clear if these two families are structurally related.


Pssm-ID: 279771 [Multi-domain]  Cd Length: 39  Bit Score: 39.46  E-value: 1.36e-04
                          10        20        30
                  ....*....|....*....|....*....|....*...
gi 57116882   515 MNTGDFNSGNVNTGYFNSGNINTGFFNSGDLNTGLFNS 552
Cdd:pfam01469   1 GNTGSGNSGSGNTGFFNLGSGNTGSFNLGSGNTGNGNT 38
Pentapeptide_2 pfam01469
Pentapeptide repeats (8 copies); These repeats are found in many mycobacterial proteins. These ...
330-368 1.38e-04

Pentapeptide repeats (8 copies); These repeats are found in many mycobacterial proteins. These repeats are most common in the pfam00823 family of proteins, where they are found in the MPTR subfamily of PPE proteins. The function of these repeats is unknown. The repeat can be approximately described as XNXGX, where X can be any amino acid. These repeats are similar to pfam00805, however it is not clear if these two families are structurally related.


Pssm-ID: 279771 [Multi-domain]  Cd Length: 39  Bit Score: 39.46  E-value: 1.38e-04
                          10        20        30
                  ....*....|....*....|....*....|....*....
gi 57116882   330 GNTGDTNFGSGNLGSGNIGFGNKGSHNIGFGNSGNNNIG 368
Cdd:pfam01469   1 GNTGSGNSGSGNTGFFNLGSGNTGSFNLGSGNTGNGNTG 39
Pentapeptide_2 pfam01469
Pentapeptide repeats (8 copies); These repeats are found in many mycobacterial proteins. These ...
379-415 1.46e-04

Pentapeptide repeats (8 copies); These repeats are found in many mycobacterial proteins. These repeats are most common in the pfam00823 family of proteins, where they are found in the MPTR subfamily of PPE proteins. The function of these repeats is unknown. The repeat can be approximately described as XNXGX, where X can be any amino acid. These repeats are similar to pfam00805, however it is not clear if these two families are structurally related.


Pssm-ID: 279771 [Multi-domain]  Cd Length: 39  Bit Score: 39.46  E-value: 1.46e-04
                          10        20        30
                  ....*....|....*....|....*....|....*..
gi 57116882   379 FGALNSGSGNLGFGNSGNGNIGFFNSGNNNIGMGNSG 415
Cdd:pfam01469   3 TGSGNSGSGNTGFFNLGSGNTGSFNLGSGNTGNGNTG 39
Pentapeptide_2 pfam01469
Pentapeptide repeats (8 copies); These repeats are found in many mycobacterial proteins. These ...
310-348 1.72e-04

Pentapeptide repeats (8 copies); These repeats are found in many mycobacterial proteins. These repeats are most common in the pfam00823 family of proteins, where they are found in the MPTR subfamily of PPE proteins. The function of these repeats is unknown. The repeat can be approximately described as XNXGX, where X can be any amino acid. These repeats are similar to pfam00805, however it is not clear if these two families are structurally related.


Pssm-ID: 279771 [Multi-domain]  Cd Length: 39  Bit Score: 39.46  E-value: 1.72e-04
                          10        20        30
                  ....*....|....*....|....*....|....*....
gi 57116882   310 GNTGNSNFGFGNTGNVNFGNGNTGDTNFGSGNLGSGNIG 348
Cdd:pfam01469   1 GNTGSGNSGSGNTGFFNLGSGNTGSFNLGSGNTGNGNTG 39
PTZ00395 PTZ00395
Sec24-related protein; Provisional
218-433 1.84e-04

Sec24-related protein; Provisional


Pssm-ID: 185594 [Multi-domain]  Cd Length: 1560  Bit Score: 45.07  E-value: 1.84e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 57116882   218 GSANTGSVNLGNANIGDLNLGSgnigsyNLGGGNTGDLNPDSGNTgtlnWGSG--NIGSYNLGG-GNLGSYNLGSGNTGD 294
Cdd:PTZ00395  340 GGFHDGSPNAASAGAPFNGLGN------QADGGHINQVHPDARGA----WAGGphSNASYNCAAySNAAQSNAAQSNAGF 409
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 57116882   295 TNFGGGNTGNLNVGGGNTGNSNFGFGNTGNVNFGNGNTGDTNFGSGNLGSGNIGFGNKGSHNIGFGNSGNNNIGFGLTGD 374
Cdd:PTZ00395  410 SNAGYSNPGNSNPGYNNAPNSNTPYNNPPNSNTPYSNPPNSNPPYSNLPYSNTPYSNAPLSNAPPSSAKDHHSAYHAAYQ 489
                         170       180       190       200       210
                  ....*....|....*....|....*....|....*....|....*....|....*....
gi 57116882   375 NQigfgALNSGSGNLGFGNSgngnigffNSGNNNIGMGNSGNGVGALSVEFGSSAERSS 433
Cdd:PTZ00395  490 HR----AANQPAANLPTANQ--------PAANNFHGAAGNSVGNPFASRPFGSAPYGGN 536
Pentapeptide_2 pfam01469
Pentapeptide repeats (8 copies); These repeats are found in many mycobacterial proteins. These ...
290-328 2.06e-04

Pentapeptide repeats (8 copies); These repeats are found in many mycobacterial proteins. These repeats are most common in the pfam00823 family of proteins, where they are found in the MPTR subfamily of PPE proteins. The function of these repeats is unknown. The repeat can be approximately described as XNXGX, where X can be any amino acid. These repeats are similar to pfam00805, however it is not clear if these two families are structurally related.


Pssm-ID: 279771 [Multi-domain]  Cd Length: 39  Bit Score: 39.08  E-value: 2.06e-04
                          10        20        30
                  ....*....|....*....|....*....|....*....
gi 57116882   290 GNTGDTNFGGGNTGNLNVGGGNTGNSNFGFGNTGNVNFG 328
Cdd:pfam01469   1 GNTGSGNSGSGNTGFFNLGSGNTGSFNLGSGNTGNGNTG 39
ser_rich_anae_1 NF033849
serine-rich protein; This serine-rich protein belongs to a family with large size (over 1000 ...
301-532 2.57e-04

serine-rich protein; This serine-rich protein belongs to a family with large size (over 1000 amino acids), which a highly serine-rich central region that averages over 300 aa in length. Species encoding members of this family of proteins tend to be anaerobic bacteria, including Gram-positive bacteria of the human gut microbiome and Chloroflexi from marine sediments.


Pssm-ID: 468206 [Multi-domain]  Cd Length: 1122  Bit Score: 44.61  E-value: 2.57e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 57116882   301 NTGNLNVGGGNTGNSNFG----------------FGNTGNVNFGNGNTGDTNFG--SGNLGSGNIGFGNKGSHNIGFGNS 362
Cdd:NF033849  231 YAANLGQSAGTGYGESVGhstsqgqshsvgtsesHSVGTSQSQSHTTGHGSTRGwsHTQSTSESESTGQSSSVGTSESQS 310
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 57116882   363 GNNNIGFGLTGDNQIGFGALNSGSGNLGFGNS----GNGNIGFFNSGNNNIGMGNSGNGVGALSVEFGSSAERSSGFGNS 438
Cdd:NF033849  311 HGTTEGTSTTDSSSHSQSSSYNVSSGTGVSSShsdgTSQSTSISHSESSSESTGTSVGHSTSSSVSSSESSSRSSSSGVS 390
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 57116882   439 GELSTGIGNSGQLSTGWFNSATTSTGWfNSGTTNTGWFNSGTTNTGIGNSGGNLVTGSMGLfNSGHTNTGSFNAG-SMNT 517
Cdd:NF033849  391 GGFSGGIAGGGVTSEGLGASQGGSEGW-GSGDSVQSVSQSYGSSSSTGTSSGHSDSSSHST-SSGQADSVSQGTSwSEGT 468
                         250
                  ....*....|....*
gi 57116882   518 GDFNSGNVNTGYFNS 532
Cdd:NF033849  469 GTSQGQSVGTSESWS 483
Pentapeptide_2 pfam01469
Pentapeptide repeats (8 copies); These repeats are found in many mycobacterial proteins. These ...
220-257 2.61e-04

Pentapeptide repeats (8 copies); These repeats are found in many mycobacterial proteins. These repeats are most common in the pfam00823 family of proteins, where they are found in the MPTR subfamily of PPE proteins. The function of these repeats is unknown. The repeat can be approximately described as XNXGX, where X can be any amino acid. These repeats are similar to pfam00805, however it is not clear if these two families are structurally related.


Pssm-ID: 279771 [Multi-domain]  Cd Length: 39  Bit Score: 38.69  E-value: 2.61e-04
                          10        20        30
                  ....*....|....*....|....*....|....*...
gi 57116882   220 ANTGSVNLGNANIGDLNLGSGNIGSYNLGGGNTGDLNP 257
Cdd:pfam01469   1 GNTGSGNSGSGNTGFFNLGSGNTGSFNLGSGNTGNGNT 38
Pentapeptide_2 pfam01469
Pentapeptide repeats (8 copies); These repeats are found in many mycobacterial proteins. These ...
326-363 2.79e-04

Pentapeptide repeats (8 copies); These repeats are found in many mycobacterial proteins. These repeats are most common in the pfam00823 family of proteins, where they are found in the MPTR subfamily of PPE proteins. The function of these repeats is unknown. The repeat can be approximately described as XNXGX, where X can be any amino acid. These repeats are similar to pfam00805, however it is not clear if these two families are structurally related.


Pssm-ID: 279771 [Multi-domain]  Cd Length: 39  Bit Score: 38.69  E-value: 2.79e-04
                          10        20        30
                  ....*....|....*....|....*....|....*...
gi 57116882   326 NFGNGNTGDTNFGSGNLGSGNIGFGNKGSHNIGFGNSG 363
Cdd:pfam01469   2 NTGSGNSGSGNTGFFNLGSGNTGSFNLGSGNTGNGNTG 39
Pentapeptide_2 pfam01469
Pentapeptide repeats (8 copies); These repeats are found in many mycobacterial proteins. These ...
210-248 3.20e-04

Pentapeptide repeats (8 copies); These repeats are found in many mycobacterial proteins. These repeats are most common in the pfam00823 family of proteins, where they are found in the MPTR subfamily of PPE proteins. The function of these repeats is unknown. The repeat can be approximately described as XNXGX, where X can be any amino acid. These repeats are similar to pfam00805, however it is not clear if these two families are structurally related.


Pssm-ID: 279771 [Multi-domain]  Cd Length: 39  Bit Score: 38.69  E-value: 3.20e-04
                          10        20        30
                  ....*....|....*....|....*....|....*....
gi 57116882   210 GNLGSFNPGSANTGSVNLGNANIGDLNLGSGNIGSYNLG 248
Cdd:pfam01469   1 GNTGSGNSGSGNTGFFNLGSGNTGSFNLGSGNTGNGNTG 39
ser_rich_anae_1 NF033849
serine-rich protein; This serine-rich protein belongs to a family with large size (over 1000 ...
425-580 3.21e-04

serine-rich protein; This serine-rich protein belongs to a family with large size (over 1000 amino acids), which a highly serine-rich central region that averages over 300 aa in length. Species encoding members of this family of proteins tend to be anaerobic bacteria, including Gram-positive bacteria of the human gut microbiome and Chloroflexi from marine sediments.


Pssm-ID: 468206 [Multi-domain]  Cd Length: 1122  Bit Score: 44.23  E-value: 3.21e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 57116882   425 FGSSAERSSGFGNSGELSTGIGNSGQLSTGWFNSATTSTGwfNSGTTNTGWFNSGTTNTGIGNSGGNLVTGSMGLFNSGH 504
Cdd:NF033849  283 RGWSHTQSTSESESTGQSSSVGTSESQSHGTTEGTSTTDS--SSHSQSSSYNVSSGTGVSSSHSDGTSQSTSISHSESSS 360
                          90       100       110       120       130       140       150
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 57116882   505 TNTGSFNAGSMNTGdfNSGNVNTGYFNSGNINTGFfnSGDLNTGLFNSVNQPV---QNSGWLHTGTNNSGYANAGTFNS 580
Cdd:NF033849  361 ESTGTSVGHSTSSS--VSSSESSSRSSSSGVSGGF--SGGIAGGGVTSEGLGAsqgGSEGWGSGDSVQSVSQSYGSSSS 435
Pentapeptide_2 pfam01469
Pentapeptide repeats (8 copies); These repeats are found in many mycobacterial proteins. These ...
316-353 3.57e-04

Pentapeptide repeats (8 copies); These repeats are found in many mycobacterial proteins. These repeats are most common in the pfam00823 family of proteins, where they are found in the MPTR subfamily of PPE proteins. The function of these repeats is unknown. The repeat can be approximately described as XNXGX, where X can be any amino acid. These repeats are similar to pfam00805, however it is not clear if these two families are structurally related.


Pssm-ID: 279771 [Multi-domain]  Cd Length: 39  Bit Score: 38.31  E-value: 3.57e-04
                          10        20        30
                  ....*....|....*....|....*....|....*...
gi 57116882   316 NFGFGNTGNVNFGNGNTGDTNFGSGNLGSGNIGFGNKG 353
Cdd:pfam01469   2 NTGSGNSGSGNTGFFNLGSGNTGSFNLGSGNTGNGNTG 39
Pentapeptide_2 pfam01469
Pentapeptide repeats (8 copies); These repeats are found in many mycobacterial proteins. These ...
335-373 5.84e-04

Pentapeptide repeats (8 copies); These repeats are found in many mycobacterial proteins. These repeats are most common in the pfam00823 family of proteins, where they are found in the MPTR subfamily of PPE proteins. The function of these repeats is unknown. The repeat can be approximately described as XNXGX, where X can be any amino acid. These repeats are similar to pfam00805, however it is not clear if these two families are structurally related.


Pssm-ID: 279771 [Multi-domain]  Cd Length: 39  Bit Score: 37.92  E-value: 5.84e-04
                          10        20        30
                  ....*....|....*....|....*....|....*....
gi 57116882   335 TNFGSGNLGSGNIGFGNKGSHNIGFGNSGNNNIGFGLTG 373
Cdd:pfam01469   1 GNTGSGNSGSGNTGFFNLGSGNTGSFNLGSGNTGNGNTG 39
Pentapeptide_2 pfam01469
Pentapeptide repeats (8 copies); These repeats are found in many mycobacterial proteins. These ...
280-318 8.40e-04

Pentapeptide repeats (8 copies); These repeats are found in many mycobacterial proteins. These repeats are most common in the pfam00823 family of proteins, where they are found in the MPTR subfamily of PPE proteins. The function of these repeats is unknown. The repeat can be approximately described as XNXGX, where X can be any amino acid. These repeats are similar to pfam00805, however it is not clear if these two families are structurally related.


Pssm-ID: 279771 [Multi-domain]  Cd Length: 39  Bit Score: 37.54  E-value: 8.40e-04
                          10        20        30
                  ....*....|....*....|....*....|....*....
gi 57116882   280 GNLGSYNLGSGNTGDTNFGGGNTGNLNVGGGNTGNSNFG 318
Cdd:pfam01469   1 GNTGSGNSGSGNTGFFNLGSGNTGSFNLGSGNTGNGNTG 39
Pentapeptide_2 pfam01469
Pentapeptide repeats (8 copies); These repeats are found in many mycobacterial proteins. These ...
306-343 2.48e-03

Pentapeptide repeats (8 copies); These repeats are found in many mycobacterial proteins. These repeats are most common in the pfam00823 family of proteins, where they are found in the MPTR subfamily of PPE proteins. The function of these repeats is unknown. The repeat can be approximately described as XNXGX, where X can be any amino acid. These repeats are similar to pfam00805, however it is not clear if these two families are structurally related.


Pssm-ID: 279771 [Multi-domain]  Cd Length: 39  Bit Score: 36.00  E-value: 2.48e-03
                          10        20        30
                  ....*....|....*....|....*....|....*...
gi 57116882   306 NVGGGNTGNSNFGFGNTGNVNFGNGNTGDTNFGSGNLG 343
Cdd:pfam01469   2 NTGSGNSGSGNTGFFNLGSGNTGSFNLGSGNTGNGNTG 39
Pentapeptide_2 pfam01469
Pentapeptide repeats (8 copies); These repeats are found in many mycobacterial proteins. These ...
250-288 3.11e-03

Pentapeptide repeats (8 copies); These repeats are found in many mycobacterial proteins. These repeats are most common in the pfam00823 family of proteins, where they are found in the MPTR subfamily of PPE proteins. The function of these repeats is unknown. The repeat can be approximately described as XNXGX, where X can be any amino acid. These repeats are similar to pfam00805, however it is not clear if these two families are structurally related.


Pssm-ID: 279771 [Multi-domain]  Cd Length: 39  Bit Score: 35.61  E-value: 3.11e-03
                          10        20        30
                  ....*....|....*....|....*....|....*....
gi 57116882   250 GNTGDLNPDSGNTGTLNWGSGNIGSYNLGGGNLGSYNLG 288
Cdd:pfam01469   1 GNTGSGNSGSGNTGFFNLGSGNTGSFNLGSGNTGNGNTG 39
Pentapeptide_2 pfam01469
Pentapeptide repeats (8 copies); These repeats are found in many mycobacterial proteins. These ...
498-533 4.02e-03

Pentapeptide repeats (8 copies); These repeats are found in many mycobacterial proteins. These repeats are most common in the pfam00823 family of proteins, where they are found in the MPTR subfamily of PPE proteins. The function of these repeats is unknown. The repeat can be approximately described as XNXGX, where X can be any amino acid. These repeats are similar to pfam00805, however it is not clear if these two families are structurally related.


Pssm-ID: 279771 [Multi-domain]  Cd Length: 39  Bit Score: 35.61  E-value: 4.02e-03
                          10        20        30
                  ....*....|....*....|....*....|....*.
gi 57116882   498 GLFNSGHTNTGSFNAGSMNTGDFNSGNVNTGYFNSG 533
Cdd:pfam01469   4 GSGNSGSGNTGFFNLGSGNTGSFNLGSGNTGNGNTG 39
COG2931 COG2931
Ca2+-binding protein, RTX toxin-related [Secondary metabolites biosynthesis, transport and ...
173-423 4.54e-03

Ca2+-binding protein, RTX toxin-related [Secondary metabolites biosynthesis, transport and catabolism];


Pssm-ID: 442175 [Multi-domain]  Cd Length: 252  Bit Score: 39.50  E-value: 4.54e-03
                        10        20        30        40        50        60        70        80
                ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 57116882 173 SAAATPGGAVIIAGFPFLDLGNVTIGGFNLASGNLGLGNLGSFNPGSANTGSVNLGNANIGDLNLGSGNIGSYNLGGGNT 252
Cdd:COG2931   1 GGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGGDGGGGGGGGGGGGGGGGLDGGGGGGGGDGGG 80
                        90       100       110       120       130       140       150       160
                ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 57116882 253 GDLNPDSGNTGTLNWGSGNIGSYNLGGGNLGSYNLGSGNTGDTNFGGGNTGNLNVGGGNtgnsNFGFGNTGNVNFGNGNT 332
Cdd:COG2931  81 GGGGDDTDGGGDGGDGGGGGTGDDTGDGGGGNDTLTGGDGNDTLTGGAGDDTLYGGAGN----DTLTGGAGNDTLYGGAG 156
                       170       180       190       200       210       220       230       240
                ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 57116882 333 GDTNFGSG-----NLGSGNIGFGNKGSHNIGFGNSGNNNIGFGLTGDNQIGFGALNSGSGNLGFGNSGNGNIGFFNSGNN 407
Cdd:COG2931 157 NDTLYGGAgndtlDGGAGNDTLTGGAGNDTLTGGAGNDTLDGGGGDDTLGGGGGDDGLDGGDGDDGLGGGGGDDTLGGGG 236
                       250
                ....*....|....*.
gi 57116882 408 NIGMGNSGNGVGALSV 423
Cdd:COG2931 237 GGDGGGGGGGDDGLGG 252
Pentapeptide_2 pfam01469
Pentapeptide repeats (8 copies); These repeats are found in many mycobacterial proteins. These ...
230-268 5.41e-03

Pentapeptide repeats (8 copies); These repeats are found in many mycobacterial proteins. These repeats are most common in the pfam00823 family of proteins, where they are found in the MPTR subfamily of PPE proteins. The function of these repeats is unknown. The repeat can be approximately described as XNXGX, where X can be any amino acid. These repeats are similar to pfam00805, however it is not clear if these two families are structurally related.


Pssm-ID: 279771 [Multi-domain]  Cd Length: 39  Bit Score: 35.23  E-value: 5.41e-03
                          10        20        30
                  ....*....|....*....|....*....|....*....
gi 57116882   230 ANIGDLNLGSGNIGSYNLGGGNTGDLNPDSGNTGTLNWG 268
Cdd:pfam01469   1 GNTGSGNSGSGNTGFFNLGSGNTGSFNLGSGNTGNGNTG 39
Pentapeptide_2 pfam01469
Pentapeptide repeats (8 copies); These repeats are found in many mycobacterial proteins. These ...
201-238 6.15e-03

Pentapeptide repeats (8 copies); These repeats are found in many mycobacterial proteins. These repeats are most common in the pfam00823 family of proteins, where they are found in the MPTR subfamily of PPE proteins. The function of these repeats is unknown. The repeat can be approximately described as XNXGX, where X can be any amino acid. These repeats are similar to pfam00805, however it is not clear if these two families are structurally related.


Pssm-ID: 279771 [Multi-domain]  Cd Length: 39  Bit Score: 34.84  E-value: 6.15e-03
                          10        20        30
                  ....*....|....*....|....*....|....*...
gi 57116882   201 NLASGNLGLGNLGSFNPGSANTGSVNLGNANIGDLNLG 238
Cdd:pfam01469   2 NTGSGNSGSGNTGFFNLGSGNTGSFNLGSGNTGNGNTG 39
Pentapeptide_2 pfam01469
Pentapeptide repeats (8 copies); These repeats are found in many mycobacterial proteins. These ...
276-313 8.18e-03

Pentapeptide repeats (8 copies); These repeats are found in many mycobacterial proteins. These repeats are most common in the pfam00823 family of proteins, where they are found in the MPTR subfamily of PPE proteins. The function of these repeats is unknown. The repeat can be approximately described as XNXGX, where X can be any amino acid. These repeats are similar to pfam00805, however it is not clear if these two families are structurally related.


Pssm-ID: 279771 [Multi-domain]  Cd Length: 39  Bit Score: 34.46  E-value: 8.18e-03
                          10        20        30
                  ....*....|....*....|....*....|....*...
gi 57116882   276 NLGGGNLGSYNLGSGNTGDTNFGGGNTGNLNVGGGNTG 313
Cdd:pfam01469   2 NTGSGNSGSGNTGFFNLGSGNTGSFNLGSGNTGNGNTG 39
 
Blast search parameters
Data Source: Precalculated data, version = cdd.v.3.21
Preset Options:Database: CDSEARCH/cdd   Low complexity filter: no  Composition Based Adjustment: yes   E-value threshold: 0.01

References:

  • Wang J et al. (2023), "The conserved domain database in 2023", Nucleic Acids Res.51(D)384-8.
  • Lu S et al. (2020), "The conserved domain database in 2020", Nucleic Acids Res.48(D)265-8.
  • Marchler-Bauer A et al. (2017), "CDD/SPARCLE: functional classification of proteins via subfamily domain architectures.", Nucleic Acids Res.45(D)200-3.
Help | Disclaimer | Write to the Help Desk
NCBI | NLM | NIH