|
Name |
Accession |
Description |
Interval |
E-value |
| LapB |
COG2956 |
Lipopolysaccharide biosynthesis regulator YciM/LapB, contains six TPR domains and a C-terminal ... |
824-1082 |
1.18e-08 |
|
Lipopolysaccharide biosynthesis regulator YciM/LapB, contains six TPR domains and a C-terminal metal-binding domain [Cell wall/membrane/envelope biogenesis];
Pssm-ID: 442196 [Multi-domain] Cd Length: 275 Bit Score: 57.82 E-value: 1.18e-08
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622913461 824 LGNMGHARG--ARALRE----AEREPE-LEARV--AVLATQLGMLEDAEQLYRKCKRYD--------LLNRFYQAAGQWQ 886
Cdd:COG2956 48 LGNLYRRRGeyDRAIRIhqklLERDPDrAEALLelAQDYLKAGLLDRAEELLEKLLELDpddaealrLLAEIYEQEGDWE 127
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622913461 887 KALKVAERHDRVHLRSTYHHY--AGHLEASADCSRALSYYEKsdthrfevprMLSEDLPSLELYVNKmkdktlwrwwAQY 964
Cdd:COG2956 128 KAIEVLERLLKLGPENAHAYCelAELYLEQGDYDEAIEALEK----------ALKLDPDCARALLLL----------AEL 187
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622913461 965 LESQGEMDAALHYYelaqdhfslvrihcfqgnvQKAAQIANEtgNLAASYHLARQYESQEEVGQAVHFYTRAQAfknair 1044
Cdd:COG2956 188 YLEQGDYEEAIAAL-------------------ERALEQDPD--YLPALPRLAELYEKLGDPEEALELLRKALE------ 240
|
250 260 270
....*....|....*....|....*....|....*...
gi 1622913461 1045 lckENSLDDQLMNLALLSSPEDMIEAARYYEEKGMQMD 1082
Cdd:COG2956 241 ---LDPSDDLLLALADLLERKEGLEAALALLERQLRRH 275
|
|
| WD40 |
COG2319 |
WD40 repeat [General function prediction only]; |
24-133 |
3.19e-08 |
|
WD40 repeat [General function prediction only];
Pssm-ID: 441893 [Multi-domain] Cd Length: 403 Bit Score: 57.61 E-value: 3.19e-08
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622913461 24 HPVHPFLAVAY-------ISTTSTGSVDIY-LEQGECVPdTHVERPFRVASLCWHPTRPVLAMGWETGEVTVFNKQDKEQ 95
Cdd:COG2319 286 GHSGGVNSVAFspdgkllASGSDDGTVRLWdLATGKLLR-TLTGHTGAVRSVAFSPDGKTLASGSDDGTVRLWDLATGEL 364
|
90 100 110
....*....|....*....|....*....|....*...
gi 1622913461 96 HTTPPTHTADIAVLSWSPSGNCLLSGDRLGVLLLWRLD 133
Cdd:COG2319 365 LRTLTGHTGAVTSVAFSPDGRTLASGSADGTVRLWDLA 402
|
|
| WD40 |
cd00200 |
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions ... |
15-167 |
2.99e-06 |
|
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment.
Pssm-ID: 238121 [Multi-domain] Cd Length: 289 Bit Score: 50.80 E-value: 2.99e-06
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622913461 15 AGSPSHISWHPVHPFLAvayiSTTSTGSVDIY-LEQGECVpDTHVERPFRVASLCWHPTRPVLAMGWETGEVTVFNKQDK 93
Cdd:cd00200 51 TGPVRDVAASADGTYLA----SGSSDKTIRLWdLETGECV-RTLTGHTSYVSSVAFSPDGRILSSSSRDKTIKVWDVETG 125
|
90 100 110 120 130 140 150
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....
gi 1622913461 94 EQHTTPPTHTADIAVLSWSPSGNCLLSGDRLGVLLLWRLDQrGRVQGTpLLKHEygKPLTHCIFRlpPPGEDLV 167
Cdd:cd00200 126 KCLTTLRGHTDWVNSVAFSPDGTFVASSSQDGTIKLWDLRT-GKCVAT-LTGHT--GEVNSVAFS--PDGEKLL 193
|
|
| PEP_TPR_lipo |
TIGR02917 |
putative PEP-CTERM system TPR-repeat lipoprotein; This protein family occurs in strictly ... |
833-1233 |
3.44e-05 |
|
putative PEP-CTERM system TPR-repeat lipoprotein; This protein family occurs in strictly within a subset of Gram-negative bacterial species with the proposed PEP-CTERM/exosortase system, analogous to the LPXTG/sortase system common in Gram-positive bacteria. This protein occurs in a species if and only if a transmembrane histidine kinase (TIGR02916) and a DNA-binding response regulator (TIGR02915) also occur. The present of tetratricopeptide repeats (TPR) suggests protein-protein interaction, possibly for the regulation of PEP-CTERM protein expression, since many PEP-CTERM proteins in these genomes are preceded by a proposed DNA binding site for the response regulator.
Pssm-ID: 274350 [Multi-domain] Cd Length: 899 Bit Score: 48.54 E-value: 3.44e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622913461 833 ARALREAEREPELEARVAVLATQLGM--------------LEDAEQLYRKCKRYDLLNRF-YQAAGQWQKALKVAERhdr 897
Cdd:TIGR02917 380 EKAAEYLAKATELDPENAAARTQLGIsklsqgdpseaiadLETAAQLDPELGRADLLLILsYLRSGQFDKALAAAKK--- 456
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622913461 898 vhLR-------STYHHYAGHLEASADCSRALSYYEKSdthrfevprmLS---EDLPSL-ELyvnkmkdktlwrwwAQYLE 966
Cdd:TIGR02917 457 --LEkkqpdnaSLHNLLGAIYLGKGDLAKAREAFEKA----------LSiepDFFPAAaNL--------------ARIDI 510
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622913461 967 SQGEMDAALHYYElaqdhfSLVRIHcfqgnvqkaaqianeTGNLAASYHLARQYESQEEVGQAVHFYTRA-----QAFKN 1041
Cdd:TIGR02917 511 QEGNPDDAIQRFE------KVLTID---------------PKNLRAILALAGLYLRTGNEEEAVAWLEKAaelnpQEIEP 569
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622913461 1042 AIRLCKENSLDDQLMN-LALLSSPEDMI-EAARYYEEKGMqmdravmLYHKAGHFSKALElAFTtqqfaalQLIAEDLDE 1119
Cdd:TIGR02917 570 ALALAQYYLGKGQLKKaLAILNEAADAApDSPEAWLMLGR-------AQLAAGDLNKAVS-SFK-------KLLALQPDS 634
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622913461 1120 tsdPALLARCSDFFIEHRQYERAvelllaARKYREALQL----CLEQNMSITEEMAEKMTV-AKDSSDLPEESRRE---L 1191
Cdd:TIGR02917 635 ---ALALLLLADAYAVMKNYAKA------ITSLKRALELkpdnTEAQIGLAQLLLAAKRTEsAKKIAKSLQKQHPKaalG 705
|
410 420 430 440
....*....|....*....|....*....|....*....|....*....
gi 1622913461 1192 LEQIANCCMRQGSYHLATKKYTQA-------GNKLKAMRALLKSGDTEK 1233
Cdd:TIGR02917 706 FELEGDLYLRQKDYPAAIQAYRKAlkrapssQNAIKLHRALLASGNTAE 754
|
|
| LapB |
COG2956 |
Lipopolysaccharide biosynthesis regulator YciM/LapB, contains six TPR domains and a C-terminal ... |
973-1227 |
2.72e-03 |
|
Lipopolysaccharide biosynthesis regulator YciM/LapB, contains six TPR domains and a C-terminal metal-binding domain [Cell wall/membrane/envelope biogenesis];
Pssm-ID: 442196 [Multi-domain] Cd Length: 275 Bit Score: 41.25 E-value: 2.72e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622913461 973 AALHYYELAQDHFslvrihcFQGNVQKAAQ-----IANETGNLAASYHLARQYESQEEVGQAVHFYtraqafKNAIRLCK 1047
Cdd:COG2956 7 AALGWYFKGLNYL-------LNGQPDKAIDlleeaLELDPETVEAHLALGNLYRRRGEYDRAIRIH------QKLLERDP 73
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622913461 1048 ENslDDQLMNLAllsspedmieaaryyeekgmqmdravMLYHKAGHFSKALELafttqqfaALQLIAEDLDetsDPALLA 1127
Cdd:COG2956 74 DR--AEALLELA--------------------------QDYLKAGLLDRAEEL--------LEKLLELDPD---DAEALR 114
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622913461 1128 RCSDFFIEHRQYERAVELLLAARKYRealqlclEQNMSITEEMAEKMTVAKDssdlPEESRRELLE--QIANCCMR---- 1201
Cdd:COG2956 115 LLAEIYEQEGDWEKAIEVLERLLKLG-------PENAHAYCELAELYLEQGD----YDEAIEALEKalKLDPDCARalll 183
|
250 260
....*....|....*....|....*.
gi 1622913461 1202 QGSYHLATKKYTQAgnkLKAMRALLK 1227
Cdd:COG2956 184 LAELYLEQGDYEEA---IAALERALE 206
|
|
|
|
Name |
Accession |
Description |
Interval |
E-value |
| LapB |
COG2956 |
Lipopolysaccharide biosynthesis regulator YciM/LapB, contains six TPR domains and a C-terminal ... |
824-1082 |
1.18e-08 |
|
Lipopolysaccharide biosynthesis regulator YciM/LapB, contains six TPR domains and a C-terminal metal-binding domain [Cell wall/membrane/envelope biogenesis];
Pssm-ID: 442196 [Multi-domain] Cd Length: 275 Bit Score: 57.82 E-value: 1.18e-08
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622913461 824 LGNMGHARG--ARALRE----AEREPE-LEARV--AVLATQLGMLEDAEQLYRKCKRYD--------LLNRFYQAAGQWQ 886
Cdd:COG2956 48 LGNLYRRRGeyDRAIRIhqklLERDPDrAEALLelAQDYLKAGLLDRAEELLEKLLELDpddaealrLLAEIYEQEGDWE 127
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622913461 887 KALKVAERHDRVHLRSTYHHY--AGHLEASADCSRALSYYEKsdthrfevprMLSEDLPSLELYVNKmkdktlwrwwAQY 964
Cdd:COG2956 128 KAIEVLERLLKLGPENAHAYCelAELYLEQGDYDEAIEALEK----------ALKLDPDCARALLLL----------AEL 187
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622913461 965 LESQGEMDAALHYYelaqdhfslvrihcfqgnvQKAAQIANEtgNLAASYHLARQYESQEEVGQAVHFYTRAQAfknair 1044
Cdd:COG2956 188 YLEQGDYEEAIAAL-------------------ERALEQDPD--YLPALPRLAELYEKLGDPEEALELLRKALE------ 240
|
250 260 270
....*....|....*....|....*....|....*...
gi 1622913461 1045 lckENSLDDQLMNLALLSSPEDMIEAARYYEEKGMQMD 1082
Cdd:COG2956 241 ---LDPSDDLLLALADLLERKEGLEAALALLERQLRRH 275
|
|
| WD40 |
COG2319 |
WD40 repeat [General function prediction only]; |
24-133 |
3.19e-08 |
|
WD40 repeat [General function prediction only];
Pssm-ID: 441893 [Multi-domain] Cd Length: 403 Bit Score: 57.61 E-value: 3.19e-08
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622913461 24 HPVHPFLAVAY-------ISTTSTGSVDIY-LEQGECVPdTHVERPFRVASLCWHPTRPVLAMGWETGEVTVFNKQDKEQ 95
Cdd:COG2319 286 GHSGGVNSVAFspdgkllASGSDDGTVRLWdLATGKLLR-TLTGHTGAVRSVAFSPDGKTLASGSDDGTVRLWDLATGEL 364
|
90 100 110
....*....|....*....|....*....|....*...
gi 1622913461 96 HTTPPTHTADIAVLSWSPSGNCLLSGDRLGVLLLWRLD 133
Cdd:COG2319 365 LRTLTGHTGAVTSVAFSPDGRTLASGSADGTVRLWDLA 402
|
|
| LapB |
COG2956 |
Lipopolysaccharide biosynthesis regulator YciM/LapB, contains six TPR domains and a C-terminal ... |
877-1149 |
1.64e-07 |
|
Lipopolysaccharide biosynthesis regulator YciM/LapB, contains six TPR domains and a C-terminal metal-binding domain [Cell wall/membrane/envelope biogenesis];
Pssm-ID: 442196 [Multi-domain] Cd Length: 275 Bit Score: 54.35 E-value: 1.64e-07
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622913461 877 RFYQAAGQWQKALKVAERHDRVHLRSTYHHYA-GHL-EASADCSRALSYYEKsdthrfevprMLSEDLPSLELYVNKmkd 954
Cdd:COG2956 16 LNYLLNGQPDKAIDLLEEALELDPETVEAHLAlGNLyRRRGEYDRAIRIHQK----------LLERDPDRAEALLEL--- 82
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622913461 955 ktlwrwwAQYLESQGEMDAALHYYELAQD--------HFSLVRIHCFQGNVQKAAQIANE-----TGNLAASYHLARQYE 1021
Cdd:COG2956 83 -------AQDYLKAGLLDRAEELLEKLLEldpddaeaLRLLAEIYEQEGDWEKAIEVLERllklgPENAHAYCELAELYL 155
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622913461 1022 SQEEVGQAVHFYTRA-QAFKNAIRLckenslddqLMNLA-LLSSPEDMIEAARYYEEKGMQMDRAVMLYHKAGHFSKALE 1099
Cdd:COG2956 156 EQGDYDEAIEALEKAlKLDPDCARA---------LLLLAeLYLEQGDYEEAIAALERALEQDPDYLPALPRLAELYEKLG 226
|
250 260 270 280 290
....*....|....*....|....*....|....*....|....*....|
gi 1622913461 1100 lafttQQFAALQLIAEDLDETSDPALLARCSDFFIEHRQYERAVELLLAA 1149
Cdd:COG2956 227 -----DPEEALELLRKALELDPSDDLLLALADLLERKEGLEAALALLERQ 271
|
|
| WD40 |
COG2319 |
WD40 repeat [General function prediction only]; |
16-139 |
3.11e-07 |
|
WD40 repeat [General function prediction only];
Pssm-ID: 441893 [Multi-domain] Cd Length: 403 Bit Score: 54.53 E-value: 3.11e-07
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622913461 16 GSPSHISWHPVHPFLAvayiSTTSTGSVDIY-LEQGECVpDTHVERPFRVASLCWHPTRPVLAMGWETGEVTVFNKQDKE 94
Cdd:COG2319 247 GSVRSVAFSPDGRLLA----SGSADGTVRLWdLATGELL-RTLTGHSGGVNSVAFSPDGKLLASGSDDGTVRLWDLATGK 321
|
90 100 110 120
....*....|....*....|....*....|....*....|....*
gi 1622913461 95 QHTTPPTHTADIAVLSWSPSGNCLLSGDRLGVLLLWRLDQRGRVQ 139
Cdd:COG2319 322 LLRTLTGHTGAVRSVAFSPDGKTLASGSDDGTVRLWDLATGELLR 366
|
|
| WD40 |
cd00200 |
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions ... |
15-167 |
2.99e-06 |
|
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment.
Pssm-ID: 238121 [Multi-domain] Cd Length: 289 Bit Score: 50.80 E-value: 2.99e-06
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622913461 15 AGSPSHISWHPVHPFLAvayiSTTSTGSVDIY-LEQGECVpDTHVERPFRVASLCWHPTRPVLAMGWETGEVTVFNKQDK 93
Cdd:cd00200 51 TGPVRDVAASADGTYLA----SGSSDKTIRLWdLETGECV-RTLTGHTSYVSSVAFSPDGRILSSSSRDKTIKVWDVETG 125
|
90 100 110 120 130 140 150
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....
gi 1622913461 94 EQHTTPPTHTADIAVLSWSPSGNCLLSGDRLGVLLLWRLDQrGRVQGTpLLKHEygKPLTHCIFRlpPPGEDLV 167
Cdd:cd00200 126 KCLTTLRGHTDWVNSVAFSPDGTFVASSSQDGTIKLWDLRT-GKCVAT-LTGHT--GEVNSVAFS--PDGEKLL 193
|
|
| WD40 |
COG2319 |
WD40 repeat [General function prediction only]; |
16-140 |
5.66e-06 |
|
WD40 repeat [General function prediction only];
Pssm-ID: 441893 [Multi-domain] Cd Length: 403 Bit Score: 50.29 E-value: 5.66e-06
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622913461 16 GSPSHISWHPVHPFLAVAyistTSTGSVDIY-LEQGECVpDTHVERPFRVASLCWHPTRPVLAMGWETGEVTVFNKQDKE 94
Cdd:COG2319 205 GAVRSVAFSPDGKLLASG----SADGTVRLWdLATGKLL-RTLTGHSGSVRSVAFSPDGRLLASGSADGTVRLWDLATGE 279
|
90 100 110 120
....*....|....*....|....*....|....*....|....*.
gi 1622913461 95 QHTTPPTHTADIAVLSWSPSGNCLLSGDRLGVLLLWRLDQRGRVQG 140
Cdd:COG2319 280 LLRTLTGHSGGVNSVAFSPDGKLLASGSDDGTVRLWDLATGKLLRT 325
|
|
| WD40 |
cd00200 |
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions ... |
16-139 |
3.11e-05 |
|
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment.
Pssm-ID: 238121 [Multi-domain] Cd Length: 289 Bit Score: 47.33 E-value: 3.11e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622913461 16 GSPSHISWHPVHPFLAVAyistTSTGSVDIY-LEQGECVpDTHVERPFRVASLCWHPTRPVLAMGWETGEVTVFNKQDKE 94
Cdd:cd00200 136 DWVNSVAFSPDGTFVASS----SQDGTIKLWdLRTGKCV-ATLTGHTGEVNSVAFSPDGEKLLSSSSDGTIKLWDLSTGK 210
|
90 100 110 120
....*....|....*....|....*....|....*....|....*
gi 1622913461 95 QHTTPPTHTADIAVLSWSPSGNCLLSGDRLGVLLLWRLDQRGRVQ 139
Cdd:cd00200 211 CLGTLRGHENGVNSVAFSPDGYLLASGSEDGTIRVWDLRTGECVQ 255
|
|
| PEP_TPR_lipo |
TIGR02917 |
putative PEP-CTERM system TPR-repeat lipoprotein; This protein family occurs in strictly ... |
833-1233 |
3.44e-05 |
|
putative PEP-CTERM system TPR-repeat lipoprotein; This protein family occurs in strictly within a subset of Gram-negative bacterial species with the proposed PEP-CTERM/exosortase system, analogous to the LPXTG/sortase system common in Gram-positive bacteria. This protein occurs in a species if and only if a transmembrane histidine kinase (TIGR02916) and a DNA-binding response regulator (TIGR02915) also occur. The present of tetratricopeptide repeats (TPR) suggests protein-protein interaction, possibly for the regulation of PEP-CTERM protein expression, since many PEP-CTERM proteins in these genomes are preceded by a proposed DNA binding site for the response regulator.
Pssm-ID: 274350 [Multi-domain] Cd Length: 899 Bit Score: 48.54 E-value: 3.44e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622913461 833 ARALREAEREPELEARVAVLATQLGM--------------LEDAEQLYRKCKRYDLLNRF-YQAAGQWQKALKVAERhdr 897
Cdd:TIGR02917 380 EKAAEYLAKATELDPENAAARTQLGIsklsqgdpseaiadLETAAQLDPELGRADLLLILsYLRSGQFDKALAAAKK--- 456
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622913461 898 vhLR-------STYHHYAGHLEASADCSRALSYYEKSdthrfevprmLS---EDLPSL-ELyvnkmkdktlwrwwAQYLE 966
Cdd:TIGR02917 457 --LEkkqpdnaSLHNLLGAIYLGKGDLAKAREAFEKA----------LSiepDFFPAAaNL--------------ARIDI 510
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622913461 967 SQGEMDAALHYYElaqdhfSLVRIHcfqgnvqkaaqianeTGNLAASYHLARQYESQEEVGQAVHFYTRA-----QAFKN 1041
Cdd:TIGR02917 511 QEGNPDDAIQRFE------KVLTID---------------PKNLRAILALAGLYLRTGNEEEAVAWLEKAaelnpQEIEP 569
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622913461 1042 AIRLCKENSLDDQLMN-LALLSSPEDMI-EAARYYEEKGMqmdravmLYHKAGHFSKALElAFTtqqfaalQLIAEDLDE 1119
Cdd:TIGR02917 570 ALALAQYYLGKGQLKKaLAILNEAADAApDSPEAWLMLGR-------AQLAAGDLNKAVS-SFK-------KLLALQPDS 634
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622913461 1120 tsdPALLARCSDFFIEHRQYERAvelllaARKYREALQL----CLEQNMSITEEMAEKMTV-AKDSSDLPEESRRE---L 1191
Cdd:TIGR02917 635 ---ALALLLLADAYAVMKNYAKA------ITSLKRALELkpdnTEAQIGLAQLLLAAKRTEsAKKIAKSLQKQHPKaalG 705
|
410 420 430 440
....*....|....*....|....*....|....*....|....*....
gi 1622913461 1192 LEQIANCCMRQGSYHLATKKYTQA-------GNKLKAMRALLKSGDTEK 1233
Cdd:TIGR02917 706 FELEGDLYLRQKDYPAAIQAYRKAlkrapssQNAIKLHRALLASGNTAE 754
|
|
| WD40 |
COG2319 |
WD40 repeat [General function prediction only]; |
35-139 |
4.08e-05 |
|
WD40 repeat [General function prediction only];
Pssm-ID: 441893 [Multi-domain] Cd Length: 403 Bit Score: 47.60 E-value: 4.08e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622913461 35 ISTTSTGSVDIY-LEQGECVPdTHVERPFRVASLCWHPTRPVLAMGWETGEVTVFNKQDKEQHTTPPTHTADIAVLSWSP 113
Cdd:COG2319 94 ASASADGTVRLWdLATGLLLR-TLTGHTGAVRSVAFSPDGKTLASGSADGTVRLWDLATGKLLRTLTGHSGAVTSVAFSP 172
|
90 100
....*....|....*....|....*.
gi 1622913461 114 SGNCLLSGDRLGVLLLWRLDQRGRVQ 139
Cdd:COG2319 173 DGKLLASGSDDGTVRLWDLATGKLLR 198
|
|
| WD40 |
COG2319 |
WD40 repeat [General function prediction only]; |
14-139 |
5.11e-05 |
|
WD40 repeat [General function prediction only];
Pssm-ID: 441893 [Multi-domain] Cd Length: 403 Bit Score: 47.21 E-value: 5.11e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622913461 14 AAGSPSHISWHPVHPFLAVAY-------ISTTSTGSVDIY-LEQGECVpDTHVERPFRVASLCWHPTRPVLAMGWETGEV 85
Cdd:COG2319 108 ATGLLLRTLTGHTGAVRSVAFspdgktlASGSADGTVRLWdLATGKLL-RTLTGHSGAVTSVAFSPDGKLLASGSDDGTV 186
|
90 100 110 120 130
....*....|....*....|....*....|....*....|....*....|....
gi 1622913461 86 TVFNKQDKEQHTTPPTHTADIAVLSWSPSGNCLLSGDRLGVLLLWRLDQRGRVQ 139
Cdd:COG2319 187 RLWDLATGKLLRTLTGHTGAVRSVAFSPDGKLLASGSADGTVRLWDLATGKLLR 240
|
|
| TPR |
COG0457 |
Tetratricopeptide (TPR) repeat [General function prediction only]; |
860-1076 |
5.32e-05 |
|
Tetratricopeptide (TPR) repeat [General function prediction only];
Pssm-ID: 440225 [Multi-domain] Cd Length: 245 Bit Score: 46.54 E-value: 5.32e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622913461 860 EDAEQLYRKCKRYDLLNRFYQAAGQWQKALKVAERHDRVH-LRSTYHHYAGHLEasadcsRALSYYEKsdthrfevprml 938
Cdd:COG0457 6 DDAEAYNNLGLAYRRLGRYEEAIEDYEKALELDPDDAEALyNLGLAYLRLGRYE------EALADYEQ------------ 67
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622913461 939 sedlpSLELYVNkmkDKTLWRWWAQYLESQGEMDAALHYYE--------LAQDHFSLVRIHCFQGNVQKAAQ-----IAN 1005
Cdd:COG0457 68 -----ALELDPD---DAEALNNLGLALQALGRYEEALEDYDkaleldpdDAEALYNLGLALLELGRYDEAIEayeraLEL 139
|
170 180 190 200 210 220 230
....*....|....*....|....*....|....*....|....*....|....*....|....*....|.
gi 1622913461 1006 ETGNLAASYHLARQYESQEEVGQAVHFYTRAQAFKNAIRLCKENSLDDQLMNLALLSSPEDMIEAARYYEE 1076
Cdd:COG0457 140 DPDDADALYNLGIALEKLGRYEEALELLEKLEAAALAALLAAALGEAALALAAAEVLLALLLALEQALRKK 210
|
|
| BepA |
COG4783 |
Outer membrane protein chaperone/metalloprotease BepA/YfgC, contains M48 and TPR domains [Cell ... |
785-894 |
6.64e-05 |
|
Outer membrane protein chaperone/metalloprotease BepA/YfgC, contains M48 and TPR domains [Cell wall/membrane/envelope biogenesis, Posttranslational modification, protein turnover, chaperones];
Pssm-ID: 443813 [Multi-domain] Cd Length: 139 Bit Score: 44.41 E-value: 6.64e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622913461 785 IGDMDEAFKSI-KLIKSE----AVWENMARMCVKTQRLDVAKVCLgnmghargARALREAEREPELEARVAVLATQLGML 859
Cdd:COG4783 17 AGDYDEAEALLeKALELDpdnpEAFALLGEILLQLGDLDEAIVLL--------HEALELDPDEPEARLNLGLALLKAGDY 88
|
90 100 110 120
....*....|....*....|....*....|....*....|...
gi 1622913461 860 EDAEQLYRKCKR--------YDLLNRFYQAAGQWQKALKVAER 894
Cdd:COG4783 89 DEALALLEKALKldpehpeaYLRLARAYRALGRPDEAIAALEK 131
|
|
| WD40 |
COG2319 |
WD40 repeat [General function prediction only]; |
14-139 |
1.41e-04 |
|
WD40 repeat [General function prediction only];
Pssm-ID: 441893 [Multi-domain] Cd Length: 403 Bit Score: 46.06 E-value: 1.41e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622913461 14 AAGSPSHISWHPVHPFLAVAYISTTSTGSVDIYLEQGECVPDTHVERPFRVASLCWHPTRPVLAMGWETGEVTVFNKQDK 93
Cdd:COG2319 31 LLLGLAAAVASLAASPDGARLAAGAGDLTLLLLDAAAGALLATLLGHTAAVLSVAFSPDGRLLASASADGTVRLWDLATG 110
|
90 100 110 120
....*....|....*....|....*....|....*....|....*.
gi 1622913461 94 EQHTTPPTHTADIAVLSWSPSGNCLLSGDRLGVLLLWRLDQRGRVQ 139
Cdd:COG2319 111 LLLRTLTGHTGAVRSVAFSPDGKTLASGSADGTVRLWDLATGKLLR 156
|
|
| WD40 |
cd00200 |
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions ... |
63-139 |
4.84e-04 |
|
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment.
Pssm-ID: 238121 [Multi-domain] Cd Length: 289 Bit Score: 43.86 E-value: 4.84e-04
10 20 30 40 50 60 70
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*..
gi 1622913461 63 RVASLCWHPTRPVLAMGWETGEVTVFNKQDKEQHTTPPTHTADIAVLSWSPSGNCLLSGDRLGVLLLWRLDQRGRVQ 139
Cdd:cd00200 11 GVTCVAFSPDGKLLATGSGDGTIKVWDLETGELLRTLKGHTGPVRDVAASADGTYLASGSSDKTIRLWDLETGECVR 87
|
|
| HemYx |
COG3071 |
Uncharacterized protein HemY, contains HemY_N domain and TPR repeats (unrelated to ... |
829-1091 |
5.94e-04 |
|
Uncharacterized protein HemY, contains HemY_N domain and TPR repeats (unrelated to protoporphyrinogen oxidase HemY) [Function unknown];
Pssm-ID: 442305 [Multi-domain] Cd Length: 323 Bit Score: 43.75 E-value: 5.94e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622913461 829 HARGARALRE------AEREPELEARVAVLATQL----GMLEDA----EQLYRKCKR----YDLLNRFYQAAGQWQKALK 890
Cdd:COG3071 61 QALGDYERRDeylaqaLELAPEAELAVLLTRAELlldqGQAEQAlatlEALRAGAPRhpqvLRLLLQAYRQLGDWEELLE 140
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622913461 891 VAER------HDRVHLRSTYHH-YAGHLEASADCSRALsyyeksdthrfevpRMLSEDLPSLElyvnkMKDKTLWRWWAQ 963
Cdd:COG3071 141 LLPAlrkhkaLSAEEAQALERRaYLGLLRQAARDAEAL--------------KALWKALPRAE-----RRDPELAAAYAR 201
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622913461 964 YLESQGEMDAALHYYELAQDHF---SLVRI--HCFQGNVQKAAQIAN-----ETGNLAASYHLARQYESQEEVGQAVHFY 1033
Cdd:COG3071 202 ALIALGDHDEAERLLREALKRQwdpRLVRLygRLQGGDPAKQLKRAEkwlkkHPNDPDLLLALGRLCLRNQLWGKAREYL 281
|
250 260 270 280 290
....*....|....*....|....*....|....*....|....*....|....*...
gi 1622913461 1034 TRAqafknairlckenslddqlmnLALLSSPEDMIEAARYYEEKGmQMDRAVMLYHKA 1091
Cdd:COG3071 282 EAA---------------------LALRPSAEAYAELARLLEQLG-DPEEAAEHYRKA 317
|
|
| LapB |
COG2956 |
Lipopolysaccharide biosynthesis regulator YciM/LapB, contains six TPR domains and a C-terminal ... |
973-1227 |
2.72e-03 |
|
Lipopolysaccharide biosynthesis regulator YciM/LapB, contains six TPR domains and a C-terminal metal-binding domain [Cell wall/membrane/envelope biogenesis];
Pssm-ID: 442196 [Multi-domain] Cd Length: 275 Bit Score: 41.25 E-value: 2.72e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622913461 973 AALHYYELAQDHFslvrihcFQGNVQKAAQ-----IANETGNLAASYHLARQYESQEEVGQAVHFYtraqafKNAIRLCK 1047
Cdd:COG2956 7 AALGWYFKGLNYL-------LNGQPDKAIDlleeaLELDPETVEAHLALGNLYRRRGEYDRAIRIH------QKLLERDP 73
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622913461 1048 ENslDDQLMNLAllsspedmieaaryyeekgmqmdravMLYHKAGHFSKALELafttqqfaALQLIAEDLDetsDPALLA 1127
Cdd:COG2956 74 DR--AEALLELA--------------------------QDYLKAGLLDRAEEL--------LEKLLELDPD---DAEALR 114
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622913461 1128 RCSDFFIEHRQYERAVELLLAARKYRealqlclEQNMSITEEMAEKMTVAKDssdlPEESRRELLE--QIANCCMR---- 1201
Cdd:COG2956 115 LLAEIYEQEGDWEKAIEVLERLLKLG-------PENAHAYCELAELYLEQGD----YDEAIEALEKalKLDPDCARalll 183
|
250 260
....*....|....*....|....*.
gi 1622913461 1202 QGSYHLATKKYTQAgnkLKAMRALLK 1227
Cdd:COG2956 184 LAELYLEQGDYEEA---IAALERALE 206
|
|
| TPR |
COG0457 |
Tetratricopeptide (TPR) repeat [General function prediction only]; |
803-1055 |
3.10e-03 |
|
Tetratricopeptide (TPR) repeat [General function prediction only];
Pssm-ID: 440225 [Multi-domain] Cd Length: 245 Bit Score: 41.15 E-value: 3.10e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622913461 803 VWENMARMCVKTQRLDVAKVCLgnmghargARALREAEREPELEARVAVLATQLGMLEDAEQLYRKCKRYD------LLN 876
Cdd:COG0457 10 AYNNLGLAYRRLGRYEEAIEDY--------EKALELDPDDAEALYNLGLAYLRLGRYEEALADYEQALELDpddaeaLNN 81
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622913461 877 R--FYQAAGQWQKALKVAERHDRVH--LRSTYHHYAGHLEASADCSRALSYYEKsdthrfevprmlsedlpSLELyvnKM 952
Cdd:COG0457 82 LglALQALGRYEEALEDYDKALELDpdDAEALYNLGLALLELGRYDEAIEAYER-----------------ALEL---DP 141
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622913461 953 KDKTLWRWWAQYLESQGEMDAALHYYELAQDHFSLVRIHCFQGNVQKAAQIANETGNLAASYHLARQYESQEEVGQAVHF 1032
Cdd:COG0457 142 DDADALYNLGIALEKLGRYEEALELLEKLEAAALAALLAAALGEAALALAAAEVLLALLLALEQALRKKLAILTLAALAE 221
|
250 260
....*....|....*....|...
gi 1622913461 1033 YTRAQAFKNAIRLCKENSLDDQL 1055
Cdd:COG0457 222 LLLLALALLLALRLAALALYQYR 244
|
|
| WD40 |
cd00200 |
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions ... |
19-121 |
6.05e-03 |
|
WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and bottom surface of the propeller are proposed to coordinate interactions with other proteins and/or small ligands; 7 copies of the repeat are present in this alignment.
Pssm-ID: 238121 [Multi-domain] Cd Length: 289 Bit Score: 40.40 E-value: 6.05e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1622913461 19 SHISWHPVHPFLavayISTTSTGSVDIY-LEQGECVpDTHVERPFRVASLCWHPTRPVLAMGWETGEVTVFNKQDKEQHT 97
Cdd:cd00200 181 NSVAFSPDGEKL----LSSSSDGTIKLWdLSTGKCL-GTLRGHENGVNSVAFSPDGYLLASGSEDGTIRVWDLRTGECVQ 255
|
90 100
....*....|....*....|....
gi 1622913461 98 TPPTHTADIAVLSWSPSGNCLLSG 121
Cdd:cd00200 256 TLSGHTNSVTSLAWSPDGKRLASG 279
|
|
|