NCBI Home Page NCBI Site Search page NCBI Guide that lists and describes the NCBI resources
Conserved domains on  [gi|8176554|gb|AAB35488|]
View 

bile salt-dependent lipase [Homo sapiens]

Protein Classification

Esterase_lipase and Mucin-like domain-containing protein( domain architecture ID 11987879)

Esterase_lipase and Mucin-like domain-containing protein

Graphical summary

 Zoom to residue level

show extra options »

Show site features     Horizontal zoom: ×

List of domain hits

Name Accession Description Interval E-value
COesterase pfam00135
Carboxylesterase family;
26-542 0e+00

Carboxylesterase family;


:

Pssm-ID: 395084 [Multi-domain]  Cd Length: 513  Bit Score: 608.54  E-value: 0e+00
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 8176554     26 VYTEGGFVEGVNKKLGLlGDSVDIFKGIPFAAPTKAL---ENPQPHPGWQGTLKAKNFKKRCLQATITQDSTY----GDE 98
Cdd:pfam00135   5 VTTSLGRVRGKRLKVDG-GKPVYAFLGIPYAEPPVGElrfQPPEPPEPWTGVRDATKFGPRCPQNGDLTSPGSsgleGSE 83
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 8176554     99 DCLYLNIWVPQGRKQVSRDLPVMIWIYGGAFLMGSGHganflnnyLYDGEEIATRGNVIVVTFNYRVGPLGFLSTGDANL 178
Cdd:pfam00135  84 DCLYLNVYTPKELKENKNKLPVMVWIHGGGFMFGSGS--------LYDGSYLAAEGDVIVVTINYRLGPLGFLSTGDDEA 155
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 8176554    179 PGNYGLRDQHMAIAWVKRNIAAFGGDPNNITLFGESAGGASVSLQTLSPYNKGLIRAAISQSGVALSPWVIQKNPLFWAK 258
Cdd:pfam00135 156 PGNYGLLDQVLALRWVQENIASFGGDPNRVTLFGESAGAASVSLLLLSPLSKGLFHRAILMSGSALSPWAIQSNARQRAK 235
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 8176554    259 KVAEKVGCPVGDAARMAQCLKVTDPAAVTVAYKVPlagLEYPMLHYVGFVPVIDEDFIPADPINLYA--NAADIDYIAGT 336
Cdd:pfam00135 236 ELAKLVGCPTSDSAELVECLRSKPAEELLDAQLKL---LVYGSVPFVPFGPVVDGDFLPEHPEELLKsgNFPKVPLLIGV 312
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 8176554    337 NNMDGHIFASIDMPAINKGNKKVTEEDFYKLVSEFTITKGL---RGAKTTFDVYTEsWAQDPSQENKKKIVVDFETDVLF 413
Cdd:pfam00135 313 TKDEGLLFAAYILDNVDILKALEEKLLRSLLIDLLYLLLVDlpeEISAALREEYLD-WGDRDDPETSRRALVELLTDYLF 391
                         410       420       430       440       450       460       470       480
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 8176554    414 LVPTEIALAQHRanAKSAKTYAYLFSHPSRMPVYPKWVGADHADDIQYVFGKPFATPTGYRPQDRTVFKAMIAYWTNFAK 493
Cdd:pfam00135 392 NCPVIRFADLHA--SRGTPVYMYSFDYRGSSLRYPKWVGVDHGDELPYVFGTPFVGALLFTEEDEKLSRKMMTYWTNFAK 469
                         490       500       510       520
                  ....*....|....*....|....*....|....*....|....*....
gi 8176554    494 TGDPNMGdsAVPTHWEPYTTENSGYLEITKKMgssSMKRSLRTNFLRYW 542
Cdd:pfam00135 470 TGNPNGP--EGLPKWPPYTDENGQYLSIDLEP---RVKQGLKAERCAFW 513
Mucin-like pfam16058
Mucin-like; This region is found repeated at the C-terminus (C-tail) of bile salt-activated ...
584-677 1.72e-22

Mucin-like; This region is found repeated at the C-terminus (C-tail) of bile salt-activated lipase, where is O-glycosylated. This region is composed of biased amino acid composition that is likely to be disordered. The region contains many repeats of an approximately 11 residue degenerate repeat.


:

Pssm-ID: 464997 [Multi-domain]  Cd Length: 94  Bit Score: 92.48  E-value: 1.72e-22
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 8176554    584 PTGDSGAPPVPPTGDSGAPPVPPTGDSGAPPVPPTGDSGAPPVPPTGDSGAPPVPPTGDSGAPPVPPTGDSGAPPVPPTG 663
Cdd:pfam16058   1 PSSSITEPPRDPSGSYGEPPRAPSSSYTEPQRDPSSSITEPPADPSSSYTEPPRDPSGSYTEPQRDPSSSSTEPQRDPSS 80
                          90
                  ....*....|....
gi 8176554    664 DAGPPPVPPTGDSG 677
Cdd:pfam16058  81 SITEPPRDPSGSYT 94
Mucin-like pfam16058
Mucin-like; This region is found repeated at the C-terminus (C-tail) of bile salt-activated ...
639-731 7.09e-19

Mucin-like; This region is found repeated at the C-terminus (C-tail) of bile salt-activated lipase, where is O-glycosylated. This region is composed of biased amino acid composition that is likely to be disordered. The region contains many repeats of an approximately 11 residue degenerate repeat.


:

Pssm-ID: 464997 [Multi-domain]  Cd Length: 94  Bit Score: 82.08  E-value: 7.09e-19
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 8176554    639 PTGDSGAPPVPPTGDSGAPPVPPTGDAGPPPVPPTGDSGPPPVPPTGDSGAPPVTPTGDSETAPVPPTGDSGAPPVPPTG 718
Cdd:pfam16058   1 PSSSITEPPRDPSGSYGEPPRAPSSSYTEPQRDPSSSITEPPADPSSSYTEPPRDPSGSYTEPQRDPSSSSTEPQRDPSS 80
                          90
                  ....*....|...
gi 8176554    719 DSEAAPVPPTDDS 731
Cdd:pfam16058  81 SITEPPRDPSGSY 93
 
Name Accession Description Interval E-value
COesterase pfam00135
Carboxylesterase family;
26-542 0e+00

Carboxylesterase family;


Pssm-ID: 395084 [Multi-domain]  Cd Length: 513  Bit Score: 608.54  E-value: 0e+00
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 8176554     26 VYTEGGFVEGVNKKLGLlGDSVDIFKGIPFAAPTKAL---ENPQPHPGWQGTLKAKNFKKRCLQATITQDSTY----GDE 98
Cdd:pfam00135   5 VTTSLGRVRGKRLKVDG-GKPVYAFLGIPYAEPPVGElrfQPPEPPEPWTGVRDATKFGPRCPQNGDLTSPGSsgleGSE 83
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 8176554     99 DCLYLNIWVPQGRKQVSRDLPVMIWIYGGAFLMGSGHganflnnyLYDGEEIATRGNVIVVTFNYRVGPLGFLSTGDANL 178
Cdd:pfam00135  84 DCLYLNVYTPKELKENKNKLPVMVWIHGGGFMFGSGS--------LYDGSYLAAEGDVIVVTINYRLGPLGFLSTGDDEA 155
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 8176554    179 PGNYGLRDQHMAIAWVKRNIAAFGGDPNNITLFGESAGGASVSLQTLSPYNKGLIRAAISQSGVALSPWVIQKNPLFWAK 258
Cdd:pfam00135 156 PGNYGLLDQVLALRWVQENIASFGGDPNRVTLFGESAGAASVSLLLLSPLSKGLFHRAILMSGSALSPWAIQSNARQRAK 235
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 8176554    259 KVAEKVGCPVGDAARMAQCLKVTDPAAVTVAYKVPlagLEYPMLHYVGFVPVIDEDFIPADPINLYA--NAADIDYIAGT 336
Cdd:pfam00135 236 ELAKLVGCPTSDSAELVECLRSKPAEELLDAQLKL---LVYGSVPFVPFGPVVDGDFLPEHPEELLKsgNFPKVPLLIGV 312
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 8176554    337 NNMDGHIFASIDMPAINKGNKKVTEEDFYKLVSEFTITKGL---RGAKTTFDVYTEsWAQDPSQENKKKIVVDFETDVLF 413
Cdd:pfam00135 313 TKDEGLLFAAYILDNVDILKALEEKLLRSLLIDLLYLLLVDlpeEISAALREEYLD-WGDRDDPETSRRALVELLTDYLF 391
                         410       420       430       440       450       460       470       480
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 8176554    414 LVPTEIALAQHRanAKSAKTYAYLFSHPSRMPVYPKWVGADHADDIQYVFGKPFATPTGYRPQDRTVFKAMIAYWTNFAK 493
Cdd:pfam00135 392 NCPVIRFADLHA--SRGTPVYMYSFDYRGSSLRYPKWVGVDHGDELPYVFGTPFVGALLFTEEDEKLSRKMMTYWTNFAK 469
                         490       500       510       520
                  ....*....|....*....|....*....|....*....|....*....
gi 8176554    494 TGDPNMGdsAVPTHWEPYTTENSGYLEITKKMgssSMKRSLRTNFLRYW 542
Cdd:pfam00135 470 TGNPNGP--EGLPKWPPYTDENGQYLSIDLEP---RVKQGLKAERCAFW 513
Esterase_lipase cd00312
Esterases and lipases (includes fungal lipases, cholinesterases, etc.) These enzymes act on ...
25-532 0e+00

Esterases and lipases (includes fungal lipases, cholinesterases, etc.) These enzymes act on carboxylic esters (EC: 3.1.1.-). The catalytic apparatus involves three residues (catalytic triad): a serine, a glutamate or aspartate and a histidine.These catalytic residues are responsible for the nucleophilic attack on the carbonyl carbon atom of the ester bond. In contrast with other alpha/beta hydrolase fold family members, p-nitrobenzyl esterase and acetylcholine esterase have a Glu instead of Asp at the active site carboxylate.


Pssm-ID: 238191 [Multi-domain]  Cd Length: 493  Bit Score: 561.95  E-value: 0e+00
                        10        20        30        40        50        60        70        80
                ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 8176554   25 AVYTEGGFVEGVNKklgllgDSVDIFKGIPFAAPT---KALENPQPHPGWQGTLKAKNFKKRCLQATITQDS-----TYG 96
Cdd:cd00312   1 LVVTPNGKVRGVDE------GGVYSFLGIPYAEPPvgdLRFKEPQPYEPWSDVLDATSYPPSCMQWDQLGGGlwnakLPG 74
                        90       100       110       120       130       140       150       160
                ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 8176554   97 DEDCLYLNIWVPQGRKqVSRDLPVMIWIYGGAFLMGSGHganflnnyLYDGEEIATRG-NVIVVTFNYRVGPLGFLSTGD 175
Cdd:cd00312  75 SEDCLYLNVYTPKNTK-PGNSLPVMVWIHGGGFMFGSGS--------LYPGDGLAREGdNVIVVSINYRLGVLGFLSTGD 145
                       170       180       190       200       210       220       230       240
                ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 8176554  176 ANLPGNYGLRDQHMAIAWVKRNIAAFGGDPNNITLFGESAGGASVSLQTLSPYNKGLIRAAISQSGVALSPWVIQKNPLF 255
Cdd:cd00312 146 IELPGNYGLKDQRLALKWVQDNIAAFGGDPDSVTIFGESAGGASVSLLLLSPDSKGLFHRAISQSGSALSPWAIQENARG 225
                       250       260       270       280       290       300       310       320
                ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 8176554  256 WAKKVAEKVGCPVGDAARMAQCLKVTDPAAVTVAYKVPlagLEYPMLHYVGFVPVIDEDFIPADPINLYA--NAADIDYI 333
Cdd:cd00312 226 RAKRLARLLGCNDTSSAELLDCLRSKSAEELLDATRKL---LLFSYSPFLPFGPVVDGDFIPDDPEELIKegKFAKVPLI 302
                       330       340       350       360       370       380       390       400
                ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 8176554  334 AGTNNMDGHIFASIDMPAINKgNKKVTEEDFYKLVSEFTITKGLRGAKTTFDVYTESWAQdpsQENKKKIVVDFETDVLF 413
Cdd:cd00312 303 IGVTKDEGGYFAAMLLNFDAK-LIIETNDRWLELLPYLLFYADDALADKVLEKYPGDVDD---SVESRKNLSDMLTDLLF 378
                       410       420       430       440       450       460       470       480
                ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 8176554  414 LVPTEIALAQHRANAKSaKTYAYLFSHPSRMPV--YPKWVGADHADDIQYVFGKPFATPTGYrPQDRTVFKAMIAYWTNF 491
Cdd:cd00312 379 KCPARYFLAQHRKAGGS-PVYAYVFDHRSSLSVgrWPPWLGTVHGDEIFFVFGNPLLKEGLR-EEEEKLSRTMMKYWANF 456
                       490       500       510       520
                ....*....|....*....|....*....|....*....|.
gi 8176554  492 AKTGDPNMGDsaVPTHWEPYTTENSGYLEITkkMGSSSMKR 532
Cdd:cd00312 457 AKTGNPNTEG--NLVVWPAYTSESEKYLDIN--IEGTEIKQ 493
PnbA COG2272
Carboxylesterase type B [Lipid transport and metabolism];
26-521 4.14e-123

Carboxylesterase type B [Lipid transport and metabolism];


Pssm-ID: 441873  Cd Length: 500  Bit Score: 377.31  E-value: 4.14e-123
                        10        20        30        40        50        60        70        80
                ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 8176554   26 VYTEGGFVEGVnkklglLGDSVDIFKGIPFAAPT------KAlenPQPHPGWQGTLKAKNFKKRCLQATITQD---STYG 96
Cdd:COG2272  15 VRTEAGRVRGV------VEGGVRVFLGIPYAAPPvgelrwRA---PQPVEPWTGVRDATEFGPACPQPPRPGDpggPAPG 85
                        90       100       110       120       130       140       150       160
                ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 8176554   97 DEDCLYLNIWVPqgRKQVSRDLPVMIWIYGGAFLMGSGHGAnflnnyLYDGEEIATRGnVIVVTFNYRVGPLGF-----L 171
Cdd:COG2272  86 SEDCLYLNVWTP--ALAAGAKLPVMVWIHGGGFVSGSGSEP------LYDGAALARRG-VVVVTINYRLGALGFlalpaL 156
                       170       180       190       200       210       220       230       240
                ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 8176554  172 STGDANLPGNYGLRDQHMAIAWVKRNIAAFGGDPNNITLFGESAGGASVSLQTLSPYNKGLIRAAISQSGVAL---SPWV 248
Cdd:COG2272 157 SGESYGASGNYGLLDQIAALRWVRDNIAAFGGDPDNVTIFGESAGAASVAALLASPLAKGLFHRAIAQSGAGLsvlTLAE 236
                       250       260       270       280       290       300       310       320
                ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 8176554  249 IQKnplfWAKKVAEKVGCPVGDAArmaqCLKVTDPAAVTVAYKVPLAGLEYPMlhyvGFVPVIDEDFIPADPINLYAN-- 326
Cdd:COG2272 237 AEA----VGAAFAAALGVAPATLA----ALRALPAEELLAAQAALAAEGPGGL----PFGPVVDGDVLPEDPLEAFAAgr 304
                       330       340       350       360       370       380       390       400
                ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 8176554  327 AADIDYIAGTNNMDGHIFASIDMPAinkgnKKVTEEDFyklvsEFTITKGLRG-AKTTFDVYTESWAQDpsqenkkkIVV 405
Cdd:COG2272 305 AADVPLLIGTNRDEGRLFAALLGDL-----GPLTAADY-----RAALRRRFGDdADEVLAAYPAASPAE--------ALA 366
                       410       420       430       440       450       460       470       480
                ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 8176554  406 DFETDVLFLVPTeIALAQHRAnAKSAKTYAYLFSHPSRMPVYPKWvGADHADDIQYVFGKPFA-TPTGYRPQDRTVFKAM 484
Cdd:COG2272 367 ALATDRVFRCPA-RRLAEAHA-AAGAPVYLYRFDWRSPPLRGFGL-GAFHGAELPFVFGNLDApALTGLTPADRALSDQM 443
                       490       500       510
                ....*....|....*....|....*....|....*..
gi 8176554  485 IAYWTNFAKTGDPNMGDsavPTHWEPYTTENSGYLEI 521
Cdd:COG2272 444 QAYWVNFARTGDPNGPG---LPEWPAYDPEDRAVMVF 477
Mucin-like pfam16058
Mucin-like; This region is found repeated at the C-terminus (C-tail) of bile salt-activated ...
584-677 1.72e-22

Mucin-like; This region is found repeated at the C-terminus (C-tail) of bile salt-activated lipase, where is O-glycosylated. This region is composed of biased amino acid composition that is likely to be disordered. The region contains many repeats of an approximately 11 residue degenerate repeat.


Pssm-ID: 464997 [Multi-domain]  Cd Length: 94  Bit Score: 92.48  E-value: 1.72e-22
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 8176554    584 PTGDSGAPPVPPTGDSGAPPVPPTGDSGAPPVPPTGDSGAPPVPPTGDSGAPPVPPTGDSGAPPVPPTGDSGAPPVPPTG 663
Cdd:pfam16058   1 PSSSITEPPRDPSGSYGEPPRAPSSSYTEPQRDPSSSITEPPADPSSSYTEPPRDPSGSYTEPQRDPSSSSTEPQRDPSS 80
                          90
                  ....*....|....
gi 8176554    664 DAGPPPVPPTGDSG 677
Cdd:pfam16058  81 SITEPPRDPSGSYT 94
Mucin-like pfam16058
Mucin-like; This region is found repeated at the C-terminus (C-tail) of bile salt-activated ...
639-731 7.09e-19

Mucin-like; This region is found repeated at the C-terminus (C-tail) of bile salt-activated lipase, where is O-glycosylated. This region is composed of biased amino acid composition that is likely to be disordered. The region contains many repeats of an approximately 11 residue degenerate repeat.


Pssm-ID: 464997 [Multi-domain]  Cd Length: 94  Bit Score: 82.08  E-value: 7.09e-19
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 8176554    639 PTGDSGAPPVPPTGDSGAPPVPPTGDAGPPPVPPTGDSGPPPVPPTGDSGAPPVTPTGDSETAPVPPTGDSGAPPVPPTG 718
Cdd:pfam16058   1 PSSSITEPPRDPSGSYGEPPRAPSSSYTEPQRDPSSSITEPPADPSSSYTEPPRDPSGSYTEPQRDPSSSSTEPQRDPSS 80
                          90
                  ....*....|...
gi 8176554    719 DSEAAPVPPTDDS 731
Cdd:pfam16058  81 SITEPPRDPSGSY 93
PRK07764 PRK07764
DNA polymerase III subunits gamma and tau; Validated
556-738 9.12e-19

DNA polymerase III subunits gamma and tau; Validated


Pssm-ID: 236090 [Multi-domain]  Cd Length: 824  Bit Score: 91.20  E-value: 9.12e-19
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 8176554   556 EATPVPPTGDSEATPVPPTGDSETAPVPPTGDSGAPPVPPTGDSGAPPVPPTGDSGAPPVPPTGDSGAPPVPPTGDSGAP 635
Cdd:PRK07764 610 EEAARPAAPAAPAAPAAPAPAGAAAAPAEASAAPAPGVAAPEHHPKHVAVPDASDGGDGWPAKAGGAAPAAPPPAPAPAA 689
                         90       100       110       120       130       140       150       160
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 8176554   636 PVPPTGDSGAPPVPpTGDSGAPPVPPTGDAGPPPVPPTGDSGPPPVPptgdSGAPPVTPTGDSETAPVPPTGDSGAPPVP 715
Cdd:PRK07764 690 PAAPAGAAPAQPAP-APAATPPAGQADDPAAQPPQAAQGASAPSPAA----DDPVPLPPEPDDPPDPAGAPAQPPPPPAP 764
                        170       180
                 ....*....|....*....|...
gi 8176554   716 PTGDSEAAPVPPTDDSKEAQMPA 738
Cdd:PRK07764 765 APAAAPAAAPPPSPPSEEEEMAE 787
PRK07764 PRK07764
DNA polymerase III subunits gamma and tau; Validated
551-738 2.44e-16

DNA polymerase III subunits gamma and tau; Validated


Pssm-ID: 236090 [Multi-domain]  Cd Length: 824  Bit Score: 83.50  E-value: 2.44e-16
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 8176554   551 TVTDQEATPVPPTGDSEATPVPPTGDSETAPVPPTGDSGAPPVPPTGDSGAPPVPPTGDSGAPPVPPTGDSGAPPVPPTG 630
Cdd:PRK07764 583 QVEAVVGPAPGAAGGEGPPAPASSGPPEEAARPAAPAAPAAPAAPAPAGAAAAPAEASAAPAPGVAAPEHHPKHVAVPDA 662
                         90       100       110       120       130       140       150       160
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 8176554   631 DSGAPPVPPTGDSGAPPVPPTGDSGAPPVPPTGDAGPPPVPptgdsGPPPVPPTGDSGAPPVTPTGDSETAPVPPTGDSG 710
Cdd:PRK07764 663 SDGGDGWPAKAGGAAPAAPPPAPAPAAPAAPAGAAPAQPAP-----APAATPPAGQADDPAAQPPQAAQGASAPSPAADD 737
                        170       180
                 ....*....|....*....|....*...
gi 8176554   711 APPVPPTGDSEAAPVPPTDDSKEAQMPA 738
Cdd:PRK07764 738 PVPLPPEPDDPPDPAGAPAQPPPPPAPA 765
PspC_subgroup_2 NF033839
pneumococcal surface protein PspC, LPXTG-anchored form; The pneumococcal surface protein PspC, ...
558-737 3.27e-14

pneumococcal surface protein PspC, LPXTG-anchored form; The pneumococcal surface protein PspC, as described in Streptococcus pneumoniae, is a repetitive and highly variable protein, recognized by a conserved N-terminal domain and also by genomic location. This form, subgroup 2, is anchored covalently after cleavage by sortase at a C-terminal LPXTG site. The other form, subgroup 1, has variable numbers of a choline-binding repeat in the C-terminal region, and is also known as choline-binding protein A.


Pssm-ID: 468202 [Multi-domain]  Cd Length: 557  Bit Score: 75.96  E-value: 3.27e-14
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 8176554   558 TPVPPTGDSEATPVP---PTGDSETAPVPPTGDSGAPPVPPTGDSGAPPVPPTGDSGAPPVPPTGDSGAPPVPPTGDSGA 634
Cdd:NF033839 283 TPKEPGNKKPSAPKPgmqPSPQPEKKEVKPEPETPKPEVKPQLEKPKPEVKPQPEKPKPEVKPQLETPKPEVKPQPEKPK 362
                         90       100       110       120       130       140       150       160
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 8176554   635 PPVPPTGDSGAPPVPPTGDSGAPPVPPTGDAGPPPVPPTGDSGPPPVPPTGDSGAPPVTP---TGDSETAPVPPTGDSGA 711
Cdd:NF033839 363 PEVKPQPEKPKPEVKPQPETPKPEVKPQPEKPKPEVKPQPEKPKPEVKPQPEKPKPEVKPqpeKPKPEVKPQPEKPKPEV 442
                        170       180
                 ....*....|....*....|....*.
gi 8176554   712 PPVPPTGDSEAAPVPPTDDSKEAQMP 737
Cdd:NF033839 443 KPQPEKPKPEVKPQPETPKPEVKPQP 468
PspC_subgroup_2 NF033839
pneumococcal surface protein PspC, LPXTG-anchored form; The pneumococcal surface protein PspC, ...
564-728 1.93e-12

pneumococcal surface protein PspC, LPXTG-anchored form; The pneumococcal surface protein PspC, as described in Streptococcus pneumoniae, is a repetitive and highly variable protein, recognized by a conserved N-terminal domain and also by genomic location. This form, subgroup 2, is anchored covalently after cleavage by sortase at a C-terminal LPXTG site. The other form, subgroup 1, has variable numbers of a choline-binding repeat in the C-terminal region, and is also known as choline-binding protein A.


Pssm-ID: 468202 [Multi-domain]  Cd Length: 557  Bit Score: 70.57  E-value: 1.93e-12
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 8176554   564 GDSEATPVPPTGDSETAPVP---PTGDSGAPPVPPTGDSGAPPVPPTGDSGAPPVPPTGDSGAPPVPPTGDSGAPPVPPT 640
Cdd:NF033839 278 GLTQDTPKEPGNKKPSAPKPgmqPSPQPEKKEVKPEPETPKPEVKPQLEKPKPEVKPQPEKPKPEVKPQLETPKPEVKPQ 357
                         90       100       110       120       130       140       150       160
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 8176554   641 GDSGAPPVPPTGDSGAPPVPPTGDAGPPPVPPTGDSGPPPVPPTGDSGAPPVTP---TGDSETAPVPPTGDSGAPPVPPT 717
Cdd:NF033839 358 PEKPKPEVKPQPEKPKPEVKPQPETPKPEVKPQPEKPKPEVKPQPEKPKPEVKPqpeKPKPEVKPQPEKPKPEVKPQPEK 437
                        170
                 ....*....|.
gi 8176554   718 GDSEAAPVPPT 728
Cdd:NF033839 438 PKPEVKPQPEK 448
PspC_subgroup_2 NF033839
pneumococcal surface protein PspC, LPXTG-anchored form; The pneumococcal surface protein PspC, ...
550-728 3.93e-11

pneumococcal surface protein PspC, LPXTG-anchored form; The pneumococcal surface protein PspC, as described in Streptococcus pneumoniae, is a repetitive and highly variable protein, recognized by a conserved N-terminal domain and also by genomic location. This form, subgroup 2, is anchored covalently after cleavage by sortase at a C-terminal LPXTG site. The other form, subgroup 1, has variable numbers of a choline-binding repeat in the C-terminal region, and is also known as choline-binding protein A.


Pssm-ID: 468202 [Multi-domain]  Cd Length: 557  Bit Score: 66.33  E-value: 3.93e-11
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 8176554   550 PTVTDQEATPVPPTgdseaTPVPPTGDSETAPVPPTgdsGAPPVPPTGDSGAPPVPPTGDSGAPPVPPTGDSGAPPVPPT 629
Cdd:NF033839 297 PGMQPSPQPEKKEV-----KPEPETPKPEVKPQLEK---PKPEVKPQPEKPKPEVKPQLETPKPEVKPQPEKPKPEVKPQ 368
                         90       100       110       120       130       140       150       160
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 8176554   630 GDSGAPPVPPTGDSGAPPVPPTGDSGAPPVPPTGDAGPPPVPPTGDSGPPPVPPTGDSGAPPVTP---TGDSETAPVPPT 706
Cdd:NF033839 369 PEKPKPEVKPQPETPKPEVKPQPEKPKPEVKPQPEKPKPEVKPQPEKPKPEVKPQPEKPKPEVKPqpeKPKPEVKPQPEK 448
                        170       180
                 ....*....|....*....|..
gi 8176554   707 GDSGAPPVPPTGDSEAAPVPPT 728
Cdd:NF033839 449 PKPEVKPQPETPKPEVKPQPEK 470
PspC_subgroup_2 NF033839
pneumococcal surface protein PspC, LPXTG-anchored form; The pneumococcal surface protein PspC, ...
556-732 1.12e-10

pneumococcal surface protein PspC, LPXTG-anchored form; The pneumococcal surface protein PspC, as described in Streptococcus pneumoniae, is a repetitive and highly variable protein, recognized by a conserved N-terminal domain and also by genomic location. This form, subgroup 2, is anchored covalently after cleavage by sortase at a C-terminal LPXTG site. The other form, subgroup 1, has variable numbers of a choline-binding repeat in the C-terminal region, and is also known as choline-binding protein A.


Pssm-ID: 468202 [Multi-domain]  Cd Length: 557  Bit Score: 64.79  E-value: 1.12e-10
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 8176554   556 EATPVPP---TGDSEATPVPPTGDSETAPVPPTGDsgaPPVPPTGDSGAPPVPPTGDSGAPPVPPTGDSGAPPVPPTGDS 632
Cdd:NF033839 306 EKKEVKPepeTPKPEVKPQLEKPKPEVKPQPEKPK---PEVKPQLETPKPEVKPQPEKPKPEVKPQPEKPKPEVKPQPET 382
                         90       100       110       120       130       140       150       160
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 8176554   633 GAPPVPPTGDSGAPPVPPTGDSGAPPVPPTGDAGPPPVPPTGDSGPPPVPPTGDSGAPPVTP---TGDSETAPVPPTGDS 709
Cdd:NF033839 383 PKPEVKPQPEKPKPEVKPQPEKPKPEVKPQPEKPKPEVKPQPEKPKPEVKPQPEKPKPEVKPqpeKPKPEVKPQPETPKP 462
                        170       180
                 ....*....|....*....|....*.
gi 8176554   710 GAPPVPPTGDSEAAP---VPPTDDSK 732
Cdd:NF033839 463 EVKPQPEKPKPEVKPqpeKPKPDNSK 488
PspC_subgroup_2 NF033839
pneumococcal surface protein PspC, LPXTG-anchored form; The pneumococcal surface protein PspC, ...
550-735 4.40e-10

pneumococcal surface protein PspC, LPXTG-anchored form; The pneumococcal surface protein PspC, as described in Streptococcus pneumoniae, is a repetitive and highly variable protein, recognized by a conserved N-terminal domain and also by genomic location. This form, subgroup 2, is anchored covalently after cleavage by sortase at a C-terminal LPXTG site. The other form, subgroup 1, has variable numbers of a choline-binding repeat in the C-terminal region, and is also known as choline-binding protein A.


Pssm-ID: 468202 [Multi-domain]  Cd Length: 557  Bit Score: 62.86  E-value: 4.40e-10
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 8176554   550 PTVTDQEATPVPPTGDSEATPVPPTGDSETAP--------VPPTGDSGAPPVPPTGDSGAPPVPPTGDSGAPPVPPTGDS 621
Cdd:NF033839 303 PQPEKKEVKPEPETPKPEVKPQLEKPKPEVKPqpekpkpeVKPQLETPKPEVKPQPEKPKPEVKPQPEKPKPEVKPQPET 382
                         90       100       110       120       130       140       150       160
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 8176554   622 GAPPVPPTGDSGAPPVPPTGDSGAPPVPPTGDSGAPPVPPTGDAGPPPVPPTGDSGPPPVPPTGDSGAPPVTPTGDSETA 701
Cdd:NF033839 383 PKPEVKPQPEKPKPEVKPQPEKPKPEVKPQPEKPKPEVKPQPEKPKPEVKPQPEKPKPEVKPQPEKPKPEVKPQPETPKP 462
                        170       180       190       200
                 ....*....|....*....|....*....|....*....|....*
gi 8176554   702 PVPPTGDSGAPPVPPTGDS-----------EAAPVPPTDDSKEAQ 735
Cdd:NF033839 463 EVKPQPEKPKPEVKPQPEKpkpdnskpqadDKKPSTPNNLSKDKQ 507
PspC_subgroup_2 NF033839
pneumococcal surface protein PspC, LPXTG-anchored form; The pneumococcal surface protein PspC, ...
575-737 1.33e-09

pneumococcal surface protein PspC, LPXTG-anchored form; The pneumococcal surface protein PspC, as described in Streptococcus pneumoniae, is a repetitive and highly variable protein, recognized by a conserved N-terminal domain and also by genomic location. This form, subgroup 2, is anchored covalently after cleavage by sortase at a C-terminal LPXTG site. The other form, subgroup 1, has variable numbers of a choline-binding repeat in the C-terminal region, and is also known as choline-binding protein A.


Pssm-ID: 468202 [Multi-domain]  Cd Length: 557  Bit Score: 61.32  E-value: 1.33e-09
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 8176554   575 GDSETAPVPPTGDSGAPPVP---PTGDSGAPPVPPTGDSGAPPVPPTGDSGAPPVPPTGDSGAPPVPPTGDSGAPPVPPT 651
Cdd:NF033839 278 GLTQDTPKEPGNKKPSAPKPgmqPSPQPEKKEVKPEPETPKPEVKPQLEKPKPEVKPQPEKPKPEVKPQLETPKPEVKPQ 357
                         90       100       110       120       130       140       150       160
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 8176554   652 GDSGAPPVPPTGDAGPPPVPPTGDSGPPPVPPTGDSGAPPVTP---TGDSETAPVPPTGDSGAPPVPPTGDSEAAPVPPT 728
Cdd:NF033839 358 PEKPKPEVKPQPEKPKPEVKPQPETPKPEVKPQPEKPKPEVKPqpeKPKPEVKPQPEKPKPEVKPQPEKPKPEVKPQPEK 437

                 ....*....
gi 8176554   729 DDSKEAQMP 737
Cdd:NF033839 438 PKPEVKPQP 446
PRK07764 PRK07764
DNA polymerase III subunits gamma and tau; Validated
621-739 2.78e-08

DNA polymerase III subunits gamma and tau; Validated


Pssm-ID: 236090 [Multi-domain]  Cd Length: 824  Bit Score: 57.30  E-value: 2.78e-08
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 8176554   621 SGAPPVPPTGDSGAPPVPPTGDSGAPPVPPTGDSGAPPVPPTGDAGPPPVPPTgDSGPPPVPPTGDSGAPPVTPTGDSET 700
Cdd:PRK07764 383 RRLGVAGGAGAPAAAAPSAAAAAPAAAPAPAAAAPAAAAAPAPAAAPQPAPAP-APAPAPPSPAGNAPAGGAPSPPPAAA 461
                         90       100       110
                 ....*....|....*....|....*....|....*....
gi 8176554   701 APVPPTGdSGAPPVPPTGDSEAAPVPPTDDSKEAQMPAV 739
Cdd:PRK07764 462 PSAQPAP-APAAAPEPTAAPAPAPPAAPAPAAAPAAPAA 499
SepH NF040712
septation protein SepH; Septation protein H (SepH) was firstly characterized in Streptomyces ...
566-715 5.31e-08

septation protein SepH; Septation protein H (SepH) was firstly characterized in Streptomyces venezuelae, and homologs were identified in Mycobacterium smegmatis. SepH contains a N-terminal DUF3071 domain and a conserved C-terminal region. It binds directly to cell division protein FtsZ to stimulate the assembly of FtsZ protofilaments.


Pssm-ID: 468676 [Multi-domain]  Cd Length: 346  Bit Score: 55.54  E-value: 5.31e-08
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 8176554   566 SEATPVPPTGDSETAPVPPTGDSGAPPVPPTGDSGAPPVPPTGDSGAPPVPPTGDSGAPPV-PPTGDSGAPPVPPTGDSG 644
Cdd:NF040712 188 IDPDFGRPLRPLATVPRLAREPADARPEEVEPAPAAEGAPATDSDPAEAGTPDDLASARRRrAGVEQPEDEPVGPGAAPA 267
                         90       100       110       120       130       140       150
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|.
gi 8176554   645 APPVPPTGDSGAPPVPPTGDAGPPPvPPTGDSGPPPVPPTGDSGAPPVTPtgdSETAPVPPTGDSGAPPVP 715
Cdd:NF040712 268 AEPDEATRDAGEPPAPGAAETPEAA-EPPAPAPAAPAAPAAPEAEEPARP---EPPPAPKPKRRRRRASVP 334
BimA_second NF040983
trimeric autotransporter actin-nucleating factor BimA; This HMM describes BimA (Burkholderia ...
628-731 4.38e-07

trimeric autotransporter actin-nucleating factor BimA; This HMM describes BimA (Burkholderia intracellular motility A), WP_004266405.1-like proteins in Burkholderia mallei or B. pseudomallei. The term BimA has also been used for WP_011205626.1-like homologs that have a very different N-terminal half.


Pssm-ID: 468913 [Multi-domain]  Cd Length: 382  Bit Score: 52.98  E-value: 4.38e-07
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 8176554   628 PTGDSGAP-PVPPtgdsgaPPVPPTGDSGAPPVPPTGDAGPPPVPPTGDSGPPPVPPTgdSGAPPVTPTGDSETAPVPPT 706
Cdd:NF040983  79 PVGDRTLPnKVPP------PPPPPPPPPPPPPTPPPPPPPPPPPPPPSPPPPPPPSPP--PSPPPPTTTPPTRTTPSTTT 150
                         90       100
                 ....*....|....*....|....*
gi 8176554   707 GDSGAPPVPPTGDSEAAPVPPTDDS 731
Cdd:NF040983 151 PTPSMHPIQPTQLPSIPNATPTSGS 175
SepH NF040712
septation protein SepH; Septation protein H (SepH) was firstly characterized in Streptomyces ...
558-692 6.17e-07

septation protein SepH; Septation protein H (SepH) was firstly characterized in Streptomyces venezuelae, and homologs were identified in Mycobacterium smegmatis. SepH contains a N-terminal DUF3071 domain and a conserved C-terminal region. It binds directly to cell division protein FtsZ to stimulate the assembly of FtsZ protofilaments.


Pssm-ID: 468676 [Multi-domain]  Cd Length: 346  Bit Score: 52.08  E-value: 6.17e-07
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 8176554   558 TPVPPTGDSEATPVPPTGDSETAPVPPTGDSGAPPVPPTGDSGAPPVPPTGDSGAPPVPPTGDSGAPPVPPTGDSGAPPV 637
Cdd:NF040712 200 ATVPRLAREPADARPEEVEPAPAAEGAPATDSDPAEAGTPDDLASARRRRAGVEQPEDEPVGPGAAPAAEPDEATRDAGE 279
                         90       100       110       120       130
                 ....*....|....*....|....*....|....*....|....*....|....*.
gi 8176554   638 PPTgdSGAPPVPPTGDSGAP-PVPPTGDAGPPPVPPTGDSGPPPVPPTGDSGAPPV 692
Cdd:NF040712 280 PPA--PGAAETPEAAEPPAPaPAAPAAPAAPEAEEPARPEPPPAPKPKRRRRRASV 333
SepH NF040712
septation protein SepH; Septation protein H (SepH) was firstly characterized in Streptomyces ...
556-704 6.98e-07

septation protein SepH; Septation protein H (SepH) was firstly characterized in Streptomyces venezuelae, and homologs were identified in Mycobacterium smegmatis. SepH contains a N-terminal DUF3071 domain and a conserved C-terminal region. It binds directly to cell division protein FtsZ to stimulate the assembly of FtsZ protofilaments.


Pssm-ID: 468676 [Multi-domain]  Cd Length: 346  Bit Score: 52.08  E-value: 6.98e-07
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 8176554   556 EATPVPPTGDSEATPVPPTGDSETAPVPPTGDSGAPPVPPTGDSGAPPVPPTGDSGAPPV-PPTGDSGAPPVPPTGDSGA 634
Cdd:NF040712 189 DPDFGRPLRPLATVPRLAREPADARPEEVEPAPAAEGAPATDSDPAEAGTPDDLASARRRrAGVEQPEDEPVGPGAAPAA 268
                         90       100       110       120       130       140       150
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 8176554   635 PPVPPTGDSGAPPVPPTGDSGAPPvPPTGDAGPPPVPPTGDSGPPPVPPtgdSGAPPVTPTGDSETAPVP 704
Cdd:NF040712 269 EPDEATRDAGEPPAPGAAETPEAA-EPPAPAPAAPAAPAAPEAEEPARP---EPPPAPKPKRRRRRASVP 334
SepH NF040712
septation protein SepH; Septation protein H (SepH) was firstly characterized in Streptomyces ...
590-738 7.36e-07

septation protein SepH; Septation protein H (SepH) was firstly characterized in Streptomyces venezuelae, and homologs were identified in Mycobacterium smegmatis. SepH contains a N-terminal DUF3071 domain and a conserved C-terminal region. It binds directly to cell division protein FtsZ to stimulate the assembly of FtsZ protofilaments.


Pssm-ID: 468676 [Multi-domain]  Cd Length: 346  Bit Score: 52.08  E-value: 7.36e-07
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 8176554   590 APPVPPTGDSGAPPVPPTGDSGAPPVPPTGDSGAPPVPPtgDSGAPPVPPTGDSGAPPVPPTGDSGAPPVPPTGDAGPPP 669
Cdd:NF040712 193 GRPLRPLATVPRLAREPADARPEEVEPAPAAEGAPATDS--DPAEAGTPDDLASARRRRAGVEQPEDEPVGPGAAPAAEP 270
                         90       100       110       120       130       140
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 8176554   670 VPPTGDSGPPPVPPTGDSGAPPVTPtgdsETAPVPPTGDSGAPPVPPTGDSEAAPVPPTDDSKEAQMPA 738
Cdd:NF040712 271 DEATRDAGEPPAPGAAETPEAAEPP----APAPAAPAAPAAPEAEEPARPEPPPAPKPKRRRRRASVPS 335
SepH NF040712
septation protein SepH; Septation protein H (SepH) was firstly characterized in Streptomyces ...
552-671 9.85e-07

septation protein SepH; Septation protein H (SepH) was firstly characterized in Streptomyces venezuelae, and homologs were identified in Mycobacterium smegmatis. SepH contains a N-terminal DUF3071 domain and a conserved C-terminal region. It binds directly to cell division protein FtsZ to stimulate the assembly of FtsZ protofilaments.


Pssm-ID: 468676 [Multi-domain]  Cd Length: 346  Bit Score: 51.69  E-value: 9.85e-07
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 8176554   552 VTDQEATPVPPTGDSEATPVPPTGDSETAPV--PPTGDSGAPPVPPTGDSGAPPVPPTGDSGAPPVPPTGDSGAPPvPPT 629
Cdd:NF040712 217 VEPAPAAEGAPATDSDPAEAGTPDDLASARRrrAGVEQPEDEPVGPGAAPAAEPDEATRDAGEPPAPGAAETPEAA-EPP 295
                         90       100       110       120
                 ....*....|....*....|....*....|....*....|..
gi 8176554   630 GDSGAPPVPPTGDSGAPPVPPtgdSGAPPVPPTGDAGPPPVP 671
Cdd:NF040712 296 APAPAAPAAPAAPEAEEPARP---EPPPAPKPKRRRRRASVP 334
SepH NF040712
septation protein SepH; Septation protein H (SepH) was firstly characterized in Streptomyces ...
571-730 1.79e-06

septation protein SepH; Septation protein H (SepH) was firstly characterized in Streptomyces venezuelae, and homologs were identified in Mycobacterium smegmatis. SepH contains a N-terminal DUF3071 domain and a conserved C-terminal region. It binds directly to cell division protein FtsZ to stimulate the assembly of FtsZ protofilaments.


Pssm-ID: 468676 [Multi-domain]  Cd Length: 346  Bit Score: 50.54  E-value: 1.79e-06
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 8176554   571 VPPTGDSETAPVPPTGDSGAPPVPP-----TGDSGAPPVPPTGDSGAPPVPPTGDSGAPPVPPTGDSGAPPV-PPTGDSG 644
Cdd:NF040712 177 VTALDDEARWLIDPDFGRPLRPLATvprlaREPADARPEEVEPAPAAEGAPATDSDPAEAGTPDDLASARRRrAGVEQPE 256
                         90       100       110       120       130       140       150       160
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 8176554   645 APPVPPTGDSGAPPVPPTGDAGPPPVPPTGDSGPPPvPPTGDSGAPPVTPTGDSETAPVPPtgdSGAPPVPPTGDSEAAP 724
Cdd:NF040712 257 DEPVGPGAAPAAEPDEATRDAGEPPAPGAAETPEAA-EPPAPAPAAPAAPAAPEAEEPARP---EPPPAPKPKRRRRRAS 332

                 ....*.
gi 8176554   725 VPPTDD 730
Cdd:NF040712 333 VPSWDD 338
PBP1 COG5180
PAB1-binding protein, interacts with poly(A)-binding protein [RNA processing and modification]; ...
559-730 2.00e-06

PAB1-binding protein, interacts with poly(A)-binding protein [RNA processing and modification];


Pssm-ID: 444064 [Multi-domain]  Cd Length: 548  Bit Score: 51.22  E-value: 2.00e-06
                        10        20        30        40        50        60        70        80
                ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 8176554  559 PVPPTGDSEATPVPPTGDSETAPVPPtGDSGAPPVPPTGDSGAPPVPPTGDSGAPPVPPTGDSGAPPVPPTGDSGAPPVP 638
Cdd:COG5180 233 KVDPPSTSEARSRPATVDAQPEMRPP-ADAKERRRAAIGDTPAAEPPGLPVLEAGSEPQSDAPEAETARPIDVKGVASAP 311
                        90       100       110       120       130       140       150       160
                ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 8176554  639 PTGDSGAPPvPPTGDSGAPP----------VPPTG-DAGPPPVPPTGDSGPPPVPPtGDSGAPPVTPTGDSETAPVPPTG 707
Cdd:COG5180 312 PATRPVRPP-GGARDPGTPRpgqpterpagVPEAAsDAGQPPSAYPPAEEAVPGKP-LEQGAPRPGSSGGDGAPFQPPNG 389
                       170       180
                ....*....|....*....|....*..
gi 8176554  708 D----SGAPPVPPTGDSEAAPVPPTDD 730
Cdd:COG5180 390 ApqpgLGRRGAPGPPMGAGDLVQAALD 416
SAV_2336_NTERM NF041121
SAV_2336 family N-terminal domain; This HMM describes an N-terminal domain shared by SAV_2336 ...
622-705 7.14e-06

SAV_2336 family N-terminal domain; This HMM describes an N-terminal domain shared by SAV_2336 (BAC70047.1) whose C-terminal region suggests restriction enzyme activity (PMID: 18456708), and with other proteins with unrelated C-terminal regions. A member protein was also identified in a kanamycin biosynthetic gene cluster (PMID:16766657), while N-terminal regions of two other member proteins were named Trypco1 in a bioinformatic study (PMID:32101166) of predicted bacterial conflict systems.


Pssm-ID: 469044 [Multi-domain]  Cd Length: 473  Bit Score: 49.23  E-value: 7.14e-06
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 8176554   622 GAPPVPPTGDSGAPPVPPTGDSGAPPVPPTGDSGAPPVPPTGDAGPPPVPPTGDSGPPPVPPTGDSGAPPvtPTGDSETA 701
Cdd:NF041121  16 GRAAAPPSPEGPAPTAASQPATPPPPAAPPSPPGDPPEPPAPEPAPLPAPYPGSLAPPPPPPPGPAGAAP--GAALPVRV 93

                 ....
gi 8176554   702 PVPP 705
Cdd:NF041121  94 PAPP 97
SepH NF040712
septation protein SepH; Septation protein H (SepH) was firstly characterized in Streptomyces ...
548-660 2.75e-05

septation protein SepH; Septation protein H (SepH) was firstly characterized in Streptomyces venezuelae, and homologs were identified in Mycobacterium smegmatis. SepH contains a N-terminal DUF3071 domain and a conserved C-terminal region. It binds directly to cell division protein FtsZ to stimulate the assembly of FtsZ protofilaments.


Pssm-ID: 468676 [Multi-domain]  Cd Length: 346  Bit Score: 47.07  E-value: 2.75e-05
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 8176554   548 ALPTVTDQEATPVPPTGDSEATPVPPTGDSETAPVPPTGDSGAPPVPPTGDSGAPPVPPTGDSGAPPvPPTGDSGAPPVP 627
Cdd:NF040712 226 APATDSDPAEAGTPDDLASARRRRAGVEQPEDEPVGPGAAPAAEPDEATRDAGEPPAPGAAETPEAA-EPPAPAPAAPAA 304
                         90       100       110
                 ....*....|....*....|....*....|...
gi 8176554   628 PTGDSGAPPVPPtgdSGAPPVPPTGDSGAPPVP 660
Cdd:NF040712 305 PAAPEAEEPARP---EPPPAPKPKRRRRRASVP 334
SAV_2336_NTERM NF041121
SAV_2336 family N-terminal domain; This HMM describes an N-terminal domain shared by SAV_2336 ...
589-682 3.11e-05

SAV_2336 family N-terminal domain; This HMM describes an N-terminal domain shared by SAV_2336 (BAC70047.1) whose C-terminal region suggests restriction enzyme activity (PMID: 18456708), and with other proteins with unrelated C-terminal regions. A member protein was also identified in a kanamycin biosynthetic gene cluster (PMID:16766657), while N-terminal regions of two other member proteins were named Trypco1 in a bioinformatic study (PMID:32101166) of predicted bacterial conflict systems.


Pssm-ID: 469044 [Multi-domain]  Cd Length: 473  Bit Score: 47.30  E-value: 3.11e-05
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 8176554   589 GAPPVPPTGDSGAPPVPPTGDSGAPPVPPTGDSGAPPVPPTGDSGAPPVPPTGDSGAPPVPPTGDSGAPPvpptGDAGPP 668
Cdd:NF041121  16 GRAAAPPSPEGPAPTAASQPATPPPPAAPPSPPGDPPEPPAPEPAPLPAPYPGSLAPPPPPPPGPAGAAP----GAALPV 91
                         90
                 ....*....|....
gi 8176554   669 PVPptgdsGPPPVP 682
Cdd:NF041121  92 RVP-----APPALP 100
half-pint TIGR01645
poly-U binding splicing factor, half-pint family; The proteins represented by this model ...
575-738 3.84e-05

poly-U binding splicing factor, half-pint family; The proteins represented by this model contain three RNA recognition motifs (rrm: pfam00076) and have been characterized as poly-pyrimidine tract binding proteins associated with RNA splicing factors. In the case of PUF60 (GP|6176532), in complex with p54, and in the presence of U2AF, facilitates association of U2 snRNP with pre-mRNA.


Pssm-ID: 130706 [Multi-domain]  Cd Length: 612  Bit Score: 46.99  E-value: 3.84e-05
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 8176554    575 GDSETAPVP---PTGDSGAPPVPPTGDSGAPPVPPTGDSGAPPVPPtGDSGAPPVPPTG----DSGAPPVPPTGDSGAPP 647
Cdd:TIGR01645 279 GKCVTPPDAllqPATVSAIPAAAAVAAAAATAKIMAAEAVAGAAVL-GPRAQSPATPSSslptDIGNKAVVSSAKKEAEE 357
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 8176554    648 VPPTGDSGAPPVPPTGDAGPPPVPPTGDSGPPPVPPTGdsgapPVTPTGDSETAPVPPTGDSGAPPVPPTGDSEAAPVPP 727
Cdd:TIGR01645 358 VPPLPQAAPAVVKPGPMEIPTPVPPPGLAIPSLVAPPG-----LVAPTEINPSFLASPRKKMKREKLPVTFGALDDTLAW 432
                         170
                  ....*....|.
gi 8176554    728 TDDSKEAQMPA 738
Cdd:TIGR01645 433 KEPSKEDQTSE 443
SAV_2336_NTERM NF041121
SAV_2336 family N-terminal domain; This HMM describes an N-terminal domain shared by SAV_2336 ...
600-695 3.85e-05

SAV_2336 family N-terminal domain; This HMM describes an N-terminal domain shared by SAV_2336 (BAC70047.1) whose C-terminal region suggests restriction enzyme activity (PMID: 18456708), and with other proteins with unrelated C-terminal regions. A member protein was also identified in a kanamycin biosynthetic gene cluster (PMID:16766657), while N-terminal regions of two other member proteins were named Trypco1 in a bioinformatic study (PMID:32101166) of predicted bacterial conflict systems.


Pssm-ID: 469044 [Multi-domain]  Cd Length: 473  Bit Score: 46.92  E-value: 3.85e-05
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 8176554   600 GAPPVPPTGDSGAPPVPPTGDSGAPPVPPTGDSGAPPVPPTGDSGAPPVPPTGDSGAPPVPPTGDAGPPPvpptGDSGPP 679
Cdd:NF041121  16 GRAAAPPSPEGPAPTAASQPATPPPPAAPPSPPGDPPEPPAPEPAPLPAPYPGSLAPPPPPPPGPAGAAP----GAALPV 91
                         90
                 ....*....|....*.
gi 8176554   680 PVPptgdsgAPPVTPT 695
Cdd:NF041121  92 RVP------APPALPN 101
SAV_2336_NTERM NF041121
SAV_2336 family N-terminal domain; This HMM describes an N-terminal domain shared by SAV_2336 ...
629-727 9.97e-05

SAV_2336 family N-terminal domain; This HMM describes an N-terminal domain shared by SAV_2336 (BAC70047.1) whose C-terminal region suggests restriction enzyme activity (PMID: 18456708), and with other proteins with unrelated C-terminal regions. A member protein was also identified in a kanamycin biosynthetic gene cluster (PMID:16766657), while N-terminal regions of two other member proteins were named Trypco1 in a bioinformatic study (PMID:32101166) of predicted bacterial conflict systems.


Pssm-ID: 469044 [Multi-domain]  Cd Length: 473  Bit Score: 45.38  E-value: 9.97e-05
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 8176554   629 TGDSGAPPVPPTGDSGAPPVPPTGDSGAPPVPPTGDAGPPPvPPTGDSGPPPVPPtgdsgappvtptgdsETAPVPPTGD 708
Cdd:NF041121  15 MGRAAAPPSPEGPAPTAASQPATPPPPAAPPSPPGDPPEPP-APEPAPLPAPYPG---------------SLAPPPPPPP 78
                         90
                 ....*....|....*....
gi 8176554   709 SGAPPVPPTGDSEAAPVPP 727
Cdd:NF041121  79 GPAGAAPGAALPVRVPAPP 97
DamX COG3266
Cell division protein DamX, binds to the septal ring, contains C-terminal SPOR domain [Cell ...
641-732 1.25e-04

Cell division protein DamX, binds to the septal ring, contains C-terminal SPOR domain [Cell cycle control, cell division, chromosome partitioning];


Pssm-ID: 442497 [Multi-domain]  Cd Length: 455  Bit Score: 45.23  E-value: 1.25e-04
                        10        20        30        40        50        60        70        80
                ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 8176554  641 GDSGAPPVPPTGDSGAPPVPPTGDAGPPPVPPTGDSGP--PPVPPTGDSGAPPVTPT--GDSETAPVPPTGDSGAPPVPP 716
Cdd:COG3266 262 SSASAPATTSLGEQQEVSLPPAVAAQPAAAAAAQPSAValPAAPAAAAAAAAPAEAAapQPTAAKPVVTETAAPAAPAPE 341
                        90
                ....*....|....*.
gi 8176554  717 TGDSEAAPVPPTDDSK 732
Cdd:COG3266 342 AAAAAAAPAAPAVAKK 357
BimA_first NF040984
trimeric autotransporter actin-nucleating factor BimA; BimA (B. pseudomallei intracellular ...
635-704 2.09e-04

trimeric autotransporter actin-nucleating factor BimA; BimA (B. pseudomallei intracellular motility protein A) is a trimeric autotransporter, homologous in its C-terminal half to a number of trimeric autotransporter adhesins. It is a virulence factor that nucleates actin, so that actin polymerization can drive escape by B. pseudomallei out of one cell and into a neighboring cell. HMM NF040983 describes a homolog with similar activity but substantial difference in sequence architecture in the N-terminal region.


Pssm-ID: 468914 [Multi-domain]  Cd Length: 517  Bit Score: 44.48  E-value: 2.09e-04
                         10        20        30        40        50        60        70
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*
gi 8176554   635 PPVPPTGDSgaPPVPPTGDSGAPPVPPtgdagPPPVPPTGDSGPPPVPPTGDSGA-----PPVTPTGDSETAPVP 704
Cdd:NF040984  42 PPEPPGGTN--IPVPPPMPGGGANIPV-----PPPMPGGGANIPPPPPPPGGIGGatpspPPLTPVNGNPGASTP 109
SAV_2336_NTERM NF041121
SAV_2336 family N-terminal domain; This HMM describes an N-terminal domain shared by SAV_2336 ...
578-671 3.17e-04

SAV_2336 family N-terminal domain; This HMM describes an N-terminal domain shared by SAV_2336 (BAC70047.1) whose C-terminal region suggests restriction enzyme activity (PMID: 18456708), and with other proteins with unrelated C-terminal regions. A member protein was also identified in a kanamycin biosynthetic gene cluster (PMID:16766657), while N-terminal regions of two other member proteins were named Trypco1 in a bioinformatic study (PMID:32101166) of predicted bacterial conflict systems.


Pssm-ID: 469044 [Multi-domain]  Cd Length: 473  Bit Score: 43.84  E-value: 3.17e-04
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 8176554   578 ETAPVPPTGDSGAPPVPPTGDSGAPPVPPTGDSGAPPVPPTGDSGAPPVPPTGDSGAPPVPPTGDSGAPPvpptGDSGAP 657
Cdd:NF041121  16 GRAAAPPSPEGPAPTAASQPATPPPPAAPPSPPGDPPEPPAPEPAPLPAPYPGSLAPPPPPPPGPAGAAP----GAALPV 91
                         90
                 ....*....|....
gi 8176554   658 PVPptgdaGPPPVP 671
Cdd:NF041121  92 RVP-----APPALP 100
SAV_2336_NTERM NF041121
SAV_2336 family N-terminal domain; This HMM describes an N-terminal domain shared by SAV_2336 ...
563-662 4.25e-04

SAV_2336 family N-terminal domain; This HMM describes an N-terminal domain shared by SAV_2336 (BAC70047.1) whose C-terminal region suggests restriction enzyme activity (PMID: 18456708), and with other proteins with unrelated C-terminal regions. A member protein was also identified in a kanamycin biosynthetic gene cluster (PMID:16766657), while N-terminal regions of two other member proteins were named Trypco1 in a bioinformatic study (PMID:32101166) of predicted bacterial conflict systems.


Pssm-ID: 469044 [Multi-domain]  Cd Length: 473  Bit Score: 43.45  E-value: 4.25e-04
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 8176554   563 TGDSEATPVPPTGDSETAPVPPTGDSGAPPVPPTGDSGAPPvpptGDSGAPPVPPTGDSGAPPVPPTGDsgaPPVPPTGD 642
Cdd:NF041121  15 MGRAAAPPSPEGPAPTAASQPATPPPPAAPPSPPGDPPEPP----APEPAPLPAPYPGSLAPPPPPPPG---PAGAAPGA 87
                         90       100
                 ....*....|....*....|
gi 8176554   643 SGAPPVPptgdsgAPPVPPT 662
Cdd:NF041121  88 ALPVRVP------APPALPN 101
SAV_2336_NTERM NF041121
SAV_2336 family N-terminal domain; This HMM describes an N-terminal domain shared by SAV_2336 ...
557-651 1.34e-03

SAV_2336 family N-terminal domain; This HMM describes an N-terminal domain shared by SAV_2336 (BAC70047.1) whose C-terminal region suggests restriction enzyme activity (PMID: 18456708), and with other proteins with unrelated C-terminal regions. A member protein was also identified in a kanamycin biosynthetic gene cluster (PMID:16766657), while N-terminal regions of two other member proteins were named Trypco1 in a bioinformatic study (PMID:32101166) of predicted bacterial conflict systems.


Pssm-ID: 469044 [Multi-domain]  Cd Length: 473  Bit Score: 41.91  E-value: 1.34e-03
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 8176554   557 ATPVPPTGDSEATPVPPTGDSETAPVPPTGDSGAPPvPPTGDSGAPPVPptGDSGAPPVPPTGDSGAPPvpptGDSGAPP 636
Cdd:NF041121  20 APPSPEGPAPTAASQPATPPPPAAPPSPPGDPPEPP-APEPAPLPAPYP--GSLAPPPPPPPGPAGAAP----GAALPVR 92
                         90
                 ....*....|....*
gi 8176554   637 VPptgdsgAPPVPPT 651
Cdd:NF041121  93 VP------APPALPN 101
KLF17_N cd21574
N-terminal domain of Kruppel-like factor 17; Kruppel-like factor 17 (KLF17), or Krueppel-like ...
601-729 1.85e-03

N-terminal domain of Kruppel-like factor 17; Kruppel-like factor 17 (KLF17), or Krueppel-like factor 17, is a protein that, in humans, is encoded by the KLF17 gene and acts as a tumor suppressor. It negatively regulates epithelial-mesenchymal transition and metastasis in breast cancer. KLF17 is thought to be the human ortholog of the mouse gene, zinc finger protein 393 (Zfp393), although it has diverged significantly. KLF17 can regulate gene transcription from CACCC-box elements. It belongs to a family of proteins, called the Specificity Protein (SP)/KLF family, characterized by a C-terminal DNA-binding domain of 81 amino acids consisting of three Kruppel-like C2H2 zinc fingers. These factors bind to a loose consensus motif, namely NNRCRCCYY (where N is any nucleotide; R is A/G, and Y is C/T), such as the recurring motifs in GC and GT boxes (5'-GGGGCGGGG-3' and 5-GGTGTGGGG-3') that are present in promoters and more distal regulatory elements of mammalian genes. Members of the KLF family can act as activators or repressors of transcription depending on cell and promoter context. KLFs regulate various cellular functions, such as proliferation, differentiation and apoptosis, as well as the development and homeostasis of several types of tissue. In addition to the C-terminal DNA-binding domain, each KLF also has a unique N-terminal activation/repression domain that confers specificity and allows it to bind specifically to a certain partner, leading to distinct activities in vivo. This model represents the N-terminal domain of KLF17.


Pssm-ID: 410567  Cd Length: 286  Bit Score: 40.83  E-value: 1.85e-03
                        10        20        30        40        50        60        70        80
                ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 8176554  601 APPVPPTGDSGAPPVPPTGDSGAPPVPPTGdSGAPPVPPTGdsgaPPVPPTgdSGAPPVPPTGdagpppVPPTGDSGPPP 680
Cdd:cd21574 111 SPSQPGMMIFKGPQMMPLGEPNIPGVAMTF-SGNLRMPPSG----LPVSAS--SGIPMMSHIR------APTMPYSGPPT 177
                        90       100       110       120       130
                ....*....|....*....|....*....|....*....|....*....|.
gi 8176554  681 VPPTGDSGAPPV--TPTgdsetapVPPTgdsGAPPVPPtgdSEAAPVPPTD 729
Cdd:cd21574 178 VPSNRDSLTPKMllAPT-------MPST---EAQAVLP---SLAQMLPPRD 215
rad23 TIGR00601
UV excision repair protein Rad23; All proteins in this family for which functions are known ...
640-720 3.89e-03

UV excision repair protein Rad23; All proteins in this family for which functions are known are components of a multiprotein complex used for targeting nucleotide excision repair to specific parts of the genome. In humans, Rad23 complexes with the XPC protein. This family is based on the phylogenomic analysis of JA Eisen (1999, Ph.D. Thesis, Stanford University). [DNA metabolism, DNA replication, recombination, and repair]


Pssm-ID: 273167 [Multi-domain]  Cd Length: 378  Bit Score: 40.26  E-value: 3.89e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 8176554    640 TGDSGAPPVPPTGdsgappvPPTGDAGPPPVPPTGDSGPPPvpptgdSGAPPVTPTGDSETAPVPPTGDSGAPPVPPTGD 719
Cdd:TIGR00601  81 TGKVAPPAATPTS-------APTPTPSPPASPASGMSAAPA------SAVEEKSPSEESATATAPESPSTSVPSSGSDAA 147

                  .
gi 8176554    720 S 720
Cdd:TIGR00601 148 S 148
SepH NF040712
septation protein SepH; Septation protein H (SepH) was firstly characterized in Streptomyces ...
623-741 4.22e-03

septation protein SepH; Septation protein H (SepH) was firstly characterized in Streptomyces venezuelae, and homologs were identified in Mycobacterium smegmatis. SepH contains a N-terminal DUF3071 domain and a conserved C-terminal region. It binds directly to cell division protein FtsZ to stimulate the assembly of FtsZ protofilaments.


Pssm-ID: 468676 [Multi-domain]  Cd Length: 346  Bit Score: 40.14  E-value: 4.22e-03
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 8176554   623 APPVPPTGDSGAPPVPPTGDSGAPPVPPTgdsgAPPVPPTGDAGPPPVPPTGD--SGPPPVPPTGDSGAPPVTPTGDSET 700
Cdd:NF040712 193 GRPLRPLATVPRLAREPADARPEEVEPAP----AAEGAPATDSDPAEAGTPDDlaSARRRRAGVEQPEDEPVGPGAAPAA 268
                         90       100       110       120
                 ....*....|....*....|....*....|....*....|.
gi 8176554   701 APVPPTGDSGAPPVPPTGDSEAAPVPPTDDSKEAQMPAVIR 741
Cdd:NF040712 269 EPDEATRDAGEPPAPGAAETPEAAEPPAPAPAAPAAPAAPE 309
SAV_2336_NTERM NF041121
SAV_2336 family N-terminal domain; This HMM describes an N-terminal domain shared by SAV_2336 ...
660-739 6.16e-03

SAV_2336 family N-terminal domain; This HMM describes an N-terminal domain shared by SAV_2336 (BAC70047.1) whose C-terminal region suggests restriction enzyme activity (PMID: 18456708), and with other proteins with unrelated C-terminal regions. A member protein was also identified in a kanamycin biosynthetic gene cluster (PMID:16766657), while N-terminal regions of two other member proteins were named Trypco1 in a bioinformatic study (PMID:32101166) of predicted bacterial conflict systems.


Pssm-ID: 469044 [Multi-domain]  Cd Length: 473  Bit Score: 39.60  E-value: 6.16e-03
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 8176554   660 PPTGDAGPPPVPPTGDSGPPPVPPTGDSGAPPVTPTGDSETAPVPPTGDSGAPPVPPTGDSEAAPVPPTDDSKEAQMPAV 739
Cdd:NF041121  17 RAAAPPSPEGPAPTAASQPATPPPPAAPPSPPGDPPEPPAPEPAPLPAPYPGSLAPPPPPPPGPAGAAPGAALPVRVPAP 96
SAV_2336_NTERM NF041121
SAV_2336 family N-terminal domain; This HMM describes an N-terminal domain shared by SAV_2336 ...
542-640 7.98e-03

SAV_2336 family N-terminal domain; This HMM describes an N-terminal domain shared by SAV_2336 (BAC70047.1) whose C-terminal region suggests restriction enzyme activity (PMID: 18456708), and with other proteins with unrelated C-terminal regions. A member protein was also identified in a kanamycin biosynthetic gene cluster (PMID:16766657), while N-terminal regions of two other member proteins were named Trypco1 in a bioinformatic study (PMID:32101166) of predicted bacterial conflict systems.


Pssm-ID: 469044 [Multi-domain]  Cd Length: 473  Bit Score: 39.60  E-value: 7.98e-03
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 8176554   542 WTLTYLALPTVTDQEATPVPPTGDSEATPVPPTGDS--ETAPVPPTGDSGAPPVPPTGDSGAPPVPPTGDsgaPPVPPTG 619
Cdd:NF041121  10 WLAAQMGRAAAPPSPEGPAPTAASQPATPPPPAAPPspPGDPPEPPAPEPAPLPAPYPGSLAPPPPPPPG---PAGAAPG 86
                         90       100
                 ....*....|....*....|.
gi 8176554   620 DSGAPPVPptgdsgAPPVPPT 640
Cdd:NF041121  87 AALPVRVP------APPALPN 101
 
Name Accession Description Interval E-value
COesterase pfam00135
Carboxylesterase family;
26-542 0e+00

Carboxylesterase family;


Pssm-ID: 395084 [Multi-domain]  Cd Length: 513  Bit Score: 608.54  E-value: 0e+00
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 8176554     26 VYTEGGFVEGVNKKLGLlGDSVDIFKGIPFAAPTKAL---ENPQPHPGWQGTLKAKNFKKRCLQATITQDSTY----GDE 98
Cdd:pfam00135   5 VTTSLGRVRGKRLKVDG-GKPVYAFLGIPYAEPPVGElrfQPPEPPEPWTGVRDATKFGPRCPQNGDLTSPGSsgleGSE 83
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 8176554     99 DCLYLNIWVPQGRKQVSRDLPVMIWIYGGAFLMGSGHganflnnyLYDGEEIATRGNVIVVTFNYRVGPLGFLSTGDANL 178
Cdd:pfam00135  84 DCLYLNVYTPKELKENKNKLPVMVWIHGGGFMFGSGS--------LYDGSYLAAEGDVIVVTINYRLGPLGFLSTGDDEA 155
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 8176554    179 PGNYGLRDQHMAIAWVKRNIAAFGGDPNNITLFGESAGGASVSLQTLSPYNKGLIRAAISQSGVALSPWVIQKNPLFWAK 258
Cdd:pfam00135 156 PGNYGLLDQVLALRWVQENIASFGGDPNRVTLFGESAGAASVSLLLLSPLSKGLFHRAILMSGSALSPWAIQSNARQRAK 235
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 8176554    259 KVAEKVGCPVGDAARMAQCLKVTDPAAVTVAYKVPlagLEYPMLHYVGFVPVIDEDFIPADPINLYA--NAADIDYIAGT 336
Cdd:pfam00135 236 ELAKLVGCPTSDSAELVECLRSKPAEELLDAQLKL---LVYGSVPFVPFGPVVDGDFLPEHPEELLKsgNFPKVPLLIGV 312
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 8176554    337 NNMDGHIFASIDMPAINKGNKKVTEEDFYKLVSEFTITKGL---RGAKTTFDVYTEsWAQDPSQENKKKIVVDFETDVLF 413
Cdd:pfam00135 313 TKDEGLLFAAYILDNVDILKALEEKLLRSLLIDLLYLLLVDlpeEISAALREEYLD-WGDRDDPETSRRALVELLTDYLF 391
                         410       420       430       440       450       460       470       480
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 8176554    414 LVPTEIALAQHRanAKSAKTYAYLFSHPSRMPVYPKWVGADHADDIQYVFGKPFATPTGYRPQDRTVFKAMIAYWTNFAK 493
Cdd:pfam00135 392 NCPVIRFADLHA--SRGTPVYMYSFDYRGSSLRYPKWVGVDHGDELPYVFGTPFVGALLFTEEDEKLSRKMMTYWTNFAK 469
                         490       500       510       520
                  ....*....|....*....|....*....|....*....|....*....
gi 8176554    494 TGDPNMGdsAVPTHWEPYTTENSGYLEITKKMgssSMKRSLRTNFLRYW 542
Cdd:pfam00135 470 TGNPNGP--EGLPKWPPYTDENGQYLSIDLEP---RVKQGLKAERCAFW 513
Esterase_lipase cd00312
Esterases and lipases (includes fungal lipases, cholinesterases, etc.) These enzymes act on ...
25-532 0e+00

Esterases and lipases (includes fungal lipases, cholinesterases, etc.) These enzymes act on carboxylic esters (EC: 3.1.1.-). The catalytic apparatus involves three residues (catalytic triad): a serine, a glutamate or aspartate and a histidine.These catalytic residues are responsible for the nucleophilic attack on the carbonyl carbon atom of the ester bond. In contrast with other alpha/beta hydrolase fold family members, p-nitrobenzyl esterase and acetylcholine esterase have a Glu instead of Asp at the active site carboxylate.


Pssm-ID: 238191 [Multi-domain]  Cd Length: 493  Bit Score: 561.95  E-value: 0e+00
                        10        20        30        40        50        60        70        80
                ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 8176554   25 AVYTEGGFVEGVNKklgllgDSVDIFKGIPFAAPT---KALENPQPHPGWQGTLKAKNFKKRCLQATITQDS-----TYG 96
Cdd:cd00312   1 LVVTPNGKVRGVDE------GGVYSFLGIPYAEPPvgdLRFKEPQPYEPWSDVLDATSYPPSCMQWDQLGGGlwnakLPG 74
                        90       100       110       120       130       140       150       160
                ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 8176554   97 DEDCLYLNIWVPQGRKqVSRDLPVMIWIYGGAFLMGSGHganflnnyLYDGEEIATRG-NVIVVTFNYRVGPLGFLSTGD 175
Cdd:cd00312  75 SEDCLYLNVYTPKNTK-PGNSLPVMVWIHGGGFMFGSGS--------LYPGDGLAREGdNVIVVSINYRLGVLGFLSTGD 145
                       170       180       190       200       210       220       230       240
                ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 8176554  176 ANLPGNYGLRDQHMAIAWVKRNIAAFGGDPNNITLFGESAGGASVSLQTLSPYNKGLIRAAISQSGVALSPWVIQKNPLF 255
Cdd:cd00312 146 IELPGNYGLKDQRLALKWVQDNIAAFGGDPDSVTIFGESAGGASVSLLLLSPDSKGLFHRAISQSGSALSPWAIQENARG 225
                       250       260       270       280       290       300       310       320
                ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 8176554  256 WAKKVAEKVGCPVGDAARMAQCLKVTDPAAVTVAYKVPlagLEYPMLHYVGFVPVIDEDFIPADPINLYA--NAADIDYI 333
Cdd:cd00312 226 RAKRLARLLGCNDTSSAELLDCLRSKSAEELLDATRKL---LLFSYSPFLPFGPVVDGDFIPDDPEELIKegKFAKVPLI 302
                       330       340       350       360       370       380       390       400
                ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 8176554  334 AGTNNMDGHIFASIDMPAINKgNKKVTEEDFYKLVSEFTITKGLRGAKTTFDVYTESWAQdpsQENKKKIVVDFETDVLF 413
Cdd:cd00312 303 IGVTKDEGGYFAAMLLNFDAK-LIIETNDRWLELLPYLLFYADDALADKVLEKYPGDVDD---SVESRKNLSDMLTDLLF 378
                       410       420       430       440       450       460       470       480
                ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 8176554  414 LVPTEIALAQHRANAKSaKTYAYLFSHPSRMPV--YPKWVGADHADDIQYVFGKPFATPTGYrPQDRTVFKAMIAYWTNF 491
Cdd:cd00312 379 KCPARYFLAQHRKAGGS-PVYAYVFDHRSSLSVgrWPPWLGTVHGDEIFFVFGNPLLKEGLR-EEEEKLSRTMMKYWANF 456
                       490       500       510       520
                ....*....|....*....|....*....|....*....|.
gi 8176554  492 AKTGDPNMGDsaVPTHWEPYTTENSGYLEITkkMGSSSMKR 532
Cdd:cd00312 457 AKTGNPNTEG--NLVVWPAYTSESEKYLDIN--IEGTEIKQ 493
PnbA COG2272
Carboxylesterase type B [Lipid transport and metabolism];
26-521 4.14e-123

Carboxylesterase type B [Lipid transport and metabolism];


Pssm-ID: 441873  Cd Length: 500  Bit Score: 377.31  E-value: 4.14e-123
                        10        20        30        40        50        60        70        80
                ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 8176554   26 VYTEGGFVEGVnkklglLGDSVDIFKGIPFAAPT------KAlenPQPHPGWQGTLKAKNFKKRCLQATITQD---STYG 96
Cdd:COG2272  15 VRTEAGRVRGV------VEGGVRVFLGIPYAAPPvgelrwRA---PQPVEPWTGVRDATEFGPACPQPPRPGDpggPAPG 85
                        90       100       110       120       130       140       150       160
                ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 8176554   97 DEDCLYLNIWVPqgRKQVSRDLPVMIWIYGGAFLMGSGHGAnflnnyLYDGEEIATRGnVIVVTFNYRVGPLGF-----L 171
Cdd:COG2272  86 SEDCLYLNVWTP--ALAAGAKLPVMVWIHGGGFVSGSGSEP------LYDGAALARRG-VVVVTINYRLGALGFlalpaL 156
                       170       180       190       200       210       220       230       240
                ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 8176554  172 STGDANLPGNYGLRDQHMAIAWVKRNIAAFGGDPNNITLFGESAGGASVSLQTLSPYNKGLIRAAISQSGVAL---SPWV 248
Cdd:COG2272 157 SGESYGASGNYGLLDQIAALRWVRDNIAAFGGDPDNVTIFGESAGAASVAALLASPLAKGLFHRAIAQSGAGLsvlTLAE 236
                       250       260       270       280       290       300       310       320
                ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 8176554  249 IQKnplfWAKKVAEKVGCPVGDAArmaqCLKVTDPAAVTVAYKVPLAGLEYPMlhyvGFVPVIDEDFIPADPINLYAN-- 326
Cdd:COG2272 237 AEA----VGAAFAAALGVAPATLA----ALRALPAEELLAAQAALAAEGPGGL----PFGPVVDGDVLPEDPLEAFAAgr 304
                       330       340       350       360       370       380       390       400
                ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 8176554  327 AADIDYIAGTNNMDGHIFASIDMPAinkgnKKVTEEDFyklvsEFTITKGLRG-AKTTFDVYTESWAQDpsqenkkkIVV 405
Cdd:COG2272 305 AADVPLLIGTNRDEGRLFAALLGDL-----GPLTAADY-----RAALRRRFGDdADEVLAAYPAASPAE--------ALA 366
                       410       420       430       440       450       460       470       480
                ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 8176554  406 DFETDVLFLVPTeIALAQHRAnAKSAKTYAYLFSHPSRMPVYPKWvGADHADDIQYVFGKPFA-TPTGYRPQDRTVFKAM 484
Cdd:COG2272 367 ALATDRVFRCPA-RRLAEAHA-AAGAPVYLYRFDWRSPPLRGFGL-GAFHGAELPFVFGNLDApALTGLTPADRALSDQM 443
                       490       500       510
                ....*....|....*....|....*....|....*..
gi 8176554  485 IAYWTNFAKTGDPNMGDsavPTHWEPYTTENSGYLEI 521
Cdd:COG2272 444 QAYWVNFARTGDPNGPG---LPEWPAYDPEDRAVMVF 477
Mucin-like pfam16058
Mucin-like; This region is found repeated at the C-terminus (C-tail) of bile salt-activated ...
584-677 1.72e-22

Mucin-like; This region is found repeated at the C-terminus (C-tail) of bile salt-activated lipase, where is O-glycosylated. This region is composed of biased amino acid composition that is likely to be disordered. The region contains many repeats of an approximately 11 residue degenerate repeat.


Pssm-ID: 464997 [Multi-domain]  Cd Length: 94  Bit Score: 92.48  E-value: 1.72e-22
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 8176554    584 PTGDSGAPPVPPTGDSGAPPVPPTGDSGAPPVPPTGDSGAPPVPPTGDSGAPPVPPTGDSGAPPVPPTGDSGAPPVPPTG 663
Cdd:pfam16058   1 PSSSITEPPRDPSGSYGEPPRAPSSSYTEPQRDPSSSITEPPADPSSSYTEPPRDPSGSYTEPQRDPSSSSTEPQRDPSS 80
                          90
                  ....*....|....
gi 8176554    664 DAGPPPVPPTGDSG 677
Cdd:pfam16058  81 SITEPPRDPSGSYT 94
Mucin-like pfam16058
Mucin-like; This region is found repeated at the C-terminus (C-tail) of bile salt-activated ...
595-688 8.46e-22

Mucin-like; This region is found repeated at the C-terminus (C-tail) of bile salt-activated lipase, where is O-glycosylated. This region is composed of biased amino acid composition that is likely to be disordered. The region contains many repeats of an approximately 11 residue degenerate repeat.


Pssm-ID: 464997 [Multi-domain]  Cd Length: 94  Bit Score: 90.17  E-value: 8.46e-22
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 8176554    595 PTGDSGAPPVPPTGDSGAPPVPPTGDSGAPPVPPTGDSGAPPVPPTGDSGAPPVPPTGDSGAPPVPPTGDAGPPPVPPTG 674
Cdd:pfam16058   1 PSSSITEPPRDPSGSYGEPPRAPSSSYTEPQRDPSSSITEPPADPSSSYTEPPRDPSGSYTEPQRDPSSSSTEPQRDPSS 80
                          90
                  ....*....|....
gi 8176554    675 DSGPPPVPPTGDSG 688
Cdd:pfam16058  81 SITEPPRDPSGSYT 94
Mucin-like pfam16058
Mucin-like; This region is found repeated at the C-terminus (C-tail) of bile salt-activated ...
573-666 8.54e-22

Mucin-like; This region is found repeated at the C-terminus (C-tail) of bile salt-activated lipase, where is O-glycosylated. This region is composed of biased amino acid composition that is likely to be disordered. The region contains many repeats of an approximately 11 residue degenerate repeat.


Pssm-ID: 464997 [Multi-domain]  Cd Length: 94  Bit Score: 90.17  E-value: 8.54e-22
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 8176554    573 PTGDSETAPVPPTGDSGAPPVPPTGDSGAPPVPPTGDSGAPPVPPTGDSGAPPVPPTGDSGAPPVPPTGDSGAPPVPPTG 652
Cdd:pfam16058   1 PSSSITEPPRDPSGSYGEPPRAPSSSYTEPQRDPSSSITEPPADPSSSYTEPPRDPSGSYTEPQRDPSSSSTEPQRDPSS 80
                          90
                  ....*....|....
gi 8176554    653 DSGAPPVPPTGDAG 666
Cdd:pfam16058  81 SITEPPRDPSGSYT 94
Mucin-like pfam16058
Mucin-like; This region is found repeated at the C-terminus (C-tail) of bile salt-activated ...
606-698 1.32e-21

Mucin-like; This region is found repeated at the C-terminus (C-tail) of bile salt-activated lipase, where is O-glycosylated. This region is composed of biased amino acid composition that is likely to be disordered. The region contains many repeats of an approximately 11 residue degenerate repeat.


Pssm-ID: 464997 [Multi-domain]  Cd Length: 94  Bit Score: 89.79  E-value: 1.32e-21
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 8176554    606 PTGDSGAPPVPPTGDSGAPPVPPTGDSGAPPVPPTGDSGAPPVPPTGDSGAPPVPPTGDAGPPPVPPTGDSGPPPVPPTG 685
Cdd:pfam16058   1 PSSSITEPPRDPSGSYGEPPRAPSSSYTEPQRDPSSSITEPPADPSSSYTEPPRDPSGSYTEPQRDPSSSSTEPQRDPSS 80
                          90
                  ....*....|...
gi 8176554    686 DSGAPPVTPTGDS 698
Cdd:pfam16058  81 SITEPPRDPSGSY 93
Mucin-like pfam16058
Mucin-like; This region is found repeated at the C-terminus (C-tail) of bile salt-activated ...
628-720 9.81e-21

Mucin-like; This region is found repeated at the C-terminus (C-tail) of bile salt-activated lipase, where is O-glycosylated. This region is composed of biased amino acid composition that is likely to be disordered. The region contains many repeats of an approximately 11 residue degenerate repeat.


Pssm-ID: 464997 [Multi-domain]  Cd Length: 94  Bit Score: 87.48  E-value: 9.81e-21
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 8176554    628 PTGDSGAPPVPPTGDSGAPPVPPTGDSGAPPVPPTGDAGPPPVPPTGDSGPPPVPPTGDSGAPPVTPTGDSETAPVPPTG 707
Cdd:pfam16058   1 PSSSITEPPRDPSGSYGEPPRAPSSSYTEPQRDPSSSITEPPADPSSSYTEPPRDPSGSYTEPQRDPSSSSTEPQRDPSS 80
                          90
                  ....*....|...
gi 8176554    708 DSGAPPVPPTGDS 720
Cdd:pfam16058  81 SITEPPRDPSGSY 93
Mucin-like pfam16058
Mucin-like; This region is found repeated at the C-terminus (C-tail) of bile salt-activated ...
617-710 1.28e-20

Mucin-like; This region is found repeated at the C-terminus (C-tail) of bile salt-activated lipase, where is O-glycosylated. This region is composed of biased amino acid composition that is likely to be disordered. The region contains many repeats of an approximately 11 residue degenerate repeat.


Pssm-ID: 464997 [Multi-domain]  Cd Length: 94  Bit Score: 87.09  E-value: 1.28e-20
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 8176554    617 PTGDSGAPPVPPTGDSGAPPVPPTGDSGAPPVPPTGDSGAPPVPPTGDAGPPPVPPTGDSGPPPVPPTGDSGAPPVTPTG 696
Cdd:pfam16058   1 PSSSITEPPRDPSGSYGEPPRAPSSSYTEPQRDPSSSITEPPADPSSSYTEPPRDPSGSYTEPQRDPSSSSTEPQRDPSS 80
                          90
                  ....*....|....
gi 8176554    697 DSETAPVPPTGDSG 710
Cdd:pfam16058  81 SITEPPRDPSGSYT 94
Mucin-like pfam16058
Mucin-like; This region is found repeated at the C-terminus (C-tail) of bile salt-activated ...
562-655 3.37e-20

Mucin-like; This region is found repeated at the C-terminus (C-tail) of bile salt-activated lipase, where is O-glycosylated. This region is composed of biased amino acid composition that is likely to be disordered. The region contains many repeats of an approximately 11 residue degenerate repeat.


Pssm-ID: 464997 [Multi-domain]  Cd Length: 94  Bit Score: 85.94  E-value: 3.37e-20
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 8176554    562 PTGDSEATPVPPTGDSETAPVPPTGDSGAPPVPPTGDSGAPPVPPTGDSGAPPVPPTGDSGAPPVPPTGDSGAPPVPPTG 641
Cdd:pfam16058   1 PSSSITEPPRDPSGSYGEPPRAPSSSYTEPQRDPSSSITEPPADPSSSYTEPPRDPSGSYTEPQRDPSSSSTEPQRDPSS 80
                          90
                  ....*....|....
gi 8176554    642 DSGAPPVPPTGDSG 655
Cdd:pfam16058  81 SITEPPRDPSGSYT 94
Mucin-like pfam16058
Mucin-like; This region is found repeated at the C-terminus (C-tail) of bile salt-activated ...
639-731 7.09e-19

Mucin-like; This region is found repeated at the C-terminus (C-tail) of bile salt-activated lipase, where is O-glycosylated. This region is composed of biased amino acid composition that is likely to be disordered. The region contains many repeats of an approximately 11 residue degenerate repeat.


Pssm-ID: 464997 [Multi-domain]  Cd Length: 94  Bit Score: 82.08  E-value: 7.09e-19
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 8176554    639 PTGDSGAPPVPPTGDSGAPPVPPTGDAGPPPVPPTGDSGPPPVPPTGDSGAPPVTPTGDSETAPVPPTGDSGAPPVPPTG 718
Cdd:pfam16058   1 PSSSITEPPRDPSGSYGEPPRAPSSSYTEPQRDPSSSITEPPADPSSSYTEPPRDPSGSYTEPQRDPSSSSTEPQRDPSS 80
                          90
                  ....*....|...
gi 8176554    719 DSEAAPVPPTDDS 731
Cdd:pfam16058  81 SITEPPRDPSGSY 93
PRK07764 PRK07764
DNA polymerase III subunits gamma and tau; Validated
556-738 9.12e-19

DNA polymerase III subunits gamma and tau; Validated


Pssm-ID: 236090 [Multi-domain]  Cd Length: 824  Bit Score: 91.20  E-value: 9.12e-19
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 8176554   556 EATPVPPTGDSEATPVPPTGDSETAPVPPTGDSGAPPVPPTGDSGAPPVPPTGDSGAPPVPPTGDSGAPPVPPTGDSGAP 635
Cdd:PRK07764 610 EEAARPAAPAAPAAPAAPAPAGAAAAPAEASAAPAPGVAAPEHHPKHVAVPDASDGGDGWPAKAGGAAPAAPPPAPAPAA 689
                         90       100       110       120       130       140       150       160
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 8176554   636 PVPPTGDSGAPPVPpTGDSGAPPVPPTGDAGPPPVPPTGDSGPPPVPptgdSGAPPVTPTGDSETAPVPPTGDSGAPPVP 715
Cdd:PRK07764 690 PAAPAGAAPAQPAP-APAATPPAGQADDPAAQPPQAAQGASAPSPAA----DDPVPLPPEPDDPPDPAGAPAQPPPPPAP 764
                        170       180
                 ....*....|....*....|...
gi 8176554   716 PTGDSEAAPVPPTDDSKEAQMPA 738
Cdd:PRK07764 765 APAAAPAAAPPPSPPSEEEEMAE 787
Mucin-like pfam16058
Mucin-like; This region is found repeated at the C-terminus (C-tail) of bile salt-activated ...
553-644 2.52e-17

Mucin-like; This region is found repeated at the C-terminus (C-tail) of bile salt-activated lipase, where is O-glycosylated. This region is composed of biased amino acid composition that is likely to be disordered. The region contains many repeats of an approximately 11 residue degenerate repeat.


Pssm-ID: 464997 [Multi-domain]  Cd Length: 94  Bit Score: 77.46  E-value: 2.52e-17
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 8176554    553 TDQEATPVPPTGDSEATPVPPTGDSETAPVPPTGDSGAPPVPPTGDSGAPPVPPTGDSGAPPVPPTGDSGAPPVPPTGDS 632
Cdd:pfam16058   3 SSITEPPRDPSGSYGEPPRAPSSSYTEPQRDPSSSITEPPADPSSSYTEPPRDPSGSYTEPQRDPSSSSTEPQRDPSSSI 82
                          90
                  ....*....|..
gi 8176554    633 GAPPVPPTGDSG 644
Cdd:pfam16058  83 TEPPRDPSGSYT 94
PRK07764 PRK07764
DNA polymerase III subunits gamma and tau; Validated
551-738 2.44e-16

DNA polymerase III subunits gamma and tau; Validated


Pssm-ID: 236090 [Multi-domain]  Cd Length: 824  Bit Score: 83.50  E-value: 2.44e-16
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 8176554   551 TVTDQEATPVPPTGDSEATPVPPTGDSETAPVPPTGDSGAPPVPPTGDSGAPPVPPTGDSGAPPVPPTGDSGAPPVPPTG 630
Cdd:PRK07764 583 QVEAVVGPAPGAAGGEGPPAPASSGPPEEAARPAAPAAPAAPAAPAPAGAAAAPAEASAAPAPGVAAPEHHPKHVAVPDA 662
                         90       100       110       120       130       140       150       160
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 8176554   631 DSGAPPVPPTGDSGAPPVPPTGDSGAPPVPPTGDAGPPPVPptgdsGPPPVPPTGDSGAPPVTPTGDSETAPVPPTGDSG 710
Cdd:PRK07764 663 SDGGDGWPAKAGGAAPAAPPPAPAPAAPAAPAGAAPAQPAP-----APAATPPAGQADDPAAQPPQAAQGASAPSPAADD 737
                        170       180
                 ....*....|....*....|....*...
gi 8176554   711 APPVPPTGDSEAAPVPPTDDSKEAQMPA 738
Cdd:PRK07764 738 PVPLPPEPDDPPDPAGAPAQPPPPPAPA 765
PHA03247 PHA03247
large tegument protein UL36; Provisional
550-729 3.65e-16

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 83.45  E-value: 3.65e-16
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 8176554    550 PTVTDQEATP-VPPTGDSEATPVPPTGDS--ETAPVPPTGDSGAPPVPPTGDSGAPPVPPTGDSGAPPVPPTGDSGAPP- 625
Cdd:PHA03247 2580 PAVTSRARRPdAPPQSARPRAPVDDRGDPrgPAPPSPLPPDTHAPDPPPPSPSPAANEPDPHPPPTVPPPERPRDDPAPg 2659
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 8176554    626 -----------VPPTGDSgAPPVPPTGDSGAPPVPPTGDSGAPPVPPTGDAGPP-------PVPPTGDSGPPPVPPTGDS 687
Cdd:PHA03247 2660 rvsrprrarrlGRAAQAS-SPPQRPRRRAARPTVGSLTSLADPPPPPPTPEPAPhalvsatPLPPGPAAARQASPALPAA 2738
                         170       180       190       200
                  ....*....|....*....|....*....|....*....|..
gi 8176554    688 GAPPVTPTGdsetaPVPPTGDSgAPPVPPTGDSEAAPVPPTD 729
Cdd:PHA03247 2739 PAPPAVPAG-----PATPGGPA-RPARPPTTAGPPAPAPPAA 2774
PHA03247 PHA03247
large tegument protein UL36; Provisional
543-738 5.45e-16

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 82.68  E-value: 5.45e-16
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 8176554    543 TLTYLALPTvtDQEATPVP-PTGDSEATPVPPTGDSETAPVPPTGDSGAPPVPPTGDS-----GAPPVPPTGDSGAPPVP 616
Cdd:PHA03247 2694 SLTSLADPP--PPPPTPEPaPHALVSATPLPPGPAAARQASPALPAAPAPPAVPAGPAtpggpARPARPPTTAGPPAPAP 2771
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 8176554    617 PTGDSGAPP---VPPTGDSGA-----------PPVPPTGDSGAPPVPPTGDSGAPPVPPTGDAGPPPVPPTGDSGPPPVP 682
Cdd:PHA03247 2772 PAAPAAGPPrrlTRPAVASLSesreslpspwdPADPPAAVLAPAAALPPAASPAGPLPPPTSAQPTAPPPPPGPPPPSLP 2851
                         170       180       190       200       210       220
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 8176554    683 PTGD------------SGAPPVTPTgdseTAPVPPTGDSGAPPVPPTGDSEAAPVPPTDDSKEAQMPA 738
Cdd:PHA03247 2852 LGGSvapggdvrrrppSRSPAAKPA----APARPPVRRLARPAVSRSTESFALPPDQPERPPQPQAPP 2915
Mucin-like pfam16058
Mucin-like; This region is found repeated at the C-terminus (C-tail) of bile salt-activated ...
650-735 1.08e-15

Mucin-like; This region is found repeated at the C-terminus (C-tail) of bile salt-activated lipase, where is O-glycosylated. This region is composed of biased amino acid composition that is likely to be disordered. The region contains many repeats of an approximately 11 residue degenerate repeat.


Pssm-ID: 464997 [Multi-domain]  Cd Length: 94  Bit Score: 72.84  E-value: 1.08e-15
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 8176554    650 PTGDSGAPPVPPTGDAGPPPVPPTGDSGPPPVPPTGDSGAPPVTPTGDSETAPVPPTGDSGAPPVPPTGDSEAAPVPPTD 729
Cdd:pfam16058   1 PSSSITEPPRDPSGSYGEPPRAPSSSYTEPQRDPSSSITEPPADPSSSYTEPPRDPSGSYTEPQRDPSSSSTEPQRDPSS 80

                  ....*.
gi 8176554    730 DSKEAQ 735
Cdd:pfam16058  81 SITEPP 86
Aes COG0657
Acetyl esterase/lipase [Lipid transport and metabolism];
105-246 1.91e-15

Acetyl esterase/lipase [Lipid transport and metabolism];


Pssm-ID: 440422 [Multi-domain]  Cd Length: 207  Bit Score: 75.68  E-value: 1.91e-15
                        10        20        30        40        50        60        70        80
                ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 8176554  105 IWVPQGRKqvsRDLPVMIWIYGGAFLMGSGHGANFLnnylydGEEIATRGNVIVVTFNYRVGPlgflstgDANLPGnyGL 184
Cdd:COG0657   3 VYRPAGAK---GPLPVVVYFHGGGWVSGSKDTHDPL------ARRLAARAGAAVVSVDYRLAP-------EHPFPA--AL 64
                        90       100       110       120       130       140
                ....*....|....*....|....*....|....*....|....*....|....*....|....*..
gi 8176554  185 RDQHMAIAWVKRNIAAFGGDPNNITLFGESAGG--ASVSLQTLSPYNKGLIRAAISQSGV---ALSP 246
Cdd:COG0657  65 EDAYAALRWLRANAAELGIDPDRIAVAGDSAGGhlAAALALRARDRGGPRPAAQVLIYPVldlTASP 131
PRK07764 PRK07764
DNA polymerase III subunits gamma and tau; Validated
550-728 2.21e-15

DNA polymerase III subunits gamma and tau; Validated


Pssm-ID: 236090 [Multi-domain]  Cd Length: 824  Bit Score: 80.42  E-value: 2.21e-15
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 8176554   550 PTVTDQEATPVPPTGDSEA-TPVPPTGDSETAPVPPTGDSGAPPVPPTGDSGAP-PVPPTGDSGAPPVPPtgdsgAPPVP 627
Cdd:PRK07764 615 PAAPAAPAAPAAPAPAGAAaAPAEASAAPAPGVAAPEHHPKHVAVPDASDGGDGwPAKAGGAAPAAPPPA-----PAPAA 689
                         90       100       110       120       130       140       150       160
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 8176554   628 PTGDSGAPPVPPTGDSGAPPVPPTGDSGAPPVPPTGDAGPPPVPPTGDSGPPPVPPTGDSGAPPVTPTGDSETAPVPPTG 707
Cdd:PRK07764 690 PAAPAGAAPAQPAPAPAATPPAGQADDPAAQPPQAAQGASAPSPAADDPVPLPPEPDDPPDPAGAPAQPPPPPAPAPAAA 769
                        170       180
                 ....*....|....*....|.
gi 8176554   708 DSGAPPVPPTGDSEAAPVPPT 728
Cdd:PRK07764 770 PAAAPPPSPPSEEEEMAEDDA 790
PHA03247 PHA03247
large tegument protein UL36; Provisional
548-739 3.46e-15

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 80.37  E-value: 3.46e-15
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 8176554    548 ALPTVTDQEATPVPPTGDSEATPVPPTGDSETAPVPPTGDSGAPPVPPTGDSG--------APPVPPTGDSGAPPVPPTG 619
Cdd:PHA03247 2805 ADPPAAVLAPAAALPPAASPAGPLPPPTSAQPTAPPPPPGPPPPSLPLGGSVApggdvrrrPPSRSPAAKPAAPARPPVR 2884
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 8176554    620 DSGAPPVPPTGDSGAPPVPPTGDSGAPPVPPTGDSGAPPVPPTGDAGPPPVPPTGDSGPPPVPPTGDSGAPPVTPTGDSE 699
Cdd:PHA03247 2885 RLARPAVSRSTESFALPPDQPERPPQPQAPPPPQPQPQPPPPPQPQPPPPPPPRPQPPLAPTTDPAGAGEPSGAVPQPWL 2964
                         170       180       190       200
                  ....*....|....*....|....*....|....*....|
gi 8176554    700 TAPVPPTGDSGAPPVPPTGDSEAAPVPPTDDSKEAQMPAV 739
Cdd:PHA03247 2965 GALVPGRVAVPRFRVPQPAPSREAPASSTPPLTGHSLSRV 3004
PHA03247 PHA03247
large tegument protein UL36; Provisional
548-735 2.47e-14

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 77.29  E-value: 2.47e-14
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 8176554    548 ALPTVTDQEATPVPPTGD----SEATPVPPTGDSETAPVPPTgdsgAPPVPPTGDSGAPPVPPTGDSGAPPVPPTGDSga 623
Cdd:PHA03247 2773 AAPAAGPPRRLTRPAVASlsesRESLPSPWDPADPPAAVLAP----AAALPPAASPAGPLPPPTSAQPTAPPPPPGPP-- 2846
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 8176554    624 PPVPPTGDSGAP--PVPPTGDSGAPPVPPTgdsgAPPVPPTGDAGPPPVPPTGDSGP-----PPVPPTGDSGAPPVTPtg 696
Cdd:PHA03247 2847 PPSLPLGGSVAPggDVRRRPPSRSPAAKPA----APARPPVRRLARPAVSRSTESFAlppdqPERPPQPQAPPPPQPQ-- 2920
                         170       180       190
                  ....*....|....*....|....*....|....*....
gi 8176554    697 dsETAPVPPTgDSGAPPVPPTGDSEAAPVPPTDDSKEAQ 735
Cdd:PHA03247 2921 --PQPPPPPQ-PQPPPPPPPRPQPPLAPTTDPAGAGEPS 2956
PspC_subgroup_2 NF033839
pneumococcal surface protein PspC, LPXTG-anchored form; The pneumococcal surface protein PspC, ...
558-737 3.27e-14

pneumococcal surface protein PspC, LPXTG-anchored form; The pneumococcal surface protein PspC, as described in Streptococcus pneumoniae, is a repetitive and highly variable protein, recognized by a conserved N-terminal domain and also by genomic location. This form, subgroup 2, is anchored covalently after cleavage by sortase at a C-terminal LPXTG site. The other form, subgroup 1, has variable numbers of a choline-binding repeat in the C-terminal region, and is also known as choline-binding protein A.


Pssm-ID: 468202 [Multi-domain]  Cd Length: 557  Bit Score: 75.96  E-value: 3.27e-14
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 8176554   558 TPVPPTGDSEATPVP---PTGDSETAPVPPTGDSGAPPVPPTGDSGAPPVPPTGDSGAPPVPPTGDSGAPPVPPTGDSGA 634
Cdd:NF033839 283 TPKEPGNKKPSAPKPgmqPSPQPEKKEVKPEPETPKPEVKPQLEKPKPEVKPQPEKPKPEVKPQLETPKPEVKPQPEKPK 362
                         90       100       110       120       130       140       150       160
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 8176554   635 PPVPPTGDSGAPPVPPTGDSGAPPVPPTGDAGPPPVPPTGDSGPPPVPPTGDSGAPPVTP---TGDSETAPVPPTGDSGA 711
Cdd:NF033839 363 PEVKPQPEKPKPEVKPQPETPKPEVKPQPEKPKPEVKPQPEKPKPEVKPQPEKPKPEVKPqpeKPKPEVKPQPEKPKPEV 442
                        170       180
                 ....*....|....*....|....*.
gi 8176554   712 PPVPPTGDSEAAPVPPTDDSKEAQMP 737
Cdd:NF033839 443 KPQPEKPKPEVKPQPETPKPEVKPQP 468
Mucin-like pfam16058
Mucin-like; This region is found repeated at the C-terminus (C-tail) of bile salt-activated ...
550-633 6.02e-14

Mucin-like; This region is found repeated at the C-terminus (C-tail) of bile salt-activated lipase, where is O-glycosylated. This region is composed of biased amino acid composition that is likely to be disordered. The region contains many repeats of an approximately 11 residue degenerate repeat.


Pssm-ID: 464997 [Multi-domain]  Cd Length: 94  Bit Score: 67.83  E-value: 6.02e-14
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 8176554    550 PTVTDQEATPVPPTGDSEATPVPPTGDSETAPVPPTGDSGAPPVPPTGDSGAPPVPPTGDSGAPPVPPTGDSGAPPVPPT 629
Cdd:pfam16058  11 DPSGSYGEPPRAPSSSYTEPQRDPSSSITEPPADPSSSYTEPPRDPSGSYTEPQRDPSSSSTEPQRDPSSSITEPPRDPS 90

                  ....
gi 8176554    630 GDSG 633
Cdd:pfam16058  91 GSYT 94
PHA03247 PHA03247
large tegument protein UL36; Provisional
559-718 1.20e-13

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 75.36  E-value: 1.20e-13
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 8176554    559 PVPPTGDSeATPVPPTGDSETAPVPPTGDSGAPPVPPTGDSGAPPVPPTGDSGAPPVPPTGDSG--APPVPPTGDSGAPP 636
Cdd:PHA03247 2691 TVGSLTSL-ADPPPPPPTPEPAPHALVSATPLPPGPAAARQASPALPAAPAPPAVPAGPATPGGpaRPARPPTTAGPPAP 2769
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 8176554    637 VPPTGDSGAPP---VPPTGDSGAPPVPPTGDAGPPPVPPTGDSGPPPVPPTGDSGAPPVTPTGDSETAPVPPTGDSGAPP 713
Cdd:PHA03247 2770 APPAAPAAGPPrrlTRPAVASLSESRESLPSPWDPADPPAAVLAPAAALPPAASPAGPLPPPTSAQPTAPPPPPGPPPPS 2849

                  ....*
gi 8176554    714 VPPTG 718
Cdd:PHA03247 2850 LPLGG 2854
PHA03247 PHA03247
large tegument protein UL36; Provisional
561-737 1.78e-13

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 74.59  E-value: 1.78e-13
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 8176554    561 PPTGDSEATPVPPTGDSETAP------------VPPTGDSgAPPVPPTGDSGAPPVPPTGDSGAPPVP-------PTGDS 621
Cdd:PHA03247 2638 PDPHPPPTVPPPERPRDDPAPgrvsrprrarrlGRAAQAS-SPPQRPRRRAARPTVGSLTSLADPPPPpptpepaPHALV 2716
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 8176554    622 GAPPVPPTGDSGAPPVPPTGDSGAPPVPPTGDS-----GAPPVPPTGDAGPPPVPPTGDSGPPP--VPPTGDSGAPPVTP 694
Cdd:PHA03247 2717 SATPLPPGPAAARQASPALPAAPAPPAVPAGPAtpggpARPARPPTTAGPPAPAPPAAPAAGPPrrLTRPAVASLSESRE 2796
                         170       180       190       200
                  ....*....|....*....|....*....|....*....|....
gi 8176554    695 TGDSETAPVPPTGDSGAP-PVPPTGDSEAAPVPPTDDSKEAQMP 737
Cdd:PHA03247 2797 SLPSPWDPADPPAAVLAPaAALPPAASPAGPLPPPTSAQPTAPP 2840
PHA03247 PHA03247
large tegument protein UL36; Provisional
557-737 6.20e-13

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 73.05  E-value: 6.20e-13
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 8176554    557 ATPVPPTGDSEATPVPPTGDSETAPVPPTgdsgAPPVPPTGDSGAPPVPPTGDSGAPPVPPTGDSGAPPVPPTGdsgapP 636
Cdd:PHA03247 2678 SPPQRPRRRAARPTVGSLTSLADPPPPPP----TPEPAPHALVSATPLPPGPAAARQASPALPAAPAPPAVPAG-----P 2748
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 8176554    637 VPPTGDSgAPPVPPTGDSGAPPVPPTGDAGPPP---VPPTGDSGPPPVPPTGDSGAPPVTPTGDSETAPVPPTGDSGAPP 713
Cdd:PHA03247 2749 ATPGGPA-RPARPPTTAGPPAPAPPAAPAAGPPrrlTRPAVASLSESRESLPSPWDPADPPAAVLAPAAALPPAASPAGP 2827
                         170       180
                  ....*....|....*....|....
gi 8176554    714 VPPTGDSEAAPVPPTDDSKEAQMP 737
Cdd:PHA03247 2828 LPPPTSAQPTAPPPPPGPPPPSLP 2851
PHA03247 PHA03247
large tegument protein UL36; Provisional
465-737 7.05e-13

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 72.66  E-value: 7.05e-13
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 8176554    465 KPFATPTGYRPQdrtVFKAMIAYWTNFAKTGDPNMGDSAVPTHWEPYTTENSGyleitkkmgsssmkrslrTNFLRYWTL 544
Cdd:PHA03247 2675 QASSPPQRPRRR---AARPTVGSLTSLADPPPPPPTPEPAPHALVSATPLPPG------------------PAAARQASP 2733
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 8176554    545 TYLALPTVTDQEATPVPPTGDSeATPVPPTGDSETAPVPPTGDSGAPP---VPPTGDSGAPPVPPTGDSGAPPVPPTGDS 621
Cdd:PHA03247 2734 ALPAAPAPPAVPAGPATPGGPA-RPARPPTTAGPPAPAPPAAPAAGPPrrlTRPAVASLSESRESLPSPWDPADPPAAVL 2812
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 8176554    622 GAPPVPPTGDSGAPPVPPTGDS--GAPPVPPtgdsgaPPVPPTGDAG------------PPPVPPTGDSGPPPVPPTGDS 687
Cdd:PHA03247 2813 APAAALPPAASPAGPLPPPTSAqpTAPPPPP------GPPPPSLPLGgsvapggdvrrrPPSRSPAAKPAAPARPPVRRL 2886
                         250       260       270       280       290
                  ....*....|....*....|....*....|....*....|....*....|
gi 8176554    688 GAPPVTPTgdSETAPVPPtgDSGAPPVPPTGDSEAAPVPPTDDSKEAQMP 737
Cdd:PHA03247 2887 ARPAVSRS--TESFALPP--DQPERPPQPQAPPPPQPQPQPPPPPQPQPP 2932
PHA03307 PHA03307
transcriptional regulator ICP4; Provisional
556-732 8.14e-13

transcriptional regulator ICP4; Provisional


Pssm-ID: 223039 [Multi-domain]  Cd Length: 1352  Bit Score: 72.13  E-value: 8.14e-13
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 8176554    556 EATPVPPTGDSEATPVPPTGDSETAPVPPTGDSGAPPVPPTGDSGAPPVPPTGDSGAPPVP--PTGDSGAPPVPPTGDSG 633
Cdd:PHA03307  149 AASPPAAGASPAAVASDAASSRQAALPLSSPEETARAPSSPPAEPPPSTPPAAASPRPPRRssPISASASSPAPAPGRSA 228
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 8176554    634 APPVPPTGDSGAPPVPPTGDSGAPPVPPTGDAGPPPVPPTGDSGPPPVPPTGDSG-APPVTPTGDSETAPVPPTGDSGAP 712
Cdd:PHA03307  229 ADDAGASSSDSSSSESSGCGWGPENECPLPRPAPITLPTRIWEASGWNGPSSRPGpASSSSSPRERSPSPSPSSPGSGPA 308
                         170       180
                  ....*....|....*....|
gi 8176554    713 PVPPTGDSEAAPVPPTDDSK 732
Cdd:PHA03307  309 PSSPRASSSSSSSRESSSSS 328
PHA03307 PHA03307
transcriptional regulator ICP4; Provisional
555-731 1.41e-12

transcriptional regulator ICP4; Provisional


Pssm-ID: 223039 [Multi-domain]  Cd Length: 1352  Bit Score: 71.36  E-value: 1.41e-12
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 8176554    555 QEATPVPPTGDSEATPVPPTGDSETAPVPPTGDSGAPPV-PPTGDSGAPPVPPTGDSGAPPVPPTGDSGAPPVPPTGDSG 633
Cdd:PHA03307  171 QAALPLSSPEETARAPSSPPAEPPPSTPPAAASPRPPRRsSPISASASSPAPAPGRSAADDAGASSSDSSSSESSGCGWG 250
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 8176554    634 APPVPPTGDSGAPPVPPTGDSGAPPVPPTGDAGP-PPVPPTGDSGPPPVPPTGDSGAPPVTPTGDSETAPVPPTGDSGAP 712
Cdd:PHA03307  251 PENECPLPRPAPITLPTRIWEASGWNGPSSRPGPaSSSSSPRERSPSPSPSSPGSGPAPSSPRASSSSSSSRESSSSSTS 330
                         170
                  ....*....|....*....
gi 8176554    713 PVPPTGDSEAAPVPPTDDS 731
Cdd:PHA03307  331 SSSESSRGAAVSPGPSPSR 349
PHA03307 PHA03307
transcriptional regulator ICP4; Provisional
547-731 1.86e-12

transcriptional regulator ICP4; Provisional


Pssm-ID: 223039 [Multi-domain]  Cd Length: 1352  Bit Score: 70.97  E-value: 1.86e-12
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 8176554    547 LALPTVTDQEATPVPPTGDSEA---TPVPPTGDSETAPVPPTGDSGAPPVPPTGDSGAPPVPPTGDSGAPPVPPTGDSGA 623
Cdd:PHA03307  172 AALPLSSPEETARAPSSPPAEPppsTPPAAASPRPPRRSSPISASASSPAPAPGRSAADDAGASSSDSSSSESSGCGWGP 251
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 8176554    624 PPVPPTGDSGAPPVPPTGDSGAPPVPPTGDSG-APPVPPTGDAGPPPVPPTGDSGPPPVPPTGDSGAPPVTPTGDSETAP 702
Cdd:PHA03307  252 ENECPLPRPAPITLPTRIWEASGWNGPSSRPGpASSSSSPRERSPSPSPSSPGSGPAPSSPRASSSSSSSRESSSSSTSS 331
                         170       180
                  ....*....|....*....|....*....
gi 8176554    703 VPPTGDSGAPPVPPTGDSEAAPVPPTDDS 731
Cdd:PHA03307  332 SSESSRGAAVSPGPSPSRSPSPSRPPPPA 360
PspC_subgroup_2 NF033839
pneumococcal surface protein PspC, LPXTG-anchored form; The pneumococcal surface protein PspC, ...
564-728 1.93e-12

pneumococcal surface protein PspC, LPXTG-anchored form; The pneumococcal surface protein PspC, as described in Streptococcus pneumoniae, is a repetitive and highly variable protein, recognized by a conserved N-terminal domain and also by genomic location. This form, subgroup 2, is anchored covalently after cleavage by sortase at a C-terminal LPXTG site. The other form, subgroup 1, has variable numbers of a choline-binding repeat in the C-terminal region, and is also known as choline-binding protein A.


Pssm-ID: 468202 [Multi-domain]  Cd Length: 557  Bit Score: 70.57  E-value: 1.93e-12
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 8176554   564 GDSEATPVPPTGDSETAPVP---PTGDSGAPPVPPTGDSGAPPVPPTGDSGAPPVPPTGDSGAPPVPPTGDSGAPPVPPT 640
Cdd:NF033839 278 GLTQDTPKEPGNKKPSAPKPgmqPSPQPEKKEVKPEPETPKPEVKPQLEKPKPEVKPQPEKPKPEVKPQLETPKPEVKPQ 357
                         90       100       110       120       130       140       150       160
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 8176554   641 GDSGAPPVPPTGDSGAPPVPPTGDAGPPPVPPTGDSGPPPVPPTGDSGAPPVTP---TGDSETAPVPPTGDSGAPPVPPT 717
Cdd:NF033839 358 PEKPKPEVKPQPEKPKPEVKPQPETPKPEVKPQPEKPKPEVKPQPEKPKPEVKPqpeKPKPEVKPQPEKPKPEVKPQPEK 437
                        170
                 ....*....|.
gi 8176554   718 GDSEAAPVPPT 728
Cdd:NF033839 438 PKPEVKPQPEK 448
PHA03247 PHA03247
large tegument protein UL36; Provisional
547-738 2.36e-12

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 71.12  E-value: 2.36e-12
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 8176554    547 LALPTVTDQEatpVPPTGDSEATPVPPTGDSETAP-VPPtgdSGAPPVPPTGDSGAPPVPPtgdsgaPPVPPTGDSGAPP 625
Cdd:PHA03247 2558 AAPPAAPDRS---VPPPRPAPRPSEPAVTSRARRPdAPP---QSARPRAPVDDRGDPRGPA------PPSPLPPDTHAPD 2625
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 8176554    626 VPPTGDSGAPPVPPTGDSGAPPVPPTGDSGAPP--VPPTGDAGPPPVPPTGDSgpPPVPPTGDSGAPPVTPTGDSETAPV 703
Cdd:PHA03247 2626 PPPPSPSPAANEPDPHPPPTVPPPERPRDDPAPgrVSRPRRARRLGRAAQASS--PPQRPRRRAARPTVGSLTSLADPPP 2703
                         170       180       190
                  ....*....|....*....|....*....|....*
gi 8176554    704 PPTgdsgAPPVPPTGDSEAAPVPPTDDSKEAQMPA 738
Cdd:PHA03247 2704 PPP----TPEPAPHALVSATPLPPGPAAARQASPA 2734
PRK12323 PRK12323
DNA polymerase III subunit gamma/tau;
544-739 3.55e-12

DNA polymerase III subunit gamma/tau;


Pssm-ID: 237057 [Multi-domain]  Cd Length: 700  Bit Score: 69.90  E-value: 3.55e-12
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 8176554   544 LTYLAL-PTVTDQEATPVPPTGDSEATPVPPTGDSETAPVPPTGDSGAPPVPPtgdSGAPPVPPTGDSGAPPVPP----- 617
Cdd:PRK12323 358 LRMLAFrPGQSGGGAGPATAAAAPVAQPAPAAAAPAAAAPAPAAPPAAPAAAP---AAAAAARAVAAAPARRSPApeala 434
                         90       100       110       120       130       140       150       160
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 8176554   618 -----TGDSGAPPVPPTGDSGAPPVPPTGDSGAPPVPPTGDSGAPPVPPTGDAGPPPVP---PTGDSGPPPVPPTGDSGA 689
Cdd:PRK12323 435 aarqaSARGPGGAPAPAPAPAAAPAAAARPAAAGPRPVAAAAAAAPARAAPAAAPAPADddpPPWEELPPEFASPAPAQP 514
                        170       180       190       200       210
                 ....*....|....*....|....*....|....*....|....*....|
gi 8176554   690 PPVTPTGDSETAPVPPTGDSGAPPVPPTGDSEAAPVPPTDDSKEAQMPAV 739
Cdd:PRK12323 515 DAAPAGWVAESIPDPATADPDDAFETLAPAPAAAPAPRAAAATEPVVAPR 564
PRK07764 PRK07764
DNA polymerase III subunits gamma and tau; Validated
557-711 4.98e-12

DNA polymerase III subunits gamma and tau; Validated


Pssm-ID: 236090 [Multi-domain]  Cd Length: 824  Bit Score: 69.63  E-value: 4.98e-12
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 8176554   557 ATPVPPTGDSEATPVPPTGDSETAPVPPTGDSGAPPVPPTGDSGAPPVPPTGDSGAPPVPpTGDSGAPPVPPTGDSGAPP 636
Cdd:PRK07764 644 APGVAAPEHHPKHVAVPDASDGGDGWPAKAGGAAPAAPPPAPAPAAPAAPAGAAPAQPAP-APAATPPAGQADDPAAQPP 722
                         90       100       110       120       130       140       150       160
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 8176554   637 VPPTG-----DSGAPPVPPTGDSGAPPVP-PTGDAGPPPVPPTGDSGPPPVPPTGDSGAPPVTPTGDSETAPVPPTGDSG 710
Cdd:PRK07764 723 QAAQGasapsPAADDPVPLPPEPDDPPDPaGAPAQPPPPPAPAPAAAPAAAPPPSPPSEEEEMAEDDAPSMDDEDRRDAE 802

                 .
gi 8176554   711 A 711
Cdd:PRK07764 803 E 803
PHA03247 PHA03247
large tegument protein UL36; Provisional
548-734 9.02e-12

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 69.20  E-value: 9.02e-12
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 8176554    548 ALPTVTDQEATPVPPTGDSEATPVPPTGDSET-----APVPPTGD--SGAPPVPPTGDSGAPPVPPTGDSGAPPVPPTGD 620
Cdd:PHA03247 2817 ALPPAASPAGPLPPPTSAQPTAPPPPPGPPPPslplgGSVAPGGDvrRRPPSRSPAAKPAAPARPPVRRLARPAVSRSTE 2896
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 8176554    621 SGAPPVPPTGDSGAPPVPPTGDSGAPPVPPTGDSGAPPVPPTGDAGPPPVP-PTGDSGPPPVPPTGDSGA---------- 689
Cdd:PHA03247 2897 SFALPPDQPERPPQPQAPPPPQPQPQPPPPPQPQPPPPPPPRPQPPLAPTTdPAGAGEPSGAVPQPWLGAlvpgrvavpr 2976
                         170       180       190       200       210       220
                  ....*....|....*....|....*....|....*....|....*....|....*....|.
gi 8176554    690 ---PPVTPTGDSETAPVPPTGDSGAPPVPPTGDS-----EAAPVP--------PTDDSKEA 734
Cdd:PHA03247 2977 frvPQPAPSREAPASSTPPLTGHSLSRVSSWASSlalheETDPPPvslkqtlwPPDDTEDS 3037
PspC_subgroup_2 NF033839
pneumococcal surface protein PspC, LPXTG-anchored form; The pneumococcal surface protein PspC, ...
550-728 3.93e-11

pneumococcal surface protein PspC, LPXTG-anchored form; The pneumococcal surface protein PspC, as described in Streptococcus pneumoniae, is a repetitive and highly variable protein, recognized by a conserved N-terminal domain and also by genomic location. This form, subgroup 2, is anchored covalently after cleavage by sortase at a C-terminal LPXTG site. The other form, subgroup 1, has variable numbers of a choline-binding repeat in the C-terminal region, and is also known as choline-binding protein A.


Pssm-ID: 468202 [Multi-domain]  Cd Length: 557  Bit Score: 66.33  E-value: 3.93e-11
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 8176554   550 PTVTDQEATPVPPTgdseaTPVPPTGDSETAPVPPTgdsGAPPVPPTGDSGAPPVPPTGDSGAPPVPPTGDSGAPPVPPT 629
Cdd:NF033839 297 PGMQPSPQPEKKEV-----KPEPETPKPEVKPQLEK---PKPEVKPQPEKPKPEVKPQLETPKPEVKPQPEKPKPEVKPQ 368
                         90       100       110       120       130       140       150       160
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 8176554   630 GDSGAPPVPPTGDSGAPPVPPTGDSGAPPVPPTGDAGPPPVPPTGDSGPPPVPPTGDSGAPPVTP---TGDSETAPVPPT 706
Cdd:NF033839 369 PEKPKPEVKPQPETPKPEVKPQPEKPKPEVKPQPEKPKPEVKPQPEKPKPEVKPQPEKPKPEVKPqpeKPKPEVKPQPEK 448
                        170       180
                 ....*....|....*....|..
gi 8176554   707 GDSGAPPVPPTGDSEAAPVPPT 728
Cdd:NF033839 449 PKPEVKPQPETPKPEVKPQPEK 470
PHA03307 PHA03307
transcriptional regulator ICP4; Provisional
559-727 4.48e-11

transcriptional regulator ICP4; Provisional


Pssm-ID: 223039 [Multi-domain]  Cd Length: 1352  Bit Score: 66.73  E-value: 4.48e-11
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 8176554    559 PVPPTGDSEATPVPPTGDSETAPVPPTGDSGAPPVPPTGDSGAPPVPPTGDSGAPPVPPTGDSGAPPVPPTGDSGAPPVP 638
Cdd:PHA03307   84 SRSTPTWSLSTLAPASPAREGSPTPPGPSSPDPPPPTPPPASPPPSPAPDLSEMLRPVGSPGPPPAASPPAAGASPAAVA 163
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 8176554    639 ---PTGDSGAPPVPPTGDSGAPPVPPTGDAGPPPVPPTGDSGPPPVPPTGDSGAPPVTPT-GDSETAPVPPTGDSGAPPV 714
Cdd:PHA03307  164 sdaASSRQAALPLSSPEETARAPSSPPAEPPPSTPPAAASPRPPRRSSPISASASSPAPApGRSAADDAGASSSDSSSSE 243
                         170
                  ....*....|....*.
gi 8176554    715 PP---TGDSEAAPVPP 727
Cdd:PHA03307  244 SSgcgWGPENECPLPR 259
BD-FAE pfam20434
BD-FAE; This family represents a novel bifunctional feruloyl and acetyl xylan esterase (BD-FAE, ...
103-232 5.39e-11

BD-FAE; This family represents a novel bifunctional feruloyl and acetyl xylan esterase (BD-FAE, previously known as bifunctional carbohydrate esterase (CE)), which is active on complex natural xylans and was identified as the basis of a monophyletic clade gathering all homologs identified in PULs (polysaccharide utilization loci) predicted to act on xylan. It adopts an alpha-beta-hydrolase fold with the catalytic triad Ser-Asp-His. This new family of proteins is a new candidate for biomass processing due to its capacity to remove ferulic acid and acetic acid from natural corn and birchwood xylan substrates.


Pssm-ID: 466583 [Multi-domain]  Cd Length: 215  Bit Score: 62.97  E-value: 5.39e-11
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 8176554    103 LNIWVPQGRKqvsRDLPVMIWIYGGAFLMGSGHGANFLNNYLydGEEIATRGNViVVTFNYRvgplgflSTGDANLPGNy 182
Cdd:pfam20434   1 LDIYLPKNAK---GPYPVVIWIHGGGWNSGDKEADMGFMTNT--VKALLKAGYA-VASINYR-------LSTDAKFPAQ- 66
                          90       100       110       120       130
                  ....*....|....*....|....*....|....*....|....*....|
gi 8176554    183 gLRDQHMAIAWVKRNIAAFGGDPNNITLFGESAGGASVSLQTLSPYNKGL 232
Cdd:pfam20434  67 -IQDVKAAIRFLRANAAKYGIDTNKIALMGFSAGGHLALLAGLSNNNKEF 115
PHA03307 PHA03307
transcriptional regulator ICP4; Provisional
565-727 5.99e-11

transcriptional regulator ICP4; Provisional


Pssm-ID: 223039 [Multi-domain]  Cd Length: 1352  Bit Score: 66.35  E-value: 5.99e-11
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 8176554    565 DSEATPVPPTGDSETAPVPPTGDSG------APPVPPTGDSGAPPVPPTGDSGAPPV-PPTGDSGAPPVPPTGDSGAPPV 637
Cdd:PHA03307   17 GGEFFPRPPATPGDAADDLLSGSQGqlvsdsAELAAVTVVAGAAACDRFEPPTGPPPgPGTEAPANESRSTPTWSLSTLA 96
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 8176554    638 PPTGDSGAPPVPPTGDSGAPPVPPTGDAGPPPVPPTGDSGPPPVPPT---GDSGAPPVTPTGDSETAPVPPTGDSGAPPV 714
Cdd:PHA03307   97 PASPAREGSPTPPGPSSPDPPPPTPPPASPPPSPAPDLSEMLRPVGSpgpPPAASPPAAGASPAAVASDAASSRQAALPL 176
                         170
                  ....*....|...
gi 8176554    715 PPTGDSEAAPVPP 727
Cdd:PHA03307  177 SSPEETARAPSSP 189
PHA03307 PHA03307
transcriptional regulator ICP4; Provisional
556-731 6.81e-11

transcriptional regulator ICP4; Provisional


Pssm-ID: 223039 [Multi-domain]  Cd Length: 1352  Bit Score: 65.96  E-value: 6.81e-11
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 8176554    556 EATPVPPTGDSEATPVPPTGDSETAPVPPTGDSGAPPvpPTGDSGAPPVPPTgDSGAPPVPPTGDSGAPPVPPTGDSGAP 635
Cdd:PHA03307   74 GPGTEAPANESRSTPTWSLSTLAPASPAREGSPTPPG--PSSPDPPPPTPPP-ASPPPSPAPDLSEMLRPVGSPGPPPAA 150
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 8176554    636 PVPPTGDSGAPPV--PPTGDSGAPPVPPTGDAGPPPVPPtGDSGPPPVPPTGDSGAPPVT--PTGDSETAPVPPTGDSGA 711
Cdd:PHA03307  151 SPPAAGASPAAVAsdAASSRQAALPLSSPEETARAPSSP-PAEPPPSTPPAAASPRPPRRssPISASASSPAPAPGRSAA 229
                         170       180
                  ....*....|....*....|
gi 8176554    712 PPVPPTGDSEAAPVPPTDDS 731
Cdd:PHA03307  230 DDAGASSSDSSSSESSGCGW 249
PHA03247 PHA03247
large tegument protein UL36; Provisional
568-716 1.04e-10

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 65.73  E-value: 1.04e-10
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 8176554    568 ATPVPPTGDSETAPVPPTGDSGAPPVPPTGDSGAPpVPP-------------TGDSG--APPVPPTGDSGAP--PVPPTG 630
Cdd:PHA03247 2494 AAPDPGGGGPPDPDAPPAPSRLAPAILPDEPVGEP-VHPrmltwirgleelaSDDAGdpPPPLPPAAPPAAPdrSVPPPR 2572
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 8176554    631 DSGAPPVPPTGDSGA-PPVPPTGDSGAPPVPPTGD--AGPPPVPPTGDSGPPPVPPTgdSGAPPVTPTGDSETAPVPPTG 707
Cdd:PHA03247 2573 PAPRPSEPAVTSRARrPDAPPQSARPRAPVDDRGDprGPAPPSPLPPDTHAPDPPPP--SPSPAANEPDPHPPPTVPPPE 2650

                  ....*....
gi 8176554    708 DSGAPPVPP 716
Cdd:PHA03247 2651 RPRDDPAPG 2659
PspC_subgroup_2 NF033839
pneumococcal surface protein PspC, LPXTG-anchored form; The pneumococcal surface protein PspC, ...
556-732 1.12e-10

pneumococcal surface protein PspC, LPXTG-anchored form; The pneumococcal surface protein PspC, as described in Streptococcus pneumoniae, is a repetitive and highly variable protein, recognized by a conserved N-terminal domain and also by genomic location. This form, subgroup 2, is anchored covalently after cleavage by sortase at a C-terminal LPXTG site. The other form, subgroup 1, has variable numbers of a choline-binding repeat in the C-terminal region, and is also known as choline-binding protein A.


Pssm-ID: 468202 [Multi-domain]  Cd Length: 557  Bit Score: 64.79  E-value: 1.12e-10
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 8176554   556 EATPVPP---TGDSEATPVPPTGDSETAPVPPTGDsgaPPVPPTGDSGAPPVPPTGDSGAPPVPPTGDSGAPPVPPTGDS 632
Cdd:NF033839 306 EKKEVKPepeTPKPEVKPQLEKPKPEVKPQPEKPK---PEVKPQLETPKPEVKPQPEKPKPEVKPQPEKPKPEVKPQPET 382
                         90       100       110       120       130       140       150       160
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 8176554   633 GAPPVPPTGDSGAPPVPPTGDSGAPPVPPTGDAGPPPVPPTGDSGPPPVPPTGDSGAPPVTP---TGDSETAPVPPTGDS 709
Cdd:NF033839 383 PKPEVKPQPEKPKPEVKPQPEKPKPEVKPQPEKPKPEVKPQPEKPKPEVKPQPEKPKPEVKPqpeKPKPEVKPQPETPKP 462
                        170       180
                 ....*....|....*....|....*.
gi 8176554   710 GAPPVPPTGDSEAAP---VPPTDDSK 732
Cdd:NF033839 463 EVKPQPEKPKPEVKPqpeKPKPDNSK 488
PRK07764 PRK07764
DNA polymerase III subunits gamma and tau; Validated
547-693 1.87e-10

DNA polymerase III subunits gamma and tau; Validated


Pssm-ID: 236090 [Multi-domain]  Cd Length: 824  Bit Score: 64.24  E-value: 1.87e-10
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 8176554   547 LALPTVTDQEATPVPPTGDSEATPVPPTGDSETAPVPPTGDSGAP-PVPPTGDSGAPPVPPTGDSGAPPVPPTGDSGAPP 625
Cdd:PRK07764 362 MLLPSASDDERGLLARLERLERRLGVAGGAGAPAAAAPSAAAAAPaAAPAPAAAAPAAAAAPAPAAAPQPAPAPAPAPAP 441
                         90       100       110       120       130       140
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 8176554   626 VPPTGDSGAPPVPPTGDSGAPPVPPTGdSGAPPVPPTGDAGPPPVPPTGDSGPPPVPPTGDSGAPPVT 693
Cdd:PRK07764 442 PSPAGNAPAGGAPSPPPAAAPSAQPAP-APAAAPEPTAAPAPAPPAAPAPAAAPAAPAAPAAPAGADD 508
PHA03307 PHA03307
transcriptional regulator ICP4; Provisional
560-738 1.89e-10

transcriptional regulator ICP4; Provisional


Pssm-ID: 223039 [Multi-domain]  Cd Length: 1352  Bit Score: 64.81  E-value: 1.89e-10
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 8176554    560 VPPTGDSEATPVPPTGDSETAPVPPTG-DSGAPPVPPTGDSGAPPVPPTGDSGAPPVPPTGDSGAPPvpPTGDSGAPPVP 638
Cdd:PHA03307   44 VSDSAELAAVTVVAGAAACDRFEPPTGpPPGPGTEAPANESRSTPTWSLSTLAPASPAREGSPTPPG--PSSPDPPPPTP 121
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 8176554    639 PTgDSGAPPVPPTGDSGAPPVPPTGDAGPPPVPPTGDSGPPPV--PPTGDSGAPPVTPTGDSETAPVPPtgdsgAPPVPP 716
Cdd:PHA03307  122 PP-ASPPPSPAPDLSEMLRPVGSPGPPPAASPPAAGASPAAVAsdAASSRQAALPLSSPEETARAPSSP-----PAEPPP 195
                         170       180
                  ....*....|....*....|..
gi 8176554    717 TGDSEAAPVPPTDDSKEAQMPA 738
Cdd:PHA03307  196 STPPAAASPRPPRRSSPISASA 217
PRK14951 PRK14951
DNA polymerase III subunits gamma and tau; Provisional
584-726 1.90e-10

DNA polymerase III subunits gamma and tau; Provisional


Pssm-ID: 237865 [Multi-domain]  Cd Length: 618  Bit Score: 63.97  E-value: 1.90e-10
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 8176554   584 PTGDSGAPPVPPTgdsgAPPVPPTGDSGAP-PVPPTGDSGAPPVPPTGDSGAPPVPPTGDSGAPPVPPTGDSGAPPVPPT 662
Cdd:PRK14951 366 PAAAAEAAAPAEK----KTPARPEAAAPAAaPVAQAAAAPAPAAAPAAAASAPAAPPAAAPPAPVAAPAAAAPAAAPAAA 441
                         90       100       110       120       130       140
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*.
gi 8176554   663 GDAGPPPVPPtgdsgPPPVPPTGDSGAPPVTPTGDSETAPVPPTGDSGAPPVPPT--GDSEAAPVP 726
Cdd:PRK14951 442 PAAVALAPAP-----PAQAAPETVAIPVRVAPEPAVASAAPAPAAAPAAARLTPTeeGDVWHATVQ 502
Atrophin-1 pfam03154
Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian ...
550-727 2.32e-10

Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian atrophy (DRPLA) gene. DRPLA OMIM:125370 is a progressive neurodegenerative disorder. It is caused by the expansion of a CAG repeat in the DRPLA gene on chromosome 12p. This results in an extended polyglutamine region in atrophin-1, that is thought to confer toxicity to the protein, possibly through altering its interactions with other proteins. The expansion of a CAG repeat is also the underlying defect in six other neurodegenerative disorders, including Huntington's disease. One interaction of expanded polyglutamine repeats that is thought to be pathogenic is that with the short glutamine repeat in the transcriptional coactivator CREB binding protein, CBP. This interaction draws CBP away from its usual nuclear location to the expanded polyglutamine repeat protein aggregates that are characteriztic of the polyglutamine neurodegenerative disorders. This interferes with CBP-mediated transcription and causes cytotoxicity.


Pssm-ID: 460830 [Multi-domain]  Cd Length: 991  Bit Score: 64.40  E-value: 2.32e-10
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 8176554    550 PTVTDQEATPVPPtgdseatPVPPTGDSETAPVPPTgdSGAPPVPPtgdSGAPPVPPTGDSGAPPVPPTG--DSGAPPVP 627
Cdd:pfam03154 172 PVLQAQSGAASPP-------SPPPPGTTQAATAGPT--PSAPSVPP---QGSPATSQPPNQTQSTAAPHTliQQTPTLHP 239
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 8176554    628 PTGDSGAPPVPPtgdsgAPPVPPTGDSGAPPVPPTGDAGP-PPVPPTGDSGPPPVP-PTGDSGAPPVTPTGDSETAPVPP 705
Cdd:pfam03154 240 QRLPSPHPPLQP-----MTQPPPPSQVSPQPLPQPSLHGQmPPMPHSLQTGPSHMQhPVPPQPFPLTPQSSQSQVPPGPS 314
                         170       180
                  ....*....|....*....|....*.
gi 8176554    706 TGDSG----APPVPPTGDSEAAPVPP 727
Cdd:pfam03154 315 PAAPGqsqqRIHTPPSQSQLQSQQPP 340
Atrophin-1 pfam03154
Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian ...
547-727 2.52e-10

Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian atrophy (DRPLA) gene. DRPLA OMIM:125370 is a progressive neurodegenerative disorder. It is caused by the expansion of a CAG repeat in the DRPLA gene on chromosome 12p. This results in an extended polyglutamine region in atrophin-1, that is thought to confer toxicity to the protein, possibly through altering its interactions with other proteins. The expansion of a CAG repeat is also the underlying defect in six other neurodegenerative disorders, including Huntington's disease. One interaction of expanded polyglutamine repeats that is thought to be pathogenic is that with the short glutamine repeat in the transcriptional coactivator CREB binding protein, CBP. This interaction draws CBP away from its usual nuclear location to the expanded polyglutamine repeat protein aggregates that are characteriztic of the polyglutamine neurodegenerative disorders. This interferes with CBP-mediated transcription and causes cytotoxicity.


Pssm-ID: 460830 [Multi-domain]  Cd Length: 991  Bit Score: 64.02  E-value: 2.52e-10
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 8176554    547 LALPTVTDQEATPVPPTGDSEATPVPPTGdseTAPVPPtgdSGAPPVPPTGDSGAPPVPPTG--DSGAPPVPPTGDSGAP 624
Cdd:pfam03154 174 LQAQSGAASPPSPPPPGTTQAATAGPTPS---APSVPP---QGSPATSQPPNQTQSTAAPHTliQQTPTLHPQRLPSPHP 247
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 8176554    625 PVPPTgdsgAPPVPPTGDSGAPPVPPTGDSGAPPVPPTGDAGPPPVPPTGDSGPPPVPPTGDSGAPPVTPTGD-----SE 699
Cdd:pfam03154 248 PLQPM----TQPPPPSQVSPQPLPQPSLHGQMPPMPHSLQTGPSHMQHPVPPQPFPLTPQSSQSQVPPGPSPAapgqsQQ 323
                         170       180
                  ....*....|....*....|....*...
gi 8176554    700 TAPVPPTGDSGAPPVPPtgdsEAAPVPP 727
Cdd:pfam03154 324 RIHTPPSQSQLQSQQPP----REQPLPP 347
PRK12323 PRK12323
DNA polymerase III subunit gamma/tau;
548-728 3.05e-10

DNA polymerase III subunit gamma/tau;


Pssm-ID: 237057 [Multi-domain]  Cd Length: 700  Bit Score: 63.74  E-value: 3.05e-10
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 8176554   548 ALPTVTDQEATPVPPTGDSEATPVPPTGDSETAPVPPTGDSGAPPVPPTGDSGAPPVPPTGDSGAPPVPPTGDSGAPPVP 627
Cdd:PRK12323 395 AAPAPAAPPAAPAAAPAAAAAARAVAAAPARRSPAPEALAAARQASARGPGGAPAPAPAPAAAPAAAARPAAAGPRPVAA 474
                         90       100       110       120       130       140       150       160
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 8176554   628 PTGDSGAPPVPPTGDSGAPPVPPTGDSGAPPVPPTGDAGPPPVPPTGDSGPPPVPPTGDSGAPPVTPTGDSETAPVPPTG 707
Cdd:PRK12323 475 AAAAAPARAAPAAAPAPADDDPPPWEELPPEFASPAPAQPDAAPAGWVAESIPDPATADPDDAFETLAPAPAAAPAPRAA 554
                        170       180
                 ....*....|....*....|.
gi 8176554   708 DSGAPPVPPTGDSEAAPVPPT 728
Cdd:PRK12323 555 AATEPVVAPRPPRASASGLPD 575
PHA03247 PHA03247
large tegument protein UL36; Provisional
567-741 4.30e-10

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 63.80  E-value: 4.30e-10
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 8176554    567 EATPVPPTGdsetAPVPPTGDSGAPPVPPTGDSGAPPVPPTGDSGAPpVPP-------------TGDSGAPPvpptgdsg 633
Cdd:PHA03247 2486 ARFPFAAGA----APDPGGGGPPDPDAPPAPSRLAPAILPDEPVGEP-VHPrmltwirgleelaSDDAGDPP-------- 2552
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 8176554    634 aPPVPPTGDSGAP--PVPPTGDSGAPPVPP-TGDAGPPPVPPTGDSGPPPVPPTGDSGAPPVTPTGDSETAPVPPTGDSG 710
Cdd:PHA03247 2553 -PPLPPAAPPAAPdrSVPPPRPAPRPSEPAvTSRARRPDAPPQSARPRAPVDDRGDPRGPAPPSPLPPDTHAPDPPPPSP 2631
                         170       180       190
                  ....*....|....*....|....*....|.
gi 8176554    711 APPVPPTGDSEAAPVPPTDDSKEAQMPAVIR 741
Cdd:PHA03247 2632 SPAANEPDPHPPPTVPPPERPRDDPAPGRVS 2662
PspC_subgroup_2 NF033839
pneumococcal surface protein PspC, LPXTG-anchored form; The pneumococcal surface protein PspC, ...
550-735 4.40e-10

pneumococcal surface protein PspC, LPXTG-anchored form; The pneumococcal surface protein PspC, as described in Streptococcus pneumoniae, is a repetitive and highly variable protein, recognized by a conserved N-terminal domain and also by genomic location. This form, subgroup 2, is anchored covalently after cleavage by sortase at a C-terminal LPXTG site. The other form, subgroup 1, has variable numbers of a choline-binding repeat in the C-terminal region, and is also known as choline-binding protein A.


Pssm-ID: 468202 [Multi-domain]  Cd Length: 557  Bit Score: 62.86  E-value: 4.40e-10
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 8176554   550 PTVTDQEATPVPPTGDSEATPVPPTGDSETAP--------VPPTGDSGAPPVPPTGDSGAPPVPPTGDSGAPPVPPTGDS 621
Cdd:NF033839 303 PQPEKKEVKPEPETPKPEVKPQLEKPKPEVKPqpekpkpeVKPQLETPKPEVKPQPEKPKPEVKPQPEKPKPEVKPQPET 382
                         90       100       110       120       130       140       150       160
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 8176554   622 GAPPVPPTGDSGAPPVPPTGDSGAPPVPPTGDSGAPPVPPTGDAGPPPVPPTGDSGPPPVPPTGDSGAPPVTPTGDSETA 701
Cdd:NF033839 383 PKPEVKPQPEKPKPEVKPQPEKPKPEVKPQPEKPKPEVKPQPEKPKPEVKPQPEKPKPEVKPQPEKPKPEVKPQPETPKP 462
                        170       180       190       200
                 ....*....|....*....|....*....|....*....|....*
gi 8176554   702 PVPPTGDSGAPPVPPTGDS-----------EAAPVPPTDDSKEAQ 735
Cdd:NF033839 463 EVKPQPEKPKPEVKPQPEKpkpdnskpqadDKKPSTPNNLSKDKQ 507
PRK14951 PRK14951
DNA polymerase III subunits gamma and tau; Provisional
562-704 4.81e-10

DNA polymerase III subunits gamma and tau; Provisional


Pssm-ID: 237865 [Multi-domain]  Cd Length: 618  Bit Score: 62.81  E-value: 4.81e-10
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 8176554   562 PTGDSEATPVPPtgdSETAPVPPTGDSGAPPVPPTGDSGAPPVPPTGDSGAPPVPPTGDSGAPPVPPTGDSGAPPVPPTG 641
Cdd:PRK14951 366 PAAAAEAAAPAE---KKTPARPEAAAPAAAPVAQAAAAPAPAAAPAAAASAPAAPPAAAPPAPVAAPAAAAPAAAPAAAP 442
                         90       100       110       120       130       140
                 ....*....|....*....|....*....|....*....|....*....|....*....|...
gi 8176554   642 DSGAPPVPPTGDSGAPPVPPTGDAGPPPVPPtgdSGPPPVPPTGDSGAPPVTPTGDSETAPVP 704
Cdd:PRK14951 443 AAVALAPAPPAQAAPETVAIPVRVAPEPAVA---SAAPAPAAAPAAARLTPTEEGDVWHATVQ 502
Atrophin-1 pfam03154
Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian ...
547-733 5.31e-10

Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian atrophy (DRPLA) gene. DRPLA OMIM:125370 is a progressive neurodegenerative disorder. It is caused by the expansion of a CAG repeat in the DRPLA gene on chromosome 12p. This results in an extended polyglutamine region in atrophin-1, that is thought to confer toxicity to the protein, possibly through altering its interactions with other proteins. The expansion of a CAG repeat is also the underlying defect in six other neurodegenerative disorders, including Huntington's disease. One interaction of expanded polyglutamine repeats that is thought to be pathogenic is that with the short glutamine repeat in the transcriptional coactivator CREB binding protein, CBP. This interaction draws CBP away from its usual nuclear location to the expanded polyglutamine repeat protein aggregates that are characteriztic of the polyglutamine neurodegenerative disorders. This interferes with CBP-mediated transcription and causes cytotoxicity.


Pssm-ID: 460830 [Multi-domain]  Cd Length: 991  Bit Score: 62.86  E-value: 5.31e-10
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 8176554    547 LALPTVTDQEATPVPPTGDSEATPVPPtgdSETAPVPPTGDSGAPPVPPTGDSGAPPV--PPtgdSGAPP----VPPTGD 620
Cdd:pfam03154 350 LSMPHIKPPPTTPIPQLPNPQSHKHPP---HLSGPSPFQMNSNLPPPPALKPLSSLSThhPP---SAHPPplqlMPQSQQ 423
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 8176554    621 SGAPPVPPTGDSGAPPVPPTGDSGAPPVPPTGDSGAPPVP--PTGDAGPPPVPPTgdSGPPPVPPTGDSGAPPVTPTGDS 698
Cdd:pfam03154 424 LPPPPAQPPVLTQSQSLPPPAASHPPTSGLHQVPSQSPFPqhPFVPGGPPPITPP--SGPPTSTSSAMPGIQPPSSASVS 501
                         170       180       190       200
                  ....*....|....*....|....*....|....*....|..
gi 8176554    699 ETAPVPPTGDSGAPPV-----PP--TGDSEAAPVPPTDDSKE 733
Cdd:pfam03154 502 SSGPVPAAVSCPLPPVqikeeALdeAEEPESPPPPPRSPSPE 543
PRK14950 PRK14950
DNA polymerase III subunits gamma and tau; Provisional
567-676 5.56e-10

DNA polymerase III subunits gamma and tau; Provisional


Pssm-ID: 237864 [Multi-domain]  Cd Length: 585  Bit Score: 62.52  E-value: 5.56e-10
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 8176554   567 EATPVPPTGdseTAPVPPTGDSGAPPVPPTGDSGAPPVPPTgdSGAPPVPPTGDSgAPPVPPTGDSGAPPVPPTgdsgaP 646
Cdd:PRK14950 357 EALLVPVPA---PQPAKPTAAAPSPVRPTPAPSTRPKAAAA--ANIPPKEPVRET-ATPPPVPPRPVAPPVPHT-----P 425
                         90       100       110
                 ....*....|....*....|....*....|
gi 8176554   647 PVPPTGDSGAPPVPPTGDAGPPPVPPTGDS 676
Cdd:PRK14950 426 ESAPKLTRAAIPVDEKPKYTPPAPPKEEEK 455
PHA03307 PHA03307
transcriptional regulator ICP4; Provisional
553-731 6.29e-10

transcriptional regulator ICP4; Provisional


Pssm-ID: 223039 [Multi-domain]  Cd Length: 1352  Bit Score: 62.88  E-value: 6.29e-10
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 8176554    553 TDQEATPVPPTGDSEATPVPPTGDSETAPVPPTGDSGAPPVPPTGDSGapPVPPTGDSGAPPvPPTGDSGAPPvPPTGDS 632
Cdd:PHA03307   58 GAAACDRFEPPTGPPPGPGTEAPANESRSTPTWSLSTLAPASPAREGS--PTPPGPSSPDPP-PPTPPPASPP-PSPAPD 133
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 8176554    633 GAPPVPPTGDSGAPPVPPTGDSGAPPVPPTGDAGPP-----PVPPTGDSGPPPVPPTGDS-GAPPVTPTGDSETAPVPPT 706
Cdd:PHA03307  134 LSEMLRPVGSPGPPPAASPPAAGASPAAVASDAASSrqaalPLSSPEETARAPSSPPAEPpPSTPPAAASPRPPRRSSPI 213
                         170       180
                  ....*....|....*....|....*
gi 8176554    707 GDSGAPPVPPTGDSEAAPVPPTDDS 731
Cdd:PHA03307  214 SASASSPAPAPGRSAADDAGASSSD 238
PRK07764 PRK07764
DNA polymerase III subunits gamma and tau; Validated
593-730 7.12e-10

DNA polymerase III subunits gamma and tau; Validated


Pssm-ID: 236090 [Multi-domain]  Cd Length: 824  Bit Score: 62.70  E-value: 7.12e-10
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 8176554   593 VPPTGDSGAPPVPPTGDSGAPPVPptgdsGAPPVPPTGDSGAPPVPPtgdsGAPPVPPTGDSGAPPVPPTGDAGPPPVPP 672
Cdd:PRK07764 385 LGVAGGAGAPAAAAPSAAAAAPAA-----APAPAAAAPAAAAAPAPA----AAPQPAPAPAPAPAPPSPAGNAPAGGAPS 455
                         90       100       110       120       130
                 ....*....|....*....|....*....|....*....|....*....|....*...
gi 8176554   673 TGDSGPPPVPPTGdSGAPPVTPTGDSETAPVPPTGDSGAPPVPPtgdSEAAPVPPTDD 730
Cdd:PRK07764 456 PPPAAAPSAQPAP-APAAAPEPTAAPAPAPPAAPAPAAAPAAPA---APAAPAGADDA 509
PRK14951 PRK14951
DNA polymerase III subunits gamma and tau; Provisional
595-730 8.14e-10

DNA polymerase III subunits gamma and tau; Provisional


Pssm-ID: 237865 [Multi-domain]  Cd Length: 618  Bit Score: 62.04  E-value: 8.14e-10
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 8176554   595 PTGDSGAPPVPPTgdsgAPPVPPTGDSGAP-PVPPTGDSGAPPVPPTGDSGAPPVPPTGDSGAPPVPPTGDAGPPPVPPT 673
Cdd:PRK14951 366 PAAAAEAAAPAEK----KTPARPEAAAPAAaPVAQAAAAPAPAAAPAAAASAPAAPPAAAPPAPVAAPAAAAPAAAPAAA 441
                         90       100       110       120       130
                 ....*....|....*....|....*....|....*....|....*....|....*..
gi 8176554   674 GDSGPPPVPPTGDSGAPPVTPTGDSETAPVPPTgdsgaPPVPPTGDSEAAPVPPTDD 730
Cdd:PRK14951 442 PAAVALAPAPPAQAAPETVAIPVRVAPEPAVAS-----AAPAPAAAPAAARLTPTEE 493
PRK07764 PRK07764
DNA polymerase III subunits gamma and tau; Validated
582-712 9.28e-10

DNA polymerase III subunits gamma and tau; Validated


Pssm-ID: 236090 [Multi-domain]  Cd Length: 824  Bit Score: 62.31  E-value: 9.28e-10
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 8176554   582 VPPTGDSGAPPVPPTGDSGAPPVPptgdsGAPPVPPTGDSGAPPVPPtgdsGAPPVPPTGDSGAPPVPPTGDSGAPPVPP 661
Cdd:PRK07764 385 LGVAGGAGAPAAAAPSAAAAAPAA-----APAPAAAAPAAAAAPAPA----AAPQPAPAPAPAPAPPSPAGNAPAGGAPS 455
                         90       100       110       120       130
                 ....*....|....*....|....*....|....*....|....*....|.
gi 8176554   662 TGDAGPPPVPPTGDSGPPPVPPTGDSGAPPVTPTGDSETAPVPPTGDSGAP 712
Cdd:PRK07764 456 PPPAAAPSAQPAPAPAAAPEPTAAPAPAPPAAPAPAAAPAAPAAPAAPAGA 506
PRK14951 PRK14951
DNA polymerase III subunits gamma and tau; Provisional
575-722 1.01e-09

DNA polymerase III subunits gamma and tau; Provisional


Pssm-ID: 237865 [Multi-domain]  Cd Length: 618  Bit Score: 61.65  E-value: 1.01e-09
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 8176554   575 GDSETAPVPPTGDSGAPPVPPTgdSGAPPVPPTGDSGAPPVPPTGDSGAPPVPPTGDSGAPPVPPTGDSGAPPVPPTGDS 654
Cdd:PRK14951 367 AAAAEAAAPAEKKTPARPEAAA--PAAAPVAQAAAAPAPAAAPAAAASAPAAPPAAAPPAPVAAPAAAAPAAAPAAAPAA 444
                         90       100       110       120       130       140       150
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 8176554   655 GAPPVPPTGDAGPPPVPPTGDSGPPPVPPTgdsgaPPVTPTGDSETAPVPPT--GDSGAPPVPPTGDSEA 722
Cdd:PRK14951 445 VALAPAPPAQAAPETVAIPVRVAPEPAVAS-----AAPAPAAAPAAARLTPTeeGDVWHATVQQLAAAEA 509
PHA03307 PHA03307
transcriptional regulator ICP4; Provisional
548-726 1.24e-09

transcriptional regulator ICP4; Provisional


Pssm-ID: 223039 [Multi-domain]  Cd Length: 1352  Bit Score: 62.11  E-value: 1.24e-09
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 8176554    548 ALPTVTDQEATPVPPTGDSEATPVPPTGDSETAPVPPTgDSGAPPVPPTGDSGAPPVPPTGDSGAPPVPPTGDSGAPPVP 627
Cdd:PHA03307   86 STPTWSLSTLAPASPAREGSPTPPGPSSPDPPPPTPPP-ASPPPSPAPDLSEMLRPVGSPGPPPAASPPAAGASPAAVAS 164
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 8176554    628 PTGDSGAPPVP-PTGDSGAPPVPPTGDSGAPPVPPTGDAGPPPVP--PTGDSGPPPVPPTGDSGAPPVTPTGDSETAPVP 704
Cdd:PHA03307  165 DAASSRQAALPlSSPEETARAPSSPPAEPPPSTPPAAASPRPPRRssPISASASSPAPAPGRSAADDAGASSSDSSSSES 244
                         170       180
                  ....*....|....*....|....*
gi 8176554    705 PTGDSG---APPVPPTGDSEAAPVP 726
Cdd:PHA03307  245 SGCGWGpenECPLPRPAPITLPTRI 269
PspC_subgroup_2 NF033839
pneumococcal surface protein PspC, LPXTG-anchored form; The pneumococcal surface protein PspC, ...
575-737 1.33e-09

pneumococcal surface protein PspC, LPXTG-anchored form; The pneumococcal surface protein PspC, as described in Streptococcus pneumoniae, is a repetitive and highly variable protein, recognized by a conserved N-terminal domain and also by genomic location. This form, subgroup 2, is anchored covalently after cleavage by sortase at a C-terminal LPXTG site. The other form, subgroup 1, has variable numbers of a choline-binding repeat in the C-terminal region, and is also known as choline-binding protein A.


Pssm-ID: 468202 [Multi-domain]  Cd Length: 557  Bit Score: 61.32  E-value: 1.33e-09
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 8176554   575 GDSETAPVPPTGDSGAPPVP---PTGDSGAPPVPPTGDSGAPPVPPTGDSGAPPVPPTGDSGAPPVPPTGDSGAPPVPPT 651
Cdd:NF033839 278 GLTQDTPKEPGNKKPSAPKPgmqPSPQPEKKEVKPEPETPKPEVKPQLEKPKPEVKPQPEKPKPEVKPQLETPKPEVKPQ 357
                         90       100       110       120       130       140       150       160
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 8176554   652 GDSGAPPVPPTGDAGPPPVPPTGDSGPPPVPPTGDSGAPPVTP---TGDSETAPVPPTGDSGAPPVPPTGDSEAAPVPPT 728
Cdd:NF033839 358 PEKPKPEVKPQPEKPKPEVKPQPETPKPEVKPQPEKPKPEVKPqpeKPKPEVKPQPEKPKPEVKPQPEKPKPEVKPQPEK 437

                 ....*....
gi 8176554   729 DDSKEAQMP 737
Cdd:NF033839 438 PKPEVKPQP 446
PHA03307 PHA03307
transcriptional regulator ICP4; Provisional
550-728 1.33e-09

transcriptional regulator ICP4; Provisional


Pssm-ID: 223039 [Multi-domain]  Cd Length: 1352  Bit Score: 61.73  E-value: 1.33e-09
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 8176554    550 PTVTDQEATPVPPTGDSEATPVPPTGDSETAPVPPTGDSGAPPVPPTGDSGAPPVPPTGDSG---APPVPPTGDSGAPPV 626
Cdd:PHA03307  189 PPAEPPPSTPPAAASPRPPRRSSPISASASSPAPAPGRSAADDAGASSSDSSSSESSGCGWGpenECPLPRPAPITLPTR 268
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 8176554    627 PptgDSGAPPVPPTGDSG-APPVPPTGDSGAPPVPPTGDAGPPPVPPTGDSGPPPVPPTGDSGAppvTPTGDSETAPVPP 705
Cdd:PHA03307  269 I---WEASGWNGPSSRPGpASSSSSPRERSPSPSPSSPGSGPAPSSPRASSSSSSSRESSSSST---SSSSESSRGAAVS 342
                         170       180
                  ....*....|....*....|...
gi 8176554    706 TGDSGAPPVPPTGDSEAAPVPPT 728
Cdd:PHA03307  343 PGPSPSRSPSPSRPPPPADPSSP 365
PRK07764 PRK07764
DNA polymerase III subunits gamma and tau; Validated
601-741 1.55e-09

DNA polymerase III subunits gamma and tau; Validated


Pssm-ID: 236090 [Multi-domain]  Cd Length: 824  Bit Score: 61.54  E-value: 1.55e-09
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 8176554   601 APPVPPTGDSGAPPVPPTGDSGAPPVPPTGDSGAPPVPPTGDSGAPPVPPTGDSGAPPVPPTGDAGPPPVPPTGDSGPPP 680
Cdd:PRK07764 589 GPAPGAAGGEGPPAPASSGPPEEAARPAAPAAPAAPAAPAPAGAAAAPAEASAAPAPGVAAPEHHPKHVAVPDASDGGDG 668
                         90       100       110       120       130       140
                 ....*....|....*....|....*....|....*....|....*....|....*....|.
gi 8176554   681 VPPTGDSGAPPVTPTGDsetAPVPPTGDSGAPPVPPTGDSEAAPVPPTDDSKEAQMPAVIR 741
Cdd:PRK07764 669 WPAKAGGAAPAAPPPAP---APAAPAAPAGAAPAQPAPAPAATPPAGQADDPAAQPPQAAQ 726
PHA03247 PHA03247
large tegument protein UL36; Provisional
565-734 1.57e-09

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 61.88  E-value: 1.57e-09
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 8176554    565 DSEATPVPPTGDSETAPVPPTGDSGAPPVPPTGDSGAPPVPPTGD-----SGAPPVPPTGDSGAPPVPPTGDSG------ 633
Cdd:PHA03247  253 AAPAPPPVVGEGADRAPETARGATGPPPPPEAAAPNGAAAPPDGVwgaalAGAPLALPAPPDPPPPAPAGDAEEeddedg 332
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 8176554    634 ----APPVP---------------PT------------GDSGAPPVPP------TGDSGAPPV--PPTGDAGPPPVPPTG 674
Cdd:PHA03247  333 amevVSPLPrprqhyplgfpkrrrPTwtppssledlsaGRHHPKRASLptrkrrSARHAATPFarGPGGDDQTRPAAPVP 412
                         170       180       190       200       210       220
                  ....*....|....*....|....*....|....*....|....*....|....*....|...
gi 8176554    675 DSGPPPVPPTGDSGAPPvtPTGDSETAPVPPTGDSGAPP---VPPTGDSEAAPVPPTDDSKEA 734
Cdd:PHA03247  413 ASVPTPAPTPVPASAPP--PPATPLPSAEPGSDDGPAPPperQPPAPATEPAPDDPDDATRKA 473
PHA03169 PHA03169
hypothetical protein; Provisional
573-728 1.82e-09

hypothetical protein; Provisional


Pssm-ID: 223003 [Multi-domain]  Cd Length: 413  Bit Score: 60.37  E-value: 1.82e-09
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 8176554   573 PTGD-SETAPVPPTGDSGAPPV-----PPTGDSGAPPVPPTGDSGaPPVPPTGDSGAPPVPPTGDSGAPPVPPTGDSGap 646
Cdd:PHA03169  92 PSGSgSESVGSPTPSPSGSAEElasglSPENTSGSSPESPASHSP-PPSPPSHPGPHEPAPPESHNPSPNQQPSSFLQ-- 168
                         90       100       110       120       130       140       150       160
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 8176554   647 pvPPTGDSGAPPVPPTGDA-----GPPPVPPTGDSGPPPVPPtgDSGAPPVTPTGDSetAPVPPTGDSGAPPVPPTGDSE 721
Cdd:PHA03169 169 --PSHEDSPEEPEPPTSEPepdspGPPQSETPTSSPPPQSPP--DEPGEPQSPTPQQ--APSPNTQQAVEHEDEPTEPER 242

                 ....*..
gi 8176554   722 AAPVPPT 728
Cdd:PHA03169 243 EGPPFPG 249
Drf_FH1 pfam06346
Formin Homology Region 1; This region is found in some of the Diaphanous related formins (Drfs) ...
570-727 1.86e-09

Formin Homology Region 1; This region is found in some of the Diaphanous related formins (Drfs). It consists of low complexity repeats of around 12 residues.


Pssm-ID: 461881 [Multi-domain]  Cd Length: 157  Bit Score: 57.19  E-value: 1.86e-09
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 8176554    570 PVPPTGDSETAPVPPTGDSGAPPVPPTGDSGAPPVPPTGDSGAPPVPPTgdSGAPPVPPtgdsgaPPVPPTGDSGAPPVP 649
Cdd:pfam06346   3 PPPLPGDSSTIPLPPGACIPTPPPLPGGGGPPPPPPLPGSAAIPPPPPL--PGGTSIPP------PPPLPGAASIPPPPP 74
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 8176554    650 PTGDSGAPPVPP-TGDAGPPPVPPTGDSG----PPPVPPTGDSGAPPVTPTGDSETAPVPPTGDSGAPPVPPTGDSEAAP 724
Cdd:pfam06346  75 LPGSTGIPPPPPlPGGAGIPPPPPPLPGGagvpPPPPPLPGGPGIPPPPPFPGGPGIPPPPPGMGMPPPPPFGFGVPAAP 154

                  ...
gi 8176554    725 VPP 727
Cdd:pfam06346 155 VLP 157
PHA03307 PHA03307
transcriptional regulator ICP4; Provisional
548-727 1.99e-09

transcriptional regulator ICP4; Provisional


Pssm-ID: 223039 [Multi-domain]  Cd Length: 1352  Bit Score: 61.34  E-value: 1.99e-09
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 8176554    548 ALPTVTDQEATPVPPTGDSEATPvPPTGDSETAPVPPTGDSGAPPVPPTGDSGAPPVPPTgDSGAPPVPPTGDSGAPPVP 627
Cdd:PHA03307  264 TLPTRIWEASGWNGPSSRPGPAS-SSSSPRERSPSPSPSSPGSGPAPSSPRASSSSSSSR-ESSSSSTSSSSESSRGAAV 341
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 8176554    628 PTGDSGAPPVPPTGDSGAPPVPPTGDSGAPPVPPTGDAGPPPVPPTGDSGPPPVPPTGDSGAPPVTPTGDSETAPVPPTG 707
Cdd:PHA03307  342 SPGPSPSRSPSPSRPPPPADPSSPRKRPRPSRAPSSPAASAGRPTRRRARAAVAGRARRRDATGRFPAGRPRPSPLDAGA 421
                         170       180
                  ....*....|....*....|....*..
gi 8176554    708 DSGAPPVP-----PTGDS--EAAPVPP 727
Cdd:PHA03307  422 ASGAFYARyplltPSGEPwpGSPPPPP 448
PRK07003 PRK07003
DNA polymerase III subunit gamma/tau;
557-735 2.85e-09

DNA polymerase III subunit gamma/tau;


Pssm-ID: 235906 [Multi-domain]  Cd Length: 830  Bit Score: 60.63  E-value: 2.85e-09
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 8176554   557 ATPVPPTGDSEATPVPPTGDSETAPVPPTGDSGAPPVPPTGDSGAP--PVPPTGDSGAPPVPPTGDSGAPPVPptgdsGA 634
Cdd:PRK07003 372 VPARVAGAVPAPGARAAAAVGASAVPAVTAVTGAAGAALAPKAAAAaaATRAEAPPAAPAPPATADRGDDAAD-----GD 446
                         90       100       110       120       130       140       150       160
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 8176554   635 PPVPPTGDSGAPPVPPTGDSGAPPV--PPTGDAGPPPVPPTGDSGP-PPVPPTGDSGAPPVTptgDSETAPVPPTGDSGA 711
Cdd:PRK07003 447 APVPAKANARASADSRCDERDAQPPadSGSASAPASDAPPDAAFEPaPRAAAPSAATPAAVP---DARAPAAASREDAPA 523
                        170       180
                 ....*....|....*....|....
gi 8176554   712 PPVPPTgdSEAAPVPPTDDSKEAQ 735
Cdd:PRK07003 524 AAAPPA--PEARPPTPAAAAPAAR 545
Abhydrolase_3 pfam07859
alpha/beta hydrolase fold; This catalytic domain is found in a very wide range of enzymes.
121-248 2.89e-09

alpha/beta hydrolase fold; This catalytic domain is found in a very wide range of enzymes.


Pssm-ID: 400284 [Multi-domain]  Cd Length: 208  Bit Score: 57.61  E-value: 2.89e-09
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 8176554    121 MIWIYGGAFLMGSghgANFLNNYLydgEEIATRGNVIVVTFNYRVGPlgflstgDANLPGnyGLRDQHMAIAWVKRNIAA 200
Cdd:pfam07859   1 LVYFHGGGFVLGS---ADTHDRLC---RRLAAEAGAVVVSVDYRLAP-------EHPFPA--AYDDAYAALRWLAEQAAE 65
                          90       100       110       120       130
                  ....*....|....*....|....*....|....*....|....*....|
gi 8176554    201 FGGDPNNITLFGESAGG--ASVSLQTLSPYNKGLIRAAisqsgVALSPWV 248
Cdd:pfam07859  66 LGADPSRIAVAGDSAGGnlAAAVALRARDEGLPKPAGQ-----VLIYPGT 110
PHA03307 PHA03307
transcriptional regulator ICP4; Provisional
564-738 2.91e-09

transcriptional regulator ICP4; Provisional


Pssm-ID: 223039 [Multi-domain]  Cd Length: 1352  Bit Score: 60.57  E-value: 2.91e-09
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 8176554    564 GDSEATPVPPTGdseTAPVPPTGDSGAPPVPPTGDSG-APPVPPTGDSGAPPVPPTGDSGAPPVPPTGDSGAPPVPPTGD 642
Cdd:PHA03307  250 GPENECPLPRPA---PITLPTRIWEASGWNGPSSRPGpASSSSSPRERSPSPSPSSPGSGPAPSSPRASSSSSSSRESSS 326
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 8176554    643 SGAPPVPPTGDSGAPPVPPTGDAGPPPVPPTGDSGPPPVPPTGDSGAPPVTPTGDSETAPVPPTGDSGAPPV------PP 716
Cdd:PHA03307  327 SSTSSSSESSRGAAVSPGPSPSRSPSPSRPPPPADPSSPRKRPRPSRAPSSPAASAGRPTRRRARAAVAGRArrrdatGR 406
                         170       180
                  ....*....|....*....|..
gi 8176554    717 TGDSEAAPVPPTDDSKEAQMPA 738
Cdd:PHA03307  407 FPAGRPRPSPLDAGAASGAFYA 428
PHA03169 PHA03169
hypothetical protein; Provisional
561-712 4.99e-09

hypothetical protein; Provisional


Pssm-ID: 223003 [Multi-domain]  Cd Length: 413  Bit Score: 59.21  E-value: 4.99e-09
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 8176554   561 PPTGDSEATPV-----PPTGDSETAPVPPTGDSGaPPVPPTGDSGAPPVPPTGDSGAPPVPPTG-------DSGAPPVPP 628
Cdd:PHA03169 103 PTPSPSGSAEElasglSPENTSGSSPESPASHSP-PPSPPSHPGPHEPAPPESHNPSPNQQPSSflqpsheDSPEEPEPP 181
                         90       100       110       120       130       140       150       160
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 8176554   629 TG----DSGAPPVPPTGDSGAPPVPPtGDSGAPPVPPTGDAGPPpvPPTGDSGPPPVPPTGDS--GAPPVTPTGDSET-- 700
Cdd:PHA03169 182 TSepepDSPGPPQSETPTSSPPPQSP-PDEPGEPQSPTPQQAPS--PNTQQAVEHEDEPTEPEreGPPFPGHRSHSYTvv 258
                        170
                 ....*....|..
gi 8176554   701 APVPPTGDSGAP 712
Cdd:PHA03169 259 GWKPSTRPGGVP 270
PRK14950 PRK14950
DNA polymerase III subunits gamma and tau; Provisional
580-689 5.38e-09

DNA polymerase III subunits gamma and tau; Provisional


Pssm-ID: 237864 [Multi-domain]  Cd Length: 585  Bit Score: 59.44  E-value: 5.38e-09
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 8176554   580 APVPPtgdsgAPPVPPTGDSGAPPVPPTGDSGAPPVPPTgdSGAPPVPPTGDSgAPPVPPTGDSGAPPVPPTgdsgaPPV 659
Cdd:PRK14950 361 VPVPA-----PQPAKPTAAAPSPVRPTPAPSTRPKAAAA--ANIPPKEPVRET-ATPPPVPPRPVAPPVPHT-----PES 427
                         90       100       110
                 ....*....|....*....|....*....|
gi 8176554   660 PPTGDAGPPPVPPTGDSGPPPVPPTGDSGA 689
Cdd:PRK14950 428 APKLTRAAIPVDEKPKYTPPAPPKEEEKAL 457
PRK07764 PRK07764
DNA polymerase III subunits gamma and tau; Validated
559-703 5.60e-09

DNA polymerase III subunits gamma and tau; Validated


Pssm-ID: 236090 [Multi-domain]  Cd Length: 824  Bit Score: 59.61  E-value: 5.60e-09
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 8176554   559 PVPPTGDSEATPVPPTGDSETAPVPPTGDSGAPPVPpTGDSGAPPVPPTGDSGAPPVPPTGDSGAPPVPptgdSGAPPVP 638
Cdd:PRK07764 668 GWPAKAGGAAPAAPPPAPAPAAPAAPAGAAPAQPAP-APAATPPAGQADDPAAQPPQAAQGASAPSPAA----DDPVPLP 742
                         90       100       110       120       130       140
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*..
gi 8176554   639 PTGDSGAPPVPPTGDSGAPPVPPtgdAGPPPV--PPTGDSGPPPVPPTGDSGAPPVTPTGDSETAPV 703
Cdd:PRK07764 743 PEPDDPPDPAGAPAQPPPPPAPA---PAAAPAaaPPPSPPSEEEEMAEDDAPSMDDEDRRDAEEVAM 806
PRK14950 PRK14950
DNA polymerase III subunits gamma and tau; Provisional
614-723 1.02e-08

DNA polymerase III subunits gamma and tau; Provisional


Pssm-ID: 237864 [Multi-domain]  Cd Length: 585  Bit Score: 58.67  E-value: 1.02e-08
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 8176554   614 PVPPtgdsgAPPVPPTGDSGAPPVPPTGDSGAPPVPPTgdSGAPPVPPTGDAGPPPVPPTGDSGPPPVPPtgdsgaPPVT 693
Cdd:PRK14950 362 PVPA-----PQPAKPTAAAPSPVRPTPAPSTRPKAAAA--ANIPPKEPVRETATPPPVPPRPVAPPVPHT------PESA 428
                         90       100       110
                 ....*....|....*....|....*....|
gi 8176554   694 PTGDSETAPVPPTGDSGAPPVPPTGDSEAA 723
Cdd:PRK14950 429 PKLTRAAIPVDEKPKYTPPAPPKEEEKALI 458
Atrophin-1 pfam03154
Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian ...
548-738 1.33e-08

Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian atrophy (DRPLA) gene. DRPLA OMIM:125370 is a progressive neurodegenerative disorder. It is caused by the expansion of a CAG repeat in the DRPLA gene on chromosome 12p. This results in an extended polyglutamine region in atrophin-1, that is thought to confer toxicity to the protein, possibly through altering its interactions with other proteins. The expansion of a CAG repeat is also the underlying defect in six other neurodegenerative disorders, including Huntington's disease. One interaction of expanded polyglutamine repeats that is thought to be pathogenic is that with the short glutamine repeat in the transcriptional coactivator CREB binding protein, CBP. This interaction draws CBP away from its usual nuclear location to the expanded polyglutamine repeat protein aggregates that are characteriztic of the polyglutamine neurodegenerative disorders. This interferes with CBP-mediated transcription and causes cytotoxicity.


Pssm-ID: 460830 [Multi-domain]  Cd Length: 991  Bit Score: 58.63  E-value: 1.33e-08
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 8176554    548 ALPTVTDQEATPVPPTGDSEATPVPPTGDSETAPVPPTGDSGAPPVPPTGDSGAPPVPPTgdsgAPPVPPTGDSGAPPVP 627
Cdd:pfam03154 193 QAATAGPTPSAPSVPPQGSPATSQPPNQTQSTAAPHTLIQQTPTLHPQRLPSPHPPLQPM----TQPPPPSQVSPQPLPQ 268
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 8176554    628 PTGDSGAPPVPPTGDSGAP----PVPP--------TGDSGAPPVPPTGDAGP----PPVPPTGDSGPPPVPPTgDSGAPP 691
Cdd:pfam03154 269 PSLHGQMPPMPHSLQTGPShmqhPVPPqpfpltpqSSQSQVPPGPSPAAPGQsqqrIHTPPSQSQLQSQQPPR-EQPLPP 347
                         170       180       190       200
                  ....*....|....*....|....*....|....*....|....*...
gi 8176554    692 vTPTGDSETAPVPPTgdsgapPVPPTGDSEAAPVPPTDDSKEA-QMPA 738
Cdd:pfam03154 348 -APLSMPHIKPPPTT------PIPQLPNPQSHKHPPHLSGPSPfQMNS 388
Chi1 COG3469
Chitinase [Carbohydrate transport and metabolism];
536-732 1.38e-08

Chitinase [Carbohydrate transport and metabolism];


Pssm-ID: 442692 [Multi-domain]  Cd Length: 534  Bit Score: 57.84  E-value: 1.38e-08
                        10        20        30        40        50        60        70        80
                ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 8176554  536 TNFLRYWTLTYLALPTVTDQEATPVPPTGDSEATPVPPTGdseTAPVPPTGDSGAPPVPPTGDSGAPPVPPTGDSGAPPV 615
Cdd:COG3469  21 TLLGAAATAASVTLTAATATTVVSTTGSVVVAASGSAGSG---TGTTAASSTAATSSTTSTTATATAAAAAATSTSATLV 97
                        90       100       110       120       130       140       150       160
                ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 8176554  616 PPTGDSGAPPVPPTGDSGAPPVPPTGDSGAPPVPPTGDSGAPPVPPTGDAGPPPVPPTGDSGPPPVPPTGDSGAPPVTPT 695
Cdd:COG3469  98 ATSTASGANTGTSTVTTTSTGAGSVTSTTSSTAGSTTTSGASATSSAGSTTTTTTVSGTETATGGTTTTSTTTTTTSAST 177
                       170       180       190
                ....*....|....*....|....*....|....*..
gi 8176554  696 GDSETAPVPPTGDSGAPPVPPTGDSEAAPVPPTDDSK 732
Cdd:COG3469 178 TPSATTTATATTASGATTPSATTTATTTGPPTPGLPK 214
PHA03307 PHA03307
transcriptional regulator ICP4; Provisional
559-727 1.86e-08

transcriptional regulator ICP4; Provisional


Pssm-ID: 223039 [Multi-domain]  Cd Length: 1352  Bit Score: 58.26  E-value: 1.86e-08
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 8176554    559 PVPPTGDSEATPVPPTGDSETAPVPPTGDSGAPPVPPTGDSGAPPVPPTGDSGAPPVPPtgDSGAPPVPP---TGDSGAP 635
Cdd:PHA03307  220 PAPAPGRSAADDAGASSSDSSSSESSGCGWGPENECPLPRPAPITLPTRIWEASGWNGP--SSRPGPASSsssPRERSPS 297
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 8176554    636 PVPPTGDSGAPPVPPTGDSGAPPVPPTGDAGPPPVPPTgdSGPPPVPPtGDSGAPPVTPTGDSETAPVPPTGDSGAPPVP 715
Cdd:PHA03307  298 PSPSSPGSGPAPSSPRASSSSSSSRESSSSSTSSSSES--SRGAAVSP-GPSPSRSPSPSRPPPPADPSSPRKRPRPSRA 374
                         170
                  ....*....|..
gi 8176554    716 PTGDSEAAPVPP 727
Cdd:PHA03307  375 PSSPAASAGRPT 386
PRK07764 PRK07764
DNA polymerase III subunits gamma and tau; Validated
548-677 2.53e-08

DNA polymerase III subunits gamma and tau; Validated


Pssm-ID: 236090 [Multi-domain]  Cd Length: 824  Bit Score: 57.69  E-value: 2.53e-08
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 8176554   548 ALPTVTDQEATPVPPTGDSEATPVPpTGDSETAPVPPTGDSGAPPVPPTGDSGAPPVPptgdSGAPPVPPTGDSGAPPVP 627
Cdd:PRK07764 679 AAPPPAPAPAAPAAPAGAAPAQPAP-APAATPPAGQADDPAAQPPQAAQGASAPSPAA----DDPVPLPPEPDDPPDPAG 753
                         90       100       110       120       130
                 ....*....|....*....|....*....|....*....|....*....|
gi 8176554   628 PTGDSGAPPVPPTGDSGAPPvPPTGDSGAPPVPPTGDAGPPPVPPTGDSG 677
Cdd:PRK07764 754 APAQPPPPPAPAPAAAPAAA-PPPSPPSEEEEMAEDDAPSMDDEDRRDAE 802
PRK07764 PRK07764
DNA polymerase III subunits gamma and tau; Validated
621-739 2.78e-08

DNA polymerase III subunits gamma and tau; Validated


Pssm-ID: 236090 [Multi-domain]  Cd Length: 824  Bit Score: 57.30  E-value: 2.78e-08
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 8176554   621 SGAPPVPPTGDSGAPPVPPTGDSGAPPVPPTGDSGAPPVPPTGDAGPPPVPPTgDSGPPPVPPTGDSGAPPVTPTGDSET 700
Cdd:PRK07764 383 RRLGVAGGAGAPAAAAPSAAAAAPAAAPAPAAAAPAAAAAPAPAAAPQPAPAP-APAPAPPSPAGNAPAGGAPSPPPAAA 461
                         90       100       110
                 ....*....|....*....|....*....|....*....
gi 8176554   701 APVPPTGdSGAPPVPPTGDSEAAPVPPTDDSKEAQMPAV 739
Cdd:PRK07764 462 PSAQPAP-APAAAPEPTAAPAPAPPAAPAPAAAPAAPAA 499
PHA03307 PHA03307
transcriptional regulator ICP4; Provisional
550-740 3.16e-08

transcriptional regulator ICP4; Provisional


Pssm-ID: 223039 [Multi-domain]  Cd Length: 1352  Bit Score: 57.49  E-value: 3.16e-08
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 8176554    550 PTVTDQEATPVPPTGDSEATPVPP---TGDSETAPVPPTGDSGAPPVPptgDSGAPPVPPTGDSG-APPVPPTGDSGAPP 625
Cdd:PHA03307  222 PAPGRSAADDAGASSSDSSSSESSgcgWGPENECPLPRPAPITLPTRI---WEASGWNGPSSRPGpASSSSSPRERSPSP 298
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 8176554    626 VPPTGDSGAPPVPPTGDSGAPPVPPTGDSGAPPVPPTGDAGPPPVPPTGDSGPPPVPPTGDSGAPPVTPTGDSETAPVPP 705
Cdd:PHA03307  299 SPSSPGSGPAPSSPRASSSSSSSRESSSSSTSSSSESSRGAAVSPGPSPSRSPSPSRPPPPADPSSPRKRPRPSRAPSSP 378
                         170       180       190
                  ....*....|....*....|....*....|....*..
gi 8176554    706 TGDSGAPPVPPTGDSEAAPVPPTDDS--KEAQMPAVI 740
Cdd:PHA03307  379 AASAGRPTRRRARAAVAGRARRRDATgrFPAGRPRPS 415
SAP130_C pfam16014
Histone deacetylase complex subunit SAP130 C-terminus;
557-727 3.96e-08

Histone deacetylase complex subunit SAP130 C-terminus;


Pssm-ID: 464973 [Multi-domain]  Cd Length: 371  Bit Score: 56.09  E-value: 3.96e-08
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 8176554    557 ATPVppTGDSEATPVPpTGDSETAPVPPTGDSGAPPVPPTGDSGAPPVPPTGDSGAPP-----VPPTGDSGAPPVPPTGD 631
Cdd:pfam16014  31 APPV--TVAVEALPGQ-NSEQQTASASPPSQHPAQAIPTILAPAAPPSQPSVVLSTLPaamavTPPIPASMANVVAPPTQ 107
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 8176554    632 SGAPPVPPTGDSGAPP-------VPPTGDSGAPPVPPTGDAGPPPVPPTGDSGPPP--VPPtgdsGAPP---------VT 693
Cdd:pfam16014 108 PAASSTAACAVSSVLPeikikqeAEPMDTSQSVPPLTPTSISPALTSLANNLSVPAgdLLP----GASPrkkprkqqhVI 183
                         170       180       190
                  ....*....|....*....|....*....|....
gi 8176554    694 PTGDSETAPVPPTGDSGAPPVPPTGDSEAAPVPP 727
Cdd:pfam16014 184 STEEGEMMETNSTDEEKSAPKPLTSRAEKRKSPP 217
Atrophin-1 pfam03154
Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian ...
550-715 4.31e-08

Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian atrophy (DRPLA) gene. DRPLA OMIM:125370 is a progressive neurodegenerative disorder. It is caused by the expansion of a CAG repeat in the DRPLA gene on chromosome 12p. This results in an extended polyglutamine region in atrophin-1, that is thought to confer toxicity to the protein, possibly through altering its interactions with other proteins. The expansion of a CAG repeat is also the underlying defect in six other neurodegenerative disorders, including Huntington's disease. One interaction of expanded polyglutamine repeats that is thought to be pathogenic is that with the short glutamine repeat in the transcriptional coactivator CREB binding protein, CBP. This interaction draws CBP away from its usual nuclear location to the expanded polyglutamine repeat protein aggregates that are characteriztic of the polyglutamine neurodegenerative disorders. This interferes with CBP-mediated transcription and causes cytotoxicity.


Pssm-ID: 460830 [Multi-domain]  Cd Length: 991  Bit Score: 56.70  E-value: 4.31e-08
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 8176554    550 PTVTDQEATPVPPTGdseATPVPPTGDSETAPVPPTGDSGAPPV----------------------------PPTGDSGA 601
Cdd:pfam03154 188 PPGTTQAATAGPTPS---APSVPPQGSPATSQPPNQTQSTAAPHtliqqtptlhpqrlpsphpplqpmtqppPPSQVSPQ 264
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 8176554    602 PPVPPTGDSGAPPVPPTGDSGAPPVP-PTGDSGAPPVPPTGDSGAPPVPPTGDSG----APPVPPTGDAGPPPVPPTGDS 676
Cdd:pfam03154 265 PLPQPSLHGQMPPMPHSLQTGPSHMQhPVPPQPFPLTPQSSQSQVPPGPSPAAPGqsqqRIHTPPSQSQLQSQQPPREQP 344
                         170       180       190       200       210
                  ....*....|....*....|....*....|....*....|....*....|
gi 8176554    677 GPPPVPPTGDSGAPPVTPTGDSET-----------APVPPTGDSGAPPVP 715
Cdd:pfam03154 345 LPPAPLSMPHIKPPPTTPIPQLPNpqshkhpphlsGPSPFQMNSNLPPPP 394
Atrophin-1 pfam03154
Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian ...
559-737 4.65e-08

Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian atrophy (DRPLA) gene. DRPLA OMIM:125370 is a progressive neurodegenerative disorder. It is caused by the expansion of a CAG repeat in the DRPLA gene on chromosome 12p. This results in an extended polyglutamine region in atrophin-1, that is thought to confer toxicity to the protein, possibly through altering its interactions with other proteins. The expansion of a CAG repeat is also the underlying defect in six other neurodegenerative disorders, including Huntington's disease. One interaction of expanded polyglutamine repeats that is thought to be pathogenic is that with the short glutamine repeat in the transcriptional coactivator CREB binding protein, CBP. This interaction draws CBP away from its usual nuclear location to the expanded polyglutamine repeat protein aggregates that are characteriztic of the polyglutamine neurodegenerative disorders. This interferes with CBP-mediated transcription and causes cytotoxicity.


Pssm-ID: 460830 [Multi-domain]  Cd Length: 991  Bit Score: 56.70  E-value: 4.65e-08
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 8176554    559 PVPPTGDSEATPVPPTGDSETAPVPPTGDSGAPPVPPTGDSGAPPVP-PTGDSGAPPVPPtgdSGAPPVPPTGDSGAPPV 637
Cdd:pfam03154 149 PSPQDNESDSDSSAQQQILQTQPPVLQAQSGAASPPSPPPPGTTQAAtAGPTPSAPSVPP---QGSPATSQPPNQTQSTA 225
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 8176554    638 PP-----TGDSGAPPVPPTGDS---GAPPVPPTGDAGPPPVPPTGDSGP-PPVPPTGDSGAPPVTPTGDSETAPVPP-TG 707
Cdd:pfam03154 226 APhtliqQTPTLHPQRLPSPHPplqPMTQPPPPSQVSPQPLPQPSLHGQmPPMPHSLQTGPSHMQHPVPPQPFPLTPqSS 305
                         170       180       190
                  ....*....|....*....|....*....|....
gi 8176554    708 DSGAPPVP----PTGDSEAAPVPPTDDSKEAQMP 737
Cdd:pfam03154 306 QSQVPPGPspaaPGQSQQRIHTPPSQSQLQSQQP 339
PRK14950 PRK14950
DNA polymerase III subunits gamma and tau; Provisional
557-656 4.68e-08

DNA polymerase III subunits gamma and tau; Provisional


Pssm-ID: 237864 [Multi-domain]  Cd Length: 585  Bit Score: 56.36  E-value: 4.68e-08
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 8176554   557 ATPVPPTgDSEATPVPPTGDSETAPVPPTGDSgAPPVPPTGDSgAPPVPPTGDSGAPPVPPTgdsgaPPVPPTGDSGAPP 636
Cdd:PRK14950 366 PQPAKPT-AAAPSPVRPTPAPSTRPKAAAAAN-IPPKEPVRET-ATPPPVPPRPVAPPVPHT-----PESAPKLTRAAIP 437
                         90       100
                 ....*....|....*....|
gi 8176554   637 VPPTGDSGAPPVPPTGDSGA 656
Cdd:PRK14950 438 VDEKPKYTPPAPPKEEEKAL 457
DAP2 COG1506
Dipeptidyl aminopeptidase/acylaminoacyl peptidase [Amino acid transport and metabolism];
118-243 4.88e-08

Dipeptidyl aminopeptidase/acylaminoacyl peptidase [Amino acid transport and metabolism];


Pssm-ID: 441115 [Multi-domain]  Cd Length: 234  Bit Score: 54.64  E-value: 4.88e-08
                        10        20        30        40        50        60        70        80
                ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 8176554  118 LPVMIWIYGGaflmgsghGANFLNNYLYDGEEIATRGnVIVVTFNYRvgplGFlsTGDANLPGNYGLRDQHMAIAWVkrn 197
Cdd:COG1506  23 YPVVVYVHGG--------PGSRDDSFLPLAQALASRG-YAVLAPDYR----GY--GESAGDWGGDEVDDVLAAIDYL--- 84
                        90       100       110       120
                ....*....|....*....|....*....|....*....|....*.
gi 8176554  198 IAAFGGDPNNITLFGESAGGASVSLqtLSPYNKGLIRAAISQSGVA 243
Cdd:COG1506  85 AARPYVDPDRIGIYGHSYGGYMALL--AAARHPDRFKAAVALAGVS 128
SepH NF040712
septation protein SepH; Septation protein H (SepH) was firstly characterized in Streptomyces ...
566-715 5.31e-08

septation protein SepH; Septation protein H (SepH) was firstly characterized in Streptomyces venezuelae, and homologs were identified in Mycobacterium smegmatis. SepH contains a N-terminal DUF3071 domain and a conserved C-terminal region. It binds directly to cell division protein FtsZ to stimulate the assembly of FtsZ protofilaments.


Pssm-ID: 468676 [Multi-domain]  Cd Length: 346  Bit Score: 55.54  E-value: 5.31e-08
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 8176554   566 SEATPVPPTGDSETAPVPPTGDSGAPPVPPTGDSGAPPVPPTGDSGAPPVPPTGDSGAPPV-PPTGDSGAPPVPPTGDSG 644
Cdd:NF040712 188 IDPDFGRPLRPLATVPRLAREPADARPEEVEPAPAAEGAPATDSDPAEAGTPDDLASARRRrAGVEQPEDEPVGPGAAPA 267
                         90       100       110       120       130       140       150
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|.
gi 8176554   645 APPVPPTGDSGAPPVPPTGDAGPPPvPPTGDSGPPPVPPTGDSGAPPVTPtgdSETAPVPPTGDSGAPPVP 715
Cdd:NF040712 268 AEPDEATRDAGEPPAPGAAETPEAA-EPPAPAPAAPAAPAAPEAEEPARP---EPPPAPKPKRRRRRASVP 334
PRK07003 PRK07003
DNA polymerase III subunit gamma/tau;
548-725 5.37e-08

DNA polymerase III subunit gamma/tau;


Pssm-ID: 235906 [Multi-domain]  Cd Length: 830  Bit Score: 56.40  E-value: 5.37e-08
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 8176554   548 ALPTVTDQEATPVPPTGDSEATPVPPTGDSETAPVPPTGDSGAPPVPPTgdsGAPPVPPTGDS------GAPPVPPTGDS 621
Cdd:PRK07003 379 AVPAPGARAAAAVGASAVPAVTAVTGAAGAALAPKAAAAAAATRAEAPP---AAPAPPATADRgddaadGDAPVPAKANA 455
                         90       100       110       120       130       140       150       160
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 8176554   622 GAPPVPPTGDSGAPPVPPTGDSGAPPVPPTGDSGAPPVPPTGDAGPPPVPPTGDSGPPPVPPTGDSGAPPVTPTgdSETA 701
Cdd:PRK07003 456 RASADSRCDERDAQPPADSGSASAPASDAPPDAAFEPAPRAAAPSAATPAAVPDARAPAAASREDAPAAAAPPA--PEAR 533
                        170       180
                 ....*....|....*....|....
gi 8176554   702 PVPPTGDSgaPPVPPTGDSEAAPV 725
Cdd:PRK07003 534 PPTPAAAA--PAARAGGAAAALDV 555
PHA03307 PHA03307
transcriptional regulator ICP4; Provisional
555-728 5.45e-08

transcriptional regulator ICP4; Provisional


Pssm-ID: 223039 [Multi-domain]  Cd Length: 1352  Bit Score: 56.72  E-value: 5.45e-08
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 8176554    555 QEATPVPPTGDSEATPVPPTGdsETAPVPPTGDSGAPPVPPTGDSGAPPVPPTGDSGAPPVPPTGDSGAPPVPPTGDSG- 633
Cdd:PHA03307  252 ENECPLPRPAPITLPTRIWEA--SGWNGPSSRPGPASSSSSPRERSPSPSPSSPGSGPAPSSPRASSSSSSSRESSSSSt 329
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 8176554    634 -------APPVPPTGDSGAPPVPPTGDSGAPPVPPTGdAGPPPVPPTGDSGPPPVPPTGDSGAPPVTPTGDSETAPvPPT 706
Cdd:PHA03307  330 ssssessRGAAVSPGPSPSRSPSPSRPPPPADPSSPR-KRPRPSRAPSSPAASAGRPTRRRARAAVAGRARRRDAT-GRF 407
                         170       180
                  ....*....|....*....|..
gi 8176554    707 GDSGAPPVPPTGDSEAAPVPPT 728
Cdd:PHA03307  408 PAGRPRPSPLDAGAASGAFYAR 429
PRK14959 PRK14959
DNA polymerase III subunits gamma and tau; Provisional
584-706 6.81e-08

DNA polymerase III subunits gamma and tau; Provisional


Pssm-ID: 184923 [Multi-domain]  Cd Length: 624  Bit Score: 55.84  E-value: 6.81e-08
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 8176554   584 PTGDSGAPPvpptGDSGAPPvPPTGDSGAPPVPPT-GDSGAPPVPP-TGDSGAPPVPPTGDSGAPPVPPtgdSGAPPVPP 661
Cdd:PRK14959 373 PSGGGASAP----SGSAAEG-PASGGAATIPTPGTqGPQGTAPAAGmTPSSAAPATPAPSAAPSPRVPW---DDAPPAPP 444
                         90       100       110       120
                 ....*....|....*....|....*....|....*....|....*
gi 8176554   662 TGDAGPPPVPPTGDSGPPPVPPTGDSGAPPVTPTGDSETAPVPPT 706
Cdd:PRK14959 445 RSGIPPRPAPRMPEASPVPGAPDSVASASDAPPTLGDPSDTAEHT 489
PRK14959 PRK14959
DNA polymerase III subunits gamma and tau; Provisional
606-729 8.89e-08

DNA polymerase III subunits gamma and tau; Provisional


Pssm-ID: 184923 [Multi-domain]  Cd Length: 624  Bit Score: 55.46  E-value: 8.89e-08
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 8176554   606 PTGDSGAPPvpptGDSGAPPvPPTGDSGAPPVPPT-GDSGAPPVPPTGDSGAPPVPPTGDAGPPPVPPTGDSgpPPVPPT 684
Cdd:PRK14959 373 PSGGGASAP----SGSAAEG-PASGGAATIPTPGTqGPQGTAPAAGMTPSSAAPATPAPSAAPSPRVPWDDA--PPAPPR 445
                         90       100       110       120
                 ....*....|....*....|....*....|....*....|....*..
gi 8176554   685 gdSGAPPVTPTGDSETAPVP--PTGDSGAPPVPPTGDSEAAPVPPTD 729
Cdd:PRK14959 446 --SGIPPRPAPRMPEASPVPgaPDSVASASDAPPTLGDPSDTAEHTP 490
PRK07003 PRK07003
DNA polymerase III subunit gamma/tau;
578-738 9.05e-08

DNA polymerase III subunit gamma/tau;


Pssm-ID: 235906 [Multi-domain]  Cd Length: 830  Bit Score: 55.63  E-value: 9.05e-08
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 8176554   578 ETAPVPPTGDSGAPPVPPTGDSGAPPVPPTGDSGAPPVPPTGD-SGAPPVPPTGDSGAP--PVPPTGDSGAPPVPPTGDS 654
Cdd:PRK07003 359 EPAVTGGGAPGGGVPARVAGAVPAPGARAAAAVGASAVPAVTAvTGAAGAALAPKAAAAaaATRAEAPPAAPAPPATADR 438
                         90       100       110       120       130       140       150       160
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 8176554   655 ------GAPPVPPTGDAGPPPVPPTGDSGPPPVPPTGDSGAPPVTPTGDSETAPVPPTGDSGAPPVPPTGDSEAAPVPPT 728
Cdd:PRK07003 439 gddaadGDAPVPAKANARASADSRCDERDAQPPADSGSASAPASDAPPDAAFEPAPRAAAPSAATPAAVPDARAPAAASR 518
                        170
                 ....*....|
gi 8176554   729 DDSKEAQMPA 738
Cdd:PRK07003 519 EDAPAAAAPP 528
PBP1 COG5180
PAB1-binding protein, interacts with poly(A)-binding protein [RNA processing and modification]; ...
550-739 9.71e-08

PAB1-binding protein, interacts with poly(A)-binding protein [RNA processing and modification];


Pssm-ID: 444064 [Multi-domain]  Cd Length: 548  Bit Score: 55.45  E-value: 9.71e-08
                        10        20        30        40        50        60        70        80
                ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 8176554  550 PTVTDQEATPVPPTGDSEATPVPPTGDSETAPVPPTGDSGAPP--------VPPTGDSGAPPVPPTGDSGAPPvPPTGDS 621
Cdd:COG5180 278 PGLPVLEAGSEPQSDAPEAETARPIDVKGVASAPPATRPVRPPggardpgtPRPGQPTERPAGVPEAASDAGQ-PPSAYP 356
                        90       100       110       120       130       140       150       160
                ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 8176554  622 GAPPVPPtGDSGAPPVPPTGDSGAPPVPPTGDSGAPPVPPTGDAGPPPVPPTGDSGPPPVPPTGDSGAPPVTPTGdseta 701
Cdd:COG5180 357 PAEEAVP-GKPLEQGAPRPGSSGGDGAPFQPPNGAPQPGLGRRGAPGPPMGAGDLVQAALDGGGRETASLGGAAG----- 430
                       170       180       190
                ....*....|....*....|....*....|....*...
gi 8176554  702 pvPPTGDSGAPPVPPTGDSEAAPVPPTDDSKEAQMPAV 739
Cdd:COG5180 431 --GAGQGPKADFVPGDAESVSGPAGLADQAGAAASTAM 466
dnaA PRK14086
chromosomal replication initiator protein DnaA;
551-732 9.74e-08

chromosomal replication initiator protein DnaA;


Pssm-ID: 237605 [Multi-domain]  Cd Length: 617  Bit Score: 55.60  E-value: 9.74e-08
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 8176554   551 TVTDQEATPVPPTGDSEATPVPPTGDSETAPVPPTGDSGAPPVPPTGDSGapPVPPTGDSGAP---------PVPPTGDS 621
Cdd:PRK14086  87 TVDPSAGEPAPPPPHARRTSEPELPRPGRRPYEGYGGPRADDRPPGLPRQ--DQLPTARPAYPayqqrpepgAWPRAADD 164
                         90       100       110       120       130       140       150       160
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 8176554   622 GAPPVPPTG-DSGAPPVPPTGDSGAPPV--PPTGDSGAPPVPPTGD---AGPPPVPPTGDSGPPPVPPTGdSGAPPVTPT 695
Cdd:PRK14086 165 YGWQQQRLGfPPRAPYASPASYAPEQERdrEPYDAGRPEYDQRRRDydhPRPDWDRPRRDRTDRPEPPPG-AGHVHRGGP 243
                        170       180       190
                 ....*....|....*....|....*....|....*..
gi 8176554   696 GDSETAPVPPTGDSGAPPVPPTGDSEAAPVPPTDDSK 732
Cdd:PRK14086 244 GPPERDDAPVVPIRPSAPGPLAAQPAPAPGPGEPTAR 280
PRK07003 PRK07003
DNA polymerase III subunit gamma/tau;
552-737 1.06e-07

DNA polymerase III subunit gamma/tau;


Pssm-ID: 235906 [Multi-domain]  Cd Length: 830  Bit Score: 55.63  E-value: 1.06e-07
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 8176554   552 VTDQEATPVPPTGDS---EATPVPPTGDSET------APVPPTGDSGAPPVPPTGDSGAPPVPPTGDSGAPPVPPTGDSG 622
Cdd:PRK07003 410 LAPKAAAAAAATRAEappAAPAPPATADRGDdaadgdAPVPAKANARASADSRCDERDAQPPADSGSASAPASDAPPDAA 489
                         90       100       110       120       130       140       150       160
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 8176554   623 APPVPPTGDSGAPPVPPTGDSGAPPVPPTGDSGAPPVPPTGDAGPP------PVPPTG------------------DSGP 678
Cdd:PRK07003 490 FEPAPRAAAPSAATPAAVPDARAPAAASREDAPAAAAPPAPEARPPtpaaaaPAARAGgaaaaldvlrnagmrvssDRGA 569
                        170       180       190       200       210
                 ....*....|....*....|....*....|....*....|....*....|....*....
gi 8176554   679 PPVPPTGDSGAPPVTPTGDSETAPVPPTGDSGAPPVPPTGDSEAAPVPPTDDSKEAQMP 737
Cdd:PRK07003 570 RAAAAAKPAAAPAAAPKPAAPRVAVQVPTPRARAATGDAPPNGAARAEQAAESRGAPPP 628
PRK14950 PRK14950
DNA polymerase III subunits gamma and tau; Provisional
625-740 1.21e-07

DNA polymerase III subunits gamma and tau; Provisional


Pssm-ID: 237864 [Multi-domain]  Cd Length: 585  Bit Score: 55.20  E-value: 1.21e-07
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 8176554   625 PVPPtgdsgAPPVPPTGDSGAPPVPPTGDSGAPPVPPTgdAGPPPVPPTGDSGPPPVPPTgDSGAPPVTPTgdsetAPVP 704
Cdd:PRK14950 362 PVPA-----PQPAKPTAAAPSPVRPTPAPSTRPKAAAA--ANIPPKEPVRETATPPPVPP-RPVAPPVPHT-----PESA 428
                         90       100       110
                 ....*....|....*....|....*....|....*.
gi 8176554   705 PTGDSGAPPVPPTGDSEaAPVPPTDDSKEAQMPAVI 740
Cdd:PRK14950 429 PKLTRAAIPVDEKPKYT-PPAPPKEEEKALIADGDV 463
PRK14950 PRK14950
DNA polymerase III subunits gamma and tau; Provisional
603-705 1.22e-07

DNA polymerase III subunits gamma and tau; Provisional


Pssm-ID: 237864 [Multi-domain]  Cd Length: 585  Bit Score: 55.20  E-value: 1.22e-07
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 8176554   603 PVPPtgdsgAPPVPPTGDSGAPPVPPTGDSGAPPVPPTgdSGAPPVPPTGDSGAPPVPPTGDAGPPPVPPtgdsgpPPVP 682
Cdd:PRK14950 362 PVPA-----PQPAKPTAAAPSPVRPTPAPSTRPKAAAA--ANIPPKEPVRETATPPPVPPRPVAPPVPHT------PESA 428
                         90       100
                 ....*....|....*....|...
gi 8176554   683 PTGDSGAPPVtPTGDSETAPVPP 705
Cdd:PRK14950 429 PKLTRAAIPV-DEKPKYTPPAPP 450
PRK14951 PRK14951
DNA polymerase III subunits gamma and tau; Provisional
617-738 1.34e-07

DNA polymerase III subunits gamma and tau; Provisional


Pssm-ID: 237865 [Multi-domain]  Cd Length: 618  Bit Score: 55.11  E-value: 1.34e-07
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 8176554   617 PTGDSGAPPVPPTgdsgAPPVPPTGDSGAP-PVPPTGDSGAPPVPPTGDAGPPPVPPTGDSGPPPVPPTgdSGAPPVTPT 695
Cdd:PRK14951 366 PAAAAEAAAPAEK----KTPARPEAAAPAAaPVAQAAAAPAPAAAPAAAASAPAAPPAAAPPAPVAAPA--AAAPAAAPA 439
                         90       100       110       120
                 ....*....|....*....|....*....|....*....|...
gi 8176554   696 GDSETAPVPPTGDSGAPPVPPTGDSEAAPVPPTDDSKEAQMPA 738
Cdd:PRK14951 440 AAPAAVALAPAPPAQAAPETVAIPVRVAPEPAVASAAPAPAAA 482
PRK14950 PRK14950
DNA polymerase III subunits gamma and tau; Provisional
592-691 1.40e-07

DNA polymerase III subunits gamma and tau; Provisional


Pssm-ID: 237864 [Multi-domain]  Cd Length: 585  Bit Score: 54.82  E-value: 1.40e-07
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 8176554   592 PVPPtgdsgAPPVPPTGDSGAPPVPPTGDSGAPPVPPTgdSGAPPVPPTGDSgAPPVPPTGDSGAPPVPPtgdagPPPVP 671
Cdd:PRK14950 362 PVPA-----PQPAKPTAAAPSPVRPTPAPSTRPKAAAA--ANIPPKEPVRET-ATPPPVPPRPVAPPVPH-----TPESA 428
                         90       100
                 ....*....|....*....|..
gi 8176554   672 P--TGDSGPPPVPPTGDSGAPP 691
Cdd:PRK14950 429 PklTRAAIPVDEKPKYTPPAPP 450
PHA03247 PHA03247
large tegument protein UL36; Provisional
554-734 2.15e-07

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 54.94  E-value: 2.15e-07
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 8176554    554 DQEATPVPPTGDSEATPVPPTGDSETAPVPPTGDSGAPPVPPTGD-----SGAPPVPPTGDSGAPPVPPTGDSG------ 622
Cdd:PHA03247  253 AAPAPPPVVGEGADRAPETARGATGPPPPPEAAAPNGAAAPPDGVwgaalAGAPLALPAPPDPPPPAPAGDAEEeddedg 332
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 8176554    623 ----APPVP---------------PT------------GDSGAPPVPP------TGDSGAPPV--PPTGDSGAPPVPPTG 663
Cdd:PHA03247  333 amevVSPLPrprqhyplgfpkrrrPTwtppssledlsaGRHHPKRASLptrkrrSARHAATPFarGPGGDDQTRPAAPVP 412
                         170       180       190       200       210       220       230
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|.
gi 8176554    664 DAGPPPVPPtgdSGPPPVPPTGDSgaPPVTPTGDSETAPVPPTgdSGAPPVPPTGDSEAAPVPPTDDSKEA 734
Cdd:PHA03247  413 ASVPTPAPT---PVPASAPPPPAT--PLPSAEPGSDDGPAPPP--ERQPPAPATEPAPDDPDDATRKALDA 476
PRK07003 PRK07003
DNA polymerase III subunit gamma/tau;
583-739 2.48e-07

DNA polymerase III subunit gamma/tau;


Pssm-ID: 235906 [Multi-domain]  Cd Length: 830  Bit Score: 54.47  E-value: 2.48e-07
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 8176554   583 PPTGDSGAPPvpptgdSGAPPVPPTGDSGAPPVPPTGDSGAPPVPPTGDSGAPPVPPTGDSGAP--PVPPTGDSGAPPVP 660
Cdd:PRK07003 360 PAVTGGGAPG------GGVPARVAGAVPAPGARAAAAVGASAVPAVTAVTGAAGAALAPKAAAAaaATRAEAPPAAPAPP 433
                         90       100       110       120       130       140       150       160
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 8176554   661 PTGDAGPPPVPptgdsGPPPVPPTGDSGAPPVTPTGDSETAPV--PPTGDSGAPPVPPTGDSEAApvPPTDDSKEAQMPA 738
Cdd:PRK07003 434 ATADRGDDAAD-----GDAPVPAKANARASADSRCDERDAQPPadSGSASAPASDAPPDAAFEPA--PRAAAPSAATPAA 506

                 .
gi 8176554   739 V 739
Cdd:PRK07003 507 V 507
PRK14959 PRK14959
DNA polymerase III subunits gamma and tau; Provisional
561-678 3.15e-07

DNA polymerase III subunits gamma and tau; Provisional


Pssm-ID: 184923 [Multi-domain]  Cd Length: 624  Bit Score: 53.92  E-value: 3.15e-07
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 8176554   561 PPTGdsEATPVPPTGDSETAPVPPT-GDSGAPPVPP-TGDSGAPPVPPTGDSGAPPVPPtgdSGAPPVPPTgdSGAPPVP 638
Cdd:PRK14959 380 APSG--SAAEGPASGGAATIPTPGTqGPQGTAPAAGmTPSSAAPATPAPSAAPSPRVPW---DDAPPAPPR--SGIPPRP 452
                         90       100       110       120
                 ....*....|....*....|....*....|....*....|..
gi 8176554   639 PTGDSGAPPVP--PTGDSGAPPVPPTGDAGPPPVPPTgDSGP 678
Cdd:PRK14959 453 APRMPEASPVPgaPDSVASASDAPPTLGDPSDTAEHT-PSGP 493
PRK14959 PRK14959
DNA polymerase III subunits gamma and tau; Provisional
595-717 4.07e-07

DNA polymerase III subunits gamma and tau; Provisional


Pssm-ID: 184923 [Multi-domain]  Cd Length: 624  Bit Score: 53.53  E-value: 4.07e-07
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 8176554   595 PTGDSGAPPvpptGDSGAPPvPPTGDSGAPPVPPT-GDSGAPPVPP-TGDSGAPPVPPTGDSGAPPVpPTGDAgpPPVPP 672
Cdd:PRK14959 373 PSGGGASAP----SGSAAEG-PASGGAATIPTPGTqGPQGTAPAAGmTPSSAAPATPAPSAAPSPRV-PWDDA--PPAPP 444
                         90       100       110       120
                 ....*....|....*....|....*....|....*....|....*
gi 8176554   673 TGDSGPPPVPPTGDSGAPPVTPTGDSETAPVPPTGDSGAPPVPPT 717
Cdd:PRK14959 445 RSGIPPRPAPRMPEASPVPGAPDSVASASDAPPTLGDPSDTAEHT 489
BimA_second NF040983
trimeric autotransporter actin-nucleating factor BimA; This HMM describes BimA (Burkholderia ...
628-731 4.38e-07

trimeric autotransporter actin-nucleating factor BimA; This HMM describes BimA (Burkholderia intracellular motility A), WP_004266405.1-like proteins in Burkholderia mallei or B. pseudomallei. The term BimA has also been used for WP_011205626.1-like homologs that have a very different N-terminal half.


Pssm-ID: 468913 [Multi-domain]  Cd Length: 382  Bit Score: 52.98  E-value: 4.38e-07
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 8176554   628 PTGDSGAP-PVPPtgdsgaPPVPPTGDSGAPPVPPTGDAGPPPVPPTGDSGPPPVPPTgdSGAPPVTPTGDSETAPVPPT 706
Cdd:NF040983  79 PVGDRTLPnKVPP------PPPPPPPPPPPPPTPPPPPPPPPPPPPPSPPPPPPPSPP--PSPPPPTTTPPTRTTPSTTT 150
                         90       100
                 ....*....|....*....|....*
gi 8176554   707 GDSGAPPVPPTGDSEAAPVPPTDDS 731
Cdd:NF040983 151 PTPSMHPIQPTQLPSIPNATPTSGS 175
PHA03378 PHA03378
EBNA-3B; Provisional
550-716 6.11e-07

EBNA-3B; Provisional


Pssm-ID: 223065 [Multi-domain]  Cd Length: 991  Bit Score: 53.15  E-value: 6.11e-07
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 8176554   550 PTVTDQEATPVPPTGDSEATPVPPTGDSETAPVPPTGDSGAPPvpptgdSGAPPVPPTGDSGAP-PVPPTGDSGAPPVPP 628
Cdd:PHA03378 664 PTWTQIGHIPYQPSPTGANTMLPIQWAPGTMQPPPRAPTPMRP------PAAPPGRAQRPAAATgRARPPAAAPGRARPP 737
                         90       100       110       120       130       140       150       160
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 8176554   629 TGDSGAPPvPPTGDSGAPPVPPTGDSGAPPVPPTGDAGPPPVPPTGdsgpPPVPPTGDSGAPPVTPtgdseTAPVPPTGD 708
Cdd:PHA03378 738 AAAPGRAR-PPAAAPGRARPPAAAPGRARPPAAAPGAPTPQPPPQA----PPAPQQRPRGAPTPQP-----PPQAGPTSM 807

                 ....*...
gi 8176554   709 SGAPPVPP 716
Cdd:PHA03378 808 QLMPRAAP 815
PRK10263 PRK10263
DNA translocase FtsK; Provisional
558-737 6.14e-07

DNA translocase FtsK; Provisional


Pssm-ID: 236669 [Multi-domain]  Cd Length: 1355  Bit Score: 53.17  E-value: 6.14e-07
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 8176554    558 TPVPPTGDSEATPVPPTGDSETAPVPPTGDSGAPPVPptgdSGAPPVPPTGDSGAPPVPPTgDSGAPPVPPTGDSGAPPV 637
Cdd:PRK10263  341 TQTPPVASVDVPPAQPTVAWQPVPGPQTGEPVIAPAP----EGYPQQSQYAQPAVQYNEPL-QQPVQPQQPYYAPAAEQP 415
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 8176554    638 PPTGDSGAPPVPPTGDSGAPPVPPTGDAGPPPVPPTGDSGPPPvPPTGDSGAPPVTPTgdsetapVPPTGDSGAPPVPPT 717
Cdd:PRK10263  416 AQQPYYAPAPEQPAQQPYYAPAPEQPVAGNAWQAEEQQSTFAP-QSTYQTEQTYQQPA-------AQEPLYQQPQPVEQQ 487
                         170       180
                  ....*....|....*....|
gi 8176554    718 GDSEaaPVPPTDDSKEAQMP 737
Cdd:PRK10263  488 PVVE--PEPVVEETKPARPP 505
SepH NF040712
septation protein SepH; Septation protein H (SepH) was firstly characterized in Streptomyces ...
558-692 6.17e-07

septation protein SepH; Septation protein H (SepH) was firstly characterized in Streptomyces venezuelae, and homologs were identified in Mycobacterium smegmatis. SepH contains a N-terminal DUF3071 domain and a conserved C-terminal region. It binds directly to cell division protein FtsZ to stimulate the assembly of FtsZ protofilaments.


Pssm-ID: 468676 [Multi-domain]  Cd Length: 346  Bit Score: 52.08  E-value: 6.17e-07
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 8176554   558 TPVPPTGDSEATPVPPTGDSETAPVPPTGDSGAPPVPPTGDSGAPPVPPTGDSGAPPVPPTGDSGAPPVPPTGDSGAPPV 637
Cdd:NF040712 200 ATVPRLAREPADARPEEVEPAPAAEGAPATDSDPAEAGTPDDLASARRRRAGVEQPEDEPVGPGAAPAAEPDEATRDAGE 279
                         90       100       110       120       130
                 ....*....|....*....|....*....|....*....|....*....|....*.
gi 8176554   638 PPTgdSGAPPVPPTGDSGAP-PVPPTGDAGPPPVPPTGDSGPPPVPPTGDSGAPPV 692
Cdd:NF040712 280 PPA--PGAAETPEAAEPPAPaPAAPAAPAAPEAEEPARPEPPPAPKPKRRRRRASV 333
PHA03247 PHA03247
large tegument protein UL36; Provisional
558-719 6.27e-07

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 53.40  E-value: 6.27e-07
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 8176554    558 TPVPPTGDSEATPVPPTGDSETAPVPPTGD-----SGAPPVPPTGDSGAPPVPPTGDSG----------APPVP------ 616
Cdd:PHA03247  268 APETARGATGPPPPPEAAAPNGAAAPPDGVwgaalAGAPLALPAPPDPPPPAPAGDAEEeddedgamevVSPLPrprqhy 347
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 8176554    617 PTGDS--------------------------------------GAPPV--PPTGDSGAPPVPPTGDSGAPPVPPTGDSGA 656
Cdd:PHA03247  348 PLGFPkrrrptwtppssledlsagrhhpkraslptrkrrsarhAATPFarGPGGDDQTRPAAPVPASVPTPAPTPVPASA 427
                         170       180       190       200       210       220
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*.
gi 8176554    657 PPVPptgdAGPPPVPPTGDSGPPPVPPTGDSGAP---PVTPTGDSETAPVPPTGDSGAPPVPPTGD 719
Cdd:PHA03247  428 PPPP----ATPLPSAEPGSDDGPAPPPERQPPAPatePAPDDPDDATRKALDALRERRPPEPPGAD 489
Pro-rich pfam15240
Proline-rich protein; This family includes several eukaryotic proline-rich proteins.
590-716 6.53e-07

Proline-rich protein; This family includes several eukaryotic proline-rich proteins.


Pssm-ID: 464580 [Multi-domain]  Cd Length: 167  Bit Score: 50.04  E-value: 6.53e-07
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 8176554    590 APPVPPTGdsGAPPVPPTGDSGAPPVPPTGDSGAPPVPPTGDSGAPPvpPTGDSGAPPVPPTGdsgapPVPPTGDAGPPP 669
Cdd:pfam15240  45 GPQGPPPG--GFPPQPPASDDPPGPPPPGGPQQPPPQGGKQKPQGPP--PQGGPRPPPGKPQG-----PPPQGGNQQQGP 115
                          90       100       110       120
                  ....*....|....*....|....*....|....*....|....*..
gi 8176554    670 VPPTGDSGPPPvpptgdSGAPPVTPTGDSETAPVPPTGDSGAPPVPP 716
Cdd:pfam15240 116 PPPGKPQGPPP------QGGGPPPQGGNQQGPPPPPPGNPQGPPQRP 156
SepH NF040712
septation protein SepH; Septation protein H (SepH) was firstly characterized in Streptomyces ...
556-704 6.98e-07

septation protein SepH; Septation protein H (SepH) was firstly characterized in Streptomyces venezuelae, and homologs were identified in Mycobacterium smegmatis. SepH contains a N-terminal DUF3071 domain and a conserved C-terminal region. It binds directly to cell division protein FtsZ to stimulate the assembly of FtsZ protofilaments.


Pssm-ID: 468676 [Multi-domain]  Cd Length: 346  Bit Score: 52.08  E-value: 6.98e-07
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 8176554   556 EATPVPPTGDSEATPVPPTGDSETAPVPPTGDSGAPPVPPTGDSGAPPVPPTGDSGAPPV-PPTGDSGAPPVPPTGDSGA 634
Cdd:NF040712 189 DPDFGRPLRPLATVPRLAREPADARPEEVEPAPAAEGAPATDSDPAEAGTPDDLASARRRrAGVEQPEDEPVGPGAAPAA 268
                         90       100       110       120       130       140       150
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 8176554   635 PPVPPTGDSGAPPVPPTGDSGAPPvPPTGDAGPPPVPPTGDSGPPPVPPtgdSGAPPVTPTGDSETAPVP 704
Cdd:NF040712 269 EPDEATRDAGEPPAPGAAETPEAA-EPPAPAPAAPAAPAAPEAEEPARP---EPPPAPKPKRRRRRASVP 334
SepH NF040712
septation protein SepH; Septation protein H (SepH) was firstly characterized in Streptomyces ...
590-738 7.36e-07

septation protein SepH; Septation protein H (SepH) was firstly characterized in Streptomyces venezuelae, and homologs were identified in Mycobacterium smegmatis. SepH contains a N-terminal DUF3071 domain and a conserved C-terminal region. It binds directly to cell division protein FtsZ to stimulate the assembly of FtsZ protofilaments.


Pssm-ID: 468676 [Multi-domain]  Cd Length: 346  Bit Score: 52.08  E-value: 7.36e-07
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 8176554   590 APPVPPTGDSGAPPVPPTGDSGAPPVPPTGDSGAPPVPPtgDSGAPPVPPTGDSGAPPVPPTGDSGAPPVPPTGDAGPPP 669
Cdd:NF040712 193 GRPLRPLATVPRLAREPADARPEEVEPAPAAEGAPATDS--DPAEAGTPDDLASARRRRAGVEQPEDEPVGPGAAPAAEP 270
                         90       100       110       120       130       140
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 8176554   670 VPPTGDSGPPPVPPTGDSGAPPVTPtgdsETAPVPPTGDSGAPPVPPTGDSEAAPVPPTDDSKEAQMPA 738
Cdd:NF040712 271 DEATRDAGEPPAPGAAETPEAAEPP----APAPAAPAAPAAPEAEEPARPEPPPAPKPKRRRRRASVPS 335
SepH NF040712
septation protein SepH; Septation protein H (SepH) was firstly characterized in Streptomyces ...
552-671 9.85e-07

septation protein SepH; Septation protein H (SepH) was firstly characterized in Streptomyces venezuelae, and homologs were identified in Mycobacterium smegmatis. SepH contains a N-terminal DUF3071 domain and a conserved C-terminal region. It binds directly to cell division protein FtsZ to stimulate the assembly of FtsZ protofilaments.


Pssm-ID: 468676 [Multi-domain]  Cd Length: 346  Bit Score: 51.69  E-value: 9.85e-07
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 8176554   552 VTDQEATPVPPTGDSEATPVPPTGDSETAPV--PPTGDSGAPPVPPTGDSGAPPVPPTGDSGAPPVPPTGDSGAPPvPPT 629
Cdd:NF040712 217 VEPAPAAEGAPATDSDPAEAGTPDDLASARRrrAGVEQPEDEPVGPGAAPAAEPDEATRDAGEPPAPGAAETPEAA-EPP 295
                         90       100       110       120
                 ....*....|....*....|....*....|....*....|..
gi 8176554   630 GDSGAPPVPPTGDSGAPPVPPtgdSGAPPVPPTGDAGPPPVP 671
Cdd:NF040712 296 APAPAAPAAPAAPEAEEPARP---EPPPAPKPKRRRRRASVP 334
PBP1 COG5180
PAB1-binding protein, interacts with poly(A)-binding protein [RNA processing and modification]; ...
550-742 1.05e-06

PAB1-binding protein, interacts with poly(A)-binding protein [RNA processing and modification];


Pssm-ID: 444064 [Multi-domain]  Cd Length: 548  Bit Score: 51.99  E-value: 1.05e-06
                        10        20        30        40        50        60        70        80
                ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 8176554  550 PTVTDQEATPV--PPTGDSEATPVPPtgdseTAPVPPTgdsgappVPPTGDSGAPPVPPTGDSGAPPVPPtgdsGAPPVP 627
Cdd:COG5180 338 PAGVPEAASDAgqPPSAYPPAEEAVP-----GKPLEQG-------APRPGSSGGDGAPFQPPNGAPQPGL----GRRGAP 401
                        90       100       110       120       130       140       150       160
                ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 8176554  628 PTGDSGAPPVPPTGDSGappVPPTGDSGAPPVPPTGDAGPPPVPPTGDSGPPPVPPTGDSGAPPVTptgdSETAPVPPTG 707
Cdd:COG5180 402 GPPMGAGDLVQAALDGG---GRETASLGGAAGGAGQGPKADFVPGDAESVSGPAGLADQAGAAAST----AMADFVAPVT 474
                       170       180       190
                ....*....|....*....|....*....|....*
gi 8176554  708 DSGAPPVPPTGDSEAAPVPPTDDSKEAQMPAVIRF 742
Cdd:COG5180 475 DATPVDVADVLGVRPDAILGGNVAPASGLDAETRI 509
PHA03307 PHA03307
transcriptional regulator ICP4; Provisional
576-738 1.16e-06

transcriptional regulator ICP4; Provisional


Pssm-ID: 223039 [Multi-domain]  Cd Length: 1352  Bit Score: 52.10  E-value: 1.16e-06
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 8176554    576 DSETAPVPPTGDSGAPPVPPTGDSGAPPVPPtgdsgAPPVPPTGDSGAPPVPPTGDSGAPPvpptgdSGAPPVPPTGDSG 655
Cdd:PHA03307   17 GGEFFPRPPATPGDAADDLLSGSQGQLVSDS-----AELAAVTVVAGAAACDRFEPPTGPP------PGPGTEAPANESR 85
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 8176554    656 APPVPPTGDAGPPPVPPTGDSGPPpvPPTGDSGAPPVTPTGdsETAPVPPTGDSGAPPVPPTGDSEAAPVPPTDDSKEAQ 735
Cdd:PHA03307   86 STPTWSLSTLAPASPAREGSPTPP--GPSSPDPPPPTPPPA--SPPPSPAPDLSEMLRPVGSPGPPPAASPPAAGASPAA 161

                  ...
gi 8176554    736 MPA 738
Cdd:PHA03307  162 VAS 164
PHA03247 PHA03247
large tegument protein UL36; Provisional
547-734 1.20e-06

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 52.25  E-value: 1.20e-06
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 8176554    547 LALPTVTDQ-EATPVPPTGDSEATPVPPTGDSETAPVPPTGDSGAPPVPPTGDSGAPPvPPTGDSGAPPVPptgdSGAPP 625
Cdd:PHA03247 2886 LARPAVSRStESFALPPDQPERPPQPQAPPPPQPQPQPPPPPQPQPPPPPPPRPQPPL-APTTDPAGAGEP----SGAVP 2960
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 8176554    626 VPPTGDSGAPPVPPTGDSGAPPVPPTGDSGAPPVPPTG-----------------DAGPPPVPPTGDSGPPPVPPTGDSG 688
Cdd:PHA03247 2961 QPWLGALVPGRVAVPRFRVPQPAPSREAPASSTPPLTGhslsrvsswasslalheETDPPPVSLKQTLWPPDDTEDSDAD 3040
                         170       180       190       200
                  ....*....|....*....|....*....|....*....|....*.
gi 8176554    689 APPVTPTGDSETAPVPPtgDSGAPPVPPTgdSEAAPVPPTDDSKEA 734
Cdd:PHA03247 3041 SLFDSDSERSDLEALDP--LPPEPHDPFA--HEPDPATPEAGARES 3082
PRK07994 PRK07994
DNA polymerase III subunits gamma and tau; Validated
595-741 1.43e-06

DNA polymerase III subunits gamma and tau; Validated


Pssm-ID: 236138 [Multi-domain]  Cd Length: 647  Bit Score: 51.79  E-value: 1.43e-06
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 8176554   595 PTGDSGAPPVPPTGDSGAPPVPPTGDSGAPPVPPTGDSGAPPVPPTGDSgAPPVPPtgdsgAPPVPPTGDAGPPPVPPTG 674
Cdd:PRK07994 361 PAAPLPEPEVPPQSAAPAASAQATAAPTAAVAPPQAPAVPPPPASAPQQ-APAVPL-----PETTSQLLAARQQLQRAQG 434
                         90       100       110       120       130       140
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 8176554   675 DSGPPPVPPTGDSGAPPVTPTGDsETAPVPPTGDSGAPPVPPTGDSEAAPVPPTDDSKEAQM-PAVIR 741
Cdd:PRK07994 435 ATKAKKSEPAAASRARPVNSALE-RLASVRPAPSALEKAPAKKEAYRWKATNPVEVKKEPVAtPKALK 501
PHA03247 PHA03247
large tegument protein UL36; Provisional
557-653 1.79e-06

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 51.86  E-value: 1.79e-06
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 8176554    557 ATPV--PPTGDSEATPVPPTGDSETAPVPPTGDSGAPPVPptgdSGAPPVPPTGDSGAPPVPPTGDSGAPPVPPTG---D 631
Cdd:PHA03247  392 ATPFarGPGGDDQTRPAAPVPASVPTPAPTPVPASAPPPP----ATPLPSAEPGSDDGPAPPPERQPPAPATEPAPddpD 467
                          90       100
                  ....*....|....*....|..
gi 8176554    632 SGAPPVPPTGDSGAPPVPPTGD 653
Cdd:PHA03247  468 DATRKALDALRERRPPEPPGAD 489
SepH NF040712
septation protein SepH; Septation protein H (SepH) was firstly characterized in Streptomyces ...
571-730 1.79e-06

septation protein SepH; Septation protein H (SepH) was firstly characterized in Streptomyces venezuelae, and homologs were identified in Mycobacterium smegmatis. SepH contains a N-terminal DUF3071 domain and a conserved C-terminal region. It binds directly to cell division protein FtsZ to stimulate the assembly of FtsZ protofilaments.


Pssm-ID: 468676 [Multi-domain]  Cd Length: 346  Bit Score: 50.54  E-value: 1.79e-06
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 8176554   571 VPPTGDSETAPVPPTGDSGAPPVPP-----TGDSGAPPVPPTGDSGAPPVPPTGDSGAPPVPPTGDSGAPPV-PPTGDSG 644
Cdd:NF040712 177 VTALDDEARWLIDPDFGRPLRPLATvprlaREPADARPEEVEPAPAAEGAPATDSDPAEAGTPDDLASARRRrAGVEQPE 256
                         90       100       110       120       130       140       150       160
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 8176554   645 APPVPPTGDSGAPPVPPTGDAGPPPVPPTGDSGPPPvPPTGDSGAPPVTPTGDSETAPVPPtgdSGAPPVPPTGDSEAAP 724
Cdd:NF040712 257 DEPVGPGAAPAAEPDEATRDAGEPPAPGAAETPEAA-EPPAPAPAAPAAPAAPEAEEPARP---EPPPAPKPKRRRRRAS 332

                 ....*.
gi 8176554   725 VPPTDD 730
Cdd:NF040712 333 VPSWDD 338
PRK07994 PRK07994
DNA polymerase III subunits gamma and tau; Validated
584-735 1.93e-06

DNA polymerase III subunits gamma and tau; Validated


Pssm-ID: 236138 [Multi-domain]  Cd Length: 647  Bit Score: 51.40  E-value: 1.93e-06
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 8176554   584 PTGDSGAPPVPPTGDSGAPPVPPTGDSGAPPVPPTGDSGAPPVPPtgdsgAPPVPPtgdSGAPPVPPTGDSGAPP--VPP 661
Cdd:PRK07994 361 PAAPLPEPEVPPQSAAPAASAQATAAPTAAVAPPQAPAVPPPPAS-----APQQAP---AVPLPETTSQLLAARQqlQRA 432
                         90       100       110       120       130       140       150
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....
gi 8176554   662 TGDAGPPPVPPTGDSGPPPVPptgdSGAPPVTPTGDSETAPVPPTGDSGAPPVPPTGDSEAAPVPPTDDSKEAQ 735
Cdd:PRK07994 433 QGATKAKKSEPAAASRARPVN----SALERLASVRPAPSALEKAPAKKEAYRWKATNPVEVKKEPVATPKALKK 502
PBP1 COG5180
PAB1-binding protein, interacts with poly(A)-binding protein [RNA processing and modification]; ...
559-730 2.00e-06

PAB1-binding protein, interacts with poly(A)-binding protein [RNA processing and modification];


Pssm-ID: 444064 [Multi-domain]  Cd Length: 548  Bit Score: 51.22  E-value: 2.00e-06
                        10        20        30        40        50        60        70        80
                ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 8176554  559 PVPPTGDSEATPVPPTGDSETAPVPPtGDSGAPPVPPTGDSGAPPVPPTGDSGAPPVPPTGDSGAPPVPPTGDSGAPPVP 638
Cdd:COG5180 233 KVDPPSTSEARSRPATVDAQPEMRPP-ADAKERRRAAIGDTPAAEPPGLPVLEAGSEPQSDAPEAETARPIDVKGVASAP 311
                        90       100       110       120       130       140       150       160
                ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 8176554  639 PTGDSGAPPvPPTGDSGAPP----------VPPTG-DAGPPPVPPTGDSGPPPVPPtGDSGAPPVTPTGDSETAPVPPTG 707
Cdd:COG5180 312 PATRPVRPP-GGARDPGTPRpgqpterpagVPEAAsDAGQPPSAYPPAEEAVPGKP-LEQGAPRPGSSGGDGAPFQPPNG 389
                       170       180
                ....*....|....*....|....*..
gi 8176554  708 D----SGAPPVPPTGDSEAAPVPPTDD 730
Cdd:COG5180 390 ApqpgLGRRGAPGPPMGAGDLVQAALD 416
PRK11633 PRK11633
cell division protein DedD; Provisional
602-725 2.05e-06

cell division protein DedD; Provisional


Pssm-ID: 236940 [Multi-domain]  Cd Length: 226  Bit Score: 49.62  E-value: 2.05e-06
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 8176554   602 PPVPPTGDSGAPPVPPTGDSGAPPVPPTGDSGAPPvppTGDSGAPPVPPTgDSGAPPVPPtgDAGPPPVPPTgdsGPPPV 681
Cdd:PRK11633  42 PLVPKPGDRDEPDMMPAATQALPTQPPEGAAEAVR---AGDAAAPSLDPA-TVAPPNTPV--EPEPAPVEPP---KPKPV 112
                         90       100       110       120
                 ....*....|....*....|....*....|....*....|....
gi 8176554   682 PPTgdsgAPPVTPTGDSETAPVPPtgdsgaPPVPPTGDSEAAPV 725
Cdd:PRK11633 113 EKP----KPKPKPQQKVEAPPAPK------PEPKPVVEEKAAPT 146
PHA03264 PHA03264
envelope glycoprotein D; Provisional
604-706 2.51e-06

envelope glycoprotein D; Provisional


Pssm-ID: 223029 [Multi-domain]  Cd Length: 416  Bit Score: 50.39  E-value: 2.51e-06
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 8176554   604 VPPTGDSGAPPVPPTGDSGAPPVPPTGDSGAPPVPPTGDSGAPPVPPTGDSGAPPVPPTGDAGPP--PVPPTGDSGPPPV 681
Cdd:PHA03264 254 VPPYFEESKGYEPPPAPSGGSPAPPGDDRPEAKPEPGPVEDGAPGRETGGEGEGPEPAGRDGAAGgePKPGPPRPAPDAD 333
                         90       100
                 ....*....|....*....|....*
gi 8176554   682 PPTGDSGAPPVTPTGDSETAPVPPT 706
Cdd:PHA03264 334 RPEGWPSLEAITFPPPTPATPAVPR 358
MISS pfam15822
MAPK-interacting and spindle-stabilising protein-like; MISS is a family of eukaryotic ...
561-715 2.63e-06

MAPK-interacting and spindle-stabilising protein-like; MISS is a family of eukaryotic MAPK-interacting and spindle-stabilising protein-like proteins. MISS is rich in prolines and has four potential MAPK-phosphorylation sites, a MAPK-docking site, a PEST sequence (PEST motif) and a bipartite nuclear localization signal. The endogenous protein accumulates during mouse meiotic maturation and is found as discrete dots on the MII spindle. MISS is the first example of a physiological MAPK-substrate that is stabilized in MII that specifically regulates MII spindle integrity during the CSF arrest.


Pssm-ID: 318115 [Multi-domain]  Cd Length: 238  Bit Score: 49.21  E-value: 2.63e-06
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 8176554    561 PPTGDSEATPVPPTGDSETAPVPPTGDSGAPPVPPTGDSGAPPVPPTGDSGAPPVP----------PTGDSGAPPVPPTG 630
Cdd:pfam15822  63 APTGMYPSIPLTGPSPGPPAPFPPSGPSCPPPGGPYPAPTVPGPGPIGPYPTPNMPfpelprpygaPTDPAAAAPSGPWG 142
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 8176554    631 DSGAPPVPPT--GDSGAP--PVPPTGDSGAPPvPPTGDAGPPPVP----PTGDSGPPP--VPPTGDSGAPPVTPTGDSET 700
Cdd:pfam15822 143 SMSSGPWAPGmgGQYPAPnmPYPSPGPYPAVP-PPQSPGAAPPVPwgtvPPGPWGPPApyPDPTGSYPMPGLYPTPNNPF 221
                         170
                  ....*....|....*
gi 8176554    701 ApvPPTGDSGAPPVP 715
Cdd:pfam15822 222 Q--VPSGPSGAPPMP 234
DUF5585 pfam17823
Family of unknown function (DUF5585); This is a family of unknown function found in chordata.
468-717 3.19e-06

Family of unknown function (DUF5585); This is a family of unknown function found in chordata.


Pssm-ID: 465521 [Multi-domain]  Cd Length: 506  Bit Score: 50.34  E-value: 3.19e-06
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 8176554    468 ATPTGYRPQDRTVFKAMIAYWTNFAKTGDPNMGDSAVPTHWEPYT-TENSGYLEITKKMGSS-----SMKRSLRTNFLRY 541
Cdd:pfam17823 166 SAPHAASPAPRTAASSTTAASSTTAASSAPTTAASSAPATLTPARgISTAATATGHPAAGTAlaavgNSSPAAGTVTAAV 245
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 8176554    542 WTLTYLALPTVTDQEATpVPPTGDSEATPVPPTGDSETAPVPPTGDSGAPPVPPTGDSGAPPV-------PPTGDSGAPP 614
Cdd:pfam17823 246 GTVTPAALATLAAAAGT-VASAAGTINMGDPHARRLSPAKHMPSDTMARNPAAPMGAQAQGPIiqvstdqPVHNTAGEPT 324
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 8176554    615 VPPTGDSGAPPVPPTGDSGAPPVPPT-----GDSGAPPVPPTGDSGAPPVPPTG-DAGPPPVPPTGDSGPPPVPPTGDSG 688
Cdd:pfam17823 325 PSPSNTTLEPNTPKSVASTNLAVVTTtkaqaKEPSASPVPVLHTSMIPEVEATSpTTQPSPLLPTQGAAGPGILLAPEQV 404
                         250       260
                  ....*....|....*....|....*....
gi 8176554    689 APPVTPTGDSeTAPVPPTgdSGAPPVPPT 717
Cdd:pfam17823 405 ATEATAGTAS-AGPTPRS--SGDPKTLAM 430
motB PRK12799
flagellar motor protein MotB; Reviewed
597-721 3.43e-06

flagellar motor protein MotB; Reviewed


Pssm-ID: 183756 [Multi-domain]  Cd Length: 421  Bit Score: 50.10  E-value: 3.43e-06
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 8176554   597 GDSGAPPV---PPTGDSGAPPVPPTGDSGAPPVPPTGDSGAPPVPPTGDSGAPPVPPTGDSGAPPVPPTGDAGPPPVPPT 673
Cdd:PRK12799 294 DTHGTVPVaavTPSSAVTQSSAITPSSAAIPSPAVIPSSVTTQSATTTQASAVALSSAGVLPSDVTLPGTVALPAAEPVN 373
                         90       100       110       120
                 ....*....|....*....|....*....|....*....|....*...
gi 8176554   674 GDSGPPPVPPTGDSGAPPVTPTGDSETAPVPPTGDSGAPPVPPTGDSE 721
Cdd:PRK12799 374 MQPQPMSTTETQQSSTGNITSTANGPTTSLPAAPASNIPVSPTSRDAQ 421
PHA03264 PHA03264
envelope glycoprotein D; Provisional
615-717 3.92e-06

envelope glycoprotein D; Provisional


Pssm-ID: 223029 [Multi-domain]  Cd Length: 416  Bit Score: 50.00  E-value: 3.92e-06
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 8176554   615 VPPTGDSGAPPVPPTGDSGAPPVPPTGDSGAPPVPPTGDSGAPPVPPTGDAGPPPVPPTGDSGPP--PVPPTGDSGAPPV 692
Cdd:PHA03264 254 VPPYFEESKGYEPPPAPSGGSPAPPGDDRPEAKPEPGPVEDGAPGRETGGEGEGPEPAGRDGAAGgePKPGPPRPAPDAD 333
                         90       100
                 ....*....|....*....|....*
gi 8176554   693 TPTGDSETAPVPPTGDSGAPPVPPT 717
Cdd:PHA03264 334 RPEGWPSLEAITFPPPTPATPAVPR 358
PRK10263 PRK10263
DNA translocase FtsK; Provisional
550-738 4.41e-06

DNA translocase FtsK; Provisional


Pssm-ID: 236669 [Multi-domain]  Cd Length: 1355  Bit Score: 50.47  E-value: 4.41e-06
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 8176554    550 PTVTDQEATPVPPTGDSEATPVPPTGDSETAPVPptgdSGAPPVPPTGDSGAP-------PVPPTGDSGAP--------- 613
Cdd:PRK10263  344 PPVASVDVPPAQPTVAWQPVPGPQTGEPVIAPAP----EGYPQQSQYAQPAVQyneplqqPVQPQQPYYAPaaeqpaqqp 419
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 8176554    614 -----PVPPTGDSGAPPVPPTGDSGAPPVPPTGDSGAPPVP---PTGDSGAPPVPPTGDAGPPPVPPTGDSGPPPVPPTG 685
Cdd:PRK10263  420 yyapaPEQPAQQPYYAPAPEQPVAGNAWQAEEQQSTFAPQStyqTEQTYQQPAAQEPLYQQPQPVEQQPVVEPEPVVEET 499
                         170       180       190       200       210       220       230
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*.
gi 8176554    686 DSGAPPV-----------------------TPTGDSETAPVPPTGDSGAPPVPPTGDSEAAPVPPTDDSKEAQMPA 738
Cdd:PRK10263  500 KPARPPLyyfeeveekrarereqlaawyqpIPEPVKEPEPIKSSLKAPSVAAVPPVEAAAAVSPLASGVKKATLAT 575
PRK14951 PRK14951
DNA polymerase III subunits gamma and tau; Provisional
557-675 5.62e-06

DNA polymerase III subunits gamma and tau; Provisional


Pssm-ID: 237865 [Multi-domain]  Cd Length: 618  Bit Score: 49.71  E-value: 5.62e-06
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 8176554   557 ATPVPPT-GDSEATPVPPTGDSETAPVPPTGDSGAPPVPPTGDSGAPPVPPTGDSGAPPVPPTGDSGAPPVPPTGDSGAP 635
Cdd:PRK14951 379 KTPARPEaAAPAAAPVAQAAAAPAPAAAPAAAASAPAAPPAAAPPAPVAAPAAAAPAAAPAAAPAAVALAPAPPAQAAPE 458
                         90       100       110       120
                 ....*....|....*....|....*....|....*....|..
gi 8176554   636 PV--PPTGDSGAPPVPPTGDSGAPPVPPtgdagPPPVPPTGD 675
Cdd:PRK14951 459 TVaiPVRVAPEPAVASAAPAPAAAPAAA-----RLTPTEEGD 495
PLN02217 PLN02217
probable pectinesterase/pectinesterase inhibitor
611-723 6.11e-06

probable pectinesterase/pectinesterase inhibitor


Pssm-ID: 215130 [Multi-domain]  Cd Length: 670  Bit Score: 49.70  E-value: 6.11e-06
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 8176554   611 GAPPVPP--TGDSGAPPVPPTGDSGAPPVPPTGDSGAPPVPPtgdSGAPPVpptGDAGPPPVPPTGDSGPPPVPPTGDSG 688
Cdd:PLN02217 554 GVPYIPGlfAGNPGSTNSTPTGSAASSNTTFSSDSPSTVVAP---STSPPA---GHLGSPPATPSKIVSPSTSPPASHLG 627
                         90       100       110
                 ....*....|....*....|....*....|....*
gi 8176554   689 APPVTPtgdseTAPVPPTGDSGAPPVPPTGDSEAA 723
Cdd:PLN02217 628 SPSTTP-----SSPESSIKVASTETASPESSIKVA 657
PRK14951 PRK14951
DNA polymerase III subunits gamma and tau; Provisional
639-739 6.45e-06

DNA polymerase III subunits gamma and tau; Provisional


Pssm-ID: 237865 [Multi-domain]  Cd Length: 618  Bit Score: 49.71  E-value: 6.45e-06
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 8176554   639 PTGDSGAPPVPPTgDSGAPPVPPTGDAGPPPVPPTGdSGPPPVPPTGDSGAPPVTPTGDSETAPVPPTGDSGAPPVPPTG 718
Cdd:PRK14951 366 PAAAAEAAAPAEK-KTPARPEAAAPAAAPVAQAAAA-PAPAAAPAAAASAPAAPPAAAPPAPVAAPAAAAPAAAPAAAPA 443
                         90       100
                 ....*....|....*....|.
gi 8176554   719 DSEAAPVPPTDDSKEAQMPAV 739
Cdd:PRK14951 444 AVALAPAPPAQAAPETVAIPV 464
PRK11633 PRK11633
cell division protein DedD; Provisional
591-703 6.48e-06

cell division protein DedD; Provisional


Pssm-ID: 236940 [Multi-domain]  Cd Length: 226  Bit Score: 48.08  E-value: 6.48e-06
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 8176554   591 PPVPPTGDSGAPPVPPTGDSGAPPVPPTGDSGAppvPPTGDSGAPPVPPTgDSGAPPVPPtgdsgAPPVPPTGDAGPPPV 670
Cdd:PRK11633  42 PLVPKPGDRDEPDMMPAATQALPTQPPEGAAEA---VRAGDAAAPSLDPA-TVAPPNTPV-----EPEPAPVEPPKPKPV 112
                         90       100       110
                 ....*....|....*....|....*....|....
gi 8176554   671 P-PTGDSGPPPVPPTGDSGAPPVTPTGDSETAPV 703
Cdd:PRK11633 113 EkPKPKPKPQQKVEAPPAPKPEPKPVVEEKAAPT 146
PRK14959 PRK14959
DNA polymerase III subunits gamma and tau; Provisional
572-684 6.80e-06

DNA polymerase III subunits gamma and tau; Provisional


Pssm-ID: 184923 [Multi-domain]  Cd Length: 624  Bit Score: 49.68  E-value: 6.80e-06
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 8176554   572 PPTGDSetAPVPPTGDSGAPPVPPT-GDSGAPPVPP-TGDSGAPPVPPTGDSGAPPVPPtgdSGAPPVPPTgdSGAPPVP 649
Cdd:PRK14959 380 APSGSA--AEGPASGGAATIPTPGTqGPQGTAPAAGmTPSSAAPATPAPSAAPSPRVPW---DDAPPAPPR--SGIPPRP 452
                         90       100       110
                 ....*....|....*....|....*....|....*..
gi 8176554   650 PTGDSGAPPVP--PTGDAGPPPVPPTGDSGPPPVPPT 684
Cdd:PRK14959 453 APRMPEASPVPgaPDSVASASDAPPTLGDPSDTAEHT 489
Pro-rich pfam15240
Proline-rich protein; This family includes several eukaryotic proline-rich proteins.
612-727 7.01e-06

Proline-rich protein; This family includes several eukaryotic proline-rich proteins.


Pssm-ID: 464580 [Multi-domain]  Cd Length: 167  Bit Score: 46.95  E-value: 7.01e-06
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 8176554    612 APPVPPTGdsGAPPVPPTGDSGAPPVPPTGDSGAPPVPPTGDSGAPPvpPTGDAGPPPVPPTGDsgPPPVPPTGDSGAPP 691
Cdd:pfam15240  45 GPQGPPPG--GFPPQPPASDDPPGPPPPGGPQQPPPQGGKQKPQGPP--PQGGPRPPPGKPQGP--PPQGGNQQQGPPPP 118
                          90       100       110
                  ....*....|....*....|....*....|....*...
gi 8176554    692 VTPTG--DSETAPVPPTGDSGAPPVPPTGDSEAAPVPP 727
Cdd:pfam15240 119 GKPQGppPQGGGPPPQGGNQQGPPPPPPGNPQGPPQRP 156
PLN02217 PLN02217
probable pectinesterase/pectinesterase inhibitor
488-678 7.01e-06

probable pectinesterase/pectinesterase inhibitor


Pssm-ID: 215130 [Multi-domain]  Cd Length: 670  Bit Score: 49.32  E-value: 7.01e-06
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 8176554   488 WTNFAKTGDPN--MGDSAVPTHWEPYTTE----NSGYLEITKK-MGSSSMKRslrtnflrywtLTYLALPTVTDQEATPV 560
Cdd:PLN02217 470 WKEYSRTIIMNtfIPDFVPPEGWQPWLGDfglnTLFYSEVQNTgPGAAITKR-----------VTWPGIKKLSDEEILKF 538
                         90       100       110       120       130       140       150       160
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 8176554   561 PPT----GDseaTPVPPTGdsetAPVPP---TGDSGAPPVPPTGDSGAPPVPPTGDSGAPPVPPTGDsgappvPPTGDSG 633
Cdd:PLN02217 539 TPAqyiqGD---AWIPGKG----VPYIPglfAGNPGSTNSTPTGSAASSNTTFSSDSPSTVVAPSTS------PPAGHLG 605
                        170       180       190       200
                 ....*....|....*....|....*....|....*....|....*
gi 8176554   634 APPVPPTGDSGAPPVPPTGDSGAPPVPPTGDAGPPPVPPTGDSGP 678
Cdd:PLN02217 606 SPPATPSKIVSPSTSPPASHLGSPSTTPSSPESSIKVASTETASP 650
SAV_2336_NTERM NF041121
SAV_2336 family N-terminal domain; This HMM describes an N-terminal domain shared by SAV_2336 ...
622-705 7.14e-06

SAV_2336 family N-terminal domain; This HMM describes an N-terminal domain shared by SAV_2336 (BAC70047.1) whose C-terminal region suggests restriction enzyme activity (PMID: 18456708), and with other proteins with unrelated C-terminal regions. A member protein was also identified in a kanamycin biosynthetic gene cluster (PMID:16766657), while N-terminal regions of two other member proteins were named Trypco1 in a bioinformatic study (PMID:32101166) of predicted bacterial conflict systems.


Pssm-ID: 469044 [Multi-domain]  Cd Length: 473  Bit Score: 49.23  E-value: 7.14e-06
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 8176554   622 GAPPVPPTGDSGAPPVPPTGDSGAPPVPPTGDSGAPPVPPTGDAGPPPVPPTGDSGPPPVPPTGDSGAPPvtPTGDSETA 701
Cdd:NF041121  16 GRAAAPPSPEGPAPTAASQPATPPPPAAPPSPPGDPPEPPAPEPAPLPAPYPGSLAPPPPPPPGPAGAAP--GAALPVRV 93

                 ....
gi 8176554   702 PVPP 705
Cdd:NF041121  94 PAPP 97
PRK14950 PRK14950
DNA polymerase III subunits gamma and tau; Provisional
645-726 7.23e-06

DNA polymerase III subunits gamma and tau; Provisional


Pssm-ID: 237864 [Multi-domain]  Cd Length: 585  Bit Score: 49.42  E-value: 7.23e-06
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 8176554   645 APPVPPTGDSGAPPVPP-----TGDAGPPPVPPTgdSGPPPVPPTGDSgAPPVTPTGDSETAPVPPTgdsgaPPVPPTGD 719
Cdd:PRK14950 361 VPVPAPQPAKPTAAAPSpvrptPAPSTRPKAAAA--ANIPPKEPVRET-ATPPPVPPRPVAPPVPHT-----PESAPKLT 432

                 ....*..
gi 8176554   720 SEAAPVP 726
Cdd:PRK14950 433 RAAIPVD 439
PHA03264 PHA03264
envelope glycoprotein D; Provisional
593-696 8.14e-06

envelope glycoprotein D; Provisional


Pssm-ID: 223029 [Multi-domain]  Cd Length: 416  Bit Score: 48.85  E-value: 8.14e-06
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 8176554   593 VPPTGDSGAPPVPPTGDSGAPPVPPTGDSGAPPVPPTGDSGAPPVPPTGDSGAPPVPPTGDSGAP--PVPPTGDAGPPPV 670
Cdd:PHA03264 254 VPPYFEESKGYEPPPAPSGGSPAPPGDDRPEAKPEPGPVEDGAPGRETGGEGEGPEPAGRDGAAGgePKPGPPRPAPDAD 333
                         90       100       110
                 ....*....|....*....|....*....|..
gi 8176554   671 PPTG------DSGPPPVPPTgdSGAPPVTPTG 696
Cdd:PHA03264 334 RPEGwpsleaITFPPPTPAT--PAVPRARPVI 363
PRK14959 PRK14959
DNA polymerase III subunits gamma and tau; Provisional
557-667 8.65e-06

DNA polymerase III subunits gamma and tau; Provisional


Pssm-ID: 184923 [Multi-domain]  Cd Length: 624  Bit Score: 49.29  E-value: 8.65e-06
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 8176554   557 ATPVPPTGDSEATPVP----PTGDSETAPVPPTgdSGAPPVPPTGDSGAPPVPPtgdSGAPPVPPTgdSGAPPVPPTGDS 632
Cdd:PRK14959 385 AAEGPASGGAATIPTPgtqgPQGTAPAAGMTPS--SAAPATPAPSAAPSPRVPW---DDAPPAPPR--SGIPPRPAPRMP 457
                         90       100       110
                 ....*....|....*....|....*....|....*..
gi 8176554   633 GAPPVP--PTGDSGAPPVPPTGDSGAPPVPPTgDAGP 667
Cdd:PRK14959 458 EASPVPgaPDSVASASDAPPTLGDPSDTAEHT-PSGP 493
PHA03418 PHA03418
hypothetical E4 protein; Provisional
581-716 9.90e-06

hypothetical E4 protein; Provisional


Pssm-ID: 177646 [Multi-domain]  Cd Length: 230  Bit Score: 47.43  E-value: 9.90e-06
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 8176554   581 PVPPTGDSGAPPVPPTGDSGAPPVPPTgdSGAPPVPPTGDSgAPPVPPT----GDSGAPPVPPTGDSGAPPVPPTGDSGA 656
Cdd:PHA03418  34 PLLPAPHHPNPQEDPDKNPSPPPDPPL--TPRPPAQPNGHN-KPPVTKQpggeGTEEDHQAPLAADADDDPRPGKRSKAD 110
                         90       100       110       120       130       140       150
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 8176554   657 PPVPPTGDAGPPPV-------PPTGDSGPPPvPPTGDSGAPPvtPTGDSETAPVPPTGD---SGAPPVPP 716
Cdd:PHA03418 111 EHGPAPGRAALAPFkldldqdPLHGDPDPPP-GATGGQGEEP--PEGGEESQPPLGEGEgavEGHPPPLP 177
PRK14971 PRK14971
DNA polymerase III subunit gamma/tau;
580-706 9.97e-06

DNA polymerase III subunit gamma/tau;


Pssm-ID: 237874 [Multi-domain]  Cd Length: 614  Bit Score: 49.00  E-value: 9.97e-06
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 8176554   580 APVPPTGDSGAPPVPPTgdsgAPPVPPTGDSGAPPVPPTGDSGAPPVPPTGDSGAPPVPP--TGDSGAPPVPPTGDSGAP 657
Cdd:PRK14971 360 AQLTQKGDDASGGRGPK----QHIKPVFTQPAAAPQPSAAAAASPSPSQSSAAAQPSAPQsaTQPAGTPPTVSVDPPAAV 435
                         90       100       110       120       130
                 ....*....|....*....|....*....|....*....|....*....|....*....
gi 8176554   658 PVPPTGdAGPPPVPPTGDSGPPPVPPTG-DSGAPPVT---------PTGDSETAPVPPT 706
Cdd:PRK14971 436 PVNPPS-TAPQAVRPAQFKEEKKIPVSKvSSLGPSTLrpiqekaeqATGNIKEAPTGTQ 493
PTZ00429 PTZ00429
beta-adaptin; Provisional
563-662 1.31e-05

beta-adaptin; Provisional


Pssm-ID: 240415 [Multi-domain]  Cd Length: 746  Bit Score: 48.78  E-value: 1.31e-05
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 8176554   563 TGDSEATPVPPTgdsetapvPPTGDSGAPPVPPTGDSGAPPVPPTGD-SGAPPVPPTGDSGAPPVPPTGD-SGAPPVPPT 640
Cdd:PTZ00429 613 TEDDDAVELPST--------PSMGTQDGSPAPSAAPAGYDIFEFAGDgTGAPHPVASGSNGAQHADPLGDlFSGLPSTVG 684
                         90       100
                 ....*....|....*....|..
gi 8176554   641 GDSGAPPVPPtgDSGAPPVPPT 662
Cdd:PTZ00429 685 ASSPAFQAAS--GSQAPASPPT 704
PRK07994 PRK07994
DNA polymerase III subunits gamma and tau; Validated
554-717 1.45e-05

DNA polymerase III subunits gamma and tau; Validated


Pssm-ID: 236138 [Multi-domain]  Cd Length: 647  Bit Score: 48.32  E-value: 1.45e-05
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 8176554   554 DQEATPVPPTGDSEATPVPptgdseTAPVPPTgdsgAPPVPPTGDSGAPPVPPtgdSGAPPVPPTGDSGAPP--VPPTGD 631
Cdd:PRK07994 369 EVPPQSAAPAASAQATAAP------TAAVAPP----QAPAVPPPPASAPQQAP---AVPLPETTSQLLAARQqlQRAQGA 435
                         90       100       110       120       130       140       150       160
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 8176554   632 SGAPPVPPTGDSGAPPVPPTGDSGApPVPPTGDAGPPPVPPTGDSGPPPVPPTGDSGAPPVTPTGDSETAPVPPTGDSGA 711
Cdd:PRK07994 436 TKAKKSEPAAASRARPVNSALERLA-SVRPAPSALEKAPAKKEAYRWKATNPVEVKKEPVATPKALKKALEHEKTPELAA 514

                 ....*.
gi 8176554   712 PPVPPT 717
Cdd:PRK07994 515 KLAAEA 520
PHA03264 PHA03264
envelope glycoprotein D; Provisional
571-685 1.50e-05

envelope glycoprotein D; Provisional


Pssm-ID: 223029 [Multi-domain]  Cd Length: 416  Bit Score: 48.08  E-value: 1.50e-05
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 8176554   571 VPPTGDSETAPVPPTGDSGAPPVPPTGDSGAPPVPPTGDSGAPPVPPTGDSGAPPVPPTGDSGAP--PVPPTGDSGAPPV 648
Cdd:PHA03264 254 VPPYFEESKGYEPPPAPSGGSPAPPGDDRPEAKPEPGPVEDGAPGRETGGEGEGPEPAGRDGAAGgePKPGPPRPAPDAD 333
                         90       100       110
                 ....*....|....*....|....*....|....*..
gi 8176554   649 PPTGDSGAPPVPptgdaGPPPVPPTgdSGPPPVPPTG 685
Cdd:PHA03264 334 RPEGWPSLEAIT-----FPPPTPAT--PAVPRARPVI 363
PRK11901 PRK11901
hypothetical protein; Reviewed
563-711 1.57e-05

hypothetical protein; Reviewed


Pssm-ID: 237015 [Multi-domain]  Cd Length: 327  Bit Score: 47.76  E-value: 1.57e-05
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 8176554   563 TGDSEATPVPPTGDSETAPVPPTGDSGAPpvpptGDSGAPPVPPTGDSGAPPVPPTGD--------------------SG 622
Cdd:PRK11901  88 SSGNQSSPSAANNTSDGHDASGVKNTAPP-----QDISAPPISPTPTQAAPPQTPNGQqrielpgnisdalsqqqgqvNA 162
                         90       100       110       120       130       140       150       160
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 8176554   623 APPVPPTGDSGAPPVPPTgdsgAPPVPPTGDSGAPPVPPTgdagPPPVPPTgdSGPPPVPPTgdsgAPPVTPTGDSETAP 702
Cdd:PRK11901 163 ASQNAQGNTSTLPTAPAT----VAPSKGAKVPATAETHPT----PPQKPAT--KKPAVNHHK----TATVAVPPATSGKP 228

                 ....*....
gi 8176554   703 VPPTGDSGA 711
Cdd:PRK11901 229 KSGAASARA 237
FAP pfam07174
Fibronectin-attachment protein (FAP); This family contains bacterial fibronectin-attachment ...
645-716 1.71e-05

Fibronectin-attachment protein (FAP); This family contains bacterial fibronectin-attachment proteins (FAP). Family members are rich in alanine and proline, are approximately 300 long, and seem to be restricted to mycobacteria. These proteins contain a fibronectin-binding motif that allows mycobacteria to bind to fibronectin in the extracellular matrix.


Pssm-ID: 429334  Cd Length: 301  Bit Score: 47.23  E-value: 1.71e-05
                          10        20        30        40        50        60        70
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|..
gi 8176554    645 APPVPPTgdSGAPPVPPTgdagPPPVPPTgdsgPPPVPPTGDSGAPPVTPTGDSETAPVPPTGDSGAPPVPP 716
Cdd:pfam07174  44 APPPPST--ATAPPAPPP----PPPAPAA----PAPPPPPAAPNAPNAPPPPADPNAPPPPPADPNAPPPPA 105
Gag_spuma pfam03276
Spumavirus gag protein;
560-696 1.73e-05

Spumavirus gag protein;


Pssm-ID: 460872 [Multi-domain]  Cd Length: 614  Bit Score: 48.20  E-value: 1.73e-05
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 8176554    560 VPPTGDSEATPVPPTGDSETAPVPPTGDSGAPPVPPTGDSGAPPVPPTGDSGappvPPTGDSGAPPVPPTgdSGAPPVPP 639
Cdd:pfam03276 186 IPPGASFSGLPSLPAIGGIHLPAIPGIHARAPPGNIARSLGDDIMPSLGDAG----MPQPRFAFHPGNPF--AEAEGHPF 259
                          90       100       110       120       130       140
                  ....*....|....*....|....*....|....*....|....*....|....*....|...
gi 8176554    640 TGDSG----APPVPPTGDSGAPPVPPTGDAGPPPVPPTgdSGPPPVPPTGDS--GAPPVTPTG 696
Cdd:pfam03276 260 AEAEGerprDIPRAPRIDAPSAPAIPAIQPIAPPMIPP--IGAPIPIPHGASipGEHIRNPRE 320
PRK14954 PRK14954
DNA polymerase III subunits gamma and tau; Provisional
587-688 1.76e-05

DNA polymerase III subunits gamma and tau; Provisional


Pssm-ID: 184918 [Multi-domain]  Cd Length: 620  Bit Score: 48.02  E-value: 1.76e-05
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 8176554   587 DSGAPPVPPTGdsgappvPPTGDSGAPPVPPTGDSGAPPVPPTGDSGAPPVPPTGDSGAPPvpptgdSGAPPVPPtgdag 666
Cdd:PRK14954 377 DGGVAPSPAGS-------PDVKKKAPEPDLPQPDRHPGPAKPEAPGARPAELPSPASAPTP------EQQPPVAR----- 438
                         90       100
                 ....*....|....*....|..
gi 8176554   667 PPPVPPTGDSGPPPVPPTGDSG 688
Cdd:PRK14954 439 SAPLPPSPQASAPRNVASGKPG 460
PRK07994 PRK07994
DNA polymerase III subunits gamma and tau; Validated
573-727 1.99e-05

DNA polymerase III subunits gamma and tau; Validated


Pssm-ID: 236138 [Multi-domain]  Cd Length: 647  Bit Score: 47.94  E-value: 1.99e-05
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 8176554   573 PTGDSETAPVPPTGDSGAPPVPPTGDSGAPPVP-PTGDSGAPPVPPtgdSGAPPVPPTGDSGAPP--VPPTGDSGAPPVP 649
Cdd:PRK07994 366 PEPEVPPQSAAPAASAQATAAPTAAVAPPQAPAvPPPPASAPQQAP---AVPLPETTSQLLAARQqlQRAQGATKAKKSE 442
                         90       100       110       120       130       140       150
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 8176554   650 PTGDSGAPPVPPTGDAGPPPVPPTGDSGPPPVPPtgdsGAPPVTPtgdseTAPVPPTGDSGAPPVPPTGDSEAAPVPP 727
Cdd:PRK07994 443 PAAASRARPVNSALERLASVRPAPSALEKAPAKK----EAYRWKA-----TNPVEVKKEPVATPKALKKALEHEKTPE 511
PRK14951 PRK14951
DNA polymerase III subunits gamma and tau; Provisional
547-653 2.04e-05

DNA polymerase III subunits gamma and tau; Provisional


Pssm-ID: 237865 [Multi-domain]  Cd Length: 618  Bit Score: 47.79  E-value: 2.04e-05
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 8176554   547 LALPTVTDQEATPVPPTGDSEATPVPPTGDSETAPVPPTGDSGAPPVPPTGDSGAPPVPPTGDSGA----PPVPPTGDSG 622
Cdd:PRK14951 381 PARPEAAAPAAAPVAQAAAAPAPAAAPAAAASAPAAPPAAAPPAPVAAPAAAAPAAAPAAAPAAVAlapaPPAQAAPETV 460
                         90       100       110
                 ....*....|....*....|....*....|....*
gi 8176554   623 APPV----PPTGDSGAPPVPPTGDSGAPPVPPTGD 653
Cdd:PRK14951 461 AIPVrvapEPAVASAAPAPAAAPAAARLTPTEEGD 495
PLN02217 PLN02217
probable pectinesterase/pectinesterase inhibitor
600-702 2.08e-05

probable pectinesterase/pectinesterase inhibitor


Pssm-ID: 215130 [Multi-domain]  Cd Length: 670  Bit Score: 47.78  E-value: 2.08e-05
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 8176554   600 GAPPVPP--TGDSGAPPVPPTGDSGAPPVPPTGDSGAPPVPPtgdSGAPPVpptGDSGAPPVPPTGDAGPPPVPPTGDSG 677
Cdd:PLN02217 554 GVPYIPGlfAGNPGSTNSTPTGSAASSNTTFSSDSPSTVVAP---STSPPA---GHLGSPPATPSKIVSPSTSPPASHLG 627
                         90       100
                 ....*....|....*....|....*
gi 8176554   678 PPPVPPTgdSGAPPVTPTGDSETAP 702
Cdd:PLN02217 628 SPSTTPS--SPESSIKVASTETASP 650
PRK12727 PRK12727
flagellar biosynthesis protein FlhF;
562-704 2.08e-05

flagellar biosynthesis protein FlhF;


Pssm-ID: 237182 [Multi-domain]  Cd Length: 559  Bit Score: 47.68  E-value: 2.08e-05
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 8176554   562 PTGDSEATPVPPTGDSETAPVPPTGDSGAPPVPPTGDSGAPPVPPTGDSGAPPVPPTGDSGAPPVPPTGDSGAPPVPPTG 641
Cdd:PRK12727 117 PVSVPRQAPAAAPVRAASIPSPAAQALAHAAAVRTAPRQEHALSAVPEQLFADFLTTAPVPRAPVQAPVVAAPAPVPAIA 196
                         90       100       110       120       130       140
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*
gi 8176554   642 DSGAPPVPPTGDSGAPPVPP--TGDAGPPPVPPtgdsgPPPVPPTGDSGAPPVTPTGDSETAPVP 704
Cdd:PRK12727 197 AALAAHAAYAQDDDEQLDDDgfDLDDALPQILP-----PAALPPIVVAPAAPAALAAVAAAAPAP 256
PLN02217 PLN02217
probable pectinesterase/pectinesterase inhibitor
644-731 2.10e-05

probable pectinesterase/pectinesterase inhibitor


Pssm-ID: 215130 [Multi-domain]  Cd Length: 670  Bit Score: 47.78  E-value: 2.10e-05
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 8176554   644 GAPPVPP--TGDSGAPPVPPTGDAGPPPVPPTGDSG-----PPPVPPTGDSGAPPVTPTGDSETAPVPPTGDSGAPPVPP 716
Cdd:PLN02217 554 GVPYIPGlfAGNPGSTNSTPTGSAASSNTTFSSDSPstvvaPSTSPPAGHLGSPPATPSKIVSPSTSPPASHLGSPSTTP 633
                         90
                 ....*....|....*
gi 8176554   717 TGDSEAAPVPPTDDS 731
Cdd:PLN02217 634 SSPESSIKVASTETA 648
PHA03307 PHA03307
transcriptional regulator ICP4; Provisional
588-729 2.17e-05

transcriptional regulator ICP4; Provisional


Pssm-ID: 223039 [Multi-domain]  Cd Length: 1352  Bit Score: 48.24  E-value: 2.17e-05
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 8176554    588 SGAPPVPPTGDSGAPPVPPTGDSGAPPVPPTGDSGAPPVPPTGDSGAPPVPPTGDSGAPPVPPTGDSG----APPVPPTG 663
Cdd:PHA03307  762 SLVPAKLAEALALLEPAEPQRGAGSSPPVRAEAAFRRPGRLRRSGPAADAASRTASKRKSRSHTPDGGsessGPARPPGA 841
                          90       100       110       120       130       140
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*.
gi 8176554    664 DAGPPPVPPTGDSGPPPVPPTGDSGAPPvtptGDSETAPVPPTGDSGAPPVPPTGDSEAAPVPPTD 729
Cdd:PHA03307  842 AARPPPARSSESSKSKPAAAGGRARGKN----GRRRPRPPEPRARPGAAAPPKAAAAAPPAGAPAP 903
PLN02217 PLN02217
probable pectinesterase/pectinesterase inhibitor
589-701 2.29e-05

probable pectinesterase/pectinesterase inhibitor


Pssm-ID: 215130 [Multi-domain]  Cd Length: 670  Bit Score: 47.78  E-value: 2.29e-05
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 8176554   589 GAPPVPP--TGDSGAPPVPPTGDSGAPPVPPTGDSGAPPVPPTGDsgappvPPTGDSGAPPVPPTGDSGAPPVPPTGDAG 666
Cdd:PLN02217 554 GVPYIPGlfAGNPGSTNSTPTGSAASSNTTFSSDSPSTVVAPSTS------PPAGHLGSPPATPSKIVSPSTSPPASHLG 627
                         90       100       110
                 ....*....|....*....|....*....|....*
gi 8176554   667 PPPVPPTGdsgppPVPPTGDSGAPPVTPTGDSETA 701
Cdd:PLN02217 628 SPSTTPSS-----PESSIKVASTETASPESSIKVA 657
PRK14950 PRK14950
DNA polymerase III subunits gamma and tau; Provisional
553-645 2.60e-05

DNA polymerase III subunits gamma and tau; Provisional


Pssm-ID: 237864 [Multi-domain]  Cd Length: 585  Bit Score: 47.50  E-value: 2.60e-05
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 8176554   553 TDQEATPVPPTGDSEATPVPPTGDSeTAPVPPTGDSgAPPVPPTGDSGAPPVPPTgdsgaPPVPPTGDSGAPPVPPTGDS 632
Cdd:PRK14950 372 TAAAPSPVRPTPAPSTRPKAAAAAN-IPPKEPVRET-ATPPPVPPRPVAPPVPHT-----PESAPKLTRAAIPVDEKPKY 444
                         90
                 ....*....|...
gi 8176554   633 GAPPVPPTGDSGA 645
Cdd:PRK14950 445 TPPAPPKEEEKAL 457
SepH NF040712
septation protein SepH; Septation protein H (SepH) was firstly characterized in Streptomyces ...
548-660 2.75e-05

septation protein SepH; Septation protein H (SepH) was firstly characterized in Streptomyces venezuelae, and homologs were identified in Mycobacterium smegmatis. SepH contains a N-terminal DUF3071 domain and a conserved C-terminal region. It binds directly to cell division protein FtsZ to stimulate the assembly of FtsZ protofilaments.


Pssm-ID: 468676 [Multi-domain]  Cd Length: 346  Bit Score: 47.07  E-value: 2.75e-05
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 8176554   548 ALPTVTDQEATPVPPTGDSEATPVPPTGDSETAPVPPTGDSGAPPVPPTGDSGAPPVPPTGDSGAPPvPPTGDSGAPPVP 627
Cdd:NF040712 226 APATDSDPAEAGTPDDLASARRRRAGVEQPEDEPVGPGAAPAAEPDEATRDAGEPPAPGAAETPEAA-EPPAPAPAAPAA 304
                         90       100       110
                 ....*....|....*....|....*....|...
gi 8176554   628 PTGDSGAPPVPPtgdSGAPPVPPTGDSGAPPVP 660
Cdd:NF040712 305 PAAPEAEEPARP---EPPPAPKPKRRRRRASVP 334
FAP pfam07174
Fibronectin-attachment protein (FAP); This family contains bacterial fibronectin-attachment ...
634-712 3.02e-05

Fibronectin-attachment protein (FAP); This family contains bacterial fibronectin-attachment proteins (FAP). Family members are rich in alanine and proline, are approximately 300 long, and seem to be restricted to mycobacteria. These proteins contain a fibronectin-binding motif that allows mycobacteria to bind to fibronectin in the extracellular matrix.


Pssm-ID: 429334  Cd Length: 301  Bit Score: 46.46  E-value: 3.02e-05
                          10        20        30        40        50        60        70
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 8176554    634 APPVPPTgdSGAPPVPPTgdsgAPPVPPTGDAGPPPVPPtgdSGPPPVPPTGDSGAPPVTPTgdSETAPVPPTGDSGAP 712
Cdd:pfam07174  44 APPPPST--ATAPPAPPP----PPPAPAAPAPPPPPAAP---NAPNAPPPPADPNAPPPPPA--DPNAPPPPAVDPNAP 111
SAV_2336_NTERM NF041121
SAV_2336 family N-terminal domain; This HMM describes an N-terminal domain shared by SAV_2336 ...
589-682 3.11e-05

SAV_2336 family N-terminal domain; This HMM describes an N-terminal domain shared by SAV_2336 (BAC70047.1) whose C-terminal region suggests restriction enzyme activity (PMID: 18456708), and with other proteins with unrelated C-terminal regions. A member protein was also identified in a kanamycin biosynthetic gene cluster (PMID:16766657), while N-terminal regions of two other member proteins were named Trypco1 in a bioinformatic study (PMID:32101166) of predicted bacterial conflict systems.


Pssm-ID: 469044 [Multi-domain]  Cd Length: 473  Bit Score: 47.30  E-value: 3.11e-05
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 8176554   589 GAPPVPPTGDSGAPPVPPTGDSGAPPVPPTGDSGAPPVPPTGDSGAPPVPPTGDSGAPPVPPTGDSGAPPvpptGDAGPP 668
Cdd:NF041121  16 GRAAAPPSPEGPAPTAASQPATPPPPAAPPSPPGDPPEPPAPEPAPLPAPYPGSLAPPPPPPPGPAGAAP----GAALPV 91
                         90
                 ....*....|....
gi 8176554   669 PVPptgdsGPPPVP 682
Cdd:NF041121  92 RVP-----APPALP 100
Atrophin-1 pfam03154
Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian ...
559-740 3.30e-05

Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian atrophy (DRPLA) gene. DRPLA OMIM:125370 is a progressive neurodegenerative disorder. It is caused by the expansion of a CAG repeat in the DRPLA gene on chromosome 12p. This results in an extended polyglutamine region in atrophin-1, that is thought to confer toxicity to the protein, possibly through altering its interactions with other proteins. The expansion of a CAG repeat is also the underlying defect in six other neurodegenerative disorders, including Huntington's disease. One interaction of expanded polyglutamine repeats that is thought to be pathogenic is that with the short glutamine repeat in the transcriptional coactivator CREB binding protein, CBP. This interaction draws CBP away from its usual nuclear location to the expanded polyglutamine repeat protein aggregates that are characteriztic of the polyglutamine neurodegenerative disorders. This interferes with CBP-mediated transcription and causes cytotoxicity.


Pssm-ID: 460830 [Multi-domain]  Cd Length: 991  Bit Score: 47.45  E-value: 3.30e-05
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 8176554    559 PVPPTGDSEATPVPPtGDSETAPVPPTGDSGAPPVPPTGDSGAPP----VPPTGDS----GAPPVPPTGDSGAP------ 624
Cdd:pfam03154 297 PFPLTPQSSQSQVPP-GPSPAAPGQSQQRIHTPPSQSQLQSQQPPreqpLPPAPLSmphiKPPPTTPIPQLPNPqshkhp 375
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 8176554    625 -----PVPPTGDSGAPPVPPTGD----------SGAPP----VPPTGDSGAPPVPPTGDAGPPPVPPTGDSGPPPVPPTG 685
Cdd:pfam03154 376 phlsgPSPFQMNSNLPPPPALKPlsslsthhppSAHPPplqlMPQSQQLPPPPAQPPVLTQSQSLPPPAASHPPTSGLHQ 455
                         170       180       190       200       210
                  ....*....|....*....|....*....|....*....|....*....|....*
gi 8176554    686 DSGAPPVTPTGDSETAPVPPTGDSGAPPVPPTGDSEAAPVPPTDDSKEAQMPAVI 740
Cdd:pfam03154 456 VPSQSPFPQHPFVPGGPPPITPPSGPPTSTSSAMPGIQPPSSASVSSSGPVPAAV 510
PLN03209 PLN03209
translocon at the inner envelope of chloroplast subunit 62; Provisional
549-740 3.33e-05

translocon at the inner envelope of chloroplast subunit 62; Provisional


Pssm-ID: 178748 [Multi-domain]  Cd Length: 576  Bit Score: 47.23  E-value: 3.33e-05
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 8176554   549 LPTVTDQEATPVPPTGDSEATPVPPTGDSETAPVPPTGDSgapPVPPTGDSGAPPVPPTG--DSGAPPVP-PTGDSGAPP 625
Cdd:PLN03209 320 LAKIPSQRVPPKESDAADGPKPVPTKPVTPEAPSPPIEEE---PPQPKAVVPRPLSPYTAyeDLKPPTSPiPTPPSSSPA 396
                         90       100       110       120       130       140       150       160
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 8176554   626 VPPTGDSGAPPVPPTGDSgAPPVPPTGDSGAPPVPPTGDAGP--PPV------PPTGdsgPPPVPPTGDSGAPPVTP--T 695
Cdd:PLN03209 397 SSKSVDAVAKPAEPDVVP-SPGSASNVPEVEPAQVEAKKTRPlsPYAryedlkPPTS---PSPTAPTGVSPSVSSTSsvP 472
                        170       180       190       200
                 ....*....|....*....|....*....|....*....|....*
gi 8176554   696 GDSETAPVPPTGDSGAPpvPPTGDSEAAPVPPTDDSKEAQMPAVI 740
Cdd:PLN03209 473 AVPDTAPATAATDAAAP--PPANMRPLSPYAVYDDLKPPTSPSPA 515
PLN03209 PLN03209
translocon at the inner envelope of chloroplast subunit 62; Provisional
561-726 3.36e-05

translocon at the inner envelope of chloroplast subunit 62; Provisional


Pssm-ID: 178748 [Multi-domain]  Cd Length: 576  Bit Score: 47.23  E-value: 3.36e-05
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 8176554   561 PPTgdsEATPVPPTGdseTAPVPPTGDSGAPPVPPTGDSGAPPVPPTGDSGAPPVPPTGDSGAPPV-------PPTGDSG 633
Cdd:PLN03209 382 PPT---SPIPTPPSS---SPASSKSVDAVAKPAEPDVVPSPGSASNVPEVEPAQVEAKKTRPLSPYaryedlkPPTSPSP 455
                         90       100       110       120       130       140       150       160
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 8176554   634 APPVP-PTGDSGAPPVPPTGDSgAPPVPPTGDAGPPPVPPTGDSGPPPV----PPTGDSGAPPVTPTGDSETAPVPPTGD 708
Cdd:PLN03209 456 TAPTGvSPSVSSTSSVPAVPDT-APATAATDAAAPPPANMRPLSPYAVYddlkPPTSPSPAAPVGKVAPSSTNEVVKVGN 534
                        170
                 ....*....|....*...
gi 8176554   709 SGAPPVPPTGDSEAAPVP 726
Cdd:PLN03209 535 SAPPTALADEQHHAQPKP 552
PHA03307 PHA03307
transcriptional regulator ICP4; Provisional
553-722 3.58e-05

transcriptional regulator ICP4; Provisional


Pssm-ID: 223039 [Multi-domain]  Cd Length: 1352  Bit Score: 47.47  E-value: 3.58e-05
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 8176554    553 TDQEATPVPPTGDSEATPVPPTGDSETAPVPPTGDSGAPPVPPTGDSG----APPVPPTGDSGAPPVPPTGDSGAPPVPP 628
Cdd:PHA03307  782 RGAGSSPPVRAEAAFRRPGRLRRSGPAADAASRTASKRKSRSHTPDGGsessGPARPPGAAARPPPARSSESSKSKPAAA 861
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 8176554    629 TGDSGAPPvpptGDSGAPPVPPTGDSGAPPVPPTGDAGPPPVPPTgdSGPPPVPPTGDSGAPPVTPTGDSETAPVPPtGD 708
Cdd:PHA03307  862 GGRARGKN----GRRRPRPPEPRARPGAAAPPKAAAAAPPAGAPA--PRPRPAPRVKLGPMPPGGPDPRGGFRRVPP-GD 934
                         170
                  ....*....|....
gi 8176554    709 SgAPPVPPTGDSEA 722
Cdd:PHA03307  935 L-HTPAPSAAALAA 947
PRK11901 PRK11901
hypothetical protein; Reviewed
585-724 3.72e-05

hypothetical protein; Reviewed


Pssm-ID: 237015 [Multi-domain]  Cd Length: 327  Bit Score: 46.60  E-value: 3.72e-05
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 8176554   585 TGDSGAPPVPPTGDSGAPPVPPTGDSGAPpvpptGDSGAPPVPPTGDSGAPPVPPTGD--------------------SG 644
Cdd:PRK11901  88 SSGNQSSPSAANNTSDGHDASGVKNTAPP-----QDISAPPISPTPTQAAPPQTPNGQqrielpgnisdalsqqqgqvNA 162
                         90       100       110       120       130       140       150       160
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 8176554   645 APPVPPTGDSGAPPVPPTgDAGPPPVPPTGDSGPPPVPPTGDSGAPPVTPTGDSETAPVPPTG-DSGAPPVPPTGDSEAA 723
Cdd:PRK11901 163 ASQNAQGNTSTLPTAPAT-VAPSKGAKVPATAETHPTPPQKPATKKPAVNHHKTATVAVPPATsGKPKSGAASARALSSA 241

                 .
gi 8176554   724 P 724
Cdd:PRK11901 242 P 242
Pro-rich pfam15240
Proline-rich protein; This family includes several eukaryotic proline-rich proteins.
556-683 3.78e-05

Proline-rich protein; This family includes several eukaryotic proline-rich proteins.


Pssm-ID: 464580 [Multi-domain]  Cd Length: 167  Bit Score: 44.64  E-value: 3.78e-05
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 8176554    556 EATPVPPTGdsEATPVPPTGDSETAPVPPTGDSGAPPVPPTGDSGAPP-----VPPTGDSGAPPvPPTGDSGAPPVPPTG 630
Cdd:pfam15240  44 QGPQGPPPG--GFPPQPPASDDPPGPPPPGGPQQPPPQGGKQKPQGPPpqggpRPPPGKPQGPP-PQGGNQQQGPPPPGK 120
                          90       100       110       120       130
                  ....*....|....*....|....*....|....*....|....*....|....
gi 8176554    631 DSGAPPvpptgdSGAPPVPPTGDSGAPPVPPTGDA-GPPPVPPTGdsGPPPVPP 683
Cdd:pfam15240 121 PQGPPP------QGGGPPPQGGNQQGPPPPPPGNPqGPPQRPPQP--GNPQGPP 166
half-pint TIGR01645
poly-U binding splicing factor, half-pint family; The proteins represented by this model ...
575-738 3.84e-05

poly-U binding splicing factor, half-pint family; The proteins represented by this model contain three RNA recognition motifs (rrm: pfam00076) and have been characterized as poly-pyrimidine tract binding proteins associated with RNA splicing factors. In the case of PUF60 (GP|6176532), in complex with p54, and in the presence of U2AF, facilitates association of U2 snRNP with pre-mRNA.


Pssm-ID: 130706 [Multi-domain]  Cd Length: 612  Bit Score: 46.99  E-value: 3.84e-05
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 8176554    575 GDSETAPVP---PTGDSGAPPVPPTGDSGAPPVPPTGDSGAPPVPPtGDSGAPPVPPTG----DSGAPPVPPTGDSGAPP 647
Cdd:TIGR01645 279 GKCVTPPDAllqPATVSAIPAAAAVAAAAATAKIMAAEAVAGAAVL-GPRAQSPATPSSslptDIGNKAVVSSAKKEAEE 357
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 8176554    648 VPPTGDSGAPPVPPTGDAGPPPVPPTGDSGPPPVPPTGdsgapPVTPTGDSETAPVPPTGDSGAPPVPPTGDSEAAPVPP 727
Cdd:TIGR01645 358 VPPLPQAAPAVVKPGPMEIPTPVPPPGLAIPSLVAPPG-----LVAPTEINPSFLASPRKKMKREKLPVTFGALDDTLAW 432
                         170
                  ....*....|.
gi 8176554    728 TDDSKEAQMPA 738
Cdd:TIGR01645 433 KEPSKEDQTSE 443
SAV_2336_NTERM NF041121
SAV_2336 family N-terminal domain; This HMM describes an N-terminal domain shared by SAV_2336 ...
600-695 3.85e-05

SAV_2336 family N-terminal domain; This HMM describes an N-terminal domain shared by SAV_2336 (BAC70047.1) whose C-terminal region suggests restriction enzyme activity (PMID: 18456708), and with other proteins with unrelated C-terminal regions. A member protein was also identified in a kanamycin biosynthetic gene cluster (PMID:16766657), while N-terminal regions of two other member proteins were named Trypco1 in a bioinformatic study (PMID:32101166) of predicted bacterial conflict systems.


Pssm-ID: 469044 [Multi-domain]  Cd Length: 473  Bit Score: 46.92  E-value: 3.85e-05
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 8176554   600 GAPPVPPTGDSGAPPVPPTGDSGAPPVPPTGDSGAPPVPPTGDSGAPPVPPTGDSGAPPVPPTGDAGPPPvpptGDSGPP 679
Cdd:NF041121  16 GRAAAPPSPEGPAPTAASQPATPPPPAAPPSPPGDPPEPPAPEPAPLPAPYPGSLAPPPPPPPGPAGAAP----GAALPV 91
                         90
                 ....*....|....*.
gi 8176554   680 PVPptgdsgAPPVTPT 695
Cdd:NF041121  92 RVP------APPALPN 101
PHA03307 PHA03307
transcriptional regulator ICP4; Provisional
587-738 4.04e-05

transcriptional regulator ICP4; Provisional


Pssm-ID: 223039 [Multi-domain]  Cd Length: 1352  Bit Score: 47.09  E-value: 4.04e-05
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 8176554    587 DSGAPPVPPTGDSGAPPvPPTGDSGAPPVppTGDSGAPPVPPTGDSGAPPVPPTGDSGAPPvpptgdSGAPPVPPTGDAG 666
Cdd:PHA03307   15 AEGGEFFPRPPATPGDA-ADDLLSGSQGQ--LVSDSAELAAVTVVAGAAACDRFEPPTGPP------PGPGTEAPANESR 85
                          90       100       110       120       130       140       150
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|...
gi 8176554    667 PPPVPPTGDSGPPPVPPTGDSGAPpvtptGDSETAPVPPTGDSGAP-PVPPTGDSEAAPVPPTDDSKEAQMPA 738
Cdd:PHA03307   86 STPTWSLSTLAPASPAREGSPTPP-----GPSSPDPPPPTPPPASPpPSPAPDLSEMLRPVGSPGPPPAASPP 153
kgd PRK12270
multifunctional oxoglutarate decarboxylase/oxoglutarate dehydrogenase thiamine ...
604-694 4.32e-05

multifunctional oxoglutarate decarboxylase/oxoglutarate dehydrogenase thiamine pyrophosphate-binding subunit/dihydrolipoyllysine-residue succinyltransferase subunit;


Pssm-ID: 237030 [Multi-domain]  Cd Length: 1228  Bit Score: 47.19  E-value: 4.32e-05
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 8176554    604 VPPTGDSGAPPVPPTGDSGAPPVPPTGDSGAPPVPPTGDSGAPPVPPTgdsgappvpPTGDAGPPPVPPTGDSGPPPVPP 683
Cdd:PRK12270   38 PGSTAAPTAAAAAAAAAASAPAAAPAAKAPAAPAPAPPAAAAPAAPPK---------PAAAAAAAAAPAAPPAAAAAAAP 108
                          90
                  ....*....|.
gi 8176554    684 TGDSGAPPVTP 694
Cdd:PRK12270  109 AAAAVEDEVTP 119
FAP pfam07174
Fibronectin-attachment protein (FAP); This family contains bacterial fibronectin-attachment ...
612-688 4.46e-05

Fibronectin-attachment protein (FAP); This family contains bacterial fibronectin-attachment proteins (FAP). Family members are rich in alanine and proline, are approximately 300 long, and seem to be restricted to mycobacteria. These proteins contain a fibronectin-binding motif that allows mycobacteria to bind to fibronectin in the extracellular matrix.


Pssm-ID: 429334  Cd Length: 301  Bit Score: 46.07  E-value: 4.46e-05
                          10        20        30        40        50        60        70
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 8176554    612 APPVPPTGDSGAPPVPPTGDSGAP-PVPPTGDSGAPPVPPtgdsgaPPVPPTGdAGPPPVPPTGDSGPPPVPPTGDSG 688
Cdd:pfam07174  44 APPPPSTATAPPAPPPPPPAPAAPaPPPPPAAPNAPNAPP------PPADPNA-PPPPPADPNAPPPPAVDPNAPEPG 114
PHA03307 PHA03307
transcriptional regulator ICP4; Provisional
550-695 4.82e-05

transcriptional regulator ICP4; Provisional


Pssm-ID: 223039 [Multi-domain]  Cd Length: 1352  Bit Score: 47.09  E-value: 4.82e-05
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 8176554    550 PTVTDQEATPVPPTGDSEATPVPPTGDSETAPVPPTGDSG----APPVPPTGDSGAPPVPPTGDSGAPPVPPTGDSGAPP 625
Cdd:PHA03307  790 VRAEAAFRRPGRLRRSGPAADAASRTASKRKSRSHTPDGGsessGPARPPGAAARPPPARSSESSKSKPAAAGGRARGKN 869
                          90       100       110       120       130       140       150
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 8176554    626 vpptGDSGAPPVPPTGDSGAPPVPPTGDSGAPPVPPTGDAGPPPVPPTGDS---GPPPV-----PPTGDSGAPPVTPT 695
Cdd:PHA03307  870 ----GRRRPRPPEPRARPGAAAPPKAAAAAPPAGAPAPRPRPAPRVKLGPMppgGPDPRggfrrVPPGDLHTPAPSAA 943
SOBP pfam15279
Sine oculis-binding protein; SOBP is associated with syndromic and nonsyndromic intellectual ...
572-705 5.73e-05

Sine oculis-binding protein; SOBP is associated with syndromic and nonsyndromic intellectual disability. It carries a zinc-finger of the zf-C2H2 type at the N-terminus, and a highly characteriztic C-terminal PhPhPhPhPhPh motif. The deduced 873-amino acid protein contains an N-terminal nuclear localization signal (NLS), followed by 2 FCS-type zinc finger motifs, a proline-rich region (PR1), a putative RNA-binding motif region, and a C-terminal NLS embedded in a second proline-rich motif. SOBP is expressed in various human tissues, including developing mouse brain at embryonic day 14. In postnatal and adult mouse brain SOBP is expressed in all neurons, with intense staining in the limbic system. Highest expression is in layer V cortical neurons, hippocampus, pyriform cortex, dorsomedial nucleus of thalamus, amygdala, and hypothalamus. Postnatal expression of SOBP in the limbic system corresponds to a time of active synaptogenesis. the family is also referred to as Jackson circler, JXC1. In seven affected siblings from a consanguineous Israeli Arab family with mental retardation, anterior maxillary protrusion, and strabismus mutations were found in this protein.


Pssm-ID: 464609 [Multi-domain]  Cd Length: 325  Bit Score: 45.96  E-value: 5.73e-05
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 8176554    572 PPTGDSETAPVPPTGDSGAPPVPPTgdSGAPPVPPTGDSGAPPVPPTgdSGAPPVPPtGDSGAPPV----PPTGDSGAPP 647
Cdd:pfam15279 176 KPQQHPPPSPLPAFMEPSSMPPPFL--RPPPSIPQPNSPLSNPMLPG--IGPPPKPP-RNLGPPSNpmhrPPFSPHHPPP 250
                          90       100       110       120       130
                  ....*....|....*....|....*....|....*....|....*....|....*...
gi 8176554    648 VPPtgdsgaPPVPPTGDAGPPPVPPTGDSGPPPVPPTGDSGAPPVTPTGDSETAPVPP 705
Cdd:pfam15279 251 PPT------PPGPPPGLPPPPPRGFTPPFGPPFPPVNMMPNPPEMNFGLPSLAPLVPP 302
kgd PRK12270
multifunctional oxoglutarate decarboxylase/oxoglutarate dehydrogenase thiamine ...
620-702 5.82e-05

multifunctional oxoglutarate decarboxylase/oxoglutarate dehydrogenase thiamine pyrophosphate-binding subunit/dihydrolipoyllysine-residue succinyltransferase subunit;


Pssm-ID: 237030 [Multi-domain]  Cd Length: 1228  Bit Score: 46.81  E-value: 5.82e-05
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 8176554    620 DSGAPPVPPTGDSGAPPVPPTGDSGAPPVPPTGDSGAPPVPPTGDAGPPPVPPTGDSGPPPVPPTGDSGAPPVTPTGDSE 699
Cdd:PRK12270   37 GPGSTAAPTAAAAAAAAAASAPAAAPAAKAPAAPAPAPPAAAAPAAPPKPAAAAAAAAAPAAPPAAAAAAAPAAAAVEDE 116

                  ...
gi 8176554    700 TAP 702
Cdd:PRK12270  117 VTP 119
PHA03378 PHA03378
EBNA-3B; Provisional
536-678 6.09e-05

EBNA-3B; Provisional


Pssm-ID: 223065 [Multi-domain]  Cd Length: 991  Bit Score: 46.60  E-value: 6.09e-05
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 8176554   536 TNFLRYWTLTYLALPTVTDQEATP--VPPTG----DSEATPVPPTGDSETAPVPPTGDSGAPPvPPTGDSGAPPVPPTGD 609
Cdd:PHA03378 683 TMLPIQWAPGTMQPPPRAPTPMRPpaAPPGRaqrpAAATGRARPPAAAPGRARPPAAAPGRAR-PPAAAPGRARPPAAAP 761
                         90       100       110       120       130       140       150
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|.
gi 8176554   610 SGAPPvpPTGDSGAP-PVPPtgdSGAPPVPPTGDSGAP-PVPPtgdsgaPPVPPTGDAGPPPVPPtGDSGP 678
Cdd:PHA03378 762 GRARP--PAAAPGAPtPQPP---PQAPPAPQQRPRGAPtPQPP------PQAGPTSMQLMPRAAP-GQQGP 820
kgd PRK12270
multifunctional oxoglutarate decarboxylase/oxoglutarate dehydrogenase thiamine ...
653-735 6.39e-05

multifunctional oxoglutarate decarboxylase/oxoglutarate dehydrogenase thiamine pyrophosphate-binding subunit/dihydrolipoyllysine-residue succinyltransferase subunit;


Pssm-ID: 237030 [Multi-domain]  Cd Length: 1228  Bit Score: 46.42  E-value: 6.39e-05
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 8176554    653 DSGAPPVPPTGDAGPPPVPPTGDSGPPPVPPTGDSGAPPVTPTgdsETAPVPPTGDSGAPPVPPTGDSEAAPVPPTDDSK 732
Cdd:PRK12270   37 GPGSTAAPTAAAAAAAAAASAPAAAPAAKAPAAPAPAPPAAAA---PAAPPKPAAAAAAAAAPAAPPAAAAAAAPAAAAV 113

                  ...
gi 8176554    733 EAQ 735
Cdd:PRK12270  114 EDE 116
PRK12727 PRK12727
flagellar biosynthesis protein FlhF;
573-736 7.49e-05

flagellar biosynthesis protein FlhF;


Pssm-ID: 237182 [Multi-domain]  Cd Length: 559  Bit Score: 46.14  E-value: 7.49e-05
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 8176554   573 PTGDSETAPVPPTGDSGAPPVPPTGDSgAPPVpptgdsgAPPVPPTGDSGAPPVPPTGDSGAPPVPPTgdsGAPPVPPTG 652
Cdd:PRK12727 117 PVSVPRQAPAAAPVRAASIPSPAAQAL-AHAA-------AVRTAPRQEHALSAVPEQLFADFLTTAPV---PRAPVQAPV 185
                         90       100       110       120       130       140       150       160
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 8176554   653 DSGAPPVPPTGDAGPPPVPPTGDSGPPPVPP--TGDSGAPPVTPtgdseTAPVPPTgdSGAPPVPPTGDSEAAPVP-PTD 729
Cdd:PRK12727 186 VAAPAPVPAIAAALAAHAAYAQDDDEQLDDDgfDLDDALPQILP-----PAALPPI--VVAPAAPAALAAVAAAAPaPQN 258

                 ....*..
gi 8176554   730 DSKEAQM 736
Cdd:PRK12727 259 DEELKQL 265
Gag_spuma pfam03276
Spumavirus gag protein;
606-727 7.57e-05

Spumavirus gag protein;


Pssm-ID: 460872 [Multi-domain]  Cd Length: 614  Bit Score: 46.28  E-value: 7.57e-05
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 8176554    606 PTGDSGAPPVppTGDSGAPPVPPTGDSGAPPVPPTGDSGAPPVPPTGDSGAPPvPPTGDAGPPPVPPTGDSGPPPV---- 681
Cdd:pfam03276 180 PGAQGGIPPG--ASFSGLPSLPAIGGIHLPAIPGIHARAPPGNIARSLGDDIM-PSLGDAGMPQPRFAFHPGNPFAeaeg 256
                          90       100       110       120       130
                  ....*....|....*....|....*....|....*....|....*....|.
gi 8176554    682 -PPTGDSG----APPVTPTGDSETAPVPPTGDSGAPPVPPtgdSEAAPVPP 727
Cdd:pfam03276 257 hPFAEAEGerprDIPRAPRIDAPSAPAIPAIQPIAPPMIP---PIGAPIPI 304
PHA03201 PHA03201
uracil DNA glycosylase; Provisional
606-694 7.60e-05

uracil DNA glycosylase; Provisional


Pssm-ID: 165468  Cd Length: 318  Bit Score: 45.65  E-value: 7.60e-05
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 8176554   606 PTGDSGAPPVPPTGDSGAPPVPPTGDSGAPPVPPTGDSGAPPVPPTGDSGAPPVPPTGdaGPPPVppTGDSGPPPVPPTG 685
Cdd:PHA03201   4 ARSRSPSPPRRPSPPRPTPPRSPDASPEETPPSPPGPGAEPPPGRAAGPAAPRRRPRG--CPAGV--TFSSSAPPRPPLG 79

                 ....*....
gi 8176554   686 DSGAPPVTP 694
Cdd:PHA03201  80 LDDAPAATP 88
Atrophin-1 pfam03154
Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian ...
549-701 7.61e-05

Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian atrophy (DRPLA) gene. DRPLA OMIM:125370 is a progressive neurodegenerative disorder. It is caused by the expansion of a CAG repeat in the DRPLA gene on chromosome 12p. This results in an extended polyglutamine region in atrophin-1, that is thought to confer toxicity to the protein, possibly through altering its interactions with other proteins. The expansion of a CAG repeat is also the underlying defect in six other neurodegenerative disorders, including Huntington's disease. One interaction of expanded polyglutamine repeats that is thought to be pathogenic is that with the short glutamine repeat in the transcriptional coactivator CREB binding protein, CBP. This interaction draws CBP away from its usual nuclear location to the expanded polyglutamine repeat protein aggregates that are characteriztic of the polyglutamine neurodegenerative disorders. This interferes with CBP-mediated transcription and causes cytotoxicity.


Pssm-ID: 460830 [Multi-domain]  Cd Length: 991  Bit Score: 46.30  E-value: 7.61e-05
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 8176554    549 LPTVTDQEATPVPPTGDSEATPVPPTGDSETapvPPTGDSGAPPVPPTgdsgapPVPPTGDSGAPPVPPTgdSGAPPVPP 628
Cdd:pfam03154 418 MPQSQQLPPPPAQPPVLTQSQSLPPPAASHP---PTSGLHQVPSQSPF------PQHPFVPGGPPPITPP--SGPPTSTS 486
                          90       100       110       120       130       140       150
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*
gi 8176554    629 TGDSGAPPVPPTGDSGAPPVPPTGDSgapPVPPTGDAGPPPVPPTGDSGPPPvPPTGDSGAPPV--TPTGDSETA 701
Cdd:pfam03154 487 SAMPGIQPPSSASVSSSGPVPAAVSC---PLPPVQIKEEALDEAEEPESPPP-PPRSPSPEPTVvnTPSHASQSA 557
PRK14954 PRK14954
DNA polymerase III subunits gamma and tau; Provisional
576-677 7.68e-05

DNA polymerase III subunits gamma and tau; Provisional


Pssm-ID: 184918 [Multi-domain]  Cd Length: 620  Bit Score: 46.09  E-value: 7.68e-05
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 8176554   576 DSETAPVPptgdSGAPPVpptgDSGAPPVPP-TGDSGAPPVPPTGDSGAPPVPPTGDSGAPPvpptgdSGAPPVPPTgds 654
Cdd:PRK14954 377 DGGVAPSP----AGSPDV----KKKAPEPDLpQPDRHPGPAKPEAPGARPAELPSPASAPTP------EQQPPVARS--- 439
                         90       100
                 ....*....|....*....|...
gi 8176554   655 gaPPVPPTGDAGPPPVPPTGDSG 677
Cdd:PRK14954 440 --APLPPSPQASAPRNVASGKPG 460
kgd PRK12270
multifunctional oxoglutarate decarboxylase/oxoglutarate dehydrogenase thiamine ...
636-713 8.53e-05

multifunctional oxoglutarate decarboxylase/oxoglutarate dehydrogenase thiamine pyrophosphate-binding subunit/dihydrolipoyllysine-residue succinyltransferase subunit;


Pssm-ID: 237030 [Multi-domain]  Cd Length: 1228  Bit Score: 46.04  E-value: 8.53e-05
                          10        20        30        40        50        60        70
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 8176554    636 PVPPTGDSGAPPVPPTGDSGAPPVPPTGDAGPPPVPPTGDSGPPPVPPTGDSGAPPVTPTGDSETAPVPPTGDSGAPP 713
Cdd:PRK12270   38 PGSTAAPTAAAAAAAAAASAPAAAPAAKAPAAPAPAPPAAAAPAAPPKPAAAAAAAAAPAAPPAAAAAAAPAAAAVED 115
Herpes_BLLF1 pfam05109
Herpes virus major outer envelope glycoprotein (BLLF1); This family consists of the BLLF1 ...
557-729 8.71e-05

Herpes virus major outer envelope glycoprotein (BLLF1); This family consists of the BLLF1 viral late glycoprotein, also termed gp350/220. It is the most abundantly expressed glycoprotein in the viral envelope of the Herpesviruses and is the major antigen responsible for stimulating the production of neutralising antibodies in vivo.


Pssm-ID: 282904 [Multi-domain]  Cd Length: 886  Bit Score: 46.06  E-value: 8.71e-05
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 8176554    557 ATPVPPTGDSEATPVPP--TGDSETAPVPPTGDSGApPVPPTGDSGAPPVPPTG---DSGAPPVPPTGDSGAPPVPPTGD 631
Cdd:pfam05109 440 AAPNTTTGLPSSTHVPTnlTAPASTGPTVSTADVTS-PTPAGTTSGASPVTPSPsprDNGTESKAPDMTSPTSAVTTPTP 518
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 8176554    632 SGAPPVP----PTGDSGAPPVPPTGDSGAPPVPPTGDAGPPP--VPPTGDSGPPPVPPTGDSGApPVTPTGDSETAPVPP 705
Cdd:pfam05109 519 NATSPTPavttPTPNATSPTLGKTSPTSAVTTPTPNATSPTPavTTPTPNATIPTLGKTSPTSA-VTTPTPNATSPTVGE 597
                         170       180
                  ....*....|....*....|....*..
gi 8176554    706 TGDSGAPPVPPTGDSEAAPV---PPTD 729
Cdd:pfam05109 598 TSPQANTTNHTLGGTSSTPVvtsPPKN 624
Pro-rich pfam15240
Proline-rich protein; This family includes several eukaryotic proline-rich proteins.
561-685 8.75e-05

Proline-rich protein; This family includes several eukaryotic proline-rich proteins.


Pssm-ID: 464580 [Multi-domain]  Cd Length: 167  Bit Score: 43.87  E-value: 8.75e-05
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 8176554    561 PPTGDSEATPVPPTGDSETAPvPPTGDSGAPPvPPTGDSGAPPVPPTGDSGAPP-----VPPTGDSGAPPvPPTGDSGAP 635
Cdd:pfam15240  38 QSQQGGQGPQGPPPGGFPPQP-PASDDPPGPP-PPGGPQQPPPQGGKQKPQGPPpqggpRPPPGKPQGPP-PQGGNQQQG 114
                          90       100       110       120       130
                  ....*....|....*....|....*....|....*....|....*....|
gi 8176554    636 PVPPTGDSGAPPvpptgDSGAPPVPPTGDAGPPPVPPTGDSGPPPVPPTG 685
Cdd:pfam15240 115 PPPPGKPQGPPP-----QGGGPPPQGGNQQGPPPPPPGNPQGPPQRPPQP 159
Herpes_BLLF1 pfam05109
Herpes virus major outer envelope glycoprotein (BLLF1); This family consists of the BLLF1 ...
475-706 9.65e-05

Herpes virus major outer envelope glycoprotein (BLLF1); This family consists of the BLLF1 viral late glycoprotein, also termed gp350/220. It is the most abundantly expressed glycoprotein in the viral envelope of the Herpesviruses and is the major antigen responsible for stimulating the production of neutralising antibodies in vivo.


Pssm-ID: 282904 [Multi-domain]  Cd Length: 886  Bit Score: 46.06  E-value: 9.65e-05
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 8176554    475 PQDRTVFKAMIAYWTNFAKTGDPNMGDSAVPTHWEPYTTENSGYLEITKKMGSSSMKRSLRTNFLRYWTLTYLALPTVTD 554
Cdd:pfam05109 600 PQANTTNHTLGGTSSTPVVTSPPKNATSAVTTGQHNITSSSTSSMSLRPSSISETLSPSTSDNSTSHMPLLTSAHPTGGE 679
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 8176554    555 QEATPVPPTGD----SEATPVPPTGDSETAPVPPTGDSGAPPVPPTGDSGAPPVPPTgdsgAPPVPPTGDSGAPPVPPTG 630
Cdd:pfam05109 680 NITQVTPASTSthhvSTSSPAPRPGTTSQASGPGNSSTSTKPGEVNVTKGTPPKNAT----SPQAPSGQKTAVPTVTSTG 755
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 8176554    631 ---DSGAPPVPPTGDSGAPPVPPTGDSGAPPVPP-TGDAGPPPVPPTGDSgppPVPPTGDSGAPPVTPTgdSETAPVPPT 706
Cdd:pfam05109 756 gkaNSTTGGKHTTGHGARTSTEPTTDYGGDSTTPrTRYNATTYLPPSTSS---KLRPRWTFTSPPVTTA--QATVPVPPT 830
PBP1 COG5180
PAB1-binding protein, interacts with poly(A)-binding protein [RNA processing and modification]; ...
539-728 9.70e-05

PAB1-binding protein, interacts with poly(A)-binding protein [RNA processing and modification];


Pssm-ID: 444064 [Multi-domain]  Cd Length: 548  Bit Score: 45.82  E-value: 9.70e-05
                        10        20        30        40        50        60        70        80
                ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 8176554  539 LRYWTLTYLALPTVTDQEATPVPPTGDSEATPVPPTGDSETAPVP---------------PTGDSGAPPVPPTGDSGAPP 603
Cdd:COG5180 154 LLQRSDPILAKDPDGDSASTLPPPAEKLDKVLTEPRDALKDSPEKldrpkvevkdeaqeePPDLTGGADHPRPEAASSPK 233
                        90       100       110       120       130       140       150       160
                ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 8176554  604 VPPTGDSGAPPVPPTGDSGAPPVPPtGDSGAPPVPPTGDSGAPPVPPTGDSGAPPVPPTGDAGPPPVPPTGDSGPPPVPP 683
Cdd:COG5180 234 VDPPSTSEARSRPATVDAQPEMRPP-ADAKERRRAAIGDTPAAEPPGLPVLEAGSEPQSDAPEAETARPIDVKGVASAPP 312
                       170       180       190       200       210
                ....*....|....*....|....*....|....*....|....*....|...
gi 8176554  684 TGDSGAPP--------VTPTGDSETAPVPPTGDSGAPPvPPTGDSEAAPVPPT 728
Cdd:COG5180 313 ATRPVRPPggardpgtPRPGQPTERPAGVPEAASDAGQ-PPSAYPPAEEAVPG 364
SAV_2336_NTERM NF041121
SAV_2336 family N-terminal domain; This HMM describes an N-terminal domain shared by SAV_2336 ...
629-727 9.97e-05

SAV_2336 family N-terminal domain; This HMM describes an N-terminal domain shared by SAV_2336 (BAC70047.1) whose C-terminal region suggests restriction enzyme activity (PMID: 18456708), and with other proteins with unrelated C-terminal regions. A member protein was also identified in a kanamycin biosynthetic gene cluster (PMID:16766657), while N-terminal regions of two other member proteins were named Trypco1 in a bioinformatic study (PMID:32101166) of predicted bacterial conflict systems.


Pssm-ID: 469044 [Multi-domain]  Cd Length: 473  Bit Score: 45.38  E-value: 9.97e-05
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 8176554   629 TGDSGAPPVPPTGDSGAPPVPPTGDSGAPPVPPTGDAGPPPvPPTGDSGPPPVPPtgdsgappvtptgdsETAPVPPTGD 708
Cdd:NF041121  15 MGRAAAPPSPEGPAPTAASQPATPPPPAAPPSPPGDPPEPP-APEPAPLPAPYPG---------------SLAPPPPPPP 78
                         90
                 ....*....|....*....
gi 8176554   709 SGAPPVPPTGDSEAAPVPP 727
Cdd:NF041121  79 GPAGAAPGAALPVRVPAPP 97
PLN02217 PLN02217
probable pectinesterase/pectinesterase inhibitor
622-731 9.97e-05

probable pectinesterase/pectinesterase inhibitor


Pssm-ID: 215130 [Multi-domain]  Cd Length: 670  Bit Score: 45.85  E-value: 9.97e-05
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 8176554   622 GAPPVPP--TGDSGAPPVPPTGDSGAPPVPPTGDSGAPPVPPTGDagpppvPPTGDSGPPPVPPT---GDSGAPPVTPTG 696
Cdd:PLN02217 554 GVPYIPGlfAGNPGSTNSTPTGSAASSNTTFSSDSPSTVVAPSTS------PPAGHLGSPPATPSkivSPSTSPPASHLG 627
                         90       100       110
                 ....*....|....*....|....*....|....*
gi 8176554   697 DSETAPVPPTGDSGAPPVPPTGDSEAAPVPPTDDS 731
Cdd:PLN02217 628 SPSTTPSSPESSIKVASTETASPESSIKVASTESS 662
PTZ00441 PTZ00441
sporozoite surface protein 2 (SSP2); Provisional
554-736 1.04e-04

sporozoite surface protein 2 (SSP2); Provisional


Pssm-ID: 240420 [Multi-domain]  Cd Length: 576  Bit Score: 45.73  E-value: 1.04e-04
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 8176554   554 DQEATPVPPtgdsEATPVPptgdsetAPVPPTGDSGAPPvpPTGDSGAPPVPPTG----DSGAPPVPPTGD-SGAPPV-- 626
Cdd:PTZ00441 278 EEEECPVEP----EPLPVP-------APVPPTPEDDNPR--PTDDEFAVPNFNEGldvpDNPQDPVPPPNEgKDGNPNee 344
                         90       100       110       120       130       140       150       160
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 8176554   627 ---PPTGDSGA--PPVPPTgdsgaPPVPPTGDSGAPPVPPTGDAGPPPVPPTGDSGPPPVPPTGDSGAP-PVTPTGDSET 700
Cdd:PTZ00441 345 nlfPPGDDEVPdeSNVPPN-----PPNVPGGSNSEFSSDVENPPNPPNPDIPEQEPNIPEDSNKEVPEDvPMEPEDDRDN 419
                        170       180       190
                 ....*....|....*....|....*....|....*.
gi 8176554   701 APVPPTGDSGappvppTGDSEAAPVPPTDDSKEAQM 736
Cdd:PTZ00441 420 NFNEPKKPEN------KGDGQNEPVIPKPLDNERDQ 449
FAP pfam07174
Fibronectin-attachment protein (FAP); This family contains bacterial fibronectin-attachment ...
568-691 1.04e-04

Fibronectin-attachment protein (FAP); This family contains bacterial fibronectin-attachment proteins (FAP). Family members are rich in alanine and proline, are approximately 300 long, and seem to be restricted to mycobacteria. These proteins contain a fibronectin-binding motif that allows mycobacteria to bind to fibronectin in the extracellular matrix.


Pssm-ID: 429334  Cd Length: 301  Bit Score: 44.92  E-value: 1.04e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 8176554    568 ATPVPPTGDSETAPvpptgdsgAPPVPPTgdSGAPPVPPtgdsgaPPVPPTGDSGAPPVPPTGDSGAPPvPPTGDSGAPP 647
Cdd:pfam07174  30 AVALPAVAHADPEP--------APPPPST--ATAPPAPP------PPPPAPAAPAPPPPPAAPNAPNAP-PPPADPNAPP 92
                          90       100       110       120
                  ....*....|....*....|....*....|....*....|....
gi 8176554    648 vpptgdsgAPPVPPTGDAGPPPVPPTGDSGPPPVPPTGDSGAPP 691
Cdd:pfam07174  93 --------PPPADPNAPPPPAVDPNAPEPGRIDNAVGGFSYVVP 128
PRK14949 PRK14949
DNA polymerase III subunits gamma and tau; Provisional
602-738 1.08e-04

DNA polymerase III subunits gamma and tau; Provisional


Pssm-ID: 237863 [Multi-domain]  Cd Length: 944  Bit Score: 45.87  E-value: 1.08e-04
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 8176554   602 PPVPPtgdSGAPPVPPTGDSGAPPVPPTGDSGAPPVP--PTGDSGAPPVPPTGDSGAPPVPPTGDAGPPP--VPPTGDSG 677
Cdd:PRK14949 645 PKTPP---SRAPPASLSKPASSPDASQTSASFDLDPDfeLATHQSVPEAALASGSAPAPPPVPDPYDRPPweEAPEVASA 721
                         90       100       110       120       130       140
                 ....*....|....*....|....*....|....*....|....*....|....*....|...
gi 8176554   678 P-PPVPPTGDSGAPPVTPTGDSETAPVPPTgdSGAPP-VPPTGDSEAAPVPPTDDSKEAQMPA 738
Cdd:PRK14949 722 NdGPNNAAEGNLSESVEDASNSELQAVEQQ--ATHQPqVQAEAQSPASTTALTQTSSEVQDTE 782
DamX COG3266
Cell division protein DamX, binds to the septal ring, contains C-terminal SPOR domain [Cell ...
641-732 1.25e-04

Cell division protein DamX, binds to the septal ring, contains C-terminal SPOR domain [Cell cycle control, cell division, chromosome partitioning];


Pssm-ID: 442497 [Multi-domain]  Cd Length: 455  Bit Score: 45.23  E-value: 1.25e-04
                        10        20        30        40        50        60        70        80
                ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 8176554  641 GDSGAPPVPPTGDSGAPPVPPTGDAGPPPVPPTGDSGP--PPVPPTGDSGAPPVTPT--GDSETAPVPPTGDSGAPPVPP 716
Cdd:COG3266 262 SSASAPATTSLGEQQEVSLPPAVAAQPAAAAAAQPSAValPAAPAAAAAAAAPAEAAapQPTAAKPVVTETAAPAAPAPE 341
                        90
                ....*....|....*.
gi 8176554  717 TGDSEAAPVPPTDDSK 732
Cdd:COG3266 342 AAAAAAAPAAPAVAKK 357
PHA03307 PHA03307
transcriptional regulator ICP4; Provisional
566-741 1.25e-04

transcriptional regulator ICP4; Provisional


Pssm-ID: 223039 [Multi-domain]  Cd Length: 1352  Bit Score: 45.55  E-value: 1.25e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 8176554    566 SEATPVPPTGDSETAPVPPTGDSGAPPVPPTGDSGAPPVPPTGDSGAPPVPPTGDSG----APPVPPTGDSGAPPVPPTG 641
Cdd:PHA03307  773 ALLEPAEPQRGAGSSPPVRAEAAFRRPGRLRRSGPAADAASRTASKRKSRSHTPDGGsessGPARPPGAAARPPPARSSE 852
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 8176554    642 DSGAPPVPPTGDSGAPPvpptGDAGPPPVPPTGDSGPPPvPPTGDSGAPPVTPTGDSEtAPVPPTGDSGAPPVPPTGDSE 721
Cdd:PHA03307  853 SSKSKPAAAGGRARGKN----GRRRPRPPEPRARPGAAA-PPKAAAAAPPAGAPAPRP-RPAPRVKLGPMPPGGPDPRGG 926
                         170       180
                  ....*....|....*....|
gi 8176554    722 AAPVPPTDDSKEAQMPAVIR 741
Cdd:PHA03307  927 FRRVPPGDLHTPAPSAAALA 946
PHA03201 PHA03201
uracil DNA glycosylase; Provisional
636-719 1.28e-04

uracil DNA glycosylase; Provisional


Pssm-ID: 165468  Cd Length: 318  Bit Score: 44.88  E-value: 1.28e-04
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 8176554   636 PVPPTGDSGAPPVPPTGDSGAPPVPPTGDAGPPPVPPTGDSGPPPVPPTGDSGAPPVTpTGDSETAPVPPTGDSGAP-PV 714
Cdd:PHA03201   9 PSPPRRPSPPRPTPPRSPDASPEETPPSPPGPGAEPPPGRAAGPAAPRRRPRGCPAGV-TFSSSAPPRPPLGLDDAPaAT 87

                 ....*
gi 8176554   715 PPTGD 719
Cdd:PHA03201  88 PPPLD 92
kgd PRK12270
multifunctional oxoglutarate decarboxylase/oxoglutarate dehydrogenase thiamine ...
603-680 1.35e-04

multifunctional oxoglutarate decarboxylase/oxoglutarate dehydrogenase thiamine pyrophosphate-binding subunit/dihydrolipoyllysine-residue succinyltransferase subunit;


Pssm-ID: 237030 [Multi-domain]  Cd Length: 1228  Bit Score: 45.65  E-value: 1.35e-04
                          10        20        30        40        50        60        70
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 8176554    603 PVPPTGDSGAPPVPPTGDSGAPPVPPTGDSGAPPVPPTGDSGAPPVPPTGDSGAPPVPPTGDAGPPPVPPTGDSGPPP 680
Cdd:PRK12270   38 PGSTAAPTAAAAAAAAAASAPAAAPAAKAPAAPAPAPPAAAAPAAPPKPAAAAAAAAAPAAPPAAAAAAAPAAAAVED 115
PHA03418 PHA03418
hypothetical E4 protein; Provisional
554-687 1.39e-04

hypothetical E4 protein; Provisional


Pssm-ID: 177646 [Multi-domain]  Cd Length: 230  Bit Score: 43.96  E-value: 1.39e-04
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 8176554   554 DQEATPVPPTgDSEATPVPPTGDSETAPVPPT------GDSGAPPVPPTGDSGAPPVPPTGDSGAPPVPPTGDSGAPPV- 626
Cdd:PHA03418  47 DPDKNPSPPP-DPPLTPRPPAQPNGHNKPPVTkqpggeGTEEDHQAPLAADADDDPRPGKRSKADEHGPAPGRAALAPFk 125
                         90       100       110       120       130       140       150
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|.
gi 8176554   627 ------PPTGDsgaPPVPPTGDSGAPPVPPTGDSGAPPVPPTGDAG----PPPVPPTgdsgPPPVPPTGDS 687
Cdd:PHA03418 126 ldldqdPLHGD---PDPPPGATGGQGEEPPEGGEESQPPLGEGEGAveghPPPLPPA----PEPKPHNGDA 189
kgd PRK12270
multifunctional oxoglutarate decarboxylase/oxoglutarate dehydrogenase thiamine ...
615-700 1.42e-04

multifunctional oxoglutarate decarboxylase/oxoglutarate dehydrogenase thiamine pyrophosphate-binding subunit/dihydrolipoyllysine-residue succinyltransferase subunit;


Pssm-ID: 237030 [Multi-domain]  Cd Length: 1228  Bit Score: 45.27  E-value: 1.42e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 8176554    615 VPPTGDSGAPPVPPTGDSGAPPVPPTGDSGAPPVPPTGDSGAPPVPPTGDAGPPPVPPtgdsgPPPVPPTGDSGAPPVTP 694
Cdd:PRK12270   38 PGSTAAPTAAAAAAAAAASAPAAAPAAKAPAAPAPAPPAAAAPAAPPKPAAAAAAAAA-----PAAPPAAAAAAAPAAAA 112

                  ....*.
gi 8176554    695 TGDSET 700
Cdd:PRK12270  113 VEDEVT 118
kgd PRK12270
multifunctional oxoglutarate decarboxylase/oxoglutarate dehydrogenase thiamine ...
582-672 1.57e-04

multifunctional oxoglutarate decarboxylase/oxoglutarate dehydrogenase thiamine pyrophosphate-binding subunit/dihydrolipoyllysine-residue succinyltransferase subunit;


Pssm-ID: 237030 [Multi-domain]  Cd Length: 1228  Bit Score: 45.27  E-value: 1.57e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 8176554    582 VPPTGDSGAPPVPPTGDSGAPPVPPTGDSGAPPVPPTGDSGAPPVPPTgdsgappvpPTGDSGAPPVPPTGDSGAPPVPP 661
Cdd:PRK12270   38 PGSTAAPTAAAAAAAAAASAPAAAPAAKAPAAPAPAPPAAAAPAAPPK---------PAAAAAAAAAPAAPPAAAAAAAP 108
                          90
                  ....*....|.
gi 8176554    662 TGDAGPPPVPP 672
Cdd:PRK12270  109 AAAAVEDEVTP 119
kgd PRK12270
multifunctional oxoglutarate decarboxylase/oxoglutarate dehydrogenase thiamine ...
565-647 1.63e-04

multifunctional oxoglutarate decarboxylase/oxoglutarate dehydrogenase thiamine pyrophosphate-binding subunit/dihydrolipoyllysine-residue succinyltransferase subunit;


Pssm-ID: 237030 [Multi-domain]  Cd Length: 1228  Bit Score: 45.27  E-value: 1.63e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 8176554    565 DSEATPVPPTGDSETAPVPPTGDSGAPPVPPTGDSGAPPVPPTGDSGAPPVPPtgdsGAPPVPPTGDSGAPPVPPTGDSG 644
Cdd:PRK12270   37 GPGSTAAPTAAAAAAAAAASAPAAAPAAKAPAAPAPAPPAAAAPAAPPKPAAA----AAAAAAPAAPPAAAAAAAPAAAA 112

                  ...
gi 8176554    645 APP 647
Cdd:PRK12270  113 VED 115
DamX COG3266
Cell division protein DamX, binds to the septal ring, contains C-terminal SPOR domain [Cell ...
619-724 1.64e-04

Cell division protein DamX, binds to the septal ring, contains C-terminal SPOR domain [Cell cycle control, cell division, chromosome partitioning];


Pssm-ID: 442497 [Multi-domain]  Cd Length: 455  Bit Score: 44.84  E-value: 1.64e-04
                        10        20        30        40        50        60        70        80
                ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 8176554  619 GDSGAPPVPPTGDSGAPPVPPTGDsGAPPVPPTG---DSGAPPVPPTGDAGPPPVPPTgdsgpPPVPPTgdsgAPPVTPT 695
Cdd:COG3266 262 SSASAPATTSLGEQQEVSLPPAVA-AQPAAAAAAqpsAVALPAAPAAAAAAAAPAEAA-----APQPTA----AKPVVTE 331
                        90       100
                ....*....|....*....|....*....
gi 8176554  696 GDSETAPVPPTGDSGAPPVPPTGDSEAAP 724
Cdd:COG3266 332 TAAPAAPAPEAAAAAAAPAAPAVAKKLAA 360
PHA03247 PHA03247
large tegument protein UL36; Provisional
545-642 1.69e-04

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 45.31  E-value: 1.69e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 8176554    545 TYLALPTVTDQEATPVPPTGDSEATPVPPTGdseTAPVPPTGDsGAPPVPPTGDSGAPPVPPTGDSGAPPVPPTG---DS 621
Cdd:PHA03247  393 TPFARGPGGDDQTRPAAPVPASVPTPAPTPV---PASAPPPPA-TPLPSAEPGSDDGPAPPPERQPPAPATEPAPddpDD 468
                          90       100
                  ....*....|....*....|.
gi 8176554    622 GAPPVPPTGDSGAPPVPPTGD 642
Cdd:PHA03247  469 ATRKALDALRERRPPEPPGAD 489
PHA02682 PHA02682
ORF080 virion core protein; Provisional
562-682 1.74e-04

ORF080 virion core protein; Provisional


Pssm-ID: 177464 [Multi-domain]  Cd Length: 280  Bit Score: 44.08  E-value: 1.74e-04
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 8176554   562 PTGDSEATPVPPTgdseTAPVPPTgDSGAPPVPPTGDSGAPPVPPTGDSGAPPVPPtgdsgaPPVPPTGDSGAPPVPPTG 641
Cdd:PHA02682  76 PSGQSPLAPSPAC----AAPAPAC-PACAPAAPAPAVTCPAPAPACPPATAPTCPP------PAVCPAPARPAPACPPST 144
                         90       100       110       120
                 ....*....|....*....|....*....|....*....|....*...
gi 8176554   642 DS--GAPPVP-PTGDSGAPPVPPTGDAGPPPVP----PTGDSGPPPVP 682
Cdd:PHA02682 145 RQcpPAPPLPtPKPAPAAKPIFLHNQLPPPDYPaascPTIETAPAASP 192
kgd PRK12270
multifunctional oxoglutarate decarboxylase/oxoglutarate dehydrogenase thiamine ...
560-636 1.77e-04

multifunctional oxoglutarate decarboxylase/oxoglutarate dehydrogenase thiamine pyrophosphate-binding subunit/dihydrolipoyllysine-residue succinyltransferase subunit;


Pssm-ID: 237030 [Multi-domain]  Cd Length: 1228  Bit Score: 45.27  E-value: 1.77e-04
                          10        20        30        40        50        60        70
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 8176554    560 VPPTGDSEATPVPPTGDSETAPVPPTGDSGAPPVPPTGDSGAPPVPPT-GDSGAPPVPPTGDSGAPPVPPTGDSGAPP 636
Cdd:PRK12270   38 PGSTAAPTAAAAAAAAAASAPAAAPAAKAPAAPAPAPPAAAAPAAPPKpAAAAAAAAAPAAPPAAAAAAAPAAAAVED 115
PHA03201 PHA03201
uracil DNA glycosylase; Provisional
569-653 1.90e-04

uracil DNA glycosylase; Provisional


Pssm-ID: 165468  Cd Length: 318  Bit Score: 44.11  E-value: 1.90e-04
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 8176554   569 TPVPPTGDSETAPVPPTGDSGAPPVPPTGDSGAPPVPPTGDSGAPPVPPTGDSGAPPvPPTGDSGAPPVPPTGDSGAP-P 647
Cdd:PHA03201   8 SPSPPRRPSPPRPTPPRSPDASPEETPPSPPGPGAEPPPGRAAGPAAPRRRPRGCPA-GVTFSSSAPPRPPLGLDDAPaA 86

                 ....*.
gi 8176554   648 VPPTGD 653
Cdd:PHA03201  87 TPPPLD 92
kgd PRK12270
multifunctional oxoglutarate decarboxylase/oxoglutarate dehydrogenase thiamine ...
598-689 1.91e-04

multifunctional oxoglutarate decarboxylase/oxoglutarate dehydrogenase thiamine pyrophosphate-binding subunit/dihydrolipoyllysine-residue succinyltransferase subunit;


Pssm-ID: 237030 [Multi-domain]  Cd Length: 1228  Bit Score: 44.88  E-value: 1.91e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 8176554    598 DSGAPPVPPTGDSGAPPVPPTGDSGAPPVPPTGDSGAPPVPPTGDSGAPPVPPTGDSGAPPVPPTGDAGPPPVPPTGDSG 677
Cdd:PRK12270   35 DYGPGSTAAPTAAAAAAAAAASAPAAAPAAKAPAAPAPAPPAAAAPAAPPKPAAAAAAAAAPAAPPAAAAAAAPAAAAVE 114
                          90
                  ....*....|..
gi 8176554    678 PPPVPPTGDSGA 689
Cdd:PRK12270  115 DEVTPLRGAAAA 126
BimA_first NF040984
trimeric autotransporter actin-nucleating factor BimA; BimA (B. pseudomallei intracellular ...
635-704 2.09e-04

trimeric autotransporter actin-nucleating factor BimA; BimA (B. pseudomallei intracellular motility protein A) is a trimeric autotransporter, homologous in its C-terminal half to a number of trimeric autotransporter adhesins. It is a virulence factor that nucleates actin, so that actin polymerization can drive escape by B. pseudomallei out of one cell and into a neighboring cell. HMM NF040983 describes a homolog with similar activity but substantial difference in sequence architecture in the N-terminal region.


Pssm-ID: 468914 [Multi-domain]  Cd Length: 517  Bit Score: 44.48  E-value: 2.09e-04
                         10        20        30        40        50        60        70
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*
gi 8176554   635 PPVPPTGDSgaPPVPPTGDSGAPPVPPtgdagPPPVPPTGDSGPPPVPPTGDSGA-----PPVTPTGDSETAPVP 704
Cdd:NF040984  42 PPEPPGGTN--IPVPPPMPGGGANIPV-----PPPMPGGGANIPPPPPPPGGIGGatpspPPLTPVNGNPGASTP 109
PRK14959 PRK14959
DNA polymerase III subunits gamma and tau; Provisional
628-731 2.16e-04

DNA polymerase III subunits gamma and tau; Provisional


Pssm-ID: 184923 [Multi-domain]  Cd Length: 624  Bit Score: 44.67  E-value: 2.16e-04
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 8176554   628 PTGDSGAPPvpptGDSGAPPvppTGDSGAPPVPPTGDAGPPPVPP----TGDSGPPPVPPTGDSGAPPVTPtgdSETAPV 703
Cdd:PRK14959 373 PSGGGASAP----SGSAAEG---PASGGAATIPTPGTQGPQGTAPaagmTPSSAAPATPAPSAAPSPRVPW---DDAPPA 442
                         90       100
                 ....*....|....*....|....*...
gi 8176554   704 PPTgdSGAPPVPPTGDSEAAPVPPTDDS 731
Cdd:PRK14959 443 PPR--SGIPPRPAPRMPEASPVPGAPDS 468
PHA03378 PHA03378
EBNA-3B; Provisional
569-716 2.28e-04

EBNA-3B; Provisional


Pssm-ID: 223065 [Multi-domain]  Cd Length: 991  Bit Score: 44.67  E-value: 2.28e-04
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 8176554   569 TPVPPTGDSETAPVPPTGDSGAPPVPPTGDSGAPPVPPTGDSGAPPVPPTGDSGAPPvpptgdSGAPPVPPTGDSGAP-P 647
Cdd:PHA03378 650 TPHQPPQVEITPYKPTWTQIGHIPYQPSPTGANTMLPIQWAPGTMQPPPRAPTPMRP------PAAPPGRAQRPAAATgR 723
                         90       100       110       120       130       140       150
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 8176554   648 VPPTGDSGAPPVPPTGDAGPPPVP---------PTGDSGPPPvPPTGDSGAPpvTPTGDSETAPVPPTGDSGAP-PVPP 716
Cdd:PHA03378 724 ARPPAAAPGRARPPAAAPGRARPPaaapgrarpPAAAPGRAR-PPAAAPGAP--TPQPPPQAPPAPQQRPRGAPtPQPP 799
PTZ00429 PTZ00429
beta-adaptin; Provisional
601-731 2.55e-04

beta-adaptin; Provisional


Pssm-ID: 240415 [Multi-domain]  Cd Length: 746  Bit Score: 44.54  E-value: 2.55e-04
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 8176554   601 APPVPPTGDSGAPPVPPTGDSGAPPVPPTGD-SGAPPVPPTGDSGAPPVPPTGD--SGappVPPTGDAGPPPVPPTGDSG 677
Cdd:PTZ00429 621 LPSTPSMGTQDGSPAPSAAPAGYDIFEFAGDgTGAPHPVASGSNGAQHADPLGDlfSG---LPSTVGASSPAFQAASGSQ 697
                         90       100       110       120       130
                 ....*....|....*....|....*....|....*....|....*....|....
gi 8176554   678 PPPVPPTgdsgappVTPTGDSETAPVPPTGDSGAPpvpptGDSEAAPVPPTDDS 731
Cdd:PTZ00429 698 APASPPT-------AASAIEDLFANGMGSGSQTVP-----LPISAAPQSADRDT 739
PRK10856 PRK10856
cytoskeleton protein RodZ;
584-697 2.58e-04

cytoskeleton protein RodZ;


Pssm-ID: 236776 [Multi-domain]  Cd Length: 331  Bit Score: 43.86  E-value: 2.58e-04
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 8176554   584 PTGDSGAPPVPPTGDSGAPPVPPTGDSGAPPVPPtgdSGAPPVPPTGDSGAPPVPPTGDSGAPPVPPtgdsgapPVPPTG 663
Cdd:PRK10856 163 PLDTSTTTDPATTPAPAAPVDTTPTNSQTPAVAT---APAPAVDPQQNAVVAPSQANVDTAATPAPA-------APATPD 232
                         90       100       110
                 ....*....|....*....|....*....|....
gi 8176554   664 DAGPPPVPPTGDSGPPpvpptGDSGAPPVTPTGD 697
Cdd:PRK10856 233 GAAPLPTDQAGVSTPA-----ADPNALVMNFTAD 261
PHA03201 PHA03201
uracil DNA glycosylase; Provisional
603-686 2.59e-04

uracil DNA glycosylase; Provisional


Pssm-ID: 165468  Cd Length: 318  Bit Score: 43.73  E-value: 2.59e-04
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 8176554   603 PVPPTGDSGAPPVPPTGDSGAPPVPPTGDSGAPPVPPTGDSGAPPVPPTGDSGAPPvPPTGDAGPPPVPPTGDSGPP-PV 681
Cdd:PHA03201   9 PSPPRRPSPPRPTPPRSPDASPEETPPSPPGPGAEPPPGRAAGPAAPRRRPRGCPA-GVTFSSSAPPRPPLGLDDAPaAT 87

                 ....*
gi 8176554   682 PPTGD 686
Cdd:PHA03201  88 PPPLD 92
kgd PRK12270
multifunctional oxoglutarate decarboxylase/oxoglutarate dehydrogenase thiamine ...
642-724 2.60e-04

multifunctional oxoglutarate decarboxylase/oxoglutarate dehydrogenase thiamine pyrophosphate-binding subunit/dihydrolipoyllysine-residue succinyltransferase subunit;


Pssm-ID: 237030 [Multi-domain]  Cd Length: 1228  Bit Score: 44.50  E-value: 2.60e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 8176554    642 DSGAPPVPPTGDSGAPPVPPTGDAGPPPVPPTGDSGPPPVPPTGDSGAPPVTPTGDSETAPVPPTGDSGAPPVPPTGDSE 721
Cdd:PRK12270   37 GPGSTAAPTAAAAAAAAAASAPAAAPAAKAPAAPAPAPPAAAAPAAPPKPAAAAAAAAAPAAPPAAAAAAAPAAAAVEDE 116

                  ...
gi 8176554    722 AAP 724
Cdd:PRK12270  117 VTP 119
PHA03201 PHA03201
uracil DNA glycosylase; Provisional
580-664 2.80e-04

uracil DNA glycosylase; Provisional


Pssm-ID: 165468  Cd Length: 318  Bit Score: 43.73  E-value: 2.80e-04
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 8176554   580 APVPPTGDSGAPPVPPTGDSGAPPVPPTGDSGAPPVPPTGDSGAPPVPPTGDSGAPPvPPTGDSGAPPVPPTGDSGAP-P 658
Cdd:PHA03201   8 SPSPPRRPSPPRPTPPRSPDASPEETPPSPPGPGAEPPPGRAAGPAAPRRRPRGCPA-GVTFSSSAPPRPPLGLDDAPaA 86

                 ....*.
gi 8176554   659 VPPTGD 664
Cdd:PHA03201  87 TPPPLD 92
PRK12495 PRK12495
hypothetical protein; Provisional
564-666 2.83e-04

hypothetical protein; Provisional


Pssm-ID: 183558 [Multi-domain]  Cd Length: 226  Bit Score: 42.93  E-value: 2.83e-04
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 8176554   564 GDSEATPVPPTGDSETAPVPPTGDSGAPPVPPTGDSGAPPVPPTGD-SGAPPVPPTGDSGAPPVPPTGDSGAPPVPPTGD 642
Cdd:PRK12495  76 DDAGDGAEATAPSDAGSQASPDDDAQPAAEAEAADQSAPPEASSTSaTDEAATDPPATAAARDGPTPDPTAQPATPDERR 155
                         90       100
                 ....*....|....*....|....
gi 8176554   643 SGAPPVPPTGDSGAPPVPPTGDAG 666
Cdd:PRK12495 156 SPRQRPPVSGEPPTPSTPDAHVAG 179
kgd PRK12270
multifunctional oxoglutarate decarboxylase/oxoglutarate dehydrogenase thiamine ...
593-673 2.85e-04

multifunctional oxoglutarate decarboxylase/oxoglutarate dehydrogenase thiamine pyrophosphate-binding subunit/dihydrolipoyllysine-residue succinyltransferase subunit;


Pssm-ID: 237030 [Multi-domain]  Cd Length: 1228  Bit Score: 44.50  E-value: 2.85e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 8176554    593 VPPTGDSGAPPVPPTGDSGAPPVPPTGDSGAPPVPPTGDSGAPPVPPTgdSGAPPVPPTGDSGAPPVPPTGDAGPPPVPP 672
Cdd:PRK12270   38 PGSTAAPTAAAAAAAAAASAPAAAPAAKAPAAPAPAPPAAAAPAAPPK--PAAAAAAAAAPAAPPAAAAAAAPAAAAVED 115

                  .
gi 8176554    673 T 673
Cdd:PRK12270  116 E 116
SAV_2336_NTERM NF041121
SAV_2336 family N-terminal domain; This HMM describes an N-terminal domain shared by SAV_2336 ...
578-671 3.17e-04

SAV_2336 family N-terminal domain; This HMM describes an N-terminal domain shared by SAV_2336 (BAC70047.1) whose C-terminal region suggests restriction enzyme activity (PMID: 18456708), and with other proteins with unrelated C-terminal regions. A member protein was also identified in a kanamycin biosynthetic gene cluster (PMID:16766657), while N-terminal regions of two other member proteins were named Trypco1 in a bioinformatic study (PMID:32101166) of predicted bacterial conflict systems.


Pssm-ID: 469044 [Multi-domain]  Cd Length: 473  Bit Score: 43.84  E-value: 3.17e-04
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 8176554   578 ETAPVPPTGDSGAPPVPPTGDSGAPPVPPTGDSGAPPVPPTGDSGAPPVPPTGDSGAPPVPPTGDSGAPPvpptGDSGAP 657
Cdd:NF041121  16 GRAAAPPSPEGPAPTAASQPATPPPPAAPPSPPGDPPEPPAPEPAPLPAPYPGSLAPPPPPPPGPAGAAP----GAALPV 91
                         90
                 ....*....|....
gi 8176554   658 PVPptgdaGPPPVP 671
Cdd:NF041121  92 RVP-----APPALP 100
PRK10856 PRK10856
cytoskeleton protein RodZ;
567-671 3.24e-04

cytoskeleton protein RodZ;


Pssm-ID: 236776 [Multi-domain]  Cd Length: 331  Bit Score: 43.48  E-value: 3.24e-04
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 8176554   567 EATPVPpTGDSETAPVPPTGDSGAPPVPPTGDSGAPPVPPtgdSGAPPVPPTGDSGAPPVPPTGDSGAPPVPptgdsgaP 646
Cdd:PRK10856 158 SGQSVP-LDTSTTTDPATTPAPAAPVDTTPTNSQTPAVAT---APAPAVDPQQNAVVAPSQANVDTAATPAP-------A 226
                         90       100
                 ....*....|....*....|....*
gi 8176554   647 PVPPTGDSGAPPVPPTGDAGPPPVP 671
Cdd:PRK10856 227 APATPDGAAPLPTDQAGVSTPAADP 251
PHA03321 PHA03321
tegument protein VP11/12; Provisional
588-733 3.27e-04

tegument protein VP11/12; Provisional


Pssm-ID: 223041 [Multi-domain]  Cd Length: 694  Bit Score: 44.18  E-value: 3.27e-04
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 8176554   588 SGAPPVPP-TGDSGAPPVPPTGDSGAPP---VPPTGDSGAPPVPPTGDS---------GAPPVPPTGDSGAPPVPPTGDS 654
Cdd:PHA03321 427 SRQPPGAPaPRRDNDPPPPPRARPGSTPacaRRARAQRARDAGPEYVDPlgalrrlpaGAAPPPEPAAAPSPATYYTRMG 506
                         90       100       110       120       130       140       150
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 8176554   655 GAPPVPPTGDAGPPPVPPtgDSGPPPVPPTGDSGAPPVTPTGDSETAPVPPTGDSGAPPVPPTGDSEAAPVPPTDDSKE 733
Cdd:PHA03321 507 GGPPRLPPRNRATETLRP--DWGPPAAAPPEQMEDPYLEPDDDRFDRRDGAAAAATSHPREAPAPDDDPIYEGVSDSEE 583
PRK12495 PRK12495
hypothetical protein; Provisional
552-655 3.45e-04

hypothetical protein; Provisional


Pssm-ID: 183558 [Multi-domain]  Cd Length: 226  Bit Score: 42.93  E-value: 3.45e-04
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 8176554   552 VTDQEATPVPPTGDSEATPVPPTGDSETAPVPPTGDSGAPPVPPTGD-SGAPPVPPTGDSGAPPVPPTGDSGAPPVPPTG 630
Cdd:PRK12495  75 GDDAGDGAEATAPSDAGSQASPDDDAQPAAEAEAADQSAPPEASSTSaTDEAATDPPATAAARDGPTPDPTAQPATPDER 154
                         90       100
                 ....*....|....*....|....*
gi 8176554   631 DSGAPPVPPTGDSGAPPVPPTGDSG 655
Cdd:PRK12495 155 RSPRQRPPVSGEPPTPSTPDAHVAG 179
kgd PRK12270
multifunctional oxoglutarate decarboxylase/oxoglutarate dehydrogenase thiamine ...
581-658 3.53e-04

multifunctional oxoglutarate decarboxylase/oxoglutarate dehydrogenase thiamine pyrophosphate-binding subunit/dihydrolipoyllysine-residue succinyltransferase subunit;


Pssm-ID: 237030 [Multi-domain]  Cd Length: 1228  Bit Score: 44.11  E-value: 3.53e-04
                          10        20        30        40        50        60        70
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 8176554    581 PVPPTGDSGAPPVPPTGDSGAPPVPPTGDSGAPPVPPTGDSGAPPVPPTGDSGAPPVPPTGDSGAPPVPPTGDSGAPP 658
Cdd:PRK12270   38 PGSTAAPTAAAAAAAAAASAPAAAPAAKAPAAPAPAPPAAAAPAAPPKPAAAAAAAAAPAAPPAAAAAAAPAAAAVED 115
PHA03369 PHA03369
capsid maturational protease; Provisional
566-662 3.95e-04

capsid maturational protease; Provisional


Pssm-ID: 223061 [Multi-domain]  Cd Length: 663  Bit Score: 43.83  E-value: 3.95e-04
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 8176554   566 SEATPVPPTGDSETAPVPPTGdsGAPPVPPTGDSGAPPVPPTGDSGAPPvPPTGDSGAPPVPPTGDSGAPPVPPTGDSGA 645
Cdd:PHA03369 349 KTASLTAPSRVLAAAAKVAVI--AAPQTHTGPADRQRPQRPDGIPYSVP-ARSPMTAYPPVPQFCGDPGLVSPYNPQSPG 425
                         90
                 ....*....|....*..
gi 8176554   646 PPVPPTGDSGAPPVPPT 662
Cdd:PHA03369 426 TSYGPEPVGPVPPQPTN 442
PHA03264 PHA03264
envelope glycoprotein D; Provisional
560-662 4.19e-04

envelope glycoprotein D; Provisional


Pssm-ID: 223029 [Multi-domain]  Cd Length: 416  Bit Score: 43.46  E-value: 4.19e-04
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 8176554   560 VPPTGDSEATPVPPTGDSETAPVPPTGDSGAPPVPPTGDSGAPPVPPTGDSGAPPVPPTGDSGAP--PVPPTGDSGAPPV 637
Cdd:PHA03264 254 VPPYFEESKGYEPPPAPSGGSPAPPGDDRPEAKPEPGPVEDGAPGRETGGEGEGPEPAGRDGAAGgePKPGPPRPAPDAD 333
                         90       100
                 ....*....|....*....|....*
gi 8176554   638 PPTGDSGAPPVPPTGDSGAPPVPPT 662
Cdd:PHA03264 334 RPEGWPSLEAITFPPPTPATPAVPR 358
kgd PRK12270
multifunctional oxoglutarate decarboxylase/oxoglutarate dehydrogenase thiamine ...
571-661 4.22e-04

multifunctional oxoglutarate decarboxylase/oxoglutarate dehydrogenase thiamine pyrophosphate-binding subunit/dihydrolipoyllysine-residue succinyltransferase subunit;


Pssm-ID: 237030 [Multi-domain]  Cd Length: 1228  Bit Score: 43.73  E-value: 4.22e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 8176554    571 VPPTGDSETAPVPPTGDSGAPPVPPTGDSGAPPVPPTGDSGAPPVPPTgdsgappvpPTGDSGAPPVPPTGDSGAPPVPP 650
Cdd:PRK12270   38 PGSTAAPTAAAAAAAAAASAPAAAPAAKAPAAPAPAPPAAAAPAAPPK---------PAAAAAAAAAPAAPPAAAAAAAP 108
                          90
                  ....*....|.
gi 8176554    651 TGDSGAPPVPP 661
Cdd:PRK12270  109 AAAAVEDEVTP 119
SAV_2336_NTERM NF041121
SAV_2336 family N-terminal domain; This HMM describes an N-terminal domain shared by SAV_2336 ...
563-662 4.25e-04

SAV_2336 family N-terminal domain; This HMM describes an N-terminal domain shared by SAV_2336 (BAC70047.1) whose C-terminal region suggests restriction enzyme activity (PMID: 18456708), and with other proteins with unrelated C-terminal regions. A member protein was also identified in a kanamycin biosynthetic gene cluster (PMID:16766657), while N-terminal regions of two other member proteins were named Trypco1 in a bioinformatic study (PMID:32101166) of predicted bacterial conflict systems.


Pssm-ID: 469044 [Multi-domain]  Cd Length: 473  Bit Score: 43.45  E-value: 4.25e-04
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 8176554   563 TGDSEATPVPPTGDSETAPVPPTGDSGAPPVPPTGDSGAPPvpptGDSGAPPVPPTGDSGAPPVPPTGDsgaPPVPPTGD 642
Cdd:NF041121  15 MGRAAAPPSPEGPAPTAASQPATPPPPAAPPSPPGDPPEPP----APEPAPLPAPYPGSLAPPPPPPPG---PAGAAPGA 87
                         90       100
                 ....*....|....*....|
gi 8176554   643 SGAPPVPptgdsgAPPVPPT 662
Cdd:NF041121  88 ALPVRVP------APPALPN 101
PHA03378 PHA03378
EBNA-3B; Provisional
552-727 4.59e-04

EBNA-3B; Provisional


Pssm-ID: 223065 [Multi-domain]  Cd Length: 991  Bit Score: 43.90  E-value: 4.59e-04
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 8176554   552 VTDQEATPVPPTGDSEAtpvPPTGDSETAPVPPTGDSGAP----------PVPPTGDSgAPPVPPTG------DSGAPPV 615
Cdd:PHA03378 599 VPHPSQTPEPPTTQSHI---PETSAPRQWPMPLRPIPMRPlrmqpitfnvLVFPTPHQ-PPQVEITPykptwtQIGHIPY 674
                         90       100       110       120       130       140       150       160
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 8176554   616 PPTGDSGAPPVPPTGDSGAPPVPPTGDSGAPP--VPPTgdSGAPPVPPTGDAGPPPVPPTGDSGPPPVPptgdsgaPPVT 693
Cdd:PHA03378 675 QPSPTGANTMLPIQWAPGTMQPPPRAPTPMRPpaAPPG--RAQRPAAATGRARPPAAAPGRARPPAAAP-------GRAR 745
                        170       180       190
                 ....*....|....*....|....*....|....
gi 8176554   694 PTGDSETAPVPPTGDSGAPPVPPTGDSEAAPVPP 727
Cdd:PHA03378 746 PPAAAPGRARPPAAAPGRARPPAAAPGAPTPQPP 779
kgd PRK12270
multifunctional oxoglutarate decarboxylase/oxoglutarate dehydrogenase thiamine ...
637-727 4.96e-04

multifunctional oxoglutarate decarboxylase/oxoglutarate dehydrogenase thiamine pyrophosphate-binding subunit/dihydrolipoyllysine-residue succinyltransferase subunit;


Pssm-ID: 237030 [Multi-domain]  Cd Length: 1228  Bit Score: 43.73  E-value: 4.96e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 8176554    637 VPPTGDSGAPPVPPTGDSGAPPVPPTGDAGPPPVPPTGDSGPPPVPPTGDSGAPPVTPtgdsETAPVPPtgdsgAPPVPP 716
Cdd:PRK12270   38 PGSTAAPTAAAAAAAAAASAPAAAPAAKAPAAPAPAPPAAAAPAAPPKPAAAAAAAAA----PAAPPAA-----AAAAAP 108
                          90
                  ....*....|.
gi 8176554    717 TGDSEAAPVPP 727
Cdd:PRK12270  109 AAAAVEDEVTP 119
PHA03132 PHA03132
thymidine kinase; Provisional
561-741 5.05e-04

thymidine kinase; Provisional


Pssm-ID: 222997 [Multi-domain]  Cd Length: 580  Bit Score: 43.21  E-value: 5.05e-04
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 8176554   561 PPTGDSEATpvpptgDSETAPVPPTGDSGAPPVPPTgdsGAPPVPPTGDSGAPPVP--PTGDSGAPPVPPtgdsGAPPVP 638
Cdd:PHA03132  39 PLGSTSEAT------SEDDDDLYPPRETGSGGGVAT---STIYTVPRPPRGPEQTLdkPDSLPASRELPP----GPTPVP 105
                         90       100       110       120       130       140       150       160
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 8176554   639 PTGDSGAPPVPPTGDSGAPpvPPTGDAGPPPVPPTGDSGPPPVPPTGDSgaPPVTPTGDSETAPVPPTGDsgappVPPTG 718
Cdd:PHA03132 106 PGGFRGASSPRLGADSTSP--RFLYQVNFPVILAPIGESNSSSEELSEE--EEHSRPPPSESLKVKNGGK-----VYPKG 176
                        170       180
                 ....*....|....*....|...
gi 8176554   719 DSEAAPVPPTDDSKEAQMPAVIR 741
Cdd:PHA03132 177 FSKHKTHKRSEFSGLTKKAARKR 199
flhF PRK06995
flagellar biosynthesis protein FlhF;
576-690 5.24e-04

flagellar biosynthesis protein FlhF;


Pssm-ID: 235904 [Multi-domain]  Cd Length: 484  Bit Score: 43.42  E-value: 5.24e-04
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 8176554   576 DSETAPVPPTGDSGAPPVPPtgdsgaPPVPPTGDSGAPPVPPtGDSGAPPVPPTGDSGAPPVPPTGDSGAPPVPPTGDSG 655
Cdd:PRK06995  45 DSDLAALAPPAAAAPAAAQP------PPAAAPAAVSRPAAPA-AEPAPWLVEHAKRLTAQREQLVARAAAPAAPEAQAPA 117
                         90       100       110
                 ....*....|....*....|....*....|....*
gi 8176554   656 APPVPPTGDAGPPPVPPTGDSGPPPVPPTGDSGAP 690
Cdd:PRK06995 118 APAERAAAENAARRLARAAAAAPRPRVPADAAAAV 152
PRK10856 PRK10856
cytoskeleton protein RodZ;
628-722 5.25e-04

cytoskeleton protein RodZ;


Pssm-ID: 236776 [Multi-domain]  Cd Length: 331  Bit Score: 42.71  E-value: 5.25e-04
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 8176554   628 PTGDSGAPPVPPTGDSGAPPVPPTGDSGAPPVPPtgdAGPPPVPPTGDSGPPPVPPTGDSGAPPV-TPTGDSETAPVPPT 706
Cdd:PRK10856 163 PLDTSTTTDPATTPAPAAPVDTTPTNSQTPAVAT---APAPAVDPQQNAVVAPSQANVDTAATPApAAPATPDGAAPLPT 239
                         90
                 ....*....|....*.
gi 8176554   707 GDsgAPPVPPTGDSEA 722
Cdd:PRK10856 240 DQ--AGVSTPAADPNA 253
half-pint TIGR01645
poly-U binding splicing factor, half-pint family; The proteins represented by this model ...
561-684 5.72e-04

poly-U binding splicing factor, half-pint family; The proteins represented by this model contain three RNA recognition motifs (rrm: pfam00076) and have been characterized as poly-pyrimidine tract binding proteins associated with RNA splicing factors. In the case of PUF60 (GP|6176532), in complex with p54, and in the presence of U2AF, facilitates association of U2 snRNP with pre-mRNA.


Pssm-ID: 130706 [Multi-domain]  Cd Length: 612  Bit Score: 43.14  E-value: 5.72e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 8176554    561 PPtgDSEATPVPPTGDSETAPVPPTGDSGAPPVPPTGDSGAPPVPPTGDSGAPPVPPTGDSGAPPVPPTGDSGAPPVPPT 640
Cdd:TIGR01645 284 PP--DALLQPATVSAIPAAAAVAAAAATAKIMAAEAVAGAAVLGPRAQSPATPSSSLPTDIGNKAVVSSAKKEAEEVPPL 361
                          90       100       110       120
                  ....*....|....*....|....*....|....*....|....
gi 8176554    641 GDSGAPPVPPTGDSGAPPVPPTGDAGPPPVPPTGDSGPPPVPPT 684
Cdd:TIGR01645 362 PQAAPAVVKPGPMEIPTPVPPPGLAIPSLVAPPGLVAPTEINPS 405
kgd PRK12270
multifunctional oxoglutarate decarboxylase/oxoglutarate dehydrogenase thiamine ...
626-716 5.78e-04

multifunctional oxoglutarate decarboxylase/oxoglutarate dehydrogenase thiamine pyrophosphate-binding subunit/dihydrolipoyllysine-residue succinyltransferase subunit;


Pssm-ID: 237030 [Multi-domain]  Cd Length: 1228  Bit Score: 43.34  E-value: 5.78e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 8176554    626 VPPTGDSGAPPVPPTGDSGAPPVPPTGDSGAPPVPPTGDAGPPPVPPTgdSGPPPVPPTGDSGAPPvtptgdsETAPVPP 705
Cdd:PRK12270   38 PGSTAAPTAAAAAAAAAASAPAAAPAAKAPAAPAPAPPAAAAPAAPPK--PAAAAAAAAAPAAPPA-------AAAAAAP 108
                          90
                  ....*....|.
gi 8176554    706 TGDSGAPPVPP 716
Cdd:PRK12270  109 AAAAVEDEVTP 119
PLN03209 PLN03209
translocon at the inner envelope of chloroplast subunit 62; Provisional
550-691 5.87e-04

translocon at the inner envelope of chloroplast subunit 62; Provisional


Pssm-ID: 178748 [Multi-domain]  Cd Length: 576  Bit Score: 43.38  E-value: 5.87e-04
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 8176554   550 PTVTDQEATPVPPTGDSEATPVPPTGDSETAPVPPTGDSGAP--PV-------PPTGDSgapPVPPTGD----SGAPPVP 616
Cdd:PLN03209 396 ASSKSVDAVAKPAEPDVVPSPGSASNVPEVEPAQVEAKKTRPlsPYaryedlkPPTSPS---PTAPTGVspsvSSTSSVP 472
                         90       100       110       120       130       140       150
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 8176554   617 PTGDSgAPPVPPTGDSGAPPVPPTGDSGAPPV----PPTGDSGAPPVPPTGDAGPPPVPPTGDSGPPPVPPTGDSGAPP 691
Cdd:PLN03209 473 AVPDT-APATAATDAAAPPPANMRPLSPYAVYddlkPPTSPSPAAPVGKVAPSSTNEVVKVGNSAPPTALADEQHHAQP 550
Gag_spuma pfam03276
Spumavirus gag protein;
580-720 6.08e-04

Spumavirus gag protein;


Pssm-ID: 460872 [Multi-domain]  Cd Length: 614  Bit Score: 43.20  E-value: 6.08e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 8176554    580 APVPPTGDSGAPPVppTGDSGAPPVPPTGDSGAPPVPPTGDSGAPPVPPTGDSGAPPvPPTGDSGappvPPTGDSGAPPV 659
Cdd:pfam03276 176 AEISPGAQGGIPPG--ASFSGLPSLPAIGGIHLPAIPGIHARAPPGNIARSLGDDIM-PSLGDAG----MPQPRFAFHPG 248
                          90       100       110       120       130       140
                  ....*....|....*....|....*....|....*....|....*....|....*....|...
gi 8176554    660 PPTGDA-GPPPVPPTGDSG-PPPVPPTGDSGAPPVTPTgdSETAPVPPTGDSGAPPVPPTGDS 720
Cdd:pfam03276 249 NPFAEAeGHPFAEAEGERPrDIPRAPRIDAPSAPAIPA--IQPIAPPMIPPIGAPIPIPHGAS 309
Med25_SD1 pfam11235
Mediator complex subunit 25 synapsin 1; The overall function of the full-length Med25 is ...
571-713 6.31e-04

Mediator complex subunit 25 synapsin 1; The overall function of the full-length Med25 is efficiently to coordinate the transcriptional activation of RAR/RXR (retinoic acid receptor/retinoic X receptor) in higher eukaryotic cells. Human Med25 consists of several domains with different binding properties, the N-terminal, VWA, domain, this SD1 - synapsin 1 - domain from residues 229-381, a PTOV(B) or ACID domain from 395-545, an SD2 domain from residues 564-645 and a C-terminal NR box-containing domain (646-650) from 646-747. This The function of the SD domains is unclear.


Pssm-ID: 463244 [Multi-domain]  Cd Length: 157  Bit Score: 40.92  E-value: 6.31e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 8176554    571 VPPTGDSETAPVPPTGDSGAPPVPPTG-------DSGAPPVPPT----GDSGAPPVPPTGDSGAPPVPPTG-DSGAPPVP 638
Cdd:pfam11235   1 LPVGGGSAPGPLQSKQPVPLPPAAPSGatlsaapQQPLPPVPPQyqvpGNLSAAQVAAQNAVEAAKNQKAGlGPRFSPIT 80
                          90       100       110       120       130       140       150
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*
gi 8176554    639 PTGDSGAPPVPPTGDSGAPPVPPTGDAGPPPVPPTGDSGPPPVPPTGDSGAPPVTPTGDSETAPVPPTGDSGAPP 713
Cdd:pfam11235  81 PLQQAAPGVGPPFSQAPAPQLPPGPPGAPKPVPPASQPSLVSTVAPGSGLAPTAQPGAPSMAGTVAPGGVSGPSP 155
PRK12495 PRK12495
hypothetical protein; Provisional
574-677 7.09e-04

hypothetical protein; Provisional


Pssm-ID: 183558 [Multi-domain]  Cd Length: 226  Bit Score: 41.78  E-value: 7.09e-04
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 8176554   574 TGDSETAPVPptGDSGAPPVPPTgDSGAPPVPPTGDSGAPPVPPTGD-SGAPPVPPTGDSGAPPVPPTGDSGAPPVPPTG 652
Cdd:PRK12495  78 AGDGAEATAP--SDAGSQASPDD-DAQPAAEAEAADQSAPPEASSTSaTDEAATDPPATAAARDGPTPDPTAQPATPDER 154
                         90       100
                 ....*....|....*....|....*
gi 8176554   653 DSGAPPVPPTGDAGPPPVPPTGDSG 677
Cdd:PRK12495 155 RSPRQRPPVSGEPPTPSTPDAHVAG 179
PRK14965 PRK14965
DNA polymerase III subunits gamma and tau; Provisional
654-739 7.78e-04

DNA polymerase III subunits gamma and tau; Provisional


Pssm-ID: 237871 [Multi-domain]  Cd Length: 576  Bit Score: 42.81  E-value: 7.78e-04
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 8176554   654 SGAPPVPPTGDAGPPPVPPtgdSGPPPvpptgdSGAPPVTptgdsetaPVPPTGDSGAPPVPPTGDSEAAPvPPTDDSKE 733
Cdd:PRK14965 379 RGAPAPPSAAWGAPTPAAP---AAPPP------AAAPPVP--------PAAPARPAAARPAPAPAPPAAAA-PPARSADP 440

                 ....*.
gi 8176554   734 AQMPAV 739
Cdd:PRK14965 441 AAAASA 446
Pneumo_att_G pfam05539
Pneumovirinae attachment membrane glycoprotein G;
552-718 7.83e-04

Pneumovirinae attachment membrane glycoprotein G;


Pssm-ID: 114270 [Multi-domain]  Cd Length: 408  Bit Score: 42.73  E-value: 7.83e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 8176554    552 VTDQEATPVPPTGDSEATPVPPTGDSETAPvpptgdSGAPPVPPTGDSGAPPVPPTGDSGAPPVPPTgdSGAPPVPPTGD 631
Cdd:pfam05539 172 VTTSKTTSWPTEVSHPTYPSQVTPQSQPAT------QGHQTATANQRLSSTEPVGTQGTTTSSNPEP--QTEPPPSQRGP 243
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 8176554    632 SGAPPVPPTGDSgaPPVPPTGDSGA-PPVPPTGDAGPPPVPPTGDSGPPPVPPTGDSGAPPVTPTGDSETAPVPPTGDSG 710
Cdd:pfam05539 244 SGSPQHPPSTTS--QDQSTTGDGQEhTQRRKTPPATSNRRSPHSTATPPPTTKRQETGRPTPRPTATTQSGSSPPHSSPP 321

                  ....*...
gi 8176554    711 APPVPPTG 718
Cdd:pfam05539 322 GVQANPTT 329
PRK10856 PRK10856
cytoskeleton protein RodZ;
617-711 7.86e-04

cytoskeleton protein RodZ;


Pssm-ID: 236776 [Multi-domain]  Cd Length: 331  Bit Score: 42.32  E-value: 7.86e-04
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 8176554   617 PTGDSGAPPVPPTGDSGAPPVPPTGDSGAPPVPPtgdSGAPPVPPTGDAGPPPVPPTGDSGPPPVP-PTGDSGAPPVTPT 695
Cdd:PRK10856 163 PLDTSTTTDPATTPAPAAPVDTTPTNSQTPAVAT---APAPAVDPQQNAVVAPSQANVDTAATPAPaAPATPDGAAPLPT 239
                         90
                 ....*....|....*.
gi 8176554   696 GDseTAPVPPTGDSGA 711
Cdd:PRK10856 240 DQ--AGVSTPAADPNA 253
PHA02682 PHA02682
ORF080 virion core protein; Provisional
606-714 7.98e-04

ORF080 virion core protein; Provisional


Pssm-ID: 177464 [Multi-domain]  Cd Length: 280  Bit Score: 42.16  E-value: 7.98e-04
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 8176554   606 PTGDSGAPPVPPTgdsgAPPVPPTgDSGAPPVP---PTGDSGAPPVPPTGDSGAPP--VPPTGDAGPPPVPPTGDSGPPP 680
Cdd:PHA02682  76 PSGQSPLAPSPAC----AAPAPAC-PACAPAAPapaVTCPAPAPACPPATAPTCPPpaVCPAPARPAPACPPSTRQCPPA 150
                         90       100       110
                 ....*....|....*....|....*....|....
gi 8176554   681 VPPTGDSGAPPVTPTGDSETAPVPPTGDSGAPPV 714
Cdd:PHA02682 151 PPLPTPKPAPAAKPIFLHNQLPPPDYPAASCPTI 184
DamX COG3266
Cell division protein DamX, binds to the septal ring, contains C-terminal SPOR domain [Cell ...
548-647 8.19e-04

Cell division protein DamX, binds to the septal ring, contains C-terminal SPOR domain [Cell cycle control, cell division, chromosome partitioning];


Pssm-ID: 442497 [Multi-domain]  Cd Length: 455  Bit Score: 42.53  E-value: 8.19e-04
                        10        20        30        40        50        60        70        80
                ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 8176554  548 ALPTVTDQEATPVPPTGDSEATPVPPTGDSETAPVPPTGDSGAP-PVPPTgdsgaPPVPPTgdsgAPPVPPTGDSGAPPV 626
Cdd:COG3266 269 TTSLGEQQEVSLPPAVAAQPAAAAAAQPSAVALPAAPAAAAAAAaPAEAA-----APQPTA----AKPVVTETAAPAAPA 339
                        90       100
                ....*....|....*....|.
gi 8176554  627 PPTGDSGAPPVPPTGDSGAPP 647
Cdd:COG3266 340 PEAAAAAAAPAAPAVAKKLAA 360
PRK14971 PRK14971
DNA polymerase III subunit gamma/tau;
652-741 8.33e-04

DNA polymerase III subunit gamma/tau;


Pssm-ID: 237874 [Multi-domain]  Cd Length: 614  Bit Score: 42.84  E-value: 8.33e-04
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 8176554   652 GDSGAPPVPPTGDAGPPPVPPTgdSGPPPVPPTGDSGAPPVTPTGDSETAPVPPTGDSGAPPVPPTGDSEAAPVPPTDDS 731
Cdd:PRK14971 366 GDDASGGRGPKQHIKPVFTQPA--AAPQPSAAAAASPSPSQSSAAAQPSAPQSATQPAGTPPTVSVDPPAAVPVNPPSTA 443
                         90
                 ....*....|
gi 8176554   732 KEAQMPAVIR 741
Cdd:PRK14971 444 PQAVRPAQFK 453
PTZ00449 PTZ00449
104 kDa microneme/rhoptry antigen; Provisional
549-738 8.34e-04

104 kDa microneme/rhoptry antigen; Provisional


Pssm-ID: 185628 [Multi-domain]  Cd Length: 943  Bit Score: 42.75  E-value: 8.34e-04
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 8176554   549 LPTVTDQEATPVPPTG--DSEATPVPPTGDSETAPVPPTgdsgAPPVPPTGD-SGAPPVPPTGDSGAPPVPPTGDSgAPP 625
Cdd:PTZ00449 572 IPTLSKKPEFPKDPKHpkDPEEPKKPKRPRSAQRPTRPK----SPKLPELLDiPKSPKRPESPKSPKRPPPPQRPS-SPE 646
                         90       100       110       120       130       140       150       160
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 8176554   626 VPPTGDSGAPPVPPTgdSGAPPVPPT-------------GDSGAPPVPPTGDAGPPPVPPTGDSGPPPVPPTGDSGAPPV 692
Cdd:PTZ00449 647 RPEGPKIIKSPKPPK--SPKPPFDPKfkekfyddyldaaAKSKETKTTVVLDESFESILKETLPETPGTPFTTPRPLPPK 724
                        170       180       190       200
                 ....*....|....*....|....*....|....*....|....*.
gi 8176554   693 TPTgdSETAPVPPTGDsgaPPVPPTGDSEAAPVPPTDDSKEAQMPA 738
Cdd:PTZ00449 725 LPR--DEEFPFEPIGD---PDAEQPDDIEFFTPPEEERTFFHETPA 765
PHA03201 PHA03201
uracil DNA glycosylase; Provisional
584-675 8.49e-04

uracil DNA glycosylase; Provisional


Pssm-ID: 165468  Cd Length: 318  Bit Score: 42.19  E-value: 8.49e-04
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 8176554   584 PTGDSGAPPVPPtgdSGAPPVPPTGDSGAPPVPPTGDSGAPPVPPTGDSGAPPVPPTGDSGAPPvPPTGDSGAPPVPPTG 663
Cdd:PHA03201   4 ARSRSPSPPRRP---SPPRPTPPRSPDASPEETPPSPPGPGAEPPPGRAAGPAAPRRRPRGCPA-GVTFSSSAPPRPPLG 79
                         90
                 ....*....|...
gi 8176554   664 DAGPP-PVPPTGD 675
Cdd:PHA03201  80 LDDAPaATPPPLD 92
Herpes_BLLF1 pfam05109
Herpes virus major outer envelope glycoprotein (BLLF1); This family consists of the BLLF1 ...
550-737 8.56e-04

Herpes virus major outer envelope glycoprotein (BLLF1); This family consists of the BLLF1 viral late glycoprotein, also termed gp350/220. It is the most abundantly expressed glycoprotein in the viral envelope of the Herpesviruses and is the major antigen responsible for stimulating the production of neutralising antibodies in vivo.


Pssm-ID: 282904 [Multi-domain]  Cd Length: 886  Bit Score: 42.98  E-value: 8.56e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 8176554    550 PTVTDQEATPVPP----TGDSEATPVPPTGDSETAPVPPTGDSGAPPVPPTGDSGAPPVP----PTGDSGAPPVPPTGDS 621
Cdd:pfam05109 466 PTVSTADVTSPTPagttSGASPVTPSPSPRDNGTESKAPDMTSPTSAVTTPTPNATSPTPavttPTPNATSPTLGKTSPT 545
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 8176554    622 GAPPVP-PTGDSGAPPV-PPTGDSGAPPVPPTGDSGApPVPPTGDAGPPPVPPTGDSGPPPVPPTGDSGAPPVTPTGDSE 699
Cdd:pfam05109 546 SAVTTPtPNATSPTPAVtTPTPNATIPTLGKTSPTSA-VTTPTPNATSPTVGETSPQANTTNHTLGGTSSTPVVTSPPKN 624
                         170       180       190       200
                  ....*....|....*....|....*....|....*....|....*
gi 8176554    700 TAPVPPTG-------DSGAPPVPPTGDSEAAPvPPTDDSKEAQMP 737
Cdd:pfam05109 625 ATSAVTTGqhnitssSTSSMSLRPSSISETLS-PSTSDNSTSHMP 668
PRK12495 PRK12495
hypothetical protein; Provisional
551-633 8.66e-04

hypothetical protein; Provisional


Pssm-ID: 183558 [Multi-domain]  Cd Length: 226  Bit Score: 41.39  E-value: 8.66e-04
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 8176554   551 TVTDQEATPVPPTGDSEATPVPPTGD-SETAPVPPTGDSGAPPVPPTGDSGAPPVPPTGDSGAPPVPPTGDSGAPPVPPT 629
Cdd:PRK12495  96 PDDDAQPAAEAEAADQSAPPEASSTSaTDEAATDPPATAAARDGPTPDPTAQPATPDERRSPRQRPPVSGEPPTPSTPDA 175

                 ....
gi 8176554   630 GDSG 633
Cdd:PRK12495 176 HVAG 179
PHA03264 PHA03264
envelope glycoprotein D; Provisional
626-728 8.68e-04

envelope glycoprotein D; Provisional


Pssm-ID: 223029 [Multi-domain]  Cd Length: 416  Bit Score: 42.30  E-value: 8.68e-04
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 8176554   626 VPPTGDSGAPPVPPTGDSGAPPVPPTGDSGAPPVPPTGDAGPPPVPPTGDSGPPPVPPTGDsGAPPVTPT-GDSETAP-- 702
Cdd:PHA03264 254 VPPYFEESKGYEPPPAPSGGSPAPPGDDRPEAKPEPGPVEDGAPGRETGGEGEGPEPAGRD-GAAGGEPKpGPPRPAPda 332
                         90       100
                 ....*....|....*....|....*.
gi 8176554   703 VPPTGDSGAPPVPPTGDSEAAPVPPT 728
Cdd:PHA03264 333 DRPEGWPSLEAITFPPPTPATPAVPR 358
PRK14971 PRK14971
DNA polymerase III subunit gamma/tau;
597-738 8.70e-04

DNA polymerase III subunit gamma/tau;


Pssm-ID: 237874 [Multi-domain]  Cd Length: 614  Bit Score: 42.84  E-value: 8.70e-04
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 8176554   597 GDSGAPPVPPTgdsgAPPVPPTGDSGAPPVPPTGDSGAPPVPPTGDSGAPPVPPTgdsgAPPVPPTgdagPPPVpPTGDS 676
Cdd:PRK14971 366 GDDASGGRGPK----QHIKPVFTQPAAAPQPSAAAAASPSPSQSSAAAQPSAPQS----ATQPAGT----PPTV-SVDPP 432
                         90       100       110       120       130       140
                 ....*....|....*....|....*....|....*....|....*....|....*....|....
gi 8176554   677 GPPPVPPTGdSGAPPVTPTGDSETAPVPPTGDS--GAPPVPPTGDSEAApvpPTDDSKEAQMPA 738
Cdd:PRK14971 433 AAVPVNPPS-TAPQAVRPAQFKEEKKIPVSKVSslGPSTLRPIQEKAEQ---ATGNIKEAPTGT 492
PHA02682 PHA02682
ORF080 virion core protein; Provisional
628-724 8.96e-04

ORF080 virion core protein; Provisional


Pssm-ID: 177464 [Multi-domain]  Cd Length: 280  Bit Score: 41.77  E-value: 8.96e-04
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 8176554   628 PTGDSGAPPVPPTgdsgAPPVPPTgDSGAPPVPPTGDAGPPPVPptgdSGPPPVPPTGdsgAPPVTPTGDSETAPVPPTG 707
Cdd:PHA02682  76 PSGQSPLAPSPAC----AAPAPAC-PACAPAAPAPAVTCPAPAP----ACPPATAPTC---PPPAVCPAPARPAPACPPS 143
                         90
                 ....*....|....*..
gi 8176554   708 DSGAPPVPPTGDSEAAP 724
Cdd:PHA02682 144 TRQCPPAPPLPTPKPAP 160
PRK14954 PRK14954
DNA polymerase III subunits gamma and tau; Provisional
616-718 8.99e-04

DNA polymerase III subunits gamma and tau; Provisional


Pssm-ID: 184918 [Multi-domain]  Cd Length: 620  Bit Score: 42.62  E-value: 8.99e-04
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 8176554   616 PPTGDSGAPPVPPTGDSGAPPVPPtgdsgappvpPTGDSGAPPVPPTGDAGPPPVPPTGDSGPPPvpptgdSGAPPVTPt 695
Cdd:PRK14954 376 NDGGVAPSPAGSPDVKKKAPEPDL----------PQPDRHPGPAKPEAPGARPAELPSPASAPTP------EQQPPVAR- 438
                         90       100
                 ....*....|....*....|...
gi 8176554   696 gdseTAPVPPTGDSGAPPVPPTG 718
Cdd:PRK14954 439 ----SAPLPPSPQASAPRNVASG 457
TALPID3 pfam15324
Hedgehog signalling target; TALPID3 is a family of eukaryotic proteins that are targets for ...
545-706 9.51e-04

Hedgehog signalling target; TALPID3 is a family of eukaryotic proteins that are targets for Hedgehog signalling. Mutations in this gene noticed first in chickens lead to multiple abnormalities of development.


Pssm-ID: 434634 [Multi-domain]  Cd Length: 1288  Bit Score: 42.95  E-value: 9.51e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 8176554     545 TYLALPTVTDQEATP-VPPTGDSEATPVPpTGDSETAPVPPTGDSGAPPVPPTGDSGAPPVPPTGDSGAPPVPpTGDSGA 623
Cdd:pfam15324  982 TLLPTPVPTPQPTPPcSPPSPLKEPSPVK-TPDSSPCVSEHDFFPVKEIPPEKGADTGPAVSLVITPTVTPIA-TPPPAA 1059
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 8176554     624 PPVPPTGDSGAPPVPptgdSGAPPVP-PTGDSGAP-----PVPPTGDAGPPPV--------------PPTGDSGPPPV-- 681
Cdd:pfam15324 1060 TPTPPLSENSIDKLK----SPSPELPkPWEDSDLPleeenPNSEQEELHPRAVvmsvardeepesvvLPASPPEPKPLap 1135
                          170       180
                   ....*....|....*....|....*.
gi 8176554     682 -PPTGDSGAPPVTPTGDSETAPVPPT 706
Cdd:pfam15324 1136 pPLGAAPPSPPQSPSSSSSTLESSSS 1161
PRK14954 PRK14954
DNA polymerase III subunits gamma and tau; Provisional
605-696 9.71e-04

DNA polymerase III subunits gamma and tau; Provisional


Pssm-ID: 184918 [Multi-domain]  Cd Length: 620  Bit Score: 42.62  E-value: 9.71e-04
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 8176554   605 PPTGDSGAPPVPPTGDSGAPPVPP-TGDSGAPPVPPTGDSGAPPVPPTGDSGAPPvpptgdAGPPPVPPtgdsgPPPVPP 683
Cdd:PRK14954 376 NDGGVAPSPAGSPDVKKKAPEPDLpQPDRHPGPAKPEAPGARPAELPSPASAPTP------EQQPPVAR-----SAPLPP 444
                         90
                 ....*....|...
gi 8176554   684 TGDSGAPPVTPTG 696
Cdd:PRK14954 445 SPQASAPRNVASG 457
PRK12438 PRK12438
hypothetical protein; Provisional
621-679 1.05e-03

hypothetical protein; Provisional


Pssm-ID: 171499 [Multi-domain]  Cd Length: 991  Bit Score: 42.54  E-value: 1.05e-03
                         10        20        30        40        50
                 ....*....|....*....|....*....|....*....|....*....|....*....
gi 8176554   621 SGAPPVPPTGDSGAPPvpPTGdsGAPPVPPtgdsgaPPVPPTGDAGPPPVPPTGDSGPP 679
Cdd:PRK12438 899 TGRVATAPGGDAASAP--PPG--AGPPAPP------QAVPPPRTTQPPAAPPRGPDVPP 947
kgd PRK12270
multifunctional oxoglutarate decarboxylase/oxoglutarate dehydrogenase thiamine ...
648-728 1.06e-03

multifunctional oxoglutarate decarboxylase/oxoglutarate dehydrogenase thiamine pyrophosphate-binding subunit/dihydrolipoyllysine-residue succinyltransferase subunit;


Pssm-ID: 237030 [Multi-domain]  Cd Length: 1228  Bit Score: 42.57  E-value: 1.06e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 8176554    648 VPPTGDSGAPPVPPTGDAGPPPVPPTGDSGPPPVPPTGDSGAPPVTPTgdSETAPVPPTGDSGAPPVPPTGDSEAAPVPP 727
Cdd:PRK12270   38 PGSTAAPTAAAAAAAAAASAPAAAPAAKAPAAPAPAPPAAAAPAAPPK--PAAAAAAAAAPAAPPAAAAAAAPAAAAVED 115

                  .
gi 8176554    728 T 728
Cdd:PRK12270  116 E 116
SAP130_C pfam16014
Histone deacetylase complex subunit SAP130 C-terminus;
595-724 1.06e-03

Histone deacetylase complex subunit SAP130 C-terminus;


Pssm-ID: 464973 [Multi-domain]  Cd Length: 371  Bit Score: 42.23  E-value: 1.06e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 8176554    595 PTGDsGAPPVPPTGDSGAPPVppTGDSGAPPVPpTGDSGAPPVPPTGDSGAPPVPPTGDSGAPPVPPTGDAGP------- 667
Cdd:pfam16014  15 PATE-GAKPKPDIHVAVAPPV--TVAVEALPGQ-NSEQQTASASPPSQHPAQAIPTILAPAAPPSQPSVVLSTlpaamav 90
                          90       100       110       120       130       140       150
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|..
gi 8176554    668 -PPVPPT--GDSGPPPVPPTGDSGAPPVTPT------------GDSETAPVPPTGDSGAPPVPPTGDSEAAP 724
Cdd:pfam16014  91 tPPIPASmaNVVAPPTQPAASSTAACAVSSVlpeikikqeaepMDTSQSVPPLTPTSISPALTSLANNLSVP 162
DamX COG3266
Cell division protein DamX, binds to the septal ring, contains C-terminal SPOR domain [Cell ...
547-658 1.07e-03

Cell division protein DamX, binds to the septal ring, contains C-terminal SPOR domain [Cell cycle control, cell division, chromosome partitioning];


Pssm-ID: 442497 [Multi-domain]  Cd Length: 455  Bit Score: 42.14  E-value: 1.07e-03
                        10        20        30        40        50        60        70        80
                ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 8176554  547 LALPTVTDQEATPVPPTGDSEATPVP-PTGDSETAPVPPTGDsGAPPVPPTGDSGAP--PVPPTGDSGAP-PVPPT--GD 620
Cdd:COG3266 244 LVLLLLIIGSALKAPSQASSASAPATtSLGEQQEVSLPPAVA-AQPAAAAAAQPSAValPAAPAAAAAAAaPAEAAapQP 322
                        90       100       110
                ....*....|....*....|....*....|....*...
gi 8176554  621 SGAPPVPPTGDSGAPPVPPTGDSGAPPVPPTGDSGAPP 658
Cdd:COG3266 323 TAAKPVVTETAAPAAPAPEAAAAAAAPAAPAVAKKLAA 360
SOBP pfam15279
Sine oculis-binding protein; SOBP is associated with syndromic and nonsyndromic intellectual ...
561-716 1.10e-03

Sine oculis-binding protein; SOBP is associated with syndromic and nonsyndromic intellectual disability. It carries a zinc-finger of the zf-C2H2 type at the N-terminus, and a highly characteriztic C-terminal PhPhPhPhPhPh motif. The deduced 873-amino acid protein contains an N-terminal nuclear localization signal (NLS), followed by 2 FCS-type zinc finger motifs, a proline-rich region (PR1), a putative RNA-binding motif region, and a C-terminal NLS embedded in a second proline-rich motif. SOBP is expressed in various human tissues, including developing mouse brain at embryonic day 14. In postnatal and adult mouse brain SOBP is expressed in all neurons, with intense staining in the limbic system. Highest expression is in layer V cortical neurons, hippocampus, pyriform cortex, dorsomedial nucleus of thalamus, amygdala, and hypothalamus. Postnatal expression of SOBP in the limbic system corresponds to a time of active synaptogenesis. the family is also referred to as Jackson circler, JXC1. In seven affected siblings from a consanguineous Israeli Arab family with mental retardation, anterior maxillary protrusion, and strabismus mutations were found in this protein.


Pssm-ID: 464609 [Multi-domain]  Cd Length: 325  Bit Score: 41.72  E-value: 1.10e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 8176554    561 PPTGDSEATPVPPTGDSET--APVPPTGDSGAPPVPPTGD---------------SGAPPV---------PPTGDSGAPP 614
Cdd:pfam15279 106 SPTSSNSSKPLISVASSSKllAPKPHEPPSLPPPPLPPKKgrrhrpglhpplgrpPGSPPMsmtprgllgKPQQHPPPSP 185
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 8176554    615 VPPTGDSGAPPVPPTgdSGAPPVPPTGDSGAPPVPPTgdSGAPPVPPtGDAGPPPVPPTGDS-GPPPVPPTGDSGAPPVT 693
Cdd:pfam15279 186 LPAFMEPSSMPPPFL--RPPPSIPQPNSPLSNPMLPG--IGPPPKPP-RNLGPPSNPMHRPPfSPHHPPPPPTPPGPPPG 260
                         170       180
                  ....*....|....*....|...
gi 8176554    694 PTGDSETAPVPPTGdsgaPPVPP 716
Cdd:pfam15279 261 LPPPPPRGFTPPFG----PPFPP 279
PHA03307 PHA03307
transcriptional regulator ICP4; Provisional
577-727 1.11e-03

transcriptional regulator ICP4; Provisional


Pssm-ID: 223039 [Multi-domain]  Cd Length: 1352  Bit Score: 42.47  E-value: 1.11e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 8176554    577 SETAPVPPTGDSGAPPVPPTGDSGAPPVPPTGDSGAPPVPPTGDSGAPPVPPTGDSGAPPVPPTGDSGAPPVPPTGDSGA 656
Cdd:PHA03307  762 SLVPAKLAEALALLEPAEPQRGAGSSPPVRAEAAFRRPGRLRRSGPAADAASRTASKRKSRSHTPDGGSESSGPARPPGA 841
                          90       100       110       120       130       140       150
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|.
gi 8176554    657 PPVPPTGDAGPPPVPptgDSGPPPVPPTGDSGAPPVTPTGDSETAPVPPTGDSGAPPVPPTGDSEAAPVPP 727
Cdd:PHA03307  842 AARPPPARSSESSKS---KPAAAGGRARGKNGRRRPRPPEPRARPGAAAPPKAAAAAPPAGAPAPRPRPAP 909
PHA03419 PHA03419
E4 protein; Provisional
574-685 1.15e-03

E4 protein; Provisional


Pssm-ID: 223079 [Multi-domain]  Cd Length: 200  Bit Score: 40.70  E-value: 1.15e-03
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 8176554   574 TGDSETAPVPPTGDSGAPPVPPTGDSGaPPVPPTGDSGAPPVPPTG------DSGAPPVPPTGDSGAPPVP---PTGDSG 644
Cdd:PHA03419  47 TGYPFCPPTTPHPSSQPPPCPPSPGHP-PQTNDTHEKDLALQPPPGgkkkekKKKETEKPAQGGEKPDQGPeakGEGEGH 125
                         90       100       110       120
                 ....*....|....*....|....*....|....*....|....*
gi 8176554   645 APPVPPTGDsgaPPVPPTG----DAGPPPVPptgdsGPPPVPPTG 685
Cdd:PHA03419 126 EPEDPPPED---TPPPPGGegevEGGPSPGP-----GPGPLDQEG 162
PRK07994 PRK07994
DNA polymerase III subunits gamma and tau; Validated
548-673 1.20e-03

DNA polymerase III subunits gamma and tau; Validated


Pssm-ID: 236138 [Multi-domain]  Cd Length: 647  Bit Score: 42.16  E-value: 1.20e-03
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 8176554   548 ALPTVTDQEATPVPPTGDSEATPVPPTgdSETAPVPPTGDSGAPPVPPTGDSGAPP--VPPTGDSGAPPVPPTGDSGAPP 625
Cdd:PRK07994 374 SAAPAASAQATAAPTAAVAPPQAPAVP--PPPASAPQQAPAVPLPETTSQLLAARQqlQRAQGATKAKKSEPAAASRARP 451
                         90       100       110       120
                 ....*....|....*....|....*....|....*....|....*...
gi 8176554   626 VPPTGDSGApPVPPTGDSGAPPVPPTGDSGAPPVPPTGDAGPPPVPPT 673
Cdd:PRK07994 452 VNSALERLA-SVRPAPSALEKAPAKKEAYRWKATNPVEVKKEPVATPK 498
PRK10856 PRK10856
cytoskeleton protein RodZ;
555-656 1.23e-03

cytoskeleton protein RodZ;


Pssm-ID: 236776 [Multi-domain]  Cd Length: 331  Bit Score: 41.55  E-value: 1.23e-03
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 8176554   555 QEATPVPpTGDSEATPVPPTgdseTAPVPPTGDSGAP-PVPPTGDSGAPPVPPTGDSGAPPVPPTGDSGAPPVP-PTGDS 632
Cdd:PRK10856 157 NSGQSVP-LDTSTTTDPATT----PAPAAPVDTTPTNsQTPAVATAPAPAVDPQQNAVVAPSQANVDTAATPAPaAPATP 231
                         90       100
                 ....*....|....*....|....
gi 8176554   633 GAPPVPPTGDsgAPPVPPTGDSGA 656
Cdd:PRK10856 232 DGAAPLPTDQ--AGVSTPAADPNA 253
PRK14954 PRK14954
DNA polymerase III subunits gamma and tau; Provisional
631-724 1.26e-03

DNA polymerase III subunits gamma and tau; Provisional


Pssm-ID: 184918 [Multi-domain]  Cd Length: 620  Bit Score: 42.24  E-value: 1.26e-03
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 8176554   631 DSGAPPVPPTGdsgappvPPTGDSGAPPVPPTGDAGPPPVPPTGDSGPPPVPPTGDSGAPPvtptgdSETAPVPPTgdsg 710
Cdd:PRK14954 377 DGGVAPSPAGS-------PDVKKKAPEPDLPQPDRHPGPAKPEAPGARPAELPSPASAPTP------EQQPPVARS---- 439
                         90
                 ....*....|....
gi 8176554   711 aPPVPPTGDSEAAP 724
Cdd:PRK14954 440 -APLPPSPQASAPR 452
SAV_2336_NTERM NF041121
SAV_2336 family N-terminal domain; This HMM describes an N-terminal domain shared by SAV_2336 ...
557-651 1.34e-03

SAV_2336 family N-terminal domain; This HMM describes an N-terminal domain shared by SAV_2336 (BAC70047.1) whose C-terminal region suggests restriction enzyme activity (PMID: 18456708), and with other proteins with unrelated C-terminal regions. A member protein was also identified in a kanamycin biosynthetic gene cluster (PMID:16766657), while N-terminal regions of two other member proteins were named Trypco1 in a bioinformatic study (PMID:32101166) of predicted bacterial conflict systems.


Pssm-ID: 469044 [Multi-domain]  Cd Length: 473  Bit Score: 41.91  E-value: 1.34e-03
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 8176554   557 ATPVPPTGDSEATPVPPTGDSETAPVPPTGDSGAPPvPPTGDSGAPPVPptGDSGAPPVPPTGDSGAPPvpptGDSGAPP 636
Cdd:NF041121  20 APPSPEGPAPTAASQPATPPPPAAPPSPPGDPPEPP-APEPAPLPAPYP--GSLAPPPPPPPGPAGAAP----GAALPVR 92
                         90
                 ....*....|....*
gi 8176554   637 VPptgdsgAPPVPPT 651
Cdd:NF041121  93 VP------APPALPN 101
Gag_spuma pfam03276
Spumavirus gag protein;
589-713 1.35e-03

Spumavirus gag protein;


Pssm-ID: 460872 [Multi-domain]  Cd Length: 614  Bit Score: 42.04  E-value: 1.35e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 8176554    589 GAPPVPPTGDSGAPPVPPTGDSGAPPVPPTGDSGAPPVPPTGDSGAPPVPPTGDSGAPPVPPTGDSGAP----------- 657
Cdd:pfam03276 182 AQGGIPPGASFSGLPSLPAIGGIHLPAIPGIHARAPPGNIARSLGDDIMPSLGDAGMPQPRFAFHPGNPfaeaeghpfae 261
                          90       100       110       120       130       140
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*
gi 8176554    658 ---------PVPPTGDAGPPPVPPTGDSGPPPVPPTgdSGAPPVTPTGDSETAPvPPTGDSGAPP 713
Cdd:pfam03276 262 aegerprdiPRAPRIDAPSAPAIPAIQPIAPPMIPP--IGAPIPIPHGASIPGE-HIRNPREEPI 323
Not5 COG5665
CCR4-NOT transcriptional regulation complex, NOT5 subunit [Transcription];
571-734 1.39e-03

CCR4-NOT transcriptional regulation complex, NOT5 subunit [Transcription];


Pssm-ID: 444384 [Multi-domain]  Cd Length: 874  Bit Score: 41.96  E-value: 1.39e-03
                        10        20        30        40        50        60        70        80
                ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 8176554  571 VPPTGDSETAPVPPTGDSGAPPVPPTGDSGAPPVPPTGDSGappvpPTGDSGAPPVPPTGDSGA--PPVPPTGDSGAPPV 648
Cdd:COG5665 244 ATPPATPATEEKSSQQPKSQPTSPSGGTTPPSTNQLTTSNT-----PTSTAKAQPQPPTKKQPAkePPSDTASGNPSAPS 318
                        90       100       110       120       130       140       150       160
                ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 8176554  649 PPTgDSGAPPVPPTGDAGPPPVP-------PTGDSGPPPVPPT-GDSGAPPVTPT-----------GDSETAPVPPTGDS 709
Cdd:COG5665 319 VLI-NSDSPTSEDPATASVPTTEettafttPSSVPSTPAEKDTpATDLATPVSPTppetsvdkkvsPDSATSSTKSEKEG 397
                       170       180
                ....*....|....*....|....*...
gi 8176554  710 GAP--PVPP-TGDSEAAPVPPTDDSKEA 734
Cdd:COG5665 398 GTAssPMPPnIAIGAKDDVDATDPSQEA 425
PRK10856 PRK10856
cytoskeleton protein RodZ;
633-725 1.40e-03

cytoskeleton protein RodZ;


Pssm-ID: 236776 [Multi-domain]  Cd Length: 331  Bit Score: 41.55  E-value: 1.40e-03
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 8176554   633 GAPPVPPTGDSGAPPvpptGDSGAPPVPPTGDAGPPPVPPTGDSGPPPVPPTGDSGAPPVTPTGDSETAPVP--PTGDSG 710
Cdd:PRK10856 158 SGQSVPLDTSTTTDP----ATTPAPAAPVDTTPTNSQTPAVATAPAPAVDPQQNAVVAPSQANVDTAATPAPaaPATPDG 233
                         90
                 ....*....|....*
gi 8176554   711 APPVPPTGDSEAAPV 725
Cdd:PRK10856 234 AAPLPTDQAGVSTPA 248
kgd PRK12270
multifunctional oxoglutarate decarboxylase/oxoglutarate dehydrogenase thiamine ...
553-639 1.59e-03

multifunctional oxoglutarate decarboxylase/oxoglutarate dehydrogenase thiamine pyrophosphate-binding subunit/dihydrolipoyllysine-residue succinyltransferase subunit;


Pssm-ID: 237030 [Multi-domain]  Cd Length: 1228  Bit Score: 42.18  E-value: 1.59e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 8176554    553 TDQEATPVPPTGDSEATPVPPTGDSETAPVPPTGDSGAPPVPPTGDSGAPPvPPTGDSGAPPVPPTGDSGAPPVPPTGDS 632
Cdd:PRK12270   34 ADYGPGSTAAPTAAAAAAAAAASAPAAAPAAKAPAAPAPAPPAAAAPAAPP-KPAAAAAAAAAPAAPPAAAAAAAPAAAA 112

                  ....*..
gi 8176554    633 GAPPVPP 639
Cdd:PRK12270  113 VEDEVTP 119
PRK14965 PRK14965
DNA polymerase III subunits gamma and tau; Provisional
621-696 1.77e-03

DNA polymerase III subunits gamma and tau; Provisional


Pssm-ID: 237871 [Multi-domain]  Cd Length: 576  Bit Score: 41.65  E-value: 1.77e-03
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 8176554   621 SGAPPVPPTGDSGAPPVPPtgdSGAPPvpptgdSGAPPVPPTGDAGP---PPVP-PTGDSGPPPVPPTGDSGAPPVTPTG 696
Cdd:PRK14965 379 RGAPAPPSAAWGAPTPAAP---AAPPP------AAAPPVPPAAPARPaaaRPAPaPAPPAAAAPPARSADPAAAASAGDR 449
PRK14950 PRK14950
DNA polymerase III subunits gamma and tau; Provisional
548-634 1.79e-03

DNA polymerase III subunits gamma and tau; Provisional


Pssm-ID: 237864 [Multi-domain]  Cd Length: 585  Bit Score: 41.72  E-value: 1.79e-03
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 8176554   548 ALPTVTDQEATPVPPTGDSEATPVPPtgdsETAPVPP-TGDSGAPPVPPTgdsgaPPVPPTGDSGAPPVPPTGDSGAPPV 626
Cdd:PRK14950 379 VRPTPAPSTRPKAAAAANIPPKEPVR----ETATPPPvPPRPVAPPVPHT-----PESAPKLTRAAIPVDEKPKYTPPAP 449

                 ....*...
gi 8176554   627 PPTGDSGA 634
Cdd:PRK14950 450 PKEEEKAL 457
PHA03325 PHA03325
nuclear-egress-membrane-like protein; Provisional
557-695 1.83e-03

nuclear-egress-membrane-like protein; Provisional


Pssm-ID: 223044  Cd Length: 418  Bit Score: 41.41  E-value: 1.83e-03
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 8176554   557 ATPVPPTGDSEATPVPPTGD-SETAPVPPTGDSGAPPVPPTGDSGAPPVPPTGDSGAppVPPTGDSGAPPVPPTGDSGAP 635
Cdd:PHA03325 278 RRAGAMRAAAGETADLADDDgSEHSDPEPLPASLPPPPVRRPRVKHPEAGKEEPDGA--RNAEAKEPAQPATSTSSKGSS 355
                         90       100       110       120       130       140
                 ....*....|....*....|....*....|....*....|....*....|....*....|....
gi 8176554   636 PVPP--TGDSGappvppTGDSGAPPVPPTGDA--GPPPVPPTGDSGPPPVPPTGDSGAPPVTPT 695
Cdd:PHA03325 356 SAQNkdSGSTG------PGSSLAAASSFLEDDdfGSPPLDLTTSLRHMPSPSVTSAPEPPSIPL 413
KLF17_N cd21574
N-terminal domain of Kruppel-like factor 17; Kruppel-like factor 17 (KLF17), or Krueppel-like ...
601-729 1.85e-03

N-terminal domain of Kruppel-like factor 17; Kruppel-like factor 17 (KLF17), or Krueppel-like factor 17, is a protein that, in humans, is encoded by the KLF17 gene and acts as a tumor suppressor. It negatively regulates epithelial-mesenchymal transition and metastasis in breast cancer. KLF17 is thought to be the human ortholog of the mouse gene, zinc finger protein 393 (Zfp393), although it has diverged significantly. KLF17 can regulate gene transcription from CACCC-box elements. It belongs to a family of proteins, called the Specificity Protein (SP)/KLF family, characterized by a C-terminal DNA-binding domain of 81 amino acids consisting of three Kruppel-like C2H2 zinc fingers. These factors bind to a loose consensus motif, namely NNRCRCCYY (where N is any nucleotide; R is A/G, and Y is C/T), such as the recurring motifs in GC and GT boxes (5'-GGGGCGGGG-3' and 5-GGTGTGGGG-3') that are present in promoters and more distal regulatory elements of mammalian genes. Members of the KLF family can act as activators or repressors of transcription depending on cell and promoter context. KLFs regulate various cellular functions, such as proliferation, differentiation and apoptosis, as well as the development and homeostasis of several types of tissue. In addition to the C-terminal DNA-binding domain, each KLF also has a unique N-terminal activation/repression domain that confers specificity and allows it to bind specifically to a certain partner, leading to distinct activities in vivo. This model represents the N-terminal domain of KLF17.


Pssm-ID: 410567  Cd Length: 286  Bit Score: 40.83  E-value: 1.85e-03
                        10        20        30        40        50        60        70        80
                ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 8176554  601 APPVPPTGDSGAPPVPPTGDSGAPPVPPTGdSGAPPVPPTGdsgaPPVPPTgdSGAPPVPPTGdagpppVPPTGDSGPPP 680
Cdd:cd21574 111 SPSQPGMMIFKGPQMMPLGEPNIPGVAMTF-SGNLRMPPSG----LPVSAS--SGIPMMSHIR------APTMPYSGPPT 177
                        90       100       110       120       130
                ....*....|....*....|....*....|....*....|....*....|.
gi 8176554  681 VPPTGDSGAPPV--TPTgdsetapVPPTgdsGAPPVPPtgdSEAAPVPPTD 729
Cdd:cd21574 178 VPSNRDSLTPKMllAPT-------MPST---EAQAVLP---SLAQMLPPRD 215
PRK12438 PRK12438
hypothetical protein; Provisional
610-668 1.91e-03

hypothetical protein; Provisional


Pssm-ID: 171499 [Multi-domain]  Cd Length: 991  Bit Score: 41.77  E-value: 1.91e-03
                         10        20        30        40        50
                 ....*....|....*....|....*....|....*....|....*....|....*....
gi 8176554   610 SGAPPVPPTGDSGAPpvPPTGdsGAPPVPPtgdsgaPPVPPTGDSGAPPVPPTGDAGPP 668
Cdd:PRK12438 899 TGRVATAPGGDAASA--PPPG--AGPPAPP------QAVPPPRTTQPPAAPPRGPDVPP 947
PHA03369 PHA03369
capsid maturational protease; Provisional
610-723 1.93e-03

capsid maturational protease; Provisional


Pssm-ID: 223061 [Multi-domain]  Cd Length: 663  Bit Score: 41.52  E-value: 1.93e-03
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 8176554   610 SGAPPVPPTGDSGAPPVPPTGdsGAPPVPPTGDSGAPPVPPTGDSGAPPvPPTGDAGPPPVPPTGDSGPP-----PVPPT 684
Cdd:PHA03369 349 KTASLTAPSRVLAAAAKVAVI--AAPQTHTGPADRQRPQRPDGIPYSVP-ARSPMTAYPPVPQFCGDPGLvspynPQSPG 425
                         90       100       110
                 ....*....|....*....|....*....|....*....
gi 8176554   685 GDSGAPPVTPTGDSETAPVpPTGDSGAPPVPPTGDSEAA 723
Cdd:PHA03369 426 TSYGPEPVGPVPPQPTNPY-VMPISMANMVYPGHPQEHG 463
PHA02669 PHA02669
hypothetical protein; Provisional
596-737 1.94e-03

hypothetical protein; Provisional


Pssm-ID: 177451  Cd Length: 210  Bit Score: 40.39  E-value: 1.94e-03
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 8176554   596 TGDSGAPPVPPTGDSGAPPVPPTGDS-GAPPVPPTGDSGAPPVPPTGDSGAPPVPPT-----GDSGAPPVPPTGDAGPPP 669
Cdd:PHA02669  55 TLDSTIGPCTISRDMGFGCSRWDSDTeDGDTVSTTSTSGGGTLSRVWVGGGPRFQHPmyenfCGNGTHRHSPSNDPGYHS 134
                         90       100       110       120       130       140
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 8176554   670 vPPTGDSGPPPVPPTgdsgaPPVTPTGDSETAPVPPTGDSGAPPVPPTGDSEAAPVPPTDDSKEAQMP 737
Cdd:PHA02669 135 -RETLCSGPPRQANI-----PPVTPYPDEVSVGVGSGPSTEHGHYEGDGPEQDLEPEPVQIEVTVQGP 196
motB PRK12799
flagellar motor protein MotB; Reviewed
563-687 2.14e-03

flagellar motor protein MotB; Reviewed


Pssm-ID: 183756 [Multi-domain]  Cd Length: 421  Bit Score: 41.24  E-value: 2.14e-03
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 8176554   563 TGDSEATPVPPTGDSETAPVPPTGDSGAPPVPPTGDSGAPPVPPTGDSGAPPVPPTGDSGAPPVPPtGDSGAPPVPPTGD 642
Cdd:PRK12799 296 HGTVPVAAVTPSSAVTQSSAITPSSAAIPSPAVIPSSVTTQSATTTQASAVALSSAGVLPSDVTLP-GTVALPAAEPVNM 374
                         90       100       110       120
                 ....*....|....*....|....*....|....*....|....*.
gi 8176554   643 SGAPPVPPTGDSGAPPVPPTGDAGPP-PVPPTGDSGPPPVPPTGDS 687
Cdd:PRK12799 375 QPQPMSTTETQQSSTGNITSTANGPTtSLPAAPASNIPVSPTSRDA 420
PRK07003 PRK07003
DNA polymerase III subunit gamma/tau;
548-727 2.24e-03

DNA polymerase III subunit gamma/tau;


Pssm-ID: 235906 [Multi-domain]  Cd Length: 830  Bit Score: 41.37  E-value: 2.24e-03
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 8176554   548 ALPTVTDQEATPVPPTGDSEatPVPPTGDSETAPVPPTGDSGAPPVPPTGDSGAPPVPPTGDSgAPPVPPTG-------- 619
Cdd:PRK07003 472 ADSGSASAPASDAPPDAAFE--PAPRAAAPSAATPAAVPDARAPAAASREDAPAAAAPPAPEA-RPPTPAAAapaaragg 548
                         90       100       110       120       130       140       150       160
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 8176554   620 -----------------DSGAPPVPPTGDSGAPPVPPTGDSGAPPVP-PT----------GDSGAPPVPPTGDAGPPPvP 671
Cdd:PRK07003 549 aaaaldvlrnagmrvssDRGARAAAAAKPAAAPAAAPKPAAPRVAVQvPTpraraatgdaPPNGAARAEQAAESRGAP-P 627
                        170       180       190       200       210       220
                 ....*....|....*....|....*....|....*....|....*....|....*....|....
gi 8176554   672 PTGDSGPPPVPPTG--------DSGAPPVTPTGDSETAPVPPTGDSGAPPVpptgdsEAAPVPP 727
Cdd:PRK07003 628 PWEDIPPDDYVPLSadegfggpDDGFVPVFDSGPDDVRVAPKPADAPAPPV------DTRPLPP 685
PHA03201 PHA03201
uracil DNA glycosylase; Provisional
639-729 2.25e-03

uracil DNA glycosylase; Provisional


Pssm-ID: 165468  Cd Length: 318  Bit Score: 40.65  E-value: 2.25e-03
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 8176554   639 PTGDSGAPPVPPtgdSGAPPVPPTGDAGPPPVPPTGDSGPPPVPPTGDSGAPPVtPTGDSETAPVPPTGDSGAPPVPPTG 718
Cdd:PHA03201   4 ARSRSPSPPRRP---SPPRPTPPRSPDASPEETPPSPPGPGAEPPPGRAAGPAA-PRRRPRGCPAGVTFSSSAPPRPPLG 79
                         90
                 ....*....|...
gi 8176554   719 --DSEAAPVPPTD 729
Cdd:PHA03201  80 ldDAPAATPPPLD 92
PHA03201 PHA03201
uracil DNA glycosylase; Provisional
558-642 2.25e-03

uracil DNA glycosylase; Provisional


Pssm-ID: 165468  Cd Length: 318  Bit Score: 40.65  E-value: 2.25e-03
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 8176554   558 TPVPPTGDSEATPVPPTGDSETAPVPPTGDSGAPPVPPTGDSGAPPVPPTGDSGAPPvPPTGDSGAPPVPPTGDSGAP-P 636
Cdd:PHA03201   8 SPSPPRRPSPPRPTPPRSPDASPEETPPSPPGPGAEPPPGRAAGPAAPRRRPRGCPA-GVTFSSSAPPRPPLGLDDAPaA 86

                 ....*.
gi 8176554   637 VPPTGD 642
Cdd:PHA03201  87 TPPPLD 92
PRK07764 PRK07764
DNA polymerase III subunits gamma and tau; Validated
548-738 2.30e-03

DNA polymerase III subunits gamma and tau; Validated


Pssm-ID: 236090 [Multi-domain]  Cd Length: 824  Bit Score: 41.51  E-value: 2.30e-03
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 8176554   548 ALPTVTDQEATPVPPTGDSEATPVPPTGDSETAPVPPTGDSGAPPVPPTGDSGAPPVPPtgdsgAPPVPPTGDSGAPPVP 627
Cdd:PRK07764 419 AAAAPAPAAAPQPAPAPAPAPAPPSPAGNAPAGGAPSPPPAAAPSAQPAPAPAAAPEPT-----AAPAPAPPAAPAPAAA 493
                         90       100       110       120       130       140       150       160
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 8176554   628 PtgDSGAPPVPPTGDSGA-------------------------------------------------------------- 645
Cdd:PRK07764 494 P--AAPAAPAAPAGADDAatlrerwpeilaavpkrsrktwaillpeatvlgvrgdtlvlgfstgglarrfaspgnaevlv 571
                        170       180       190       200       210       220       230       240
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 8176554   646 ------------------PPVPPTGDSGAPPVPPTGDAGPPPVPPTGDSGPPPVPPTGDSGAPPVTPTGDSETAPVPPTG 707
Cdd:PRK07764 572 talaeelggdwqveavvgPAPGAAGGEGPPAPASSGPPEEAARPAAPAAPAAPAAPAPAGAAAAPAEASAAPAPGVAAPE 651
                        250       260       270
                 ....*....|....*....|....*....|...
gi 8176554   708 DSGAPPV--PPTGDSEAAPVPPTDDSKEAQMPA 738
Cdd:PRK07764 652 HHPKHVAvpDASDGGDGWPAKAGGAAPAAPPPA 684
PRK12438 PRK12438
hypothetical protein; Provisional
632-693 2.31e-03

hypothetical protein; Provisional


Pssm-ID: 171499 [Multi-domain]  Cd Length: 991  Bit Score: 41.39  E-value: 2.31e-03
                         10        20        30        40        50        60
                 ....*....|....*....|....*....|....*....|....*....|....*....|..
gi 8176554   632 SGAPPVPPTGDSGAPpvPPTGdsGAPPVPptgdagPPPVPPTGDSGPPPVPPTGdSGAPPVT 693
Cdd:PRK12438 899 TGRVATAPGGDAASA--PPPG--AGPPAP------PQAVPPPRTTQPPAAPPRG-PDVPPAA 949
kgd PRK12270
multifunctional oxoglutarate decarboxylase/oxoglutarate dehydrogenase thiamine ...
620-702 2.31e-03

multifunctional oxoglutarate decarboxylase/oxoglutarate dehydrogenase thiamine pyrophosphate-binding subunit/dihydrolipoyllysine-residue succinyltransferase subunit;


Pssm-ID: 237030 [Multi-domain]  Cd Length: 1228  Bit Score: 41.41  E-value: 2.31e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 8176554    620 DSGAPPVPPTGDSGAPPVPPTGDSGAPPVPPTGDSGAPPVPPTGDAGPPPVPptGDSGPPPVPPTGDSGAPPVTPTGDSE 699
Cdd:PRK12270   35 DYGPGSTAAPTAAAAAAAAAASAPAAAPAAKAPAAPAPAPPAAAAPAAPPKP--AAAAAAAAAPAAPPAAAAAAAPAAAA 112

                  ...
gi 8176554    700 TAP 702
Cdd:PRK12270  113 VED 115
PHA03291 PHA03291
envelope glycoprotein I; Provisional
566-673 2.36e-03

envelope glycoprotein I; Provisional


Pssm-ID: 223033 [Multi-domain]  Cd Length: 401  Bit Score: 41.09  E-value: 2.36e-03
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 8176554   566 SEATPVPPTG--DSETAPVPPTGDSGAPPVPPTGDSGAPPVPPTGDSGAPP---VPPTGDSGAPPVPPTGDSGAPPVPPT 640
Cdd:PHA03291 168 AEGTLAAPPLgeGSADGSCDPALPLSAPRLGPADVFVPATPRPTPRTTASPettPTPSTTTSPPSTTIPAPSTTIAAPQA 247
                         90       100       110
                 ....*....|....*....|....*....|...
gi 8176554   641 GDSGAPPVPPtgdsgAPPVPPTGDAGPPPVPPT 673
Cdd:PHA03291 248 GTTPEAEGTP-----APPTPGGGEAPPANATPA 275
Herpes_BLLF1 pfam05109
Herpes virus major outer envelope glycoprotein (BLLF1); This family consists of the BLLF1 ...
567-726 2.42e-03

Herpes virus major outer envelope glycoprotein (BLLF1); This family consists of the BLLF1 viral late glycoprotein, also termed gp350/220. It is the most abundantly expressed glycoprotein in the viral envelope of the Herpesviruses and is the major antigen responsible for stimulating the production of neutralising antibodies in vivo.


Pssm-ID: 282904 [Multi-domain]  Cd Length: 886  Bit Score: 41.44  E-value: 2.42e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 8176554    567 EATPVPPTGDSE--TAPVPPTGDSGAPPVPP--TGDSGAPPVPPTGDSGAPpVPPTGDSGAPPVPPTgdsgappvPPTGD 642
Cdd:pfam05109 426 ESTTTSPTLNTTgfAAPNTTTGLPSSTHVPTnlTAPASTGPTVSTADVTSP-TPAGTTSGASPVTPS--------PSPRD 496
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 8176554    643 SG----APPVPPTGDSGAPPVPPTGDAGPPPVPPTGDSGPPPVPPTGDSGApPVTPTGDSeTAPVP----PTGDSGAPPV 714
Cdd:pfam05109 497 NGteskAPDMTSPTSAVTTPTPNATSPTPAVTTPTPNATSPTLGKTSPTSA-VTTPTPNA-TSPTPavttPTPNATIPTL 574
                         170
                  ....*....|..
gi 8176554    715 PPTGDSEAAPVP 726
Cdd:pfam05109 575 GKTSPTSAVTTP 586
PTZ00436 PTZ00436
60S ribosomal protein L19-like protein; Provisional
584-726 2.47e-03

60S ribosomal protein L19-like protein; Provisional


Pssm-ID: 185616 [Multi-domain]  Cd Length: 357  Bit Score: 40.70  E-value: 2.47e-03
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 8176554   584 PTGDSGAPPVPPTGDSGAPpvpptGDSGAPPVPPTGDSGAPPVPPTGDSGAPP--VPPTGDSGAPPVPPTGDSGAPPVPP 661
Cdd:PTZ00436 211 PSGKKSAKAAAPAKAAAAP-----AKAAAPPAKAAAAPAKAAAAPAKAAAPPAkaAAPPAKAAAPPAKAAAPPAKAAAPP 285
                         90       100       110       120       130       140
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*
gi 8176554   662 TGDAGPPPVPPTGDSgPPPVPPTGDSGAPPVTPTGDSETApVPPTGDSGAPPVPPTGDSEAAPVP 726
Cdd:PTZ00436 286 AKAAAPPAKAAAAPA-KAAAAPAKAAAAPAKAAAPPAKAA-APPAKAATPPAKAAAPPAKAAAAP 348
dnaA PRK14086
chromosomal replication initiator protein DnaA;
601-729 2.52e-03

chromosomal replication initiator protein DnaA;


Pssm-ID: 237605 [Multi-domain]  Cd Length: 617  Bit Score: 41.35  E-value: 2.52e-03
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 8176554   601 APPVPPTGDSGAPPVPPTGDSGAPPVPPTGDSGAPPVPPTGDSGAPPVPPTGDSGAPPVPPTGD----AGPPPVPPTGDS 676
Cdd:PRK14086  85 AITVDPSAGEPAPPPPHARRTSEPELPRPGRRPYEGYGGPRADDRPPGLPRQDQLPTARPAYPAyqqrPEPGAWPRAADD 164
                         90       100       110       120       130
                 ....*....|....*....|....*....|....*....|....*....|....*
gi 8176554   677 GPPPVPPTG-DSGAPPVTPTGDS-ETAPVPPTGDSGAPPVPPTGDSEAAPVPPTD 729
Cdd:PRK14086 165 YGWQQQRLGfPPRAPYASPASYApEQERDREPYDAGRPEYDQRRRDYDHPRPDWD 219
PHA03269 PHA03269
envelope glycoprotein C; Provisional
543-680 2.53e-03

envelope glycoprotein C; Provisional


Pssm-ID: 165527 [Multi-domain]  Cd Length: 566  Bit Score: 41.25  E-value: 2.53e-03
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 8176554   543 TLTYLALPTVTDQEATPVPPTGDSEATPVP-----PTGDSETAPVPPTGDSGAPPVPPtgDSGAPPVPPTGDSGAPPVPP 617
Cdd:PHA03269  11 TIACINLIIANLNTNIPIPELHTSAATQKPdpapaPHQAASRAPDPAVAPTSAASRKP--DLAQAPTPAASEKFDPAPAP 88
                         90       100       110       120       130       140
                 ....*....|....*....|....*....|....*....|....*....|....*....|...
gi 8176554   618 TGDSGAPPVPPTGDSGAppVPPTGDSGAPPVPPTGDSGAPPVPPTGDAGPPPVPPTGDSGPPP 680
Cdd:PHA03269  89 HQAASRAPDPAVAPQLA--AAPKPDAAEAFTSAAQAHEAPADAGTSAASKKPDPAAHTQHSPP 149
FrsA COG1073
Fermentation-respiration switch esterase FrsA, DUF1100 family [Signal transduction mechanisms]; ...
127-243 2.67e-03

Fermentation-respiration switch esterase FrsA, DUF1100 family [Signal transduction mechanisms];


Pssm-ID: 440691 [Multi-domain]  Cd Length: 253  Bit Score: 40.28  E-value: 2.67e-03
                        10        20        30        40        50        60        70        80
                ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 8176554  127 GAFLMGSGHGANfLNNYLYDGEEIATRGnVIVVTFNYRvgplGF-LSTGDanlPGNYGLRDQHMAIAWVKRNIAAFGGDP 205
Cdd:COG1073  38 PAVVVAHGNGGV-KEQRALYAQRLAELG-FNVLAFDYR----GYgESEGE---PREEGSPERRDARAAVDYLRTLPGVDP 108
                        90       100       110
                ....*....|....*....|....*....|....*...
gi 8176554  206 NNITLFGESAGGAsVSLQTLSPYNKglIRAAISQSGVA 243
Cdd:COG1073 109 ERIGLLGISLGGG-YALNAAATDPR--VKAVILDSPFT 143
PHA03169 PHA03169
hypothetical protein; Provisional
619-733 2.71e-03

hypothetical protein; Provisional


Pssm-ID: 223003 [Multi-domain]  Cd Length: 413  Bit Score: 40.72  E-value: 2.71e-03
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 8176554   619 GDSGAPPVPPTGDSGAPPVPPTGDSGAPPVPPTGDSGAPpvPPTGDAGP-PPVPPTGDSGPPPVPPTG-------DSGAP 690
Cdd:PHA03169 100 VGSPTPSPSGSAEELASGLSPENTSGSSPESPASHSPPP--SPPSHPGPhEPAPPESHNPSPNQQPSSflqpsheDSPEE 177
                         90       100       110       120
                 ....*....|....*....|....*....|....*....|....*..
gi 8176554   691 PVTPTG----DSETAPVPPTGDSGAPPVPPtGDSEAAPVPPTDDSKE 733
Cdd:PHA03169 178 PEPPTSepepDSPGPPQSETPTSSPPPQSP-PDEPGEPQSPTPQQAP 223
PRK14948 PRK14948
DNA polymerase III subunit gamma/tau;
640-734 2.97e-03

DNA polymerase III subunit gamma/tau;


Pssm-ID: 237862 [Multi-domain]  Cd Length: 620  Bit Score: 41.10  E-value: 2.97e-03
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 8176554   640 TGDSGAPPVPPTGDSGAPPVPPTGDAGPPPVPPTGDSGPPPVpptgdsGAPPVTPTGDSETAPVPPTGDSGAPPVPPTGD 719
Cdd:PRK14948 512 SQSGSASNTAKTPPPPQKSPPPPAPTPPLPQPTATAPPPTPP------PPPPTATQASSNAPAQIPADSSPPPPIPEEPT 585
                         90
                 ....*....|....*
gi 8176554   720 SEAAPVPPTDDSKEA 734
Cdd:PRK14948 586 PSPTKDSSPEEIDKA 600
kgd PRK12270
multifunctional oxoglutarate decarboxylase/oxoglutarate dehydrogenase thiamine ...
550-625 3.16e-03

multifunctional oxoglutarate decarboxylase/oxoglutarate dehydrogenase thiamine pyrophosphate-binding subunit/dihydrolipoyllysine-residue succinyltransferase subunit;


Pssm-ID: 237030 [Multi-domain]  Cd Length: 1228  Bit Score: 41.03  E-value: 3.16e-03
                          10        20        30        40        50        60        70
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*..
gi 8176554    550 PTVTDQEATPVPPTGDSEATPVPPTGDSETAPVPPTGDSGAPPVPPT-GDSGAPPVPPTGDSGAPPVPPTGDSGAPP 625
Cdd:PRK12270   39 GSTAAPTAAAAAAAAAASAPAAAPAAKAPAAPAPAPPAAAAPAAPPKpAAAAAAAAAPAAPPAAAAAAAPAAAAVED 115
PLN02983 PLN02983
biotin carboxyl carrier protein of acetyl-CoA carboxylase
567-664 3.30e-03

biotin carboxyl carrier protein of acetyl-CoA carboxylase


Pssm-ID: 215533 [Multi-domain]  Cd Length: 274  Bit Score: 40.21  E-value: 3.30e-03
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 8176554   567 EATPVPPTgdseTAPVPPTGDSGAPPVPPTGDSGAPPVPPTGDSGAPPVPPTgdsgaPPVPPTGDSGAPPV--PPTGDSG 644
Cdd:PLN02983 139 EALPQPPP----PAPVVMMQPPPPHAMPPASPPAAQPAPSAPASSPPPTPAS-----PPPAKAPKSSHPPLksPMAGTFY 209
                         90       100
                 ....*....|....*....|
gi 8176554   645 APPVPptgdsGAPPVPPTGD 664
Cdd:PLN02983 210 RSPAP-----GEPPFVKVGD 224
PRK14965 PRK14965
DNA polymerase III subunits gamma and tau; Provisional
588-676 3.35e-03

DNA polymerase III subunits gamma and tau; Provisional


Pssm-ID: 237871 [Multi-domain]  Cd Length: 576  Bit Score: 40.88  E-value: 3.35e-03
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 8176554   588 SGAPPVPPTGDSGAPPVPPTG--DSGAPPVpptgdsgaPPVPPTGDSGAPPVPptgdsgaPPVPPtgdsgAPPVPPTGDA 665
Cdd:PRK14965 379 RGAPAPPSAAWGAPTPAAPAAppPAAAPPV--------PPAAPARPAAARPAP-------APAPP-----AAAAPPARSA 438
                         90
                 ....*....|.
gi 8176554   666 GPPPVPPTGDS 676
Cdd:PRK14965 439 DPAAAASAGDR 449
PHA03369 PHA03369
capsid maturational protease; Provisional
557-651 3.39e-03

capsid maturational protease; Provisional


Pssm-ID: 223061 [Multi-domain]  Cd Length: 663  Bit Score: 40.75  E-value: 3.39e-03
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 8176554   557 ATPVPPTGDSEATPVPPTGdSETAPvPPTGDSGAPPVPPTGDSGAPPvPPTGDSGAPPVPPTGDSGAPPVPPTGDSGAPP 636
Cdd:PHA03369 351 ASLTAPSRVLAAAAKVAVI-AAPQT-HTGPADRQRPQRPDGIPYSVP-ARSPMTAYPPVPQFCGDPGLVSPYNPQSPGTS 427
                         90
                 ....*....|....*
gi 8176554   637 VPPTGDSGAPPVPPT 651
Cdd:PHA03369 428 YGPEPVGPVPPQPTN 442
PLN03209 PLN03209
translocon at the inner envelope of chloroplast subunit 62; Provisional
561-684 3.62e-03

translocon at the inner envelope of chloroplast subunit 62; Provisional


Pssm-ID: 178748 [Multi-domain]  Cd Length: 576  Bit Score: 40.68  E-value: 3.62e-03
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 8176554   561 PPTGDSeatPVPPTGDSETAPVPPTGDSGAPPVPPTGDSGAPPVPPTGDSGAPPVPPTGDSGappvPPTGDSGAPPVPPT 640
Cdd:PLN03209 449 PPTSPS---PTAPTGVSPSVSSTSSVPAVPDTAPATAATDAAAPPPANMRPLSPYAVYDDLK----PPTSPSPAAPVGKV 521
                         90       100       110       120       130
                 ....*....|....*....|....*....|....*....|....*....|
gi 8176554   641 GDSGAPPVPPTGDSGAPPVPPTGD--AGPPPVP----PTGDSGPPPVPPT 684
Cdd:PLN03209 522 APSSTNEVVKVGNSAPPTALADEQhhAQPKPRPlspyTMYEDLKPPTSPT 571
PRK14965 PRK14965
DNA polymerase III subunits gamma and tau; Provisional
579-654 3.77e-03

DNA polymerase III subunits gamma and tau; Provisional


Pssm-ID: 237871 [Multi-domain]  Cd Length: 576  Bit Score: 40.50  E-value: 3.77e-03
                         10        20        30        40        50        60        70
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*.
gi 8176554   579 TAPVPPTGDSGAPPVPptgdsgAPPvPPTGDSGAPPVPPTGDSGAPPVPPTGDSGAPPvPPTGDSGAPPVPPTGDS 654
Cdd:PRK14965 382 PAPPSAAWGAPTPAAP------AAP-PPAAAPPVPPAAPARPAAARPAPAPAPPAAAA-PPARSADPAAAASAGDR 449
GGN pfam15685
Gametogenetin; GGN is a family of proteins largely found in mammals. It reacts with POG in the ...
608-727 3.81e-03

Gametogenetin; GGN is a family of proteins largely found in mammals. It reacts with POG in the maturation of sperm and is expressed virtually only in the testis. It is found to be associated with the intracellular membrane, binds with GGNBP1 and may be involved in vesicular trafficking.


Pssm-ID: 434857 [Multi-domain]  Cd Length: 668  Bit Score: 40.52  E-value: 3.81e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 8176554    608 GDSGAPPVPPTGDSGAPPVPPTGDSGAPPvppTGDSGAPPVPPT---------GDSGAPPVPPTGDAGP----PPVPPTG 674
Cdd:pfam15685 324 GCSGGPAAPASHARALPPPAYTTFPGSKP---KFDWVSPPDGPErhfrfngagGGIGAPRRRAAALSGPwgspPPPPGKA 400
                          90       100       110       120       130
                  ....*....|....*....|....*....|....*....|....*....|....*...
gi 8176554    675 DSGPPPVPPTGDSGAPPV----TPTGDSETAPVPPTGDSGAP-PVPPTGDSEAAPVPP 727
Cdd:pfam15685 401 HPIPGPRRPAPALLAPPMfifpAPTNGEPVRPGPPAPQALLPrPPPPTPPATPPPVPP 458
PHA03369 PHA03369
capsid maturational protease; Provisional
588-684 3.88e-03

capsid maturational protease; Provisional


Pssm-ID: 223061 [Multi-domain]  Cd Length: 663  Bit Score: 40.75  E-value: 3.88e-03
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 8176554   588 SGAPPVPPTGDSGAPPVPPtgdSGAPPVPPTGdsGAPPVPPTGDSGAPPVPPTGDSGAPPVPPTGDSGAP-------PVP 660
Cdd:PHA03369 349 KTASLTAPSRVLAAAAKVA---VIAAPQTHTG--PADRQRPQRPDGIPYSVPARSPMTAYPPVPQFCGDPglvspynPQS 423
                         90       100
                 ....*....|....*....|....
gi 8176554   661 PTGDAGPPPVPPTgdsgpPPVPPT 684
Cdd:PHA03369 424 PGTSYGPEPVGPV-----PPQPTN 442
rad23 TIGR00601
UV excision repair protein Rad23; All proteins in this family for which functions are known ...
640-720 3.89e-03

UV excision repair protein Rad23; All proteins in this family for which functions are known are components of a multiprotein complex used for targeting nucleotide excision repair to specific parts of the genome. In humans, Rad23 complexes with the XPC protein. This family is based on the phylogenomic analysis of JA Eisen (1999, Ph.D. Thesis, Stanford University). [DNA metabolism, DNA replication, recombination, and repair]


Pssm-ID: 273167 [Multi-domain]  Cd Length: 378  Bit Score: 40.26  E-value: 3.89e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 8176554    640 TGDSGAPPVPPTGdsgappvPPTGDAGPPPVPPTGDSGPPPvpptgdSGAPPVTPTGDSETAPVPPTGDSGAPPVPPTGD 719
Cdd:TIGR00601  81 TGKVAPPAATPTS-------APTPTPSPPASPASGMSAAPA------SAVEEKSPSEESATATAPESPSTSVPSSGSDAA 147

                  .
gi 8176554    720 S 720
Cdd:TIGR00601 148 S 148
KLF9_13_N-like cd21975
Kruppel-like factor (KLF) 9, KLF13, KLF14, KLF16, and similar proteins; Kruppel/Krueppel-like ...
576-691 3.98e-03

Kruppel-like factor (KLF) 9, KLF13, KLF14, KLF16, and similar proteins; Kruppel/Krueppel-like transcription factors (KLFs) belong to a family of proteins, called the Specificity Protein (SP)/KLF family, characterized by a C-terminal DNA-binding domain of 81 amino acids consisting of three Kruppel-like C2H2 zinc fingers. These factors bind to a loose consensus motif, namely NNRCRCCYY (where N is any nucleotide; R is A/G, and Y is C/T), such as the recurring motifs in GC and GT boxes (5'-GGGGCGGGG-3' and 5-GGTGTGGGG-3') that are present in promoters and more distal regulatory elements of mammalian genes. Members of the KLF family can act as activators or repressors of transcription depending on cell and promoter context. KLFs regulate various cellular functions, such as proliferation, differentiation, and apoptosis, as well as the development and homeostasis of several types of tissue. KLF9, KLF10, KLF11, KLF13, KLF14, and KLF16 share a conserved alpha-helical motif AA/VXXL that mediates their binding to Sin3A and their activities as transcriptional repressors. In addition to the C-terminal DNA-binding domain, each KLF also has a unique N-terminal activation/repression domain that confers specificity and allows it to bind specifically to a certain partner, leading to distinct activities in vivo. This model represents the related N-terminal domains of KLF9, KLF13, KLF14, KLF16, and similar proteins.


Pssm-ID: 409240 [Multi-domain]  Cd Length: 163  Bit Score: 38.90  E-value: 3.98e-03
                        10        20        30        40        50        60        70        80
                ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 8176554  576 DSETAPVPPTGDSGAP--PVPPTGDSGAPPVPPTGDSGAPPV-----------------PPTGDSGAPPVPPTGdsgaPP 636
Cdd:cd21975  24 DPEGAGLAAGLDVRATreVAKGPGPPGPAWKPDGADSPGLVTaaphllaanvlaplrgpSVEGSSLESGDADMG----SD 99
                        90       100       110       120       130
                ....*....|....*....|....*....|....*....|....*....|....*
gi 8176554  637 VPPTGDSGAPPVPPTGDSGAPPVPPTgdagPPPVPPTGDSGPPPVPPTGDSGAPP 691
Cdd:cd21975 100 SDVAPASGAAASTSPESSSDAASSPS----PLSLLHPGEAGLEPERPRPRVRRGV 150
PRK14965 PRK14965
DNA polymerase III subunits gamma and tau; Provisional
632-738 4.15e-03

DNA polymerase III subunits gamma and tau; Provisional


Pssm-ID: 237871 [Multi-domain]  Cd Length: 576  Bit Score: 40.50  E-value: 4.15e-03
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 8176554   632 SGAPPVPPTGDSGAPPVPPtgdsgAPPvPPTGDAGPPPVPPTGDSGPPPVP-PTGDSGAPPvtPTGDSETAPVPPTGDS- 709
Cdd:PRK14965 379 RGAPAPPSAAWGAPTPAAP-----AAP-PPAAAPPVPPAAPARPAAARPAPaPAPPAAAAP--PARSADPAAAASAGDRw 450
                         90       100       110
                 ....*....|....*....|....*....|....*..
gi 8176554   710 --------GAPPVPPTGDSEAAPVPPTDDSKEAQMPA 738
Cdd:PRK14965 451 rafvafvkGKKPALGASLEQGSPLGVSAGLLEIGFPE 487
SepH NF040712
septation protein SepH; Septation protein H (SepH) was firstly characterized in Streptomyces ...
623-741 4.22e-03

septation protein SepH; Septation protein H (SepH) was firstly characterized in Streptomyces venezuelae, and homologs were identified in Mycobacterium smegmatis. SepH contains a N-terminal DUF3071 domain and a conserved C-terminal region. It binds directly to cell division protein FtsZ to stimulate the assembly of FtsZ protofilaments.


Pssm-ID: 468676 [Multi-domain]  Cd Length: 346  Bit Score: 40.14  E-value: 4.22e-03
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 8176554   623 APPVPPTGDSGAPPVPPTGDSGAPPVPPTgdsgAPPVPPTGDAGPPPVPPTGD--SGPPPVPPTGDSGAPPVTPTGDSET 700
Cdd:NF040712 193 GRPLRPLATVPRLAREPADARPEEVEPAP----AAEGAPATDSDPAEAGTPDDlaSARRRRAGVEQPEDEPVGPGAAPAA 268
                         90       100       110       120
                 ....*....|....*....|....*....|....*....|.
gi 8176554   701 APVPPTGDSGAPPVPPTGDSEAAPVPPTDDSKEAQMPAVIR 741
Cdd:NF040712 269 EPDEATRDAGEPPAPGAAETPEAAEPPAPAPAAPAAPAAPE 309
PRK10856 PRK10856
cytoskeleton protein RodZ;
550-634 4.41e-03

cytoskeleton protein RodZ;


Pssm-ID: 236776 [Multi-domain]  Cd Length: 331  Bit Score: 40.01  E-value: 4.41e-03
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 8176554   550 PTVTDQEATPVPPTGDSEATPVPPTGDSETAPVPPTGDSGAPPVPPTGDSGAPPVP-PTGDSGAPPVPPTGDsgAPPVPP 628
Cdd:PRK10856 170 TDPATTPAPAAPVDTTPTNSQTPAVATAPAPAVDPQQNAVVAPSQANVDTAATPAPaAPATPDGAAPLPTDQ--AGVSTP 247

                 ....*.
gi 8176554   629 TGDSGA 634
Cdd:PRK10856 248 AADPNA 253
kgd PRK12270
multifunctional oxoglutarate decarboxylase/oxoglutarate dehydrogenase thiamine ...
664-738 4.60e-03

multifunctional oxoglutarate decarboxylase/oxoglutarate dehydrogenase thiamine pyrophosphate-binding subunit/dihydrolipoyllysine-residue succinyltransferase subunit;


Pssm-ID: 237030 [Multi-domain]  Cd Length: 1228  Bit Score: 40.64  E-value: 4.60e-03
                          10        20        30        40        50        60        70
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*
gi 8176554    664 DAGPPPVPPTGDSGPPPVPPTGDSGAPPVTPTGDSETAPVPPTGDSGAPPVPPTGDSEAAPVPPTDDSKEAQMPA 738
Cdd:PRK12270   35 DYGPGSTAAPTAAAAAAAAAASAPAAAPAAKAPAAPAPAPPAAAAPAAPPKPAAAAAAAAAPAAPPAAAAAAAPA 109
DUF3729 pfam12526
Protein of unknown function (DUF3729); This family of proteins is found in viruses. Proteins ...
601-682 4.62e-03

Protein of unknown function (DUF3729); This family of proteins is found in viruses. Proteins in this family are typically between 145 and 1707 amino acids in length. The family is found in association with pfam01443, pfam01661, pfam05417, pfam01660, pfam00978. There is a single completely conserved residue L that may be functionally important.


Pssm-ID: 372164 [Multi-domain]  Cd Length: 115  Bit Score: 37.75  E-value: 4.62e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 8176554    601 APPVPPTGDSGAPPVPPTGDSGAPPvPPTGDSGAPPVPPTGDSGAP-PVPPTGDSGAPPVPPTgdagpPPVPPTGDSGPP 679
Cdd:pfam12526  30 SPPESAHPDPPPPVGDPRPPVVDTP-PPVSAVWVLPPPSEPAAPEPdLVPPVTGPAGPPSPLA-----PPAPAQKPPLPP 103

                  ...
gi 8176554    680 PVP 682
Cdd:pfam12526 104 PRP 106
MISS pfam15822
MAPK-interacting and spindle-stabilising protein-like; MISS is a family of eukaryotic ...
603-728 4.64e-03

MAPK-interacting and spindle-stabilising protein-like; MISS is a family of eukaryotic MAPK-interacting and spindle-stabilising protein-like proteins. MISS is rich in prolines and has four potential MAPK-phosphorylation sites, a MAPK-docking site, a PEST sequence (PEST motif) and a bipartite nuclear localization signal. The endogenous protein accumulates during mouse meiotic maturation and is found as discrete dots on the MII spindle. MISS is the first example of a physiological MAPK-substrate that is stabilized in MII that specifically regulates MII spindle integrity during the CSF arrest.


Pssm-ID: 318115 [Multi-domain]  Cd Length: 238  Bit Score: 39.58  E-value: 4.64e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 8176554    603 PVPPTGDSGAPPVPPTGDSGAPPVPPTGdsgappVPPTGDSGAPPVPPTGDSGAPPVPPTGDA--GPPPVPPTGDSGPPP 680
Cdd:pfam15822  21 PKPGQPPQGWPGSNPWNNPSAPPAVPSG------LPPSTAPSTVPFGPAPTGMYPSIPLTGPSpgPPAPFPPSGPSCPPP 94
                          90       100       110       120       130
                  ....*....|....*....|....*....|....*....|....*....|....*...
gi 8176554    681 VPPTGDSGAPPVTPTGDSETAPVP----------PTGDSGAPPVPPTGDSEAAPVPPT 728
Cdd:pfam15822  95 GGPYPAPTVPGPGPIGPYPTPNMPfpelprpygaPTDPAAAAPSGPWGSMSSGPWAPG 152
PHA03418 PHA03418
hypothetical E4 protein; Provisional
570-709 4.65e-03

hypothetical E4 protein; Provisional


Pssm-ID: 177646 [Multi-domain]  Cd Length: 230  Bit Score: 39.34  E-value: 4.65e-03
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 8176554   570 PVPPTGDSETAPVPPTGDSGAPPVPPTgdSGAPPVPPTGDSgAPPVPPT----GDSGAPPVPPTGDSGAPPVPPTGDSGA 645
Cdd:PHA03418  34 PLLPAPHHPNPQEDPDKNPSPPPDPPL--TPRPPAQPNGHN-KPPVTKQpggeGTEEDHQAPLAADADDDPRPGKRSKAD 110
                         90       100       110       120       130       140       150       160
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 8176554   646 PPVPPTGDSGAPPV-------PPTGDAGPPP--------VPPTGDSGPPPVPPTGD---SGAPPVTPTGDSetaPVPPTG 707
Cdd:PHA03418 111 EHGPAPGRAALAPFkldldqdPLHGDPDPPPgatggqgeEPPEGGEESQPPLGEGEgavEGHPPPLPPAPE---PKPHNG 187

                 ..
gi 8176554   708 DS 709
Cdd:PHA03418 188 DA 189
PRK11633 PRK11633
cell division protein DedD; Provisional
560-681 4.77e-03

cell division protein DedD; Provisional


Pssm-ID: 236940 [Multi-domain]  Cd Length: 226  Bit Score: 39.22  E-value: 4.77e-03
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 8176554   560 VPPTGDSEATPVPPTGDSETAPVPPTGDSGAPPvppTGDSGAPPVPPTgDSGAPPVPPtgdsgAPPVPPTGDSGAPPVPP 639
Cdd:PRK11633  44 VPKPGDRDEPDMMPAATQALPTQPPEGAAEAVR---AGDAAAPSLDPA-TVAPPNTPV-----EPEPAPVEPPKPKPVEK 114
                         90       100       110       120
                 ....*....|....*....|....*....|....*....|..
gi 8176554   640 tgdsgaPPVPPTGDSGAPPVPPTgdagPPPVPPTGDSGPPPV 681
Cdd:PRK11633 115 ------PKPKPKPQQKVEAPPAP----KPEPKPVVEEKAAPT 146
PRK11901 PRK11901
hypothetical protein; Reviewed
596-732 5.06e-03

hypothetical protein; Reviewed


Pssm-ID: 237015 [Multi-domain]  Cd Length: 327  Bit Score: 39.67  E-value: 5.06e-03
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 8176554   596 TGDSGAPPVPPTGDSGAPPVPPTGDSGAPpvpptGDSGAPPVPPTGDSGAPPVPPTGD--------------------SG 655
Cdd:PRK11901  88 SSGNQSSPSAANNTSDGHDASGVKNTAPP-----QDISAPPISPTPTQAAPPQTPNGQqrielpgnisdalsqqqgqvNA 162
                         90       100       110       120       130       140       150
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*..
gi 8176554   656 APPVPPTGDAGPPPVPPTgdsgpppVPPTGDSGAPPVTPTgdsetAPVPPTGDSGAPPVPPTGDSEAAPVPPTDDSK 732
Cdd:PRK11901 163 ASQNAQGNTSTLPTAPAT-------VAPSKGAKVPATAET-----HPTPPQKPATKKPAVNHHKTATVAVPPATSGK 227
PTZ00436 PTZ00436
60S ribosomal protein L19-like protein; Provisional
587-727 5.08e-03

60S ribosomal protein L19-like protein; Provisional


Pssm-ID: 185616 [Multi-domain]  Cd Length: 357  Bit Score: 39.93  E-value: 5.08e-03
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 8176554   587 DSGAPPVPPTGDSGAPPVPPTGDSGAPPVPPTGDSGAPP--VPPTGDSGAPPvpptGDSGAPPVPPTGDSGAPPVPPTGD 664
Cdd:PTZ00436 192 DAAAAAAAKQKAAAKKAAAPSGKKSAKAAAPAKAAAAPAkaAAPPAKAAAAP----AKAAAAPAKAAAPPAKAAAPPAKA 267
                         90       100       110       120       130       140       150
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....
gi 8176554   665 AGPPP---VPPTGDSGPPP---VPPTGDSGAPPVTPTGDSETAPVP-----PTGDSGAPPVPPTGDSEAAPVPP 727
Cdd:PTZ00436 268 AAPPAkaaAPPAKAAAPPAkaaAPPAKAAAAPAKAAAAPAKAAAAPakaaaPPAKAAAPPAKAATPPAKAAAPP 341
PHA03418 PHA03418
hypothetical E4 protein; Provisional
592-726 5.13e-03

hypothetical E4 protein; Provisional


Pssm-ID: 177646 [Multi-domain]  Cd Length: 230  Bit Score: 39.34  E-value: 5.13e-03
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 8176554   592 PVPPTGDSGAPPVPPTGDSGAPPVPPTgdSGAPPVPPTGDSGAPPVPPTGDSGappvppTGDSGAPPVPPTGDAGPPPVP 671
Cdd:PHA03418  34 PLLPAPHHPNPQEDPDKNPSPPPDPPL--TPRPPAQPNGHNKPPVTKQPGGEG------TEEDHQAPLAADADDDPRPGK 105
                         90       100       110       120       130
                 ....*....|....*....|....*....|....*....|....*....|....*..
gi 8176554   672 PTGDSGPPPVPPTGDSGAPPVTPTGDS-ETAPVPPTGDSGAP-PVPPTGDSEAAPVP 726
Cdd:PHA03418 106 RSKADEHGPAPGRAALAPFKLDLDQDPlHGDPDPPPGATGGQgEEPPEGGEESQPPL 162
KREPA2 cd23959
Kinetoplastid RNA Editing Protein A2 (KREPA2); The KREPA2 (TbMP63) protein is a component of ...
601-717 5.54e-03

Kinetoplastid RNA Editing Protein A2 (KREPA2); The KREPA2 (TbMP63) protein is a component of the parasitic protozoan's KREPA RNA editing catalytic complex (RECC). Kinetoplastid RNA editing (KRE) proteins occur as pairs or sets of related proteins in multiple complexes. KREPA complex is composed of six components (KREPA1-6), which share a conserved C-terminal region containing an oligonucleotide-binding (OB)-fold-like domain. KREPAs are responsible for the site-specific insertion and deletion of U nucleotides in the kinetoplastid mitochondria pre-messenger RNA. Apart from the conserved C-terminal OB-fold domain, KREPA1, KREPA2, and KREPA3 contain two conserved C2H2 zinc-finger domains. KREPA2 and kinetoplastid RNA editing ligase 1 (KREL1) are specific for ligation post-U-deletion and are paralogous to KREL2 and KREPA1 that are specific for ligation post-U-insertion. KREPA2, is critical for RECC stability and KREL1 integration into the complex.


Pssm-ID: 467780 [Multi-domain]  Cd Length: 424  Bit Score: 39.85  E-value: 5.54e-03
                        10        20        30        40        50        60        70        80
                ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 8176554  601 APPVPPTGDSGAPPVPPTGDsgappVPPTGDSGAPPVPPTGDSGAPPVPPTGDSGAPPVPPTGDAGPPPVPPTGDSGPPP 680
Cdd:cd23959 133 AQVAPPKAEPQTAPVTPFGQ-----LPMFGQHPPPAKPLPAAAAAQQSSASPGEVASPFASGTVSASPFATATDTAPSSG 207
                        90       100       110
                ....*....|....*....|....*....|....*...
gi 8176554  681 VPPTGDSGAPPVTPTG-DSETAPVPPTGDSGAPPVPPT 717
Cdd:cd23959 208 APDGFPAEASAPSPFAaPASAASFPAAPVANGEAATPT 245
PHA03291 PHA03291
envelope glycoprotein I; Provisional
550-640 5.62e-03

envelope glycoprotein I; Provisional


Pssm-ID: 223033 [Multi-domain]  Cd Length: 401  Bit Score: 39.94  E-value: 5.62e-03
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 8176554   550 PTVTDQEATPVPPTgdseATPVPPtgdsETAPVPPTGDSGAPPVPPtGDSGAPPVPPTGDSGAPPVPPtgdsgAPPVPPT 629
Cdd:PHA03291 199 PADVFVPATPRPTP----RTTASP----ETTPTPSTTTSPPSTTIP-APSTTIAAPQAGTTPEAEGTP-----APPTPGG 264
                         90
                 ....*....|.
gi 8176554   630 GDSGAPPVPPT 640
Cdd:PHA03291 265 GEAPPANATPA 275
PTZ00429 PTZ00429
beta-adaptin; Provisional
554-670 5.85e-03

beta-adaptin; Provisional


Pssm-ID: 240415 [Multi-domain]  Cd Length: 746  Bit Score: 39.92  E-value: 5.85e-03
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 8176554   554 DQEATPVPPT---GDSEATPVPPTGDSETAPVPPTGD-SGAPPVPPTGDSGAPPVPPTGD-SGAPPVPPTGDS------- 621
Cdd:PTZ00429 615 DDDAVELPSTpsmGTQDGSPAPSAAPAGYDIFEFAGDgTGAPHPVASGSNGAQHADPLGDlFSGLPSTVGASSpafqaas 694
                         90       100       110       120       130
                 ....*....|....*....|....*....|....*....|....*....|.
gi 8176554   622 --GAPPVPPTgdsgapPVPPTGDSGAPpvPPTGDSGAPPVPPTgdAGPPPV 670
Cdd:PTZ00429 695 gsQAPASPPT------AASAIEDLFAN--GMGSGSQTVPLPIS--AAPQSA 735
PRK10856 PRK10856
cytoskeleton protein RodZ;
644-742 5.89e-03

cytoskeleton protein RodZ;


Pssm-ID: 236776 [Multi-domain]  Cd Length: 331  Bit Score: 39.62  E-value: 5.89e-03
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 8176554   644 GAPPVPPTGDSGAPPVPPTGDAGPPPVPPTGDSGPPPvpptgdsgAPPVTPTGDSE----TAPVPPTGDSGAPPVP-PTG 718
Cdd:PRK10856 158 SGQSVPLDTSTTTDPATTPAPAAPVDTTPTNSQTPAV--------ATAPAPAVDPQqnavVAPSQANVDTAATPAPaAPA 229
                         90       100
                 ....*....|....*....|....*....
gi 8176554   719 DSEAAPVPPTDDSKEAQMPA-----VIRF 742
Cdd:PRK10856 230 TPDGAAPLPTDQAGVSTPAAdpnalVMNF 258
SAV_2336_NTERM NF041121
SAV_2336 family N-terminal domain; This HMM describes an N-terminal domain shared by SAV_2336 ...
660-739 6.16e-03

SAV_2336 family N-terminal domain; This HMM describes an N-terminal domain shared by SAV_2336 (BAC70047.1) whose C-terminal region suggests restriction enzyme activity (PMID: 18456708), and with other proteins with unrelated C-terminal regions. A member protein was also identified in a kanamycin biosynthetic gene cluster (PMID:16766657), while N-terminal regions of two other member proteins were named Trypco1 in a bioinformatic study (PMID:32101166) of predicted bacterial conflict systems.


Pssm-ID: 469044 [Multi-domain]  Cd Length: 473  Bit Score: 39.60  E-value: 6.16e-03
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 8176554   660 PPTGDAGPPPVPPTGDSGPPPVPPTGDSGAPPVTPTGDSETAPVPPTGDSGAPPVPPTGDSEAAPVPPTDDSKEAQMPAV 739
Cdd:NF041121  17 RAAAPPSPEGPAPTAASQPATPPPPAAPPSPPGDPPEPPAPEPAPLPAPYPGSLAPPPPPPPGPAGAAPGAALPVRVPAP 96
DUF3729 pfam12526
Protein of unknown function (DUF3729); This family of proteins is found in viruses. Proteins ...
623-693 6.36e-03

Protein of unknown function (DUF3729); This family of proteins is found in viruses. Proteins in this family are typically between 145 and 1707 amino acids in length. The family is found in association with pfam01443, pfam01661, pfam05417, pfam01660, pfam00978. There is a single completely conserved residue L that may be functionally important.


Pssm-ID: 372164 [Multi-domain]  Cd Length: 115  Bit Score: 36.98  E-value: 6.36e-03
                          10        20        30        40        50        60        70
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|..
gi 8176554    623 APPVPPTGDSGAPPVPPTGDSGAPPvPPTGDSGAPPVPPTGDAGPP-PVPPTGDSGPPPVPPTGDSGAPPVT 693
Cdd:pfam12526  30 SPPESAHPDPPPPVGDPRPPVVDTP-PPVSAVWVLPPPSEPAAPEPdLVPPVTGPAGPPSPLAPPAPAQKPP 100
kgd PRK12270
multifunctional oxoglutarate decarboxylase/oxoglutarate dehydrogenase thiamine ...
564-650 6.40e-03

multifunctional oxoglutarate decarboxylase/oxoglutarate dehydrogenase thiamine pyrophosphate-binding subunit/dihydrolipoyllysine-residue succinyltransferase subunit;


Pssm-ID: 237030 [Multi-domain]  Cd Length: 1228  Bit Score: 39.87  E-value: 6.40e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 8176554    564 GDSEATPVPPTGDSETAPVPPTGDSGAPPVPPTGDSGAPPVPptgdsgAPPVPPTGDSGAPPVPPTGDSGAPPVPPTGDS 643
Cdd:PRK12270   39 GSTAAPTAAAAAAAAAASAPAAAPAAKAPAAPAPAPPAAAAP------AAPPKPAAAAAAAAAPAAPPAAAAAAAPAAAA 112

                  ....*..
gi 8176554    644 GAPPVPP 650
Cdd:PRK12270  113 VEDEVTP 119
DUF4045 pfam13254
Domain of unknown function (DUF4045); This presumed domain is functionally uncharacterized. ...
555-684 6.73e-03

Domain of unknown function (DUF4045); This presumed domain is functionally uncharacterized. This domain family is found in bacteria and eukaryotes, and is typically between 384 and 430 amino acids in length.


Pssm-ID: 433066 [Multi-domain]  Cd Length: 415  Bit Score: 39.38  E-value: 6.73e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 8176554    555 QEATPV-----PPTGDSEATPVPPTGDSETAPVPPTGDSGA-PPVPPTGDSGAPPVPPTGDSGAPPVPPtgDSGAPPVPP 628
Cdd:pfam13254 201 KEVTPVglmrsPAPGGHSKSPSVSGISADSSPTKEEPSEEAdTLSTDKEQSPAPTSASEPPPKTKELPK--DSEEPAAPS 278
                          90       100       110       120       130
                  ....*....|....*....|....*....|....*....|....*....|....*...
gi 8176554    629 TGDSGAPPVPPTGDSGAPPVP--PTGDSGAPPVPPTGDAGPPPVPPTGDSGPPPVPPT 684
Cdd:pfam13254 279 KSAEASTEKKEPDTESSPETSseKSAPSLLSPVSKASIDKPLSSPDRDPLSPKPKPQS 336
PHA03369 PHA03369
capsid maturational protease; Provisional
621-734 7.13e-03

capsid maturational protease; Provisional


Pssm-ID: 223061 [Multi-domain]  Cd Length: 663  Bit Score: 39.60  E-value: 7.13e-03
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 8176554   621 SGAPPVPPTGDSGAPPVPPTGdsGAPPVPPTGDSGAPPVPPTGD--AGPPPVPPTgdsGPPPVPPTGDSGAPPVTPTGDS 698
Cdd:PHA03369 349 KTASLTAPSRVLAAAAKVAVI--AAPQTHTGPADRQRPQRPDGIpySVPARSPMT---AYPPVPQFCGDPGLVSPYNPQS 423
                         90       100       110       120
                 ....*....|....*....|....*....|....*....|
gi 8176554   699 ETAPVPPTGDSGAPPVPPT----GDSEAAPVPPTDDSKEA 734
Cdd:PHA03369 424 PGTSYGPEPVGPVPPQPTNpyvmPISMANMVYPGHPQEHG 463
KLF10_11_N cd21974
N-terminal domain of Kruppel-like factor (KLF) 10, KLF11, and similar proteins; This subfamily ...
558-714 7.40e-03

N-terminal domain of Kruppel-like factor (KLF) 10, KLF11, and similar proteins; This subfamily is composed of Kruppel-like factor or Krueppel-like factor (KLF) 10, KLF11, and similar proteins. KLF10 was first identified in human osteoblasts and plays a role in mediating estrogen (E2) signaling in bone and skeletal homeostasis and a regulatory role in tumor formation and metastasis. KLF11 is involved in cell growth, apoptosis, cellular inflammation and differentiation, endometriosis, and cholesterol, prostaglandin, neurotransmitter, fat, and sugar metabolism. KLF9, KLF10, KLF11, KLF13, KLF14, and KLF16 share a conserved a-helical motif AA/VXXL that mediates their binding to Sin3A and their activities as transcriptional repressors. KLF10/11 belong to a family of proteins, called the Specificity Protein (SP)/KLF family, characterized by a C-terminal DNA-binding domain of 81 amino acids consisting of three Kruppel-like C2H2 zinc fingers. These factors bind to a loose consensus motif, namely NNRCRCCYY (where N is any nucleotide; R is A/G, and Y is C/T), such as the recurring motifs in GC and GT boxes (5'-GGGGCGGGG-3' and 5-GGTGTGGGG-3') that are present in promoters and more distal regulatory elements of mammalian genes. Members of the KLF family can act as activators or repressors of transcription depending on cell and promoter context. KLFs regulate various cellular functions, such as proliferation, differentiation, and apoptosis, as well as the development and homeostasis of several types of tissue. In addition to the C-terminal DNA-binding domain, each KLF also has a unique N-terminal activation/repression domain that confers specificity and allows it to bind specifically to a certain partner, leading to distinct activities in vivo. This model represents the N-terminal domain of KLF10, KLF11, and similar proteins.


Pssm-ID: 409243 [Multi-domain]  Cd Length: 229  Bit Score: 38.76  E-value: 7.40e-03
                        10        20        30        40        50        60        70        80
                ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 8176554  558 TPVPPTGDSEATPvPPTGDSETAPV----PPTgdsgAPPV--PPTGDSGAPPVPPTGDSGAPPVPPTGDSGAPPVPP--- 628
Cdd:cd21974  32 TPSSDSSDEDDAP-ESPKDFHSLSSlcmtPPY----SPPFfeASHSPSVASLHPPSAASSQPPPEPESSEPPAASPQraq 106
                        90       100       110       120       130       140       150       160
                ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 8176554  629 -------TGDSGAPP--------VPPTGDSGAPPVPPTGDSGAPPVPPTGDAGPPPVPPTGDSG---------PPPVPPT 684
Cdd:cd21974 107 atsvirhTADPVPVSpppvlcqmLPVSSSSGVIVAFLKAPQQPSPQPQKPALPQPQVVLVGGQVpqgpvmlvvPQPAVPQ 186
                       170       180       190
                ....*....|....*....|....*....|....*...
gi 8176554  685 GDSGAPPVTPTGdseT-----APVP---PTGDSGAPPV 714
Cdd:cd21974 187 PYVQPTVVTPGG---TkllpiAPAPgfiPSGQSSAPQP 221
PHA03291 PHA03291
envelope glycoprotein I; Provisional
600-723 7.74e-03

envelope glycoprotein I; Provisional


Pssm-ID: 223033 [Multi-domain]  Cd Length: 401  Bit Score: 39.17  E-value: 7.74e-03
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 8176554   600 GAPPVPPTGDSGAPPVPPTGDSGA--PPVPPTgdsgAPPVPPTGDSGAPPVPPTGDSGAPPvpptgDAGPPPVPPTGDSG 677
Cdd:PHA03291 162 GLAAFPAEGTLAAPPLGEGSADGScdPALPLS----APRLGPADVFVPATPRPTPRTTASP-----ETTPTPSTTTSPPS 232
                         90       100       110       120
                 ....*....|....*....|....*....|....*....|....*..
gi 8176554   678 PPPVPPTGDSGAPPVTPTGDSETAPVPPTGDSG-APPVPPTGDSEAA 723
Cdd:PHA03291 233 TTIPAPSTTIAAPQAGTTPEAEGTPAPPTPGGGeAPPANATPAPEAS 279
PRK12438 PRK12438
hypothetical protein; Provisional
599-652 7.79e-03

hypothetical protein; Provisional


Pssm-ID: 171499 [Multi-domain]  Cd Length: 991  Bit Score: 39.84  E-value: 7.79e-03
                         10        20        30        40        50
                 ....*....|....*....|....*....|....*....|....*....|....
gi 8176554   599 SGAPPVPPTGDSGAPpvPPTGdsGAPPVPPtgdsgaPPVPPTGDSGAPPVPPTG 652
Cdd:PRK12438 899 TGRVATAPGGDAASA--PPPG--AGPPAPP------QAVPPPRTTQPPAAPPRG 942
PRK12438 PRK12438
hypothetical protein; Provisional
588-641 7.79e-03

hypothetical protein; Provisional


Pssm-ID: 171499 [Multi-domain]  Cd Length: 991  Bit Score: 39.84  E-value: 7.79e-03
                         10        20        30        40        50
                 ....*....|....*....|....*....|....*....|....*....|....
gi 8176554   588 SGAPPVPPTGDSGAPpvPPTGdsGAPPVPPtgdsgaPPVPPTGDSGAPPVPPTG 641
Cdd:PRK12438 899 TGRVATAPGGDAASA--PPPG--AGPPAPP------QAVPPPRTTQPPAAPPRG 942
DUF3729 pfam12526
Protein of unknown function (DUF3729); This family of proteins is found in viruses. Proteins ...
590-671 7.94e-03

Protein of unknown function (DUF3729); This family of proteins is found in viruses. Proteins in this family are typically between 145 and 1707 amino acids in length. The family is found in association with pfam01443, pfam01661, pfam05417, pfam01660, pfam00978. There is a single completely conserved residue L that may be functionally important.


Pssm-ID: 372164 [Multi-domain]  Cd Length: 115  Bit Score: 36.98  E-value: 7.94e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 8176554    590 APPVPPTGDSGAPPVPPTGDSGAPPvPPTGDSGAPPVPPTGDSGAP-PVPPTGDSGAPPVPPtgdsgAPPVPPTGDAGPP 668
Cdd:pfam12526  30 SPPESAHPDPPPPVGDPRPPVVDTP-PPVSAVWVLPPPSEPAAPEPdLVPPVTGPAGPPSPL-----APPAPAQKPPLPP 103

                  ...
gi 8176554    669 PVP 671
Cdd:pfam12526 104 PRP 106
NUT pfam12881
NUT protein; This family includes the NUT protein. The gene encoding for NUT protein (Nuclear ...
568-713 7.95e-03

NUT protein; This family includes the NUT protein. The gene encoding for NUT protein (Nuclear Testis protein) is found fused to BRD3 or BRD4 genes, in some aggressive types of carcinoma, due to chromosomal translocations. Proteins of the BRD family contain two bromodomains that bind transcriptionally active chromatin through associations with acetylated histones H3 and H4. Such proteins are crucial for the regulation of cell cycle progression. On the other hand, little is known about NUT protein. NUT is known to have a Nuclear Export Sequence (NES) as well as a Nuclear localization Signal (NLS), both located towards the C-terminal end of the protein. A fused NUT-GFP protein showed either cytoplasmic or nuclear localization, suggesting that it is subject to nuclear/cytoplasmic shuttling. Consistent with this possibility, treatment with leptomycin B an inhibitor of CRM1-dependent nuclear export resulted in re-distribution of NUT-GFP to the nucleus. Inspection of NUT revealed a C-terminal sequence similar to known nuclear export sequences (NES) which are often regulated by phosphorylation. This family carries some natively unstructured sequence.


Pssm-ID: 432850  Cd Length: 717  Bit Score: 39.46  E-value: 7.95e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 8176554    568 ATPVPPTGDSeTAPVPPTGDSGAPPVPPTGDSGAPPVPPT--------GDSGAPPVPP--------TGDSGAPPVP---- 627
Cdd:pfam12881  14 ALPFPPPTPG-PAHQPPWGQPPPPLMTASFPPGSPLVLSAlprtplvaGDGGSGPSGAgacnvivqVRTEGRPVQPpqtq 92
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 8176554    628 -------------PTGDSGAPPVPPTGDSGAPPVPPTGDSGAPPVPPTGDAG-----PPPVPPTGDSGPPPVPPtGDSGA 689
Cdd:pfam12881  93 tfvltqaplnwsaPGALCGGAQCPAPLFLAAPAVETIVPAPAVGGTQAGEGGwipglPPPAPPPAAQLAPIVSP-VNAGP 171
                         170       180
                  ....*....|....*....|....
gi 8176554    690 PPVTPTGDSetapVPPTGDSGAPP 713
Cdd:pfam12881 172 QPHGASREG----SLATSQAKASP 191
SAV_2336_NTERM NF041121
SAV_2336 family N-terminal domain; This HMM describes an N-terminal domain shared by SAV_2336 ...
542-640 7.98e-03

SAV_2336 family N-terminal domain; This HMM describes an N-terminal domain shared by SAV_2336 (BAC70047.1) whose C-terminal region suggests restriction enzyme activity (PMID: 18456708), and with other proteins with unrelated C-terminal regions. A member protein was also identified in a kanamycin biosynthetic gene cluster (PMID:16766657), while N-terminal regions of two other member proteins were named Trypco1 in a bioinformatic study (PMID:32101166) of predicted bacterial conflict systems.


Pssm-ID: 469044 [Multi-domain]  Cd Length: 473  Bit Score: 39.60  E-value: 7.98e-03
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 8176554   542 WTLTYLALPTVTDQEATPVPPTGDSEATPVPPTGDS--ETAPVPPTGDSGAPPVPPTGDSGAPPVPPTGDsgaPPVPPTG 619
Cdd:NF041121  10 WLAAQMGRAAAPPSPEGPAPTAASQPATPPPPAAPPspPGDPPEPPAPEPAPLPAPYPGSLAPPPPPPPG---PAGAAPG 86
                         90       100
                 ....*....|....*....|.
gi 8176554   620 DSGAPPVPptgdsgAPPVPPT 640
Cdd:NF041121  87 AALPVRVP------APPALPN 101
DUF4045 pfam13254
Domain of unknown function (DUF4045); This presumed domain is functionally uncharacterized. ...
586-730 8.08e-03

Domain of unknown function (DUF4045); This presumed domain is functionally uncharacterized. This domain family is found in bacteria and eukaryotes, and is typically between 384 and 430 amino acids in length.


Pssm-ID: 433066 [Multi-domain]  Cd Length: 415  Bit Score: 39.38  E-value: 8.08e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 8176554    586 GDSGAP-PVPPTGDSGAPPvpPTGDSGAPPVPPTGDSGAPPVPPTGDSGAPPVPPTGDSGAPPVPPTGDSGAPPVPPtgD 664
Cdd:pfam13254 195 GRPNSFkEVTPVGLMRSPA--PGGHSKSPSVSGISADSSPTKEEPSEEADTLSTDKEQSPAPTSASEPPPKTKELPK--D 270
                          90       100       110       120       130       140       150
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|.
gi 8176554    665 AGPPPVPPTGDSGPPPVPPTGDSGAP--PVTPTGDSETAPVPPTGDSgAPPVPPTGD---SEAAPVPPTDD 730
Cdd:pfam13254 271 SEEPAAPSKSAEASTEKKEPDTESSPetSSEKSAPSLLSPVSKASID-KPLSSPDRDplsPKPKPQSPPKD 340
DamX COG3266
Cell division protein DamX, binds to the septal ring, contains C-terminal SPOR domain [Cell ...
556-691 8.35e-03

Cell division protein DamX, binds to the septal ring, contains C-terminal SPOR domain [Cell cycle control, cell division, chromosome partitioning];


Pssm-ID: 442497 [Multi-domain]  Cd Length: 455  Bit Score: 39.45  E-value: 8.35e-03
                        10        20        30        40        50        60        70        80
                ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 8176554  556 EATPVPPTGDSEATPVPPTGDSETAPvPPTGDSGAPPVPPTGDSGAPPVPPTGDsGAPPVPPTGDSGAP--PVPPTGDSG 633
Cdd:COG3266 233 AAGAAEVLTARLVLLLLIIGSALKAP-SQASSASAPATTSLGEQQEVSLPPAVA-AQPAAAAAAQPSAValPAAPAAAAA 310
                        90       100       110       120       130
                ....*....|....*....|....*....|....*....|....*....|....*....
gi 8176554  634 AP-PVPPTgdsgaPPVPPTgdsgAPPVPPTGDAGPPPVPPTGDSGPPPVPPTGDSGAPP 691
Cdd:COG3266 311 AAaPAEAA-----APQPTA----AKPVVTETAAPAAPAPEAAAAAAAPAAPAVAKKLAA 360
PTZ00436 PTZ00436
60S ribosomal protein L19-like protein; Provisional
598-738 8.72e-03

60S ribosomal protein L19-like protein; Provisional


Pssm-ID: 185616 [Multi-domain]  Cd Length: 357  Bit Score: 39.16  E-value: 8.72e-03
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 8176554   598 DSGAPPVPPTGDSGAPPVPPTGDSGAPPVPPTGDSGAPP---VPPTGDSGAP--PVPPTGDSGAPP----VPPTGDAGPP 668
Cdd:PTZ00436 192 DAAAAAAAKQKAAAKKAAAPSGKKSAKAAAPAKAAAAPAkaaAPPAKAAAAPakAAAAPAKAAAPPakaaAPPAKAAAPP 271
                         90       100       110       120       130       140       150
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*.
gi 8176554   669 ---PVPPTGDSGPP---PVPPTGDSGAPPVTPTGDSETAPVPptGDSGAPPVPPTGDSEAAPVPPtddSKEAQMPA 738
Cdd:PTZ00436 272 akaAAPPAKAAAPPakaAAPPAKAAAAPAKAAAAPAKAAAAP--AKAAAPPAKAAAPPAKAATPP---AKAAAPPA 342
half-pint TIGR01645
poly-U binding splicing factor, half-pint family; The proteins represented by this model ...
539-656 8.94e-03

poly-U binding splicing factor, half-pint family; The proteins represented by this model contain three RNA recognition motifs (rrm: pfam00076) and have been characterized as poly-pyrimidine tract binding proteins associated with RNA splicing factors. In the case of PUF60 (GP|6176532), in complex with p54, and in the presence of U2AF, facilitates association of U2 snRNP with pre-mRNA.


Pssm-ID: 130706 [Multi-domain]  Cd Length: 612  Bit Score: 39.28  E-value: 8.94e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 8176554    539 LRYWTLTYLALPTVTDQEATPVPPTGDSEATPVPPTGDSETAPVPPTGDSGAPPVPPTGDSGAPPVPPTGdSGAPPVPPT 618
Cdd:TIGR01645 326 PRAQSPATPSSSLPTDIGNKAVVSSAKKEAEEVPPLPQAAPAVVKPGPMEIPTPVPPPGLAIPSLVAPPG-LVAPTEINP 404
                          90       100       110
                  ....*....|....*....|....*....|....*...
gi 8176554    619 GDSGAPPVPPTGDSGAPPVPPTGDSGAPPVPPTGDSGA 656
Cdd:TIGR01645 405 SFLASPRKKMKREKLPVTFGALDDTLAWKEPSKEDQTS 442
 
Blast search parameters
Data Source: Precalculated data, version = cdd.v.3.21
Preset Options:Database: CDSEARCH/cdd   Low complexity filter: no  Composition Based Adjustment: yes   E-value threshold: 0.01

References:

  • Wang J et al. (2023), "The conserved domain database in 2023", Nucleic Acids Res.51(D)384-8.
  • Lu S et al. (2020), "The conserved domain database in 2020", Nucleic Acids Res.48(D)265-8.
  • Marchler-Bauer A et al. (2017), "CDD/SPARCLE: functional classification of proteins via subfamily domain architectures.", Nucleic Acids Res.45(D)200-3.
Help | Disclaimer | Write to the Help Desk
NCBI | NLM | NIH