|
Name |
Accession |
Description |
Interval |
E-value |
| PRK10263 |
PRK10263 |
DNA translocase FtsK; Provisional |
1-1355 |
0e+00 |
|
DNA translocase FtsK; Provisional
Pssm-ID: 236669 [Multi-domain] Cd Length: 1355 Bit Score: 2635.54 E-value: 0e+00
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 659985388 1 MSQEYTEDKEVTLTKLSSGRRLLEALLILIVLFAVWLMAALLSFNPSDPSWSQTAWHEPIHNLGGMPGAWLADTLFFIFG 80
Cdd:PRK10263 1 MSQEYTEDKEVTLTKLSSGRRLLEALLILIVLFAVWLMAALLSFNPSDPSWSQTAWHEPIHNLGGMPGAWLADTLFFIFG 80
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 659985388 81 VMAYTIPVIIVGGCWFAWRHQSSDEYIDYFAVSLRIIGVLALILTSCGLAAINADDIWYFASGGVIGSLLSTTLQPLLHS 160
Cdd:PRK10263 81 VMAYTIPVIIVGGCWFAWRHQSSDEYIDYFAVSLRIIGVLALILTSCGLAAINADDIWYFASGGVIGSLLSTTLQPLLHS 160
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 659985388 161 SGGTIALLCVWAAGLTLFTGWSWVTIAEKLGGWILNILTFASNRTRRDDTWVDEDEYEDDEEYEDENHGKQHESRRARIL 240
Cdd:PRK10263 161 SGGTIALLCVWAAGLTLFTGWSWVTIAEKLGGWILNILTFASNRTRRDDTWVDEDEYEDDEEYEDENHGKQHESRRARIL 240
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 659985388 241 RGALARRKRLAEKFINPMGRQTDAALFSGKRMDDDEEIIYTARGVAADPDDVLFSGNRATQPEYDEYDPLLNGAPITEPV 320
Cdd:PRK10263 241 RGALARRKRLAEKFINPMGRQTDAALFSGKRMDDDEEITYTARGVAADPDDVLFSGNRATQPEYDEYDPLLNGAPITEPV 320
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 659985388 321 AVAAAATTATQSWAAPVEPVTQTPPVASVDVPPSQPTVAWQPVPGPQTGEPVIAPAPEGYPQQSQYAQPAVQYNEPLQQP 400
Cdd:PRK10263 321 AVAAAATTATQSWAAPVEPVTQTPPVASVDVPPAQPTVAWQPVPGPQTGEPVIAPAPEGYPQQSQYAQPAVQYNEPLQQP 400
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 659985388 401 VQPQQPYYAPAAEQPAQQPYYAPAAEQPVQQPYYAPAPEQPVAGNAWQAEEQQSTFAPQSTYQTEQTYQQPAAQEPLYQQ 480
Cdd:PRK10263 401 VQPQQPYYAPAAEQPAQQPYYAPAPEQPAQQPYYAPAPEQPVAGNAWQAEEQQSTFAPQSTYQTEQTYQQPAAQEPLYQQ 480
|
490 500 510 520 530 540 550 560
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 659985388 481 PQSVEQQPVVEPEPVVEETKPARPPLYYFEEVEEKRAREREQLAAWYQPIPEPVKEPEPIKSSLKAPSVAAVPPVEAAAA 560
Cdd:PRK10263 481 PQPVEQQPVVEPEPVVEETKPARPPLYYFEEVEEKRAREREQLAAWYQPIPEPVKEPEPIKSSLKAPSVAAVPPVEAAAA 560
|
570 580 590 600 610 620 630 640
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 659985388 561 VSPLASGVKKATLATGAAATVAAPVFSLANSGGPRPQVKEGIGPQLPRPKRIRVPTRRELASYGIKLPSQRAAEEKAREA 640
Cdd:PRK10263 561 VSPLASGVKKATLATGAAATVAAPVFSLANSGGPRPQVKEGIGPQLPRPKRIRVPTRRELASYGIKLPSQRAAEEKAREA 640
|
650 660 670 680 690 700 710 720
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 659985388 641 QRNQYDSGDQYNDDEIDAMQQDELARQFAQTQQQRYGEQYQHDVPVNAEDADAAAEAELARQFAQTQQQRYSGEQPAGAN 720
Cdd:PRK10263 641 QRNQYDSGDQYNDDEIDAMQQDELARQFAQTQQQRYGEQYQHDVPVNAEDADAAAEAELARQFAQTQQQRYSGEQPAGAN 720
|
730 740 750 760 770 780 790 800
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 659985388 721 PFSLDDFEFSPMKALLDDGPHEPLFTPIVEPVQQPQQPVAPQQQYQQPQQPVPPQQQYQQPQQPVAPQPQYQQPQQQVAP 800
Cdd:PRK10263 721 PFSLDDFEFSPMKALLDDGPHEPLFTPIVEPVQQPQQPVAPQQQYQQPQQPVAPQPQYQQPQQPVAPQPQYQQPQQPVAP 800
|
810 820 830 840 850 860 870 880
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 659985388 801 QPQYQQPQQPVAPQPQYQQPQQPVAPQPQYQQPQQPVAPQQQDTLLHPLLMRNGDSRPLHKPTTPLPSLDLLTPPPSEVE 880
Cdd:PRK10263 801 QPQYQQPQQPVAPQPQYQQPQQPVAPQPQYQQPQQPVAPQPQDTLLHPLLMRNGDSRPLHKPTTPLPSLDLLTPPPSEVE 880
|
890 900 910 920 930 940 950 960
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 659985388 881 PVDTFALEQMARLVEARLADFRIKADVVNYSPGPVITRFELNLAPGVKAARISNLSRDLARSLSTVAVRVVEVIPGKPYV 960
Cdd:PRK10263 881 PVDTFALEQMARLVEARLADFRIKADVVNYSPGPVITRFELNLAPGVKAARISNLSRDLARSLSTVAVRVVEVIPGKPYV 960
|
970 980 990 1000 1010 1020 1030 1040
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 659985388 961 GLELPNKKRQTVYLREVLDNAKFRDNPSPLTVVLGKDIAGEPVVADLAKMPHLLVAGTTGSGKSVGVNAMILSMLYKAQP 1040
Cdd:PRK10263 961 GLELPNKKRQTVYLREVLDNAKFRDNPSPLTVVLGKDIAGEPVVADLAKMPHLLVAGTTGSGKSVGVNAMILSMLYKAQP 1040
|
1050 1060 1070 1080 1090 1100 1110 1120
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 659985388 1041 EDVRFIMIDPKMLELSVYEGIPHLLTEVVTDMKDAANALRWCVNEMERRYKLMSALGVRNLAGYNEKIAEADRMMRPIPD 1120
Cdd:PRK10263 1041 EDVRFIMIDPKMLELSVYEGIPHLLTEVVTDMKDAANALRWCVNEMERRYKLMSALGVRNLAGYNEKIAEADRMMRPIPD 1120
|
1130 1140 1150 1160 1170 1180 1190 1200
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 659985388 1121 PYWKPGDSMDAQHPVLKKEPYIVVLVDEFADLMMTVGKKVEELIARLAQKARAAGIHLVLATQRPSVDVITGLIKANIPT 1200
Cdd:PRK10263 1121 PYWKPGDSMDAQHPVLKKEPYIVVLVDEFADLMMTVGKKVEELIARLAQKARAAGIHLVLATQRPSVDVITGLIKANIPT 1200
|
1210 1220 1230 1240 1250 1260 1270 1280
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 659985388 1201 RIAFTVSSKIDSRTILDQAGAESLLGMGDMLYSGPNSTLPVRVHGAFVRDQEVHAVVQDWKARGRPQYVDGITSDSESEG 1280
Cdd:PRK10263 1201 RIAFTVSSKIDSRTILDQAGAESLLGMGDMLYSGPNSTLPVRVHGAFVRDQEVHAVVQDWKARGRPQYVDGITSDSESEG 1280
|
1290 1300 1310 1320 1330 1340 1350
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*
gi 659985388 1281 GAGGFDGAEELDPLFDQAVQFVTEKRKASISGVQRQFRIGYNRAARIIEQMEAQGIVSEQGHNGNREVLAPPPFD 1355
Cdd:PRK10263 1281 GAGGFDGAEELDPLFDQAVQFVTEKRKASISGVQRQFRIGYNRAARIIEQMEAQGIVSEQGHNGNREVLAPPPFD 1355
|
|
| FtsK |
COG1674 |
DNA segregation ATPase FtsK/SpoIIIE or related protein [Cell cycle control, cell division, ... |
864-1353 |
0e+00 |
|
DNA segregation ATPase FtsK/SpoIIIE or related protein [Cell cycle control, cell division, chromosome partitioning];
Pssm-ID: 441280 [Multi-domain] Cd Length: 611 Bit Score: 922.78 E-value: 0e+00
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 659985388 864 TPLPSLDLLTPPPSEVEPVDTFALEQMARLVEARLADFRIKADVVNYSPGPVITRFELNLAPGVKAARISNLSRDLARSL 943
Cdd:COG1674 135 AVLPPLDLLDPPPPKKEKIDEEELEENARLLEETLEDFGVEAKVVGVTPGPVVTRYEIEPAPGVKVSKITNLADDIALAL 214
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 659985388 944 STVAVRVVEVIPGKPYVGLELPNKKRQTVYLREVLDNAKFRDNPSPLTVVLGKDIAGEPVVADLAKMPHLLVAGTTGSGK 1023
Cdd:COG1674 215 AAKSVRIEAPIPGKSAVGIEVPNKKRETVYLREVLESDEFQNSKSPLPIALGKDISGEPVVADLAKMPHLLIAGATGSGK 294
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 659985388 1024 SVGVNAMILSMLYKAQPEDVRFIMIDPKMLELSVYEGIPHLLTEVVTDMKDAANALRWCVNEMERRYKLMSALGVRNLAG 1103
Cdd:COG1674 295 SVCINAMILSLLYKATPDEVRLILIDPKMVELSVYNGIPHLLTPVVTDPKKAANALKWAVREMERRYKLFAKAGVRNIAG 374
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 659985388 1104 YNEKIAEADRMMRPipdpywkpgdsmdaqHPVLKKEPYIVVLVDEFADLMMTVGKKVEELIARLAQKARAAGIHLVLATQ 1183
Cdd:COG1674 375 YNEKVREAKAKGEE---------------EEGLEPLPYIVVIIDELADLMMVAGKEVEEAIARLAQKARAAGIHLILATQ 439
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 659985388 1184 RPSVDVITGLIKANIPTRIAFTVSSKIDSRTILDQAGAESLLGMGDMLYSGPNSTLPVRVHGAFVRDQEVHAVVQDWKAR 1263
Cdd:COG1674 440 RPSVDVITGLIKANIPSRIAFAVSSKIDSRTILDQGGAEKLLGRGDMLFLPPGASKPIRVQGAFVSDEEVERVVDFLKSQ 519
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 659985388 1264 GRPQYVDGITSDSESEGGAGgfdGAEELDPLFDQAVQFVTEKRKASISGVQRQFRIGYNRAARIIEQMEAQGIVSEQGHN 1343
Cdd:COG1674 520 GEPEYIEEILEEEEEEDEGG---DDDEDDELFDEAVELVVETQKASTSLLQRRLRIGYNRAARLIDQMEERGIVGPAEGS 596
|
490
....*....|
gi 659985388 1344 GNREVLAPPP 1353
Cdd:COG1674 597 KPREVLVSPE 606
|
|
| FtsK_SpoIIIE |
pfam01580 |
FtsK/SpoIIIE family; FtsK has extensive sequence similarity to wide variety of proteins from ... |
974-1186 |
1.10e-92 |
|
FtsK/SpoIIIE family; FtsK has extensive sequence similarity to wide variety of proteins from prokaryotes and plasmids, termed the FtsK/SpoIIIE family. This domain contains a putative ATP binding P-loop motif. It is found in the FtsK cell division protein from E. coli and the stage III sporulation protein E SpoIIIE, which has roles in regulation of prespore specific gene expression in B. subtilis. A mutation in FtsK causes a temperature sensitive block in cell division and it is involved in peptidoglycan synthesis or modification. The SpoIIIE protein is implicated in intercellular chromosomal DNA transfer.
Pssm-ID: 279863 [Multi-domain] Cd Length: 219 Bit Score: 297.75 E-value: 1.10e-92
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 659985388 974 LREVLDNAKFRDNPSPLTVVLGKDIAGEPVVADLAKMP-HLLVAGTTGSGKSVGVNAMILSMLYKAQPEDVRFIMIDPKM 1052
Cdd:pfam01580 1 LLEVLESKPFDTDYSRLPIALGKDISGNPEVFDLKKMPvHLLIAGATGSGKSVALNTLILSLAYMHTPEEVQLYCIDPKM 80
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 659985388 1053 LELSVYEGIPHLLT-EVVTDMKDAANALRWCVNEMERRYKLMSALGVRNLAGYNEKIAEADRMMRPIPDPYWKPGDSMDA 1131
Cdd:pfam01580 81 GELSAYEDIPHLLSvPVATDPKRALRALEWLVDEMERRYALFRALGVRSIAGYNGEIAEDPLDGFGDVFLVIYGVHVMCT 160
|
170 180 190 200 210
....*....|....*....|....*....|....*....|....*....|....*....
gi 659985388 1132 QHPVLKKEPYIVVLVDEFADLMMTVGKK----VEELIARLAQKARAAGIHLVLATQRPS 1186
Cdd:pfam01580 161 AGRWLEILPYLVVIVDERAELRLAAPKDsemrVEDAIVRLAQKGRAAGIHLLLATQRPS 219
|
|
| Ftsk_gamma |
smart00843 |
This domain directs oriented DNA translocation and forms a winged helix structure; Mutated ... |
1289-1351 |
8.24e-33 |
|
This domain directs oriented DNA translocation and forms a winged helix structure; Mutated proteins with substitutions in the FtsK gamma DNA-recognition helix are impaired in DNA binding.
Pssm-ID: 197911 [Multi-domain] Cd Length: 63 Bit Score: 121.37 E-value: 8.24e-33
10 20 30 40 50 60
....*....|....*....|....*....|....*....|....*....|....*....|...
gi 659985388 1289 EELDPLFDQAVQFVTEKRKASISGVQRQFRIGYNRAARIIEQMEAQGIVSEQGHNGNREVLAP 1351
Cdd:smart00843 1 EEEDELYDEAVELVIETQKASTSLLQRRLRIGYNRAARLIDQLEEEGIVGPANGSKPREVLVT 63
|
|
| T7_EssCb_Firm |
TIGR03928 |
type VII secretion protein EssC, C-terminal domain; This model describes the C-terminal domain, ... |
995-1216 |
1.59e-22 |
|
type VII secretion protein EssC, C-terminal domain; This model describes the C-terminal domain, or longer subunit, of the Firmicutes type VII secretion protein EssC. This protein (homologous to EccC in Actinobacteria) and the WXG100 target proteins are the only homologous parts of type VII secretion between Firmicutes and Actinobacteria. [Protein fate, Protein and peptide secretion and trafficking]
Pssm-ID: 274860 [Multi-domain] Cd Length: 1296 Bit Score: 105.45 E-value: 1.59e-22
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 659985388 995 GKDiagEPVVADL---AKMPHLLVAGTTGSGKSVGVNAMILSMLYKAQPEDVRFIMIDPK---MLELsvYEGIPHLLTeV 1068
Cdd:TIGR03928 454 GKD---DIVYLNLhekAHGPHGLVAGTTGSGKSEILQTYILSLAVNFHPHEVAFLLIDYKgggMANL--FKNLPHLLG-T 527
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 659985388 1069 VTDMkDAANALRWCVN---EMERRYKLMSALGVRNLAGYNEKiaeadrmmrpipdpYW--KPGDSMdaqhpvlkkePYIV 1143
Cdd:TIGR03928 528 ITNL-DGAQSMRALASikaELKKRQRLFGENNVNHINQYQKL--------------YKqgKAKEPM----------PHLF 582
|
170 180 190 200 210 220 230
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 659985388 1144 VLVDEFADLmmtvgkKVE--ELIARLAQKA---RAAGIHLVLATQRPSvDVITGLIKANIPTRIAFTVSSKIDSRTIL 1216
Cdd:TIGR03928 583 LISDEFAEL------KSEqpEFMKELVSTArigRSLGVHLILATQKPS-GVVDDQIWSNSRFKLALKVQDASDSNEIL 653
|
|
| TrwB_TraG_TraD_VirD4 |
cd01127 |
TrwB/TraG/TraD/VirD4 family of bacterial conjugation proteins; The TraG/TraD/VirD4 family are ... |
1012-1204 |
5.86e-09 |
|
TrwB/TraG/TraD/VirD4 family of bacterial conjugation proteins; The TraG/TraD/VirD4 family are bacterial conjugation proteins involved in type IV secretion (T4S) systems, versatile bacterial secretion systems mediating transport of protein and/or DNA. They are present in gram-negative and gram-positive bacteria, as well as archaea. They form hexameric rings and belong to the RecA-like NTPases superfamily, which also includes the NTP binding domain of F1 and V1 H(+)ATPases, DnaB and related helicases as well as bacterial RecA and related eukaryotic and archaeal recombinases.
Pssm-ID: 410871 [Multi-domain] Cd Length: 144 Bit Score: 56.07 E-value: 5.86e-09
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 659985388 1012 HLLVAGTTGSGKSVGVNAMILSMLykaqPEDVRFIMIDPKMlELSVyegiphllteVVTDMKDAANALRwcvnemerryK 1091
Cdd:cd01127 1 NTLVLGTTGSGKTTSIVIPLLDQA----ARGGSVIITDPKG-ELFL----------VIPDRDDSFAALR----------A 55
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 659985388 1092 LMSALgvrnLAGYNEKIAEADRMMRPIPdpywkpgdsmdaqhpvlkkepyIVVLVDEFADLMmtvgkKVEELIARLAqKA 1171
Cdd:cd01127 56 LFFNQ----LFRALTELASLSPGRLPRR----------------------VWFILDEFANLG-----RIPNLPNLLA-TG 103
|
170 180 190
....*....|....*....|....*....|....*....
gi 659985388 1172 RAAGIHLVLATQ------RPSVDVITGLIKANIPTRIAF 1204
Cdd:cd01127 104 RKRGISVVLILQslaqleAVYGKDGAQTILGNCNTKLYL 142
|
|
| PspC_subgroup_1 |
NF033838 |
pneumococcal surface protein PspC, choline-binding form; The pneumococcal surface protein PspC, ... |
346-450 |
9.24e-04 |
|
pneumococcal surface protein PspC, choline-binding form; The pneumococcal surface protein PspC, as described in Streptococcus pneumoniae, is a repetitive and highly variable protein, recognized by a conserved N-terminal domain and also by genomic location. This form, subgroup 1, has variable numbers of a choline-binding repeat in the C-terminal region, and is also known as choline-binding protein A. The other form, subgroup 2, is anchored covalently after cleavage by sortase at a C-terminal LPXTG site.
Pssm-ID: 468201 [Multi-domain] Cd Length: 684 Bit Score: 43.46 E-value: 9.24e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 659985388 346 VASVDVPPSQPTVAWQPVPGPQTGEPviAPAPEGYPQQSQYAQPAVQYNEPLqqpvqpqqpyYAPAAEQPaqqpyYAPAA 425
Cdd:NF033838 406 AAEEDKVKEKPAEQPQPAPAPQPEKP--APKPEKPAEQPKAEKPADQQAEED----------YARRSEEE-----YNRLT 468
|
90 100
....*....|....*....|....*
gi 659985388 426 EQPVQQPYYAPAPEQPVAGnaWQAE 450
Cdd:NF033838 469 QQQPPKTEKPAQPSTPKTG--WKQE 491
|
|
| PspC_subgroup_2 |
NF033839 |
pneumococcal surface protein PspC, LPXTG-anchored form; The pneumococcal surface protein PspC, ... |
336-546 |
2.57e-03 |
|
pneumococcal surface protein PspC, LPXTG-anchored form; The pneumococcal surface protein PspC, as described in Streptococcus pneumoniae, is a repetitive and highly variable protein, recognized by a conserved N-terminal domain and also by genomic location. This form, subgroup 2, is anchored covalently after cleavage by sortase at a C-terminal LPXTG site. The other form, subgroup 1, has variable numbers of a choline-binding repeat in the C-terminal region, and is also known as choline-binding protein A.
Pssm-ID: 468202 [Multi-domain] Cd Length: 557 Bit Score: 42.06 E-value: 2.57e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 659985388 336 PVEPVTQTPPVASVDVPPSQPTVAWQPVPGPQTGEPVIAPAPEGypqqsqyAQPAVQyneplqqpvqpqqpyyaPAAEQP 415
Cdd:NF033839 284 PKEPGNKKPSAPKPGMQPSPQPEKKEVKPEPETPKPEVKPQLEK-------PKPEVK-----------------PQPEKP 339
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 659985388 416 aqQPYYAPAAEQPvqQPYYAPAPEQPVAGNAWQAEEQQSTFAPQSTYQTEQTYQQPAAQEPLYQ-QPQSVEQQPVVEPEP 494
Cdd:NF033839 340 --KPEVKPQLETP--KPEVKPQPEKPKPEVKPQPEKPKPEVKPQPETPKPEVKPQPEKPKPEVKpQPEKPKPEVKPQPEK 415
|
170 180 190 200 210
....*....|....*....|....*....|....*....|....*....|..
gi 659985388 495 VVEETKPARPPlyyfEEVEEKRAREReqlaawyqPIPEPVKEPEPIKSSLKA 546
Cdd:NF033839 416 PKPEVKPQPEK----PKPEVKPQPEK--------PKPEVKPQPEKPKPEVKP 455
|
|
|
|
Name |
Accession |
Description |
Interval |
E-value |
| PRK10263 |
PRK10263 |
DNA translocase FtsK; Provisional |
1-1355 |
0e+00 |
|
DNA translocase FtsK; Provisional
Pssm-ID: 236669 [Multi-domain] Cd Length: 1355 Bit Score: 2635.54 E-value: 0e+00
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 659985388 1 MSQEYTEDKEVTLTKLSSGRRLLEALLILIVLFAVWLMAALLSFNPSDPSWSQTAWHEPIHNLGGMPGAWLADTLFFIFG 80
Cdd:PRK10263 1 MSQEYTEDKEVTLTKLSSGRRLLEALLILIVLFAVWLMAALLSFNPSDPSWSQTAWHEPIHNLGGMPGAWLADTLFFIFG 80
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 659985388 81 VMAYTIPVIIVGGCWFAWRHQSSDEYIDYFAVSLRIIGVLALILTSCGLAAINADDIWYFASGGVIGSLLSTTLQPLLHS 160
Cdd:PRK10263 81 VMAYTIPVIIVGGCWFAWRHQSSDEYIDYFAVSLRIIGVLALILTSCGLAAINADDIWYFASGGVIGSLLSTTLQPLLHS 160
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 659985388 161 SGGTIALLCVWAAGLTLFTGWSWVTIAEKLGGWILNILTFASNRTRRDDTWVDEDEYEDDEEYEDENHGKQHESRRARIL 240
Cdd:PRK10263 161 SGGTIALLCVWAAGLTLFTGWSWVTIAEKLGGWILNILTFASNRTRRDDTWVDEDEYEDDEEYEDENHGKQHESRRARIL 240
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 659985388 241 RGALARRKRLAEKFINPMGRQTDAALFSGKRMDDDEEIIYTARGVAADPDDVLFSGNRATQPEYDEYDPLLNGAPITEPV 320
Cdd:PRK10263 241 RGALARRKRLAEKFINPMGRQTDAALFSGKRMDDDEEITYTARGVAADPDDVLFSGNRATQPEYDEYDPLLNGAPITEPV 320
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 659985388 321 AVAAAATTATQSWAAPVEPVTQTPPVASVDVPPSQPTVAWQPVPGPQTGEPVIAPAPEGYPQQSQYAQPAVQYNEPLQQP 400
Cdd:PRK10263 321 AVAAAATTATQSWAAPVEPVTQTPPVASVDVPPAQPTVAWQPVPGPQTGEPVIAPAPEGYPQQSQYAQPAVQYNEPLQQP 400
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 659985388 401 VQPQQPYYAPAAEQPAQQPYYAPAAEQPVQQPYYAPAPEQPVAGNAWQAEEQQSTFAPQSTYQTEQTYQQPAAQEPLYQQ 480
Cdd:PRK10263 401 VQPQQPYYAPAAEQPAQQPYYAPAPEQPAQQPYYAPAPEQPVAGNAWQAEEQQSTFAPQSTYQTEQTYQQPAAQEPLYQQ 480
|
490 500 510 520 530 540 550 560
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 659985388 481 PQSVEQQPVVEPEPVVEETKPARPPLYYFEEVEEKRAREREQLAAWYQPIPEPVKEPEPIKSSLKAPSVAAVPPVEAAAA 560
Cdd:PRK10263 481 PQPVEQQPVVEPEPVVEETKPARPPLYYFEEVEEKRAREREQLAAWYQPIPEPVKEPEPIKSSLKAPSVAAVPPVEAAAA 560
|
570 580 590 600 610 620 630 640
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 659985388 561 VSPLASGVKKATLATGAAATVAAPVFSLANSGGPRPQVKEGIGPQLPRPKRIRVPTRRELASYGIKLPSQRAAEEKAREA 640
Cdd:PRK10263 561 VSPLASGVKKATLATGAAATVAAPVFSLANSGGPRPQVKEGIGPQLPRPKRIRVPTRRELASYGIKLPSQRAAEEKAREA 640
|
650 660 670 680 690 700 710 720
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 659985388 641 QRNQYDSGDQYNDDEIDAMQQDELARQFAQTQQQRYGEQYQHDVPVNAEDADAAAEAELARQFAQTQQQRYSGEQPAGAN 720
Cdd:PRK10263 641 QRNQYDSGDQYNDDEIDAMQQDELARQFAQTQQQRYGEQYQHDVPVNAEDADAAAEAELARQFAQTQQQRYSGEQPAGAN 720
|
730 740 750 760 770 780 790 800
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 659985388 721 PFSLDDFEFSPMKALLDDGPHEPLFTPIVEPVQQPQQPVAPQQQYQQPQQPVPPQQQYQQPQQPVAPQPQYQQPQQQVAP 800
Cdd:PRK10263 721 PFSLDDFEFSPMKALLDDGPHEPLFTPIVEPVQQPQQPVAPQQQYQQPQQPVAPQPQYQQPQQPVAPQPQYQQPQQPVAP 800
|
810 820 830 840 850 860 870 880
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 659985388 801 QPQYQQPQQPVAPQPQYQQPQQPVAPQPQYQQPQQPVAPQQQDTLLHPLLMRNGDSRPLHKPTTPLPSLDLLTPPPSEVE 880
Cdd:PRK10263 801 QPQYQQPQQPVAPQPQYQQPQQPVAPQPQYQQPQQPVAPQPQDTLLHPLLMRNGDSRPLHKPTTPLPSLDLLTPPPSEVE 880
|
890 900 910 920 930 940 950 960
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 659985388 881 PVDTFALEQMARLVEARLADFRIKADVVNYSPGPVITRFELNLAPGVKAARISNLSRDLARSLSTVAVRVVEVIPGKPYV 960
Cdd:PRK10263 881 PVDTFALEQMARLVEARLADFRIKADVVNYSPGPVITRFELNLAPGVKAARISNLSRDLARSLSTVAVRVVEVIPGKPYV 960
|
970 980 990 1000 1010 1020 1030 1040
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 659985388 961 GLELPNKKRQTVYLREVLDNAKFRDNPSPLTVVLGKDIAGEPVVADLAKMPHLLVAGTTGSGKSVGVNAMILSMLYKAQP 1040
Cdd:PRK10263 961 GLELPNKKRQTVYLREVLDNAKFRDNPSPLTVVLGKDIAGEPVVADLAKMPHLLVAGTTGSGKSVGVNAMILSMLYKAQP 1040
|
1050 1060 1070 1080 1090 1100 1110 1120
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 659985388 1041 EDVRFIMIDPKMLELSVYEGIPHLLTEVVTDMKDAANALRWCVNEMERRYKLMSALGVRNLAGYNEKIAEADRMMRPIPD 1120
Cdd:PRK10263 1041 EDVRFIMIDPKMLELSVYEGIPHLLTEVVTDMKDAANALRWCVNEMERRYKLMSALGVRNLAGYNEKIAEADRMMRPIPD 1120
|
1130 1140 1150 1160 1170 1180 1190 1200
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 659985388 1121 PYWKPGDSMDAQHPVLKKEPYIVVLVDEFADLMMTVGKKVEELIARLAQKARAAGIHLVLATQRPSVDVITGLIKANIPT 1200
Cdd:PRK10263 1121 PYWKPGDSMDAQHPVLKKEPYIVVLVDEFADLMMTVGKKVEELIARLAQKARAAGIHLVLATQRPSVDVITGLIKANIPT 1200
|
1210 1220 1230 1240 1250 1260 1270 1280
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 659985388 1201 RIAFTVSSKIDSRTILDQAGAESLLGMGDMLYSGPNSTLPVRVHGAFVRDQEVHAVVQDWKARGRPQYVDGITSDSESEG 1280
Cdd:PRK10263 1201 RIAFTVSSKIDSRTILDQAGAESLLGMGDMLYSGPNSTLPVRVHGAFVRDQEVHAVVQDWKARGRPQYVDGITSDSESEG 1280
|
1290 1300 1310 1320 1330 1340 1350
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*
gi 659985388 1281 GAGGFDGAEELDPLFDQAVQFVTEKRKASISGVQRQFRIGYNRAARIIEQMEAQGIVSEQGHNGNREVLAPPPFD 1355
Cdd:PRK10263 1281 GAGGFDGAEELDPLFDQAVQFVTEKRKASISGVQRQFRIGYNRAARIIEQMEAQGIVSEQGHNGNREVLAPPPFD 1355
|
|
| FtsK |
COG1674 |
DNA segregation ATPase FtsK/SpoIIIE or related protein [Cell cycle control, cell division, ... |
864-1353 |
0e+00 |
|
DNA segregation ATPase FtsK/SpoIIIE or related protein [Cell cycle control, cell division, chromosome partitioning];
Pssm-ID: 441280 [Multi-domain] Cd Length: 611 Bit Score: 922.78 E-value: 0e+00
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 659985388 864 TPLPSLDLLTPPPSEVEPVDTFALEQMARLVEARLADFRIKADVVNYSPGPVITRFELNLAPGVKAARISNLSRDLARSL 943
Cdd:COG1674 135 AVLPPLDLLDPPPPKKEKIDEEELEENARLLEETLEDFGVEAKVVGVTPGPVVTRYEIEPAPGVKVSKITNLADDIALAL 214
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 659985388 944 STVAVRVVEVIPGKPYVGLELPNKKRQTVYLREVLDNAKFRDNPSPLTVVLGKDIAGEPVVADLAKMPHLLVAGTTGSGK 1023
Cdd:COG1674 215 AAKSVRIEAPIPGKSAVGIEVPNKKRETVYLREVLESDEFQNSKSPLPIALGKDISGEPVVADLAKMPHLLIAGATGSGK 294
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 659985388 1024 SVGVNAMILSMLYKAQPEDVRFIMIDPKMLELSVYEGIPHLLTEVVTDMKDAANALRWCVNEMERRYKLMSALGVRNLAG 1103
Cdd:COG1674 295 SVCINAMILSLLYKATPDEVRLILIDPKMVELSVYNGIPHLLTPVVTDPKKAANALKWAVREMERRYKLFAKAGVRNIAG 374
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 659985388 1104 YNEKIAEADRMMRPipdpywkpgdsmdaqHPVLKKEPYIVVLVDEFADLMMTVGKKVEELIARLAQKARAAGIHLVLATQ 1183
Cdd:COG1674 375 YNEKVREAKAKGEE---------------EEGLEPLPYIVVIIDELADLMMVAGKEVEEAIARLAQKARAAGIHLILATQ 439
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 659985388 1184 RPSVDVITGLIKANIPTRIAFTVSSKIDSRTILDQAGAESLLGMGDMLYSGPNSTLPVRVHGAFVRDQEVHAVVQDWKAR 1263
Cdd:COG1674 440 RPSVDVITGLIKANIPSRIAFAVSSKIDSRTILDQGGAEKLLGRGDMLFLPPGASKPIRVQGAFVSDEEVERVVDFLKSQ 519
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 659985388 1264 GRPQYVDGITSDSESEGGAGgfdGAEELDPLFDQAVQFVTEKRKASISGVQRQFRIGYNRAARIIEQMEAQGIVSEQGHN 1343
Cdd:COG1674 520 GEPEYIEEILEEEEEEDEGG---DDDEDDELFDEAVELVVETQKASTSLLQRRLRIGYNRAARLIDQMEERGIVGPAEGS 596
|
490
....*....|
gi 659985388 1344 GNREVLAPPP 1353
Cdd:COG1674 597 KPREVLVSPE 606
|
|
| FtsK_SpoIIIE |
pfam01580 |
FtsK/SpoIIIE family; FtsK has extensive sequence similarity to wide variety of proteins from ... |
974-1186 |
1.10e-92 |
|
FtsK/SpoIIIE family; FtsK has extensive sequence similarity to wide variety of proteins from prokaryotes and plasmids, termed the FtsK/SpoIIIE family. This domain contains a putative ATP binding P-loop motif. It is found in the FtsK cell division protein from E. coli and the stage III sporulation protein E SpoIIIE, which has roles in regulation of prespore specific gene expression in B. subtilis. A mutation in FtsK causes a temperature sensitive block in cell division and it is involved in peptidoglycan synthesis or modification. The SpoIIIE protein is implicated in intercellular chromosomal DNA transfer.
Pssm-ID: 279863 [Multi-domain] Cd Length: 219 Bit Score: 297.75 E-value: 1.10e-92
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 659985388 974 LREVLDNAKFRDNPSPLTVVLGKDIAGEPVVADLAKMP-HLLVAGTTGSGKSVGVNAMILSMLYKAQPEDVRFIMIDPKM 1052
Cdd:pfam01580 1 LLEVLESKPFDTDYSRLPIALGKDISGNPEVFDLKKMPvHLLIAGATGSGKSVALNTLILSLAYMHTPEEVQLYCIDPKM 80
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 659985388 1053 LELSVYEGIPHLLT-EVVTDMKDAANALRWCVNEMERRYKLMSALGVRNLAGYNEKIAEADRMMRPIPDPYWKPGDSMDA 1131
Cdd:pfam01580 81 GELSAYEDIPHLLSvPVATDPKRALRALEWLVDEMERRYALFRALGVRSIAGYNGEIAEDPLDGFGDVFLVIYGVHVMCT 160
|
170 180 190 200 210
....*....|....*....|....*....|....*....|....*....|....*....
gi 659985388 1132 QHPVLKKEPYIVVLVDEFADLMMTVGKK----VEELIARLAQKARAAGIHLVLATQRPS 1186
Cdd:pfam01580 161 AGRWLEILPYLVVIVDERAELRLAAPKDsemrVEDAIVRLAQKGRAAGIHLLLATQRPS 219
|
|
| FtsK_4TM |
pfam13491 |
4TM region of DNA translocase FtsK/SpoIIIE; 4TM_FtsK is the integral membrane domain of the ... |
20-191 |
8.38e-57 |
|
4TM region of DNA translocase FtsK/SpoIIIE; 4TM_FtsK is the integral membrane domain of the FtsK DNA tranlocases. During sporulation in Bacillus subtilis, the SpoIIIE protein is believed to form a translocation pore at the leading edge of the nearly closed septum. The E. coli FtsK protein is homologous to SpoIIIE, and can free chromosomes trapped in vegetative septa.
Pssm-ID: 463896 Cd Length: 171 Bit Score: 193.96 E-value: 8.38e-57
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 659985388 20 RRLLEALLILIVLFAVWLMAALLSFNPSDPSWSQT-AWHEPIHNLGGMPGAWLADTLFFIFGVMAYTIPVIIVGGCWFAW 98
Cdd:pfam13491 2 RLLRELLGLALLLLGLFLLLALVSYSPADPSWSTSgSGAAPVHNWGGRFGAWLADLLLQLFGYSAWLLPVALLYWGWRLF 81
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 659985388 99 RHQSSDEYidyfavSLRIIGVLALILTSCGLAAIN---ADDIWYFASGGVIGSLLSTTLQPLLHSSGGTIALLCVWAAGL 175
Cdd:pfam13491 82 RRRSLERR------WLRLLGFLLLLLASSALFALRlpsLEFGLPGGAGGVIGRLLANALVTLLGFTGATLLLLALLAIGL 155
|
170
....*....|....*.
gi 659985388 176 TLFTGWSWVTIAEKLG 191
Cdd:pfam13491 156 SLVTGFSWLALAERLG 171
|
|
| FtsK_alpha |
pfam17854 |
FtsK alpha domain; FtsK is a DNA translocase that coordinates chromosome segregation and cell ... |
866-966 |
1.24e-40 |
|
FtsK alpha domain; FtsK is a DNA translocase that coordinates chromosome segregation and cell division in bacteria. In addition to its role as activator of XerCD site-specific recombination, FtsK can translocate double-stranded DNA (dsDNA) rapidly and directionally and reverse direction. FtsK can be split into three domains called alpha (this entry), beta and gamma. The alpha and beta domains contain the core ATPase machinery of the DNA translocase.
Pssm-ID: 436096 [Multi-domain] Cd Length: 101 Bit Score: 144.98 E-value: 1.24e-40
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 659985388 866 LPSLDLLTPPPSEVEPVDTFALEQMARLVEARLADFRIKADVVNYSPGPVITRFELNLAPGVKAARISNLSRDLARSLST 945
Cdd:pfam17854 1 LPPLDLLEPPPTSSQKVDEEELEETAEKLEETLAEFGIEAKVVGVTPGPVVTLYELEPAPGVKVSKITNLSDDLALALSA 80
|
90 100
....*....|....*....|.
gi 659985388 946 VAVRVVEVIPGKPYVGLELPN 966
Cdd:pfam17854 81 PSIRIVAPIPGKSTIGIEVPN 101
|
|
| FtsK_gamma |
pfam09397 |
Ftsk gamma domain; This domain directs oriented DNA translocation and forms a winged helix ... |
1289-1351 |
3.34e-33 |
|
Ftsk gamma domain; This domain directs oriented DNA translocation and forms a winged helix structure. Mutated proteins with substitutions in the FtsK gamma DNA-recognition helix are impaired in DNA binding.
Pssm-ID: 462786 [Multi-domain] Cd Length: 63 Bit Score: 122.48 E-value: 3.34e-33
10 20 30 40 50 60
....*....|....*....|....*....|....*....|....*....|....*....|...
gi 659985388 1289 EELDPLFDQAVQFVTEKRKASISGVQRQFRIGYNRAARIIEQMEAQGIVSEQGHNGNREVLAP 1351
Cdd:pfam09397 1 EEEDELYEEAVEIVIETGKASTSLLQRRLRIGYNRAARLIDQLEEEGIVGPADGSKPREVLIT 63
|
|
| Ftsk_gamma |
smart00843 |
This domain directs oriented DNA translocation and forms a winged helix structure; Mutated ... |
1289-1351 |
8.24e-33 |
|
This domain directs oriented DNA translocation and forms a winged helix structure; Mutated proteins with substitutions in the FtsK gamma DNA-recognition helix are impaired in DNA binding.
Pssm-ID: 197911 [Multi-domain] Cd Length: 63 Bit Score: 121.37 E-value: 8.24e-33
10 20 30 40 50 60
....*....|....*....|....*....|....*....|....*....|....*....|...
gi 659985388 1289 EELDPLFDQAVQFVTEKRKASISGVQRQFRIGYNRAARIIEQMEAQGIVSEQGHNGNREVLAP 1351
Cdd:smart00843 1 EEEDELYDEAVELVIETQKASTSLLQRRLRIGYNRAARLIDQLEEEGIVGPANGSKPREVLVT 63
|
|
| T7_EssCb_Firm |
TIGR03928 |
type VII secretion protein EssC, C-terminal domain; This model describes the C-terminal domain, ... |
995-1216 |
1.59e-22 |
|
type VII secretion protein EssC, C-terminal domain; This model describes the C-terminal domain, or longer subunit, of the Firmicutes type VII secretion protein EssC. This protein (homologous to EccC in Actinobacteria) and the WXG100 target proteins are the only homologous parts of type VII secretion between Firmicutes and Actinobacteria. [Protein fate, Protein and peptide secretion and trafficking]
Pssm-ID: 274860 [Multi-domain] Cd Length: 1296 Bit Score: 105.45 E-value: 1.59e-22
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 659985388 995 GKDiagEPVVADL---AKMPHLLVAGTTGSGKSVGVNAMILSMLYKAQPEDVRFIMIDPK---MLELsvYEGIPHLLTeV 1068
Cdd:TIGR03928 454 GKD---DIVYLNLhekAHGPHGLVAGTTGSGKSEILQTYILSLAVNFHPHEVAFLLIDYKgggMANL--FKNLPHLLG-T 527
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 659985388 1069 VTDMkDAANALRWCVN---EMERRYKLMSALGVRNLAGYNEKiaeadrmmrpipdpYW--KPGDSMdaqhpvlkkePYIV 1143
Cdd:TIGR03928 528 ITNL-DGAQSMRALASikaELKKRQRLFGENNVNHINQYQKL--------------YKqgKAKEPM----------PHLF 582
|
170 180 190 200 210 220 230
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 659985388 1144 VLVDEFADLmmtvgkKVE--ELIARLAQKA---RAAGIHLVLATQRPSvDVITGLIKANIPTRIAFTVSSKIDSRTIL 1216
Cdd:TIGR03928 583 LISDEFAEL------KSEqpEFMKELVSTArigRSLGVHLILATQKPS-GVVDDQIWSNSRFKLALKVQDASDSNEIL 653
|
|
| T7SS_EccC_a |
TIGR03924 |
type VII secretion protein EccCa; This model represents the N-terminal domain or EccCa subunit ... |
984-1248 |
4.27e-18 |
|
type VII secretion protein EccCa; This model represents the N-terminal domain or EccCa subunit of the type VII secretion protein EccC as found in the Actinobacteria. Type VII secretion is defined more broadly as including secretion systems for ESAT-6-like proteins in the Firmicutes as well as in the Actinobacteria, but this family does not show close homologs in the Firmicutes. [Protein fate, Protein and peptide secretion and trafficking]
Pssm-ID: 274858 [Multi-domain] Cd Length: 658 Bit Score: 90.03 E-value: 4.27e-18
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 659985388 984 RDNPSPLTVVLGKDIAGEPVVADLAK-----M-PHLLVAGTTGSGKSVGVNAMILSMLYKAQPEDVRFIMIDPK----ML 1053
Cdd:TIGR03924 403 RPGRDRLRVPIGVGDDGEPVELDLKEsaeggMgPHGLCIGATGSGKSELLRTLVLGLAATHSPEQLNLVLVDFKggatFL 482
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 659985388 1054 ELsvyEGIPHlLTEVVTDMKDAAN-------ALRwcvNEMERRYKLMSALG-VRNLAGYNEKIAEadrmmrpipdpywkp 1125
Cdd:TIGR03924 483 GL---EGLPH-VSAVITNLADEAPlvdrmqdALA---GEMNRRQELLRAAGnFANVAEYEKARAA--------------- 540
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 659985388 1126 GDSMDaqhPVlkkePYIVVLVDEFADLMmtvGKKVE--ELIARLAQKARAAGIHLVLATQRPSVDVITGLiKANIPTRIA 1203
Cdd:TIGR03924 541 GADLP---PL----PALFVVVDEFSELL---SQHPDfaDLFVAIGRLGRSLGVHLLLASQRLDEGRLRGL-ESHLSYRIG 609
|
250 260 270 280
....*....|....*....|....*....|....*....|....*...
gi 659985388 1204 FTVSSKIDSRTILDQAGA---ESLLGMGdMLYSGPNStlPVRVHGAFV 1248
Cdd:TIGR03924 610 LKTFSASESRAVLGVPDAyhlPSTPGAG-YLKVDTAE--PVRFRAAYV 654
|
|
| T7_EssCb_Firm |
TIGR03928 |
type VII secretion protein EssC, C-terminal domain; This model describes the C-terminal domain, ... |
964-1216 |
4.83e-15 |
|
type VII secretion protein EssC, C-terminal domain; This model describes the C-terminal domain, or longer subunit, of the Firmicutes type VII secretion protein EssC. This protein (homologous to EccC in Actinobacteria) and the WXG100 target proteins are the only homologous parts of type VII secretion between Firmicutes and Actinobacteria. [Protein fate, Protein and peptide secretion and trafficking]
Pssm-ID: 274860 [Multi-domain] Cd Length: 1296 Bit Score: 80.80 E-value: 4.83e-15
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 659985388 964 LPNKkrqtVYLREVLDNAKFRD---NPSPLTVVLG-KDI----AGEPVVADLAKMPHLLVAGTTGSGKSVGVNAMILSML 1035
Cdd:TIGR03928 760 LEEK----IYLDDLHAVEFDKLwskPKEPLQATIGlLDDpelqSQEPLTLDLSKDGHLAIFGSPGYGKSTFLQTLIMSLA 835
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 659985388 1036 YKAQPEDVRFIMIDPKMLELSVYEGIPHlLTEVVT--DMKDAANALRWCVNEMERRYKLMSALGVRNLAGYNEKIAEadr 1113
Cdd:TIGR03928 836 RQHSPEQLHFYLFDFGTNGLLPLKKLPH-VADYFTldEEEKIEKLIRRIKKEIDRRKKLFSEYGVASISMYNKASGE--- 911
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 659985388 1114 mmrpipdpywkpgdsmdaqhpvlkKEPYIVVLVDEFadlmMTVGK-----KVEELIARLAQKARAAGIHLVL-ATQRPSV 1187
Cdd:TIGR03928 912 ------------------------KLPQIVIIIDNY----DAVKEepfyeDFEELLIQLAREGASLGIYLVMtAGRQNAV 963
|
250 260
....*....|....*....|....*....
gi 659985388 1188 DVItglIKANIPTRIAFTVSSKIDSRTIL 1216
Cdd:TIGR03928 964 RMP---LMNNIKTKIALYLIDKSEYRSIV 989
|
|
| HerA |
COG0433 |
Archaeal DNA helicase HerA or a related bacterial ATPase, contains HAS-barrel and ATPase ... |
991-1244 |
6.09e-10 |
|
Archaeal DNA helicase HerA or a related bacterial ATPase, contains HAS-barrel and ATPase domains [Replication, recombination and repair];
Pssm-ID: 440202 [Multi-domain] Cd Length: 388 Bit Score: 62.70 E-value: 6.09e-10
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 659985388 991 TVVLGKDI-AGEPVVADLAKM--PHLLVAGTTGSGKSVGVnAMILSMLYKAqpeDVRFIMIDPK------------MLEL 1055
Cdd:COG0433 25 GILIGKLLsPGVPVYLDLDKLlnRHILILGATGSGKSNTL-QVLLEELSRA---GVPVLVFDPHgeysglaepgaeRADV 100
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 659985388 1056 SVY---------------------------------EGIPHLLTEVVTDMKDAANA------LRWCVNEME---RRYKLM 1093
Cdd:COG0433 101 GVFdpgagrplpinpwdlfataselgplllsrldlnDTQRGVLREALRLADDKGLLlldlkdLIALLEEGEelgEEYGNV 180
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 659985388 1094 SALGVRNLAGYNEKIAEADRMMRP----IPDPYWKPG---------------------------DSMDAQHPVLKKEPYI 1142
Cdd:COG0433 181 SAASAGALLRRLESLESADGLFGEpgldLEDLLRTDGrvtvidlsglpeelqstfvlwllrelfEARPEVGDADDRKLPL 260
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 659985388 1143 VVLVDEFADLMMTVGKKVEELIARLAQKARAAGIHLVLATQRPSvDVITGlIKANIPTRIAFTVSSKIDSRTI------L 1216
Cdd:COG0433 261 VLVIDEAHLLAPAAPSALLEILERIAREGRKFGVGLILATQRPS-DIDED-VLSQLGTQIILRLFNPRDQKAVkaaaetL 338
|
330 340 350
....*....|....*....|....*....|
gi 659985388 1217 DQAGAESL--LGMGDMLYSGPNSTLPVRVH 1244
Cdd:COG0433 339 SEDLLERLpsLGTGEALVLGEGIPLPVLVK 368
|
|
| TrwB_TraG_TraD_VirD4 |
cd01127 |
TrwB/TraG/TraD/VirD4 family of bacterial conjugation proteins; The TraG/TraD/VirD4 family are ... |
1012-1204 |
5.86e-09 |
|
TrwB/TraG/TraD/VirD4 family of bacterial conjugation proteins; The TraG/TraD/VirD4 family are bacterial conjugation proteins involved in type IV secretion (T4S) systems, versatile bacterial secretion systems mediating transport of protein and/or DNA. They are present in gram-negative and gram-positive bacteria, as well as archaea. They form hexameric rings and belong to the RecA-like NTPases superfamily, which also includes the NTP binding domain of F1 and V1 H(+)ATPases, DnaB and related helicases as well as bacterial RecA and related eukaryotic and archaeal recombinases.
Pssm-ID: 410871 [Multi-domain] Cd Length: 144 Bit Score: 56.07 E-value: 5.86e-09
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 659985388 1012 HLLVAGTTGSGKSVGVNAMILSMLykaqPEDVRFIMIDPKMlELSVyegiphllteVVTDMKDAANALRwcvnemerryK 1091
Cdd:cd01127 1 NTLVLGTTGSGKTTSIVIPLLDQA----ARGGSVIITDPKG-ELFL----------VIPDRDDSFAALR----------A 55
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 659985388 1092 LMSALgvrnLAGYNEKIAEADRMMRPIPdpywkpgdsmdaqhpvlkkepyIVVLVDEFADLMmtvgkKVEELIARLAqKA 1171
Cdd:cd01127 56 LFFNQ----LFRALTELASLSPGRLPRR----------------------VWFILDEFANLG-----RIPNLPNLLA-TG 103
|
170 180 190
....*....|....*....|....*....|....*....
gi 659985388 1172 RAAGIHLVLATQ------RPSVDVITGLIKANIPTRIAF 1204
Cdd:cd01127 104 RKRGISVVLILQslaqleAVYGKDGAQTILGNCNTKLYL 142
|
|
| T7SS_EccC_b |
TIGR03925 |
type VII secretion protein EccCb; This model represents the C-terminal domain or EccCb subunit ... |
894-1074 |
2.81e-07 |
|
type VII secretion protein EccCb; This model represents the C-terminal domain or EccCb subunit of the type VII secretion protein EccC as found in the Actinobacteria. Type VII secretion is defined more broadly as including secretion systems for ESAT-6-like proteins in the Firmicutes as well as in the Actinobacteria, but this family does not show close homologs in the Firmicutes. [Protein fate, Protein and peptide secretion and trafficking]
Pssm-ID: 274859 [Multi-domain] Cd Length: 566 Bit Score: 55.00 E-value: 2.81e-07
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 659985388 894 VEARLAD-------FRIKADVVNYSPGPVITRFELNLAPGVKAARISNLSRDLARSLSTVAVRVVEVIPGKPYVGLeLPn 966
Cdd:TIGR03925 248 IELRLGDpmdseidRRAAARVPAGRPGRGLTPDGLHMLIALPRLDGIASVDDLGTRGLVAVIRDVWGGPPAPPVRL-LP- 325
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 659985388 967 kkrQTVYLREVLDnakfRDNPSPLTVVLGKDIAG-EPVVADLAKMPHLLVAGTTGSGKSVGVNAMILSMLYKAQPEDVRF 1045
Cdd:TIGR03925 326 ---ARLPLSALPA----GGGAPRLRVPLGLGESDlAPVYVDFAESPHLLIFGDSESGKTTLLRTIARGIVRRYSPDQARL 398
|
170 180 190
....*....|....*....|....*....|....*..
gi 659985388 1046 IMIDPK--MLE------LSVYEGIPHLLTEVVTDMKD 1074
Cdd:TIGR03925 399 VVVDYRrtLLGavpedyLAGYAATSAALTELIAALAA 435
|
|
| PHA03247 |
PHA03247 |
large tegument protein UL36; Provisional |
308-476 |
4.39e-07 |
|
large tegument protein UL36; Provisional
Pssm-ID: 223021 [Multi-domain] Cd Length: 3151 Bit Score: 54.94 E-value: 4.39e-07
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 659985388 308 DPLLNGAPITEPVAVAAAATTATQSWAAPVEPVTQTPPVASVDVPPSQPTVAWQpVPG------PQTGEPVIAPAPEGYP 381
Cdd:PHA03247 2803 DPADPPAAVLAPAAALPPAASPAGPLPPPTSAQPTAPPPPPGPPPPSLPLGGSV-APGgdvrrrPPSRSPAAKPAAPARP 2881
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 659985388 382 QQSQYAQPAVQYN-EPLQQPVQPQQPYYAPAAEQPAQQPYYAPAAEQPV---QQPYYAPAPEQPVAGNAWQAEEQQSTFA 457
Cdd:PHA03247 2882 PVRRLARPAVSRStESFALPPDQPERPPQPQAPPPPQPQPQPPPPPQPQpppPPPPRPQPPLAPTTDPAGAGEPSGAVPQ 2961
|
170
....*....|....*....
gi 659985388 458 PQSTYQTEQTYQQPAAQEP 476
Cdd:PHA03247 2962 PWLGALVPGRVAVPRFRVP 2980
|
|
| VirB4 |
COG3451 |
Type IV secretory pathway, VirB4 component [Intracellular trafficking, secretion, and ... |
992-1050 |
4.32e-06 |
|
Type IV secretory pathway, VirB4 component [Intracellular trafficking, secretion, and vesicular transport];
Pssm-ID: 442674 [Multi-domain] Cd Length: 546 Bit Score: 51.10 E-value: 4.32e-06
10 20 30 40 50 60
....*....|....*....|....*....|....*....|....*....|....*....|..
gi 659985388 992 VVLGKDIAGEPVVADLAKM---PHLLVAGTTGSGKSVGVNAMILSMLYKaqpeDVRFIMIDP 1050
Cdd:COG3451 183 IYLLNTRSGTPVFFDFHDGldnGNTLILGPSGSGKSFLLKLLLLQLLRY----GARIVIFDP 240
|
|
| PHA03378 |
PHA03378 |
EBNA-3B; Provisional |
336-570 |
1.61e-05 |
|
EBNA-3B; Provisional
Pssm-ID: 223065 [Multi-domain] Cd Length: 991 Bit Score: 49.68 E-value: 1.61e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 659985388 336 PVEPVTQTPPVASVDvpPSQPTVAWQPVP------GPQTGEPVIAPAPEGYPQ------QSQYAQPA-VQYNEPLQQPVQ 402
Cdd:PHA03378 606 PEPPTTQSHIPETSA--PRQWPMPLRPIPmrplrmQPITFNVLVFPTPHQPPQveitpyKPTWTQIGhIPYQPSPTGANT 683
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 659985388 403 PQQPYYAPAAEQPAqqpyyaPAAEQPVQQPYYAPAPEQPVAGNAWQAEEQQSTFAPQSTYQTEQTYQQPAAQEPLYQQPQ 482
Cdd:PHA03378 684 MLPIQWAPGTMQPP------PRAPTPMRPPAAPPGRAQRPAAATGRARPPAAAPGRARPPAAAPGRARPPAAAPGRARPP 757
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 659985388 483 SVEQQPVVEPEPVVEETKPARPPlyyfeeveEKRAREREQLAAWYQPIPEPVKEPEPIKSSLKAPSVAAVPPVEAAAAVs 562
Cdd:PHA03378 758 AAAPGRARPPAAAPGAPTPQPPP--------QAPPAPQQRPRGAPTPQPPPQAGPTSMQLMPRAAPGQQGPTKQILRQL- 828
|
....*...
gi 659985388 563 pLASGVKK 570
Cdd:PHA03378 829 -LTGGVKR 835
|
|
| T7SS_EccC_b |
TIGR03925 |
type VII secretion protein EccCb; This model represents the C-terminal domain or EccCb subunit ... |
1012-1213 |
1.78e-05 |
|
type VII secretion protein EccCb; This model represents the C-terminal domain or EccCb subunit of the type VII secretion protein EccC as found in the Actinobacteria. Type VII secretion is defined more broadly as including secretion systems for ESAT-6-like proteins in the Firmicutes as well as in the Actinobacteria, but this family does not show close homologs in the Firmicutes. [Protein fate, Protein and peptide secretion and trafficking]
Pssm-ID: 274859 [Multi-domain] Cd Length: 566 Bit Score: 49.22 E-value: 1.78e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 659985388 1012 HLLVAGTTGSGKSVGVNAMILSMLYKAQPEDVRFIMIDPKMLELSVYEGIPHLlTEVVTDMkdAANALRWCVNEME---- 1087
Cdd:TIGR03925 81 HVAIVGAPQSGKSTALRTLILALALTHTPEEVQFYCLDFGGGGLASLADLPHV-GGVAGRL--DPERVRRTVAEVEgllr 157
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 659985388 1088 RRYKLMSALGVRNLAGYnekiaeadRMMRPIPDPYWKP-GDsmdaqhpvlkkepyiVVLV--------DEFADLmmtvgk 1158
Cdd:TIGR03925 158 RRERLFRTHGIDSMAQY--------RARRAAGRLPEDPfGD---------------VFLVidgwgtlrQDFEDL------ 208
|
170 180 190 200 210
....*....|....*....|....*....|....*....|....*....|....*....
gi 659985388 1159 kvEELIARLAQKARAAGIHLVLATQRPSvdVITGLIKANIPTRIAF----TVSSKIDSR 1213
Cdd:TIGR03925 209 --EDKVTDLAARGLAYGVHVVLTASRWS--EIRPALRDLIGTRIELrlgdPMDSEIDRR 263
|
|
| PRK07994 |
PRK07994 |
DNA polymerase III subunits gamma and tau; Validated |
334-476 |
3.12e-05 |
|
DNA polymerase III subunits gamma and tau; Validated
Pssm-ID: 236138 [Multi-domain] Cd Length: 647 Bit Score: 48.32 E-value: 3.12e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 659985388 334 AAPVEPVTQTPPVASVDVPPSQPTVA----WQPVPGPQTGEPVIAPAPEGYPQQSQYAQPAVQYNEPLQQPVQPQQPYYA 409
Cdd:PRK07994 366 PEPEVPPQSAAPAASAQATAAPTAAVappqAPAVPPPPASAPQQAPAVPLPETTSQLLAARQQLQRAQGATKAKKSEPAA 445
|
90 100 110 120 130 140
....*....|....*....|....*....|....*....|....*....|....*....|....*..
gi 659985388 410 PAAEQPAQQPYYAPAAEQPVQQpyyaPAPEQPVAGNAWQAEEQQSTFAPQSTYQTEQTYQQPAAQEP 476
Cdd:PRK07994 446 ASRARPVNSALERLASVRPAPS----ALEKAPAKKEAYRWKATNPVEVKKEPVATPKALKKALEHEK 508
|
|
| PRK07764 |
PRK07764 |
DNA polymerase III subunits gamma and tau; Validated |
331-522 |
5.03e-05 |
|
DNA polymerase III subunits gamma and tau; Validated
Pssm-ID: 236090 [Multi-domain] Cd Length: 824 Bit Score: 47.67 E-value: 5.03e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 659985388 331 QSWAAPVEPVTQTPPVASVDVPPSQPTVAWQPVPGPQTGEP---VIAPAPEGYPQQSQYAQPAVQYNEPLQQPVQPQQPY 407
Cdd:PRK07764 596 GGEGPPAPASSGPPEEAARPAAPAAPAAPAAPAPAGAAAAPaeaSAAPAPGVAAPEHHPKHVAVPDASDGGDGWPAKAGG 675
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 659985388 408 YAPAAEQPAQQPYYAPA-AEQPVQQPYYAPAPEQPVAGNAWQAEEQQSTFAPQSTYQTEQTYQQPAAQEPLYQQPQSVEQ 486
Cdd:PRK07764 676 AAPAAPPPAPAPAAPAApAGAAPAQPAPAPAATPPAGQADDPAAQPPQAAQGASAPSPAADDPVPLPPEPDDPPDPAGAP 755
|
170 180 190
....*....|....*....|....*....|....*.
gi 659985388 487 QPVVEPEPVVEETKPARPPLYYFEEVEEKRAREREQ 522
Cdd:PRK07764 756 AQPPPPPAPAPAAAPAAAPPPSPPSEEEEMAEDDAP 791
|
|
| PHA03247 |
PHA03247 |
large tegument protein UL36; Provisional |
309-539 |
2.42e-04 |
|
large tegument protein UL36; Provisional
Pssm-ID: 223021 [Multi-domain] Cd Length: 3151 Bit Score: 45.70 E-value: 2.42e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 659985388 309 PLLNGAPItePVAVAAAATTATQSWAAPVEPVTQTPPV--ASVDVPPSQPTVAWQPVPGPQTGePVIAPAPEGYPQQSQY 386
Cdd:PHA03247 2714 ALVSATPL--PPGPAAARQASPALPAAPAPPAVPAGPAtpGGPARPARPPTTAGPPAPAPPAA-PAAGPPRRLTRPAVAS 2790
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 659985388 387 AQPAVQYNEPLQQPVQPQQPYYAPAAEQPAQQPYYAPAAEQPVQQPYYAPAPEQPVAgnawQAEEQQSTFAPQSTYQTEQ 466
Cdd:PHA03247 2791 LSESRESLPSPWDPADPPAAVLAPAAALPPAASPAGPLPPPTSAQPTAPPPPPGPPP----PSLPLGGSVAPGGDVRRRP 2866
|
170 180 190 200 210 220 230
....*....|....*....|....*....|....*....|....*....|....*....|....*....|...
gi 659985388 467 TYQQPAAQEPLYQQPQSVEQQPVVEPEPVVEETKPARPPLYYFEEVEEKRAREREQLAAWYQPIPEPVKEPEP 539
Cdd:PHA03247 2867 PSRSPAAKPAAPARPPVRRLARPAVSRSTESFALPPDQPERPPQPQAPPPPQPQPQPPPPPQPQPPPPPPPRP 2939
|
|
| PRK07764 |
PRK07764 |
DNA polymerase III subunits gamma and tau; Validated |
334-455 |
2.91e-04 |
|
DNA polymerase III subunits gamma and tau; Validated
Pssm-ID: 236090 [Multi-domain] Cd Length: 824 Bit Score: 45.36 E-value: 2.91e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 659985388 334 AAPVEPVTQTPPVASVD-----VPPSQPTVAWQPVPGPQTGEPVIAPAPEGYPQQSQYAQP-AVQYNEPLQQPVQPQQPY 407
Cdd:PRK07764 375 LARLERLERRLGVAGGAgapaaAAPSAAAAAPAAAPAPAAAAPAAAAAPAPAAAPQPAPAPaPAPAPPSPAGNAPAGGAP 454
|
90 100 110 120
....*....|....*....|....*....|....*....|....*....
gi 659985388 408 YAPAAEQPAQQPYYAP-AAEQPVQQPyyAPAPEQPVAGNAWQAEEQQST 455
Cdd:PRK07764 455 SPPPAAAPSAQPAPAPaAAPEPTAAP--APAPPAAPAPAAAPAAPAAPA 501
|
|
| Atrophin-1 |
pfam03154 |
Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian ... |
331-505 |
3.25e-04 |
|
Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian atrophy (DRPLA) gene. DRPLA OMIM:125370 is a progressive neurodegenerative disorder. It is caused by the expansion of a CAG repeat in the DRPLA gene on chromosome 12p. This results in an extended polyglutamine region in atrophin-1, that is thought to confer toxicity to the protein, possibly through altering its interactions with other proteins. The expansion of a CAG repeat is also the underlying defect in six other neurodegenerative disorders, including Huntington's disease. One interaction of expanded polyglutamine repeats that is thought to be pathogenic is that with the short glutamine repeat in the transcriptional coactivator CREB binding protein, CBP. This interaction draws CBP away from its usual nuclear location to the expanded polyglutamine repeat protein aggregates that are characteriztic of the polyglutamine neurodegenerative disorders. This interferes with CBP-mediated transcription and causes cytotoxicity.
Pssm-ID: 460830 [Multi-domain] Cd Length: 991 Bit Score: 45.14 E-value: 3.25e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 659985388 331 QSWAAPVEPVTQT------------PPVASVDVPPSQPTVAWQPVPGP-------------QTGEPVIAP--APEGYPQQ 383
Cdd:pfam03154 222 QSTAAPHTLIQQTptlhpqrlpsphPPLQPMTQPPPPSQVSPQPLPQPslhgqmppmphslQTGPSHMQHpvPPQPFPLT 301
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 659985388 384 SQYAQPAVQYNEPLQQPVQPQQPYYAPAAE---QPAQQPYYAPAAEQPVQQPYYAPAPEQPVAG-NAWQAEEQQSTFAPQ 459
Cdd:pfam03154 302 PQSSQSQVPPGPSPAAPGQSQQRIHTPPSQsqlQSQQPPREQPLPPAPLSMPHIKPPPTTPIPQlPNPQSHKHPPHLSGP 381
|
170 180 190 200 210
....*....|....*....|....*....|....*....|....*....|.
gi 659985388 460 STYQTEQTYQQPAAQEPL-----YQQPQSVEQQPVVEPEPVVEETKPARPP 505
Cdd:pfam03154 382 SPFQMNSNLPPPPALKPLsslstHHPPSAHPPPLQLMPQSQQLPPPPAQPP 432
|
|
| PAT1 |
pfam09770 |
Topoisomerase II-associated protein PAT1; Members of this family are necessary for accurate ... |
352-482 |
4.96e-04 |
|
Topoisomerase II-associated protein PAT1; Members of this family are necessary for accurate chromosome transmission during cell division.
Pssm-ID: 401645 [Multi-domain] Cd Length: 846 Bit Score: 44.64 E-value: 4.96e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 659985388 352 PPSQPTVAWQPVPGPQTGEPVI------------APAPEGYPQQSQYAQPAvqyneplqqpvqpqqPYYAPAAEQPAQQP 419
Cdd:pfam09770 173 PAPAPQPAAQPASLPAPSRKMMsleeveaamraqAKKPAQQPAPAPAQPPA---------------APPAQQAQQQQQFP 237
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 659985388 420 YYAPAAEQPVQQPYYAP--------------------APEQPVAGNAWQAEEQQstfAPQSTYQTEQTYQQP----AAQE 475
Cdd:pfam09770 238 PQIQQQQQPQQQPQQPQqhpgqghpvtilqrpqspqpDPAQPSIQPQAQQFHQQ---PPPVPVQPTQILQNPnrlsAARV 314
|
....*..
gi 659985388 476 PLYQQPQ 482
Cdd:pfam09770 315 GYPQNPQ 321
|
|
| PHA03378 |
PHA03378 |
EBNA-3B; Provisional |
300-441 |
5.32e-04 |
|
EBNA-3B; Provisional
Pssm-ID: 223065 [Multi-domain] Cd Length: 991 Bit Score: 44.67 E-value: 5.32e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 659985388 300 TQPEYDEYDPLLNGAPITEPVavaaaattatqSWAapvePVT-QTPPVASVDVPPSQ--------PTVAWQPVPGPQtGE 370
Cdd:PHA03378 667 TQIGHIPYQPSPTGANTMLPI-----------QWA----PGTmQPPPRAPTPMRPPAappgraqrPAAATGRARPPA-AA 730
|
90 100 110 120 130 140 150
....*....|....*....|....*....|....*....|....*....|....*....|....*....|..
gi 659985388 371 PVIAPAPEGYPQQSQYAQPA-VQYNEPLQQPVQPQQPYYAPAAEQPAQQPYYAPAaeqPVQQPYYAPAPEQP 441
Cdd:PHA03378 731 PGRARPPAAAPGRARPPAAApGRARPPAAAPGRARPPAAAPGAPTPQPPPQAPPA---PQQRPRGAPTPQPP 799
|
|
| PHA03247 |
PHA03247 |
large tegument protein UL36; Provisional |
314-543 |
6.15e-04 |
|
large tegument protein UL36; Provisional
Pssm-ID: 223021 [Multi-domain] Cd Length: 3151 Bit Score: 44.54 E-value: 6.15e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 659985388 314 APITEPVAVAAAATTATQSWAAPVEPVTQTPPVASVDVPPSQPTVAWQ-----PVPGPQTGEPVIAPAPEGYPQQSQYAQ 388
Cdd:PHA03247 2774 APAAGPPRRLTRPAVASLSESRESLPSPWDPADPPAAVLAPAAALPPAaspagPLPPPTSAQPTAPPPPPGPPPPSLPLG 2853
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 659985388 389 PAVQYNEPLQQPVQPQQPYYAPAAeqPAQQPYYAPAAEQPVQQPYYAPAPEQPvagnawqAEEQQSTFAPQSTYQTEQTY 468
Cdd:PHA03247 2854 GSVAPGGDVRRRPPSRSPAAKPAA--PARPPVRRLARPAVSRSTESFALPPDQ-------PERPPQPQAPPPPQPQPQPP 2924
|
170 180 190 200 210 220 230
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*
gi 659985388 469 QQPAAQEPLYQQPQSVEQQPVVEPEPVVEETKPARPPLYYFEEVEEKRAREREQLAAWYQPIPEPVKEPEPIKSS 543
Cdd:PHA03247 2925 PPPQPQPPPPPPPRPQPPLAPTTDPAGAGEPSGAVPQPWLGALVPGRVAVPRFRVPQPAPSREAPASSTPPLTGH 2999
|
|
| PspC_subgroup_1 |
NF033838 |
pneumococcal surface protein PspC, choline-binding form; The pneumococcal surface protein PspC, ... |
346-450 |
9.24e-04 |
|
pneumococcal surface protein PspC, choline-binding form; The pneumococcal surface protein PspC, as described in Streptococcus pneumoniae, is a repetitive and highly variable protein, recognized by a conserved N-terminal domain and also by genomic location. This form, subgroup 1, has variable numbers of a choline-binding repeat in the C-terminal region, and is also known as choline-binding protein A. The other form, subgroup 2, is anchored covalently after cleavage by sortase at a C-terminal LPXTG site.
Pssm-ID: 468201 [Multi-domain] Cd Length: 684 Bit Score: 43.46 E-value: 9.24e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 659985388 346 VASVDVPPSQPTVAWQPVPGPQTGEPviAPAPEGYPQQSQYAQPAVQYNEPLqqpvqpqqpyYAPAAEQPaqqpyYAPAA 425
Cdd:NF033838 406 AAEEDKVKEKPAEQPQPAPAPQPEKP--APKPEKPAEQPKAEKPADQQAEED----------YARRSEEE-----YNRLT 468
|
90 100
....*....|....*....|....*
gi 659985388 426 EQPVQQPYYAPAPEQPVAGnaWQAE 450
Cdd:NF033838 469 QQQPPKTEKPAQPSTPKTG--WKQE 491
|
|
| PRK07764 |
PRK07764 |
DNA polymerase III subunits gamma and tau; Validated |
334-446 |
1.83e-03 |
|
DNA polymerase III subunits gamma and tau; Validated
Pssm-ID: 236090 [Multi-domain] Cd Length: 824 Bit Score: 42.67 E-value: 1.83e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 659985388 334 AAPVEPVTQTPPVASVDVPPSQPTVAWQPVPGPQTG-EPVIAPAPEGYPQQSQYAQPAVQYNEPLQQPVQPQQPYYAP-A 411
Cdd:PRK07764 393 APAAAAPSAAAAAPAAAPAPAAAAPAAAAAPAPAAApQPAPAPAPAPAPPSPAGNAPAGGAPSPPPAAAPSAQPAPAPaA 472
|
90 100 110
....*....|....*....|....*....|....*
gi 659985388 412 AEQPAQQPYYAPAAEQPVQQPyyAPAPEQPVAGNA 446
Cdd:PRK07764 473 APEPTAAPAPAPPAAPAPAAA--PAAPAAPAAPAG 505
|
|
| PspC_subgroup_2 |
NF033839 |
pneumococcal surface protein PspC, LPXTG-anchored form; The pneumococcal surface protein PspC, ... |
336-546 |
2.57e-03 |
|
pneumococcal surface protein PspC, LPXTG-anchored form; The pneumococcal surface protein PspC, as described in Streptococcus pneumoniae, is a repetitive and highly variable protein, recognized by a conserved N-terminal domain and also by genomic location. This form, subgroup 2, is anchored covalently after cleavage by sortase at a C-terminal LPXTG site. The other form, subgroup 1, has variable numbers of a choline-binding repeat in the C-terminal region, and is also known as choline-binding protein A.
Pssm-ID: 468202 [Multi-domain] Cd Length: 557 Bit Score: 42.06 E-value: 2.57e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 659985388 336 PVEPVTQTPPVASVDVPPSQPTVAWQPVPGPQTGEPVIAPAPEGypqqsqyAQPAVQyneplqqpvqpqqpyyaPAAEQP 415
Cdd:NF033839 284 PKEPGNKKPSAPKPGMQPSPQPEKKEVKPEPETPKPEVKPQLEK-------PKPEVK-----------------PQPEKP 339
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 659985388 416 aqQPYYAPAAEQPvqQPYYAPAPEQPVAGNAWQAEEQQSTFAPQSTYQTEQTYQQPAAQEPLYQ-QPQSVEQQPVVEPEP 494
Cdd:NF033839 340 --KPEVKPQLETP--KPEVKPQPEKPKPEVKPQPEKPKPEVKPQPETPKPEVKPQPEKPKPEVKpQPEKPKPEVKPQPEK 415
|
170 180 190 200 210
....*....|....*....|....*....|....*....|....*....|..
gi 659985388 495 VVEETKPARPPlyyfEEVEEKRAREReqlaawyqPIPEPVKEPEPIKSSLKA 546
Cdd:NF033839 416 PKPEVKPQPEK----PKPEVKPQPEK--------PKPEVKPQPEKPKPEVKP 455
|
|
| PRK12323 |
PRK12323 |
DNA polymerase III subunit gamma/tau; |
334-506 |
2.70e-03 |
|
DNA polymerase III subunit gamma/tau;
Pssm-ID: 237057 [Multi-domain] Cd Length: 700 Bit Score: 42.17 E-value: 2.70e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 659985388 334 AAPVEPVTqtPPVASVdVPPSQPTVAWQPVPGPQTGEPVIAPAPEGYPQQSQYAQPAVQYNEPLQQPVQPQQPYYAPAAE 413
Cdd:PRK12323 372 AGPATAAA--APVAQP-APAAAAPAAAAPAPAAPPAAPAAAPAAAAAARAVAAAPARRSPAPEALAAARQASARGPGGAP 448
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 659985388 414 QPAQQPYYAPAAEQPVQQPYYAPAPEQPVAGNAWQAEEQQSTFAPQSTYQTEQTYQQPAAQEPLYQQPQSVEQQPVVEPE 493
Cdd:PRK12323 449 APAPAPAAAPAAAARPAAAGPRPVAAAAAAAPARAAPAAAPAPADDDPPPWEELPPEFASPAPAQPDAAPAGWVAESIPD 528
|
170
....*....|...
gi 659985388 494 PVVEETKPARPPL 506
Cdd:PRK12323 529 PATADPDDAFETL 541
|
|
| DUF4106 |
pfam13388 |
Protein of unknown function (DUF4106); This family of proteins are found in large numbers in ... |
410-483 |
3.12e-03 |
|
Protein of unknown function (DUF4106); This family of proteins are found in large numbers in the Trichomonas vaginalis proteome. The function of this protein is unknown.
Pssm-ID: 404296 Cd Length: 431 Bit Score: 41.80 E-value: 3.12e-03
10 20 30 40 50 60 70
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....
gi 659985388 410 PAAEQPAQQPYYAPAAEQPVQQpyyaPAPEQPvagnawqAEEQQSTFAPQStyqteQTYQQPAAQEPLYQQPQS 483
Cdd:pfam13388 203 PTVQNPAQQPTVQNPAQQPTQQ----PTVQNP-------AQQQNPAQQPPP-----QPAQQPTVQNPAQQQPQT 260
|
|
| dnaA |
PRK14086 |
chromosomal replication initiator protein DnaA; |
338-471 |
4.96e-03 |
|
chromosomal replication initiator protein DnaA;
Pssm-ID: 237605 [Multi-domain] Cd Length: 617 Bit Score: 41.35 E-value: 4.96e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 659985388 338 EPVTQTPPVASVDVPPSQPTVAWQPVPGPQtgePVIAPAPEGYPQQSQYAQPAVQYNEPLQQPVQPQQPYYAPAAeqpaq 417
Cdd:PRK14086 107 EPELPRPGRRPYEGYGGPRADDRPPGLPRQ---DQLPTARPAYPAYQQRPEPGAWPRAADDYGWQQQRLGFPPRA----- 178
|
90 100 110 120 130
....*....|....*....|....*....|....*....|....*....|....*
gi 659985388 418 qPYYAPAAEQPVQQPYYAP-APEQPVAGNAWQAEEQQstfAPQSTYQTEQTYQQP 471
Cdd:PRK14086 179 -PYASPASYAPEQERDREPyDAGRPEYDQRRRDYDHP---RPDWDRPRRDRTDRP 229
|
|
| VirD4 |
COG3505 |
Type IV secretory pathway, VirD4 component, TraG/TraD family ATPase [Intracellular trafficking, ... |
1012-1051 |
5.47e-03 |
|
Type IV secretory pathway, VirD4 component, TraG/TraD family ATPase [Intracellular trafficking, secretion, and vesicular transport];
Pssm-ID: 442728 [Multi-domain] Cd Length: 402 Bit Score: 40.74 E-value: 5.47e-03
10 20 30 40
....*....|....*....|....*....|....*....|
gi 659985388 1012 HLLVAGTTGSGKSVGVnamILSMLYKAQPEDvRFIMIDPK 1051
Cdd:COG3505 1 HVLVIGPTGSGKTVGL---VIPNLTQLARGE-SVVVTDPK 36
|
|
| PRK07764 |
PRK07764 |
DNA polymerase III subunits gamma and tau; Validated |
296-457 |
8.09e-03 |
|
DNA polymerase III subunits gamma and tau; Validated
Pssm-ID: 236090 [Multi-domain] Cd Length: 824 Bit Score: 40.74 E-value: 8.09e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 659985388 296 GNRATQPEYDEYDPLLNGAPITEPVAVAAAATTATQSWAAPVePVTQTPPVASVDVPPSQPTVAWQPVPGPQTGEPVIAP 375
Cdd:PRK07764 630 AGAAAAPAEASAAPAPGVAAPEHHPKHVAVPDASDGGDGWPA-KAGGAAPAAPPPAPAPAAPAAPAGAAPAQPAPAPAAT 708
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 659985388 376 APEGyPQQSQYAQPAVQYNEPLQQPVQPQQPyyAPAAEQPAQQPYYAPAAEQPVQQPYYAPAPEQPVAGNAWQAEEQQST 455
Cdd:PRK07764 709 PPAG-QADDPAAQPPQAAQGASAPSPAADDP--VPLPPEPDDPPDPAGAPAQPPPPPAPAPAAAPAAAPPPSPPSEEEEM 785
|
..
gi 659985388 456 FA 457
Cdd:PRK07764 786 AE 787
|
|
|