|
Name |
Accession |
Description |
Interval |
E-value |
| ATF7IP_BD |
pfam16788 |
ATF-interacting protein binding domain; ATF7IP-BD is a short conserved region of activating ... |
564-779 |
1.12e-76 |
|
ATF-interacting protein binding domain; ATF7IP-BD is a short conserved region of activating transcription factor 7-interacting protein 1 found in higher eukaryotes. This domain appears to bind several key proteins such as TFIIE-alpha and TFIIE-beta as well the transcriptional regulator Sp1 which are part of the transcriptional machinery.
Pssm-ID: 465271 [Multi-domain] Cd Length: 214 Bit Score: 252.29 E-value: 1.12e-76
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 317373420 564 NVQSKRRRYMEEeyeaeFQVKITAKGDINQKLQKVIQWLLEEKLCALQCAVFDKTLAELKTRVEKIECNKRHKTVLTELQ 643
Cdd:pfam16788 1 KENVKRMKTSEQ-----INENICVALEKQTALLEQVKHLIEQEICSINYKLFDKKLKELNERVEKTECRKKHEAIATELQ 75
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 317373420 644 AKIARLTKRFEAAKEDLKKrhehpPNPPVSPGKTVND--VNSNNNMSYRNAGTVRQMLESKRNVSESAPpsFQTPVNTVS 721
Cdd:pfam16788 76 AKIARLTKRFKAALEDLKK-----CLPPNSPSSNAASkvANSNTINLYRNAGSVRSMLESKRSVGESSP--FQPPEKASK 148
|
170 180 190 200 210 220
....*....|....*....|....*....|....*....|....*....|....*....|....*.
gi 317373420 722 STNLVTPPAVVSSQPKLQTPVTSGSLT----ATSVLPAPNTATVV---ATTQVPSGNPQPT-ISLQ 779
Cdd:pfam16788 149 KINLTSPQNEVVSESNNQDDVMLISVEspnlTTPVTSNPTDTRKVtsgNSSNSPSAETEVMaVEKK 214
|
|
| fn3_4 |
pfam16794 |
Fibronectin-III type domain; |
1160-1260 |
6.58e-49 |
|
Fibronectin-III type domain;
Pssm-ID: 465273 [Multi-domain] Cd Length: 101 Bit Score: 168.68 E-value: 6.58e-49
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 317373420 1160 LPQKPHLKLARVQsqNGIVLSWSVLEVDRSCATVDSYHLYAYHEEPSATVPS-QWKKIGEVKALPLPMACTLTQFVSGSK 1238
Cdd:pfam16794 2 PPQKPTLKLARVP--TGIVLSWNMPDLDPKYAPVESYHLFAYQENTSTTPSTdSWKKIGDVKALPLPMACTLSQFKAGQR 79
|
90 100
....*....|....*....|..
gi 317373420 1239 YYFAVRAKDIYGRFGPFCDPQS 1260
Cdd:pfam16794 80 YYFAVRAVDIHGRYGPFSDPKT 101
|
|
| PHA03247 |
PHA03247 |
large tegument protein UL36; Provisional |
822-1158 |
4.84e-09 |
|
large tegument protein UL36; Provisional
Pssm-ID: 223021 [Multi-domain] Cd Length: 3151 Bit Score: 61.11 E-value: 4.84e-09
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 317373420 822 PPTVSGLTK--NPVSLPSLPNPTKPNNVPSVPSPSIQRNPTASAAPLGTTLAVQAVPTAHSIVQATRTSLPTVGPSGLYS 899
Cdd:PHA03247 2689 RPTVGSLTSlaDPPPPPPTPEPAPHALVSATPLPPGPAAARQASPALPAAPAPPAVPAGPATPGGPARPARPPTTAGPPA 2768
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 317373420 900 PSTNRGPiqmkipisafstsSAAEQNSNTTPRIENQTNKTIDASVSKKAADSTSQCGKATGSDSsgvidltmddeesgAS 979
Cdd:PHA03247 2769 PAPPAAP-------------AAGPPRRLTRPAVASLSESRESLPSPWDPADPPAAVLAPAAALP--------------PA 2821
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 317373420 980 QDPKklnhTPVSTMSSSQPVSRPLQPIQPAPPLQPSG-VPTSGPSQTTIHLLPTAPTTVNVTHRPVTQVTtRLPVPRAP- 1057
Cdd:PHA03247 2822 ASPA----GPLPPPTSAQPTAPPPPPGPPPPSLPLGGsVAPGGDVRRRPPSRSPAAKPAAPARPPVRRLA-RPAVSRSTe 2896
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 317373420 1058 --ANHQVVYTTLPAPPAQAPLRGTVMQAPAVR---QVNPQNSVTVRVPQTTTYVVNNGLTLGSTGPQLTvHHRP-----P 1127
Cdd:PHA03247 2897 sfALPPDQPERPPQPQAPPPPQPQPQPPPPPQpqpPPPPPPRPQPPLAPTTDPAGAGEPSGAVPQPWLG-ALVPgrvavP 2975
|
330 340 350
....*....|....*....|....*....|.
gi 317373420 1128 QVHTEPPRPVHPAPLPEAPQPQRLPPEAAST 1158
Cdd:PHA03247 2976 RFRVPQPAPSREAPASSTPPLTGHSLSRVSS 3006
|
|
| PTZ00341 |
PTZ00341 |
Ring-infected erythrocyte surface antigen; Provisional |
319-574 |
4.54e-08 |
|
Ring-infected erythrocyte surface antigen; Provisional
Pssm-ID: 173534 [Multi-domain] Cd Length: 1136 Bit Score: 57.87 E-value: 4.54e-08
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 317373420 319 KNGADEKLEQIQSKDSLDEKNKADNNIDAN-EETLEtddtticsdrppEN-EKKVEEDIitelalgEDAISSSMEIDQGE 396
Cdd:PTZ00341 929 KNQNENVPEHLKEHAEANIEEDAEENVEEDaEENVE------------ENvEENVEENV-------EENVEENVEENVEE 989
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 317373420 397 KNEDETSADLVETINENViEDNKSENILENTDSMETDEIIPILEKLAPSEDEltcfsktsllPIDETNPDLEEKMESSFg 476
Cdd:PTZ00341 990 NVEENVEENVEENIEENV-EENVEENIEENVEEYDEENVEEVEENVEEYDEE----------NVEEIEENAEENVEENI- 1057
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 317373420 477 spskQESSESLPKEaflvlsDEEDIsgEKDESEVISQNetcspaeVESNEKDNKPEEEEQVIHEDDERPSEKNEFSRRKR 556
Cdd:PTZ00341 1058 ----EENIEEYDEE------NVEEI--EENIEENIEEN-------VEENVEENVEEIEENVEENVEENAEENAEENAEEN 1118
|
250
....*....|....*...
gi 317373420 557 SKSEDMDNVQSKRRRYME 574
Cdd:PTZ00341 1119 AEEYDDENPEEHNEEYDE 1136
|
|
| Atrophin-1 |
pfam03154 |
Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian ... |
708-1149 |
6.28e-07 |
|
Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian atrophy (DRPLA) gene. DRPLA OMIM:125370 is a progressive neurodegenerative disorder. It is caused by the expansion of a CAG repeat in the DRPLA gene on chromosome 12p. This results in an extended polyglutamine region in atrophin-1, that is thought to confer toxicity to the protein, possibly through altering its interactions with other proteins. The expansion of a CAG repeat is also the underlying defect in six other neurodegenerative disorders, including Huntington's disease. One interaction of expanded polyglutamine repeats that is thought to be pathogenic is that with the short glutamine repeat in the transcriptional coactivator CREB binding protein, CBP. This interaction draws CBP away from its usual nuclear location to the expanded polyglutamine repeat protein aggregates that are characteriztic of the polyglutamine neurodegenerative disorders. This interferes with CBP-mediated transcription and causes cytotoxicity.
Pssm-ID: 460830 [Multi-domain] Cd Length: 991 Bit Score: 54.00 E-value: 6.28e-07
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 317373420 708 SAPPSFQTPVNTVSSTNLVTPPAVVSSQP-KLQTPVTSGSLTATSVLPAPNTATVVATTQVPSGNPQPTISLQPLPVILH 786
Cdd:pfam03154 143 STSPSIPSPQDNESDSDSSAQQQILQTQPpVLQAQSGAASPPSPPPPGTTQAATAGPTPSAPSVPPQGSPATSQPPNQTQ 222
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 317373420 787 VPVAVSSqpqLLQSHPGTLVTNQPSGNVEFISVQSPPTVSGLTKNPVSLPSLPNPTKPNNVPSVPSPSIQRNPTASAA-P 865
Cdd:pfam03154 223 STAAPHT---LIQQTPTLHPQRLPSPHPPLQPMTQPPPPSQVSPQPLPQPSLHGQMPPMPHSLQTGPSHMQHPVPPQPfP 299
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 317373420 866 LGTTLAVQAVPTAHSIVQATRTSLPTVGPSGLYSPSTNRGPIQMKIPISAFSTSSAAEQNSNTTPRIENQTNKTIDASVS 945
Cdd:pfam03154 300 LTPQSSQSQVPPGPSPAAPGQSQQRIHTPPSQSQLQSQQPPREQPLPPAPLSMPHIKPPPTTPIPQLPNPQSHKHPPHLS 379
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 317373420 946 KKAADSTSqcgkatgsdssgvidltmddeesgaSQDPKKLNHTPVSTMSSSQPVSRPLQPIQPAPPLQPSGVPTSGPSQT 1025
Cdd:pfam03154 380 GPSPFQMN-------------------------SNLPPPPALKPLSSLSTHHPPSAHPPPLQLMPQSQQLPPPPAQPPVL 434
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 317373420 1026 TIHLLPTAPTTVNVTHRPVTQVTTRLPVPRAPANHQVVYTTLPA--PPAQAPLRGTVMQAPAVRQVnpqnSVTVRVPQTT 1103
Cdd:pfam03154 435 TQSQSLPPPAASHPPTSGLHQVPSQSPFPQHPFVPGGPPPITPPsgPPTSTSSAMPGIQPPSSASV----SSSGPVPAAV 510
|
410 420 430 440
....*....|....*....|....*....|....*....|....*.
gi 317373420 1104 TYVVnngltlgstgPQLTVHHRPPQvhtEPPRPVHPAPLPEAPQPQ 1149
Cdd:pfam03154 511 SCPL----------PPVQIKEEALD---EAEEPESPPPPPRSPSPE 543
|
|
| MSCRAMM_ClfA |
NF033609 |
MSCRAMM family adhesin clumping factor ClfA; Clumping factor A is an MSCRAMM (Microbial ... |
122-433 |
1.87e-05 |
|
MSCRAMM family adhesin clumping factor ClfA; Clumping factor A is an MSCRAMM (Microbial Surface Components Recognizing Adhesive Matrix Molecules). It is heavily studied in Staphylococcus aureus both for its biological role in adhesion and for its potential for vaccination. Features of the sequence, but also of other MSCRAMM adhesins, include a long run of Ser-Asp dipeptide repeats and a C-terminal cell wall anchoring LPXTG motif.
Pssm-ID: 468110 [Multi-domain] Cd Length: 934 Bit Score: 49.14 E-value: 1.87e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 317373420 122 VSKLPAEPVSGDPAPGDLDA------GDPASGVLASGDSTSGDPTSSEP-SSSDAASGDATSGDAPSGDVSPGDATSGDA 194
Cdd:NF033609 544 VPEQPDEPGEIEPIPEDSDSdpgsdsGSDSSNSDSGSDSGSDSTSDSGSdSASDSDSASDSDSASDSDSASDSDSASDSD 623
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 317373420 195 TADDLSSGDPTSSDPIPGEPVPVEPISGDCAADDIASSEITSVDLASGAPASTDPASDDLASGDLSSSELASDDLATGEL 274
Cdd:NF033609 624 SASDSDSASDSDSASDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSD 703
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 317373420 275 ASDELTSESTFDRTFEPKSVPVCEPVPEIDNiEPSSNKDDDFLEKNGADEKLEQIQSKDSlDEKNKADNNIDANEETLET 354
Cdd:NF033609 704 SDSDSDSDSDSDSDSDSDSDSDSDSDSDSDS-DSDSDSDSDSDSDSDSDSDSDSDSDSDS-DSDSDSDSDSDSDSDSDSD 781
|
250 260 270 280 290 300 310
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 317373420 355 DDTTICSDRPPENEKKVEEDIITELALGEDAISSSmEIDQGEKNEDETSADLVETINENVIEDNKSENILENTDSMETD 433
Cdd:NF033609 782 SDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDS-DSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSESD 859
|
|
| PHA03307 |
PHA03307 |
transcriptional regulator ICP4; Provisional |
110-287 |
4.10e-05 |
|
transcriptional regulator ICP4; Provisional
Pssm-ID: 223039 [Multi-domain] Cd Length: 1352 Bit Score: 48.24 E-value: 4.10e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 317373420 110 EPLSPHNITPEPVSKLPAEPVSGDPAPGDLDAGD------PASGVLASGDSTSGdPTSSEPSSSDAASGDATSGDAPSGD 183
Cdd:PHA03307 78 EAPANESRSTPTWSLSTLAPASPAREGSPTPPGPsspdppPPTPPPASPPPSPA-PDLSEMLRPVGSPGPPPAASPPAAG 156
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 317373420 184 VSPGDATSGDAT---ADDLSSGDPTSSDPI--PGEPVPVEPISGDCAADDIASSEITSVDLASGAPASTDPASDDLASGD 258
Cdd:PHA03307 157 ASPAAVASDAASsrqAALPLSSPEETARAPssPPAEPPPSTPPAAASPRPPRRSSPISASASSPAPAPGRSAADDAGASS 236
|
170 180
....*....|....*....|....*....
gi 317373420 259 LSSSELASDDLATGELASDELTSESTFDR 287
Cdd:PHA03307 237 SDSSSSESSGCGWGPENECPLPRPAPITL 265
|
|
| SMC_prok_B |
TIGR02168 |
chromosome segregation protein SMC, common bacterial type; SMC (structural maintenance of ... |
324-662 |
2.25e-03 |
|
chromosome segregation protein SMC, common bacterial type; SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. This family represents the SMC protein of most bacteria. The smc gene is often associated with scpB (TIGR00281) and scpA genes, where scp stands for segregation and condensation protein. SMC was shown (in Caulobacter crescentus) to be induced early in S phase but present and bound to DNA throughout the cell cycle. [Cellular processes, Cell division, DNA metabolism, Chromosome-associated proteins]
Pssm-ID: 274008 [Multi-domain] Cd Length: 1179 Bit Score: 42.35 E-value: 2.25e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 317373420 324 EKLEQIQSKdsLDEKNKAdnnIDANEETLETDDTTIcsdrppENEKKVEEDIITELALG-EDAISSSMEIDQGEKNEDET 402
Cdd:TIGR02168 684 EKIEELEEK--IAELEKA---LAELRKELEELEEEL------EQLRKELEELSRQISALrKDLARLEAEVEQLEERIAQL 752
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 317373420 403 SADLVETINENVIEDNK----SENILENTDSMETDEiipilEKLAPSEDELTCFSKTsllpIDETNPDLEEKMESSFgsp 478
Cdd:TIGR02168 753 SKELTELEAEIEELEERleeaEEELAEAEAEIEELE-----AQIEQLKEELKALREA----LDELRAELTLLNEEAA--- 820
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 317373420 479 SKQESSESLPKEAFLVLSDEEDISGE-KDESEVISQNEtcspAEVESnEKDNKPEEEEQVIHEDDERpSEKNEFSRRKRS 557
Cdd:TIGR02168 821 NLRERLESLERRIAATERRLEDLEEQiEELSEDIESLA----AEIEE-LEELIEELESELEALLNER-ASLEEALALLRS 894
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 317373420 558 KSEDMDNVQ---SKRRRYMEEEYEAefqvKITAKGDINQKLQKVIQWL--LEEKLCALQCAVFDKTLAELKTRVEKIECN 632
Cdd:TIGR02168 895 ELEELSEELrelESKRSELRRELEE----LREKLAQLELRLEGLEVRIdnLQERLSEEYSLTLEEAEALENKIEDDEEEA 970
|
330 340 350 360
....*....|....*....|....*....|....*....|....*..
gi 317373420 633 KRHktvLTELQAKIARL--------------TKRFE---AAKEDLKK 662
Cdd:TIGR02168 971 RRR---LKRLENKIKELgpvnlaaieeyeelKERYDfltAQKEDLTE 1014
|
|
| MDN1 |
COG5271 |
Midasin, AAA ATPase with vWA domain, involved in ribosome maturation [Translation, ribosomal ... |
80-552 |
4.01e-03 |
|
Midasin, AAA ATPase with vWA domain, involved in ribosome maturation [Translation, ribosomal structure and biogenesis];
Pssm-ID: 444083 [Multi-domain] Cd Length: 1028 Bit Score: 41.54 E-value: 4.01e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 317373420 80 DPEGSKAEWKETPCILSVNVKNKQDDDLNCEPLSPHNITPEPVSKLPAEPVSGDPAPGDLDAGDPASGVLASGDSTSGDP 159
Cdd:COG5271 274 ATDDADGLEAAEDDALDAELTAAQAADPESDDDADDSTLAALEGAAEDTEIATADELAAADDEDDDDSAAEDAAEEAATA 353
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 317373420 160 TSSEPSSSDAASGDATSGDAPSGDVSPGDATSGDATADDLSSGDPTSSDPIPGEPVPVEPISGDCAADDIASSEITSVDL 239
Cdd:COG5271 354 EDSAAEDTQDAEDEAAGEAADESEGADTDAAADEADAAADDSADDEEASADGGTSPTSDTDEEEEEADEDASAGETEDES 433
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 317373420 240 ASGAPASTDPASDDLASGDLSSSELASDDLATGELASDELTSESTFDRTFEPKSVPVCEPVPEIDNIEPSSNKDD----- 314
Cdd:COG5271 434 TDVTSAEDDIATDEEADSLADEEEEAEAELDTEEDTESAEEDADGDEATDEDDASDDGDEEEAEEDAEAEADSDEltaee 513
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 317373420 315 ---DFLEKNGADEKLEQIQSKDSLDEKNKADNNIDANEETLETDDTTICSDRPPENEKKVEEDIITELALGEDAISSSME 391
Cdd:COG5271 514 tsaDDGADTDAAADPEDSDEDALEDETEGEENAPGSDQDADETDEPEATAEEDEPDEAEAETEDATENADADETEESADE 593
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 317373420 392 IDQGEKNEDETSADLVETINENVIEDNKSENILENTDSMETDEIIPILEKLAPSEDELTCFSKTSLLPIDETNPDLEEKM 471
Cdd:COG5271 594 SEEAEASEDEAAEEEEADDDEADADADGAADEEETEEEAAEDEAAEPETDASEAADEDADAETEAEASADESEEEAEDES 673
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 317373420 472 ESSfgSPSKQESSESLPKEAflvLSDEEDISGEKDESEVISQNETCSPAEVESNEKDNKPEEEEQVIHEDDERPSEKNEF 551
Cdd:COG5271 674 ETS--SEDAEEDADAAAAEA---SDDEEETEEADEDAETASEEADAEEADTEADGTAEEAEEAAEEAESADEEAASLPDE 748
|
.
gi 317373420 552 S 552
Cdd:COG5271 749 A 749
|
|
| COG1340 |
COG1340 |
Uncharacterized coiled-coil protein, contains DUF342 domain [Function unknown]; |
532-665 |
9.66e-03 |
|
Uncharacterized coiled-coil protein, contains DUF342 domain [Function unknown];
Pssm-ID: 440951 [Multi-domain] Cd Length: 297 Bit Score: 39.51 E-value: 9.66e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 317373420 532 EEEEQVIHEDDERPSEKNEFSRRKRSKSEDMDNVQSKRRRYMEE--EYEAEfqvkitaKGDINQKLQKVIQWLLEEKLCA 609
Cdd:COG1340 29 EKRDELNEELKELAEKRDELNAQVKELREEAQELREKRDELNEKvkELKEE-------RDELNEKLNELREELDELRKEL 101
|
90 100 110 120 130 140
....*....|....*....|....*....|....*....|....*....|....*....|..
gi 317373420 610 LQCAVFDKTLAELKTRVEKIEcnKRHKT-VLT-----ELQAKIARLTKRFEAAKEDLKKRHE 665
Cdd:COG1340 102 AELNKAGGSIDKLRKEIERLE--WRQQTeVLSpeeekELVEKIKELEKELEKAKKALEKNEK 161
|
|
|
|
Name |
Accession |
Description |
Interval |
E-value |
| ATF7IP_BD |
pfam16788 |
ATF-interacting protein binding domain; ATF7IP-BD is a short conserved region of activating ... |
564-779 |
1.12e-76 |
|
ATF-interacting protein binding domain; ATF7IP-BD is a short conserved region of activating transcription factor 7-interacting protein 1 found in higher eukaryotes. This domain appears to bind several key proteins such as TFIIE-alpha and TFIIE-beta as well the transcriptional regulator Sp1 which are part of the transcriptional machinery.
Pssm-ID: 465271 [Multi-domain] Cd Length: 214 Bit Score: 252.29 E-value: 1.12e-76
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 317373420 564 NVQSKRRRYMEEeyeaeFQVKITAKGDINQKLQKVIQWLLEEKLCALQCAVFDKTLAELKTRVEKIECNKRHKTVLTELQ 643
Cdd:pfam16788 1 KENVKRMKTSEQ-----INENICVALEKQTALLEQVKHLIEQEICSINYKLFDKKLKELNERVEKTECRKKHEAIATELQ 75
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 317373420 644 AKIARLTKRFEAAKEDLKKrhehpPNPPVSPGKTVND--VNSNNNMSYRNAGTVRQMLESKRNVSESAPpsFQTPVNTVS 721
Cdd:pfam16788 76 AKIARLTKRFKAALEDLKK-----CLPPNSPSSNAASkvANSNTINLYRNAGSVRSMLESKRSVGESSP--FQPPEKASK 148
|
170 180 190 200 210 220
....*....|....*....|....*....|....*....|....*....|....*....|....*.
gi 317373420 722 STNLVTPPAVVSSQPKLQTPVTSGSLT----ATSVLPAPNTATVV---ATTQVPSGNPQPT-ISLQ 779
Cdd:pfam16788 149 KINLTSPQNEVVSESNNQDDVMLISVEspnlTTPVTSNPTDTRKVtsgNSSNSPSAETEVMaVEKK 214
|
|
| fn3_4 |
pfam16794 |
Fibronectin-III type domain; |
1160-1260 |
6.58e-49 |
|
Fibronectin-III type domain;
Pssm-ID: 465273 [Multi-domain] Cd Length: 101 Bit Score: 168.68 E-value: 6.58e-49
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 317373420 1160 LPQKPHLKLARVQsqNGIVLSWSVLEVDRSCATVDSYHLYAYHEEPSATVPS-QWKKIGEVKALPLPMACTLTQFVSGSK 1238
Cdd:pfam16794 2 PPQKPTLKLARVP--TGIVLSWNMPDLDPKYAPVESYHLFAYQENTSTTPSTdSWKKIGDVKALPLPMACTLSQFKAGQR 79
|
90 100
....*....|....*....|..
gi 317373420 1239 YYFAVRAKDIYGRFGPFCDPQS 1260
Cdd:pfam16794 80 YYFAVRAVDIHGRYGPFSDPKT 101
|
|
| PHA03247 |
PHA03247 |
large tegument protein UL36; Provisional |
822-1158 |
4.84e-09 |
|
large tegument protein UL36; Provisional
Pssm-ID: 223021 [Multi-domain] Cd Length: 3151 Bit Score: 61.11 E-value: 4.84e-09
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 317373420 822 PPTVSGLTK--NPVSLPSLPNPTKPNNVPSVPSPSIQRNPTASAAPLGTTLAVQAVPTAHSIVQATRTSLPTVGPSGLYS 899
Cdd:PHA03247 2689 RPTVGSLTSlaDPPPPPPTPEPAPHALVSATPLPPGPAAARQASPALPAAPAPPAVPAGPATPGGPARPARPPTTAGPPA 2768
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 317373420 900 PSTNRGPiqmkipisafstsSAAEQNSNTTPRIENQTNKTIDASVSKKAADSTSQCGKATGSDSsgvidltmddeesgAS 979
Cdd:PHA03247 2769 PAPPAAP-------------AAGPPRRLTRPAVASLSESRESLPSPWDPADPPAAVLAPAAALP--------------PA 2821
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 317373420 980 QDPKklnhTPVSTMSSSQPVSRPLQPIQPAPPLQPSG-VPTSGPSQTTIHLLPTAPTTVNVTHRPVTQVTtRLPVPRAP- 1057
Cdd:PHA03247 2822 ASPA----GPLPPPTSAQPTAPPPPPGPPPPSLPLGGsVAPGGDVRRRPPSRSPAAKPAAPARPPVRRLA-RPAVSRSTe 2896
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 317373420 1058 --ANHQVVYTTLPAPPAQAPLRGTVMQAPAVR---QVNPQNSVTVRVPQTTTYVVNNGLTLGSTGPQLTvHHRP-----P 1127
Cdd:PHA03247 2897 sfALPPDQPERPPQPQAPPPPQPQPQPPPPPQpqpPPPPPPRPQPPLAPTTDPAGAGEPSGAVPQPWLG-ALVPgrvavP 2975
|
330 340 350
....*....|....*....|....*....|.
gi 317373420 1128 QVHTEPPRPVHPAPLPEAPQPQRLPPEAAST 1158
Cdd:PHA03247 2976 RFRVPQPAPSREAPASSTPPLTGHSLSRVSS 3006
|
|
| PHA03247 |
PHA03247 |
large tegument protein UL36; Provisional |
820-1174 |
4.96e-09 |
|
large tegument protein UL36; Provisional
Pssm-ID: 223021 [Multi-domain] Cd Length: 3151 Bit Score: 61.11 E-value: 4.96e-09
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 317373420 820 QSPPTVSGLTKNPVSLPSLPNPTKPNNVPSVPSPSiQRNPTASAAPLGTTLAVQAVPTAHSIVQATRTSLPTVGPSGLYS 899
Cdd:PHA03247 2595 SARPRAPVDDRGDPRGPAPPSPLPPDTHAPDPPPP-SPSPAANEPDPHPPPTVPPPERPRDDPAPGRVSRPRRARRLGRA 2673
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 317373420 900 P---STNRGPIQMKIPISAFS-TSSAAEQNSNTTPriENQTNKTIDASVSKKAADSTSQCGKATGSDSSgvidltmddee 975
Cdd:PHA03247 2674 AqasSPPQRPRRRAARPTVGSlTSLADPPPPPPTP--EPAPHALVSATPLPPGPAAARQASPALPAAPA----------- 2740
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 317373420 976 sgasqdPKKLNHTPVSTMSSSQPVSRPLQ--PIQPAPPLQPSG-------VPTSGPSQTTIHLLPTAPTTVNVThRPVTQ 1046
Cdd:PHA03247 2741 ------PPAVPAGPATPGGPARPARPPTTagPPAPAPPAAPAAgpprrltRPAVASLSESRESLPSPWDPADPP-AAVLA 2813
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 317373420 1047 VTTRLPVPRAPANHQVVYTT-LPAPPAQAP--------LRGTVMQAPAVRQVNPQNSvTVRVPQTTTYVVNNGLTlgstG 1117
Cdd:PHA03247 2814 PAAALPPAASPAGPLPPPTSaQPTAPPPPPgppppslpLGGSVAPGGDVRRRPPSRS-PAAKPAAPARPPVRRLA----R 2888
|
330 340 350 360 370
....*....|....*....|....*....|....*....|....*....|....*..
gi 317373420 1118 PQLTvhhRPPQVHTEPPRPVHPAPLPEAPQPQRLPPEAASTSLPQKPHLKLARVQSQ 1174
Cdd:PHA03247 2889 PAVS---RSTESFALPPDQPERPPQPQAPPPPQPQPQPPPPPQPQPPPPPPPRPQPP 2942
|
|
| PTZ00341 |
PTZ00341 |
Ring-infected erythrocyte surface antigen; Provisional |
319-574 |
4.54e-08 |
|
Ring-infected erythrocyte surface antigen; Provisional
Pssm-ID: 173534 [Multi-domain] Cd Length: 1136 Bit Score: 57.87 E-value: 4.54e-08
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 317373420 319 KNGADEKLEQIQSKDSLDEKNKADNNIDAN-EETLEtddtticsdrppEN-EKKVEEDIitelalgEDAISSSMEIDQGE 396
Cdd:PTZ00341 929 KNQNENVPEHLKEHAEANIEEDAEENVEEDaEENVE------------ENvEENVEENV-------EENVEENVEENVEE 989
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 317373420 397 KNEDETSADLVETINENViEDNKSENILENTDSMETDEIIPILEKLAPSEDEltcfsktsllPIDETNPDLEEKMESSFg 476
Cdd:PTZ00341 990 NVEENVEENVEENIEENV-EENVEENIEENVEEYDEENVEEVEENVEEYDEE----------NVEEIEENAEENVEENI- 1057
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 317373420 477 spskQESSESLPKEaflvlsDEEDIsgEKDESEVISQNetcspaeVESNEKDNKPEEEEQVIHEDDERPSEKNEFSRRKR 556
Cdd:PTZ00341 1058 ----EENIEEYDEE------NVEEI--EENIEENIEEN-------VEENVEENVEEIEENVEENVEENAEENAEENAEEN 1118
|
250
....*....|....*...
gi 317373420 557 SKSEDMDNVQSKRRRYME 574
Cdd:PTZ00341 1119 AEEYDDENPEEHNEEYDE 1136
|
|
| Atrophin-1 |
pfam03154 |
Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian ... |
708-1149 |
6.28e-07 |
|
Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian atrophy (DRPLA) gene. DRPLA OMIM:125370 is a progressive neurodegenerative disorder. It is caused by the expansion of a CAG repeat in the DRPLA gene on chromosome 12p. This results in an extended polyglutamine region in atrophin-1, that is thought to confer toxicity to the protein, possibly through altering its interactions with other proteins. The expansion of a CAG repeat is also the underlying defect in six other neurodegenerative disorders, including Huntington's disease. One interaction of expanded polyglutamine repeats that is thought to be pathogenic is that with the short glutamine repeat in the transcriptional coactivator CREB binding protein, CBP. This interaction draws CBP away from its usual nuclear location to the expanded polyglutamine repeat protein aggregates that are characteriztic of the polyglutamine neurodegenerative disorders. This interferes with CBP-mediated transcription and causes cytotoxicity.
Pssm-ID: 460830 [Multi-domain] Cd Length: 991 Bit Score: 54.00 E-value: 6.28e-07
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 317373420 708 SAPPSFQTPVNTVSSTNLVTPPAVVSSQP-KLQTPVTSGSLTATSVLPAPNTATVVATTQVPSGNPQPTISLQPLPVILH 786
Cdd:pfam03154 143 STSPSIPSPQDNESDSDSSAQQQILQTQPpVLQAQSGAASPPSPPPPGTTQAATAGPTPSAPSVPPQGSPATSQPPNQTQ 222
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 317373420 787 VPVAVSSqpqLLQSHPGTLVTNQPSGNVEFISVQSPPTVSGLTKNPVSLPSLPNPTKPNNVPSVPSPSIQRNPTASAA-P 865
Cdd:pfam03154 223 STAAPHT---LIQQTPTLHPQRLPSPHPPLQPMTQPPPPSQVSPQPLPQPSLHGQMPPMPHSLQTGPSHMQHPVPPQPfP 299
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 317373420 866 LGTTLAVQAVPTAHSIVQATRTSLPTVGPSGLYSPSTNRGPIQMKIPISAFSTSSAAEQNSNTTPRIENQTNKTIDASVS 945
Cdd:pfam03154 300 LTPQSSQSQVPPGPSPAAPGQSQQRIHTPPSQSQLQSQQPPREQPLPPAPLSMPHIKPPPTTPIPQLPNPQSHKHPPHLS 379
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 317373420 946 KKAADSTSqcgkatgsdssgvidltmddeesgaSQDPKKLNHTPVSTMSSSQPVSRPLQPIQPAPPLQPSGVPTSGPSQT 1025
Cdd:pfam03154 380 GPSPFQMN-------------------------SNLPPPPALKPLSSLSTHHPPSAHPPPLQLMPQSQQLPPPPAQPPVL 434
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 317373420 1026 TIHLLPTAPTTVNVTHRPVTQVTTRLPVPRAPANHQVVYTTLPA--PPAQAPLRGTVMQAPAVRQVnpqnSVTVRVPQTT 1103
Cdd:pfam03154 435 TQSQSLPPPAASHPPTSGLHQVPSQSPFPQHPFVPGGPPPITPPsgPPTSTSSAMPGIQPPSSASV----SSSGPVPAAV 510
|
410 420 430 440
....*....|....*....|....*....|....*....|....*.
gi 317373420 1104 TYVVnngltlgstgPQLTVHHRPPQvhtEPPRPVHPAPLPEAPQPQ 1149
Cdd:pfam03154 511 SCPL----------PPVQIKEEALD---EAEEPESPPPPPRSPSPE 543
|
|
| PTZ00121 |
PTZ00121 |
MAEBL; Provisional |
319-684 |
2.44e-06 |
|
MAEBL; Provisional
Pssm-ID: 173412 [Multi-domain] Cd Length: 2084 Bit Score: 52.45 E-value: 2.44e-06
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 317373420 319 KNGADEKLEQIQSKDSLDEKNKADNNIDANEETLETDDTTICSD--RPPENEKKVEEDIITELALGEDAISSSMEIDQGE 396
Cdd:PTZ00121 1437 KKKAEEAKKADEAKKKAEEAKKAEEAKKKAEEAKKADEAKKKAEeaKKADEAKKKAEEAKKKADEAKKAAEAKKKADEAK 1516
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 317373420 397 KNEDETSADLVETINE--------NVIEDNKSENILENTDSMETDEIIPILEKLAPSEDELTCFSKTSLLPIDEtNPDLE 468
Cdd:PTZ00121 1517 KAEEAKKADEAKKAEEakkadeakKAEEKKKADELKKAEELKKAEEKKKAEEAKKAEEDKNMALRKAEEAKKAE-EARIE 1595
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 317373420 469 EKMEssFGSPSKQESSESLPKEAFLVLSDEEdISGEKDESEVISQNETCSPAEVESNEKDNKPEEEEQVIHEDDERPSE- 547
Cdd:PTZ00121 1596 EVMK--LYEEEKKMKAEEAKKAEEAKIKAEE-LKKAEEEKKKVEQLKKKEAEEKKKAEELKKAEEENKIKAAEEAKKAEe 1672
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 317373420 548 ---------KNEFSRRKRS-----KSEDMDNVQSKRRRYMEEEYEAEfQVKitakgdinqKLQKVIQWLLEEklcALQCA 613
Cdd:PTZ00121 1673 dkkkaeeakKAEEDEKKAAealkkEAEEAKKAEELKKKEAEEKKKAE-ELK---------KAEEENKIKAEE---AKKEA 1739
|
330 340 350 360 370 380 390
....*....|....*....|....*....|....*....|....*....|....*....|....*....|.
gi 317373420 614 VFDKTLAElKTRVEKIECNKRHKTVLTELQAKIARLTKRFEAAKEDLKKRHEhppNPPVSPGKTVNDVNSN 684
Cdd:PTZ00121 1740 EEDKKKAE-EAKKDEEEKKKIAHLKKEEEKKAEEIRKEKEAVIEEELDEEDE---KRRMEVDKKIKDIFDN 1806
|
|
| Atrophin-1 |
pfam03154 |
Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian ... |
792-1172 |
5.64e-06 |
|
Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian atrophy (DRPLA) gene. DRPLA OMIM:125370 is a progressive neurodegenerative disorder. It is caused by the expansion of a CAG repeat in the DRPLA gene on chromosome 12p. This results in an extended polyglutamine region in atrophin-1, that is thought to confer toxicity to the protein, possibly through altering its interactions with other proteins. The expansion of a CAG repeat is also the underlying defect in six other neurodegenerative disorders, including Huntington's disease. One interaction of expanded polyglutamine repeats that is thought to be pathogenic is that with the short glutamine repeat in the transcriptional coactivator CREB binding protein, CBP. This interaction draws CBP away from its usual nuclear location to the expanded polyglutamine repeat protein aggregates that are characteriztic of the polyglutamine neurodegenerative disorders. This interferes with CBP-mediated transcription and causes cytotoxicity.
Pssm-ID: 460830 [Multi-domain] Cd Length: 991 Bit Score: 50.92 E-value: 5.64e-06
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 317373420 792 SSQPQLLQSHPGTLVTNQPSgnvefiSVQSPPTVSGLTKNPVSLPSLPNPTKPNNV-PSVPSPSIQRNPTASAAPL---G 867
Cdd:pfam03154 161 SAQQQILQTQPPVLQAQSGA------ASPPSPPPPGTTQAATAGPTPSAPSVPPQGsPATSQPPNQTQSTAAPHTLiqqT 234
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 317373420 868 TTLAVQAVPTAHSIVQ-ATRTSLPTVGPSGLYSPSTNRGPIQ-MKIPISAFSTSSAAEQNSNTTPRIENQTNKTIDASVS 945
Cdd:pfam03154 235 PTLHPQRLPSPHPPLQpMTQPPPPSQVSPQPLPQPSLHGQMPpMPHSLQTGPSHMQHPVPPQPFPLTPQSSQSQVPPGPS 314
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 317373420 946 KKAADSTSQCGKATGSDSSGvidltmddeesgASQDPKKLNHTPVSTMSSSQPVSRPLQPIQPAPPLQPSGVPT--SGPS 1023
Cdd:pfam03154 315 PAAPGQSQQRIHTPPSQSQL------------QSQQPPREQPLPPAPLSMPHIKPPPTTPIPQLPNPQSHKHPPhlSGPS 382
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 317373420 1024 QTTIHL-LPTAPTTvnvthRPVTQVTTRLPVPRAPANHQVVYTT--LPAPPAQAPLRGTVMQAPAVRQVNPQNSVTVRVP 1100
Cdd:pfam03154 383 PFQMNSnLPPPPAL-----KPLSSLSTHHPPSAHPPPLQLMPQSqqLPPPPAQPPVLTQSQSLPPPAASHPPTSGLHQVP 457
|
330 340 350 360 370 380 390
....*....|....*....|....*....|....*....|....*....|....*....|....*....|..
gi 317373420 1101 QTTTYVVNNGLTLGStgpqltvhhrPPQVHTEPPRPVHPAPLPEAPQPQRLPPeAASTSLPQKPHLKLARVQ 1172
Cdd:pfam03154 458 SQSPFPQHPFVPGGP----------PPITPPSGPPTSTSSAMPGIQPPSSASV-SSSGPVPAAVSCPLPPVQ 518
|
|
| PHA03247 |
PHA03247 |
large tegument protein UL36; Provisional |
708-1024 |
9.15e-06 |
|
large tegument protein UL36; Provisional
Pssm-ID: 223021 [Multi-domain] Cd Length: 3151 Bit Score: 50.32 E-value: 9.15e-06
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 317373420 708 SAPPSFQTPVNTVSSTNLVTPPAVVSSQPKLQTPVTSGSLTATSVLPAPNTATVVATTQVPSGNPQPTISLQPLPVILHV 787
Cdd:PHA03247 2764 AGPPAPAPPAAPAAGPPRRLTRPAVASLSESRESLPSPWDPADPPAAVLAPAAALPPAASPAGPLPPPTSAQPTAPPPPP 2843
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 317373420 788 PVAVSSQPQLLQSHPGTLVTNQPSGNVEFISVQSP--PTVSGLTKNPVSLPSLPNPTKPNNVPSVPSPSIQRNPTASA-- 863
Cdd:PHA03247 2844 GPPPPSLPLGGSVAPGGDVRRRPPSRSPAAKPAAParPPVRRLARPAVSRSTESFALPPDQPERPPQPQAPPPPQPQPqp 2923
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 317373420 864 -APLGTTLAVQAVPTAHSIVQATRTSLPTVGPSGLYSPSTNRGPIQMKIPISAFSTSSAAEqnSNTTPRIENQTNKTIDA 942
Cdd:PHA03247 2924 pPPPQPQPPPPPPPRPQPPLAPTTDPAGAGEPSGAVPQPWLGALVPGRVAVPRFRVPQPAP--SREAPASSTPPLTGHSL 3001
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 317373420 943 SVSKKAADSTSQCGKATGSDSSGVIDLTMDDEESGASQDPKKLNHTPVSTMSSsqpvsrpLQPIQPAPPLQPSGVPTSGP 1022
Cdd:PHA03247 3002 SRVSSWASSLALHEETDPPPVSLKQTLWPPDDTEDSDADSLFDSDSERSDLEA-------LDPLPPEPHDPFAHEPDPAT 3074
|
..
gi 317373420 1023 SQ 1024
Cdd:PHA03247 3075 PE 3076
|
|
| MSCRAMM_ClfA |
NF033609 |
MSCRAMM family adhesin clumping factor ClfA; Clumping factor A is an MSCRAMM (Microbial ... |
122-433 |
1.87e-05 |
|
MSCRAMM family adhesin clumping factor ClfA; Clumping factor A is an MSCRAMM (Microbial Surface Components Recognizing Adhesive Matrix Molecules). It is heavily studied in Staphylococcus aureus both for its biological role in adhesion and for its potential for vaccination. Features of the sequence, but also of other MSCRAMM adhesins, include a long run of Ser-Asp dipeptide repeats and a C-terminal cell wall anchoring LPXTG motif.
Pssm-ID: 468110 [Multi-domain] Cd Length: 934 Bit Score: 49.14 E-value: 1.87e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 317373420 122 VSKLPAEPVSGDPAPGDLDA------GDPASGVLASGDSTSGDPTSSEP-SSSDAASGDATSGDAPSGDVSPGDATSGDA 194
Cdd:NF033609 544 VPEQPDEPGEIEPIPEDSDSdpgsdsGSDSSNSDSGSDSGSDSTSDSGSdSASDSDSASDSDSASDSDSASDSDSASDSD 623
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 317373420 195 TADDLSSGDPTSSDPIPGEPVPVEPISGDCAADDIASSEITSVDLASGAPASTDPASDDLASGDLSSSELASDDLATGEL 274
Cdd:NF033609 624 SASDSDSASDSDSASDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSD 703
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 317373420 275 ASDELTSESTFDRTFEPKSVPVCEPVPEIDNiEPSSNKDDDFLEKNGADEKLEQIQSKDSlDEKNKADNNIDANEETLET 354
Cdd:NF033609 704 SDSDSDSDSDSDSDSDSDSDSDSDSDSDSDS-DSDSDSDSDSDSDSDSDSDSDSDSDSDS-DSDSDSDSDSDSDSDSDSD 781
|
250 260 270 280 290 300 310
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 317373420 355 DDTTICSDRPPENEKKVEEDIITELALGEDAISSSmEIDQGEKNEDETSADLVETINENVIEDNKSENILENTDSMETD 433
Cdd:NF033609 782 SDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDS-DSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSESD 859
|
|
| PHA03307 |
PHA03307 |
transcriptional regulator ICP4; Provisional |
110-287 |
4.10e-05 |
|
transcriptional regulator ICP4; Provisional
Pssm-ID: 223039 [Multi-domain] Cd Length: 1352 Bit Score: 48.24 E-value: 4.10e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 317373420 110 EPLSPHNITPEPVSKLPAEPVSGDPAPGDLDAGD------PASGVLASGDSTSGdPTSSEPSSSDAASGDATSGDAPSGD 183
Cdd:PHA03307 78 EAPANESRSTPTWSLSTLAPASPAREGSPTPPGPsspdppPPTPPPASPPPSPA-PDLSEMLRPVGSPGPPPAASPPAAG 156
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 317373420 184 VSPGDATSGDAT---ADDLSSGDPTSSDPI--PGEPVPVEPISGDCAADDIASSEITSVDLASGAPASTDPASDDLASGD 258
Cdd:PHA03307 157 ASPAAVASDAASsrqAALPLSSPEETARAPssPPAEPPPSTPPAAASPRPPRRSSPISASASSPAPAPGRSAADDAGASS 236
|
170 180
....*....|....*....|....*....
gi 317373420 259 LSSSELASDDLATGELASDELTSESTFDR 287
Cdd:PHA03307 237 SDSSSSESSGCGWGPENECPLPRPAPITL 265
|
|
| Herpes_BLLF1 |
pfam05109 |
Herpes virus major outer envelope glycoprotein (BLLF1); This family consists of the BLLF1 ... |
708-1191 |
6.55e-05 |
|
Herpes virus major outer envelope glycoprotein (BLLF1); This family consists of the BLLF1 viral late glycoprotein, also termed gp350/220. It is the most abundantly expressed glycoprotein in the viral envelope of the Herpesviruses and is the major antigen responsible for stimulating the production of neutralising antibodies in vivo.
Pssm-ID: 282904 [Multi-domain] Cd Length: 886 Bit Score: 47.22 E-value: 6.55e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 317373420 708 SAPPSFQTPVNTVSSTNLVTP------PAVVSSQPKLQTPVTSGSLTATSVL--PAPNTATVVATTQVPSGNPQ------ 773
Cdd:pfam05109 422 SKAPESTTTSPTLNTTGFAAPntttglPSSTHVPTNLTAPASTGPTVSTADVtsPTPAGTTSGASPVTPSPSPRdngtes 501
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 317373420 774 --PTISLQPLPVILHVPVAVSSQPQLLQSHPGTlvTNQPSGNVEFISVQSPPTVSGLTKNPVSLPSLPNPTKPNNVPSVP 851
Cdd:pfam05109 502 kaPDMTSPTSAVTTPTPNATSPTPAVTTPTPNA--TSPTLGKTSPTSAVTTPTPNATSPTPAVTTPTPNATIPTLGKTSP 579
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 317373420 852 SPSIQR-NPTASAAPLGTTlAVQAVPTAHSIVQATRTSLPTVGPSGLYSPSTNRgpiqmKIPISAFSTSSAAEQNSNTTP 930
Cdd:pfam05109 580 TSAVTTpTPNATSPTVGET-SPQANTTNHTLGGTSSTPVVTSPPKNATSAVTTG-----QHNITSSSTSSMSLRPSSISE 653
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 317373420 931 RIENQTNKtidasvskkaaDSTSQCGKATGSDSSGVIDLTMDDEESgasqdpkklnhTPVSTMSSSQPVSRPLQPIQPAP 1010
Cdd:pfam05109 654 TLSPSTSD-----------NSTSHMPLLTSAHPTGGENITQVTPAS-----------TSTHHVSTSSPAPRPGTTSQASG 711
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 317373420 1011 PlqpsgvptsGPSQTTihllpTAPTTVNVTH-RPVTQVTTrlpvPRAPANHQVVYTTLPAPPAQA-PLRGTVMQAPAVRQ 1088
Cdd:pfam05109 712 P---------GNSSTS-----TKPGEVNVTKgTPPKNATS----PQAPSGQKTAVPTVTSTGGKAnSTTGGKHTTGHGAR 773
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 317373420 1089 VNPQNSVTVRVPQTTTYVVNNGLTLgsTGPQLTVHHRPPQVHTEPPRPVHPAPLPeapqpqrLPPeaasTSLPQKPHLKL 1168
Cdd:pfam05109 774 TSTEPTTDYGGDSTTPRTRYNATTY--LPPSTSSKLRPRWTFTSPPVTTAQATVP-------VPP----TSQPRFSNLSM 840
|
490 500
....*....|....*....|...
gi 317373420 1169 ARVQSQNGIVLSWSVLEVDRSCA 1191
Cdd:pfam05109 841 LVLQWASLAVLTLLLLLVMADCA 863
|
|
| PRK13108 |
PRK13108 |
prolipoprotein diacylglyceryl transferase; Reviewed |
142-291 |
7.16e-05 |
|
prolipoprotein diacylglyceryl transferase; Reviewed
Pssm-ID: 237284 [Multi-domain] Cd Length: 460 Bit Score: 46.90 E-value: 7.16e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 317373420 142 GDPASGVLASGDSTSGDPTSSEPSSSDAASGDATSGDAPSGDvsPGDATSGDATADDLSS-------------------- 201
Cdd:PRK13108 278 GREAPGALRGSEYVVDEALEREPAELAAAAVASAASAVGPVG--PGEPNQPDDVAEAVKAevaevtdevaaesvvqvadr 355
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 317373420 202 -GDPTSSDPIPGEPVPVEPISGDCAADDIASSEitsVDLASGAPASTDPAsdDLASGDLSSSELASDDLATGELAS---D 277
Cdd:PRK13108 356 dGESTPAVEETSEADIEREQPGDLAGQAPAAHQ---VDAEAASAAPEEPA--ALASEAHDETEPEVPEKAAPIPDPakpD 430
|
170
....*....|....
gi 317373420 278 ELTSESTFDRTFEP 291
Cdd:PRK13108 431 ELAVAGPGDDPAEP 444
|
|
| Atrophin-1 |
pfam03154 |
Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian ... |
963-1164 |
7.59e-05 |
|
Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian atrophy (DRPLA) gene. DRPLA OMIM:125370 is a progressive neurodegenerative disorder. It is caused by the expansion of a CAG repeat in the DRPLA gene on chromosome 12p. This results in an extended polyglutamine region in atrophin-1, that is thought to confer toxicity to the protein, possibly through altering its interactions with other proteins. The expansion of a CAG repeat is also the underlying defect in six other neurodegenerative disorders, including Huntington's disease. One interaction of expanded polyglutamine repeats that is thought to be pathogenic is that with the short glutamine repeat in the transcriptional coactivator CREB binding protein, CBP. This interaction draws CBP away from its usual nuclear location to the expanded polyglutamine repeat protein aggregates that are characteriztic of the polyglutamine neurodegenerative disorders. This interferes with CBP-mediated transcription and causes cytotoxicity.
Pssm-ID: 460830 [Multi-domain] Cd Length: 991 Bit Score: 47.07 E-value: 7.59e-05
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 317373420 963 SSGVIDLTMDDEESGASQDPKKLNHTPVSTMSSSQPVSRPLQPIQPAPPLQPSGVPTSGPSQTTIHLLPTAPTTVNVTHR 1042
Cdd:pfam03154 144 TSPSIPSPQDNESDSDSSAQQQILQTQPPVLQAQSGAASPPSPPPPGTTQAATAGPTPSAPSVPPQGSPATSQPPNQTQS 223
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 317373420 1043 PVT-----QVTTRLPVPRAPANHQVVY-TTLPAPPAQAP--------LRGTVMQAPAVRQVNPQNSVTVRVPQTTTYVVN 1108
Cdd:pfam03154 224 TAAphtliQQTPTLHPQRLPSPHPPLQpMTQPPPPSQVSpqplpqpsLHGQMPPMPHSLQTGPSHMQHPVPPQPFPLTPQ 303
|
170 180 190 200 210 220
....*....|....*....|....*....|....*....|....*....|....*....|....
gi 317373420 1109 NGLTLGSTGPQLTV--------HHRPPQVHTEPPRPVHPAPLPEAPQPQRLPPEAASTSLPQKP 1164
Cdd:pfam03154 304 SSQSQVPPGPSPAApgqsqqriHTPPSQSQLQSQQPPREQPLPPAPLSMPHIKPPPTTPIPQLP 367
|
|
| PTZ00341 |
PTZ00341 |
Ring-infected erythrocyte surface antigen; Provisional |
321-578 |
1.94e-04 |
|
Ring-infected erythrocyte surface antigen; Provisional
Pssm-ID: 173534 [Multi-domain] Cd Length: 1136 Bit Score: 45.93 E-value: 1.94e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 317373420 321 GADEKLEQIQSKDSLDEKNKADNNIDANEETLETDDTTICSDRPPENEKKVEEDIitelalgEDAISSSMEIDQGEKNED 400
Cdd:PTZ00341 897 GGGKKDKKAKKKDAKDLSGNIAHEINLINKELKNQNENVPEHLKEHAEANIEEDA-------EENVEEDAEENVEENVEE 969
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 317373420 401 ETSADLVETINENViEDNKSENILENTDSMETDEIIPILEklapsedeltcfsktsllpiDETNPDLEEKMESSFGSPSK 480
Cdd:PTZ00341 970 NVEENVEENVEENV-EENVEENVEENVEENVEENIEENVE--------------------ENVEENIEENVEEYDEENVE 1028
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 317373420 481 QESSESLPKEAFLVLSDEEDIsgEKDESEVISQN----ETCSPAEVESNEKDNKPEEEEQVIHEDDERPSEKNEFSRRKR 556
Cdd:PTZ00341 1029 EVEENVEEYDEENVEEIEENA--EENVEENIEENieeyDEENVEEIEENIEENIEENVEENVEENVEEIEENVEENVEEN 1106
|
250 260
....*....|....*....|..
gi 317373420 557 SKSEDMDNVQSKRRRYMEEEYE 578
Cdd:PTZ00341 1107 AEENAEENAEENAEEYDDENPE 1128
|
|
| PHA03378 |
PHA03378 |
EBNA-3B; Provisional |
976-1162 |
2.32e-04 |
|
EBNA-3B; Provisional
Pssm-ID: 223065 [Multi-domain] Cd Length: 991 Bit Score: 45.83 E-value: 2.32e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 317373420 976 SGASQDPKKLNHTPVSTMSSSQPVsrPLQPIQPAP------PLQPSGVPTsgPSQT-TIHLLPTAPTTVNVTHRPV---- 1044
Cdd:PHA03378 603 SQTPEPPTTQSHIPETSAPRQWPM--PLRPIPMRPlrmqpiTFNVLVFPT--PHQPpQVEITPYKPTWTQIGHIPYqpsp 678
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 317373420 1045 TQVTTRLPVPRAPANHQvvyttlpaPPAQAPLRGTVMQAPAVRQVNPQNSVT-VRVPQTTTYVVN--NGLTLGSTGPQLT 1121
Cdd:PHA03378 679 TGANTMLPIQWAPGTMQ--------PPPRAPTPMRPPAAPPGRAQRPAAATGrARPPAAAPGRARppAAAPGRARPPAAA 750
|
170 180 190 200
....*....|....*....|....*....|....*....|....*....
gi 317373420 1122 -VHHRPPQVHTEPPRPVHPAPLPEAPQPQ-------RLPPEAASTSLPQ 1162
Cdd:PHA03378 751 pGRARPPAAAPGRARPPAAAPGAPTPQPPpqappapQQRPRGAPTPQPP 799
|
|
| PRK03918 |
PRK03918 |
DNA double-strand break repair ATPase Rad50; |
366-663 |
2.84e-04 |
|
DNA double-strand break repair ATPase Rad50;
Pssm-ID: 235175 [Multi-domain] Cd Length: 880 Bit Score: 45.44 E-value: 2.84e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 317373420 366 ENEKKVEEDIITELALGEDAISSSMEIDQGEKNEDETSADLVETINENVIEDNKSENILENTDS--METDEIIPILEKLA 443
Cdd:PRK03918 165 KNLGEVIKEIKRRIERLEKFIKRTENIEELIKEKEKELEEVLREINEISSELPELREELEKLEKevKELEELKEEIEELE 244
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 317373420 444 PSEDELTCFSKTSLLPIDETNPDLEEKMESSFGSPSKQESSESLPKEA--------FLVLSDEEDISGEKDESEVISQ-- 513
Cdd:PRK03918 245 KELESLEGSKRKLEEKIRELEERIEELKKEIEELEEKVKELKELKEKAeeyiklseFYEEYLDELREIEKRLSRLEEEin 324
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 317373420 514 --NETCSPAEvESNEKDNKPEEEEQVIHEDDERPSEKNEFSRRKRSKSEDMDNVQSKRRRYMEEEYEAEFQVKITAKGDI 591
Cdd:PRK03918 325 giEERIKELE-EKEERLEELKKKLKELEKRLEELEERHELYEEAKAKKEELERLKKRLTGLTPEKLEKELEELEKAKEEI 403
|
250 260 270 280 290 300 310
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 317373420 592 NQKLQKVIQWL--LEEKLCALQCAVFDKTLAELKTRVEKIECNKRH-KTVLTELQAKIARLTKR---FEAAKEDLKKR 663
Cdd:PRK03918 404 EEEISKITARIgeLKKEIKELKKAIEELKKAKGKCPVCGRELTEEHrKELLEEYTAELKRIEKElkeIEEKERKLRKE 481
|
|
| PTZ00121 |
PTZ00121 |
MAEBL; Provisional |
317-600 |
3.35e-04 |
|
MAEBL; Provisional
Pssm-ID: 173412 [Multi-domain] Cd Length: 2084 Bit Score: 45.13 E-value: 3.35e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 317373420 317 LEKNGADEKLEQIQSKDSLDEKNKADNNIDANEETLETDDTTicsdRPPENEKKVEEDIITELALGEDAISSSMEIDQGE 396
Cdd:PTZ00121 1680 AKKAEEDEKKAAEALKKEAEEAKKAEELKKKEAEEKKKAEEL----KKAEEENKIKAEEAKKEAEEDKKKAEEAKKDEEE 1755
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 317373420 397 KN-------EDETSADLVETINENVIEDNKSENilENTDSMETDEIIPilEKLAPSEDELTCFSKTSLLPIDETNPDLEE 469
Cdd:PTZ00121 1756 KKkiahlkkEEEKKAEEIRKEKEAVIEEELDEE--DEKRRMEVDKKIK--DIFDNFANIIEGGKEGNLVINDSKEMEDSA 1831
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 317373420 470 KMESSFGSPSKQESSESLPKEAFlvlsDEEDISGEKDESEVISqnetcspaeveSNEKDNKPEEEEQVIHEDDERPSEKN 549
Cdd:PTZ00121 1832 IKEVADSKNMQLEEADAFEKHKF----NKNNENGEDGNKEADF-----------NKEKDLKEDDEEEIEEADEIEKIDKD 1896
|
250 260 270 280 290
....*....|....*....|....*....|....*....|....*....|.
gi 317373420 550 EFSRRKRSKSEDMDNVQSKRRRYMEEEYEaefqvkitaKGDINQKLQKVIQ 600
Cdd:PTZ00121 1897 DIEREIPNNNMAGKNNDIIDDKLDKDEYI---------KRDAEETREEIIK 1938
|
|
| PRK07764 |
PRK07764 |
DNA polymerase III subunits gamma and tau; Validated |
126-252 |
1.66e-03 |
|
DNA polymerase III subunits gamma and tau; Validated
Pssm-ID: 236090 [Multi-domain] Cd Length: 824 Bit Score: 42.67 E-value: 1.66e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 317373420 126 PAEPVSGDPAPGDLDAGDPASGVLASGDSTSGDPTSSEPSSSDAASGDATSGDAPSGDVSPGDATSGDATADDLS---SG 202
Cdd:PRK07764 649 APEHHPKHVAVPDASDGGDGWPAKAGGAAPAAPPPAPAPAAPAAPAGAAPAQPAPAPAATPPAGQADDPAAQPPQaaqGA 728
|
90 100 110 120 130
....*....|....*....|....*....|....*....|....*....|
gi 317373420 203 DPTSSDPIPGEPVPVEPISGDCAADDIASSEITSVDLASGAPASTDPASD 252
Cdd:PRK07764 729 SAPSPAADDPVPLPPEPDDPPDPAGAPAQPPPPPAPAPAAAPAAAPPPSP 778
|
|
| PHA03247 |
PHA03247 |
large tegument protein UL36; Provisional |
1001-1164 |
2.21e-03 |
|
large tegument protein UL36; Provisional
Pssm-ID: 223021 [Multi-domain] Cd Length: 3151 Bit Score: 42.62 E-value: 2.21e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 317373420 1001 RPLQPIQPAPPLQPSGVPTSGPSQTTIHLLPTAPttvnvtHRPVTQVttrlPVPRAPANHQVVYTTLPAPPAQAPLRgtv 1080
Cdd:PHA03247 2588 RPDAPPQSARPRAPVDDRGDPRGPAPPSPLPPDT------HAPDPPP----PSPSPAANEPDPHPPPTVPPPERPRD--- 2654
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 317373420 1081 mqAPAVRQVNPQNSVTVRVPQTTTYVVNNGLTLGSTGP---QLTVHHRPPqvhtEPPRPVHPAPLPEAPQ-PQRLPPEAA 1156
Cdd:PHA03247 2655 --DPAPGRVSRPRRARRLGRAAQASSPPQRPRRRAARPtvgSLTSLADPP----PPPPTPEPAPHALVSAtPLPPGPAAA 2728
|
....*...
gi 317373420 1157 STSLPQKP 1164
Cdd:PHA03247 2729 RQASPALP 2736
|
|
| SMC_prok_B |
TIGR02168 |
chromosome segregation protein SMC, common bacterial type; SMC (structural maintenance of ... |
324-662 |
2.25e-03 |
|
chromosome segregation protein SMC, common bacterial type; SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. This family represents the SMC protein of most bacteria. The smc gene is often associated with scpB (TIGR00281) and scpA genes, where scp stands for segregation and condensation protein. SMC was shown (in Caulobacter crescentus) to be induced early in S phase but present and bound to DNA throughout the cell cycle. [Cellular processes, Cell division, DNA metabolism, Chromosome-associated proteins]
Pssm-ID: 274008 [Multi-domain] Cd Length: 1179 Bit Score: 42.35 E-value: 2.25e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 317373420 324 EKLEQIQSKdsLDEKNKAdnnIDANEETLETDDTTIcsdrppENEKKVEEDIITELALG-EDAISSSMEIDQGEKNEDET 402
Cdd:TIGR02168 684 EKIEELEEK--IAELEKA---LAELRKELEELEEEL------EQLRKELEELSRQISALrKDLARLEAEVEQLEERIAQL 752
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 317373420 403 SADLVETINENVIEDNK----SENILENTDSMETDEiipilEKLAPSEDELTCFSKTsllpIDETNPDLEEKMESSFgsp 478
Cdd:TIGR02168 753 SKELTELEAEIEELEERleeaEEELAEAEAEIEELE-----AQIEQLKEELKALREA----LDELRAELTLLNEEAA--- 820
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 317373420 479 SKQESSESLPKEAFLVLSDEEDISGE-KDESEVISQNEtcspAEVESnEKDNKPEEEEQVIHEDDERpSEKNEFSRRKRS 557
Cdd:TIGR02168 821 NLRERLESLERRIAATERRLEDLEEQiEELSEDIESLA----AEIEE-LEELIEELESELEALLNER-ASLEEALALLRS 894
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 317373420 558 KSEDMDNVQ---SKRRRYMEEEYEAefqvKITAKGDINQKLQKVIQWL--LEEKLCALQCAVFDKTLAELKTRVEKIECN 632
Cdd:TIGR02168 895 ELEELSEELrelESKRSELRRELEE----LREKLAQLELRLEGLEVRIdnLQERLSEEYSLTLEEAEALENKIEDDEEEA 970
|
330 340 350 360
....*....|....*....|....*....|....*....|....*..
gi 317373420 633 KRHktvLTELQAKIARL--------------TKRFE---AAKEDLKK 662
Cdd:TIGR02168 971 RRR---LKRLENKIKELgpvnlaaieeyeelKERYDfltAQKEDLTE 1014
|
|
| DUF5585 |
pfam17823 |
Family of unknown function (DUF5585); This is a family of unknown function found in chordata. |
860-1164 |
2.71e-03 |
|
Family of unknown function (DUF5585); This is a family of unknown function found in chordata.
Pssm-ID: 465521 [Multi-domain] Cd Length: 506 Bit Score: 41.87 E-value: 2.71e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 317373420 860 TASAAPLGTTLAVQAvpTAHSIVQATRTSLPTVGPSGlySPSTNRGpiqmkipiSAFSTSSAAEQNSNTTPRIEN-QTNK 938
Cdd:pfam17823 87 TAEHTPHGTDLSEPA--TREGAADGAASRALAAAASS--SPSSAAQ--------SLPAAIAALPSEAFSAPRAAAcRANA 154
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 317373420 939 TIDASVSKKAADSTSQCGKATGSDSSGVidlTMDDEESGASQDPKKLNHTPVSTMSSSQPVS-RPLQPIQPAPPLQPSGV 1017
Cdd:pfam17823 155 SAAPRAAIAAASAPHAASPAPRTAASST---TAASSTTAASSAPTTAASSAPATLTPARGIStAATATGHPAAGTALAAV 231
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 317373420 1018 PTSGPSQTTIHL---------LPTAPTTVNVTHRPVTQVTTRLPVPR--APANHQVVYTTL--PAPPAQAPLRGTVMQAP 1084
Cdd:pfam17823 232 GNSSPAAGTVTAavgtvtpaaLATLAAAAGTVASAAGTINMGDPHARrlSPAKHMPSDTMArnPAAPMGAQAQGPIIQVS 311
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 317373420 1085 AVRQV-------NPQNSVTVRVPQTTTYVVNNGLTLGSTgpqLTVHHRPPQVHTEPPRPVHPAPLPEA----PQPQRLPP 1153
Cdd:pfam17823 312 TDQPVhntagepTPSPSNTTLEPNTPKSVASTNLAVVTT---TKAQAKEPSASPVPVLHTSMIPEVEAtsptTQPSPLLP 388
|
330
....*....|...
gi 317373420 1154 E--AASTSLPQKP 1164
Cdd:pfam17823 389 TqgAAGPGILLAP 401
|
|
| PRK10263 |
PRK10263 |
DNA translocase FtsK; Provisional |
994-1170 |
3.16e-03 |
|
DNA translocase FtsK; Provisional
Pssm-ID: 236669 [Multi-domain] Cd Length: 1355 Bit Score: 41.99 E-value: 3.16e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 317373420 994 SSSQPVSRPLQPIQPAPPLqPSgvPTSGPSQTTIHLLPT-APTTVNVTHRPVTQVTTRLPVPRAPANHQVVYTTLPAPPA 1072
Cdd:PRK10263 328 TATQSWAAPVEPVTQTPPV-AS--VDVPPAQPTVAWQPVpGPQTGEPVIAPAPEGYPQQSQYAQPAVQYNEPLQQPVQPQ 404
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 317373420 1073 QAPLRGTVMQAPAVRQVNPQNSVTVRVPQTTTYVVNNGLTLGSTGPQLTVHHRPPQVH---TEPPRPVHPAPLPEAPQPq 1149
Cdd:PRK10263 405 QPYYAPAAEQPAQQPYYAPAPEQPAQQPYYAPAPEQPVAGNAWQAEEQQSTFAPQSTYqteQTYQQPAAQEPLYQQPQP- 483
|
170 180
....*....|....*....|.
gi 317373420 1150 rLPPEAASTSLPQKPHLKLAR 1170
Cdd:PRK10263 484 -VEQQPVVEPEPVVEETKPAR 503
|
|
| PHA02664 |
PHA02664 |
hypothetical protein; Provisional |
126-282 |
3.68e-03 |
|
hypothetical protein; Provisional
Pssm-ID: 177447 Cd Length: 534 Bit Score: 41.52 E-value: 3.68e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 317373420 126 PAEP----VSGDPApgdLDAGDPASGVLASGDSTSGdptssePSSSDAASGDATSgdapsgdvSPGDATSGDATADDLSS 201
Cdd:PHA02664 368 PAEPaalfVDGNEV---IAAGAAAAMIAAAERAANG------ARGSPMAAPEEGR--------AAAAAAAANAPADQDVE 430
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 317373420 202 GDPtSSDPIPGEPVPVEPISGDCAADDIASSEIT-----------SVDLASGAPASTDPASDDLASGDLSSSELASDDLA 270
Cdd:PHA02664 431 AEA-HDEFDQDPGAPAHADRADSDEDDMDEQESGderadgeddsdSSYSYSTTSSEDESDSADDSWGDESDSGIEHDDGG 509
|
170
....*....|..
gi 317373420 271 TGELASDELTSE 282
Cdd:PHA02664 510 VGQAIEEEEEEE 521
|
|
| Mplasa_alph_rch |
TIGR04523 |
helix-rich Mycoplasma protein; Members of this family occur strictly within a subset of ... |
302-662 |
3.86e-03 |
|
helix-rich Mycoplasma protein; Members of this family occur strictly within a subset of Mycoplasma species. Members average 750 amino acids in length, including signal peptide. Sequences are predicted (Jpred 3) to be almost entirely alpha-helical. These sequences show strong periodicity (consistent with long alpha helical structures) and low complexity rich in D,E,N,Q, and K. Genes encoding these proteins are often found in tandem. The function is unknown.
Pssm-ID: 275316 [Multi-domain] Cd Length: 745 Bit Score: 41.54 E-value: 3.86e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 317373420 302 EIDNIEPSSNKDDdfleKNGADEKLEQIQSKDSLDEKNKADNNIDANEETLETDDTTICSDRppENEKKVEEDIITELAL 381
Cdd:TIGR04523 125 ELNKLEKQKKENK----KNIDKFLTEIKKKEKELEKLNNKYNDLKKQKEELENELNLLEKEK--LNIQKNIDKIKNKLLK 198
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 317373420 382 GEDAISSSMEIDQGEK-------NEDETSADLVETINENVIEDNKSENILENTDSM---ETDEIIPILEKLAPSEDELTC 451
Cdd:TIGR04523 199 LELLLSNLKKKIQKNKslesqisELKKQNNQLKDNIEKKQQEINEKTTEISNTQTQlnqLKDEQNKIKKQLSEKQKELEQ 278
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 317373420 452 FSKTsllpIDETNPDLEE-KMESsfgSPSKQESSESLPKEaflVLSDEEDISGEKDESE--------VISQ-NETCSPAE 521
Cdd:TIGR04523 279 NNKK----IKELEKQLNQlKSEI---SDLNNQKEQDWNKE---LKSELKNQEKKLEEIQnqisqnnkIISQlNEQISQLK 348
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 317373420 522 VESNEKDNKPEEEEQVIHE-DDERPSEKNEfsrrKRSKSEDMDNVQSKRRRY-----MEEEYEAEFQVKITAKGDINQKL 595
Cdd:TIGR04523 349 KELTNSESENSEKQRELEEkQNEIEKLKKE----NQSYKQEIKNLESQINDLeskiqNQEKLNQQKDEQIKKLQQEKELL 424
|
330 340 350 360 370 380
....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 317373420 596 QKVIQWLLEEKLcalqcaVFDKTLAELKTR--VEKIECNKrHKTVLTELQAKIARLTKRFEAAKEDLKK 662
Cdd:TIGR04523 425 EKEIERLKETII------KNNSEIKDLTNQdsVKELIIKN-LDNTRESLETQLKVLSRSINKIKQNLEQ 486
|
|
| MDN1 |
COG5271 |
Midasin, AAA ATPase with vWA domain, involved in ribosome maturation [Translation, ribosomal ... |
80-552 |
4.01e-03 |
|
Midasin, AAA ATPase with vWA domain, involved in ribosome maturation [Translation, ribosomal structure and biogenesis];
Pssm-ID: 444083 [Multi-domain] Cd Length: 1028 Bit Score: 41.54 E-value: 4.01e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 317373420 80 DPEGSKAEWKETPCILSVNVKNKQDDDLNCEPLSPHNITPEPVSKLPAEPVSGDPAPGDLDAGDPASGVLASGDSTSGDP 159
Cdd:COG5271 274 ATDDADGLEAAEDDALDAELTAAQAADPESDDDADDSTLAALEGAAEDTEIATADELAAADDEDDDDSAAEDAAEEAATA 353
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 317373420 160 TSSEPSSSDAASGDATSGDAPSGDVSPGDATSGDATADDLSSGDPTSSDPIPGEPVPVEPISGDCAADDIASSEITSVDL 239
Cdd:COG5271 354 EDSAAEDTQDAEDEAAGEAADESEGADTDAAADEADAAADDSADDEEASADGGTSPTSDTDEEEEEADEDASAGETEDES 433
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 317373420 240 ASGAPASTDPASDDLASGDLSSSELASDDLATGELASDELTSESTFDRTFEPKSVPVCEPVPEIDNIEPSSNKDD----- 314
Cdd:COG5271 434 TDVTSAEDDIATDEEADSLADEEEEAEAELDTEEDTESAEEDADGDEATDEDDASDDGDEEEAEEDAEAEADSDEltaee 513
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 317373420 315 ---DFLEKNGADEKLEQIQSKDSLDEKNKADNNIDANEETLETDDTTICSDRPPENEKKVEEDIITELALGEDAISSSME 391
Cdd:COG5271 514 tsaDDGADTDAAADPEDSDEDALEDETEGEENAPGSDQDADETDEPEATAEEDEPDEAEAETEDATENADADETEESADE 593
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 317373420 392 IDQGEKNEDETSADLVETINENVIEDNKSENILENTDSMETDEIIPILEKLAPSEDELTCFSKTSLLPIDETNPDLEEKM 471
Cdd:COG5271 594 SEEAEASEDEAAEEEEADDDEADADADGAADEEETEEEAAEDEAAEPETDASEAADEDADAETEAEASADESEEEAEDES 673
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 317373420 472 ESSfgSPSKQESSESLPKEAflvLSDEEDISGEKDESEVISQNETCSPAEVESNEKDNKPEEEEQVIHEDDERPSEKNEF 551
Cdd:COG5271 674 ETS--SEDAEEDADAAAAEA---SDDEEETEEADEDAETASEEADAEEADTEADGTAEEAEEAAEEAESADEEAASLPDE 748
|
.
gi 317373420 552 S 552
Cdd:COG5271 749 A 749
|
|
| PRK14971 |
PRK14971 |
DNA polymerase III subunit gamma/tau; |
992-1091 |
4.64e-03 |
|
DNA polymerase III subunit gamma/tau;
Pssm-ID: 237874 [Multi-domain] Cd Length: 614 Bit Score: 41.30 E-value: 4.64e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 317373420 992 TMSSSQPVSRPLQPI--QPAPPLQPSGVPTSGPSQTTIHLLPTAPTTVNVTHRPVTQVTTRLPVPRApanhqvvyttLPA 1069
Cdd:PRK14971 368 DASGGRGPKQHIKPVftQPAAAPQPSAAAAASPSPSQSSAAAQPSAPQSATQPAGTPPTVSVDPPAA----------VPV 437
|
90 100
....*....|....*....|..
gi 317373420 1070 PPAQAPLRGTVMQAPAVRQVNP 1091
Cdd:PRK14971 438 NPPSTAPQAVRPAQFKEEKKIP 459
|
|
| PRK07003 |
PRK07003 |
DNA polymerase III subunit gamma/tau; |
121-313 |
5.09e-03 |
|
DNA polymerase III subunit gamma/tau;
Pssm-ID: 235906 [Multi-domain] Cd Length: 830 Bit Score: 41.37 E-value: 5.09e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 317373420 121 PVSKLPAEPVSGDPAPGDLDAGDPASGVLASGDSTSGDPtSSEPSSSDAASGDATSGDAPSGDVSPgdATSGDATADDLS 200
Cdd:PRK07003 368 PGGGVPARVAGAVPAPGARAAAAVGASAVPAVTAVTGAA-GAALAPKAAAAAAATRAEAPPAAPAP--PATADRGDDAAD 444
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 317373420 201 SGDPTSSDpipgEPVPVEPisgDCAADDIASSEITSVDLASGAPASTDPASddlASGDLSSSELASDDLATGELASDELT 280
Cdd:PRK07003 445 GDAPVPAK----ANARASA---DSRCDERDAQPPADSGSASAPASDAPPDA---AFEPAPRAAAPSAATPAAVPDARAPA 514
|
170 180 190
....*....|....*....|....*....|...
gi 317373420 281 SESTFDRtfepkSVPVCEPVPEIDNIEPSSNKD 313
Cdd:PRK07003 515 AASREDA-----PAAAAPPAPEARPPTPAAAAP 542
|
|
| PHA03307 |
PHA03307 |
transcriptional regulator ICP4; Provisional |
655-1076 |
7.14e-03 |
|
transcriptional regulator ICP4; Provisional
Pssm-ID: 223039 [Multi-domain] Cd Length: 1352 Bit Score: 40.92 E-value: 7.14e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 317373420 655 AAKEDLKKRHEHPPNPPVSPGKTVNDVNSNNNMSYRNAGTVRQMLESKRNVSESAPPSFQTPVNTvSSTNLVTPPAVVSS 734
Cdd:PHA03307 56 VAGAAACDRFEPPTGPPPGPGTEAPANESRSTPTWSLSTLAPASPAREGSPTPPGPSSPDPPPPT-PPPASPPPSPAPDL 134
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 317373420 735 QPKLQTPVTSGSLTATSVLPAPNTATVVATTQVPSGNPQPTISLQPLPVilHVPVAVSSQPQLLQSHPGTLVTNQPSGNV 814
Cdd:PHA03307 135 SEMLRPVGSPGPPPAASPPAAGASPAAVASDAASSRQAALPLSSPEETA--RAPSSPPAEPPPSTPPAAASPRPPRRSSP 212
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 317373420 815 EFISVQSPPtvsgltknpvslPSLPNPTKPNNVPSVPSPSIQRNPTASAAPLGTTlavqAVPTAHSIVQATR--TSLPTV 892
Cdd:PHA03307 213 ISASASSPA------------PAPGRSAADDAGASSSDSSSSESSGCGWGPENEC----PLPRPAPITLPTRiwEASGWN 276
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 317373420 893 GPSGLYSPSTNRGPIQMKIPISAFSTSSAAEqnSNTTPRIENQTNKTIDASVSKKAADSTSQCGKATGSDSSgvidltmd 972
Cdd:PHA03307 277 GPSSRPGPASSSSSPRERSPSPSPSSPGSGP--APSSPRASSSSSSSRESSSSSTSSSSESSRGAAVSPGPS-------- 346
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 317373420 973 DEESGASQDPkklnhtPVSTMSSSQPVSRPLQPIQPAPPlQPSGVPTSgpsqttihllPTAPTTVNVTHRPvTQVTTRLP 1052
Cdd:PHA03307 347 PSRSPSPSRP------PPPADPSSPRKRPRPSRAPSSPA-ASAGRPTR----------RRARAAVAGRARR-RDATGRFP 408
|
410 420
....*....|....*....|....
gi 317373420 1053 VPRAPANHQVVYTTLPAPPAQAPL 1076
Cdd:PHA03307 409 AGRPRPSPLDAGAASGAFYARYPL 432
|
|
| COG1340 |
COG1340 |
Uncharacterized coiled-coil protein, contains DUF342 domain [Function unknown]; |
532-665 |
9.66e-03 |
|
Uncharacterized coiled-coil protein, contains DUF342 domain [Function unknown];
Pssm-ID: 440951 [Multi-domain] Cd Length: 297 Bit Score: 39.51 E-value: 9.66e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 317373420 532 EEEEQVIHEDDERPSEKNEFSRRKRSKSEDMDNVQSKRRRYMEE--EYEAEfqvkitaKGDINQKLQKVIQWLLEEKLCA 609
Cdd:COG1340 29 EKRDELNEELKELAEKRDELNAQVKELREEAQELREKRDELNEKvkELKEE-------RDELNEKLNELREELDELRKEL 101
|
90 100 110 120 130 140
....*....|....*....|....*....|....*....|....*....|....*....|..
gi 317373420 610 LQCAVFDKTLAELKTRVEKIEcnKRHKT-VLT-----ELQAKIARLTKRFEAAKEDLKKRHE 665
Cdd:COG1340 102 AELNKAGGSIDKLRKEIERLE--WRQQTeVLSpeeekELVEKIKELEKELEKAKKALEKNEK 161
|
|
|