NCBI Home Page NCBI Site Search page NCBI Guide that lists and describes the NCBI resources
Conserved domains on  [gi|317373420|sp|Q6VMQ6|]
View 

RecName: Full=Activating transcription factor 7-interacting protein 1; AltName: Full=ATF-interacting protein; Short=ATF-IP; AltName: Full=ATF7-interacting protein; AltName: Full=ATFa-associated modulator; Short=hAM; AltName: Full=MBD1-containing chromatin-associated factor 1; AltName: Full=P621

Protein Classification

ATF7IP_BD and fn3_4 domain-containing protein( domain architecture ID 11245579)

protein containing domains ATF7IP_BD, PHA03247, and fn3_4

Graphical summary

 Zoom to residue level

show extra options »

Show site features     Horizontal zoom: ×

List of domain hits

Name Accession Description Interval E-value
ATF7IP_BD pfam16788
ATF-interacting protein binding domain; ATF7IP-BD is a short conserved region of activating ...
564-779 1.12e-76

ATF-interacting protein binding domain; ATF7IP-BD is a short conserved region of activating transcription factor 7-interacting protein 1 found in higher eukaryotes. This domain appears to bind several key proteins such as TFIIE-alpha and TFIIE-beta as well the transcriptional regulator Sp1 which are part of the transcriptional machinery.


:

Pssm-ID: 465271 [Multi-domain]  Cd Length: 214  Bit Score: 252.29  E-value: 1.12e-76
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 317373420   564 NVQSKRRRYMEEeyeaeFQVKITAKGDINQKLQKVIQWLLEEKLCALQCAVFDKTLAELKTRVEKIECNKRHKTVLTELQ 643
Cdd:pfam16788    1 KENVKRMKTSEQ-----INENICVALEKQTALLEQVKHLIEQEICSINYKLFDKKLKELNERVEKTECRKKHEAIATELQ 75
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 317373420   644 AKIARLTKRFEAAKEDLKKrhehpPNPPVSPGKTVND--VNSNNNMSYRNAGTVRQMLESKRNVSESAPpsFQTPVNTVS 721
Cdd:pfam16788   76 AKIARLTKRFKAALEDLKK-----CLPPNSPSSNAASkvANSNTINLYRNAGSVRSMLESKRSVGESSP--FQPPEKASK 148
                          170       180       190       200       210       220
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*.
gi 317373420   722 STNLVTPPAVVSSQPKLQTPVTSGSLT----ATSVLPAPNTATVV---ATTQVPSGNPQPT-ISLQ 779
Cdd:pfam16788  149 KINLTSPQNEVVSESNNQDDVMLISVEspnlTTPVTSNPTDTRKVtsgNSSNSPSAETEVMaVEKK 214
fn3_4 pfam16794
Fibronectin-III type domain;
1160-1260 6.58e-49

Fibronectin-III type domain;


:

Pssm-ID: 465273 [Multi-domain]  Cd Length: 101  Bit Score: 168.68  E-value: 6.58e-49
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 317373420  1160 LPQKPHLKLARVQsqNGIVLSWSVLEVDRSCATVDSYHLYAYHEEPSATVPS-QWKKIGEVKALPLPMACTLTQFVSGSK 1238
Cdd:pfam16794    2 PPQKPTLKLARVP--TGIVLSWNMPDLDPKYAPVESYHLFAYQENTSTTPSTdSWKKIGDVKALPLPMACTLSQFKAGQR 79
                           90       100
                   ....*....|....*....|..
gi 317373420  1239 YYFAVRAKDIYGRFGPFCDPQS 1260
Cdd:pfam16794   80 YYFAVRAVDIHGRYGPFSDPKT 101
PHA03247 super family cl33720
large tegument protein UL36; Provisional
822-1158 4.84e-09

large tegument protein UL36; Provisional


The actual alignment was detected with superfamily member PHA03247:

Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 61.11  E-value: 4.84e-09
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 317373420  822 PPTVSGLTK--NPVSLPSLPNPTKPNNVPSVPSPSIQRNPTASAAPLGTTLAVQAVPTAHSIVQATRTSLPTVGPSGLYS 899
Cdd:PHA03247 2689 RPTVGSLTSlaDPPPPPPTPEPAPHALVSATPLPPGPAAARQASPALPAAPAPPAVPAGPATPGGPARPARPPTTAGPPA 2768
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 317373420  900 PSTNRGPiqmkipisafstsSAAEQNSNTTPRIENQTNKTIDASVSKKAADSTSQCGKATGSDSsgvidltmddeesgAS 979
Cdd:PHA03247 2769 PAPPAAP-------------AAGPPRRLTRPAVASLSESRESLPSPWDPADPPAAVLAPAAALP--------------PA 2821
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 317373420  980 QDPKklnhTPVSTMSSSQPVSRPLQPIQPAPPLQPSG-VPTSGPSQTTIHLLPTAPTTVNVTHRPVTQVTtRLPVPRAP- 1057
Cdd:PHA03247 2822 ASPA----GPLPPPTSAQPTAPPPPPGPPPPSLPLGGsVAPGGDVRRRPPSRSPAAKPAAPARPPVRRLA-RPAVSRSTe 2896
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 317373420 1058 --ANHQVVYTTLPAPPAQAPLRGTVMQAPAVR---QVNPQNSVTVRVPQTTTYVVNNGLTLGSTGPQLTvHHRP-----P 1127
Cdd:PHA03247 2897 sfALPPDQPERPPQPQAPPPPQPQPQPPPPPQpqpPPPPPPRPQPPLAPTTDPAGAGEPSGAVPQPWLG-ALVPgrvavP 2975
                         330       340       350
                  ....*....|....*....|....*....|.
gi 317373420 1128 QVHTEPPRPVHPAPLPEAPQPQRLPPEAAST 1158
Cdd:PHA03247 2976 RFRVPQPAPSREAPASSTPPLTGHSLSRVSS 3006
PTZ00341 super family cl31759
Ring-infected erythrocyte surface antigen; Provisional
319-574 4.54e-08

Ring-infected erythrocyte surface antigen; Provisional


The actual alignment was detected with superfamily member PTZ00341:

Pssm-ID: 173534 [Multi-domain]  Cd Length: 1136  Bit Score: 57.87  E-value: 4.54e-08
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 317373420  319 KNGADEKLEQIQSKDSLDEKNKADNNIDAN-EETLEtddtticsdrppEN-EKKVEEDIitelalgEDAISSSMEIDQGE 396
Cdd:PTZ00341  929 KNQNENVPEHLKEHAEANIEEDAEENVEEDaEENVE------------ENvEENVEENV-------EENVEENVEENVEE 989
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 317373420  397 KNEDETSADLVETINENViEDNKSENILENTDSMETDEIIPILEKLAPSEDEltcfsktsllPIDETNPDLEEKMESSFg 476
Cdd:PTZ00341  990 NVEENVEENVEENIEENV-EENVEENIEENVEEYDEENVEEVEENVEEYDEE----------NVEEIEENAEENVEENI- 1057
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 317373420  477 spskQESSESLPKEaflvlsDEEDIsgEKDESEVISQNetcspaeVESNEKDNKPEEEEQVIHEDDERPSEKNEFSRRKR 556
Cdd:PTZ00341 1058 ----EENIEEYDEE------NVEEI--EENIEENIEEN-------VEENVEENVEEIEENVEENVEENAEENAEENAEEN 1118
                         250
                  ....*....|....*...
gi 317373420  557 SKSEDMDNVQSKRRRYME 574
Cdd:PTZ00341 1119 AEEYDDENPEEHNEEYDE 1136
MSCRAMM_ClfA super family cl41352
MSCRAMM family adhesin clumping factor ClfA; Clumping factor A is an MSCRAMM (Microbial ...
122-433 1.87e-05

MSCRAMM family adhesin clumping factor ClfA; Clumping factor A is an MSCRAMM (Microbial Surface Components Recognizing Adhesive Matrix Molecules). It is heavily studied in Staphylococcus aureus both for its biological role in adhesion and for its potential for vaccination. Features of the sequence, but also of other MSCRAMM adhesins, include a long run of Ser-Asp dipeptide repeats and a C-terminal cell wall anchoring LPXTG motif.


The actual alignment was detected with superfamily member NF033609:

Pssm-ID: 468110 [Multi-domain]  Cd Length: 934  Bit Score: 49.14  E-value: 1.87e-05
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 317373420  122 VSKLPAEPVSGDPAPGDLDA------GDPASGVLASGDSTSGDPTSSEP-SSSDAASGDATSGDAPSGDVSPGDATSGDA 194
Cdd:NF033609  544 VPEQPDEPGEIEPIPEDSDSdpgsdsGSDSSNSDSGSDSGSDSTSDSGSdSASDSDSASDSDSASDSDSASDSDSASDSD 623
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 317373420  195 TADDLSSGDPTSSDPIPGEPVPVEPISGDCAADDIASSEITSVDLASGAPASTDPASDDLASGDLSSSELASDDLATGEL 274
Cdd:NF033609  624 SASDSDSASDSDSASDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSD 703
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 317373420  275 ASDELTSESTFDRTFEPKSVPVCEPVPEIDNiEPSSNKDDDFLEKNGADEKLEQIQSKDSlDEKNKADNNIDANEETLET 354
Cdd:NF033609  704 SDSDSDSDSDSDSDSDSDSDSDSDSDSDSDS-DSDSDSDSDSDSDSDSDSDSDSDSDSDS-DSDSDSDSDSDSDSDSDSD 781
                         250       260       270       280       290       300       310
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 317373420  355 DDTTICSDRPPENEKKVEEDIITELALGEDAISSSmEIDQGEKNEDETSADLVETINENVIEDNKSENILENTDSMETD 433
Cdd:NF033609  782 SDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDS-DSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSESD 859
 
Name Accession Description Interval E-value
ATF7IP_BD pfam16788
ATF-interacting protein binding domain; ATF7IP-BD is a short conserved region of activating ...
564-779 1.12e-76

ATF-interacting protein binding domain; ATF7IP-BD is a short conserved region of activating transcription factor 7-interacting protein 1 found in higher eukaryotes. This domain appears to bind several key proteins such as TFIIE-alpha and TFIIE-beta as well the transcriptional regulator Sp1 which are part of the transcriptional machinery.


Pssm-ID: 465271 [Multi-domain]  Cd Length: 214  Bit Score: 252.29  E-value: 1.12e-76
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 317373420   564 NVQSKRRRYMEEeyeaeFQVKITAKGDINQKLQKVIQWLLEEKLCALQCAVFDKTLAELKTRVEKIECNKRHKTVLTELQ 643
Cdd:pfam16788    1 KENVKRMKTSEQ-----INENICVALEKQTALLEQVKHLIEQEICSINYKLFDKKLKELNERVEKTECRKKHEAIATELQ 75
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 317373420   644 AKIARLTKRFEAAKEDLKKrhehpPNPPVSPGKTVND--VNSNNNMSYRNAGTVRQMLESKRNVSESAPpsFQTPVNTVS 721
Cdd:pfam16788   76 AKIARLTKRFKAALEDLKK-----CLPPNSPSSNAASkvANSNTINLYRNAGSVRSMLESKRSVGESSP--FQPPEKASK 148
                          170       180       190       200       210       220
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*.
gi 317373420   722 STNLVTPPAVVSSQPKLQTPVTSGSLT----ATSVLPAPNTATVV---ATTQVPSGNPQPT-ISLQ 779
Cdd:pfam16788  149 KINLTSPQNEVVSESNNQDDVMLISVEspnlTTPVTSNPTDTRKVtsgNSSNSPSAETEVMaVEKK 214
fn3_4 pfam16794
Fibronectin-III type domain;
1160-1260 6.58e-49

Fibronectin-III type domain;


Pssm-ID: 465273 [Multi-domain]  Cd Length: 101  Bit Score: 168.68  E-value: 6.58e-49
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 317373420  1160 LPQKPHLKLARVQsqNGIVLSWSVLEVDRSCATVDSYHLYAYHEEPSATVPS-QWKKIGEVKALPLPMACTLTQFVSGSK 1238
Cdd:pfam16794    2 PPQKPTLKLARVP--TGIVLSWNMPDLDPKYAPVESYHLFAYQENTSTTPSTdSWKKIGDVKALPLPMACTLSQFKAGQR 79
                           90       100
                   ....*....|....*....|..
gi 317373420  1239 YYFAVRAKDIYGRFGPFCDPQS 1260
Cdd:pfam16794   80 YYFAVRAVDIHGRYGPFSDPKT 101
PHA03247 PHA03247
large tegument protein UL36; Provisional
822-1158 4.84e-09

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 61.11  E-value: 4.84e-09
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 317373420  822 PPTVSGLTK--NPVSLPSLPNPTKPNNVPSVPSPSIQRNPTASAAPLGTTLAVQAVPTAHSIVQATRTSLPTVGPSGLYS 899
Cdd:PHA03247 2689 RPTVGSLTSlaDPPPPPPTPEPAPHALVSATPLPPGPAAARQASPALPAAPAPPAVPAGPATPGGPARPARPPTTAGPPA 2768
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 317373420  900 PSTNRGPiqmkipisafstsSAAEQNSNTTPRIENQTNKTIDASVSKKAADSTSQCGKATGSDSsgvidltmddeesgAS 979
Cdd:PHA03247 2769 PAPPAAP-------------AAGPPRRLTRPAVASLSESRESLPSPWDPADPPAAVLAPAAALP--------------PA 2821
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 317373420  980 QDPKklnhTPVSTMSSSQPVSRPLQPIQPAPPLQPSG-VPTSGPSQTTIHLLPTAPTTVNVTHRPVTQVTtRLPVPRAP- 1057
Cdd:PHA03247 2822 ASPA----GPLPPPTSAQPTAPPPPPGPPPPSLPLGGsVAPGGDVRRRPPSRSPAAKPAAPARPPVRRLA-RPAVSRSTe 2896
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 317373420 1058 --ANHQVVYTTLPAPPAQAPLRGTVMQAPAVR---QVNPQNSVTVRVPQTTTYVVNNGLTLGSTGPQLTvHHRP-----P 1127
Cdd:PHA03247 2897 sfALPPDQPERPPQPQAPPPPQPQPQPPPPPQpqpPPPPPPRPQPPLAPTTDPAGAGEPSGAVPQPWLG-ALVPgrvavP 2975
                         330       340       350
                  ....*....|....*....|....*....|.
gi 317373420 1128 QVHTEPPRPVHPAPLPEAPQPQRLPPEAAST 1158
Cdd:PHA03247 2976 RFRVPQPAPSREAPASSTPPLTGHSLSRVSS 3006
PTZ00341 PTZ00341
Ring-infected erythrocyte surface antigen; Provisional
319-574 4.54e-08

Ring-infected erythrocyte surface antigen; Provisional


Pssm-ID: 173534 [Multi-domain]  Cd Length: 1136  Bit Score: 57.87  E-value: 4.54e-08
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 317373420  319 KNGADEKLEQIQSKDSLDEKNKADNNIDAN-EETLEtddtticsdrppEN-EKKVEEDIitelalgEDAISSSMEIDQGE 396
Cdd:PTZ00341  929 KNQNENVPEHLKEHAEANIEEDAEENVEEDaEENVE------------ENvEENVEENV-------EENVEENVEENVEE 989
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 317373420  397 KNEDETSADLVETINENViEDNKSENILENTDSMETDEIIPILEKLAPSEDEltcfsktsllPIDETNPDLEEKMESSFg 476
Cdd:PTZ00341  990 NVEENVEENVEENIEENV-EENVEENIEENVEEYDEENVEEVEENVEEYDEE----------NVEEIEENAEENVEENI- 1057
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 317373420  477 spskQESSESLPKEaflvlsDEEDIsgEKDESEVISQNetcspaeVESNEKDNKPEEEEQVIHEDDERPSEKNEFSRRKR 556
Cdd:PTZ00341 1058 ----EENIEEYDEE------NVEEI--EENIEENIEEN-------VEENVEENVEEIEENVEENVEENAEENAEENAEEN 1118
                         250
                  ....*....|....*...
gi 317373420  557 SKSEDMDNVQSKRRRYME 574
Cdd:PTZ00341 1119 AEEYDDENPEEHNEEYDE 1136
Atrophin-1 pfam03154
Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian ...
708-1149 6.28e-07

Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian atrophy (DRPLA) gene. DRPLA OMIM:125370 is a progressive neurodegenerative disorder. It is caused by the expansion of a CAG repeat in the DRPLA gene on chromosome 12p. This results in an extended polyglutamine region in atrophin-1, that is thought to confer toxicity to the protein, possibly through altering its interactions with other proteins. The expansion of a CAG repeat is also the underlying defect in six other neurodegenerative disorders, including Huntington's disease. One interaction of expanded polyglutamine repeats that is thought to be pathogenic is that with the short glutamine repeat in the transcriptional coactivator CREB binding protein, CBP. This interaction draws CBP away from its usual nuclear location to the expanded polyglutamine repeat protein aggregates that are characteriztic of the polyglutamine neurodegenerative disorders. This interferes with CBP-mediated transcription and causes cytotoxicity.


Pssm-ID: 460830 [Multi-domain]  Cd Length: 991  Bit Score: 54.00  E-value: 6.28e-07
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 317373420   708 SAPPSFQTPVNTVSSTNLVTPPAVVSSQP-KLQTPVTSGSLTATSVLPAPNTATVVATTQVPSGNPQPTISLQPLPVILH 786
Cdd:pfam03154  143 STSPSIPSPQDNESDSDSSAQQQILQTQPpVLQAQSGAASPPSPPPPGTTQAATAGPTPSAPSVPPQGSPATSQPPNQTQ 222
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 317373420   787 VPVAVSSqpqLLQSHPGTLVTNQPSGNVEFISVQSPPTVSGLTKNPVSLPSLPNPTKPNNVPSVPSPSIQRNPTASAA-P 865
Cdd:pfam03154  223 STAAPHT---LIQQTPTLHPQRLPSPHPPLQPMTQPPPPSQVSPQPLPQPSLHGQMPPMPHSLQTGPSHMQHPVPPQPfP 299
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 317373420   866 LGTTLAVQAVPTAHSIVQATRTSLPTVGPSGLYSPSTNRGPIQMKIPISAFSTSSAAEQNSNTTPRIENQTNKTIDASVS 945
Cdd:pfam03154  300 LTPQSSQSQVPPGPSPAAPGQSQQRIHTPPSQSQLQSQQPPREQPLPPAPLSMPHIKPPPTTPIPQLPNPQSHKHPPHLS 379
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 317373420   946 KKAADSTSqcgkatgsdssgvidltmddeesgaSQDPKKLNHTPVSTMSSSQPVSRPLQPIQPAPPLQPSGVPTSGPSQT 1025
Cdd:pfam03154  380 GPSPFQMN-------------------------SNLPPPPALKPLSSLSTHHPPSAHPPPLQLMPQSQQLPPPPAQPPVL 434
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 317373420  1026 TIHLLPTAPTTVNVTHRPVTQVTTRLPVPRAPANHQVVYTTLPA--PPAQAPLRGTVMQAPAVRQVnpqnSVTVRVPQTT 1103
Cdd:pfam03154  435 TQSQSLPPPAASHPPTSGLHQVPSQSPFPQHPFVPGGPPPITPPsgPPTSTSSAMPGIQPPSSASV----SSSGPVPAAV 510
                          410       420       430       440
                   ....*....|....*....|....*....|....*....|....*.
gi 317373420  1104 TYVVnngltlgstgPQLTVHHRPPQvhtEPPRPVHPAPLPEAPQPQ 1149
Cdd:pfam03154  511 SCPL----------PPVQIKEEALD---EAEEPESPPPPPRSPSPE 543
MSCRAMM_ClfA NF033609
MSCRAMM family adhesin clumping factor ClfA; Clumping factor A is an MSCRAMM (Microbial ...
122-433 1.87e-05

MSCRAMM family adhesin clumping factor ClfA; Clumping factor A is an MSCRAMM (Microbial Surface Components Recognizing Adhesive Matrix Molecules). It is heavily studied in Staphylococcus aureus both for its biological role in adhesion and for its potential for vaccination. Features of the sequence, but also of other MSCRAMM adhesins, include a long run of Ser-Asp dipeptide repeats and a C-terminal cell wall anchoring LPXTG motif.


Pssm-ID: 468110 [Multi-domain]  Cd Length: 934  Bit Score: 49.14  E-value: 1.87e-05
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 317373420  122 VSKLPAEPVSGDPAPGDLDA------GDPASGVLASGDSTSGDPTSSEP-SSSDAASGDATSGDAPSGDVSPGDATSGDA 194
Cdd:NF033609  544 VPEQPDEPGEIEPIPEDSDSdpgsdsGSDSSNSDSGSDSGSDSTSDSGSdSASDSDSASDSDSASDSDSASDSDSASDSD 623
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 317373420  195 TADDLSSGDPTSSDPIPGEPVPVEPISGDCAADDIASSEITSVDLASGAPASTDPASDDLASGDLSSSELASDDLATGEL 274
Cdd:NF033609  624 SASDSDSASDSDSASDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSD 703
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 317373420  275 ASDELTSESTFDRTFEPKSVPVCEPVPEIDNiEPSSNKDDDFLEKNGADEKLEQIQSKDSlDEKNKADNNIDANEETLET 354
Cdd:NF033609  704 SDSDSDSDSDSDSDSDSDSDSDSDSDSDSDS-DSDSDSDSDSDSDSDSDSDSDSDSDSDS-DSDSDSDSDSDSDSDSDSD 781
                         250       260       270       280       290       300       310
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 317373420  355 DDTTICSDRPPENEKKVEEDIITELALGEDAISSSmEIDQGEKNEDETSADLVETINENVIEDNKSENILENTDSMETD 433
Cdd:NF033609  782 SDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDS-DSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSESD 859
PHA03307 PHA03307
transcriptional regulator ICP4; Provisional
110-287 4.10e-05

transcriptional regulator ICP4; Provisional


Pssm-ID: 223039 [Multi-domain]  Cd Length: 1352  Bit Score: 48.24  E-value: 4.10e-05
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 317373420  110 EPLSPHNITPEPVSKLPAEPVSGDPAPGDLDAGD------PASGVLASGDSTSGdPTSSEPSSSDAASGDATSGDAPSGD 183
Cdd:PHA03307   78 EAPANESRSTPTWSLSTLAPASPAREGSPTPPGPsspdppPPTPPPASPPPSPA-PDLSEMLRPVGSPGPPPAASPPAAG 156
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 317373420  184 VSPGDATSGDAT---ADDLSSGDPTSSDPI--PGEPVPVEPISGDCAADDIASSEITSVDLASGAPASTDPASDDLASGD 258
Cdd:PHA03307  157 ASPAAVASDAASsrqAALPLSSPEETARAPssPPAEPPPSTPPAAASPRPPRRSSPISASASSPAPAPGRSAADDAGASS 236
                         170       180
                  ....*....|....*....|....*....
gi 317373420  259 LSSSELASDDLATGELASDELTSESTFDR 287
Cdd:PHA03307  237 SDSSSSESSGCGWGPENECPLPRPAPITL 265
SMC_prok_B TIGR02168
chromosome segregation protein SMC, common bacterial type; SMC (structural maintenance of ...
324-662 2.25e-03

chromosome segregation protein SMC, common bacterial type; SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. This family represents the SMC protein of most bacteria. The smc gene is often associated with scpB (TIGR00281) and scpA genes, where scp stands for segregation and condensation protein. SMC was shown (in Caulobacter crescentus) to be induced early in S phase but present and bound to DNA throughout the cell cycle. [Cellular processes, Cell division, DNA metabolism, Chromosome-associated proteins]


Pssm-ID: 274008 [Multi-domain]  Cd Length: 1179  Bit Score: 42.35  E-value: 2.25e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 317373420   324 EKLEQIQSKdsLDEKNKAdnnIDANEETLETDDTTIcsdrppENEKKVEEDIITELALG-EDAISSSMEIDQGEKNEDET 402
Cdd:TIGR02168  684 EKIEELEEK--IAELEKA---LAELRKELEELEEEL------EQLRKELEELSRQISALrKDLARLEAEVEQLEERIAQL 752
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 317373420   403 SADLVETINENVIEDNK----SENILENTDSMETDEiipilEKLAPSEDELTCFSKTsllpIDETNPDLEEKMESSFgsp 478
Cdd:TIGR02168  753 SKELTELEAEIEELEERleeaEEELAEAEAEIEELE-----AQIEQLKEELKALREA----LDELRAELTLLNEEAA--- 820
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 317373420   479 SKQESSESLPKEAFLVLSDEEDISGE-KDESEVISQNEtcspAEVESnEKDNKPEEEEQVIHEDDERpSEKNEFSRRKRS 557
Cdd:TIGR02168  821 NLRERLESLERRIAATERRLEDLEEQiEELSEDIESLA----AEIEE-LEELIEELESELEALLNER-ASLEEALALLRS 894
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 317373420   558 KSEDMDNVQ---SKRRRYMEEEYEAefqvKITAKGDINQKLQKVIQWL--LEEKLCALQCAVFDKTLAELKTRVEKIECN 632
Cdd:TIGR02168  895 ELEELSEELrelESKRSELRRELEE----LREKLAQLELRLEGLEVRIdnLQERLSEEYSLTLEEAEALENKIEDDEEEA 970
                          330       340       350       360
                   ....*....|....*....|....*....|....*....|....*..
gi 317373420   633 KRHktvLTELQAKIARL--------------TKRFE---AAKEDLKK 662
Cdd:TIGR02168  971 RRR---LKRLENKIKELgpvnlaaieeyeelKERYDfltAQKEDLTE 1014
MDN1 COG5271
Midasin, AAA ATPase with vWA domain, involved in ribosome maturation [Translation, ribosomal ...
80-552 4.01e-03

Midasin, AAA ATPase with vWA domain, involved in ribosome maturation [Translation, ribosomal structure and biogenesis];


Pssm-ID: 444083 [Multi-domain]  Cd Length: 1028  Bit Score: 41.54  E-value: 4.01e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 317373420   80 DPEGSKAEWKETPCILSVNVKNKQDDDLNCEPLSPHNITPEPVSKLPAEPVSGDPAPGDLDAGDPASGVLASGDSTSGDP 159
Cdd:COG5271   274 ATDDADGLEAAEDDALDAELTAAQAADPESDDDADDSTLAALEGAAEDTEIATADELAAADDEDDDDSAAEDAAEEAATA 353
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 317373420  160 TSSEPSSSDAASGDATSGDAPSGDVSPGDATSGDATADDLSSGDPTSSDPIPGEPVPVEPISGDCAADDIASSEITSVDL 239
Cdd:COG5271   354 EDSAAEDTQDAEDEAAGEAADESEGADTDAAADEADAAADDSADDEEASADGGTSPTSDTDEEEEEADEDASAGETEDES 433
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 317373420  240 ASGAPASTDPASDDLASGDLSSSELASDDLATGELASDELTSESTFDRTFEPKSVPVCEPVPEIDNIEPSSNKDD----- 314
Cdd:COG5271   434 TDVTSAEDDIATDEEADSLADEEEEAEAELDTEEDTESAEEDADGDEATDEDDASDDGDEEEAEEDAEAEADSDEltaee 513
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 317373420  315 ---DFLEKNGADEKLEQIQSKDSLDEKNKADNNIDANEETLETDDTTICSDRPPENEKKVEEDIITELALGEDAISSSME 391
Cdd:COG5271   514 tsaDDGADTDAAADPEDSDEDALEDETEGEENAPGSDQDADETDEPEATAEEDEPDEAEAETEDATENADADETEESADE 593
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 317373420  392 IDQGEKNEDETSADLVETINENVIEDNKSENILENTDSMETDEIIPILEKLAPSEDELTCFSKTSLLPIDETNPDLEEKM 471
Cdd:COG5271   594 SEEAEASEDEAAEEEEADDDEADADADGAADEEETEEEAAEDEAAEPETDASEAADEDADAETEAEASADESEEEAEDES 673
                         410       420       430       440       450       460       470       480
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 317373420  472 ESSfgSPSKQESSESLPKEAflvLSDEEDISGEKDESEVISQNETCSPAEVESNEKDNKPEEEEQVIHEDDERPSEKNEF 551
Cdd:COG5271   674 ETS--SEDAEEDADAAAAEA---SDDEEETEEADEDAETASEEADAEEADTEADGTAEEAEEAAEEAESADEEAASLPDE 748

                  .
gi 317373420  552 S 552
Cdd:COG5271   749 A 749
COG1340 COG1340
Uncharacterized coiled-coil protein, contains DUF342 domain [Function unknown];
532-665 9.66e-03

Uncharacterized coiled-coil protein, contains DUF342 domain [Function unknown];


Pssm-ID: 440951 [Multi-domain]  Cd Length: 297  Bit Score: 39.51  E-value: 9.66e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 317373420  532 EEEEQVIHEDDERPSEKNEFSRRKRSKSEDMDNVQSKRRRYMEE--EYEAEfqvkitaKGDINQKLQKVIQWLLEEKLCA 609
Cdd:COG1340    29 EKRDELNEELKELAEKRDELNAQVKELREEAQELREKRDELNEKvkELKEE-------RDELNEKLNELREELDELRKEL 101
                          90       100       110       120       130       140
                  ....*....|....*....|....*....|....*....|....*....|....*....|..
gi 317373420  610 LQCAVFDKTLAELKTRVEKIEcnKRHKT-VLT-----ELQAKIARLTKRFEAAKEDLKKRHE 665
Cdd:COG1340   102 AELNKAGGSIDKLRKEIERLE--WRQQTeVLSpeeekELVEKIKELEKELEKAKKALEKNEK 161
 
Name Accession Description Interval E-value
ATF7IP_BD pfam16788
ATF-interacting protein binding domain; ATF7IP-BD is a short conserved region of activating ...
564-779 1.12e-76

ATF-interacting protein binding domain; ATF7IP-BD is a short conserved region of activating transcription factor 7-interacting protein 1 found in higher eukaryotes. This domain appears to bind several key proteins such as TFIIE-alpha and TFIIE-beta as well the transcriptional regulator Sp1 which are part of the transcriptional machinery.


Pssm-ID: 465271 [Multi-domain]  Cd Length: 214  Bit Score: 252.29  E-value: 1.12e-76
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 317373420   564 NVQSKRRRYMEEeyeaeFQVKITAKGDINQKLQKVIQWLLEEKLCALQCAVFDKTLAELKTRVEKIECNKRHKTVLTELQ 643
Cdd:pfam16788    1 KENVKRMKTSEQ-----INENICVALEKQTALLEQVKHLIEQEICSINYKLFDKKLKELNERVEKTECRKKHEAIATELQ 75
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 317373420   644 AKIARLTKRFEAAKEDLKKrhehpPNPPVSPGKTVND--VNSNNNMSYRNAGTVRQMLESKRNVSESAPpsFQTPVNTVS 721
Cdd:pfam16788   76 AKIARLTKRFKAALEDLKK-----CLPPNSPSSNAASkvANSNTINLYRNAGSVRSMLESKRSVGESSP--FQPPEKASK 148
                          170       180       190       200       210       220
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*.
gi 317373420   722 STNLVTPPAVVSSQPKLQTPVTSGSLT----ATSVLPAPNTATVV---ATTQVPSGNPQPT-ISLQ 779
Cdd:pfam16788  149 KINLTSPQNEVVSESNNQDDVMLISVEspnlTTPVTSNPTDTRKVtsgNSSNSPSAETEVMaVEKK 214
fn3_4 pfam16794
Fibronectin-III type domain;
1160-1260 6.58e-49

Fibronectin-III type domain;


Pssm-ID: 465273 [Multi-domain]  Cd Length: 101  Bit Score: 168.68  E-value: 6.58e-49
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 317373420  1160 LPQKPHLKLARVQsqNGIVLSWSVLEVDRSCATVDSYHLYAYHEEPSATVPS-QWKKIGEVKALPLPMACTLTQFVSGSK 1238
Cdd:pfam16794    2 PPQKPTLKLARVP--TGIVLSWNMPDLDPKYAPVESYHLFAYQENTSTTPSTdSWKKIGDVKALPLPMACTLSQFKAGQR 79
                           90       100
                   ....*....|....*....|..
gi 317373420  1239 YYFAVRAKDIYGRFGPFCDPQS 1260
Cdd:pfam16794   80 YYFAVRAVDIHGRYGPFSDPKT 101
PHA03247 PHA03247
large tegument protein UL36; Provisional
822-1158 4.84e-09

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 61.11  E-value: 4.84e-09
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 317373420  822 PPTVSGLTK--NPVSLPSLPNPTKPNNVPSVPSPSIQRNPTASAAPLGTTLAVQAVPTAHSIVQATRTSLPTVGPSGLYS 899
Cdd:PHA03247 2689 RPTVGSLTSlaDPPPPPPTPEPAPHALVSATPLPPGPAAARQASPALPAAPAPPAVPAGPATPGGPARPARPPTTAGPPA 2768
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 317373420  900 PSTNRGPiqmkipisafstsSAAEQNSNTTPRIENQTNKTIDASVSKKAADSTSQCGKATGSDSsgvidltmddeesgAS 979
Cdd:PHA03247 2769 PAPPAAP-------------AAGPPRRLTRPAVASLSESRESLPSPWDPADPPAAVLAPAAALP--------------PA 2821
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 317373420  980 QDPKklnhTPVSTMSSSQPVSRPLQPIQPAPPLQPSG-VPTSGPSQTTIHLLPTAPTTVNVTHRPVTQVTtRLPVPRAP- 1057
Cdd:PHA03247 2822 ASPA----GPLPPPTSAQPTAPPPPPGPPPPSLPLGGsVAPGGDVRRRPPSRSPAAKPAAPARPPVRRLA-RPAVSRSTe 2896
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 317373420 1058 --ANHQVVYTTLPAPPAQAPLRGTVMQAPAVR---QVNPQNSVTVRVPQTTTYVVNNGLTLGSTGPQLTvHHRP-----P 1127
Cdd:PHA03247 2897 sfALPPDQPERPPQPQAPPPPQPQPQPPPPPQpqpPPPPPPRPQPPLAPTTDPAGAGEPSGAVPQPWLG-ALVPgrvavP 2975
                         330       340       350
                  ....*....|....*....|....*....|.
gi 317373420 1128 QVHTEPPRPVHPAPLPEAPQPQRLPPEAAST 1158
Cdd:PHA03247 2976 RFRVPQPAPSREAPASSTPPLTGHSLSRVSS 3006
PHA03247 PHA03247
large tegument protein UL36; Provisional
820-1174 4.96e-09

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 61.11  E-value: 4.96e-09
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 317373420  820 QSPPTVSGLTKNPVSLPSLPNPTKPNNVPSVPSPSiQRNPTASAAPLGTTLAVQAVPTAHSIVQATRTSLPTVGPSGLYS 899
Cdd:PHA03247 2595 SARPRAPVDDRGDPRGPAPPSPLPPDTHAPDPPPP-SPSPAANEPDPHPPPTVPPPERPRDDPAPGRVSRPRRARRLGRA 2673
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 317373420  900 P---STNRGPIQMKIPISAFS-TSSAAEQNSNTTPriENQTNKTIDASVSKKAADSTSQCGKATGSDSSgvidltmddee 975
Cdd:PHA03247 2674 AqasSPPQRPRRRAARPTVGSlTSLADPPPPPPTP--EPAPHALVSATPLPPGPAAARQASPALPAAPA----------- 2740
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 317373420  976 sgasqdPKKLNHTPVSTMSSSQPVSRPLQ--PIQPAPPLQPSG-------VPTSGPSQTTIHLLPTAPTTVNVThRPVTQ 1046
Cdd:PHA03247 2741 ------PPAVPAGPATPGGPARPARPPTTagPPAPAPPAAPAAgpprrltRPAVASLSESRESLPSPWDPADPP-AAVLA 2813
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 317373420 1047 VTTRLPVPRAPANHQVVYTT-LPAPPAQAP--------LRGTVMQAPAVRQVNPQNSvTVRVPQTTTYVVNNGLTlgstG 1117
Cdd:PHA03247 2814 PAAALPPAASPAGPLPPPTSaQPTAPPPPPgppppslpLGGSVAPGGDVRRRPPSRS-PAAKPAAPARPPVRRLA----R 2888
                         330       340       350       360       370
                  ....*....|....*....|....*....|....*....|....*....|....*..
gi 317373420 1118 PQLTvhhRPPQVHTEPPRPVHPAPLPEAPQPQRLPPEAASTSLPQKPHLKLARVQSQ 1174
Cdd:PHA03247 2889 PAVS---RSTESFALPPDQPERPPQPQAPPPPQPQPQPPPPPQPQPPPPPPPRPQPP 2942
PTZ00341 PTZ00341
Ring-infected erythrocyte surface antigen; Provisional
319-574 4.54e-08

Ring-infected erythrocyte surface antigen; Provisional


Pssm-ID: 173534 [Multi-domain]  Cd Length: 1136  Bit Score: 57.87  E-value: 4.54e-08
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 317373420  319 KNGADEKLEQIQSKDSLDEKNKADNNIDAN-EETLEtddtticsdrppEN-EKKVEEDIitelalgEDAISSSMEIDQGE 396
Cdd:PTZ00341  929 KNQNENVPEHLKEHAEANIEEDAEENVEEDaEENVE------------ENvEENVEENV-------EENVEENVEENVEE 989
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 317373420  397 KNEDETSADLVETINENViEDNKSENILENTDSMETDEIIPILEKLAPSEDEltcfsktsllPIDETNPDLEEKMESSFg 476
Cdd:PTZ00341  990 NVEENVEENVEENIEENV-EENVEENIEENVEEYDEENVEEVEENVEEYDEE----------NVEEIEENAEENVEENI- 1057
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 317373420  477 spskQESSESLPKEaflvlsDEEDIsgEKDESEVISQNetcspaeVESNEKDNKPEEEEQVIHEDDERPSEKNEFSRRKR 556
Cdd:PTZ00341 1058 ----EENIEEYDEE------NVEEI--EENIEENIEEN-------VEENVEENVEEIEENVEENVEENAEENAEENAEEN 1118
                         250
                  ....*....|....*...
gi 317373420  557 SKSEDMDNVQSKRRRYME 574
Cdd:PTZ00341 1119 AEEYDDENPEEHNEEYDE 1136
Atrophin-1 pfam03154
Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian ...
708-1149 6.28e-07

Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian atrophy (DRPLA) gene. DRPLA OMIM:125370 is a progressive neurodegenerative disorder. It is caused by the expansion of a CAG repeat in the DRPLA gene on chromosome 12p. This results in an extended polyglutamine region in atrophin-1, that is thought to confer toxicity to the protein, possibly through altering its interactions with other proteins. The expansion of a CAG repeat is also the underlying defect in six other neurodegenerative disorders, including Huntington's disease. One interaction of expanded polyglutamine repeats that is thought to be pathogenic is that with the short glutamine repeat in the transcriptional coactivator CREB binding protein, CBP. This interaction draws CBP away from its usual nuclear location to the expanded polyglutamine repeat protein aggregates that are characteriztic of the polyglutamine neurodegenerative disorders. This interferes with CBP-mediated transcription and causes cytotoxicity.


Pssm-ID: 460830 [Multi-domain]  Cd Length: 991  Bit Score: 54.00  E-value: 6.28e-07
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 317373420   708 SAPPSFQTPVNTVSSTNLVTPPAVVSSQP-KLQTPVTSGSLTATSVLPAPNTATVVATTQVPSGNPQPTISLQPLPVILH 786
Cdd:pfam03154  143 STSPSIPSPQDNESDSDSSAQQQILQTQPpVLQAQSGAASPPSPPPPGTTQAATAGPTPSAPSVPPQGSPATSQPPNQTQ 222
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 317373420   787 VPVAVSSqpqLLQSHPGTLVTNQPSGNVEFISVQSPPTVSGLTKNPVSLPSLPNPTKPNNVPSVPSPSIQRNPTASAA-P 865
Cdd:pfam03154  223 STAAPHT---LIQQTPTLHPQRLPSPHPPLQPMTQPPPPSQVSPQPLPQPSLHGQMPPMPHSLQTGPSHMQHPVPPQPfP 299
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 317373420   866 LGTTLAVQAVPTAHSIVQATRTSLPTVGPSGLYSPSTNRGPIQMKIPISAFSTSSAAEQNSNTTPRIENQTNKTIDASVS 945
Cdd:pfam03154  300 LTPQSSQSQVPPGPSPAAPGQSQQRIHTPPSQSQLQSQQPPREQPLPPAPLSMPHIKPPPTTPIPQLPNPQSHKHPPHLS 379
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 317373420   946 KKAADSTSqcgkatgsdssgvidltmddeesgaSQDPKKLNHTPVSTMSSSQPVSRPLQPIQPAPPLQPSGVPTSGPSQT 1025
Cdd:pfam03154  380 GPSPFQMN-------------------------SNLPPPPALKPLSSLSTHHPPSAHPPPLQLMPQSQQLPPPPAQPPVL 434
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 317373420  1026 TIHLLPTAPTTVNVTHRPVTQVTTRLPVPRAPANHQVVYTTLPA--PPAQAPLRGTVMQAPAVRQVnpqnSVTVRVPQTT 1103
Cdd:pfam03154  435 TQSQSLPPPAASHPPTSGLHQVPSQSPFPQHPFVPGGPPPITPPsgPPTSTSSAMPGIQPPSSASV----SSSGPVPAAV 510
                          410       420       430       440
                   ....*....|....*....|....*....|....*....|....*.
gi 317373420  1104 TYVVnngltlgstgPQLTVHHRPPQvhtEPPRPVHPAPLPEAPQPQ 1149
Cdd:pfam03154  511 SCPL----------PPVQIKEEALD---EAEEPESPPPPPRSPSPE 543
PTZ00121 PTZ00121
MAEBL; Provisional
319-684 2.44e-06

MAEBL; Provisional


Pssm-ID: 173412 [Multi-domain]  Cd Length: 2084  Bit Score: 52.45  E-value: 2.44e-06
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 317373420  319 KNGADEKLEQIQSKDSLDEKNKADNNIDANEETLETDDTTICSD--RPPENEKKVEEDIITELALGEDAISSSMEIDQGE 396
Cdd:PTZ00121 1437 KKKAEEAKKADEAKKKAEEAKKAEEAKKKAEEAKKADEAKKKAEeaKKADEAKKKAEEAKKKADEAKKAAEAKKKADEAK 1516
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 317373420  397 KNEDETSADLVETINE--------NVIEDNKSENILENTDSMETDEIIPILEKLAPSEDELTCFSKTSLLPIDEtNPDLE 468
Cdd:PTZ00121 1517 KAEEAKKADEAKKAEEakkadeakKAEEKKKADELKKAEELKKAEEKKKAEEAKKAEEDKNMALRKAEEAKKAE-EARIE 1595
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 317373420  469 EKMEssFGSPSKQESSESLPKEAFLVLSDEEdISGEKDESEVISQNETCSPAEVESNEKDNKPEEEEQVIHEDDERPSE- 547
Cdd:PTZ00121 1596 EVMK--LYEEEKKMKAEEAKKAEEAKIKAEE-LKKAEEEKKKVEQLKKKEAEEKKKAEELKKAEEENKIKAAEEAKKAEe 1672
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 317373420  548 ---------KNEFSRRKRS-----KSEDMDNVQSKRRRYMEEEYEAEfQVKitakgdinqKLQKVIQWLLEEklcALQCA 613
Cdd:PTZ00121 1673 dkkkaeeakKAEEDEKKAAealkkEAEEAKKAEELKKKEAEEKKKAE-ELK---------KAEEENKIKAEE---AKKEA 1739
                         330       340       350       360       370       380       390
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|.
gi 317373420  614 VFDKTLAElKTRVEKIECNKRHKTVLTELQAKIARLTKRFEAAKEDLKKRHEhppNPPVSPGKTVNDVNSN 684
Cdd:PTZ00121 1740 EEDKKKAE-EAKKDEEEKKKIAHLKKEEEKKAEEIRKEKEAVIEEELDEEDE---KRRMEVDKKIKDIFDN 1806
Atrophin-1 pfam03154
Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian ...
792-1172 5.64e-06

Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian atrophy (DRPLA) gene. DRPLA OMIM:125370 is a progressive neurodegenerative disorder. It is caused by the expansion of a CAG repeat in the DRPLA gene on chromosome 12p. This results in an extended polyglutamine region in atrophin-1, that is thought to confer toxicity to the protein, possibly through altering its interactions with other proteins. The expansion of a CAG repeat is also the underlying defect in six other neurodegenerative disorders, including Huntington's disease. One interaction of expanded polyglutamine repeats that is thought to be pathogenic is that with the short glutamine repeat in the transcriptional coactivator CREB binding protein, CBP. This interaction draws CBP away from its usual nuclear location to the expanded polyglutamine repeat protein aggregates that are characteriztic of the polyglutamine neurodegenerative disorders. This interferes with CBP-mediated transcription and causes cytotoxicity.


Pssm-ID: 460830 [Multi-domain]  Cd Length: 991  Bit Score: 50.92  E-value: 5.64e-06
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 317373420   792 SSQPQLLQSHPGTLVTNQPSgnvefiSVQSPPTVSGLTKNPVSLPSLPNPTKPNNV-PSVPSPSIQRNPTASAAPL---G 867
Cdd:pfam03154  161 SAQQQILQTQPPVLQAQSGA------ASPPSPPPPGTTQAATAGPTPSAPSVPPQGsPATSQPPNQTQSTAAPHTLiqqT 234
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 317373420   868 TTLAVQAVPTAHSIVQ-ATRTSLPTVGPSGLYSPSTNRGPIQ-MKIPISAFSTSSAAEQNSNTTPRIENQTNKTIDASVS 945
Cdd:pfam03154  235 PTLHPQRLPSPHPPLQpMTQPPPPSQVSPQPLPQPSLHGQMPpMPHSLQTGPSHMQHPVPPQPFPLTPQSSQSQVPPGPS 314
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 317373420   946 KKAADSTSQCGKATGSDSSGvidltmddeesgASQDPKKLNHTPVSTMSSSQPVSRPLQPIQPAPPLQPSGVPT--SGPS 1023
Cdd:pfam03154  315 PAAPGQSQQRIHTPPSQSQL------------QSQQPPREQPLPPAPLSMPHIKPPPTTPIPQLPNPQSHKHPPhlSGPS 382
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 317373420  1024 QTTIHL-LPTAPTTvnvthRPVTQVTTRLPVPRAPANHQVVYTT--LPAPPAQAPLRGTVMQAPAVRQVNPQNSVTVRVP 1100
Cdd:pfam03154  383 PFQMNSnLPPPPAL-----KPLSSLSTHHPPSAHPPPLQLMPQSqqLPPPPAQPPVLTQSQSLPPPAASHPPTSGLHQVP 457
                          330       340       350       360       370       380       390
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|..
gi 317373420  1101 QTTTYVVNNGLTLGStgpqltvhhrPPQVHTEPPRPVHPAPLPEAPQPQRLPPeAASTSLPQKPHLKLARVQ 1172
Cdd:pfam03154  458 SQSPFPQHPFVPGGP----------PPITPPSGPPTSTSSAMPGIQPPSSASV-SSSGPVPAAVSCPLPPVQ 518
PHA03247 PHA03247
large tegument protein UL36; Provisional
708-1024 9.15e-06

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 50.32  E-value: 9.15e-06
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 317373420  708 SAPPSFQTPVNTVSSTNLVTPPAVVSSQPKLQTPVTSGSLTATSVLPAPNTATVVATTQVPSGNPQPTISLQPLPVILHV 787
Cdd:PHA03247 2764 AGPPAPAPPAAPAAGPPRRLTRPAVASLSESRESLPSPWDPADPPAAVLAPAAALPPAASPAGPLPPPTSAQPTAPPPPP 2843
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 317373420  788 PVAVSSQPQLLQSHPGTLVTNQPSGNVEFISVQSP--PTVSGLTKNPVSLPSLPNPTKPNNVPSVPSPSIQRNPTASA-- 863
Cdd:PHA03247 2844 GPPPPSLPLGGSVAPGGDVRRRPPSRSPAAKPAAParPPVRRLARPAVSRSTESFALPPDQPERPPQPQAPPPPQPQPqp 2923
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 317373420  864 -APLGTTLAVQAVPTAHSIVQATRTSLPTVGPSGLYSPSTNRGPIQMKIPISAFSTSSAAEqnSNTTPRIENQTNKTIDA 942
Cdd:PHA03247 2924 pPPPQPQPPPPPPPRPQPPLAPTTDPAGAGEPSGAVPQPWLGALVPGRVAVPRFRVPQPAP--SREAPASSTPPLTGHSL 3001
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 317373420  943 SVSKKAADSTSQCGKATGSDSSGVIDLTMDDEESGASQDPKKLNHTPVSTMSSsqpvsrpLQPIQPAPPLQPSGVPTSGP 1022
Cdd:PHA03247 3002 SRVSSWASSLALHEETDPPPVSLKQTLWPPDDTEDSDADSLFDSDSERSDLEA-------LDPLPPEPHDPFAHEPDPAT 3074

                  ..
gi 317373420 1023 SQ 1024
Cdd:PHA03247 3075 PE 3076
MSCRAMM_ClfA NF033609
MSCRAMM family adhesin clumping factor ClfA; Clumping factor A is an MSCRAMM (Microbial ...
122-433 1.87e-05

MSCRAMM family adhesin clumping factor ClfA; Clumping factor A is an MSCRAMM (Microbial Surface Components Recognizing Adhesive Matrix Molecules). It is heavily studied in Staphylococcus aureus both for its biological role in adhesion and for its potential for vaccination. Features of the sequence, but also of other MSCRAMM adhesins, include a long run of Ser-Asp dipeptide repeats and a C-terminal cell wall anchoring LPXTG motif.


Pssm-ID: 468110 [Multi-domain]  Cd Length: 934  Bit Score: 49.14  E-value: 1.87e-05
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 317373420  122 VSKLPAEPVSGDPAPGDLDA------GDPASGVLASGDSTSGDPTSSEP-SSSDAASGDATSGDAPSGDVSPGDATSGDA 194
Cdd:NF033609  544 VPEQPDEPGEIEPIPEDSDSdpgsdsGSDSSNSDSGSDSGSDSTSDSGSdSASDSDSASDSDSASDSDSASDSDSASDSD 623
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 317373420  195 TADDLSSGDPTSSDPIPGEPVPVEPISGDCAADDIASSEITSVDLASGAPASTDPASDDLASGDLSSSELASDDLATGEL 274
Cdd:NF033609  624 SASDSDSASDSDSASDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSD 703
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 317373420  275 ASDELTSESTFDRTFEPKSVPVCEPVPEIDNiEPSSNKDDDFLEKNGADEKLEQIQSKDSlDEKNKADNNIDANEETLET 354
Cdd:NF033609  704 SDSDSDSDSDSDSDSDSDSDSDSDSDSDSDS-DSDSDSDSDSDSDSDSDSDSDSDSDSDS-DSDSDSDSDSDSDSDSDSD 781
                         250       260       270       280       290       300       310
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 317373420  355 DDTTICSDRPPENEKKVEEDIITELALGEDAISSSmEIDQGEKNEDETSADLVETINENVIEDNKSENILENTDSMETD 433
Cdd:NF033609  782 SDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDS-DSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSESD 859
PHA03307 PHA03307
transcriptional regulator ICP4; Provisional
110-287 4.10e-05

transcriptional regulator ICP4; Provisional


Pssm-ID: 223039 [Multi-domain]  Cd Length: 1352  Bit Score: 48.24  E-value: 4.10e-05
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 317373420  110 EPLSPHNITPEPVSKLPAEPVSGDPAPGDLDAGD------PASGVLASGDSTSGdPTSSEPSSSDAASGDATSGDAPSGD 183
Cdd:PHA03307   78 EAPANESRSTPTWSLSTLAPASPAREGSPTPPGPsspdppPPTPPPASPPPSPA-PDLSEMLRPVGSPGPPPAASPPAAG 156
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 317373420  184 VSPGDATSGDAT---ADDLSSGDPTSSDPI--PGEPVPVEPISGDCAADDIASSEITSVDLASGAPASTDPASDDLASGD 258
Cdd:PHA03307  157 ASPAAVASDAASsrqAALPLSSPEETARAPssPPAEPPPSTPPAAASPRPPRRSSPISASASSPAPAPGRSAADDAGASS 236
                         170       180
                  ....*....|....*....|....*....
gi 317373420  259 LSSSELASDDLATGELASDELTSESTFDR 287
Cdd:PHA03307  237 SDSSSSESSGCGWGPENECPLPRPAPITL 265
Herpes_BLLF1 pfam05109
Herpes virus major outer envelope glycoprotein (BLLF1); This family consists of the BLLF1 ...
708-1191 6.55e-05

Herpes virus major outer envelope glycoprotein (BLLF1); This family consists of the BLLF1 viral late glycoprotein, also termed gp350/220. It is the most abundantly expressed glycoprotein in the viral envelope of the Herpesviruses and is the major antigen responsible for stimulating the production of neutralising antibodies in vivo.


Pssm-ID: 282904 [Multi-domain]  Cd Length: 886  Bit Score: 47.22  E-value: 6.55e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 317373420   708 SAPPSFQTPVNTVSSTNLVTP------PAVVSSQPKLQTPVTSGSLTATSVL--PAPNTATVVATTQVPSGNPQ------ 773
Cdd:pfam05109  422 SKAPESTTTSPTLNTTGFAAPntttglPSSTHVPTNLTAPASTGPTVSTADVtsPTPAGTTSGASPVTPSPSPRdngtes 501
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 317373420   774 --PTISLQPLPVILHVPVAVSSQPQLLQSHPGTlvTNQPSGNVEFISVQSPPTVSGLTKNPVSLPSLPNPTKPNNVPSVP 851
Cdd:pfam05109  502 kaPDMTSPTSAVTTPTPNATSPTPAVTTPTPNA--TSPTLGKTSPTSAVTTPTPNATSPTPAVTTPTPNATIPTLGKTSP 579
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 317373420   852 SPSIQR-NPTASAAPLGTTlAVQAVPTAHSIVQATRTSLPTVGPSGLYSPSTNRgpiqmKIPISAFSTSSAAEQNSNTTP 930
Cdd:pfam05109  580 TSAVTTpTPNATSPTVGET-SPQANTTNHTLGGTSSTPVVTSPPKNATSAVTTG-----QHNITSSSTSSMSLRPSSISE 653
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 317373420   931 RIENQTNKtidasvskkaaDSTSQCGKATGSDSSGVIDLTMDDEESgasqdpkklnhTPVSTMSSSQPVSRPLQPIQPAP 1010
Cdd:pfam05109  654 TLSPSTSD-----------NSTSHMPLLTSAHPTGGENITQVTPAS-----------TSTHHVSTSSPAPRPGTTSQASG 711
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 317373420  1011 PlqpsgvptsGPSQTTihllpTAPTTVNVTH-RPVTQVTTrlpvPRAPANHQVVYTTLPAPPAQA-PLRGTVMQAPAVRQ 1088
Cdd:pfam05109  712 P---------GNSSTS-----TKPGEVNVTKgTPPKNATS----PQAPSGQKTAVPTVTSTGGKAnSTTGGKHTTGHGAR 773
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 317373420  1089 VNPQNSVTVRVPQTTTYVVNNGLTLgsTGPQLTVHHRPPQVHTEPPRPVHPAPLPeapqpqrLPPeaasTSLPQKPHLKL 1168
Cdd:pfam05109  774 TSTEPTTDYGGDSTTPRTRYNATTY--LPPSTSSKLRPRWTFTSPPVTTAQATVP-------VPP----TSQPRFSNLSM 840
                          490       500
                   ....*....|....*....|...
gi 317373420  1169 ARVQSQNGIVLSWSVLEVDRSCA 1191
Cdd:pfam05109  841 LVLQWASLAVLTLLLLLVMADCA 863
PRK13108 PRK13108
prolipoprotein diacylglyceryl transferase; Reviewed
142-291 7.16e-05

prolipoprotein diacylglyceryl transferase; Reviewed


Pssm-ID: 237284 [Multi-domain]  Cd Length: 460  Bit Score: 46.90  E-value: 7.16e-05
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 317373420  142 GDPASGVLASGDSTSGDPTSSEPSSSDAASGDATSGDAPSGDvsPGDATSGDATADDLSS-------------------- 201
Cdd:PRK13108  278 GREAPGALRGSEYVVDEALEREPAELAAAAVASAASAVGPVG--PGEPNQPDDVAEAVKAevaevtdevaaesvvqvadr 355
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 317373420  202 -GDPTSSDPIPGEPVPVEPISGDCAADDIASSEitsVDLASGAPASTDPAsdDLASGDLSSSELASDDLATGELAS---D 277
Cdd:PRK13108  356 dGESTPAVEETSEADIEREQPGDLAGQAPAAHQ---VDAEAASAAPEEPA--ALASEAHDETEPEVPEKAAPIPDPakpD 430
                         170
                  ....*....|....
gi 317373420  278 ELTSESTFDRTFEP 291
Cdd:PRK13108  431 ELAVAGPGDDPAEP 444
Atrophin-1 pfam03154
Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian ...
963-1164 7.59e-05

Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian atrophy (DRPLA) gene. DRPLA OMIM:125370 is a progressive neurodegenerative disorder. It is caused by the expansion of a CAG repeat in the DRPLA gene on chromosome 12p. This results in an extended polyglutamine region in atrophin-1, that is thought to confer toxicity to the protein, possibly through altering its interactions with other proteins. The expansion of a CAG repeat is also the underlying defect in six other neurodegenerative disorders, including Huntington's disease. One interaction of expanded polyglutamine repeats that is thought to be pathogenic is that with the short glutamine repeat in the transcriptional coactivator CREB binding protein, CBP. This interaction draws CBP away from its usual nuclear location to the expanded polyglutamine repeat protein aggregates that are characteriztic of the polyglutamine neurodegenerative disorders. This interferes with CBP-mediated transcription and causes cytotoxicity.


Pssm-ID: 460830 [Multi-domain]  Cd Length: 991  Bit Score: 47.07  E-value: 7.59e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 317373420   963 SSGVIDLTMDDEESGASQDPKKLNHTPVSTMSSSQPVSRPLQPIQPAPPLQPSGVPTSGPSQTTIHLLPTAPTTVNVTHR 1042
Cdd:pfam03154  144 TSPSIPSPQDNESDSDSSAQQQILQTQPPVLQAQSGAASPPSPPPPGTTQAATAGPTPSAPSVPPQGSPATSQPPNQTQS 223
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 317373420  1043 PVT-----QVTTRLPVPRAPANHQVVY-TTLPAPPAQAP--------LRGTVMQAPAVRQVNPQNSVTVRVPQTTTYVVN 1108
Cdd:pfam03154  224 TAAphtliQQTPTLHPQRLPSPHPPLQpMTQPPPPSQVSpqplpqpsLHGQMPPMPHSLQTGPSHMQHPVPPQPFPLTPQ 303
                          170       180       190       200       210       220
                   ....*....|....*....|....*....|....*....|....*....|....*....|....
gi 317373420  1109 NGLTLGSTGPQLTV--------HHRPPQVHTEPPRPVHPAPLPEAPQPQRLPPEAASTSLPQKP 1164
Cdd:pfam03154  304 SSQSQVPPGPSPAApgqsqqriHTPPSQSQLQSQQPPREQPLPPAPLSMPHIKPPPTTPIPQLP 367
PTZ00341 PTZ00341
Ring-infected erythrocyte surface antigen; Provisional
321-578 1.94e-04

Ring-infected erythrocyte surface antigen; Provisional


Pssm-ID: 173534 [Multi-domain]  Cd Length: 1136  Bit Score: 45.93  E-value: 1.94e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 317373420  321 GADEKLEQIQSKDSLDEKNKADNNIDANEETLETDDTTICSDRPPENEKKVEEDIitelalgEDAISSSMEIDQGEKNED 400
Cdd:PTZ00341  897 GGGKKDKKAKKKDAKDLSGNIAHEINLINKELKNQNENVPEHLKEHAEANIEEDA-------EENVEEDAEENVEENVEE 969
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 317373420  401 ETSADLVETINENViEDNKSENILENTDSMETDEIIPILEklapsedeltcfsktsllpiDETNPDLEEKMESSFGSPSK 480
Cdd:PTZ00341  970 NVEENVEENVEENV-EENVEENVEENVEENVEENIEENVE--------------------ENVEENIEENVEEYDEENVE 1028
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 317373420  481 QESSESLPKEAFLVLSDEEDIsgEKDESEVISQN----ETCSPAEVESNEKDNKPEEEEQVIHEDDERPSEKNEFSRRKR 556
Cdd:PTZ00341 1029 EVEENVEEYDEENVEEIEENA--EENVEENIEENieeyDEENVEEIEENIEENIEENVEENVEENVEEIEENVEENVEEN 1106
                         250       260
                  ....*....|....*....|..
gi 317373420  557 SKSEDMDNVQSKRRRYMEEEYE 578
Cdd:PTZ00341 1107 AEENAEENAEENAEEYDDENPE 1128
PHA03378 PHA03378
EBNA-3B; Provisional
976-1162 2.32e-04

EBNA-3B; Provisional


Pssm-ID: 223065 [Multi-domain]  Cd Length: 991  Bit Score: 45.83  E-value: 2.32e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 317373420  976 SGASQDPKKLNHTPVSTMSSSQPVsrPLQPIQPAP------PLQPSGVPTsgPSQT-TIHLLPTAPTTVNVTHRPV---- 1044
Cdd:PHA03378  603 SQTPEPPTTQSHIPETSAPRQWPM--PLRPIPMRPlrmqpiTFNVLVFPT--PHQPpQVEITPYKPTWTQIGHIPYqpsp 678
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 317373420 1045 TQVTTRLPVPRAPANHQvvyttlpaPPAQAPLRGTVMQAPAVRQVNPQNSVT-VRVPQTTTYVVN--NGLTLGSTGPQLT 1121
Cdd:PHA03378  679 TGANTMLPIQWAPGTMQ--------PPPRAPTPMRPPAAPPGRAQRPAAATGrARPPAAAPGRARppAAAPGRARPPAAA 750
                         170       180       190       200
                  ....*....|....*....|....*....|....*....|....*....
gi 317373420 1122 -VHHRPPQVHTEPPRPVHPAPLPEAPQPQ-------RLPPEAASTSLPQ 1162
Cdd:PHA03378  751 pGRARPPAAAPGRARPPAAAPGAPTPQPPpqappapQQRPRGAPTPQPP 799
PRK03918 PRK03918
DNA double-strand break repair ATPase Rad50;
366-663 2.84e-04

DNA double-strand break repair ATPase Rad50;


Pssm-ID: 235175 [Multi-domain]  Cd Length: 880  Bit Score: 45.44  E-value: 2.84e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 317373420  366 ENEKKVEEDIITELALGEDAISSSMEIDQGEKNEDETSADLVETINENVIEDNKSENILENTDS--METDEIIPILEKLA 443
Cdd:PRK03918  165 KNLGEVIKEIKRRIERLEKFIKRTENIEELIKEKEKELEEVLREINEISSELPELREELEKLEKevKELEELKEEIEELE 244
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 317373420  444 PSEDELTCFSKTSLLPIDETNPDLEEKMESSFGSPSKQESSESLPKEA--------FLVLSDEEDISGEKDESEVISQ-- 513
Cdd:PRK03918  245 KELESLEGSKRKLEEKIRELEERIEELKKEIEELEEKVKELKELKEKAeeyiklseFYEEYLDELREIEKRLSRLEEEin 324
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 317373420  514 --NETCSPAEvESNEKDNKPEEEEQVIHEDDERPSEKNEFSRRKRSKSEDMDNVQSKRRRYMEEEYEAEFQVKITAKGDI 591
Cdd:PRK03918  325 giEERIKELE-EKEERLEELKKKLKELEKRLEELEERHELYEEAKAKKEELERLKKRLTGLTPEKLEKELEELEKAKEEI 403
                         250       260       270       280       290       300       310
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 317373420  592 NQKLQKVIQWL--LEEKLCALQCAVFDKTLAELKTRVEKIECNKRH-KTVLTELQAKIARLTKR---FEAAKEDLKKR 663
Cdd:PRK03918  404 EEEISKITARIgeLKKEIKELKKAIEELKKAKGKCPVCGRELTEEHrKELLEEYTAELKRIEKElkeIEEKERKLRKE 481
PTZ00121 PTZ00121
MAEBL; Provisional
317-600 3.35e-04

MAEBL; Provisional


Pssm-ID: 173412 [Multi-domain]  Cd Length: 2084  Bit Score: 45.13  E-value: 3.35e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 317373420  317 LEKNGADEKLEQIQSKDSLDEKNKADNNIDANEETLETDDTTicsdRPPENEKKVEEDIITELALGEDAISSSMEIDQGE 396
Cdd:PTZ00121 1680 AKKAEEDEKKAAEALKKEAEEAKKAEELKKKEAEEKKKAEEL----KKAEEENKIKAEEAKKEAEEDKKKAEEAKKDEEE 1755
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 317373420  397 KN-------EDETSADLVETINENVIEDNKSENilENTDSMETDEIIPilEKLAPSEDELTCFSKTSLLPIDETNPDLEE 469
Cdd:PTZ00121 1756 KKkiahlkkEEEKKAEEIRKEKEAVIEEELDEE--DEKRRMEVDKKIK--DIFDNFANIIEGGKEGNLVINDSKEMEDSA 1831
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 317373420  470 KMESSFGSPSKQESSESLPKEAFlvlsDEEDISGEKDESEVISqnetcspaeveSNEKDNKPEEEEQVIHEDDERPSEKN 549
Cdd:PTZ00121 1832 IKEVADSKNMQLEEADAFEKHKF----NKNNENGEDGNKEADF-----------NKEKDLKEDDEEEIEEADEIEKIDKD 1896
                         250       260       270       280       290
                  ....*....|....*....|....*....|....*....|....*....|.
gi 317373420  550 EFSRRKRSKSEDMDNVQSKRRRYMEEEYEaefqvkitaKGDINQKLQKVIQ 600
Cdd:PTZ00121 1897 DIEREIPNNNMAGKNNDIIDDKLDKDEYI---------KRDAEETREEIIK 1938
PRK07764 PRK07764
DNA polymerase III subunits gamma and tau; Validated
126-252 1.66e-03

DNA polymerase III subunits gamma and tau; Validated


Pssm-ID: 236090 [Multi-domain]  Cd Length: 824  Bit Score: 42.67  E-value: 1.66e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 317373420  126 PAEPVSGDPAPGDLDAGDPASGVLASGDSTSGDPTSSEPSSSDAASGDATSGDAPSGDVSPGDATSGDATADDLS---SG 202
Cdd:PRK07764  649 APEHHPKHVAVPDASDGGDGWPAKAGGAAPAAPPPAPAPAAPAAPAGAAPAQPAPAPAATPPAGQADDPAAQPPQaaqGA 728
                          90       100       110       120       130
                  ....*....|....*....|....*....|....*....|....*....|
gi 317373420  203 DPTSSDPIPGEPVPVEPISGDCAADDIASSEITSVDLASGAPASTDPASD 252
Cdd:PRK07764  729 SAPSPAADDPVPLPPEPDDPPDPAGAPAQPPPPPAPAPAAAPAAAPPPSP 778
PHA03247 PHA03247
large tegument protein UL36; Provisional
1001-1164 2.21e-03

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 42.62  E-value: 2.21e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 317373420 1001 RPLQPIQPAPPLQPSGVPTSGPSQTTIHLLPTAPttvnvtHRPVTQVttrlPVPRAPANHQVVYTTLPAPPAQAPLRgtv 1080
Cdd:PHA03247 2588 RPDAPPQSARPRAPVDDRGDPRGPAPPSPLPPDT------HAPDPPP----PSPSPAANEPDPHPPPTVPPPERPRD--- 2654
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 317373420 1081 mqAPAVRQVNPQNSVTVRVPQTTTYVVNNGLTLGSTGP---QLTVHHRPPqvhtEPPRPVHPAPLPEAPQ-PQRLPPEAA 1156
Cdd:PHA03247 2655 --DPAPGRVSRPRRARRLGRAAQASSPPQRPRRRAARPtvgSLTSLADPP----PPPPTPEPAPHALVSAtPLPPGPAAA 2728

                  ....*...
gi 317373420 1157 STSLPQKP 1164
Cdd:PHA03247 2729 RQASPALP 2736
SMC_prok_B TIGR02168
chromosome segregation protein SMC, common bacterial type; SMC (structural maintenance of ...
324-662 2.25e-03

chromosome segregation protein SMC, common bacterial type; SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. This family represents the SMC protein of most bacteria. The smc gene is often associated with scpB (TIGR00281) and scpA genes, where scp stands for segregation and condensation protein. SMC was shown (in Caulobacter crescentus) to be induced early in S phase but present and bound to DNA throughout the cell cycle. [Cellular processes, Cell division, DNA metabolism, Chromosome-associated proteins]


Pssm-ID: 274008 [Multi-domain]  Cd Length: 1179  Bit Score: 42.35  E-value: 2.25e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 317373420   324 EKLEQIQSKdsLDEKNKAdnnIDANEETLETDDTTIcsdrppENEKKVEEDIITELALG-EDAISSSMEIDQGEKNEDET 402
Cdd:TIGR02168  684 EKIEELEEK--IAELEKA---LAELRKELEELEEEL------EQLRKELEELSRQISALrKDLARLEAEVEQLEERIAQL 752
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 317373420   403 SADLVETINENVIEDNK----SENILENTDSMETDEiipilEKLAPSEDELTCFSKTsllpIDETNPDLEEKMESSFgsp 478
Cdd:TIGR02168  753 SKELTELEAEIEELEERleeaEEELAEAEAEIEELE-----AQIEQLKEELKALREA----LDELRAELTLLNEEAA--- 820
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 317373420   479 SKQESSESLPKEAFLVLSDEEDISGE-KDESEVISQNEtcspAEVESnEKDNKPEEEEQVIHEDDERpSEKNEFSRRKRS 557
Cdd:TIGR02168  821 NLRERLESLERRIAATERRLEDLEEQiEELSEDIESLA----AEIEE-LEELIEELESELEALLNER-ASLEEALALLRS 894
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 317373420   558 KSEDMDNVQ---SKRRRYMEEEYEAefqvKITAKGDINQKLQKVIQWL--LEEKLCALQCAVFDKTLAELKTRVEKIECN 632
Cdd:TIGR02168  895 ELEELSEELrelESKRSELRRELEE----LREKLAQLELRLEGLEVRIdnLQERLSEEYSLTLEEAEALENKIEDDEEEA 970
                          330       340       350       360
                   ....*....|....*....|....*....|....*....|....*..
gi 317373420   633 KRHktvLTELQAKIARL--------------TKRFE---AAKEDLKK 662
Cdd:TIGR02168  971 RRR---LKRLENKIKELgpvnlaaieeyeelKERYDfltAQKEDLTE 1014
DUF5585 pfam17823
Family of unknown function (DUF5585); This is a family of unknown function found in chordata.
860-1164 2.71e-03

Family of unknown function (DUF5585); This is a family of unknown function found in chordata.


Pssm-ID: 465521 [Multi-domain]  Cd Length: 506  Bit Score: 41.87  E-value: 2.71e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 317373420   860 TASAAPLGTTLAVQAvpTAHSIVQATRTSLPTVGPSGlySPSTNRGpiqmkipiSAFSTSSAAEQNSNTTPRIEN-QTNK 938
Cdd:pfam17823   87 TAEHTPHGTDLSEPA--TREGAADGAASRALAAAASS--SPSSAAQ--------SLPAAIAALPSEAFSAPRAAAcRANA 154
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 317373420   939 TIDASVSKKAADSTSQCGKATGSDSSGVidlTMDDEESGASQDPKKLNHTPVSTMSSSQPVS-RPLQPIQPAPPLQPSGV 1017
Cdd:pfam17823  155 SAAPRAAIAAASAPHAASPAPRTAASST---TAASSTTAASSAPTTAASSAPATLTPARGIStAATATGHPAAGTALAAV 231
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 317373420  1018 PTSGPSQTTIHL---------LPTAPTTVNVTHRPVTQVTTRLPVPR--APANHQVVYTTL--PAPPAQAPLRGTVMQAP 1084
Cdd:pfam17823  232 GNSSPAAGTVTAavgtvtpaaLATLAAAAGTVASAAGTINMGDPHARrlSPAKHMPSDTMArnPAAPMGAQAQGPIIQVS 311
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 317373420  1085 AVRQV-------NPQNSVTVRVPQTTTYVVNNGLTLGSTgpqLTVHHRPPQVHTEPPRPVHPAPLPEA----PQPQRLPP 1153
Cdd:pfam17823  312 TDQPVhntagepTPSPSNTTLEPNTPKSVASTNLAVVTT---TKAQAKEPSASPVPVLHTSMIPEVEAtsptTQPSPLLP 388
                          330
                   ....*....|...
gi 317373420  1154 E--AASTSLPQKP 1164
Cdd:pfam17823  389 TqgAAGPGILLAP 401
PRK10263 PRK10263
DNA translocase FtsK; Provisional
994-1170 3.16e-03

DNA translocase FtsK; Provisional


Pssm-ID: 236669 [Multi-domain]  Cd Length: 1355  Bit Score: 41.99  E-value: 3.16e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 317373420  994 SSSQPVSRPLQPIQPAPPLqPSgvPTSGPSQTTIHLLPT-APTTVNVTHRPVTQVTTRLPVPRAPANHQVVYTTLPAPPA 1072
Cdd:PRK10263  328 TATQSWAAPVEPVTQTPPV-AS--VDVPPAQPTVAWQPVpGPQTGEPVIAPAPEGYPQQSQYAQPAVQYNEPLQQPVQPQ 404
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 317373420 1073 QAPLRGTVMQAPAVRQVNPQNSVTVRVPQTTTYVVNNGLTLGSTGPQLTVHHRPPQVH---TEPPRPVHPAPLPEAPQPq 1149
Cdd:PRK10263  405 QPYYAPAAEQPAQQPYYAPAPEQPAQQPYYAPAPEQPVAGNAWQAEEQQSTFAPQSTYqteQTYQQPAAQEPLYQQPQP- 483
                         170       180
                  ....*....|....*....|.
gi 317373420 1150 rLPPEAASTSLPQKPHLKLAR 1170
Cdd:PRK10263  484 -VEQQPVVEPEPVVEETKPAR 503
PHA02664 PHA02664
hypothetical protein; Provisional
126-282 3.68e-03

hypothetical protein; Provisional


Pssm-ID: 177447  Cd Length: 534  Bit Score: 41.52  E-value: 3.68e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 317373420  126 PAEP----VSGDPApgdLDAGDPASGVLASGDSTSGdptssePSSSDAASGDATSgdapsgdvSPGDATSGDATADDLSS 201
Cdd:PHA02664  368 PAEPaalfVDGNEV---IAAGAAAAMIAAAERAANG------ARGSPMAAPEEGR--------AAAAAAAANAPADQDVE 430
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 317373420  202 GDPtSSDPIPGEPVPVEPISGDCAADDIASSEIT-----------SVDLASGAPASTDPASDDLASGDLSSSELASDDLA 270
Cdd:PHA02664  431 AEA-HDEFDQDPGAPAHADRADSDEDDMDEQESGderadgeddsdSSYSYSTTSSEDESDSADDSWGDESDSGIEHDDGG 509
                         170
                  ....*....|..
gi 317373420  271 TGELASDELTSE 282
Cdd:PHA02664  510 VGQAIEEEEEEE 521
Mplasa_alph_rch TIGR04523
helix-rich Mycoplasma protein; Members of this family occur strictly within a subset of ...
302-662 3.86e-03

helix-rich Mycoplasma protein; Members of this family occur strictly within a subset of Mycoplasma species. Members average 750 amino acids in length, including signal peptide. Sequences are predicted (Jpred 3) to be almost entirely alpha-helical. These sequences show strong periodicity (consistent with long alpha helical structures) and low complexity rich in D,E,N,Q, and K. Genes encoding these proteins are often found in tandem. The function is unknown.


Pssm-ID: 275316 [Multi-domain]  Cd Length: 745  Bit Score: 41.54  E-value: 3.86e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 317373420   302 EIDNIEPSSNKDDdfleKNGADEKLEQIQSKDSLDEKNKADNNIDANEETLETDDTTICSDRppENEKKVEEDIITELAL 381
Cdd:TIGR04523  125 ELNKLEKQKKENK----KNIDKFLTEIKKKEKELEKLNNKYNDLKKQKEELENELNLLEKEK--LNIQKNIDKIKNKLLK 198
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 317373420   382 GEDAISSSMEIDQGEK-------NEDETSADLVETINENVIEDNKSENILENTDSM---ETDEIIPILEKLAPSEDELTC 451
Cdd:TIGR04523  199 LELLLSNLKKKIQKNKslesqisELKKQNNQLKDNIEKKQQEINEKTTEISNTQTQlnqLKDEQNKIKKQLSEKQKELEQ 278
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 317373420   452 FSKTsllpIDETNPDLEE-KMESsfgSPSKQESSESLPKEaflVLSDEEDISGEKDESE--------VISQ-NETCSPAE 521
Cdd:TIGR04523  279 NNKK----IKELEKQLNQlKSEI---SDLNNQKEQDWNKE---LKSELKNQEKKLEEIQnqisqnnkIISQlNEQISQLK 348
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 317373420   522 VESNEKDNKPEEEEQVIHE-DDERPSEKNEfsrrKRSKSEDMDNVQSKRRRY-----MEEEYEAEFQVKITAKGDINQKL 595
Cdd:TIGR04523  349 KELTNSESENSEKQRELEEkQNEIEKLKKE----NQSYKQEIKNLESQINDLeskiqNQEKLNQQKDEQIKKLQQEKELL 424
                          330       340       350       360       370       380
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 317373420   596 QKVIQWLLEEKLcalqcaVFDKTLAELKTR--VEKIECNKrHKTVLTELQAKIARLTKRFEAAKEDLKK 662
Cdd:TIGR04523  425 EKEIERLKETII------KNNSEIKDLTNQdsVKELIIKN-LDNTRESLETQLKVLSRSINKIKQNLEQ 486
MDN1 COG5271
Midasin, AAA ATPase with vWA domain, involved in ribosome maturation [Translation, ribosomal ...
80-552 4.01e-03

Midasin, AAA ATPase with vWA domain, involved in ribosome maturation [Translation, ribosomal structure and biogenesis];


Pssm-ID: 444083 [Multi-domain]  Cd Length: 1028  Bit Score: 41.54  E-value: 4.01e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 317373420   80 DPEGSKAEWKETPCILSVNVKNKQDDDLNCEPLSPHNITPEPVSKLPAEPVSGDPAPGDLDAGDPASGVLASGDSTSGDP 159
Cdd:COG5271   274 ATDDADGLEAAEDDALDAELTAAQAADPESDDDADDSTLAALEGAAEDTEIATADELAAADDEDDDDSAAEDAAEEAATA 353
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 317373420  160 TSSEPSSSDAASGDATSGDAPSGDVSPGDATSGDATADDLSSGDPTSSDPIPGEPVPVEPISGDCAADDIASSEITSVDL 239
Cdd:COG5271   354 EDSAAEDTQDAEDEAAGEAADESEGADTDAAADEADAAADDSADDEEASADGGTSPTSDTDEEEEEADEDASAGETEDES 433
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 317373420  240 ASGAPASTDPASDDLASGDLSSSELASDDLATGELASDELTSESTFDRTFEPKSVPVCEPVPEIDNIEPSSNKDD----- 314
Cdd:COG5271   434 TDVTSAEDDIATDEEADSLADEEEEAEAELDTEEDTESAEEDADGDEATDEDDASDDGDEEEAEEDAEAEADSDEltaee 513
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 317373420  315 ---DFLEKNGADEKLEQIQSKDSLDEKNKADNNIDANEETLETDDTTICSDRPPENEKKVEEDIITELALGEDAISSSME 391
Cdd:COG5271   514 tsaDDGADTDAAADPEDSDEDALEDETEGEENAPGSDQDADETDEPEATAEEDEPDEAEAETEDATENADADETEESADE 593
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 317373420  392 IDQGEKNEDETSADLVETINENVIEDNKSENILENTDSMETDEIIPILEKLAPSEDELTCFSKTSLLPIDETNPDLEEKM 471
Cdd:COG5271   594 SEEAEASEDEAAEEEEADDDEADADADGAADEEETEEEAAEDEAAEPETDASEAADEDADAETEAEASADESEEEAEDES 673
                         410       420       430       440       450       460       470       480
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 317373420  472 ESSfgSPSKQESSESLPKEAflvLSDEEDISGEKDESEVISQNETCSPAEVESNEKDNKPEEEEQVIHEDDERPSEKNEF 551
Cdd:COG5271   674 ETS--SEDAEEDADAAAAEA---SDDEEETEEADEDAETASEEADAEEADTEADGTAEEAEEAAEEAESADEEAASLPDE 748

                  .
gi 317373420  552 S 552
Cdd:COG5271   749 A 749
PRK14971 PRK14971
DNA polymerase III subunit gamma/tau;
992-1091 4.64e-03

DNA polymerase III subunit gamma/tau;


Pssm-ID: 237874 [Multi-domain]  Cd Length: 614  Bit Score: 41.30  E-value: 4.64e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 317373420  992 TMSSSQPVSRPLQPI--QPAPPLQPSGVPTSGPSQTTIHLLPTAPTTVNVTHRPVTQVTTRLPVPRApanhqvvyttLPA 1069
Cdd:PRK14971  368 DASGGRGPKQHIKPVftQPAAAPQPSAAAAASPSPSQSSAAAQPSAPQSATQPAGTPPTVSVDPPAA----------VPV 437
                          90       100
                  ....*....|....*....|..
gi 317373420 1070 PPAQAPLRGTVMQAPAVRQVNP 1091
Cdd:PRK14971  438 NPPSTAPQAVRPAQFKEEKKIP 459
PRK07003 PRK07003
DNA polymerase III subunit gamma/tau;
121-313 5.09e-03

DNA polymerase III subunit gamma/tau;


Pssm-ID: 235906 [Multi-domain]  Cd Length: 830  Bit Score: 41.37  E-value: 5.09e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 317373420  121 PVSKLPAEPVSGDPAPGDLDAGDPASGVLASGDSTSGDPtSSEPSSSDAASGDATSGDAPSGDVSPgdATSGDATADDLS 200
Cdd:PRK07003  368 PGGGVPARVAGAVPAPGARAAAAVGASAVPAVTAVTGAA-GAALAPKAAAAAAATRAEAPPAAPAP--PATADRGDDAAD 444
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 317373420  201 SGDPTSSDpipgEPVPVEPisgDCAADDIASSEITSVDLASGAPASTDPASddlASGDLSSSELASDDLATGELASDELT 280
Cdd:PRK07003  445 GDAPVPAK----ANARASA---DSRCDERDAQPPADSGSASAPASDAPPDA---AFEPAPRAAAPSAATPAAVPDARAPA 514
                         170       180       190
                  ....*....|....*....|....*....|...
gi 317373420  281 SESTFDRtfepkSVPVCEPVPEIDNIEPSSNKD 313
Cdd:PRK07003  515 AASREDA-----PAAAAPPAPEARPPTPAAAAP 542
PHA03307 PHA03307
transcriptional regulator ICP4; Provisional
655-1076 7.14e-03

transcriptional regulator ICP4; Provisional


Pssm-ID: 223039 [Multi-domain]  Cd Length: 1352  Bit Score: 40.92  E-value: 7.14e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 317373420  655 AAKEDLKKRHEHPPNPPVSPGKTVNDVNSNNNMSYRNAGTVRQMLESKRNVSESAPPSFQTPVNTvSSTNLVTPPAVVSS 734
Cdd:PHA03307   56 VAGAAACDRFEPPTGPPPGPGTEAPANESRSTPTWSLSTLAPASPAREGSPTPPGPSSPDPPPPT-PPPASPPPSPAPDL 134
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 317373420  735 QPKLQTPVTSGSLTATSVLPAPNTATVVATTQVPSGNPQPTISLQPLPVilHVPVAVSSQPQLLQSHPGTLVTNQPSGNV 814
Cdd:PHA03307  135 SEMLRPVGSPGPPPAASPPAAGASPAAVASDAASSRQAALPLSSPEETA--RAPSSPPAEPPPSTPPAAASPRPPRRSSP 212
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 317373420  815 EFISVQSPPtvsgltknpvslPSLPNPTKPNNVPSVPSPSIQRNPTASAAPLGTTlavqAVPTAHSIVQATR--TSLPTV 892
Cdd:PHA03307  213 ISASASSPA------------PAPGRSAADDAGASSSDSSSSESSGCGWGPENEC----PLPRPAPITLPTRiwEASGWN 276
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 317373420  893 GPSGLYSPSTNRGPIQMKIPISAFSTSSAAEqnSNTTPRIENQTNKTIDASVSKKAADSTSQCGKATGSDSSgvidltmd 972
Cdd:PHA03307  277 GPSSRPGPASSSSSPRERSPSPSPSSPGSGP--APSSPRASSSSSSSRESSSSSTSSSSESSRGAAVSPGPS-------- 346
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 317373420  973 DEESGASQDPkklnhtPVSTMSSSQPVSRPLQPIQPAPPlQPSGVPTSgpsqttihllPTAPTTVNVTHRPvTQVTTRLP 1052
Cdd:PHA03307  347 PSRSPSPSRP------PPPADPSSPRKRPRPSRAPSSPA-ASAGRPTR----------RRARAAVAGRARR-RDATGRFP 408
                         410       420
                  ....*....|....*....|....
gi 317373420 1053 VPRAPANHQVVYTTLPAPPAQAPL 1076
Cdd:PHA03307  409 AGRPRPSPLDAGAASGAFYARYPL 432
COG1340 COG1340
Uncharacterized coiled-coil protein, contains DUF342 domain [Function unknown];
532-665 9.66e-03

Uncharacterized coiled-coil protein, contains DUF342 domain [Function unknown];


Pssm-ID: 440951 [Multi-domain]  Cd Length: 297  Bit Score: 39.51  E-value: 9.66e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 317373420  532 EEEEQVIHEDDERPSEKNEFSRRKRSKSEDMDNVQSKRRRYMEE--EYEAEfqvkitaKGDINQKLQKVIQWLLEEKLCA 609
Cdd:COG1340    29 EKRDELNEELKELAEKRDELNAQVKELREEAQELREKRDELNEKvkELKEE-------RDELNEKLNELREELDELRKEL 101
                          90       100       110       120       130       140
                  ....*....|....*....|....*....|....*....|....*....|....*....|..
gi 317373420  610 LQCAVFDKTLAELKTRVEKIEcnKRHKT-VLT-----ELQAKIARLTKRFEAAKEDLKKRHE 665
Cdd:COG1340   102 AELNKAGGSIDKLRKEIERLE--WRQQTeVLSpeeekELVEKIKELEKELEKAKKALEKNEK 161
 
Blast search parameters
Data Source: Precalculated data, version = cdd.v.3.21
Preset Options:Database: CDSEARCH/cdd   Low complexity filter: no  Composition Based Adjustment: yes   E-value threshold: 0.01

References:

  • Wang J et al. (2023), "The conserved domain database in 2023", Nucleic Acids Res.51(D)384-8.
  • Lu S et al. (2020), "The conserved domain database in 2020", Nucleic Acids Res.48(D)265-8.
  • Marchler-Bauer A et al. (2017), "CDD/SPARCLE: functional classification of proteins via subfamily domain architectures.", Nucleic Acids Res.45(D)200-3.
Help | Disclaimer | Write to the Help Desk
NCBI | NLM | NIH