NCBI Home Page NCBI Site Search page NCBI Guide that lists and describes the NCBI resources
Conserved domains on  [gi|410225316|gb|JAA09877|]
View 

suppressor of Ty 5 homolog [Pan troglodytes]

Protein Classification

Graphical summary

 Zoom to residue level

show extra options »

Show site features     Horizontal zoom: ×

List of domain hits

Name Accession Description Interval E-value
NGN_Euk cd09888
Eukaryotic N-Utilization Substance G (NusG) N-terminal (NGN) domain, including plant KTF1 (KOW ...
178-265 4.33e-44

Eukaryotic N-Utilization Substance G (NusG) N-terminal (NGN) domain, including plant KTF1 (KOW domain-containing Transcription Factor 1); The N-Utilization Substance G (NusG) protein and its eukaryotic homolog, Spt5, are involved in transcription elongation and termination. NusG contains an NGN domain at its N-terminus and Kyrpides Ouzounis and Woese (KOW) repeats at its C-terminus. Spt5 forms an Spt4-Spt5 complex that is an essential RNA polymerase II elongation factor. NusG was originally discovered as an N-dependent antitermination enhancing activity in Escherichia coli, and has a variety of functions such as its involvement in RNA polymerase elongation and Rho-termination in bacteria. Orthologs of the NusG gene exist in all bacteria, but their functions and requirements are different. Spt5-like is homologous to the Spt5 proteins present in all eukaryotes, which is unique as it encodes a protein with an additional long carboxy-terminal extension that contains WG/GW motifs. Spt5-like, or KTF1 (KOW domain-containing Transcription Factor 1), is a RNA-directed DNA methylation (RdDM) pathway effector in plants.


:

Pssm-ID: 193577 [Multi-domain]  Cd Length: 86  Bit Score: 154.23  E-value: 4.33e-44
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 410225316  178 NLWTVKCKIGEERATAISLMRKFIAYQFTDTPLQIKSVVAPEHVKGYIYVEAYKQTHVKQAIEGVGNLRLgyWNQQMVPI 257
Cdd:cd09888     1 KLWAVKCKPGKEREIVISLMRKFLDLQRTGNPLGIKSVFARDGLKGYIYIEARKEAHVKDAIEGLRGVYL--NTIKLVPI 78

                  ....*...
gi 410225316  258 KEMTDVLK 265
Cdd:cd09888    79 KEMPDVLS 86
CTD smart01104
Spt5 C-terminal nonapeptide repeat binding Spt4; The C-terminal domain of the transcription ...
772-889 1.91e-28

Spt5 C-terminal nonapeptide repeat binding Spt4; The C-terminal domain of the transcription elongation factor protein Spt5 is necessary for binding to Spt4 to form the functional complex that regulates early transcription elongation by RNA polymerase II. The complex may be involved in pre-mRNA processing through its association with mRNA capping enzymes. This CTD domain carries a regular nonapeptide repeat that can be present in up to 18 copies, as in S. pombe. The repeat has a characteristic TPA motif.


:

Pssm-ID: 215026 [Multi-domain]  Cd Length: 121  Bit Score: 110.69  E-value: 1.91e-28
                            10        20        30        40        50        60        70        80
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 410225316    772 GSQTPMYG-SGSRTPMYGSQTP----LQDGSRTPHYGSQTPPHDG--SRTPAQSGAWdPNNPNTPSRAEEEYEYAFDDEP 844
Cdd:smart01104    1 GGRTPAWGaSGSKTPAWGSRTPgtaaGGAPTARGGSGSRTPAWGGagSRTPAWGGAG-PTGSRTPAWGGASAWGNKSSEG 79
                            90       100       110       120
                    ....*....|....*....|....*....|....*....|....*..
gi 410225316    845 TPSPQA--YGGTPNPQTPGYpdpssPQVNPQYNPQTPGTPAMYNTDQ 889
Cdd:smart01104   80 SASSWAagPGGAYGAPTPGY-----GGTPSAYGPATPGGGAMAGSAS 121
KOW_Spt5_3 cd06083
KOW domain of Spt5, repeat 3; Spt5, an eukaryotic ortholog of NusG, contains multiple KOW ...
471-521 7.09e-28

KOW domain of Spt5, repeat 3; Spt5, an eukaryotic ortholog of NusG, contains multiple KOW motifs at its C-terminus. Spt5 is involved in transcription elongation and termination. KOW domain is known as an RNA-binding motif that is shared so far among some families of ribosomal proteins, the essential bacterial transcriptional elongation factor NusG, the eukaryotic chromatin elongation factor Spt5, the higher eukaryotic KIN17 proteins and Mtr4. KOW_Spt5 domains play critical roles in recruitment of multiple other eukaryotic transcription elongation and RNA biogenesis factors and additionally are involved in the binding of the eukaryotic Spt5 proteins to RNA polymerases.


:

Pssm-ID: 240507  Cd Length: 51  Bit Score: 106.46  E-value: 7.09e-28
                          10        20        30        40        50
                  ....*....|....*....|....*....|....*....|....*....|.
gi 410225316  471 YFKMGDHVKVIAGRFEGDTGLIVRVEENFVILFSDLTMHELKVLPRDLQLC 521
Cdd:cd06083     1 HFKVGDHVKVISGRHEGETGLVVKVEDDVVTVFSDLTMRELKVFPRDLQLS 51
KOW_Spt5_6 cd06086
KOW domain of Spt5, repeat 6; Spt5, an eukaryotic ortholog of NusG, contains multiple KOW ...
1028-1084 2.13e-26

KOW domain of Spt5, repeat 6; Spt5, an eukaryotic ortholog of NusG, contains multiple KOW motifs at its C-terminus. Spt5 is involved in transcription elongation and termination. KOW domain is known as an RNA-binding motif that is shared so far among some families of ribosomal proteins, the essential bacterial transcriptional elongation factor NusG, the eukaryotic chromatin elongation factor Spt5, the higher eukaryotic KIN17 proteins and Mtr4. KOW_Spt5 domains play critical roles in recruitment of multiple other eukaryotic transcription elongation and RNA biogenesis factors and additionally are involved in the binding of the eukaryotic Spt5 proteins to RNA polymerases.


:

Pssm-ID: 240510  Cd Length: 58  Bit Score: 102.59  E-value: 2.13e-26
                          10        20        30        40        50
                  ....*....|....*....|....*....|....*....|....*....|....*..
gi 410225316 1028 EHLEPITPTKNNKVKVILGEDREATGVLLSIDGEDGIVRMDLDEQLKILNLRFLGKL 1084
Cdd:cd06086     1 EHLEPVPPEKGDRVKVIKGEDRGSTGELISIDGADGIVKMDSDGDIKILPMNFLAKL 57
KOW_Spt5_2 cd06082
KOW domain of Spt5, repeat 2; Spt5, an eukaryotic ortholog of NusG, contains multiple KOW ...
420-470 1.09e-25

KOW domain of Spt5, repeat 2; Spt5, an eukaryotic ortholog of NusG, contains multiple KOW motifs at its C-terminus. Spt5 is involved in transcription elongation and termination. KOW domain is known as an RNA-binding motif that is shared so far among some families of ribosomal proteins, the essential bacterial transcriptional elongation factor NusG, the eukaryotic chromatin elongation factor Spt5, the higher eukaryotic KIN17 proteins and Mtr4. KOW_Spt5 domains play critical roles in recruitment of multiple other eukaryotic transcription elongation and RNA biogenesis factors and additionally are involved in the binding of the eukaryotic Spt5 proteins to RNA polymerases.


:

Pssm-ID: 240506  Cd Length: 51  Bit Score: 100.27  E-value: 1.09e-25
                          10        20        30        40        50
                  ....*....|....*....|....*....|....*....|....*....|.
gi 410225316  420 FQPGDNVEVCEGELINLQGKILSVDGNKITIMPKHEDLKDMLEFPAQELRK 470
Cdd:cd06082     1 FQPGDNVEVIEGELKGLQGKVESVDGDIVTIMPKHEDLKEPLEFPAKELRK 51
KOW_Spt5_5 cd06085
KOW domain of Spt5, repeat 5; Spt5, an eukaryotic ortholog of NusG, contains multiple KOW ...
702-751 5.16e-25

KOW domain of Spt5, repeat 5; Spt5, an eukaryotic ortholog of NusG, contains multiple KOW motifs at its C-terminus. Spt5 is involved in transcription elongation and termination. KOW domain is known as an RNA-binding motif that is shared so far among some families of ribosomal proteins, the essential bacterial transcriptional elongation factor NusG, the eukaryotic chromatin elongation factor Spt5, the higher eukaryotic KIN17 proteins and Mtr4. KOW_Spt5 domains play critical roles in recruitment of multiple other eukaryotic transcription elongation and RNA biogenesis factors and additionally are involved in the binding of the eukaryotic Spt5 proteins to RNA polymerases.


:

Pssm-ID: 240509  Cd Length: 52  Bit Score: 98.33  E-value: 5.16e-25
                          10        20        30        40        50
                  ....*....|....*....|....*....|....*....|....*....|
gi 410225316  702 DNELIGQTVRISQGPYKGYIGVVKDATESTARVELHSTCQTISVDRQRLT 751
Cdd:cd06085     2 RDPLIGKTVRIRKGPYKGYIGIVKDATGTTARVELHSKNKTITVDRSRLA 51
KOW_Spt5_4 cd06084
KOW domain of Spt5, repeat 4; Spt5, an eukaryotic ortholog of NusG, contains multiple KOW ...
597-639 1.52e-19

KOW domain of Spt5, repeat 4; Spt5, an eukaryotic ortholog of NusG, contains multiple KOW motifs at its C-terminus. Spt5 is involved in transcription elongation and termination. KOW domain is known as an RNA-binding motif that is shared so far among some families of ribosomal proteins, the essential bacterial transcriptional elongation factor NusG, the eukaryotic chromatin elongation factor Spt5, the higher eukaryotic KIN17 proteins and Mtr4. KOW_Spt5 domains play critical roles in recruitment of multiple other eukaryotic transcription elongation and RNA biogenesis factors and additionally are involved in the binding of the eukaryotic Spt5 proteins to RNA polymerases.


:

Pssm-ID: 240508  Cd Length: 43  Bit Score: 82.57  E-value: 1.52e-19
                          10        20        30        40
                  ....*....|....*....|....*....|....*....|...
gi 410225316  597 KDIVKVIDGPHSGREGEIRHLFRSFAFLHCKKLVENGGMFVCK 639
Cdd:cd06084     1 GDTVKVVDGPYKGRQGTVLHIYRGTLFLHSREVTENGGIFVVR 43
KOW_Spt5_1 cd06081
KOW domain of Spt5, repeat 1; Spt5, an eukaryotic ortholog of NusG, contains multiple KOW ...
276-313 1.67e-17

KOW domain of Spt5, repeat 1; Spt5, an eukaryotic ortholog of NusG, contains multiple KOW motifs at its C-terminus. Spt5 is involved in transcription elongation and termination. KOW domain is known as an RNA-binding motif that is shared so far among some families of ribosomal proteins, the essential bacterial transcriptional elongation factor NusG, the eukaryotic chromatin elongation factor Spt5, the higher eukaryotic KIN17 proteins and Mtr4. KOW_Spt5 domains play critical roles in recruitment of multiple other eukaryotic transcription elongation and RNA biogenesis factors and additionally are involved in the binding of the eukaryotic Spt5 proteins to RNA polymerases.


:

Pssm-ID: 240505  Cd Length: 38  Bit Score: 76.74  E-value: 1.67e-17
                          10        20        30
                  ....*....|....*....|....*....|....*...
gi 410225316  276 KSWVRLKRGIYKDDIAQVDYVEPSQNTISLKMIPRIDY 313
Cdd:cd06081     1 GSWVRIKRGIYKGDLAQVDEVDENGNRVVVKLIPRIDY 38
Spt5_N pfam11942
Spt5 transcription elongation factor, acidic N-terminal; This is the very acidic N-terminal ...
93-172 5.24e-12

Spt5 transcription elongation factor, acidic N-terminal; This is the very acidic N-terminal region of the early transcription elongation factor Spt5. The Spt5-Spt4 complex regulates early transcription elongation by RNA polymerase II and has an imputed role in pre-mRNA processing via its physical association with mRNA capping enzymes. The actual function of this N-terminal domain is not known although it is dispensable for binding to Spt4.


:

Pssm-ID: 463406  Cd Length: 97  Bit Score: 63.05  E-value: 5.24e-12
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 410225316    93 EDGAEDILEKEEIEASNIDNVVLDEDRSGARRLQNLWRDQREEELGEYYMKKYAKSSvgETVYGGSDElsDDITQQQLLP 172
Cdd:pfam11942   22 EDGADDFIEDDEEDEDEEDGRRDDRRHRELDRRRELEEDEDAEEIAEYLKERYGRSS--SDAYRGDAE--EGVPQRLLLP 97
PHA03269 super family cl29788
envelope glycoprotein C; Provisional
825-970 3.97e-09

envelope glycoprotein C; Provisional


The actual alignment was detected with superfamily member PHA03269:

Pssm-ID: 165527 [Multi-domain]  Cd Length: 566  Bit Score: 60.51  E-value: 3.97e-09
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 410225316  825 NPNTPSRAEEEYEYAFDDEPTPSPQayggtPNPQTPGYPDPS-SPQVNPQYNPQtpgtpamyntdqfsPYAAPSPQGSYQ 903
Cdd:PHA03269   21 NLNTNIPIPELHTSAATQKPDPAPA-----PHQAASRAPDPAvAPTSAASRKPD--------------LAQAPTPAASEK 81
                          90       100       110       120       130       140       150
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....
gi 410225316  904 PSPSPQSYHQV--APSPAGYQNTHSPasyhPTPSPM-----AYQASPSPSPVGYSPMTPgAPSPGGYNPHTPGS 970
Cdd:PHA03269   82 FDPAPAPHQAAsrAPDPAVAPQLAAA----PKPDAAeaftsAAQAHEAPADAGTSAASK-KPDPAAHTQHSPPP 150
 
Name Accession Description Interval E-value
NGN_Euk cd09888
Eukaryotic N-Utilization Substance G (NusG) N-terminal (NGN) domain, including plant KTF1 (KOW ...
178-265 4.33e-44

Eukaryotic N-Utilization Substance G (NusG) N-terminal (NGN) domain, including plant KTF1 (KOW domain-containing Transcription Factor 1); The N-Utilization Substance G (NusG) protein and its eukaryotic homolog, Spt5, are involved in transcription elongation and termination. NusG contains an NGN domain at its N-terminus and Kyrpides Ouzounis and Woese (KOW) repeats at its C-terminus. Spt5 forms an Spt4-Spt5 complex that is an essential RNA polymerase II elongation factor. NusG was originally discovered as an N-dependent antitermination enhancing activity in Escherichia coli, and has a variety of functions such as its involvement in RNA polymerase elongation and Rho-termination in bacteria. Orthologs of the NusG gene exist in all bacteria, but their functions and requirements are different. Spt5-like is homologous to the Spt5 proteins present in all eukaryotes, which is unique as it encodes a protein with an additional long carboxy-terminal extension that contains WG/GW motifs. Spt5-like, or KTF1 (KOW domain-containing Transcription Factor 1), is a RNA-directed DNA methylation (RdDM) pathway effector in plants.


Pssm-ID: 193577 [Multi-domain]  Cd Length: 86  Bit Score: 154.23  E-value: 4.33e-44
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 410225316  178 NLWTVKCKIGEERATAISLMRKFIAYQFTDTPLQIKSVVAPEHVKGYIYVEAYKQTHVKQAIEGVGNLRLgyWNQQMVPI 257
Cdd:cd09888     1 KLWAVKCKPGKEREIVISLMRKFLDLQRTGNPLGIKSVFARDGLKGYIYIEARKEAHVKDAIEGLRGVYL--NTIKLVPI 78

                  ....*...
gi 410225316  258 KEMTDVLK 265
Cdd:cd09888    79 KEMPDVLS 86
Spt5-NGN pfam03439
Early transcription elongation factor of RNA pol II, NGN section; Spt5p and prokaryotic NusG ...
178-264 6.66e-31

Early transcription elongation factor of RNA pol II, NGN section; Spt5p and prokaryotic NusG are shown to contain a novel 'NGN' domain. The combined NGN and KOW motif regions of Spt5 form the binding domain with Spt4. Spt5 complexes with Spt4 as a 1:1 heterodimer snf this Spt5-Spt4 complex regulates early transcription elongation by RNA polymerase II and has an imputed role in pre-mRNA processing via its physical association with mRNA capping enzymes. The Schizosaccharomyces pombe core Spt5-Spt4 complex is a heterodimer bearing a trypsin-resistant Spt4-binding domain within the Spt5 subunit.


Pssm-ID: 397481  Cd Length: 84  Bit Score: 116.53  E-value: 6.66e-31
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 410225316   178 NLWTVKCKIGEERATAISLMRKFIAYQfTDTPLQIKSVVAPEHVKGYIYVEAYKQTHVKQAIEGVGNLRLGywNQQMVPI 257
Cdd:pfam03439    1 KIWAVKCTPGQEREVALSLMRKILALA-KTNNLGIYSVFAPDGLKGYIYVEADRQAAVKRALEGIPNVRGL--VPGLVPI 77

                   ....*..
gi 410225316   258 KEMTDVL 264
Cdd:pfam03439   78 KEMEHLL 84
CTD smart01104
Spt5 C-terminal nonapeptide repeat binding Spt4; The C-terminal domain of the transcription ...
772-889 1.91e-28

Spt5 C-terminal nonapeptide repeat binding Spt4; The C-terminal domain of the transcription elongation factor protein Spt5 is necessary for binding to Spt4 to form the functional complex that regulates early transcription elongation by RNA polymerase II. The complex may be involved in pre-mRNA processing through its association with mRNA capping enzymes. This CTD domain carries a regular nonapeptide repeat that can be present in up to 18 copies, as in S. pombe. The repeat has a characteristic TPA motif.


Pssm-ID: 215026 [Multi-domain]  Cd Length: 121  Bit Score: 110.69  E-value: 1.91e-28
                            10        20        30        40        50        60        70        80
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 410225316    772 GSQTPMYG-SGSRTPMYGSQTP----LQDGSRTPHYGSQTPPHDG--SRTPAQSGAWdPNNPNTPSRAEEEYEYAFDDEP 844
Cdd:smart01104    1 GGRTPAWGaSGSKTPAWGSRTPgtaaGGAPTARGGSGSRTPAWGGagSRTPAWGGAG-PTGSRTPAWGGASAWGNKSSEG 79
                            90       100       110       120
                    ....*....|....*....|....*....|....*....|....*..
gi 410225316    845 TPSPQA--YGGTPNPQTPGYpdpssPQVNPQYNPQTPGTPAMYNTDQ 889
Cdd:smart01104   80 SASSWAagPGGAYGAPTPGY-----GGTPSAYGPATPGGGAMAGSAS 121
KOW_Spt5_3 cd06083
KOW domain of Spt5, repeat 3; Spt5, an eukaryotic ortholog of NusG, contains multiple KOW ...
471-521 7.09e-28

KOW domain of Spt5, repeat 3; Spt5, an eukaryotic ortholog of NusG, contains multiple KOW motifs at its C-terminus. Spt5 is involved in transcription elongation and termination. KOW domain is known as an RNA-binding motif that is shared so far among some families of ribosomal proteins, the essential bacterial transcriptional elongation factor NusG, the eukaryotic chromatin elongation factor Spt5, the higher eukaryotic KIN17 proteins and Mtr4. KOW_Spt5 domains play critical roles in recruitment of multiple other eukaryotic transcription elongation and RNA biogenesis factors and additionally are involved in the binding of the eukaryotic Spt5 proteins to RNA polymerases.


Pssm-ID: 240507  Cd Length: 51  Bit Score: 106.46  E-value: 7.09e-28
                          10        20        30        40        50
                  ....*....|....*....|....*....|....*....|....*....|.
gi 410225316  471 YFKMGDHVKVIAGRFEGDTGLIVRVEENFVILFSDLTMHELKVLPRDLQLC 521
Cdd:cd06083     1 HFKVGDHVKVISGRHEGETGLVVKVEDDVVTVFSDLTMRELKVFPRDLQLS 51
KOW_Spt5_6 cd06086
KOW domain of Spt5, repeat 6; Spt5, an eukaryotic ortholog of NusG, contains multiple KOW ...
1028-1084 2.13e-26

KOW domain of Spt5, repeat 6; Spt5, an eukaryotic ortholog of NusG, contains multiple KOW motifs at its C-terminus. Spt5 is involved in transcription elongation and termination. KOW domain is known as an RNA-binding motif that is shared so far among some families of ribosomal proteins, the essential bacterial transcriptional elongation factor NusG, the eukaryotic chromatin elongation factor Spt5, the higher eukaryotic KIN17 proteins and Mtr4. KOW_Spt5 domains play critical roles in recruitment of multiple other eukaryotic transcription elongation and RNA biogenesis factors and additionally are involved in the binding of the eukaryotic Spt5 proteins to RNA polymerases.


Pssm-ID: 240510  Cd Length: 58  Bit Score: 102.59  E-value: 2.13e-26
                          10        20        30        40        50
                  ....*....|....*....|....*....|....*....|....*....|....*..
gi 410225316 1028 EHLEPITPTKNNKVKVILGEDREATGVLLSIDGEDGIVRMDLDEQLKILNLRFLGKL 1084
Cdd:cd06086     1 EHLEPVPPEKGDRVKVIKGEDRGSTGELISIDGADGIVKMDSDGDIKILPMNFLAKL 57
KOW_Spt5_2 cd06082
KOW domain of Spt5, repeat 2; Spt5, an eukaryotic ortholog of NusG, contains multiple KOW ...
420-470 1.09e-25

KOW domain of Spt5, repeat 2; Spt5, an eukaryotic ortholog of NusG, contains multiple KOW motifs at its C-terminus. Spt5 is involved in transcription elongation and termination. KOW domain is known as an RNA-binding motif that is shared so far among some families of ribosomal proteins, the essential bacterial transcriptional elongation factor NusG, the eukaryotic chromatin elongation factor Spt5, the higher eukaryotic KIN17 proteins and Mtr4. KOW_Spt5 domains play critical roles in recruitment of multiple other eukaryotic transcription elongation and RNA biogenesis factors and additionally are involved in the binding of the eukaryotic Spt5 proteins to RNA polymerases.


Pssm-ID: 240506  Cd Length: 51  Bit Score: 100.27  E-value: 1.09e-25
                          10        20        30        40        50
                  ....*....|....*....|....*....|....*....|....*....|.
gi 410225316  420 FQPGDNVEVCEGELINLQGKILSVDGNKITIMPKHEDLKDMLEFPAQELRK 470
Cdd:cd06082     1 FQPGDNVEVIEGELKGLQGKVESVDGDIVTIMPKHEDLKEPLEFPAKELRK 51
KOW_Spt5_5 cd06085
KOW domain of Spt5, repeat 5; Spt5, an eukaryotic ortholog of NusG, contains multiple KOW ...
702-751 5.16e-25

KOW domain of Spt5, repeat 5; Spt5, an eukaryotic ortholog of NusG, contains multiple KOW motifs at its C-terminus. Spt5 is involved in transcription elongation and termination. KOW domain is known as an RNA-binding motif that is shared so far among some families of ribosomal proteins, the essential bacterial transcriptional elongation factor NusG, the eukaryotic chromatin elongation factor Spt5, the higher eukaryotic KIN17 proteins and Mtr4. KOW_Spt5 domains play critical roles in recruitment of multiple other eukaryotic transcription elongation and RNA biogenesis factors and additionally are involved in the binding of the eukaryotic Spt5 proteins to RNA polymerases.


Pssm-ID: 240509  Cd Length: 52  Bit Score: 98.33  E-value: 5.16e-25
                          10        20        30        40        50
                  ....*....|....*....|....*....|....*....|....*....|
gi 410225316  702 DNELIGQTVRISQGPYKGYIGVVKDATESTARVELHSTCQTISVDRQRLT 751
Cdd:cd06085     2 RDPLIGKTVRIRKGPYKGYIGIVKDATGTTARVELHSKNKTITVDRSRLA 51
KOW_Spt5_4 cd06084
KOW domain of Spt5, repeat 4; Spt5, an eukaryotic ortholog of NusG, contains multiple KOW ...
597-639 1.52e-19

KOW domain of Spt5, repeat 4; Spt5, an eukaryotic ortholog of NusG, contains multiple KOW motifs at its C-terminus. Spt5 is involved in transcription elongation and termination. KOW domain is known as an RNA-binding motif that is shared so far among some families of ribosomal proteins, the essential bacterial transcriptional elongation factor NusG, the eukaryotic chromatin elongation factor Spt5, the higher eukaryotic KIN17 proteins and Mtr4. KOW_Spt5 domains play critical roles in recruitment of multiple other eukaryotic transcription elongation and RNA biogenesis factors and additionally are involved in the binding of the eukaryotic Spt5 proteins to RNA polymerases.


Pssm-ID: 240508  Cd Length: 43  Bit Score: 82.57  E-value: 1.52e-19
                          10        20        30        40
                  ....*....|....*....|....*....|....*....|...
gi 410225316  597 KDIVKVIDGPHSGREGEIRHLFRSFAFLHCKKLVENGGMFVCK 639
Cdd:cd06084     1 GDTVKVVDGPYKGRQGTVLHIYRGTLFLHSREVTENGGIFVVR 43
KOW_Spt5_1 cd06081
KOW domain of Spt5, repeat 1; Spt5, an eukaryotic ortholog of NusG, contains multiple KOW ...
276-313 1.67e-17

KOW domain of Spt5, repeat 1; Spt5, an eukaryotic ortholog of NusG, contains multiple KOW motifs at its C-terminus. Spt5 is involved in transcription elongation and termination. KOW domain is known as an RNA-binding motif that is shared so far among some families of ribosomal proteins, the essential bacterial transcriptional elongation factor NusG, the eukaryotic chromatin elongation factor Spt5, the higher eukaryotic KIN17 proteins and Mtr4. KOW_Spt5 domains play critical roles in recruitment of multiple other eukaryotic transcription elongation and RNA biogenesis factors and additionally are involved in the binding of the eukaryotic Spt5 proteins to RNA polymerases.


Pssm-ID: 240505  Cd Length: 38  Bit Score: 76.74  E-value: 1.67e-17
                          10        20        30
                  ....*....|....*....|....*....|....*...
gi 410225316  276 KSWVRLKRGIYKDDIAQVDYVEPSQNTISLKMIPRIDY 313
Cdd:cd06081     1 GSWVRIKRGIYKGDLAQVDEVDENGNRVVVKLIPRIDY 38
CTD pfam12815
Spt5 C-terminal nonapeptide repeat binding Spt4; The C-terminal domain of the transcription ...
772-830 2.13e-14

Spt5 C-terminal nonapeptide repeat binding Spt4; The C-terminal domain of the transcription elongation factor protein Spt5 is necessary for binding to Spt4 to form the functional complex that regulates early transcription elongation by RNA polymerase II. The complex may be involved in pre-mRNA processing through its association with mRNA capping enzymes. This CTD domain carries a regular nonapeptide repeat that can be present in up to 18 copies, as in S. pombe. The repeat has a characteriztic TPA motif.


Pssm-ID: 372327 [Multi-domain]  Cd Length: 71  Bit Score: 69.01  E-value: 2.13e-14
                           10        20        30        40        50        60        70
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 410225316   772 GSQTPMYGS--GSRTPMY---GSQTPL--QDGSRTPHY--GSQTPPHD--GSRTPAQSGAWDPnnPNTPS 830
Cdd:pfam12815    1 GSRTPAYNSagGSRTPAWgadGSRTPAygGAGGRTPAYnqGGKTPAWGgaGSRTPAYYGAWGG--SRTPA 68
Spt5_N pfam11942
Spt5 transcription elongation factor, acidic N-terminal; This is the very acidic N-terminal ...
93-172 5.24e-12

Spt5 transcription elongation factor, acidic N-terminal; This is the very acidic N-terminal region of the early transcription elongation factor Spt5. The Spt5-Spt4 complex regulates early transcription elongation by RNA polymerase II and has an imputed role in pre-mRNA processing via its physical association with mRNA capping enzymes. The actual function of this N-terminal domain is not known although it is dispensable for binding to Spt4.


Pssm-ID: 463406  Cd Length: 97  Bit Score: 63.05  E-value: 5.24e-12
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 410225316    93 EDGAEDILEKEEIEASNIDNVVLDEDRSGARRLQNLWRDQREEELGEYYMKKYAKSSvgETVYGGSDElsDDITQQQLLP 172
Cdd:pfam11942   22 EDGADDFIEDDEEDEDEEDGRRDDRRHRELDRRRELEEDEDAEEIAEYLKERYGRSS--SDAYRGDAE--EGVPQRLLLP 97
nusG PRK08559
transcription antitermination protein NusG; Validated
175-316 5.34e-12

transcription antitermination protein NusG; Validated


Pssm-ID: 181467 [Multi-domain]  Cd Length: 153  Bit Score: 64.89  E-value: 5.34e-12
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 410225316  175 KDPNLWTVKCKIGEERATAISLMRKFIAYQftdtpLQIKSVVAPEHVKGYIYVEAYKQTHVKQAIEGVGNLRlGYwNQQM 254
Cdd:PRK08559    4 EMSMIFAVKTTAGQERNVALMLAMRAKKEN-----LPIYAILAPPELKGYVLVEAESKGAVEEAIRGIPHVR-GV-VPGE 76
                          90       100       110       120       130       140       150
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|.
gi 410225316  255 VPIKEMTDVLKVVKEVANLKPKSWVRLKRGIYKDDIAQVDYVEPSQNTISLKM------IP---RIDYDRI 316
Cdd:PRK08559   77 ISFEEVEHFLKPKPIVEGIKEGDIVELIAGPFKGEKARVVRVDESKEEVTVELleaavpIPvtvRGDQVRV 147
NGN smart00738
In Spt5p, this domain may confer affinity for Spt4p. It possesses a RNP-like fold; In Spt5p, ...
178-266 1.46e-10

In Spt5p, this domain may confer affinity for Spt4p. It possesses a RNP-like fold; In Spt5p, this domain may confer affinity for Spt4p.Spt4p


Pssm-ID: 197850 [Multi-domain]  Cd Length: 106  Bit Score: 59.31  E-value: 1.46e-10
                            10        20        30        40        50        60        70        80
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 410225316    178 NLWTVKCKIGEERATAISLMRKFIAYQFTDtplQIKSVVAP-EHVK----------------GYIYVEAYKQTHVKQAIE 240
Cdd:smart00738    1 NWYAVRTTSGQEKRVAENLERKAEALGLED---KIVSILVPtEEVKeirrgkkkvverklfpGYIFVEADLEDEVWTAIR 77
                            90       100       110
                    ....*....|....*....|....*....|
gi 410225316    241 GV----GNLRLGYWnQQMVPIKEMTDVLKV 266
Cdd:smart00738   78 GTpgvrGFVGGGGK-PTPVPDDEIEKILKP 106
PHA03269 PHA03269
envelope glycoprotein C; Provisional
825-970 3.97e-09

envelope glycoprotein C; Provisional


Pssm-ID: 165527 [Multi-domain]  Cd Length: 566  Bit Score: 60.51  E-value: 3.97e-09
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 410225316  825 NPNTPSRAEEEYEYAFDDEPTPSPQayggtPNPQTPGYPDPS-SPQVNPQYNPQtpgtpamyntdqfsPYAAPSPQGSYQ 903
Cdd:PHA03269   21 NLNTNIPIPELHTSAATQKPDPAPA-----PHQAASRAPDPAvAPTSAASRKPD--------------LAQAPTPAASEK 81
                          90       100       110       120       130       140       150
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....
gi 410225316  904 PSPSPQSYHQV--APSPAGYQNTHSPasyhPTPSPM-----AYQASPSPSPVGYSPMTPgAPSPGGYNPHTPGS 970
Cdd:PHA03269   82 FDPAPAPHQAAsrAPDPAVAPQLAAA----PKPDAAeaftsAAQAHEAPADAGTSAASK-KPDPAAHTQHSPPP 150
PHA03247 PHA03247
large tegument protein UL36; Provisional
746-962 6.81e-09

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 60.34  E-value: 6.81e-09
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 410225316  746 DRQRLTTVGSR-RPGGMTSTYGRTPMYGSQTPMYGSGSRTPMYGSQTPLQDgsrtphygsqtpPHDGSRTPAQS-GAWDP 823
Cdd:PHA03247 2650 ERPRDDPAPGRvSRPRRARRLGRAAQASSPPQRPRRRAARPTVGSLTSLAD------------PPPPPPTPEPApHALVS 2717
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 410225316  824 NNPNTPSRAEEEYEY-AFDDEPTPSPQAYG----GTPN----PQTPGYPDPSSPQVNPQYNPQTPGTPAMYNTDQFSPYA 894
Cdd:PHA03247 2718 ATPLPPGPAAARQASpALPAAPAPPAVPAGpatpGGPArparPPTTAGPPAPAPPAAPAAGPPRRLTRPAVASLSESRES 2797
                         170       180       190       200       210       220
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 410225316  895 APSPqgsyqPSPSPQSYHQVAPSPAgYQNTHSPASYHPTPSPMAYQASPSPSPVGYSPMTP-GAPSPGG 962
Cdd:PHA03247 2798 LPSP-----WDPADPPAAVLAPAAA-LPPAASPAGPLPPPTSAQPTAPPPPPGPPPPSLPLgGSVAPGG 2860
KOW_elon_Spt5 TIGR00405
transcription elongation factor Spt5; This protein contains a KOW domain, shared by bacterial ...
180-308 3.52e-08

transcription elongation factor Spt5; This protein contains a KOW domain, shared by bacterial NusG and the uL24 (previously L24p/L26e) family of ribosomal proteins. The most recent papers and crystal structures make this a transcription elongation factor rather than a ribosomal protein.


Pssm-ID: 129499 [Multi-domain]  Cd Length: 145  Bit Score: 53.74  E-value: 3.52e-08
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 410225316   180 WTVKCKIGEERATAislmrKFIAYQFTDTPLQIKSVVAPEHVKGYIYVEAYKQTHVKQAIEGVGNLRlgywnqQMVP--- 256
Cdd:TIGR00405    1 FAVKTSVGQEKNVA-----RLMARKARKSGLEVYSILAPESLKGYILVEAETKIDMRNPIIGVPHVR------GVVEgei 69
                           90       100       110       120       130
                   ....*....|....*....|....*....|....*....|....*....|...
gi 410225316   257 -IKEMTDVLKVVKEVANLKPKSWVRLKRGIYKDDIAQVDYVEPSQNTISLKMI 308
Cdd:TIGR00405   70 dFEEIERFLTPKKIIESIKKGDIVEIISGPFKGERAKVIRVDESKEEVTLELI 122
Atrophin-1 pfam03154
Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian ...
844-970 6.07e-08

Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian atrophy (DRPLA) gene. DRPLA OMIM:125370 is a progressive neurodegenerative disorder. It is caused by the expansion of a CAG repeat in the DRPLA gene on chromosome 12p. This results in an extended polyglutamine region in atrophin-1, that is thought to confer toxicity to the protein, possibly through altering its interactions with other proteins. The expansion of a CAG repeat is also the underlying defect in six other neurodegenerative disorders, including Huntington's disease. One interaction of expanded polyglutamine repeats that is thought to be pathogenic is that with the short glutamine repeat in the transcriptional coactivator CREB binding protein, CBP. This interaction draws CBP away from its usual nuclear location to the expanded polyglutamine repeat protein aggregates that are characteriztic of the polyglutamine neurodegenerative disorders. This interferes with CBP-mediated transcription and causes cytotoxicity.


Pssm-ID: 460830 [Multi-domain]  Cd Length: 991  Bit Score: 57.08  E-value: 6.07e-08
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 410225316   844 PTPSPQAYGGTPNPQTPGYPDPSSPQVN-PQYNPQTPGTP--AMYNTDQFSPYAAPSPQGSYQPSPSPQSYHQVAPSPAG 920
Cdd:pfam03154  188 PPGTTQAATAGPTPSAPSVPPQGSPATSqPPNQTQSTAAPhtLIQQTPTLHPQRLPSPHPPLQPMTQPPPPSQVSPQPLP 267
                           90       100       110       120       130       140
                   ....*....|....*....|....*....|....*....|....*....|....*....|...
gi 410225316   921 YQNTHSPASYHP---------TPSPMAYQASPSPSPVGYSPMTPG----APSPGGYNPHTPGS 970
Cdd:pfam03154  268 QPSLHGQMPPMPhslqtgpshMQHPVPPQPFPLTPQSSQSQVPPGpspaAPGQSQQRIHTPPS 330
PBP1 COG5180
PAB1-binding protein, interacts with poly(A)-binding protein [RNA processing and modification]; ...
806-977 1.38e-06

PAB1-binding protein, interacts with poly(A)-binding protein [RNA processing and modification];


Pssm-ID: 444064 [Multi-domain]  Cd Length: 548  Bit Score: 52.37  E-value: 1.38e-06
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 410225316  806 TPPHDGSRTPAQSgawdPNNPNTPSRAEEEYEYAFDD-EPTPSPQAYGGTPNP----QTPGYPDPSSPQVNPQYNPQT-- 878
Cdd:COG5180   218 TGGADHPRPEAAS----SPKVDPPSTSEARSRPATVDaQPEMRPPADAKERRRaaigDTPAAEPPGLPVLEAGSEPQSda 293
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 410225316  879 PGTPAMYNTDQFSPYAAPSPQGSYQPSPS-----PQSYHQVAPSPAGYQNTHSPASYHPTPSPMAYQASPSpspvgySPM 953
Cdd:COG5180   294 PEAETARPIDVKGVASAPPATRPVRPPGGardpgTPRPGQPTERPAGVPEAASDAGQPPSAYPPAEEAVPG------KPL 367
                         170       180
                  ....*....|....*....|....*...
gi 410225316  954 TPGAPSPG--GYN--PHTPGSGIEQNSS 977
Cdd:COG5180   368 EQGAPRPGssGGDgaPFQPPNGAPQPGL 395
KOW smart00739
KOW (Kyprides, Ouzounis, Woese) motif; Motif in ribosomal proteins, NusG, Spt5p, KIN17 and T54.
471-498 5.62e-05

KOW (Kyprides, Ouzounis, Woese) motif; Motif in ribosomal proteins, NusG, Spt5p, KIN17 and T54.


Pssm-ID: 128978  Cd Length: 28  Bit Score: 40.77  E-value: 5.62e-05
                            10        20
                    ....*....|....*....|....*...
gi 410225316    471 YFKMGDHVKVIAGRFEGDTGLIVRVEEN 498
Cdd:smart00739    1 KFEVGDTVRVIAGPFKGKVGKVLEVDGE 28
KOW pfam00467
KOW motif; This family has been extended to coincide with ref. The KOW (Kyprides, Ouzounis, ...
706-737 9.60e-05

KOW motif; This family has been extended to coincide with ref. The KOW (Kyprides, Ouzounis, Woese) motif is found in a variety of ribosomal proteins and NusG.


Pssm-ID: 425698 [Multi-domain]  Cd Length: 32  Bit Score: 40.45  E-value: 9.60e-05
                           10        20        30
                   ....*....|....*....|....*....|..
gi 410225316   706 IGQTVRISQGPYKGYIGVVKDATESTARVELH 737
Cdd:pfam00467    1 KGDVVRVIAGPFKGKVGKVVEVDDKKNRVLVE 32
SP7_N cd22542
N-terminal domain of transcription factor Specificity Protein (SP) 7; Specificity Proteins ...
802-970 1.79e-04

N-terminal domain of transcription factor Specificity Protein (SP) 7; Specificity Proteins (SPs) are transcription factors that are involved in many cellular processes, including cell differentiation, cell growth, apoptosis, immune responses, response to DNA damage, and chromatin remodeling. SP7, also called Osterix (Osx) in humans, is highly conserved among bone-forming vertebrates. It plays a major role, along with Runx2 and Dlx5 in driving the differentiation of mesenchymal precursor cells into osteoblasts and eventually osteocytes. SP7 also plays a regulatory role by inhibiting chondrocyte differentiation, maintaining the balance between differentiation of mesenchymal precursor cells into ossified bone or cartilage. Mutations of this gene have been associated with multiple dysfunctional bone phenotypes in vertebrates. SP7 is thought to play a role in diseases such as Osteogenesis imperfecta. SP7 belongs to a family of proteins, called the SP/Kruppel or Krueppel-like Factor (KLF) family, characterized by a C-terminal DNA-binding domain of 81 amino acids consisting of three Kruppel-like C2H2 zinc fingers. These factors bind to a loose consensus motif, namely NNRCRCCYY (where N is any nucleotide; R is A/G, and Y is C/T), such as the recurring motifs in GC and GT boxes (5'-GGGGCGGGG-3' and 5-GGTGTGGGG-3') that are present in promoters and more distal regulatory elements of mammalian genes. SP factors preferentially bind GC boxes, while KLFs bind CACCC boxes. Another characteristic hallmark of SP factors is the presence of the Buttonhead (BTD) box CXCPXC, just N-terminal to the zinc fingers. The function of the BTD box is unknown, but it is thought to play an important physiological role. Another feature of most SP factors is the presence of a conserved amino acid stretch, the so-called SP box, located close to the N-terminus. SP factors may be separated into three groups based on their domain architecture and the similarity of their N-terminal transactivation domains: SP1-4, SP5, and SP6-9. The transactivation domains between the three groups are not homologous to one another. This model represents the N-terminal domain of SP7.


Pssm-ID: 411691 [Multi-domain]  Cd Length: 297  Bit Score: 44.89  E-value: 1.79e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 410225316  802 YGSQTPPHDgSRTPAQSGAWDPNNPNTP--------SRAEE----EYEYAFDD-----EPTPSPQA---YGGTPNPQTPG 861
Cdd:cd22542    26 FGGSSPIRD-SATPGKPGNNPGKKPYSLgsdlssakSRSSElmgdSYTATFSSgnglmSPSGSPQAsttYGNDYNPFSHS 104
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 410225316  862 YPDPSSPQ----VNPQYNPQTPGTPAMYNT-DQFSPY-----AAPSPQGSYQPSPSPQSYHQVAPSPAGYQNTHSPASYH 931
Cdd:cd22542   105 FPTSSGSQdpslLVSKGHPSADCLPSVYTSlDMAHPYgswykTGIHPGISSSSTNATASWWDMHSNTNWLSAQGQPDGLQ 184
                         170       180       190
                  ....*....|....*....|....*....|....*....
gi 410225316  932 PTPSPMAYQASPSPSPVGYSPMTPgaPSPGGYNPHTPGS 970
Cdd:cd22542   185 ASLQPVPAQTPLNPQLPSYTEFTT--LNPAPYPAVGISS 221
PspC_subgroup_2 NF033839
pneumococcal surface protein PspC, LPXTG-anchored form; The pneumococcal surface protein PspC, ...
824-960 2.18e-04

pneumococcal surface protein PspC, LPXTG-anchored form; The pneumococcal surface protein PspC, as described in Streptococcus pneumoniae, is a repetitive and highly variable protein, recognized by a conserved N-terminal domain and also by genomic location. This form, subgroup 2, is anchored covalently after cleavage by sortase at a C-terminal LPXTG site. The other form, subgroup 1, has variable numbers of a choline-binding repeat in the C-terminal region, and is also known as choline-binding protein A.


Pssm-ID: 468202 [Multi-domain]  Cd Length: 557  Bit Score: 45.14  E-value: 2.18e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 410225316  824 NNPNTPSRAEEEYEYAFDD-EPTPSPQAYGGTPN-PQTPGYPDPSSPQVNPQYNPQTPGTPAMYNTDQFSPYAAPSPQG- 900
Cdd:NF033839  249 DNVNTKVEIENTVHKIFADmDAVVTKFKKGLTQDtPKEPGNKKPSAPKPGMQPSPQPEKKEVKPEPETPKPEVKPQLEKp 328
                          90       100       110       120       130       140
                  ....*....|....*....|....*....|....*....|....*....|....*....|.
gi 410225316  901 SYQPSPSPQSYH-QVAPSPAGYQNTHSPASYHPTPSPMAYQASPSPSpvgySPMTPGAPSP 960
Cdd:NF033839  329 KPEVKPQPEKPKpEVKPQLETPKPEVKPQPEKPKPEVKPQPEKPKPE----VKPQPETPKP 385
KOW pfam00467
KOW motif; This family has been extended to coincide with ref. The KOW (Kyprides, Ouzounis, ...
475-503 2.21e-04

KOW motif; This family has been extended to coincide with ref. The KOW (Kyprides, Ouzounis, Woese) motif is found in a variety of ribosomal proteins and NusG.


Pssm-ID: 425698 [Multi-domain]  Cd Length: 32  Bit Score: 39.29  E-value: 2.21e-04
                           10        20        30
                   ....*....|....*....|....*....|.
gi 410225316   475 GDHVKVIAGRFEGDTGLIVRVEE--NFVILF 503
Cdd:pfam00467    2 GDVVRVIAGPFKGKVGKVVEVDDkkNRVLVE 32
KLF1_2_4_N cd21972
N-terminal domain of Kruppel-like factor (KLF) 1, KLF2, KLF4, and similar proteins; Kruppel ...
829-968 9.51e-04

N-terminal domain of Kruppel-like factor (KLF) 1, KLF2, KLF4, and similar proteins; Kruppel/Krueppel-like transcription factors (KLFs) belong to a family of proteins called the Specificity Protein (SP)/KLF family, characterized by a C-terminal DNA-binding domain of 81 amino acids consisting of three Kruppel-like C2H2 zinc fingers. These factors bind to a loose consensus motif, namely NNRCRCCYY (where N is any nucleotide; R is A/G, and Y is C/T), such as the recurring motifs in GC and GT boxes (5'-GGGGCGGGG-3' and 5-GGTGTGGGG-3') that are present in promoters and more distal regulatory elements of mammalian genes. Members of the KLF family can act as activators or repressors of transcription depending on cell and promoter context. KLFs regulate various cellular functions, such as proliferation, differentiation, and apoptosis, as well as the development and homeostasis of several types of tissue. In addition to the C-terminal DNA-binding domain, each KLF also has a unique N-terminal activation/repression domain that confers specifity and allows it to bind specifically to a certain partner, leading to distinct activities in vivo. This model represents the related N-terminal domains of KLF1, KLF2, KLF4, and similar proteins.


Pssm-ID: 409230 [Multi-domain]  Cd Length: 194  Bit Score: 41.51  E-value: 9.51e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 410225316  829 PSRAEEEYEYAFDDEPTPSPQAYG----GTPNPQTPGYPDPSSPQVNPQYN-PQTPGTPAMYNTDQFSPYAAPSPQGSYQ 903
Cdd:cd21972    22 LDLEFILSNTVTSDNDNPPPPDPAypppESPESCSTVYDSDGCHPTPNAYCgPNGPGLPGHFLLAGNSPNLGPKIKTENQ 101
                          90       100       110       120       130       140       150
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 410225316  904 PS-------PSPQSYHQVAPS------PAGYQNTHSPASYHPTPSPMAYQASPSPSPVGYSPmtPGAPSPGGYNPHTP 968
Cdd:cd21972   102 EQacmpvagYSGHYGPREPQRvppappPPQYAGHFQYHGHFNMFSPPLRANHPGMSTVMLTP--LSTPPLGFLSPEEA 177
PspC_subgroup_2 NF033839
pneumococcal surface protein PspC, LPXTG-anchored form; The pneumococcal surface protein PspC, ...
842-968 3.66e-03

pneumococcal surface protein PspC, LPXTG-anchored form; The pneumococcal surface protein PspC, as described in Streptococcus pneumoniae, is a repetitive and highly variable protein, recognized by a conserved N-terminal domain and also by genomic location. This form, subgroup 2, is anchored covalently after cleavage by sortase at a C-terminal LPXTG site. The other form, subgroup 1, has variable numbers of a choline-binding repeat in the C-terminal region, and is also known as choline-binding protein A.


Pssm-ID: 468202 [Multi-domain]  Cd Length: 557  Bit Score: 41.29  E-value: 3.66e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 410225316  842 DEPTPSPQAYGGTPNPQTPGYPDPSSPQVNPQynPQTPGTPAMYNTDQFSPYAAPSPQ-GSYQPSPSPQSYH-QVAPSPA 919
Cdd:NF033839  370 EKPKPEVKPQPETPKPEVKPQPEKPKPEVKPQ--PEKPKPEVKPQPEKPKPEVKPQPEkPKPEVKPQPEKPKpEVKPQPE 447
                          90       100       110       120       130
                  ....*....|....*....|....*....|....*....|....*....|...
gi 410225316  920 GYQNTHSPASYHPTPSPMAYQASPSPSpVGYSPMTP----GAPSPGGYNPHTP 968
Cdd:NF033839  448 KPKPEVKPQPETPKPEVKPQPEKPKPE-VKPQPEKPkpdnSKPQADDKKPSTP 499
KOW pfam00467
KOW motif; This family has been extended to coincide with ref. The KOW (Kyprides, Ouzounis, ...
596-627 7.70e-03

KOW motif; This family has been extended to coincide with ref. The KOW (Kyprides, Ouzounis, Woese) motif is found in a variety of ribosomal proteins and NusG.


Pssm-ID: 425698 [Multi-domain]  Cd Length: 32  Bit Score: 35.06  E-value: 7.70e-03
                           10        20        30
                   ....*....|....*....|....*....|..
gi 410225316   596 VKDIVKVIDGPHSGREGEIRHLFRSFAFLHCK 627
Cdd:pfam00467    1 KGDVVRVIAGPFKGKVGKVVEVDDKKNRVLVE 32
 
Name Accession Description Interval E-value
NGN_Euk cd09888
Eukaryotic N-Utilization Substance G (NusG) N-terminal (NGN) domain, including plant KTF1 (KOW ...
178-265 4.33e-44

Eukaryotic N-Utilization Substance G (NusG) N-terminal (NGN) domain, including plant KTF1 (KOW domain-containing Transcription Factor 1); The N-Utilization Substance G (NusG) protein and its eukaryotic homolog, Spt5, are involved in transcription elongation and termination. NusG contains an NGN domain at its N-terminus and Kyrpides Ouzounis and Woese (KOW) repeats at its C-terminus. Spt5 forms an Spt4-Spt5 complex that is an essential RNA polymerase II elongation factor. NusG was originally discovered as an N-dependent antitermination enhancing activity in Escherichia coli, and has a variety of functions such as its involvement in RNA polymerase elongation and Rho-termination in bacteria. Orthologs of the NusG gene exist in all bacteria, but their functions and requirements are different. Spt5-like is homologous to the Spt5 proteins present in all eukaryotes, which is unique as it encodes a protein with an additional long carboxy-terminal extension that contains WG/GW motifs. Spt5-like, or KTF1 (KOW domain-containing Transcription Factor 1), is a RNA-directed DNA methylation (RdDM) pathway effector in plants.


Pssm-ID: 193577 [Multi-domain]  Cd Length: 86  Bit Score: 154.23  E-value: 4.33e-44
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 410225316  178 NLWTVKCKIGEERATAISLMRKFIAYQFTDTPLQIKSVVAPEHVKGYIYVEAYKQTHVKQAIEGVGNLRLgyWNQQMVPI 257
Cdd:cd09888     1 KLWAVKCKPGKEREIVISLMRKFLDLQRTGNPLGIKSVFARDGLKGYIYIEARKEAHVKDAIEGLRGVYL--NTIKLVPI 78

                  ....*...
gi 410225316  258 KEMTDVLK 265
Cdd:cd09888    79 KEMPDVLS 86
Spt5-NGN pfam03439
Early transcription elongation factor of RNA pol II, NGN section; Spt5p and prokaryotic NusG ...
178-264 6.66e-31

Early transcription elongation factor of RNA pol II, NGN section; Spt5p and prokaryotic NusG are shown to contain a novel 'NGN' domain. The combined NGN and KOW motif regions of Spt5 form the binding domain with Spt4. Spt5 complexes with Spt4 as a 1:1 heterodimer snf this Spt5-Spt4 complex regulates early transcription elongation by RNA polymerase II and has an imputed role in pre-mRNA processing via its physical association with mRNA capping enzymes. The Schizosaccharomyces pombe core Spt5-Spt4 complex is a heterodimer bearing a trypsin-resistant Spt4-binding domain within the Spt5 subunit.


Pssm-ID: 397481  Cd Length: 84  Bit Score: 116.53  E-value: 6.66e-31
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 410225316   178 NLWTVKCKIGEERATAISLMRKFIAYQfTDTPLQIKSVVAPEHVKGYIYVEAYKQTHVKQAIEGVGNLRLGywNQQMVPI 257
Cdd:pfam03439    1 KIWAVKCTPGQEREVALSLMRKILALA-KTNNLGIYSVFAPDGLKGYIYVEADRQAAVKRALEGIPNVRGL--VPGLVPI 77

                   ....*..
gi 410225316   258 KEMTDVL 264
Cdd:pfam03439   78 KEMEHLL 84
CTD smart01104
Spt5 C-terminal nonapeptide repeat binding Spt4; The C-terminal domain of the transcription ...
772-889 1.91e-28

Spt5 C-terminal nonapeptide repeat binding Spt4; The C-terminal domain of the transcription elongation factor protein Spt5 is necessary for binding to Spt4 to form the functional complex that regulates early transcription elongation by RNA polymerase II. The complex may be involved in pre-mRNA processing through its association with mRNA capping enzymes. This CTD domain carries a regular nonapeptide repeat that can be present in up to 18 copies, as in S. pombe. The repeat has a characteristic TPA motif.


Pssm-ID: 215026 [Multi-domain]  Cd Length: 121  Bit Score: 110.69  E-value: 1.91e-28
                            10        20        30        40        50        60        70        80
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 410225316    772 GSQTPMYG-SGSRTPMYGSQTP----LQDGSRTPHYGSQTPPHDG--SRTPAQSGAWdPNNPNTPSRAEEEYEYAFDDEP 844
Cdd:smart01104    1 GGRTPAWGaSGSKTPAWGSRTPgtaaGGAPTARGGSGSRTPAWGGagSRTPAWGGAG-PTGSRTPAWGGASAWGNKSSEG 79
                            90       100       110       120
                    ....*....|....*....|....*....|....*....|....*..
gi 410225316    845 TPSPQA--YGGTPNPQTPGYpdpssPQVNPQYNPQTPGTPAMYNTDQ 889
Cdd:smart01104   80 SASSWAagPGGAYGAPTPGY-----GGTPSAYGPATPGGGAMAGSAS 121
KOW_Spt5_3 cd06083
KOW domain of Spt5, repeat 3; Spt5, an eukaryotic ortholog of NusG, contains multiple KOW ...
471-521 7.09e-28

KOW domain of Spt5, repeat 3; Spt5, an eukaryotic ortholog of NusG, contains multiple KOW motifs at its C-terminus. Spt5 is involved in transcription elongation and termination. KOW domain is known as an RNA-binding motif that is shared so far among some families of ribosomal proteins, the essential bacterial transcriptional elongation factor NusG, the eukaryotic chromatin elongation factor Spt5, the higher eukaryotic KIN17 proteins and Mtr4. KOW_Spt5 domains play critical roles in recruitment of multiple other eukaryotic transcription elongation and RNA biogenesis factors and additionally are involved in the binding of the eukaryotic Spt5 proteins to RNA polymerases.


Pssm-ID: 240507  Cd Length: 51  Bit Score: 106.46  E-value: 7.09e-28
                          10        20        30        40        50
                  ....*....|....*....|....*....|....*....|....*....|.
gi 410225316  471 YFKMGDHVKVIAGRFEGDTGLIVRVEENFVILFSDLTMHELKVLPRDLQLC 521
Cdd:cd06083     1 HFKVGDHVKVISGRHEGETGLVVKVEDDVVTVFSDLTMRELKVFPRDLQLS 51
KOW_Spt5_6 cd06086
KOW domain of Spt5, repeat 6; Spt5, an eukaryotic ortholog of NusG, contains multiple KOW ...
1028-1084 2.13e-26

KOW domain of Spt5, repeat 6; Spt5, an eukaryotic ortholog of NusG, contains multiple KOW motifs at its C-terminus. Spt5 is involved in transcription elongation and termination. KOW domain is known as an RNA-binding motif that is shared so far among some families of ribosomal proteins, the essential bacterial transcriptional elongation factor NusG, the eukaryotic chromatin elongation factor Spt5, the higher eukaryotic KIN17 proteins and Mtr4. KOW_Spt5 domains play critical roles in recruitment of multiple other eukaryotic transcription elongation and RNA biogenesis factors and additionally are involved in the binding of the eukaryotic Spt5 proteins to RNA polymerases.


Pssm-ID: 240510  Cd Length: 58  Bit Score: 102.59  E-value: 2.13e-26
                          10        20        30        40        50
                  ....*....|....*....|....*....|....*....|....*....|....*..
gi 410225316 1028 EHLEPITPTKNNKVKVILGEDREATGVLLSIDGEDGIVRMDLDEQLKILNLRFLGKL 1084
Cdd:cd06086     1 EHLEPVPPEKGDRVKVIKGEDRGSTGELISIDGADGIVKMDSDGDIKILPMNFLAKL 57
KOW_Spt5_2 cd06082
KOW domain of Spt5, repeat 2; Spt5, an eukaryotic ortholog of NusG, contains multiple KOW ...
420-470 1.09e-25

KOW domain of Spt5, repeat 2; Spt5, an eukaryotic ortholog of NusG, contains multiple KOW motifs at its C-terminus. Spt5 is involved in transcription elongation and termination. KOW domain is known as an RNA-binding motif that is shared so far among some families of ribosomal proteins, the essential bacterial transcriptional elongation factor NusG, the eukaryotic chromatin elongation factor Spt5, the higher eukaryotic KIN17 proteins and Mtr4. KOW_Spt5 domains play critical roles in recruitment of multiple other eukaryotic transcription elongation and RNA biogenesis factors and additionally are involved in the binding of the eukaryotic Spt5 proteins to RNA polymerases.


Pssm-ID: 240506  Cd Length: 51  Bit Score: 100.27  E-value: 1.09e-25
                          10        20        30        40        50
                  ....*....|....*....|....*....|....*....|....*....|.
gi 410225316  420 FQPGDNVEVCEGELINLQGKILSVDGNKITIMPKHEDLKDMLEFPAQELRK 470
Cdd:cd06082     1 FQPGDNVEVIEGELKGLQGKVESVDGDIVTIMPKHEDLKEPLEFPAKELRK 51
KOW_Spt5_5 cd06085
KOW domain of Spt5, repeat 5; Spt5, an eukaryotic ortholog of NusG, contains multiple KOW ...
702-751 5.16e-25

KOW domain of Spt5, repeat 5; Spt5, an eukaryotic ortholog of NusG, contains multiple KOW motifs at its C-terminus. Spt5 is involved in transcription elongation and termination. KOW domain is known as an RNA-binding motif that is shared so far among some families of ribosomal proteins, the essential bacterial transcriptional elongation factor NusG, the eukaryotic chromatin elongation factor Spt5, the higher eukaryotic KIN17 proteins and Mtr4. KOW_Spt5 domains play critical roles in recruitment of multiple other eukaryotic transcription elongation and RNA biogenesis factors and additionally are involved in the binding of the eukaryotic Spt5 proteins to RNA polymerases.


Pssm-ID: 240509  Cd Length: 52  Bit Score: 98.33  E-value: 5.16e-25
                          10        20        30        40        50
                  ....*....|....*....|....*....|....*....|....*....|
gi 410225316  702 DNELIGQTVRISQGPYKGYIGVVKDATESTARVELHSTCQTISVDRQRLT 751
Cdd:cd06085     2 RDPLIGKTVRIRKGPYKGYIGIVKDATGTTARVELHSKNKTITVDRSRLA 51
KOW_Spt5_4 cd06084
KOW domain of Spt5, repeat 4; Spt5, an eukaryotic ortholog of NusG, contains multiple KOW ...
597-639 1.52e-19

KOW domain of Spt5, repeat 4; Spt5, an eukaryotic ortholog of NusG, contains multiple KOW motifs at its C-terminus. Spt5 is involved in transcription elongation and termination. KOW domain is known as an RNA-binding motif that is shared so far among some families of ribosomal proteins, the essential bacterial transcriptional elongation factor NusG, the eukaryotic chromatin elongation factor Spt5, the higher eukaryotic KIN17 proteins and Mtr4. KOW_Spt5 domains play critical roles in recruitment of multiple other eukaryotic transcription elongation and RNA biogenesis factors and additionally are involved in the binding of the eukaryotic Spt5 proteins to RNA polymerases.


Pssm-ID: 240508  Cd Length: 43  Bit Score: 82.57  E-value: 1.52e-19
                          10        20        30        40
                  ....*....|....*....|....*....|....*....|...
gi 410225316  597 KDIVKVIDGPHSGREGEIRHLFRSFAFLHCKKLVENGGMFVCK 639
Cdd:cd06084     1 GDTVKVVDGPYKGRQGTVLHIYRGTLFLHSREVTENGGIFVVR 43
KOW_Spt5_1 cd06081
KOW domain of Spt5, repeat 1; Spt5, an eukaryotic ortholog of NusG, contains multiple KOW ...
276-313 1.67e-17

KOW domain of Spt5, repeat 1; Spt5, an eukaryotic ortholog of NusG, contains multiple KOW motifs at its C-terminus. Spt5 is involved in transcription elongation and termination. KOW domain is known as an RNA-binding motif that is shared so far among some families of ribosomal proteins, the essential bacterial transcriptional elongation factor NusG, the eukaryotic chromatin elongation factor Spt5, the higher eukaryotic KIN17 proteins and Mtr4. KOW_Spt5 domains play critical roles in recruitment of multiple other eukaryotic transcription elongation and RNA biogenesis factors and additionally are involved in the binding of the eukaryotic Spt5 proteins to RNA polymerases.


Pssm-ID: 240505  Cd Length: 38  Bit Score: 76.74  E-value: 1.67e-17
                          10        20        30
                  ....*....|....*....|....*....|....*...
gi 410225316  276 KSWVRLKRGIYKDDIAQVDYVEPSQNTISLKMIPRIDY 313
Cdd:cd06081     1 GSWVRIKRGIYKGDLAQVDEVDENGNRVVVKLIPRIDY 38
CTD pfam12815
Spt5 C-terminal nonapeptide repeat binding Spt4; The C-terminal domain of the transcription ...
772-830 2.13e-14

Spt5 C-terminal nonapeptide repeat binding Spt4; The C-terminal domain of the transcription elongation factor protein Spt5 is necessary for binding to Spt4 to form the functional complex that regulates early transcription elongation by RNA polymerase II. The complex may be involved in pre-mRNA processing through its association with mRNA capping enzymes. This CTD domain carries a regular nonapeptide repeat that can be present in up to 18 copies, as in S. pombe. The repeat has a characteriztic TPA motif.


Pssm-ID: 372327 [Multi-domain]  Cd Length: 71  Bit Score: 69.01  E-value: 2.13e-14
                           10        20        30        40        50        60        70
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 410225316   772 GSQTPMYGS--GSRTPMY---GSQTPL--QDGSRTPHY--GSQTPPHD--GSRTPAQSGAWDPnnPNTPS 830
Cdd:pfam12815    1 GSRTPAYNSagGSRTPAWgadGSRTPAygGAGGRTPAYnqGGKTPAWGgaGSRTPAYYGAWGG--SRTPA 68
Spt5_N pfam11942
Spt5 transcription elongation factor, acidic N-terminal; This is the very acidic N-terminal ...
93-172 5.24e-12

Spt5 transcription elongation factor, acidic N-terminal; This is the very acidic N-terminal region of the early transcription elongation factor Spt5. The Spt5-Spt4 complex regulates early transcription elongation by RNA polymerase II and has an imputed role in pre-mRNA processing via its physical association with mRNA capping enzymes. The actual function of this N-terminal domain is not known although it is dispensable for binding to Spt4.


Pssm-ID: 463406  Cd Length: 97  Bit Score: 63.05  E-value: 5.24e-12
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 410225316    93 EDGAEDILEKEEIEASNIDNVVLDEDRSGARRLQNLWRDQREEELGEYYMKKYAKSSvgETVYGGSDElsDDITQQQLLP 172
Cdd:pfam11942   22 EDGADDFIEDDEEDEDEEDGRRDDRRHRELDRRRELEEDEDAEEIAEYLKERYGRSS--SDAYRGDAE--EGVPQRLLLP 97
nusG PRK08559
transcription antitermination protein NusG; Validated
175-316 5.34e-12

transcription antitermination protein NusG; Validated


Pssm-ID: 181467 [Multi-domain]  Cd Length: 153  Bit Score: 64.89  E-value: 5.34e-12
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 410225316  175 KDPNLWTVKCKIGEERATAISLMRKFIAYQftdtpLQIKSVVAPEHVKGYIYVEAYKQTHVKQAIEGVGNLRlGYwNQQM 254
Cdd:PRK08559    4 EMSMIFAVKTTAGQERNVALMLAMRAKKEN-----LPIYAILAPPELKGYVLVEAESKGAVEEAIRGIPHVR-GV-VPGE 76
                          90       100       110       120       130       140       150
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|.
gi 410225316  255 VPIKEMTDVLKVVKEVANLKPKSWVRLKRGIYKDDIAQVDYVEPSQNTISLKM------IP---RIDYDRI 316
Cdd:PRK08559   77 ISFEEVEHFLKPKPIVEGIKEGDIVELIAGPFKGEKARVVRVDESKEEVTVELleaavpIPvtvRGDQVRV 147
NGN cd08000
N-Utilization Substance G (NusG) N-terminal (NGN) domain Superfamily; The N-Utilization ...
178-264 1.10e-10

N-Utilization Substance G (NusG) N-terminal (NGN) domain Superfamily; The N-Utilization Substance G (NusG) and its eukaryotic homolog Spt5 are involved in transcription elongation and termination. NusG contains an NGN domain at its N-terminus and Kyrpides Ouzounis and Woese (KOW) repeats at its C-terminus in bacteria and archaea. The eukaryotic ortholog, Spt5, is a large protein composed of an acidic N-terminus, an NGN domain, and multiple KOW motifs at its C-terminus. Spt5 forms a Spt4-Spt5 complex that is an essential RNA Polymerase II elongation factor. NusG was originally discovered as an N-dependent antitermination enhancing activity in Escherichia coli and has a variety of functions, such as being involved in RNA polymerase elongation and Rho-termination in bacteria. Orthologs of the NusG gene exist in all bacteria, but its functions and requirements are different. The diverse activities suggest that, after diverging from a common ancestor, NusG proteins became specialized in different bacteria.


Pssm-ID: 193574 [Multi-domain]  Cd Length: 99  Bit Score: 59.25  E-value: 1.10e-10
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 410225316  178 NLWTVKCKIGEERATAISLMRKFIA---------YQFTDTPLQIKSVVAPEHVKGYIYVEAYKQTHVKQAIEGVGNLRLG 248
Cdd:cd08000     1 NWYVLFVKTGREEKVEKLLEKRFEAndieafvpkKEVPERKRGKIEEVIKPLFPGYVFVETDLSPELYELIREVPGVIGI 80
                          90
                  ....*....|....*....
gi 410225316  249 YWN---QQMVPIKEMTDVL 264
Cdd:cd08000    81 LGNgeePSPVSDEEIEMIL 99
NGN smart00738
In Spt5p, this domain may confer affinity for Spt4p. It possesses a RNP-like fold; In Spt5p, ...
178-266 1.46e-10

In Spt5p, this domain may confer affinity for Spt4p. It possesses a RNP-like fold; In Spt5p, this domain may confer affinity for Spt4p.Spt4p


Pssm-ID: 197850 [Multi-domain]  Cd Length: 106  Bit Score: 59.31  E-value: 1.46e-10
                            10        20        30        40        50        60        70        80
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 410225316    178 NLWTVKCKIGEERATAISLMRKFIAYQFTDtplQIKSVVAP-EHVK----------------GYIYVEAYKQTHVKQAIE 240
Cdd:smart00738    1 NWYAVRTTSGQEKRVAENLERKAEALGLED---KIVSILVPtEEVKeirrgkkkvverklfpGYIFVEADLEDEVWTAIR 77
                            90       100       110
                    ....*....|....*....|....*....|
gi 410225316    241 GV----GNLRLGYWnQQMVPIKEMTDVLKV 266
Cdd:smart00738   78 GTpgvrGFVGGGGK-PTPVPDDEIEKILKP 106
Atrophin-1 pfam03154
Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian ...
789-970 4.67e-10

Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian atrophy (DRPLA) gene. DRPLA OMIM:125370 is a progressive neurodegenerative disorder. It is caused by the expansion of a CAG repeat in the DRPLA gene on chromosome 12p. This results in an extended polyglutamine region in atrophin-1, that is thought to confer toxicity to the protein, possibly through altering its interactions with other proteins. The expansion of a CAG repeat is also the underlying defect in six other neurodegenerative disorders, including Huntington's disease. One interaction of expanded polyglutamine repeats that is thought to be pathogenic is that with the short glutamine repeat in the transcriptional coactivator CREB binding protein, CBP. This interaction draws CBP away from its usual nuclear location to the expanded polyglutamine repeat protein aggregates that are characteriztic of the polyglutamine neurodegenerative disorders. This interferes with CBP-mediated transcription and causes cytotoxicity.


Pssm-ID: 460830 [Multi-domain]  Cd Length: 991  Bit Score: 64.02  E-value: 4.67e-10
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 410225316   789 SQTPLQDGSRTPHYGSQTP-PHdgsrtPAQSGAWDPNNPNTPSRAEEEYEYAFDDEPtPSPQAYGGTPNPQTPGYPdPSS 867
Cdd:pfam03154  259 SQVSPQPLPQPSLHGQMPPmPH-----SLQTGPSHMQHPVPPQPFPLTPQSSQSQVP-PGPSPAAPGQSQQRIHTP-PSQ 331
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 410225316   868 PQVNPQYNP-QTPGTPAMYNTdqfsPYAAPSPQGSYQPSPSPQSY----HQVAPSPagYQ-----------------NTH 925
Cdd:pfam03154  332 SQLQSQQPPrEQPLPPAPLSM----PHIKPPPTTPIPQLPNPQSHkhppHLSGPSP--FQmnsnlppppalkplsslSTH 405
                          170       180       190       200
                   ....*....|....*....|....*....|....*....|....*...
gi 410225316   926 SPASYHPTPSPMAYQASPSPSPVGYSPM---TPGAPSPGGYNPHTPGS 970
Cdd:pfam03154  406 HPPSAHPPPLQLMPQSQQLPPPPAQPPVltqSQSLPPPAASHPPTSGL 453
DUF5585 pfam17823
Family of unknown function (DUF5585); This is a family of unknown function found in chordata.
727-1046 1.36e-09

Family of unknown function (DUF5585); This is a family of unknown function found in chordata.


Pssm-ID: 465521 [Multi-domain]  Cd Length: 506  Bit Score: 61.90  E-value: 1.36e-09
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 410225316   727 ATESTARVELHSTCQTISvdrqrlTTVGSRRPGGMTSTYGRT--PMYGSQTPMYGSGsrTPMYGSQTPlQDGSRTPHYGS 804
Cdd:pfam17823  170 AASPAPRTAASSTTAASS------TTAASSAPTTAASSAPATltPARGISTAATATG--HPAAGTALA-AVGNSSPAAGT 240
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 410225316   805 QTPPhDGSRTPAQSGawdpnnpnTPSRAEEEYEYAFDDEPTPSPQAYGGTPNPQTPGypDPSSPQVNPQYNPQTPGTPAM 884
Cdd:pfam17823  241 VTAA-VGTVTPAALA--------TLAAAAGTVASAAGTINMGDPHARRLSPAKHMPS--DTMARNPAAPMGAQAQGPIIQ 309
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 410225316   885 YNTDQfsPYAAPSPqgsyQPSPSPQSYHQVAPSPAGYQNTHSPASyhPTPSPMAYQASPSPSPVGYSPMTPGA------- 957
Cdd:pfam17823  310 VSTDQ--PVHNTAG----EPTPSPSNTTLEPNTPKSVASTNLAVV--TTTKAQAKEPSASPVPVLHTSMIPEVeatsptt 381
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 410225316   958 -PSPGGYNPHTPGSGIEQNSSdwvttdiQVKVRDTyLDTQVVGQT----GVIRSVTGGMCSVYLKDSEKVVSIssehlEP 1032
Cdd:pfam17823  382 qPSPLLPTQGAAGPGILLAPE-------QVATEAT-AGTASAGPTprssGDPKTLAMASCQLSTQGQYLVVTT-----DP 448
                          330
                   ....*....|....*...
gi 410225316  1033 ITPTKNNK----VKVILG 1046
Cdd:pfam17823  449 LTPALVDKmfllVVLILG 466
CTD pfam12815
Spt5 C-terminal nonapeptide repeat binding Spt4; The C-terminal domain of the transcription ...
754-819 3.52e-09

Spt5 C-terminal nonapeptide repeat binding Spt4; The C-terminal domain of the transcription elongation factor protein Spt5 is necessary for binding to Spt4 to form the functional complex that regulates early transcription elongation by RNA polymerase II. The complex may be involved in pre-mRNA processing through its association with mRNA capping enzymes. This CTD domain carries a regular nonapeptide repeat that can be present in up to 18 copies, as in S. pombe. The repeat has a characteriztic TPA motif.


Pssm-ID: 372327 [Multi-domain]  Cd Length: 71  Bit Score: 53.99  E-value: 3.52e-09
                           10        20        30        40        50        60        70
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 410225316   754 GSRRPGGMTSTYGRTPMY-------------GSQTPMYGSGSRTPMYGsqtplQDGSRTPHYGSQTpphDGSRTPAQSG 819
Cdd:pfam12815    1 GSRTPAYNSAGGSRTPAWgadgsrtpayggaGGRTPAYNQGGKTPAWG-----GAGSRTPAYYGAW---GGSRTPAYGG 71
Atrophin-1 pfam03154
Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian ...
776-977 3.74e-09

Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian atrophy (DRPLA) gene. DRPLA OMIM:125370 is a progressive neurodegenerative disorder. It is caused by the expansion of a CAG repeat in the DRPLA gene on chromosome 12p. This results in an extended polyglutamine region in atrophin-1, that is thought to confer toxicity to the protein, possibly through altering its interactions with other proteins. The expansion of a CAG repeat is also the underlying defect in six other neurodegenerative disorders, including Huntington's disease. One interaction of expanded polyglutamine repeats that is thought to be pathogenic is that with the short glutamine repeat in the transcriptional coactivator CREB binding protein, CBP. This interaction draws CBP away from its usual nuclear location to the expanded polyglutamine repeat protein aggregates that are characteriztic of the polyglutamine neurodegenerative disorders. This interferes with CBP-mediated transcription and causes cytotoxicity.


Pssm-ID: 460830 [Multi-domain]  Cd Length: 991  Bit Score: 60.94  E-value: 3.74e-09
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 410225316   776 PMYGSGSRTPMYGSQTPLQDGSRTPHYGSQTPPHDGSRTPAQSGAWDPNNPNTPsrAEEEYEYAFDDEPTPSPQayggTP 855
Cdd:pfam03154  294 PPQPFPLTPQSSQSQVPPGPSPAAPGQSQQRIHTPPSQSQLQSQQPPREQPLPP--APLSMPHIKPPPTTPIPQ----LP 367
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 410225316   856 NPQTPGYPDPSSPQVNPQYNPQTPGTPAMYNTDQFSPYAAPSPQgsyqPSP---SPQSyHQVAPSPAG------YQNTHS 926
Cdd:pfam03154  368 NPQSHKHPPHLSGPSPFQMNSNLPPPPALKPLSSLSTHHPPSAH----PPPlqlMPQS-QQLPPPPAQppvltqSQSLPP 442
                          170       180       190       200       210
                   ....*....|....*....|....*....|....*....|....*....|....*..
gi 410225316   927 PASYHPTPSpmAYQASPSPSPVGYSPMTPGAP----SPGGYNPHTP--GSGIEQNSS 977
Cdd:pfam03154  443 PAASHPPTS--GLHQVPSQSPFPQHPFVPGGPppitPPSGPPTSTSsaMPGIQPPSS 497
PHA03269 PHA03269
envelope glycoprotein C; Provisional
825-970 3.97e-09

envelope glycoprotein C; Provisional


Pssm-ID: 165527 [Multi-domain]  Cd Length: 566  Bit Score: 60.51  E-value: 3.97e-09
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 410225316  825 NPNTPSRAEEEYEYAFDDEPTPSPQayggtPNPQTPGYPDPS-SPQVNPQYNPQtpgtpamyntdqfsPYAAPSPQGSYQ 903
Cdd:PHA03269   21 NLNTNIPIPELHTSAATQKPDPAPA-----PHQAASRAPDPAvAPTSAASRKPD--------------LAQAPTPAASEK 81
                          90       100       110       120       130       140       150
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....
gi 410225316  904 PSPSPQSYHQV--APSPAGYQNTHSPasyhPTPSPM-----AYQASPSPSPVGYSPMTPgAPSPGGYNPHTPGS 970
Cdd:PHA03269   82 FDPAPAPHQAAsrAPDPAVAPQLAAA----PKPDAAeaftsAAQAHEAPADAGTSAASK-KPDPAAHTQHSPPP 150
PHA03247 PHA03247
large tegument protein UL36; Provisional
746-962 6.81e-09

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 60.34  E-value: 6.81e-09
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 410225316  746 DRQRLTTVGSR-RPGGMTSTYGRTPMYGSQTPMYGSGSRTPMYGSQTPLQDgsrtphygsqtpPHDGSRTPAQS-GAWDP 823
Cdd:PHA03247 2650 ERPRDDPAPGRvSRPRRARRLGRAAQASSPPQRPRRRAARPTVGSLTSLAD------------PPPPPPTPEPApHALVS 2717
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 410225316  824 NNPNTPSRAEEEYEY-AFDDEPTPSPQAYG----GTPN----PQTPGYPDPSSPQVNPQYNPQTPGTPAMYNTDQFSPYA 894
Cdd:PHA03247 2718 ATPLPPGPAAARQASpALPAAPAPPAVPAGpatpGGPArparPPTTAGPPAPAPPAAPAAGPPRRLTRPAVASLSESRES 2797
                         170       180       190       200       210       220
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 410225316  895 APSPqgsyqPSPSPQSYHQVAPSPAgYQNTHSPASYHPTPSPMAYQASPSPSPVGYSPMTP-GAPSPGG 962
Cdd:PHA03247 2798 LPSP-----WDPADPPAAVLAPAAA-LPPAASPAGPLPPPTSAQPTAPPPPPGPPPPSLPLgGSVAPGG 2860
Med15 pfam09606
ARC105 or Med15 subunit of Mediator complex non-fungal; The approx. 70 residue Med15 domain of ...
758-971 7.94e-09

ARC105 or Med15 subunit of Mediator complex non-fungal; The approx. 70 residue Med15 domain of the ARC-Mediator co-activator is a three-helix bundle with marked similarity to the KIX domain. The sterol regulatory element binding protein (SREBP) family of transcription activators use the ARC105 subunit to activate target genes in the regulation of cholesterol and fatty acid homeostasis. In addition, Med15 is a critical transducer of gene activation signals that control early metazoan development.


Pssm-ID: 312941 [Multi-domain]  Cd Length: 732  Bit Score: 59.64  E-value: 7.94e-09
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 410225316   758 PGGMTSTYGRTPMYGSQTPMYGSGSRTPMYGSQ-------TPLQDGSRTPHYGSQTPPHDGSRTPAQSGAWDPNNP---- 826
Cdd:pfam09606  231 PQQMGGAPNQVAMQQQQPQQQGQQSQLGMGINQmqqmpqgVGGGAGQGGPGQPMGPPGQQPGAMPNVMSIGDQNNYqqqq 310
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 410225316   827 --NTPSRAEEEYEYAFDDEPTPS--------PQAYGGTPNPQTPGypdpssPQVNPQYNPQTPGTPAMYNTDQFSPYAAP 896
Cdd:pfam09606  311 trQQQQQQGGNHPAAHQQQMNQSvgqggqvvALGGLNHLETWNPG------NFGGLGANPMQRGQPGMMSSPSPVPGQQV 384
                          170       180       190       200       210       220       230
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*..
gi 410225316   897 SPQGSYQPSPSPQSYHQVAPSPAGYQNTHSPASyHPTPSPmAYQASPSPSPVGY--SPMTPGAPSPGGyNPHTPGSG 971
Cdd:pfam09606  385 RQVTPNQFMRQSPQPSVPSPQGPGSQPPQSHPG-GMIPSP-ALIPSPSPQMSQQpaQQRTIGQDSPGG-SLNTPGQS 458
KOW_elon_Spt5 TIGR00405
transcription elongation factor Spt5; This protein contains a KOW domain, shared by bacterial ...
180-308 3.52e-08

transcription elongation factor Spt5; This protein contains a KOW domain, shared by bacterial NusG and the uL24 (previously L24p/L26e) family of ribosomal proteins. The most recent papers and crystal structures make this a transcription elongation factor rather than a ribosomal protein.


Pssm-ID: 129499 [Multi-domain]  Cd Length: 145  Bit Score: 53.74  E-value: 3.52e-08
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 410225316   180 WTVKCKIGEERATAislmrKFIAYQFTDTPLQIKSVVAPEHVKGYIYVEAYKQTHVKQAIEGVGNLRlgywnqQMVP--- 256
Cdd:TIGR00405    1 FAVKTSVGQEKNVA-----RLMARKARKSGLEVYSILAPESLKGYILVEAETKIDMRNPIIGVPHVR------GVVEgei 69
                           90       100       110       120       130
                   ....*....|....*....|....*....|....*....|....*....|...
gi 410225316   257 -IKEMTDVLKVVKEVANLKPKSWVRLKRGIYKDDIAQVDYVEPSQNTISLKMI 308
Cdd:TIGR00405   70 dFEEIERFLTPKKIIESIKKGDIVEIISGPFKGERAKVIRVDESKEEVTLELI 122
PHA03377 PHA03377
EBNA-3C; Provisional
738-968 4.45e-08

EBNA-3C; Provisional


Pssm-ID: 177614 [Multi-domain]  Cd Length: 1000  Bit Score: 57.37  E-value: 4.45e-08
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 410225316  738 STCQTISVDRQRLTTVGSRRPGGMTSTygrTPMYGSQTPMYgSGSRTPMYGSQTPLQD---GSRTPHYGSQTPPHDGSRT 814
Cdd:PHA03377  686 SVFVLPSVDAGRAQPSEESHLSSMSPT---QPISHEEQPRY-EDPDDPLDLSLHPDQApppSHQAPYSGHEEPQAQQAPY 761
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 410225316  815 PaqsGAWDPNNPNTPSRAEEEyeyafddeptpsPQAYGGTPNpQTPGY--PDPSSPQvNPQY--------------NPQT 878
Cdd:PHA03377  762 P---GYWEPRPPQAPYLGYQE------------PQAQGVQVS-SYPGYagPWGLRAQ-HPRYrhswaywsqypghgHPQG 824
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 410225316  879 PGTP-AMYNTDQFSPYAAP-----SPQGSYQPSPSP----QSYHQVAPSPAGYQNTHSPASYHPTPS----PMAYQASPS 944
Cdd:PHA03377  825 PWAPrPPHLPPQWDGSAGHgqdqvSQFPHLQSETGPprlqLSQVPQLPYSQTLVSSSAPSWSSPQPRapirPIPTRFPPP 904
                         250       260
                  ....*....|....*....|....
gi 410225316  945 PSPVGYSpMTPGAPSPGGYNPHTP 968
Cdd:PHA03377  905 PMPLQDS-MAVGCDSSGTACPSMP 927
Atrophin-1 pfam03154
Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian ...
844-970 6.07e-08

Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian atrophy (DRPLA) gene. DRPLA OMIM:125370 is a progressive neurodegenerative disorder. It is caused by the expansion of a CAG repeat in the DRPLA gene on chromosome 12p. This results in an extended polyglutamine region in atrophin-1, that is thought to confer toxicity to the protein, possibly through altering its interactions with other proteins. The expansion of a CAG repeat is also the underlying defect in six other neurodegenerative disorders, including Huntington's disease. One interaction of expanded polyglutamine repeats that is thought to be pathogenic is that with the short glutamine repeat in the transcriptional coactivator CREB binding protein, CBP. This interaction draws CBP away from its usual nuclear location to the expanded polyglutamine repeat protein aggregates that are characteriztic of the polyglutamine neurodegenerative disorders. This interferes with CBP-mediated transcription and causes cytotoxicity.


Pssm-ID: 460830 [Multi-domain]  Cd Length: 991  Bit Score: 57.08  E-value: 6.07e-08
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 410225316   844 PTPSPQAYGGTPNPQTPGYPDPSSPQVN-PQYNPQTPGTP--AMYNTDQFSPYAAPSPQGSYQPSPSPQSYHQVAPSPAG 920
Cdd:pfam03154  188 PPGTTQAATAGPTPSAPSVPPQGSPATSqPPNQTQSTAAPhtLIQQTPTLHPQRLPSPHPPLQPMTQPPPPSQVSPQPLP 267
                           90       100       110       120       130       140
                   ....*....|....*....|....*....|....*....|....*....|....*....|...
gi 410225316   921 YQNTHSPASYHP---------TPSPMAYQASPSPSPVGYSPMTPG----APSPGGYNPHTPGS 970
Cdd:pfam03154  268 QPSLHGQMPPMPhslqtgpshMQHPVPPQPFPLTPQSSQSQVPPGpspaAPGQSQQRIHTPPS 330
PHA03378 PHA03378
EBNA-3B; Provisional
763-962 6.27e-08

EBNA-3B; Provisional


Pssm-ID: 223065 [Multi-domain]  Cd Length: 991  Bit Score: 57.00  E-value: 6.27e-08
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 410225316  763 STYGRTPMYGSQTPMYGSGSRTPMYGSQTPLQD-----------GSRTPHYGSQTPPHDGSRTPAQSGAWdPNNPNTPSR 831
Cdd:PHA03378  603 SQTPEPPTTQSHIPETSAPRQWPMPLRPIPMRPlrmqpitfnvlVFPTPHQPPQVEITPYKPTWTQIGHI-PYQPSPTGA 681
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 410225316  832 AEEEYEYAFDDEPTPSPQAYGGTPNPQTPgyPDPSS-PQVNPQYNPQTPGTPAMYNTDQFSPYAAPSPQGSYQPSPSPQS 910
Cdd:PHA03378  682 NTMLPIQWAPGTMQPPPRAPTPMRPPAAP--PGRAQrPAAATGRARPPAAAPGRARPPAAAPGRARPPAAAPGRARPPAA 759
                         170       180       190       200       210       220
                  ....*....|....*....|....*....|....*....|....*....|....*....|..
gi 410225316  911 YHQVAPSPAGyqnthSPASYHPTPSPMA-------YQASPSPSP---VGYSPMTPGAPSPGG 962
Cdd:PHA03378  760 APGRARPPAA-----APGAPTPQPPPQAppapqqrPRGAPTPQPppqAGPTSMQLMPRAAPG 816
PRK10263 PRK10263
DNA translocase FtsK; Provisional
805-969 7.60e-08

DNA translocase FtsK; Provisional


Pssm-ID: 236669 [Multi-domain]  Cd Length: 1355  Bit Score: 57.02  E-value: 7.60e-08
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 410225316  805 QTPPHDGSRTPAQSG--AWDP-NNPNTPsraeeeyeyafddEPTPSPQAYGGTPNPQtpgYPDPSSPQVNP---QYNPQT 878
Cdd:PRK10263  342 QTPPVASVDVPPAQPtvAWQPvPGPQTG-------------EPVIAPAPEGYPQQSQ---YAQPAVQYNEPlqqPVQPQQ 405
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 410225316  879 PGTPAMYNTDQFSPYAAPSP-QGSYQPSPSPQSYHQVAPSPAGYQNTHSPasYHPTPSPMAYQASPSPSPVGYSPMTPGA 957
Cdd:PRK10263  406 PYYAPAAEQPAQQPYYAPAPeQPAQQPYYAPAPEQPVAGNAWQAEEQQST--FAPQSTYQTEQTYQQPAAQEPLYQQPQP 483
                         170
                  ....*....|..
gi 410225316  958 PSPGGYNPHTPG 969
Cdd:PRK10263  484 VEQQPVVEPEPV 495
NGN_Arch cd09887
Archaeal N-Utilization Substance G (NusG) N-terminal (NGN) domain; The N-Utilization Substance ...
179-246 1.05e-07

Archaeal N-Utilization Substance G (NusG) N-terminal (NGN) domain; The N-Utilization Substance G (NusG) protein and its eukaryotic homolog, Spt5, are involved in transcription elongation and termination. Transcription in archaea has a eukaryotic-type transcription apparatus, but contains bacterial-type transcription factors. NusG is one of the few archaeal transcription factors that has orthologs in both bacteria and eukaryotes. Archaeal NusG is similar to bacterial NusG, composed of an NGN domain and a Kyrpides Ouzounis and Woese (KOW) repeat. The eukaryotic ortholog, Spt5, is a large protein composed of an acidic N-terminus, an NGN domain, and multiple KOW motifs at its C-terminus. NusG was originally discovered as a N-dependent antitermination enhancing activity in Escherichia coli and has a variety of functions, such as being involved in RNA polymerase elongation and Rho-termination in bacteria. Archaeal NusG forms a complex with DNA-directed RNA polymerase subunit E (rpoE) that is similar to the Spt5-Spt4 complex in eukaryotes.


Pssm-ID: 193576  Cd Length: 82  Bit Score: 50.23  E-value: 1.05e-07
                          10        20        30        40        50        60
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 410225316  179 LWTVKCKIGEERATAISLMRKFiayqfTDTPLQIKSVVAPEHVKGYIYVEAYKQTHVKQAIEGVGNLR 246
Cdd:cd09887     2 IYAVKTTAGQERNVADLLAMRA-----EKENLDVYSILVPEELKGYVFVEAEDPDRVEELIRGIPHVR 64
PHA03247 PHA03247
large tegument protein UL36; Provisional
762-982 1.39e-07

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 56.10  E-value: 1.39e-07
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 410225316  762 TSTYGRTPMYGSQTPMygsgSRTPMYGSQTPLQDGSRTPHYGSQTPPHDGSR-TPAQSGAWDPNNPNTPsRAEEEYEYAF 840
Cdd:PHA03247 2817 ALPPAASPAGPLPPPT----SAQPTAPPPPPGPPPPSLPLGGSVAPGGDVRRrPPSRSPAAKPAAPARP-PVRRLARPAV 2891
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 410225316  841 DDEPTPSPQAYGGTPNPQTPGYPDPSSPQVNPQYNPQtpgtpamyntdqfsPYAAPSPQGSYQPSPSPQSYHQVAPSPAG 920
Cdd:PHA03247 2892 SRSTESFALPPDQPERPPQPQAPPPPQPQPQPPPPPQ--------------PQPPPPPPPRPQPPLAPTTDPAGAGEPSG 2957
                         170       180       190       200       210       220
                  ....*....|....*....|....*....|....*....|....*....|....*....|..
gi 410225316  921 YQNTHSPASYHPTPSPMAYQASPSPSPvgySPMTPGAPSPGgyNPHTPGSGIeqnsSDWVTT 982
Cdd:PHA03247 2958 AVPQPWLGALVPGRVAVPRFRVPQPAP---SREAPASSTPP--LTGHSLSRV----SSWASS 3010
PHA03247 PHA03247
large tegument protein UL36; Provisional
747-974 2.16e-07

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 55.71  E-value: 2.16e-07
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 410225316  747 RQRLTTVGSRRPGGMTSTYGRTPMYGSQ--TPMYGSGSRTPmyGSQTPLQDGSRTPHYGSQTPPHDGSRTPAQSGAWDPN 824
Cdd:PHA03247 2711 APHALVSATPLPPGPAAARQASPALPAApaPPAVPAGPATP--GGPARPARPPTTAGPPAPAPPAAPAAGPPRRLTRPAV 2788
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 410225316  825 NPNTPSRAEEEYEYAFDDEPTPSPQAYGGTPNPQTPGYPDPSSPQVNPQYNPQTPGTPAMYntdqFSPYAAPSPQGSYQP 904
Cdd:PHA03247 2789 ASLSESRESLPSPWDPADPPAAVLAPAAALPPAASPAGPLPPPTSAQPTAPPPPPGPPPPS----LPLGGSVAPGGDVRR 2864
                         170       180       190       200       210       220       230
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|.
gi 410225316  905 SPSPQSyhqVAPSPAGyqNTHSPASYHPTPS-PMAYQASPSPSPVGYSPMTPGAPSPGGYNPHTPGSGIEQ 974
Cdd:PHA03247 2865 RPPSRS---PAAKPAA--PARPPVRRLARPAvSRSTESFALPPDQPERPPQPQAPPPPQPQPQPPPPPQPQ 2930
KOW cd00380
KOW: an acronym for the authors' surnames (Kyrpides, Ouzounis and Woese); KOW domain is known ...
423-469 2.44e-07

KOW: an acronym for the authors' surnames (Kyrpides, Ouzounis and Woese); KOW domain is known as an RNA-binding motif that is shared so far among some families of ribosomal proteins, the essential bacterial transcriptional elongation factor NusG, the eukaryotic chromatin elongation factor Spt5, the higher eukaryotic KIN17 proteins and Mtr4. The KOW motif contains an invariants glycine residue and comprises alternating blocks of hydrophilic and hydrophobic residues.


Pssm-ID: 240504  Cd Length: 49  Bit Score: 48.37  E-value: 2.44e-07
                          10        20        30        40
                  ....*....|....*....|....*....|....*....|....*....
gi 410225316  423 GDNVEVCEGELINLQGKILSVDG--NKITIMPKHEDLKDMLEFPAQELR 469
Cdd:cd00380     1 GDVVRVLRGPYKGREGVVVDIDPrfGIVTVKGATGSKGAELKVRFDDVD 49
PHA03378 PHA03378
EBNA-3B; Provisional
799-968 6.75e-07

EBNA-3B; Provisional


Pssm-ID: 223065 [Multi-domain]  Cd Length: 991  Bit Score: 53.53  E-value: 6.75e-07
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 410225316  799 TPHYGSQTPPHDGSRTPAQSGAWDPNNPNTPSRAEEE---YEYAFDDEPTPSPQAYGGTPNPQTPGYPDPS-SPQVNPQ- 873
Cdd:PHA03378  582 TSQLASSAPSYAQTPWPVPHPSQTPEPPTTQSHIPETsapRQWPMPLRPIPMRPLRMQPITFNVLVFPTPHqPPQVEITp 661
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 410225316  874 ------------YNPQTPG---------TPAMYNTDQFSPYAAPSPQGSYQPSPSPQSYHQVAPSPAGYQNTHSPASYHP 932
Cdd:PHA03378  662 ykptwtqighipYQPSPTGantmlpiqwAPGTMQPPPRAPTPMRPPAAPPGRAQRPAAATGRARPPAAAPGRARPPAAAP 741
                         170       180       190
                  ....*....|....*....|....*....|....*...
gi 410225316  933 TPS--PMAYQASPSPSPVGYSPMTPGAPSPGGYNPHTP 968
Cdd:PHA03378  742 GRArpPAAAPGRARPPAAAPGRARPPAAAPGAPTPQPP 779
Herpes_BLLF1 pfam05109
Herpes virus major outer envelope glycoprotein (BLLF1); This family consists of the BLLF1 ...
738-961 1.17e-06

Herpes virus major outer envelope glycoprotein (BLLF1); This family consists of the BLLF1 viral late glycoprotein, also termed gp350/220. It is the most abundantly expressed glycoprotein in the viral envelope of the Herpesviruses and is the major antigen responsible for stimulating the production of neutralising antibodies in vivo.


Pssm-ID: 282904 [Multi-domain]  Cd Length: 886  Bit Score: 53.00  E-value: 1.17e-06
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 410225316   738 STCQTISVdrqrlTTVGSRRPGGMTStyGRTPMYGSQTPM-YGSGSRTPMYGS-----QTPLQDG-SRTPHYGSQTP--- 807
Cdd:pfam05109  463 STGPTVST-----ADVTSPTPAGTTS--GASPVTPSPSPRdNGTESKAPDMTSptsavTTPTPNAtSPTPAVTTPTPnat 535
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 410225316   808 -PHDGSRTPAQSGAWDPNNPNTPSRAEEEyeyafddePTP----------SPQAYGGTPnpqTPGYPDPSSPQVNPQYNP 876
Cdd:pfam05109  536 sPTLGKTSPTSAVTTPTPNATSPTPAVTT--------PTPnatiptlgktSPTSAVTTP---TPNATSPTVGETSPQANT 604
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 410225316   877 Q------TPGTPAMYNTDQFSPYAAPSPQGSYQPSPS------PQSYHQ-VAPSPAGYQNTHSP--ASYHPTPSPMAYQA 941
Cdd:pfam05109  605 TnhtlggTSSTPVVTSPPKNATSAVTTGQHNITSSSTssmslrPSSISEtLSPSTSDNSTSHMPllTSAHPTGGENITQV 684
                          250       260
                   ....*....|....*....|
gi 410225316   942 SPSPSPVGYSPMTPGAPSPG 961
Cdd:pfam05109  685 TPASTSTHHVSTSSPAPRPG 704
PBP1 COG5180
PAB1-binding protein, interacts with poly(A)-binding protein [RNA processing and modification]; ...
806-977 1.38e-06

PAB1-binding protein, interacts with poly(A)-binding protein [RNA processing and modification];


Pssm-ID: 444064 [Multi-domain]  Cd Length: 548  Bit Score: 52.37  E-value: 1.38e-06
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 410225316  806 TPPHDGSRTPAQSgawdPNNPNTPSRAEEEYEYAFDD-EPTPSPQAYGGTPNP----QTPGYPDPSSPQVNPQYNPQT-- 878
Cdd:COG5180   218 TGGADHPRPEAAS----SPKVDPPSTSEARSRPATVDaQPEMRPPADAKERRRaaigDTPAAEPPGLPVLEAGSEPQSda 293
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 410225316  879 PGTPAMYNTDQFSPYAAPSPQGSYQPSPS-----PQSYHQVAPSPAGYQNTHSPASYHPTPSPMAYQASPSpspvgySPM 953
Cdd:COG5180   294 PEAETARPIDVKGVASAPPATRPVRPPGGardpgTPRPGQPTERPAGVPEAASDAGQPPSAYPPAEEAVPG------KPL 367
                         170       180
                  ....*....|....*....|....*...
gi 410225316  954 TPGAPSPG--GYN--PHTPGSGIEQNSS 977
Cdd:COG5180   368 EQGAPRPGssGGDgaPFQPPNGAPQPGL 395
Atrophin-1 pfam03154
Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian ...
846-974 2.76e-06

Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian atrophy (DRPLA) gene. DRPLA OMIM:125370 is a progressive neurodegenerative disorder. It is caused by the expansion of a CAG repeat in the DRPLA gene on chromosome 12p. This results in an extended polyglutamine region in atrophin-1, that is thought to confer toxicity to the protein, possibly through altering its interactions with other proteins. The expansion of a CAG repeat is also the underlying defect in six other neurodegenerative disorders, including Huntington's disease. One interaction of expanded polyglutamine repeats that is thought to be pathogenic is that with the short glutamine repeat in the transcriptional coactivator CREB binding protein, CBP. This interaction draws CBP away from its usual nuclear location to the expanded polyglutamine repeat protein aggregates that are characteriztic of the polyglutamine neurodegenerative disorders. This interferes with CBP-mediated transcription and causes cytotoxicity.


Pssm-ID: 460830 [Multi-domain]  Cd Length: 991  Bit Score: 51.69  E-value: 2.76e-06
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 410225316   846 PSPQAYGGTPNPQTPGYPDPSS-PQVNPqynpqTPGTPAMynTDQFSPYAAPSPQGSyQPSPSPQSYHQVAPS--PAGYQ 922
Cdd:pfam03154  172 PVLQAQSGAASPPSPPPPGTTQaATAGP-----TPSAPSV--PPQGSPATSQPPNQT-QSTAAPHTLIQQTPTlhPQRLP 243
                           90       100       110       120       130
                   ....*....|....*....|....*....|....*....|....*....|...
gi 410225316   923 NTHSPASYHPTPSPMAY-QASPSPSPVGYSPMTPGAPSPGGYNPHTPGSGIEQ 974
Cdd:pfam03154  244 SPHPPLQPMTQPPPPSQvSPQPLPQPSLHGQMPPMPHSLQTGPSHMQHPVPPQ 296
KOW cd00380
KOW: an acronym for the authors' surnames (Kyrpides, Ouzounis and Woese); KOW domain is known ...
597-634 2.83e-06

KOW: an acronym for the authors' surnames (Kyrpides, Ouzounis and Woese); KOW domain is known as an RNA-binding motif that is shared so far among some families of ribosomal proteins, the essential bacterial transcriptional elongation factor NusG, the eukaryotic chromatin elongation factor Spt5, the higher eukaryotic KIN17 proteins and Mtr4. The KOW motif contains an invariants glycine residue and comprises alternating blocks of hydrophilic and hydrophobic residues.


Pssm-ID: 240504  Cd Length: 49  Bit Score: 45.29  E-value: 2.83e-06
                          10        20        30
                  ....*....|....*....|....*....|....*...
gi 410225316  597 KDIVKVIDGPHSGREGEIRHLFRSFAFLHCKKLVENGG 634
Cdd:cd00380     1 GDVVRVLRGPYKGREGVVVDIDPRFGIVTVKGATGSKG 38
PHA03307 PHA03307
transcriptional regulator ICP4; Provisional
791-978 3.50e-06

transcriptional regulator ICP4; Provisional


Pssm-ID: 223039 [Multi-domain]  Cd Length: 1352  Bit Score: 51.33  E-value: 3.50e-06
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 410225316  791 TPLQDGSRTPHYGSQ-----------TPPHDGSRTPAqSGAWDPNNPNTPSRAEE-EYEYAFDDEPTPSPQAYGGTPNPQ 858
Cdd:PHA03307   26 ATPGDAADDLLSGSQgqlvsdsaelaAVTVVAGAAAC-DRFEPPTGPPPGPGTEApANESRSTPTWSLSTLAPASPAREG 104
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 410225316  859 TPGYPDPSSPqvnpqynpqtPGTPAMYNTDQFSPYAAPSPQGSYQPSPSPQSYHQVAPSPAGYQNThSPASYHPTP---- 934
Cdd:PHA03307  105 SPTPPGPSSP----------DPPPPTPPPASPPPSPAPDLSEMLRPVGSPGPPPAASPPAAGASPA-AVASDAASSrqaa 173
                         170       180       190       200
                  ....*....|....*....|....*....|....*....|....*.
gi 410225316  935 --SPMAYQASPSPSPVGYSPMTPGAPSPGGYNPHTPGSGIEQNSSD 978
Cdd:PHA03307  174 lpLSSPEETARAPSSPPAEPPPSTPPAAASPRPPRRSSPISASASS 219
KOW cd00380
KOW: an acronym for the authors' surnames (Kyrpides, Ouzounis and Woese); KOW domain is known ...
707-750 3.51e-06

KOW: an acronym for the authors' surnames (Kyrpides, Ouzounis and Woese); KOW domain is known as an RNA-binding motif that is shared so far among some families of ribosomal proteins, the essential bacterial transcriptional elongation factor NusG, the eukaryotic chromatin elongation factor Spt5, the higher eukaryotic KIN17 proteins and Mtr4. The KOW motif contains an invariants glycine residue and comprises alternating blocks of hydrophilic and hydrophobic residues.


Pssm-ID: 240504  Cd Length: 49  Bit Score: 44.90  E-value: 3.51e-06
                          10        20        30        40
                  ....*....|....*....|....*....|....*....|....*...
gi 410225316  707 GQTVRISQGPYKGYIGVVKDATEST--ARVELH--STCQTISVDRQRL 750
Cdd:cd00380     1 GDVVRVLRGPYKGREGVVVDIDPRFgiVTVKGAtgSKGAELKVRFDDV 48
KOW cd00380
KOW: an acronym for the authors' surnames (Kyrpides, Ouzounis and Woese); KOW domain is known ...
475-519 4.49e-06

KOW: an acronym for the authors' surnames (Kyrpides, Ouzounis and Woese); KOW domain is known as an RNA-binding motif that is shared so far among some families of ribosomal proteins, the essential bacterial transcriptional elongation factor NusG, the eukaryotic chromatin elongation factor Spt5, the higher eukaryotic KIN17 proteins and Mtr4. The KOW motif contains an invariants glycine residue and comprises alternating blocks of hydrophilic and hydrophobic residues.


Pssm-ID: 240504  Cd Length: 49  Bit Score: 44.52  E-value: 4.49e-06
                          10        20        30        40
                  ....*....|....*....|....*....|....*....|....*....
gi 410225316  475 GDHVKVIAGRFEGDTGLIVRVEENFVIL----FSDLTMHELKVLPRDLQ 519
Cdd:cd00380     1 GDVVRVLRGPYKGREGVVVDIDPRFGIVtvkgATGSKGAELKVRFDDVD 49
PRK10263 PRK10263
DNA translocase FtsK; Provisional
865-976 1.43e-05

DNA translocase FtsK; Provisional


Pssm-ID: 236669 [Multi-domain]  Cd Length: 1355  Bit Score: 49.31  E-value: 1.43e-05
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 410225316  865 PSSPQVNPQYNPQTpgtpamynTDQFSPYAAPSPQGSYQPSPSPQSYHQ-VAPSPAGYQNTHSPASYHPTPSPMAYQASP 943
Cdd:PRK10263  740 PHEPLFTPIVEPVQ--------QPQQPVAPQQQYQQPQQPVAPQPQYQQpQQPVAPQPQYQQPQQPVAPQPQYQQPQQPV 811
                          90       100       110
                  ....*....|....*....|....*....|...
gi 410225316  944 SPSPVGYSPMTPGAPSPGGYNPHTPGSGIEQNS 976
Cdd:PRK10263  812 APQPQYQQPQQPVAPQPQYQQPQQPVAPQPQDT 844
PHA03291 PHA03291
envelope glycoprotein I; Provisional
839-986 1.72e-05

envelope glycoprotein I; Provisional


Pssm-ID: 223033 [Multi-domain]  Cd Length: 401  Bit Score: 48.41  E-value: 1.72e-05
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 410225316  839 AFDDEPTPSPQAYGGTPnpqTPGYPDPSSPQVNPQYNPqtpgtpamynTDQFSPyAAPSPQGSYQPSPspqsyhQVAPSP 918
Cdd:PHA03291  165 AFPAEGTLAAPPLGEGS---ADGSCDPALPLSAPRLGP----------ADVFVP-ATPRPTPRTTASP------ETTPTP 224
                          90       100       110       120       130       140
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 410225316  919 AgyqNTHSPASyHPTPSPMAYQASPSPSPVGYSPMTPGAPSPGGYNPHTPGSGIEQNSSDWVTTDIQV 986
Cdd:PHA03291  225 S---TTTSPPS-TTIPAPSTTIAAPQAGTTPEAEGTPAPPTPGGGEAPPANATPAPEASRYELTVTQI 288
Pro-rich pfam15240
Proline-rich protein; This family includes several eukaryotic proline-rich proteins.
816-968 1.74e-05

Proline-rich protein; This family includes several eukaryotic proline-rich proteins.


Pssm-ID: 464580 [Multi-domain]  Cd Length: 167  Bit Score: 46.18  E-value: 1.74e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 410225316   816 AQSGAWDPNNPNTPSR-AEEEYEYAFDDEPTPSPQAYGGTPNPQTPGYPDPSSPQVNPQYNPQTPGTPAMYNTDQfsPYA 894
Cdd:pfam15240   16 AQSSSEDVSQEDSPSLiSEEEGQSQQGGQGPQGPPPGGFPPQPPASDDPPGPPPPGGPQQPPPQGGKQKPQGPPP--QGG 93
                           90       100       110       120       130       140       150
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....
gi 410225316   895 APSPQGSYQPSPSPQSYHQVAPSPAGYQNTHSPASYHPTPSPMAYQASPSPSPvGYSPMTPGAPSPGGyNPHTP 968
Cdd:pfam15240   94 PRPPPGKPQGPPPQGGNQQQGPPPPGKPQGPPPQGGGPPPQGGNQQGPPPPPP-GNPQGPPQRPPQPG-NPQGP 165
PRK07764 PRK07764
DNA polymerase III subunits gamma and tau; Validated
795-983 3.08e-05

DNA polymerase III subunits gamma and tau; Validated


Pssm-ID: 236090 [Multi-domain]  Cd Length: 824  Bit Score: 48.06  E-value: 3.08e-05
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 410225316  795 DGSRTPHYGSQTPPHDGSR--TPAQSGAWDPNNPNTPSRAEEEYEYAFDDEPTPSPQAYGGTPNPQTPGYPDPSSPQVNP 872
Cdd:PRK07764  596 GGEGPPAPASSGPPEEAARpaAPAAPAAPAAPAPAGAAAAPAEASAAPAPGVAAPEHHPKHVAVPDASDGGDGWPAKAGG 675
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 410225316  873 QYNPQTPGTPAMyntdqfSPYAAPSPQGSYQPSPSPQSYHQVAPSPAGYQNTHSPASYHPTPSPMAYQASPSPSPVGYSP 952
Cdd:PRK07764  676 AAPAAPPPAPAP------AAPAAPAGAAPAQPAPAPAATPPAGQADDPAAQPPQAAQGASAPSPAADDPVPLPPEPDDPP 749
                         170       180       190
                  ....*....|....*....|....*....|.
gi 410225316  953 MTPGAPSPGGYNPHTPGSGIEQNSSDWVTTD 983
Cdd:PRK07764  750 DPAGAPAQPPPPPAPAPAAAPAAAPPPSPPS 780
PRK07764 PRK07764
DNA polymerase III subunits gamma and tau; Validated
843-966 3.50e-05

DNA polymerase III subunits gamma and tau; Validated


Pssm-ID: 236090 [Multi-domain]  Cd Length: 824  Bit Score: 48.06  E-value: 3.50e-05
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 410225316  843 EPTPSPQAYGGTPNPQTPGyPDPSSPQVNPQYNPQTPGTPAmyntdqfsPYAAPSPQGSYQPSPSPQSYHQVAPSPAGYQ 922
Cdd:PRK07764  391 AGAPAAAAPSAAAAAPAAA-PAPAAAAPAAAAAPAPAAAPQ--------PAPAPAPAPAPPSPAGNAPAGGAPSPPPAAA 461
                          90       100       110       120
                  ....*....|....*....|....*....|....*....|....
gi 410225316  923 NTHSPASYHPTPSPMAYQASPSPSPVGYSPMTPGAPSPGGYNPH 966
Cdd:PRK07764  462 PSAQPAPAPAAAPEPTAAPAPAPPAAPAPAAAPAAPAAPAAPAG 505
KOW smart00739
KOW (Kyprides, Ouzounis, Woese) motif; Motif in ribosomal proteins, NusG, Spt5p, KIN17 and T54.
471-498 5.62e-05

KOW (Kyprides, Ouzounis, Woese) motif; Motif in ribosomal proteins, NusG, Spt5p, KIN17 and T54.


Pssm-ID: 128978  Cd Length: 28  Bit Score: 40.77  E-value: 5.62e-05
                            10        20
                    ....*....|....*....|....*...
gi 410225316    471 YFKMGDHVKVIAGRFEGDTGLIVRVEEN 498
Cdd:smart00739    1 KFEVGDTVRVIAGPFKGKVGKVLEVDGE 28
Atrophin-1 pfam03154
Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian ...
791-960 6.21e-05

Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian atrophy (DRPLA) gene. DRPLA OMIM:125370 is a progressive neurodegenerative disorder. It is caused by the expansion of a CAG repeat in the DRPLA gene on chromosome 12p. This results in an extended polyglutamine region in atrophin-1, that is thought to confer toxicity to the protein, possibly through altering its interactions with other proteins. The expansion of a CAG repeat is also the underlying defect in six other neurodegenerative disorders, including Huntington's disease. One interaction of expanded polyglutamine repeats that is thought to be pathogenic is that with the short glutamine repeat in the transcriptional coactivator CREB binding protein, CBP. This interaction draws CBP away from its usual nuclear location to the expanded polyglutamine repeat protein aggregates that are characteriztic of the polyglutamine neurodegenerative disorders. This interferes with CBP-mediated transcription and causes cytotoxicity.


Pssm-ID: 460830 [Multi-domain]  Cd Length: 991  Bit Score: 47.07  E-value: 6.21e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 410225316   791 TPLQDGSRTPHYGSQTPPHDGSRTPAQSGAWDPNNPNTPSRAEEEYE--YAFDDEPTPSPQAYGGTPNPQTPGYPDP--- 865
Cdd:pfam03154   75 SPLKSAKRQREKGASDTEEPERATAKKSKTQEISRPNSPSEGEGESSdgRSVNDEGSSDPKDIDQDNRSTSPSIPSPqdn 154
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 410225316   866 -----SS----------PQVNPQYNPQTPGTPAMYNTDQfSPYAAPSPQGsyqPSPSPQSYHQVAPSPAGYQNTHSPASY 930
Cdd:pfam03154  155 esdsdSSaqqqilqtqpPVLQAQSGAASPPSPPPPGTTQ-AATAGPTPSA---PSVPPQGSPATSQPPNQTQSTAAPHTL 230
                          170       180       190
                   ....*....|....*....|....*....|
gi 410225316   931 HPTPSPMAYQASPSPSPvGYSPMTPGAPSP 960
Cdd:pfam03154  231 IQQTPTLHPQRLPSPHP-PLQPMTQPPPPS 259
PAT1 pfam09770
Topoisomerase II-associated protein PAT1; Members of this family are necessary for accurate ...
784-974 7.90e-05

Topoisomerase II-associated protein PAT1; Members of this family are necessary for accurate chromosome transmission during cell division.


Pssm-ID: 401645 [Multi-domain]  Cd Length: 846  Bit Score: 46.95  E-value: 7.90e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 410225316   784 TPMYGSQTPLQDGSRTPHYGSQTPPHDGSRTPAQSGawdpnnpntpsraeeeYEYAFDDEP------------TPSPQAY 851
Cdd:pfam09770  108 AARAAQSSAQPPASSLPQYQYASQQSQQPSKPVRTG----------------YEKYKEPEPipdlqvdaslwgVAPKKAA 171
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 410225316   852 GGTPNPQTPGYPDPS--------------------SPQVNPQYNPQTPGTPAMYNTDQFSPYAAPSPQGSYQPSP----- 906
Cdd:pfam09770  172 APAPAPQPAAQPASLpapsrkmmsleeveaamraqAKKPAQQPAPAPAQPPAAPPAQQAQQQQQFPPQIQQQQQPqqqpq 251
                          170       180       190       200       210       220       230
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....
gi 410225316   907 SPQSYHQVAPSPAGYQNTHSPASYHPTPS--PMAYQASPSPSPVGYSPM----TPGAPSPGGYNPHTPGSGIEQ 974
Cdd:pfam09770  252 QPQQHPGQGHPVTILQRPQSPQPDPAQPSiqPQAQQFHQQPPPVPVQPTqilqNPNRLSAARVGYPQNPQPGVQ 325
PRK14959 PRK14959
DNA polymerase III subunits gamma and tau; Provisional
844-958 9.02e-05

DNA polymerase III subunits gamma and tau; Provisional


Pssm-ID: 184923 [Multi-domain]  Cd Length: 624  Bit Score: 46.60  E-value: 9.02e-05
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 410225316  844 PTPSPQAYGGTPNPQTPGyPDPSSPQVNPQYNPQTPGTPAmyntdqfsPYAAPSPQGSYQPSPSPQSYHQVAPSPAGYQN 923
Cdd:PRK14959  387 EGPASGGAATIPTPGTQG-PQGTAPAAGMTPSSAAPATPA--------PSAAPSPRVPWDDAPPAPPRSGIPPRPAPRMP 457
                          90       100       110
                  ....*....|....*....|....*....|....*.
gi 410225316  924 THSPASYHPTPSPMAYQASPSPS-PVGYSPMTPGAP 958
Cdd:PRK14959  458 EASPVPGAPDSVASASDAPPTLGdPSDTAEHTPSGP 493
KOW pfam00467
KOW motif; This family has been extended to coincide with ref. The KOW (Kyprides, Ouzounis, ...
706-737 9.60e-05

KOW motif; This family has been extended to coincide with ref. The KOW (Kyprides, Ouzounis, Woese) motif is found in a variety of ribosomal proteins and NusG.


Pssm-ID: 425698 [Multi-domain]  Cd Length: 32  Bit Score: 40.45  E-value: 9.60e-05
                           10        20        30
                   ....*....|....*....|....*....|..
gi 410225316   706 IGQTVRISQGPYKGYIGVVKDATESTARVELH 737
Cdd:pfam00467    1 KGDVVRVIAGPFKGKVGKVVEVDDKKNRVLVE 32
PHA03247 PHA03247
large tegument protein UL36; Provisional
796-962 1.01e-04

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 46.86  E-value: 1.01e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 410225316  796 GSRTPHYGSQTPPHDGSRT-PAQSGAWDPNNPNTPSRAEEEyeyafDDEPTP-SPQAYGGTP-NPQTPGYPDPSSPQVNP 872
Cdd:PHA03247 2549 GDPPPPLPPAAPPAAPDRSvPPPRPAPRPSEPAVTSRARRP-----DAPPQSaRPRAPVDDRgDPRGPAPPSPLPPDTHA 2623
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 410225316  873 QYNPQTPGTPAMYNTDQFSPYAAPSP-QGSYQPSPSPQSYHQVAPSPAGYQNTHSPA------SYHPTPSPMAYQASPSP 945
Cdd:PHA03247 2624 PDPPPPSPSPAANEPDPHPPPTVPPPeRPRDDPAPGRVSRPRRARRLGRAAQASSPPqrprrrAARPTVGSLTSLADPPP 2703
                         170       180
                  ....*....|....*....|.
gi 410225316  946 SPVGYSP----MTPGAPSPGG 962
Cdd:PHA03247 2704 PPPTPEPaphaLVSATPLPPG 2724
Herpes_BLLF1 pfam05109
Herpes virus major outer envelope glycoprotein (BLLF1); This family consists of the BLLF1 ...
845-960 1.30e-04

Herpes virus major outer envelope glycoprotein (BLLF1); This family consists of the BLLF1 viral late glycoprotein, also termed gp350/220. It is the most abundantly expressed glycoprotein in the viral envelope of the Herpesviruses and is the major antigen responsible for stimulating the production of neutralising antibodies in vivo.


Pssm-ID: 282904 [Multi-domain]  Cd Length: 886  Bit Score: 46.06  E-value: 1.30e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 410225316   845 TPSPQAYGGTPNPQTPGY--PD-----PSSPQVNPQYNPQTPGTPAMYNTDQFSPYAAPSPQGSYQPSPSPQSYHQVAPS 917
Cdd:pfam05109  422 SKAPESTTTSPTLNTTGFaaPNtttglPSSTHVPTNLTAPASTGPTVSTADVTSPTPAGTTSGASPVTPSPSPRDNGTES 501
                           90       100       110       120
                   ....*....|....*....|....*....|....*....|...
gi 410225316   918 PAgyQNTHSPASYHPTPSPMAyqasPSPSPVGYSPmTPGAPSP 960
Cdd:pfam05109  502 KA--PDMTSPTSAVTTPTPNA----TSPTPAVTTP-TPNATSP 537
PHA03379 PHA03379
EBNA-3A; Provisional
795-990 1.71e-04

EBNA-3A; Provisional


Pssm-ID: 223066 [Multi-domain]  Cd Length: 935  Bit Score: 45.82  E-value: 1.71e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 410225316  795 DGSRTPHYGSQTPPHDGSRTPA-QSgawDPNNPNTPSRAEEEYEYAFDDEPTPSPQAYGGTPNPQTPGYPDPSSPqVNPq 873
Cdd:PHA03379  406 EKASEPTYGTPRPPVEKPRPEVpQS---LETATSHGSAQVPEPPPVHDLEPGPLHDQHSMAPCPVAQLPPGPLQD-LEP- 480
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 410225316  874 yNPQTPGTPamyNTDQFSPYAAPSPQGSY-QP-SPSPQSYHQVAPSPAgyqnthspasyhpTPSPMAYQASPSPSPVGYS 951
Cdd:PHA03379  481 -GDQLPGVV---QDGRPACAPVPAPAGPIvRPwEASLSQVPGVAFAPV-------------MPQPMPVEPVPVPTVALER 543
                         170       180       190       200
                  ....*....|....*....|....*....|....*....|....*...
gi 410225316  952 PMTPGAPSPGGYNPHTPgSGIEQNSSDW---------VTTDIQVKVRD 990
Cdd:PHA03379  544 PVCPAPPLIAMQGPGET-SGIVRVRERWrpapwtpnpPRSPSQMSVRD 590
SP7_N cd22542
N-terminal domain of transcription factor Specificity Protein (SP) 7; Specificity Proteins ...
802-970 1.79e-04

N-terminal domain of transcription factor Specificity Protein (SP) 7; Specificity Proteins (SPs) are transcription factors that are involved in many cellular processes, including cell differentiation, cell growth, apoptosis, immune responses, response to DNA damage, and chromatin remodeling. SP7, also called Osterix (Osx) in humans, is highly conserved among bone-forming vertebrates. It plays a major role, along with Runx2 and Dlx5 in driving the differentiation of mesenchymal precursor cells into osteoblasts and eventually osteocytes. SP7 also plays a regulatory role by inhibiting chondrocyte differentiation, maintaining the balance between differentiation of mesenchymal precursor cells into ossified bone or cartilage. Mutations of this gene have been associated with multiple dysfunctional bone phenotypes in vertebrates. SP7 is thought to play a role in diseases such as Osteogenesis imperfecta. SP7 belongs to a family of proteins, called the SP/Kruppel or Krueppel-like Factor (KLF) family, characterized by a C-terminal DNA-binding domain of 81 amino acids consisting of three Kruppel-like C2H2 zinc fingers. These factors bind to a loose consensus motif, namely NNRCRCCYY (where N is any nucleotide; R is A/G, and Y is C/T), such as the recurring motifs in GC and GT boxes (5'-GGGGCGGGG-3' and 5-GGTGTGGGG-3') that are present in promoters and more distal regulatory elements of mammalian genes. SP factors preferentially bind GC boxes, while KLFs bind CACCC boxes. Another characteristic hallmark of SP factors is the presence of the Buttonhead (BTD) box CXCPXC, just N-terminal to the zinc fingers. The function of the BTD box is unknown, but it is thought to play an important physiological role. Another feature of most SP factors is the presence of a conserved amino acid stretch, the so-called SP box, located close to the N-terminus. SP factors may be separated into three groups based on their domain architecture and the similarity of their N-terminal transactivation domains: SP1-4, SP5, and SP6-9. The transactivation domains between the three groups are not homologous to one another. This model represents the N-terminal domain of SP7.


Pssm-ID: 411691 [Multi-domain]  Cd Length: 297  Bit Score: 44.89  E-value: 1.79e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 410225316  802 YGSQTPPHDgSRTPAQSGAWDPNNPNTP--------SRAEE----EYEYAFDD-----EPTPSPQA---YGGTPNPQTPG 861
Cdd:cd22542    26 FGGSSPIRD-SATPGKPGNNPGKKPYSLgsdlssakSRSSElmgdSYTATFSSgnglmSPSGSPQAsttYGNDYNPFSHS 104
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 410225316  862 YPDPSSPQ----VNPQYNPQTPGTPAMYNT-DQFSPY-----AAPSPQGSYQPSPSPQSYHQVAPSPAGYQNTHSPASYH 931
Cdd:cd22542   105 FPTSSGSQdpslLVSKGHPSADCLPSVYTSlDMAHPYgswykTGIHPGISSSSTNATASWWDMHSNTNWLSAQGQPDGLQ 184
                         170       180       190
                  ....*....|....*....|....*....|....*....
gi 410225316  932 PTPSPMAYQASPSPSPVGYSPMTPgaPSPGGYNPHTPGS 970
Cdd:cd22542   185 ASLQPVPAQTPLNPQLPSYTEFTT--LNPAPYPAVGISS 221
PHA03264 PHA03264
envelope glycoprotein D; Provisional
783-972 1.97e-04

envelope glycoprotein D; Provisional


Pssm-ID: 223029 [Multi-domain]  Cd Length: 416  Bit Score: 45.00  E-value: 1.97e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 410225316  783 RTPMYGSQTPLQDGSR---------TPHYgsQTPPHDgsrtpAQSGAWDPNNPNTPSRAeeeYEYAFDDEPTPSPQayGG 853
Cdd:PHA03264  206 RGYTFGACFPDEDYEQrkvlrltylTQYY--PQEAHK-----AIVDYWFMRHGGVVPPY---FEESKGYEPPPAPS--GG 273
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 410225316  854 TPNPqtPGYPDPsspQVNPQYNPQTPGTPAMYNTDQFSPYAAPSPQGSYQPSPSPQSyhqvapspagyqnthspasyhPT 933
Cdd:PHA03264  274 SPAP--PGDDRP---EAKPEPGPVEDGAPGRETGGEGEGPEPAGRDGAAGGEPKPGP---------------------PR 327
                         170       180       190       200
                  ....*....|....*....|....*....|....*....|.
gi 410225316  934 PSPMAYQAS--PSPSPVGYSPMTPGAPSPGGYNPHTPGSGI 972
Cdd:PHA03264  328 PAPDADRPEgwPSLEAITFPPPTPATPAVPRARPVIVGTGI 368
PHA03307 PHA03307
transcriptional regulator ICP4; Provisional
788-970 2.13e-04

transcriptional regulator ICP4; Provisional


Pssm-ID: 223039 [Multi-domain]  Cd Length: 1352  Bit Score: 45.55  E-value: 2.13e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 410225316  788 GSQTPLQDGSRTPHYGSQTPPHDGSRTPAQSGAWDPNNPNTPSRAeeeyeyAFDDEPTPSPQAYGGTPNPQTPGYPDPSS 867
Cdd:PHA03307  216 SASSPAPAPGRSAADDAGASSSDSSSSESSGCGWGPENECPLPRP------APITLPTRIWEASGWNGPSSRPGPASSSS 289
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 410225316  868 P--QVNPQYNPQTPGTPAMYNTDQFSPYAAPSPQGSyQPSPSPQSyhqVAPSPAGyqnTHSPASYHPTPSPmayqASPSP 945
Cdd:PHA03307  290 SprERSPSPSPSSPGSGPAPSSPRASSSSSSSRESS-SSSTSSSS---ESSRGAA---VSPGPSPSRSPSP----SRPPP 358
                         170       180
                  ....*....|....*....|....*
gi 410225316  946 SPVGYSPMTPGAPSPGGYNPHTPGS 970
Cdd:PHA03307  359 PADPSSPRKRPRPSRAPSSPAASAG 383
PspC_subgroup_2 NF033839
pneumococcal surface protein PspC, LPXTG-anchored form; The pneumococcal surface protein PspC, ...
824-960 2.18e-04

pneumococcal surface protein PspC, LPXTG-anchored form; The pneumococcal surface protein PspC, as described in Streptococcus pneumoniae, is a repetitive and highly variable protein, recognized by a conserved N-terminal domain and also by genomic location. This form, subgroup 2, is anchored covalently after cleavage by sortase at a C-terminal LPXTG site. The other form, subgroup 1, has variable numbers of a choline-binding repeat in the C-terminal region, and is also known as choline-binding protein A.


Pssm-ID: 468202 [Multi-domain]  Cd Length: 557  Bit Score: 45.14  E-value: 2.18e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 410225316  824 NNPNTPSRAEEEYEYAFDD-EPTPSPQAYGGTPN-PQTPGYPDPSSPQVNPQYNPQTPGTPAMYNTDQFSPYAAPSPQG- 900
Cdd:NF033839  249 DNVNTKVEIENTVHKIFADmDAVVTKFKKGLTQDtPKEPGNKKPSAPKPGMQPSPQPEKKEVKPEPETPKPEVKPQLEKp 328
                          90       100       110       120       130       140
                  ....*....|....*....|....*....|....*....|....*....|....*....|.
gi 410225316  901 SYQPSPSPQSYH-QVAPSPAGYQNTHSPASYHPTPSPMAYQASPSPSpvgySPMTPGAPSP 960
Cdd:NF033839  329 KPEVKPQPEKPKpEVKPQLETPKPEVKPQPEKPKPEVKPQPEKPKPE----VKPQPETPKP 385
KOW pfam00467
KOW motif; This family has been extended to coincide with ref. The KOW (Kyprides, Ouzounis, ...
475-503 2.21e-04

KOW motif; This family has been extended to coincide with ref. The KOW (Kyprides, Ouzounis, Woese) motif is found in a variety of ribosomal proteins and NusG.


Pssm-ID: 425698 [Multi-domain]  Cd Length: 32  Bit Score: 39.29  E-value: 2.21e-04
                           10        20        30
                   ....*....|....*....|....*....|.
gi 410225316   475 GDHVKVIAGRFEGDTGLIVRVEE--NFVILF 503
Cdd:pfam00467    2 GDVVRVIAGPFKGKVGKVVEVDDkkNRVLVE 32
dnaA PRK14086
chromosomal replication initiator protein DnaA;
785-949 2.30e-04

chromosomal replication initiator protein DnaA;


Pssm-ID: 237605 [Multi-domain]  Cd Length: 617  Bit Score: 45.20  E-value: 2.30e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 410225316  785 PMY-GSQTPLQDGSRTPHYGSQTPPHDGSRTPAQSGAWDPNnPNTPSRAEEEYEYAFDDEPTPSPQAYGGTPNPQTPGYP 863
Cdd:PRK14086  103 RRTsEPELPRPGRRPYEGYGGPRADDRPPGLPRQDQLPTAR-PAYPAYQQRPEPGAWPRAADDYGWQQQRLGFPPRAPYA 181
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 410225316  864 DPSSPQVNPQYNPQTPGTPAMYNTDQFSPYAAPSPQGSY-------QPSPSPQSYHQV--APSPAGYQNTHSPASYHPTP 934
Cdd:PRK14086  182 SPASYAPEQERDREPYDAGRPEYDQRRRDYDHPRPDWDRprrdrtdRPEPPPGAGHVHrgGPGPPERDDAPVVPIRPSAP 261
                         170
                  ....*....|....*
gi 410225316  935 SPMAYQASPSPSPVG 949
Cdd:PRK14086  262 GPLAAQPAPAPGPGE 276
Pneumo_att_G pfam05539
Pneumovirinae attachment membrane glycoprotein G;
773-934 2.46e-04

Pneumovirinae attachment membrane glycoprotein G;


Pssm-ID: 114270 [Multi-domain]  Cd Length: 408  Bit Score: 44.65  E-value: 2.46e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 410225316   773 SQTPMYGSGSRTPMYGSQT----PLQDG-SRTPHYGSQTPPHDgSRTPAQSGAWDP--NNPNTPSRAEEEYEYAFDDEPT 845
Cdd:pfam05539  215 STEPVGTQGTTTSSNPEPQteppPSQRGpSGSPQHPPSTTSQD-QSTTGDGQEHTQrrKTPPATSNRRSPHSTATPPPTT 293
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 410225316   846 PSPQAYGGTPNPQTPGYPDPSSPQVNPQYNPQTPGTPAMYNTDQFSPyaaPSPQ------GSYQPSpSPQSYHQVAPSPA 919
Cdd:pfam05539  294 KRQETGRPTPRPTATTQSGSSPPHSSPPGVQANPTTQNLVDCKELDP---PKPNsicygvGIYNEA-LPRGCDIVVPLCS 369
                          170
                   ....*....|....*
gi 410225316   920 GYqNTHSPASYHPTP 934
Cdd:pfam05539  370 TY-TIMCMDTYYSKP 383
PRK07764 PRK07764
DNA polymerase III subunits gamma and tau; Validated
815-970 2.49e-04

DNA polymerase III subunits gamma and tau; Validated


Pssm-ID: 236090 [Multi-domain]  Cd Length: 824  Bit Score: 45.36  E-value: 2.49e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 410225316  815 PAQSGAWDPNNPNTPSRAEEEYEYAfdDEPTPS---PQAYGGTPNPQTPGYPDPSSPQVNPQYNPQTPGTPAMYNTDQFS 891
Cdd:PRK07764  592 PGAAGGEGPPAPASSGPPEEAARPA--APAAPAapaAPAPAGAAAAPAEASAAPAPGVAAPEHHPKHVAVPDASDGGDGW 669
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 410225316  892 PYAAPSPQGSyQPSPSPQSYHQVAPS--PAGYQNTHSPASYHP---------TPSPMAYQASPSPSPVGYSPMTPGAPSP 960
Cdd:PRK07764  670 PAKAGGAAPA-APPPAPAPAAPAAPAgaAPAQPAPAPAATPPAgqaddpaaqPPQAAQGASAPSPAADDPVPLPPEPDDP 748
                         170
                  ....*....|
gi 410225316  961 GGYNPHTPGS 970
Cdd:PRK07764  749 PDPAGAPAQP 758
Atrophin-1 pfam03154
Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian ...
816-960 2.56e-04

Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian atrophy (DRPLA) gene. DRPLA OMIM:125370 is a progressive neurodegenerative disorder. It is caused by the expansion of a CAG repeat in the DRPLA gene on chromosome 12p. This results in an extended polyglutamine region in atrophin-1, that is thought to confer toxicity to the protein, possibly through altering its interactions with other proteins. The expansion of a CAG repeat is also the underlying defect in six other neurodegenerative disorders, including Huntington's disease. One interaction of expanded polyglutamine repeats that is thought to be pathogenic is that with the short glutamine repeat in the transcriptional coactivator CREB binding protein, CBP. This interaction draws CBP away from its usual nuclear location to the expanded polyglutamine repeat protein aggregates that are characteriztic of the polyglutamine neurodegenerative disorders. This interferes with CBP-mediated transcription and causes cytotoxicity.


Pssm-ID: 460830 [Multi-domain]  Cd Length: 991  Bit Score: 45.14  E-value: 2.56e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 410225316   816 AQSGAWDPNNPNTPSRAEEEYEYAFDDEPTPSPQayGGTPNPQTPGYPDPSSP-----QVNPQYNPQTpgTPAMYNTDQF 890
Cdd:pfam03154  176 AQSGAASPPSPPPPGTTQAATAGPTPSAPSVPPQ--GSPATSQPPNQTQSTAAphtliQQTPTLHPQR--LPSPHPPLQP 251
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 410225316   891 SPYAAPSPQGSYQPSPSPQSY------------------HQVAPSPAGYQNTHSPASYHPTPSPMAyqASPSPSPVGYSP 952
Cdd:pfam03154  252 MTQPPPPSQVSPQPLPQPSLHgqmppmphslqtgpshmqHPVPPQPFPLTPQSSQSQVPPGPSPAA--PGQSQQRIHTPP 329

                   ....*...
gi 410225316   953 MTPGAPSP 960
Cdd:pfam03154  330 SQSQLQSQ 337
PTZ00395 PTZ00395
Sec24-related protein; Provisional
788-966 2.74e-04

Sec24-related protein; Provisional


Pssm-ID: 185594 [Multi-domain]  Cd Length: 1560  Bit Score: 45.07  E-value: 2.74e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 410225316  788 GSQTPLQDGSRTPHYGSQTPP-HDGSRTPAQSGAWDPNNPNTPSRAEEEYEYAFDDEPTPSPQAYGGTP--NP--QTPGY 862
Cdd:PTZ00395  345 GSPNAASAGAPFNGLGNQADGgHINQVHPDARGAWAGGPHSNASYNCAAYSNAAQSNAAQSNAGFSNAGysNPgnSNPGY 424
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 410225316  863 PDP---SSPQVNPQY------NPQTPGTPamYNTDQFSPYAAPSPQGSYQPSPSPQSYHQVAPSPAGYQNTHSPASYHPT 933
Cdd:PTZ00395  425 NNApnsNTPYNNPPNsntpysNPPNSNPP--YSNLPYSNTPYSNAPLSNAPPSSAKDHHSAYHAAYQHRAANQPAANLPT 502
                         170       180       190
                  ....*....|....*....|....*....|...
gi 410225316  934 PSPMAyqASPSPSPVGYSPMTPGAPSPGGYNPH 966
Cdd:PTZ00395  503 ANQPA--ANNFHGAAGNSVGNPFASRPFGSAPY 533
PHA03307 PHA03307
transcriptional regulator ICP4; Provisional
796-968 2.99e-04

transcriptional regulator ICP4; Provisional


Pssm-ID: 223039 [Multi-domain]  Cd Length: 1352  Bit Score: 45.16  E-value: 2.99e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 410225316  796 GSRTPHYGSQTPPHDGSRTPAQSGAWDPnnPNTPSRAEEEYEYAFDDEPTPSPQAYGGTPNPQTPGYPDPSSPQvnPQYN 875
Cdd:PHA03307  773 ALLEPAEPQRGAGSSPPVRAEAAFRRPG--RLRRSGPAADAASRTASKRKSRSHTPDGGSESSGPARPPGAAAR--PPPA 848
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 410225316  876 PQTPGTPAMyntDQFSPYAAPSPQGSYQPSPSPQSYHQVAPSPAGYQNTHSPASyHPTPSPMAyqaspsPSPVGYSPMTP 955
Cdd:PHA03307  849 RSSESSKSK---PAAAGGRARGKNGRRRPRPPEPRARPGAAAPPKAAAAAPPAG-APAPRPRP------APRVKLGPMPP 918
                         170       180
                  ....*....|....*....|
gi 410225316  956 GAPSP-GGY------NPHTP 968
Cdd:PHA03307  919 GGPDPrGGFrrvppgDLHTP 938
PHA03307 PHA03307
transcriptional regulator ICP4; Provisional
767-960 3.43e-04

transcriptional regulator ICP4; Provisional


Pssm-ID: 223039 [Multi-domain]  Cd Length: 1352  Bit Score: 44.78  E-value: 3.43e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 410225316  767 RTPMYGSQTPMYGSGSRTPMYGSQTPLQDGSRTPhygsQTPPHDGSRTPAQSGAwDPNNPNTPSRAEeeyeyafdDEPTP 846
Cdd:PHA03307   64 RFEPPTGPPPGPGTEAPANESRSTPTWSLSTLAP----ASPAREGSPTPPGPSS-PDPPPPTPPPAS--------PPPSP 130
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 410225316  847 SPQAYGGTPNPQTPGYPDPSSPQVNPQYNPQTPGTPAMYNTdqfSPYAAPSPQGSYQPSPSPQSYHQVAPSPAGyqnths 926
Cdd:PHA03307  131 APDLSEMLRPVGSPGPPPAASPPAAGASPAAVASDAASSRQ---AALPLSSPEETARAPSSPPAEPPPSTPPAA------ 201
                         170       180       190
                  ....*....|....*....|....*....|....
gi 410225316  927 PASYHPTPSPMAyqASPSPSPVGYSPMTPGAPSP 960
Cdd:PHA03307  202 ASPRPPRRSSPI--SASASSPAPAPGRSAADDAG 233
PHA03378 PHA03378
EBNA-3B; Provisional
758-971 3.44e-04

EBNA-3B; Provisional


Pssm-ID: 223065 [Multi-domain]  Cd Length: 991  Bit Score: 44.67  E-value: 3.44e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 410225316  758 PGGMTSTYGRTPMYGSQTPMYGSGSRTPMYGSQTPLQDGSRTPhygSQTPPHDGSRTPAQSGAWDPNnPNTPSRAEEEYE 837
Cdd:PHA03378  659 ITPYKPTWTQIGHIPYQPSPTGANTMLPIQWAPGTMQPPPRAP---TPMRPPAAPPGRAQRPAAATG-RARPPAAAPGRA 734
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 410225316  838 YAFDDEPTPSPqayggtPNPQTPGYPDPSSPQVNPQYNPQ-TPGTPamynTDQFSPYAAPSPQGSYQPSPSPQSYHQVAP 916
Cdd:PHA03378  735 RPPAAAPGRAR------PPAAAPGRARPPAAAPGRARPPAaAPGAP----TPQPPPQAPPAPQQRPRGAPTPQPPPQAGP 804
                         170       180       190       200       210
                  ....*....|....*....|....*....|....*....|....*....|....*...
gi 410225316  917 SPAGYQNTHSPASYHPTPSPMAYQASP---SPSPVGYSPMTPGAPSPGGYNPhTPGSG 971
Cdd:PHA03378  805 TSMQLMPRAAPGQQGPTKQILRQLLTGgvkRGRPSLKKPAALERQAAAGPTP-SPGSG 861
PHA03307 PHA03307
transcriptional regulator ICP4; Provisional
791-961 3.73e-04

transcriptional regulator ICP4; Provisional


Pssm-ID: 223039 [Multi-domain]  Cd Length: 1352  Bit Score: 44.78  E-value: 3.73e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 410225316  791 TPLQDGSRTPHYGSQTPPHDGSRTPAQSGAWDP---NNPNTPSRAEEEYEYAFDDEPTPSPQAYGGTPNPQTPGYPDPSS 867
Cdd:PHA03307  200 AAASPRPPRRSSPISASASSPAPAPGRSAADDAgasSSDSSSSESSGCGWGPENECPLPRPAPITLPTRIWEASGWNGPS 279
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 410225316  868 PQVNPQYNPQTPGtpamyntdqfSPYAAPSP---QGSYQPSPSPQSYHQVAPSPAGyqnTHSPASYHPTPSPMAYQASPS 944
Cdd:PHA03307  280 SRPGPASSSSSPR----------ERSPSPSPsspGSGPAPSSPRASSSSSSSRESS---SSSTSSSSESSRGAAVSPGPS 346
                         170
                  ....*....|....*..
gi 410225316  945 PSPVGYSPMTPGAPSPG 961
Cdd:PHA03307  347 PSRSPSPSRPPPPADPS 363
PRK12323 PRK12323
DNA polymerase III subunit gamma/tau;
846-977 4.41e-04

DNA polymerase III subunit gamma/tau;


Pssm-ID: 237057 [Multi-domain]  Cd Length: 700  Bit Score: 44.48  E-value: 4.41e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 410225316  846 PSPQAYGGTPNPQTPGYPDPSSPQVNPQYNPQTPGTPAMYNTDQFSPYAAPSPQGSYQPSPSPQSYHQVAPSPAGYQNTH 925
Cdd:PRK12323  392 PAAAAPAPAAPPAAPAAAPAAAAAARAVAAAPARRSPAPEALAAARQASARGPGGAPAPAPAPAAAPAAAARPAAAGPRP 471
                          90       100       110       120       130
                  ....*....|....*....|....*....|....*....|....*....|....*
gi 410225316  926 SPASYHPTPSPMAYQASPSPSPVGYSP---MTPGAPSPGGYNPHTPGSGIEQNSS 977
Cdd:PRK12323  472 VAAAAAAAPARAAPAAAPAPADDDPPPweeLPPEFASPAPAQPDAAPAGWVAESI 526
DUF1373 pfam07117
Protein of unknown function (DUF1373); This family consists of several hypothetical proteins ...
826-940 5.23e-04

Protein of unknown function (DUF1373); This family consists of several hypothetical proteins which seem to be specific to Oryzias latipes (Japanese ricefish). Members of this family are typically around 200 residues in length. The function of this family is unknown.


Pssm-ID: 462093 [Multi-domain]  Cd Length: 212  Bit Score: 42.47  E-value: 5.23e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 410225316   826 PNTPSRAEEEYEY----AFDDEPTPSPQAYGGTPNPQTPGYPDPSSPQVNPQYNPQTPGTPAMYNTDQFSPYAAPSPQGS 901
Cdd:pfam07117   42 PPRPEEEEGQGGGggtfPFPGSPEPEPGGGGSGPMPMSASAPEPEPAKAKPQRPAPAQGHGHGGGGDSDSSGSGSGHQGS 121
                           90       100       110       120
                   ....*....|....*....|....*....|....*....|..
gi 410225316   902 YQP---SPSPQSYHQVAPSPAGYQNTHSPasyHPTPSPMAYQ 940
Cdd:pfam07117  122 GGAgagAGAPGHQHEQEQESSSSDDDDED---EFEFTPEEDE 160
PRK14950 PRK14950
DNA polymerase III subunits gamma and tau; Provisional
855-1041 5.50e-04

DNA polymerase III subunits gamma and tau; Provisional


Pssm-ID: 237864 [Multi-domain]  Cd Length: 585  Bit Score: 44.03  E-value: 5.50e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 410225316  855 PNPQTPGYPDPSSPQVNPQYNPQTPGTPAMyntdqfspyAAPSPQGSYQPSPSPQSyhQVAPSPAGYQNTHSPASYHPTP 934
Cdd:PRK14950  364 PAPQPAKPTAAAPSPVRPTPAPSTRPKAAA---------AANIPPKEPVRETATPP--PVPPRPVAPPVPHTPESAPKLT 432
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 410225316  935 spmayqasPSPSPVGYSPMTPGAPSPGGYNP--HTPGSGIEQNSSDWVTTDIQVKVRDTYLdtQVVGQTGViRSVTggmc 1012
Cdd:PRK14950  433 --------RAAIPVDEKPKYTPPAPPKEEEKalIADGDVLEQLEAIWKQILRDVPPRSPAV--QALLSSGV-RPVS---- 497
                         170       180       190
                  ....*....|....*....|....*....|
gi 410225316 1013 svyLKDSEKVVSISSE-HLEPITPTKNNKV 1041
Cdd:PRK14950  498 ---VEKNTLTLSFKSKfHKDKIEEPENRKI 524
dnaA PRK14086
chromosomal replication initiator protein DnaA;
843-969 5.65e-04

chromosomal replication initiator protein DnaA;


Pssm-ID: 237605 [Multi-domain]  Cd Length: 617  Bit Score: 44.05  E-value: 5.65e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 410225316  843 EPTPSPQAYGGTPNPQTPGYPDPSSPQVNPQYNPQTPGTPAMYNTDQFSPYAAPSPQGSYQPSPSPQsyhqvAPSPAGYQ 922
Cdd:PRK14086   94 EPAPPPPHARRTSEPELPRPGRRPYEGYGGPRADDRPPGLPRQDQLPTARPAYPAYQQRPEPGAWPR-----AADDYGWQ 168
                          90       100       110       120
                  ....*....|....*....|....*....|....*....|....*...
gi 410225316  923 NT-HSPASYHPTPSPMAYQASPSPSPVGYSPMTPGAPSPGGyNPHTPG 969
Cdd:PRK14086  169 QQrLGFPPRAPYASPASYAPEQERDREPYDAGRPEYDQRRR-DYDHPR 215
PRK07764 PRK07764
DNA polymerase III subunits gamma and tau; Validated
781-919 8.06e-04

DNA polymerase III subunits gamma and tau; Validated


Pssm-ID: 236090 [Multi-domain]  Cd Length: 824  Bit Score: 43.44  E-value: 8.06e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 410225316  781 GSRTPMYGSQTPLQDGSRTPHYGSQTPPHDGSRTPAQSGAWDPNNPNTPSRAEEEYEyafddeptPSPQAYGGTPNPQTP 860
Cdd:PRK07764  674 GGAAPAAPPPAPAPAAPAAPAGAAPAQPAPAPAATPPAGQADDPAAQPPQAAQGASA--------PSPAADDPVPLPPEP 745
                          90       100       110       120       130
                  ....*....|....*....|....*....|....*....|....*....|....*....
gi 410225316  861 GYPDPSSPQVNPQYNPQTPGTPAmyntdqfspyAAPSPQGSYQPSPSPQSYHQVAPSPA 919
Cdd:PRK07764  746 DDPPDPAGAPAQPPPPPAPAPAA----------APAAAPPPSPPSEEEEMAEDDAPSMD 794
Med26_M pfam15694
Mediator complex subunit 26 middle domain; Med26_M is the middle domain of subunit 26 of ...
858-974 8.59e-04

Mediator complex subunit 26 middle domain; Med26_M is the middle domain of subunit 26 of Mediator. Med19 and Med26 act synergistically to mediate the interaction between REST (a Kruppel-type zinc finger transcription factor that binds to a 21-bp RE1 silencing element present in over 900 human genes) and Mediator.


Pssm-ID: 464807 [Multi-domain]  Cd Length: 255  Bit Score: 42.55  E-value: 8.59e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 410225316   858 QTPGYPDPSSPQvNPQYNPQTpgTPAMYNTDQFSPYAapsPQGSY-QPSPSPQSYHQVAPSPAGYQNTHSP--------A 928
Cdd:pfam15694   81 ETGGPPQPKSPR-CSSFSPRN--SRHETFARRSSTYA---PKGSVpSPSPRSQVLDAQVPSPLPLSQPSTPpvqakrleK 154
                           90       100       110       120       130
                   ....*....|....*....|....*....|....*....|....*....|..
gi 410225316   929 SYHPTP-SPMAYQASPS-----PSPVGYSPMTPGAPSPGGYNPHTPGSGIEQ 974
Cdd:pfam15694  155 PPQSSPeSSQHWLEQSDseshqRHQDGSATLLSQSVSPGCKTPLHPGENSLP 206
KLF1_2_4_N cd21972
N-terminal domain of Kruppel-like factor (KLF) 1, KLF2, KLF4, and similar proteins; Kruppel ...
829-968 9.51e-04

N-terminal domain of Kruppel-like factor (KLF) 1, KLF2, KLF4, and similar proteins; Kruppel/Krueppel-like transcription factors (KLFs) belong to a family of proteins called the Specificity Protein (SP)/KLF family, characterized by a C-terminal DNA-binding domain of 81 amino acids consisting of three Kruppel-like C2H2 zinc fingers. These factors bind to a loose consensus motif, namely NNRCRCCYY (where N is any nucleotide; R is A/G, and Y is C/T), such as the recurring motifs in GC and GT boxes (5'-GGGGCGGGG-3' and 5-GGTGTGGGG-3') that are present in promoters and more distal regulatory elements of mammalian genes. Members of the KLF family can act as activators or repressors of transcription depending on cell and promoter context. KLFs regulate various cellular functions, such as proliferation, differentiation, and apoptosis, as well as the development and homeostasis of several types of tissue. In addition to the C-terminal DNA-binding domain, each KLF also has a unique N-terminal activation/repression domain that confers specifity and allows it to bind specifically to a certain partner, leading to distinct activities in vivo. This model represents the related N-terminal domains of KLF1, KLF2, KLF4, and similar proteins.


Pssm-ID: 409230 [Multi-domain]  Cd Length: 194  Bit Score: 41.51  E-value: 9.51e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 410225316  829 PSRAEEEYEYAFDDEPTPSPQAYG----GTPNPQTPGYPDPSSPQVNPQYN-PQTPGTPAMYNTDQFSPYAAPSPQGSYQ 903
Cdd:cd21972    22 LDLEFILSNTVTSDNDNPPPPDPAypppESPESCSTVYDSDGCHPTPNAYCgPNGPGLPGHFLLAGNSPNLGPKIKTENQ 101
                          90       100       110       120       130       140       150
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 410225316  904 PS-------PSPQSYHQVAPS------PAGYQNTHSPASYHPTPSPMAYQASPSPSPVGYSPmtPGAPSPGGYNPHTP 968
Cdd:cd21972   102 EQacmpvagYSGHYGPREPQRvppappPPQYAGHFQYHGHFNMFSPPLRANHPGMSTVMLTP--LSTPPLGFLSPEEA 177
PHA03369 PHA03369
capsid maturational protease; Provisional
814-911 1.19e-03

capsid maturational protease; Provisional


Pssm-ID: 223061 [Multi-domain]  Cd Length: 663  Bit Score: 43.06  E-value: 1.19e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 410225316  814 TPAQSGAWDPNnPNTPSRAEEE-YEYAFDDEPTPSPQAYGGTPNPQTPGYPDPSSPQVnpqynPQTPGTPAMYNTDQFSP 892
Cdd:PHA03369  353 LTAPSRVLAAA-AKVAVIAAPQtHTGPADRQRPQRPDGIPYSVPARSPMTAYPPVPQF-----CGDPGLVSPYNPQSPGT 426
                          90
                  ....*....|....*....
gi 410225316  893 YAAPSPQGSYQPSPSPQSY 911
Cdd:PHA03369  427 SYGPEPVGPVPPQPTNPYV 445
KLF3_N cd21577
N-terminal domain of Kruppel-like factor 3; Kruppel-like factor 3 (KLF3; also called ...
891-966 1.40e-03

N-terminal domain of Kruppel-like factor 3; Kruppel-like factor 3 (KLF3; also called Krueppel-like factor 3 and originally called Basic Kruppel-like Factor/BKLF), was the third member of the KLF family of zinc finger transcription factors to be discovered. KLF3 possesses a wide range of biological impacts on regulating apoptosis, differentiation, and proliferation in various tissues during the entire progression process. It has been proposed as a tumor suppressor in colorectal cancer. It appears to function predominantly as a repressor of transcription, turning genes off by recruiting the C-terminal Binding Protein co-repressors CtBP1 and CtBP2. CtBP docks onto a short motif (residues 61-65) in the N-terminus of KLF3, through the Proline-X-Aspartate-Leucine-Serine (PXDLS) motif. CtBP in turn recruits histone modifying enzymes to alter chromatin and repress gene expression. KLF3 belongs to a family of proteins, called the Specificity Protein (SP)/KLF family, characterized by a C-terminal DNA-binding domain of 81 amino acids consisting of three Kruppel-like C2H2 zinc fingers. These factors bind to a loose consensus motif, namely NNRCRCCYY (where N is any nucleotide; R is A/G, and Y is C/T), such as the recurring motifs in GC and GT boxes (5'-GGGGCGGGG-3' and 5-GGTGTGGGG-3') that are present in promoters and more distal regulatory elements of mammalian genes. Members of the KLF family can act as activators or repressors of transcription depending on cell and promoter context. KLFs regulate various cellular functions, such as proliferation, differentiation, and apoptosis, as well as the development and homeostasis of several types of tissue. In addition to the C-terminal DNA-binding domain, each KLF also has a unique N-terminal activation/repression domain that confers specificity and allows it to bind specifically to a certain partner, leading to distinct activities in vivo. This model represents the N-terminal domain of KLF3.


Pssm-ID: 410554 [Multi-domain]  Cd Length: 214  Bit Score: 41.18  E-value: 1.40e-03
                          10        20        30        40        50        60        70
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 410225316  891 SPYAAPSPQGSYQPSPSPQSYHQVAPSPAGYQNTHsPASYHP--TPSPMAYQASPSPSPVGYSPMTpgAPSPGGYNPH 966
Cdd:cd21577    41 SSSSSSSSPSSRASPPSPYSKSSPPSPPQQRPLSP-PLSLPPpvAPPPLSPGSVPGGLPVISPVMV--QPVPVLYPPH 115
PHA03307 PHA03307
transcriptional regulator ICP4; Provisional
789-919 1.41e-03

transcriptional regulator ICP4; Provisional


Pssm-ID: 223039 [Multi-domain]  Cd Length: 1352  Bit Score: 42.85  E-value: 1.41e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 410225316  789 SQTPLQDGSRTPHYGSQTPPHDGSRTPAQSGAWDPNNPNTPSRAEEEYEYAFDDEPTPSPQAYGGTPNPQTPGYPDPSSP 868
Cdd:PHA03307  822 RSHTPDGGSESSGPARPPGAAARPPPARSSESSKSKPAAAGGRARGKNGRRRPRPPEPRARPGAAAPPKAAAAAPPAGAP 901
                          90       100       110       120       130
                  ....*....|....*....|....*....|....*....|....*....|.
gi 410225316  869 QVNPQYNPQTPGTPamyntdqfSPYAAPSPQGSYQPSPsPQSYHQVAPSPA 919
Cdd:PHA03307  902 APRPRPAPRVKLGP--------MPPGGPDPRGGFRRVP-PGDLHTPAPSAA 943
PRK07764 PRK07764
DNA polymerase III subunits gamma and tau; Validated
852-971 1.59e-03

DNA polymerase III subunits gamma and tau; Validated


Pssm-ID: 236090 [Multi-domain]  Cd Length: 824  Bit Score: 42.67  E-value: 1.59e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 410225316  852 GGTPNPQTPGYPDPSSPQVnPQYNPQTPGTPAMyntdqfspyAAPSPQGSYQPSPSPQSYHQVAPSPAgyqnthSPASYH 931
Cdd:PRK07764  389 GGAGAPAAAAPSAAAAAPA-AAPAPAAAAPAAA---------AAPAPAAAPQPAPAPAPAPAPPSPAG------NAPAGG 452
                          90       100       110       120
                  ....*....|....*....|....*....|....*....|
gi 410225316  932 PTPSPMAYQASPSPSPVGYSPMTPGAPSPGGYNPHTPGSG 971
Cdd:PRK07764  453 APSPPPAAAPSAQPAPAPAAAPEPTAAPAPAPPAAPAPAA 492
PHA03269 PHA03269
envelope glycoprotein C; Provisional
875-982 1.74e-03

envelope glycoprotein C; Provisional


Pssm-ID: 165527 [Multi-domain]  Cd Length: 566  Bit Score: 42.41  E-value: 1.74e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 410225316  875 NPQTP-GTPAMYN---TDQFSPYAAPSPQGSYQPSPSPQSYHQVAPSPAGYQNTHSPASYHPTPSPMAYQASPSPSPVGY 950
Cdd:PHA03269   21 NLNTNiPIPELHTsaaTQKPDPAPAPHQAASRAPDPAVAPTSAASRKPDLAQAPTPAASEKFDPAPAPHQAASRAPDPAV 100
                          90       100       110
                  ....*....|....*....|....*....|..
gi 410225316  951 SPMTPGAPSPggyNPHTPGSGIEQNSSDWVTT 982
Cdd:PHA03269  101 APQLAAAPKP---DAAEAFTSAAQAHEAPADA 129
PHA03369 PHA03369
capsid maturational protease; Provisional
854-991 1.75e-03

capsid maturational protease; Provisional


Pssm-ID: 223061 [Multi-domain]  Cd Length: 663  Bit Score: 42.29  E-value: 1.75e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 410225316  854 TPNPQTPGYPDPSSPQVNPqynPQTPGTPAM---YNTDQFSPYAAPSPQGSYQPSPSPQSYHQVAPSpAGYQNTHSPASY 930
Cdd:PHA03369  353 LTAPSRVLAAAAKVAVIAA---PQTHTGPADrqrPQRPDGIPYSVPARSPMTAYPPVPQFCGDPGLV-SPYNPQSPGTSY 428
                          90       100       110       120       130       140
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*
gi 410225316  931 HPTPSPMAYQASPSPS--PVGYSPMT-PGAPSPGGYnpHTPGS-GIEQNSSDWVTTDIQVKVRDT 991
Cdd:PHA03369  429 GPEPVGPVPPQPTNPYvmPISMANMVyPGHPQEHGH--ERKRKrGGELKEELIETLKLVKKLKEE 491
PHA03325 PHA03325
nuclear-egress-membrane-like protein; Provisional
804-965 1.81e-03

nuclear-egress-membrane-like protein; Provisional


Pssm-ID: 223044  Cd Length: 418  Bit Score: 42.18  E-value: 1.81e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 410225316  804 SQTPPHDGSRTPAQSGAWDPNNPNTPSRAEEEYEYAFDDEPTPSPQAyggtpnpqtpgYPDPSSPQVNPQYNPQTPGTPA 883
Cdd:PHA03325  266 SSLPTSAPKRRSRRAGAMRAAAGETADLADDDGSEHSDPEPLPASLP-----------PPPVRRPRVKHPEAGKEEPDGA 334
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 410225316  884 MYNTDQFSPYAAPSPQGSYQPSPSPQSYHQVAPSPAGyqnthSPASYHPTPSPMAYQASPSPSPVGYSPMTPGAPSPGGY 963
Cdd:PHA03325  335 RNAEAKEPAQPATSTSSKGSSSAQNKDSGSTGPGSSL-----AAASSFLEDDDFGSPPLDLTTSLRHMPSPSVTSAPEPP 409

                  ..
gi 410225316  964 NP 965
Cdd:PHA03325  410 SI 411
dnaA PRK14086
chromosomal replication initiator protein DnaA;
823-971 2.12e-03

chromosomal replication initiator protein DnaA;


Pssm-ID: 237605 [Multi-domain]  Cd Length: 617  Bit Score: 42.12  E-value: 2.12e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 410225316  823 PNNPNTPSRAEEeyeyafddeptPSPQAYGGTPNPQTPGYPDPSSPQVNPQYnPQTPGTPAMY--NTDQFSPYAAPSPQG 900
Cdd:PRK14086   96 APPPPHARRTSE-----------PELPRPGRRPYEGYGGPRADDRPPGLPRQ-DQLPTARPAYpaYQQRPEPGAWPRAAD 163
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 410225316  901 SYQPSPSPQSYhqvaPSPAGYQnthSPASYHPTPSPMAY----------QASPSPSPVGYSPMTPGA-------PSPGGY 963
Cdd:PRK14086  164 DYGWQQQRLGF----PPRAPYA---SPASYAPEQERDREpydagrpeydQRRRDYDHPRPDWDRPRRdrtdrpePPPGAG 236

                  ....*...
gi 410225316  964 NPHTPGSG 971
Cdd:PRK14086  237 HVHRGGPG 244
PRK07764 PRK07764
DNA polymerase III subunits gamma and tau; Validated
844-961 3.01e-03

DNA polymerase III subunits gamma and tau; Validated


Pssm-ID: 236090 [Multi-domain]  Cd Length: 824  Bit Score: 41.51  E-value: 3.01e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 410225316  844 PTPSPQAYGGTPNPQTPgypDPSSPQVNPQYNPQTPGTPAmyntdqfSPYAAPSPQGSYQPSPSPQSYHQVAPSPAGYQN 923
Cdd:PRK07764  404 AAPAAAPAPAAAAPAAA---AAPAPAAAPQPAPAPAPAPA-------PPSPAGNAPAGGAPSPPPAAAPSAQPAPAPAAA 473
                          90       100       110
                  ....*....|....*....|....*....|....*...
gi 410225316  924 THSPASyhPTPSPMAYQASPSPSPVgysPMTPGAPSPG 961
Cdd:PRK07764  474 PEPTAA--PAPAPPAAPAPAAAPAA---PAAPAAPAGA 506
PHA03307 PHA03307
transcriptional regulator ICP4; Provisional
754-978 3.04e-03

transcriptional regulator ICP4; Provisional


Pssm-ID: 223039 [Multi-domain]  Cd Length: 1352  Bit Score: 41.70  E-value: 3.04e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 410225316  754 GSRRPGGMTSTYGRTPmygSQTPMYGSGSRTPMYGSQTPLQDGSRTPHYGSQTPPHDG-SRTPAQSGAWDPNNPNTPSRA 832
Cdd:PHA03307  102 REGSPTPPGPSSPDPP---PPTPPPASPPPSPAPDLSEMLRPVGSPGPPPAASPPAAGaSPAAVASDAASSRQAALPLSS 178
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 410225316  833 EEEyeyafdDEPTPSPQAYGGTPNPQTPGYPDPSSPQVNPQYNPQTPGTPAMYNTDQFSPYAAPSPQGSYQPSPS----- 907
Cdd:PHA03307  179 PEE------TARAPSSPPAEPPPSTPPAAASPRPPRRSSPISASASSPAPAPGRSAADDAGASSSDSSSSESSGCgwgpe 252
                         170       180       190       200       210       220       230
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....
gi 410225316  908 ---PQSYHQVAPSPAGYQNTHSPASYHPTPSPMAYQASPSPSpvgYSPMTPGAPSPGGYNPHTPGSGIEQNSSD 978
Cdd:PHA03307  253 necPLPRPAPITLPTRIWEASGWNGPSSRPGPASSSSSPRER---SPSPSPSSPGSGPAPSSPRASSSSSSSRE 323
PspC_subgroup_2 NF033839
pneumococcal surface protein PspC, LPXTG-anchored form; The pneumococcal surface protein PspC, ...
842-968 3.66e-03

pneumococcal surface protein PspC, LPXTG-anchored form; The pneumococcal surface protein PspC, as described in Streptococcus pneumoniae, is a repetitive and highly variable protein, recognized by a conserved N-terminal domain and also by genomic location. This form, subgroup 2, is anchored covalently after cleavage by sortase at a C-terminal LPXTG site. The other form, subgroup 1, has variable numbers of a choline-binding repeat in the C-terminal region, and is also known as choline-binding protein A.


Pssm-ID: 468202 [Multi-domain]  Cd Length: 557  Bit Score: 41.29  E-value: 3.66e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 410225316  842 DEPTPSPQAYGGTPNPQTPGYPDPSSPQVNPQynPQTPGTPAMYNTDQFSPYAAPSPQ-GSYQPSPSPQSYH-QVAPSPA 919
Cdd:NF033839  370 EKPKPEVKPQPETPKPEVKPQPEKPKPEVKPQ--PEKPKPEVKPQPEKPKPEVKPQPEkPKPEVKPQPEKPKpEVKPQPE 447
                          90       100       110       120       130
                  ....*....|....*....|....*....|....*....|....*....|...
gi 410225316  920 GYQNTHSPASYHPTPSPMAYQASPSPSpVGYSPMTP----GAPSPGGYNPHTP 968
Cdd:NF033839  448 KPKPEVKPQPETPKPEVKPQPEKPKPE-VKPQPEKPkpdnSKPQADDKKPSTP 499
PRK07764 PRK07764
DNA polymerase III subunits gamma and tau; Validated
803-920 5.88e-03

DNA polymerase III subunits gamma and tau; Validated


Pssm-ID: 236090 [Multi-domain]  Cd Length: 824  Bit Score: 40.74  E-value: 5.88e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 410225316  803 GSQTPPHDGSRTPAQSGAWDPNNPNTPSRAEEEYEYAFDDEPTPSPQAYGGTPnPQTPGYPDPSSPQVNPQynPQTPGTP 882
Cdd:PRK07764  396 AAAPSAAAAAPAAAPAPAAAAPAAAAAPAPAAAPQPAPAPAPAPAPPSPAGNA-PAGGAPSPPPAAAPSAQ--PAPAPAA 472
                          90       100       110
                  ....*....|....*....|....*....|....*...
gi 410225316  883 AMyntdQFSPYAAPSPQGSYQPSPSPQSYHQVAPSPAG 920
Cdd:PRK07764  473 AP----EPTAAPAPAPPAAPAPAAAPAAPAAPAAPAGA 506
KOW pfam00467
KOW motif; This family has been extended to coincide with ref. The KOW (Kyprides, Ouzounis, ...
596-627 7.70e-03

KOW motif; This family has been extended to coincide with ref. The KOW (Kyprides, Ouzounis, Woese) motif is found in a variety of ribosomal proteins and NusG.


Pssm-ID: 425698 [Multi-domain]  Cd Length: 32  Bit Score: 35.06  E-value: 7.70e-03
                           10        20        30
                   ....*....|....*....|....*....|..
gi 410225316   596 VKDIVKVIDGPHSGREGEIRHLFRSFAFLHCK 627
Cdd:pfam00467    1 KGDVVRVIAGPFKGKVGKVVEVDDKKNRVLVE 32
 
Blast search parameters
Data Source: Precalculated data, version = cdd.v.3.21
Preset Options:Database: CDSEARCH/cdd   Low complexity filter: no  Composition Based Adjustment: yes   E-value threshold: 0.01

References:

  • Wang J et al. (2023), "The conserved domain database in 2023", Nucleic Acids Res.51(D)384-8.
  • Lu S et al. (2020), "The conserved domain database in 2020", Nucleic Acids Res.48(D)265-8.
  • Marchler-Bauer A et al. (2017), "CDD/SPARCLE: functional classification of proteins via subfamily domain architectures.", Nucleic Acids Res.45(D)200-3.
Help | Disclaimer | Write to the Help Desk
NCBI | NLM | NIH