NCBI Home Page NCBI Site Search page NCBI Guide that lists and describes the NCBI resources
Conserved domains on  [gi|2490654199|ref|XP_055082292|]
View 

transcription elongation factor SPT5 [Periophthalmus magnuspinnatus]

Protein Classification

Graphical summary

 Zoom to residue level

show extra options »

Show site features     Horizontal zoom: ×

List of domain hits

Name Accession Description Interval E-value
NGN_Euk cd09888
Eukaryotic N-Utilization Substance G (NusG) N-terminal (NGN) domain, including plant KTF1 (KOW ...
185-272 2.79e-43

Eukaryotic N-Utilization Substance G (NusG) N-terminal (NGN) domain, including plant KTF1 (KOW domain-containing Transcription Factor 1); The N-Utilization Substance G (NusG) protein and its eukaryotic homolog, Spt5, are involved in transcription elongation and termination. NusG contains an NGN domain at its N-terminus and Kyrpides Ouzounis and Woese (KOW) repeats at its C-terminus. Spt5 forms an Spt4-Spt5 complex that is an essential RNA polymerase II elongation factor. NusG was originally discovered as an N-dependent antitermination enhancing activity in Escherichia coli, and has a variety of functions such as its involvement in RNA polymerase elongation and Rho-termination in bacteria. Orthologs of the NusG gene exist in all bacteria, but their functions and requirements are different. Spt5-like is homologous to the Spt5 proteins present in all eukaryotes, which is unique as it encodes a protein with an additional long carboxy-terminal extension that contains WG/GW motifs. Spt5-like, or KTF1 (KOW domain-containing Transcription Factor 1), is a RNA-directed DNA methylation (RdDM) pathway effector in plants.


:

Pssm-ID: 193577 [Multi-domain]  Cd Length: 86  Bit Score: 151.91  E-value: 2.79e-43
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2490654199  185 NLWTVKCKIGEERATAIALMRKFIAYQFTDTPLQIKSVVAPDHVKGYIYVESYKQTHVKAAIEGIGNLRMglWNQQMVPI 264
Cdd:cd09888      1 KLWAVKCKPGKEREIVISLMRKFLDLQRTGNPLGIKSVFARDGLKGYIYIEARKEAHVKDAIEGLRGVYL--NTIKLVPI 78

                   ....*...
gi 2490654199  265 KEMTDVLK 272
Cdd:cd09888     79 KEMPDVLS 86
CTD smart01104
Spt5 C-terminal nonapeptide repeat binding Spt4; The C-terminal domain of the transcription ...
774-891 1.43e-28

Spt5 C-terminal nonapeptide repeat binding Spt4; The C-terminal domain of the transcription elongation factor protein Spt5 is necessary for binding to Spt4 to form the functional complex that regulates early transcription elongation by RNA polymerase II. The complex may be involved in pre-mRNA processing through its association with mRNA capping enzymes. This CTD domain carries a regular nonapeptide repeat that can be present in up to 18 copies, as in S. pombe. The repeat has a characteristic TPA motif.


:

Pssm-ID: 215026 [Multi-domain]  Cd Length: 121  Bit Score: 111.07  E-value: 1.43e-28
                            10        20        30        40        50        60        70        80
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2490654199   774 GSQTPMYG-TGSRTPMYGSQTPIHEGNRTP----HYGSQTPLHDG--NRTPGQSGAWdPNNPNTPSRNDEEYDFGYDDEP 846
Cdd:smart01104    1 GGRTPAWGaSGSKTPAWGSRTPGTAAGGAPtargGSGSRTPAWGGagSRTPAWGGAG-PTGSRTPAWGGASAWGNKSSEG 79
                            90       100       110       120
                    ....*....|....*....|....*....|....*....|....*..
gi 2490654199   847 SPSPQ--GYGSTPNPQTPGYPEAPSPqvnpqYNPQTPGTPAMYNTEQ 891
Cdd:smart01104   80 SASSWaaGPGGAYGAPTPGYGGTPSA-----YGPATPGGGAMAGSAS 121
KOW_Spt5_3 cd06083
KOW domain of Spt5, repeat 3; Spt5, an eukaryotic ortholog of NusG, contains multiple KOW ...
477-527 9.13e-28

KOW domain of Spt5, repeat 3; Spt5, an eukaryotic ortholog of NusG, contains multiple KOW motifs at its C-terminus. Spt5 is involved in transcription elongation and termination. KOW domain is known as an RNA-binding motif that is shared so far among some families of ribosomal proteins, the essential bacterial transcriptional elongation factor NusG, the eukaryotic chromatin elongation factor Spt5, the higher eukaryotic KIN17 proteins and Mtr4. KOW_Spt5 domains play critical roles in recruitment of multiple other eukaryotic transcription elongation and RNA biogenesis factors and additionally are involved in the binding of the eukaryotic Spt5 proteins to RNA polymerases.


:

Pssm-ID: 240507  Cd Length: 51  Bit Score: 106.46  E-value: 9.13e-28
                           10        20        30        40        50
                   ....*....|....*....|....*....|....*....|....*....|.
gi 2490654199  477 YFRMGDHVKVIAGRYEGDTGLIVRVEENFVILFSDLTMHELKVLPRDLQLC 527
Cdd:cd06083      1 HFKVGDHVKVISGRHEGETGLVVKVEDDVVTVFSDLTMRELKVFPRDLQLS 51
KOW_Spt5_5 cd06085
KOW domain of Spt5, repeat 5; Spt5, an eukaryotic ortholog of NusG, contains multiple KOW ...
704-753 5.31e-25

KOW domain of Spt5, repeat 5; Spt5, an eukaryotic ortholog of NusG, contains multiple KOW motifs at its C-terminus. Spt5 is involved in transcription elongation and termination. KOW domain is known as an RNA-binding motif that is shared so far among some families of ribosomal proteins, the essential bacterial transcriptional elongation factor NusG, the eukaryotic chromatin elongation factor Spt5, the higher eukaryotic KIN17 proteins and Mtr4. KOW_Spt5 domains play critical roles in recruitment of multiple other eukaryotic transcription elongation and RNA biogenesis factors and additionally are involved in the binding of the eukaryotic Spt5 proteins to RNA polymerases.


:

Pssm-ID: 240509  Cd Length: 52  Bit Score: 98.33  E-value: 5.31e-25
                           10        20        30        40        50
                   ....*....|....*....|....*....|....*....|....*....|
gi 2490654199  704 DNELIGQTVRISQGPYKGYIGVVKDATESTARVELHSTCQTISVDRQRLT 753
Cdd:cd06085      2 RDPLIGKTVRIRKGPYKGYIGIVKDATGTTARVELHSKNKTITVDRSRLA 51
KOW_Spt5_6 cd06086
KOW domain of Spt5, repeat 6; Spt5, an eukaryotic ortholog of NusG, contains multiple KOW ...
1029-1085 7.08e-25

KOW domain of Spt5, repeat 6; Spt5, an eukaryotic ortholog of NusG, contains multiple KOW motifs at its C-terminus. Spt5 is involved in transcription elongation and termination. KOW domain is known as an RNA-binding motif that is shared so far among some families of ribosomal proteins, the essential bacterial transcriptional elongation factor NusG, the eukaryotic chromatin elongation factor Spt5, the higher eukaryotic KIN17 proteins and Mtr4. KOW_Spt5 domains play critical roles in recruitment of multiple other eukaryotic transcription elongation and RNA biogenesis factors and additionally are involved in the binding of the eukaryotic Spt5 proteins to RNA polymerases.


:

Pssm-ID: 240510  Cd Length: 58  Bit Score: 98.36  E-value: 7.08e-25
                           10        20        30        40        50
                   ....*....|....*....|....*....|....*....|....*....|....*..
gi 2490654199 1029 DHLEPVTPTKNNKVKVILGEDREATGVLLSIDGDDGIVRMELDDQLKILNLRFLGRL 1085
Cdd:cd06086      1 EHLEPVPPEKGDRVKVIKGEDRGSTGELISIDGADGIVKMDSDGDIKILPMNFLAKL 57
KOW_Spt5_2 cd06082
KOW domain of Spt5, repeat 2; Spt5, an eukaryotic ortholog of NusG, contains multiple KOW ...
426-476 1.04e-24

KOW domain of Spt5, repeat 2; Spt5, an eukaryotic ortholog of NusG, contains multiple KOW motifs at its C-terminus. Spt5 is involved in transcription elongation and termination. KOW domain is known as an RNA-binding motif that is shared so far among some families of ribosomal proteins, the essential bacterial transcriptional elongation factor NusG, the eukaryotic chromatin elongation factor Spt5, the higher eukaryotic KIN17 proteins and Mtr4. KOW_Spt5 domains play critical roles in recruitment of multiple other eukaryotic transcription elongation and RNA biogenesis factors and additionally are involved in the binding of the eukaryotic Spt5 proteins to RNA polymerases.


:

Pssm-ID: 240506  Cd Length: 51  Bit Score: 97.57  E-value: 1.04e-24
                           10        20        30        40        50
                   ....*....|....*....|....*....|....*....|....*....|.
gi 2490654199  426 LQAGDNVEVCEGELINLQGKILSVDGNKITIMPKHEDLKDPLEFPAHELKK 476
Cdd:cd06082      1 FQPGDNVEVIEGELKGLQGKVESVDGDIVTIMPKHEDLKEPLEFPAKELRK 51
KOW_Spt5_4 cd06084
KOW domain of Spt5, repeat 4; Spt5, an eukaryotic ortholog of NusG, contains multiple KOW ...
603-645 2.16e-19

KOW domain of Spt5, repeat 4; Spt5, an eukaryotic ortholog of NusG, contains multiple KOW motifs at its C-terminus. Spt5 is involved in transcription elongation and termination. KOW domain is known as an RNA-binding motif that is shared so far among some families of ribosomal proteins, the essential bacterial transcriptional elongation factor NusG, the eukaryotic chromatin elongation factor Spt5, the higher eukaryotic KIN17 proteins and Mtr4. KOW_Spt5 domains play critical roles in recruitment of multiple other eukaryotic transcription elongation and RNA biogenesis factors and additionally are involved in the binding of the eukaryotic Spt5 proteins to RNA polymerases.


:

Pssm-ID: 240508  Cd Length: 43  Bit Score: 82.18  E-value: 2.16e-19
                           10        20        30        40
                   ....*....|....*....|....*....|....*....|...
gi 2490654199  603 KDIVKVIDGPHSGREGEIRHLFRGFAFLHCKKLVENGGMFVCK 645
Cdd:cd06084      1 GDTVKVVDGPYKGRQGTVLHIYRGTLFLHSREVTENGGIFVVR 43
KOW_Spt5_1 cd06081
KOW domain of Spt5, repeat 1; Spt5, an eukaryotic ortholog of NusG, contains multiple KOW ...
283-320 9.81e-17

KOW domain of Spt5, repeat 1; Spt5, an eukaryotic ortholog of NusG, contains multiple KOW motifs at its C-terminus. Spt5 is involved in transcription elongation and termination. KOW domain is known as an RNA-binding motif that is shared so far among some families of ribosomal proteins, the essential bacterial transcriptional elongation factor NusG, the eukaryotic chromatin elongation factor Spt5, the higher eukaryotic KIN17 proteins and Mtr4. KOW_Spt5 domains play critical roles in recruitment of multiple other eukaryotic transcription elongation and RNA biogenesis factors and additionally are involved in the binding of the eukaryotic Spt5 proteins to RNA polymerases.


:

Pssm-ID: 240505  Cd Length: 38  Bit Score: 74.43  E-value: 9.81e-17
                           10        20        30
                   ....*....|....*....|....*....|....*...
gi 2490654199  283 KSWVRLKRGLYKDDIAQVDYVEPSQNTISLKMIPRIDL 320
Cdd:cd06081      1 GSWVRIKRGIYKGDLAQVDEVDENGNRVVVKLIPRIDY 38
Spt5_N pfam11942
Spt5 transcription elongation factor, acidic N-terminal; This is the very acidic N-terminal ...
100-179 3.32e-10

Spt5 transcription elongation factor, acidic N-terminal; This is the very acidic N-terminal region of the early transcription elongation factor Spt5. The Spt5-Spt4 complex regulates early transcription elongation by RNA polymerase II and has an imputed role in pre-mRNA processing via its physical association with mRNA capping enzymes. The actual function of this N-terminal domain is not known although it is dispensable for binding to Spt4.


:

Pssm-ID: 463406  Cd Length: 97  Bit Score: 58.05  E-value: 3.32e-10
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2490654199  100 GAEDILEKgsDYAEVSNLDHIPLDDDHSGSRRLQNLWRDSREEALGEYYMRKYAKSAggEHYSGGSEElsDDITQQQLLP 179
Cdd:pfam11942   24 GADDFIED--DEEDEDEEDGRRDDRRHRELDRRRELEEDEDAEEIAEYLKERYGRSS--SDAYRGDAE--EGVPQRLLLP 97
Atrophin-1 super family cl38111
Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian ...
846-974 5.22e-09

Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian atrophy (DRPLA) gene. DRPLA OMIM:125370 is a progressive neurodegenerative disorder. It is caused by the expansion of a CAG repeat in the DRPLA gene on chromosome 12p. This results in an extended polyglutamine region in atrophin-1, that is thought to confer toxicity to the protein, possibly through altering its interactions with other proteins. The expansion of a CAG repeat is also the underlying defect in six other neurodegenerative disorders, including Huntington's disease. One interaction of expanded polyglutamine repeats that is thought to be pathogenic is that with the short glutamine repeat in the transcriptional coactivator CREB binding protein, CBP. This interaction draws CBP away from its usual nuclear location to the expanded polyglutamine repeat protein aggregates that are characteriztic of the polyglutamine neurodegenerative disorders. This interferes with CBP-mediated transcription and causes cytotoxicity.


The actual alignment was detected with superfamily member pfam03154:

Pssm-ID: 460830 [Multi-domain]  Cd Length: 991  Bit Score: 60.55  E-value: 5.22e-09
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2490654199  846 PSPSPQGYG-STPNPQTPGYPEAPSPQVNPQYNPQTPGTPAMynteqysPYAAPSPQGSYQPSPSPQSYHH---QVAPSP 921
Cdd:pfam03154  311 PGPSPAAPGqSQQRIHTPPSQSQLQSQQPPREQPLPPAPLSM-------PHIKPPPTTPIPQLPNPQSHKHpphLSGPSP 383
                           90       100       110       120       130       140       150
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|.
gi 2490654199  922 ---------------VGYQNTHSPASYHPTPSPMAYQASPSPSPVGYSPM---TPGAPSPGGYNPHTPGSN 974
Cdd:pfam03154  384 fqmnsnlppppalkpLSSLSTHHPPSAHPPPLQLMPQSQQLPPPPAQPPVltqSQSLPPPAASHPPTSGLH 454
 
Name Accession Description Interval E-value
NGN_Euk cd09888
Eukaryotic N-Utilization Substance G (NusG) N-terminal (NGN) domain, including plant KTF1 (KOW ...
185-272 2.79e-43

Eukaryotic N-Utilization Substance G (NusG) N-terminal (NGN) domain, including plant KTF1 (KOW domain-containing Transcription Factor 1); The N-Utilization Substance G (NusG) protein and its eukaryotic homolog, Spt5, are involved in transcription elongation and termination. NusG contains an NGN domain at its N-terminus and Kyrpides Ouzounis and Woese (KOW) repeats at its C-terminus. Spt5 forms an Spt4-Spt5 complex that is an essential RNA polymerase II elongation factor. NusG was originally discovered as an N-dependent antitermination enhancing activity in Escherichia coli, and has a variety of functions such as its involvement in RNA polymerase elongation and Rho-termination in bacteria. Orthologs of the NusG gene exist in all bacteria, but their functions and requirements are different. Spt5-like is homologous to the Spt5 proteins present in all eukaryotes, which is unique as it encodes a protein with an additional long carboxy-terminal extension that contains WG/GW motifs. Spt5-like, or KTF1 (KOW domain-containing Transcription Factor 1), is a RNA-directed DNA methylation (RdDM) pathway effector in plants.


Pssm-ID: 193577 [Multi-domain]  Cd Length: 86  Bit Score: 151.91  E-value: 2.79e-43
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2490654199  185 NLWTVKCKIGEERATAIALMRKFIAYQFTDTPLQIKSVVAPDHVKGYIYVESYKQTHVKAAIEGIGNLRMglWNQQMVPI 264
Cdd:cd09888      1 KLWAVKCKPGKEREIVISLMRKFLDLQRTGNPLGIKSVFARDGLKGYIYIEARKEAHVKDAIEGLRGVYL--NTIKLVPI 78

                   ....*...
gi 2490654199  265 KEMTDVLK 272
Cdd:cd09888     79 KEMPDVLS 86
Spt5-NGN pfam03439
Early transcription elongation factor of RNA pol II, NGN section; Spt5p and prokaryotic NusG ...
185-271 1.35e-30

Early transcription elongation factor of RNA pol II, NGN section; Spt5p and prokaryotic NusG are shown to contain a novel 'NGN' domain. The combined NGN and KOW motif regions of Spt5 form the binding domain with Spt4. Spt5 complexes with Spt4 as a 1:1 heterodimer snf this Spt5-Spt4 complex regulates early transcription elongation by RNA polymerase II and has an imputed role in pre-mRNA processing via its physical association with mRNA capping enzymes. The Schizosaccharomyces pombe core Spt5-Spt4 complex is a heterodimer bearing a trypsin-resistant Spt4-binding domain within the Spt5 subunit.


Pssm-ID: 397481  Cd Length: 84  Bit Score: 115.37  E-value: 1.35e-30
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2490654199  185 NLWTVKCKIGEERATAIALMRKFIAYQfTDTPLQIKSVVAPDHVKGYIYVESYKQTHVKAAIEGIGNLRMGlwNQQMVPI 264
Cdd:pfam03439    1 KIWAVKCTPGQEREVALSLMRKILALA-KTNNLGIYSVFAPDGLKGYIYVEADRQAAVKRALEGIPNVRGL--VPGLVPI 77

                   ....*..
gi 2490654199  265 KEMTDVL 271
Cdd:pfam03439   78 KEMEHLL 84
CTD smart01104
Spt5 C-terminal nonapeptide repeat binding Spt4; The C-terminal domain of the transcription ...
774-891 1.43e-28

Spt5 C-terminal nonapeptide repeat binding Spt4; The C-terminal domain of the transcription elongation factor protein Spt5 is necessary for binding to Spt4 to form the functional complex that regulates early transcription elongation by RNA polymerase II. The complex may be involved in pre-mRNA processing through its association with mRNA capping enzymes. This CTD domain carries a regular nonapeptide repeat that can be present in up to 18 copies, as in S. pombe. The repeat has a characteristic TPA motif.


Pssm-ID: 215026 [Multi-domain]  Cd Length: 121  Bit Score: 111.07  E-value: 1.43e-28
                            10        20        30        40        50        60        70        80
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2490654199   774 GSQTPMYG-TGSRTPMYGSQTPIHEGNRTP----HYGSQTPLHDG--NRTPGQSGAWdPNNPNTPSRNDEEYDFGYDDEP 846
Cdd:smart01104    1 GGRTPAWGaSGSKTPAWGSRTPGTAAGGAPtargGSGSRTPAWGGagSRTPAWGGAG-PTGSRTPAWGGASAWGNKSSEG 79
                            90       100       110       120
                    ....*....|....*....|....*....|....*....|....*..
gi 2490654199   847 SPSPQ--GYGSTPNPQTPGYPEAPSPqvnpqYNPQTPGTPAMYNTEQ 891
Cdd:smart01104   80 SASSWaaGPGGAYGAPTPGYGGTPSA-----YGPATPGGGAMAGSAS 121
KOW_Spt5_3 cd06083
KOW domain of Spt5, repeat 3; Spt5, an eukaryotic ortholog of NusG, contains multiple KOW ...
477-527 9.13e-28

KOW domain of Spt5, repeat 3; Spt5, an eukaryotic ortholog of NusG, contains multiple KOW motifs at its C-terminus. Spt5 is involved in transcription elongation and termination. KOW domain is known as an RNA-binding motif that is shared so far among some families of ribosomal proteins, the essential bacterial transcriptional elongation factor NusG, the eukaryotic chromatin elongation factor Spt5, the higher eukaryotic KIN17 proteins and Mtr4. KOW_Spt5 domains play critical roles in recruitment of multiple other eukaryotic transcription elongation and RNA biogenesis factors and additionally are involved in the binding of the eukaryotic Spt5 proteins to RNA polymerases.


Pssm-ID: 240507  Cd Length: 51  Bit Score: 106.46  E-value: 9.13e-28
                           10        20        30        40        50
                   ....*....|....*....|....*....|....*....|....*....|.
gi 2490654199  477 YFRMGDHVKVIAGRYEGDTGLIVRVEENFVILFSDLTMHELKVLPRDLQLC 527
Cdd:cd06083      1 HFKVGDHVKVISGRHEGETGLVVKVEDDVVTVFSDLTMRELKVFPRDLQLS 51
KOW_Spt5_5 cd06085
KOW domain of Spt5, repeat 5; Spt5, an eukaryotic ortholog of NusG, contains multiple KOW ...
704-753 5.31e-25

KOW domain of Spt5, repeat 5; Spt5, an eukaryotic ortholog of NusG, contains multiple KOW motifs at its C-terminus. Spt5 is involved in transcription elongation and termination. KOW domain is known as an RNA-binding motif that is shared so far among some families of ribosomal proteins, the essential bacterial transcriptional elongation factor NusG, the eukaryotic chromatin elongation factor Spt5, the higher eukaryotic KIN17 proteins and Mtr4. KOW_Spt5 domains play critical roles in recruitment of multiple other eukaryotic transcription elongation and RNA biogenesis factors and additionally are involved in the binding of the eukaryotic Spt5 proteins to RNA polymerases.


Pssm-ID: 240509  Cd Length: 52  Bit Score: 98.33  E-value: 5.31e-25
                           10        20        30        40        50
                   ....*....|....*....|....*....|....*....|....*....|
gi 2490654199  704 DNELIGQTVRISQGPYKGYIGVVKDATESTARVELHSTCQTISVDRQRLT 753
Cdd:cd06085      2 RDPLIGKTVRIRKGPYKGYIGIVKDATGTTARVELHSKNKTITVDRSRLA 51
KOW_Spt5_6 cd06086
KOW domain of Spt5, repeat 6; Spt5, an eukaryotic ortholog of NusG, contains multiple KOW ...
1029-1085 7.08e-25

KOW domain of Spt5, repeat 6; Spt5, an eukaryotic ortholog of NusG, contains multiple KOW motifs at its C-terminus. Spt5 is involved in transcription elongation and termination. KOW domain is known as an RNA-binding motif that is shared so far among some families of ribosomal proteins, the essential bacterial transcriptional elongation factor NusG, the eukaryotic chromatin elongation factor Spt5, the higher eukaryotic KIN17 proteins and Mtr4. KOW_Spt5 domains play critical roles in recruitment of multiple other eukaryotic transcription elongation and RNA biogenesis factors and additionally are involved in the binding of the eukaryotic Spt5 proteins to RNA polymerases.


Pssm-ID: 240510  Cd Length: 58  Bit Score: 98.36  E-value: 7.08e-25
                           10        20        30        40        50
                   ....*....|....*....|....*....|....*....|....*....|....*..
gi 2490654199 1029 DHLEPVTPTKNNKVKVILGEDREATGVLLSIDGDDGIVRMELDDQLKILNLRFLGRL 1085
Cdd:cd06086      1 EHLEPVPPEKGDRVKVIKGEDRGSTGELISIDGADGIVKMDSDGDIKILPMNFLAKL 57
KOW_Spt5_2 cd06082
KOW domain of Spt5, repeat 2; Spt5, an eukaryotic ortholog of NusG, contains multiple KOW ...
426-476 1.04e-24

KOW domain of Spt5, repeat 2; Spt5, an eukaryotic ortholog of NusG, contains multiple KOW motifs at its C-terminus. Spt5 is involved in transcription elongation and termination. KOW domain is known as an RNA-binding motif that is shared so far among some families of ribosomal proteins, the essential bacterial transcriptional elongation factor NusG, the eukaryotic chromatin elongation factor Spt5, the higher eukaryotic KIN17 proteins and Mtr4. KOW_Spt5 domains play critical roles in recruitment of multiple other eukaryotic transcription elongation and RNA biogenesis factors and additionally are involved in the binding of the eukaryotic Spt5 proteins to RNA polymerases.


Pssm-ID: 240506  Cd Length: 51  Bit Score: 97.57  E-value: 1.04e-24
                           10        20        30        40        50
                   ....*....|....*....|....*....|....*....|....*....|.
gi 2490654199  426 LQAGDNVEVCEGELINLQGKILSVDGNKITIMPKHEDLKDPLEFPAHELKK 476
Cdd:cd06082      1 FQPGDNVEVIEGELKGLQGKVESVDGDIVTIMPKHEDLKEPLEFPAKELRK 51
KOW_Spt5_4 cd06084
KOW domain of Spt5, repeat 4; Spt5, an eukaryotic ortholog of NusG, contains multiple KOW ...
603-645 2.16e-19

KOW domain of Spt5, repeat 4; Spt5, an eukaryotic ortholog of NusG, contains multiple KOW motifs at its C-terminus. Spt5 is involved in transcription elongation and termination. KOW domain is known as an RNA-binding motif that is shared so far among some families of ribosomal proteins, the essential bacterial transcriptional elongation factor NusG, the eukaryotic chromatin elongation factor Spt5, the higher eukaryotic KIN17 proteins and Mtr4. KOW_Spt5 domains play critical roles in recruitment of multiple other eukaryotic transcription elongation and RNA biogenesis factors and additionally are involved in the binding of the eukaryotic Spt5 proteins to RNA polymerases.


Pssm-ID: 240508  Cd Length: 43  Bit Score: 82.18  E-value: 2.16e-19
                           10        20        30        40
                   ....*....|....*....|....*....|....*....|...
gi 2490654199  603 KDIVKVIDGPHSGREGEIRHLFRGFAFLHCKKLVENGGMFVCK 645
Cdd:cd06084      1 GDTVKVVDGPYKGRQGTVLHIYRGTLFLHSREVTENGGIFVVR 43
KOW_Spt5_1 cd06081
KOW domain of Spt5, repeat 1; Spt5, an eukaryotic ortholog of NusG, contains multiple KOW ...
283-320 9.81e-17

KOW domain of Spt5, repeat 1; Spt5, an eukaryotic ortholog of NusG, contains multiple KOW motifs at its C-terminus. Spt5 is involved in transcription elongation and termination. KOW domain is known as an RNA-binding motif that is shared so far among some families of ribosomal proteins, the essential bacterial transcriptional elongation factor NusG, the eukaryotic chromatin elongation factor Spt5, the higher eukaryotic KIN17 proteins and Mtr4. KOW_Spt5 domains play critical roles in recruitment of multiple other eukaryotic transcription elongation and RNA biogenesis factors and additionally are involved in the binding of the eukaryotic Spt5 proteins to RNA polymerases.


Pssm-ID: 240505  Cd Length: 38  Bit Score: 74.43  E-value: 9.81e-17
                           10        20        30
                   ....*....|....*....|....*....|....*...
gi 2490654199  283 KSWVRLKRGLYKDDIAQVDYVEPSQNTISLKMIPRIDL 320
Cdd:cd06081      1 GSWVRIKRGIYKGDLAQVDEVDENGNRVVVKLIPRIDY 38
CTD pfam12815
Spt5 C-terminal nonapeptide repeat binding Spt4; The C-terminal domain of the transcription ...
774-832 1.09e-12

Spt5 C-terminal nonapeptide repeat binding Spt4; The C-terminal domain of the transcription elongation factor protein Spt5 is necessary for binding to Spt4 to form the functional complex that regulates early transcription elongation by RNA polymerase II. The complex may be involved in pre-mRNA processing through its association with mRNA capping enzymes. This CTD domain carries a regular nonapeptide repeat that can be present in up to 18 copies, as in S. pombe. The repeat has a characteriztic TPA motif.


Pssm-ID: 372327 [Multi-domain]  Cd Length: 71  Bit Score: 64.00  E-value: 1.09e-12
                           10        20        30        40        50        60        70
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2490654199  774 GSQTPMYGT--GSRTPMY---GSQTPIHE--GNRTPHY--GSQTPLHD--GNRTPGQSGAWDPnnPNTPS 832
Cdd:pfam12815    1 GSRTPAYNSagGSRTPAWgadGSRTPAYGgaGGRTPAYnqGGKTPAWGgaGSRTPAYYGAWGG--SRTPA 68
nusG PRK08559
transcription antitermination protein NusG; Validated
182-315 1.15e-12

transcription antitermination protein NusG; Validated


Pssm-ID: 181467 [Multi-domain]  Cd Length: 153  Bit Score: 66.81  E-value: 1.15e-12
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2490654199  182 KDPNLWTVKCKIGEERATAIALMRKFIAYQftdtpLQIKSVVAPDHVKGYIYVESYKQTHVKAAIEGIGNLRMGLwnQQM 261
Cdd:PRK08559     4 EMSMIFAVKTTAGQERNVALMLAMRAKKEN-----LPIYAILAPPELKGYVLVEAESKGAVEEAIRGIPHVRGVV--PGE 76
                           90       100       110       120       130
                   ....*....|....*....|....*....|....*....|....*....|....
gi 2490654199  262 VPIKEMTDVLKVVKEVTNLKPKSWVRLKRGLYKDDIAQVDYVEPSQNTISLKMI 315
Cdd:PRK08559    77 ISFEEVEHFLKPKPIVEGIKEGDIVELIAGPFKGEKARVVRVDESKEEVTVELL 130
NGN smart00738
In Spt5p, this domain may confer affinity for Spt4p. It possesses a RNP-like fold; In Spt5p, ...
185-273 2.43e-10

In Spt5p, this domain may confer affinity for Spt4p. It possesses a RNP-like fold; In Spt5p, this domain may confer affinity for Spt4p.Spt4p


Pssm-ID: 197850 [Multi-domain]  Cd Length: 106  Bit Score: 58.54  E-value: 2.43e-10
                            10        20        30        40        50        60        70        80
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2490654199   185 NLWTVKCKIGEERATAIALMRKFIAYQFTDtplQIKSVVAP-DHVK----------------GYIYVESYKQTHVKAAIE 247
Cdd:smart00738    1 NWYAVRTTSGQEKRVAENLERKAEALGLED---KIVSILVPtEEVKeirrgkkkvverklfpGYIFVEADLEDEVWTAIR 77
                            90       100
                    ....*....|....*....|....*....
gi 2490654199   248 GIGNLRMGLWN---QQMVPIKEMTDVLKV 273
Cdd:smart00738   78 GTPGVRGFVGGggkPTPVPDDEIEKILKP 106
Spt5_N pfam11942
Spt5 transcription elongation factor, acidic N-terminal; This is the very acidic N-terminal ...
100-179 3.32e-10

Spt5 transcription elongation factor, acidic N-terminal; This is the very acidic N-terminal region of the early transcription elongation factor Spt5. The Spt5-Spt4 complex regulates early transcription elongation by RNA polymerase II and has an imputed role in pre-mRNA processing via its physical association with mRNA capping enzymes. The actual function of this N-terminal domain is not known although it is dispensable for binding to Spt4.


Pssm-ID: 463406  Cd Length: 97  Bit Score: 58.05  E-value: 3.32e-10
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2490654199  100 GAEDILEKgsDYAEVSNLDHIPLDDDHSGSRRLQNLWRDSREEALGEYYMRKYAKSAggEHYSGGSEElsDDITQQQLLP 179
Cdd:pfam11942   24 GADDFIED--DEEDEDEEDGRRDDRRHRELDRRRELEEDEDAEEIAEYLKERYGRSS--SDAYRGDAE--EGVPQRLLLP 97
Atrophin-1 pfam03154
Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian ...
846-974 5.22e-09

Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian atrophy (DRPLA) gene. DRPLA OMIM:125370 is a progressive neurodegenerative disorder. It is caused by the expansion of a CAG repeat in the DRPLA gene on chromosome 12p. This results in an extended polyglutamine region in atrophin-1, that is thought to confer toxicity to the protein, possibly through altering its interactions with other proteins. The expansion of a CAG repeat is also the underlying defect in six other neurodegenerative disorders, including Huntington's disease. One interaction of expanded polyglutamine repeats that is thought to be pathogenic is that with the short glutamine repeat in the transcriptional coactivator CREB binding protein, CBP. This interaction draws CBP away from its usual nuclear location to the expanded polyglutamine repeat protein aggregates that are characteriztic of the polyglutamine neurodegenerative disorders. This interferes with CBP-mediated transcription and causes cytotoxicity.


Pssm-ID: 460830 [Multi-domain]  Cd Length: 991  Bit Score: 60.55  E-value: 5.22e-09
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2490654199  846 PSPSPQGYG-STPNPQTPGYPEAPSPQVNPQYNPQTPGTPAMynteqysPYAAPSPQGSYQPSPSPQSYHH---QVAPSP 921
Cdd:pfam03154  311 PGPSPAAPGqSQQRIHTPPSQSQLQSQQPPREQPLPPAPLSM-------PHIKPPPTTPIPQLPNPQSHKHpphLSGPSP 383
                           90       100       110       120       130       140       150
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|.
gi 2490654199  922 ---------------VGYQNTHSPASYHPTPSPMAYQASPSPSPVGYSPM---TPGAPSPGGYNPHTPGSN 974
Cdd:pfam03154  384 fqmnsnlppppalkpLSSLSTHHPPSAHPPPLQLMPQSQQLPPPPAQPPVltqSQSLPPPAASHPPTSGLH 454
PHA03247 PHA03247
large tegument protein UL36; Provisional
800-979 7.72e-08

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 56.87  E-value: 7.72e-08
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2490654199  800 RTPHYGSQTPLHDGNRTPGQSGAWDPNNPNTPsrndeeydfgyddePSPS----PQGYGSTPNPQTPGYPEAPSPQVNPQ 875
Cdd:PHA03247  2711 APHALVSATPLPPGPAAARQASPALPAAPAPP--------------AVPAgpatPGGPARPARPPTTAGPPAPAPPAAPA 2776
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2490654199  876 YNPQTPGTPAMYNTEQYSPYAAPSPqgsyqPSPSPqsyhhqvAPSPVGYQNTHSPASYHPtpspmayqASPSPSPVGYSP 955
Cdd:PHA03247  2777 AGPPRRLTRPAVASLSESRESLPSP-----WDPAD-------PPAAVLAPAAALPPAASP--------AGPLPPPTSAQP 2836
                          170       180
                   ....*....|....*....|....
gi 2490654199  956 MTPGAPSPGGYNPHTPGSNIEQGG 979
Cdd:PHA03247  2837 TAPPPPPGPPPPSLPLGGSVAPGG 2860
KOW_elon_Spt5 TIGR00405
transcription elongation factor Spt5; This protein contains a KOW domain, shared by bacterial ...
187-315 1.97e-07

transcription elongation factor Spt5; This protein contains a KOW domain, shared by bacterial NusG and the uL24 (previously L24p/L26e) family of ribosomal proteins. The most recent papers and crystal structures make this a transcription elongation factor rather than a ribosomal protein.


Pssm-ID: 129499 [Multi-domain]  Cd Length: 145  Bit Score: 51.43  E-value: 1.97e-07
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2490654199  187 WTVKCKIGEERATAialmrKFIAYQFTDTPLQIKSVVAPDHVKGYIYVESYKQTHVKAAIEGIGNLRmGLWNQQmVPIKE 266
Cdd:TIGR00405    1 FAVKTSVGQEKNVA-----RLMARKARKSGLEVYSILAPESLKGYILVEAETKIDMRNPIIGVPHVR-GVVEGE-IDFEE 73
                           90       100       110       120
                   ....*....|....*....|....*....|....*....|....*....
gi 2490654199  267 MTDVLKVVKEVTNLKPKSWVRLKRGLYKDDIAQVDYVEPSQNTISLKMI 315
Cdd:TIGR00405   74 IERFLTPKKIIESIKKGDIVEIISGPFKGERAKVIRVDESKEEVTLELI 122
PHA03269 PHA03269
envelope glycoprotein C; Provisional
855-973 2.23e-07

envelope glycoprotein C; Provisional


Pssm-ID: 165527 [Multi-domain]  Cd Length: 566  Bit Score: 54.73  E-value: 2.23e-07
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2490654199  855 STPNPQTPGYPEAPSPQVNPQYNPQTPGTPAMYNTEQYS----PYAAPSPQGSYQPSPSPQSYHHQV-APSPVGYQNTHS 929
Cdd:PHA03269    27 PIPELHTSAATQKPDPAPAPHQAASRAPDPAVAPTSAASrkpdLAQAPTPAASEKFDPAPAPHQAASrAPDPAVAPQLAA 106
                           90       100       110       120
                   ....*....|....*....|....*....|....*....|....*....
gi 2490654199  930 PasyhPTPSPM-----AYQASPSPSPVGYSPMTPgAPSPGGYNPHTPGS 973
Cdd:PHA03269   107 A----PKPDAAeaftsAAQAHEAPADAGTSAASK-KPDPAAHTQHSPPP 150
PspC_subgroup_2 NF033839
pneumococcal surface protein PspC, LPXTG-anchored form; The pneumococcal surface protein PspC, ...
792-968 4.22e-07

pneumococcal surface protein PspC, LPXTG-anchored form; The pneumococcal surface protein PspC, as described in Streptococcus pneumoniae, is a repetitive and highly variable protein, recognized by a conserved N-terminal domain and also by genomic location. This form, subgroup 2, is anchored covalently after cleavage by sortase at a C-terminal LPXTG site. The other form, subgroup 1, has variable numbers of a choline-binding repeat in the C-terminal region, and is also known as choline-binding protein A.


Pssm-ID: 468202 [Multi-domain]  Cd Length: 557  Bit Score: 54.00  E-value: 4.22e-07
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2490654199  792 QTPIHEGNRTPhygsQTPLHDGNRTPGQSGAWDPNNPNTPSRNDEEYDFGYDDEPSPSPQGYG-------STPNPQTPGY 864
Cdd:NF033839   282 DTPKEPGNKKP----SAPKPGMQPSPQPEKKEVKPEPETPKPEVKPQLEKPKPEVKPQPEKPKpevkpqlETPKPEVKPQ 357
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2490654199  865 PEAPSPQVNPQynPQTPGTPAMYNTEQYSPYAAPSPQGSY-QPSPSPQSYHHQVAPSPVGYQNTHSPASYHPTPS--PMA 941
Cdd:NF033839   358 PEKPKPEVKPQ--PEKPKPEVKPQPETPKPEVKPQPEKPKpEVKPQPEKPKPEVKPQPEKPKPEVKPQPEKPKPEvkPQP 435
                          170       180
                   ....*....|....*....|....*..
gi 2490654199  942 YQASPSPSPVGYSPMTPGAPSPGGYNP 968
Cdd:NF033839   436 EKPKPEVKPQPEKPKPEVKPQPETPKP 462
PABP-1234 TIGR01628
polyadenylate binding protein, human types 1, 2, 3, 4 family; These eukaryotic proteins ...
749-886 8.68e-06

polyadenylate binding protein, human types 1, 2, 3, 4 family; These eukaryotic proteins recognize the poly-A of mRNA and consists of four tandem RNA recognition domains at the N-terminus (rrm: pfam00076) followed by a PABP-specific domain (pfam00658) at the C-terminus. The protein is involved in the transport of mRNA's from the nucleus to the cytoplasm. There are four paralogs in Homo sapiens which are expressed in testis, platelets, broadly expressed and of unknown tissue range.


Pssm-ID: 130689 [Multi-domain]  Cd Length: 562  Bit Score: 49.81  E-value: 8.68e-06
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2490654199  749 RQRLTTMGAKRQGGMTSAHSRTPMYGS--QTPMYGTGSRTpMYGSQTPIHEGNR---TPHYGSQTPLHDGNRTPGQSGAW 823
Cdd:TIGR01628  368 RAHLQDQFMQLQPRMRQLPMGSPMGGAmgQPPYYGQGPQQ-QFNGQPLGWPRMSmmpTPMGPGGPLRPNGLAPMNAVRAP 446
                           90       100       110       120       130       140
                   ....*....|....*....|....*....|....*....|....*....|....*....|....
gi 2490654199  824 DPNNPNTPSRndeeydfgyddepsPSPQGYGSTPNPQT-PGYPEAPSPQVNPQYNPQTPGTPAM 886
Cdd:TIGR01628  447 SRNAQNAAQK--------------PPMQPVMYPPNYQSlPLSQDLPQPQSTASQGGQNKKLAQV 496
KOW pfam00467
KOW motif; This family has been extended to coincide with ref. The KOW (Kyprides, Ouzounis, ...
708-739 8.13e-05

KOW motif; This family has been extended to coincide with ref. The KOW (Kyprides, Ouzounis, Woese) motif is found in a variety of ribosomal proteins and NusG.


Pssm-ID: 425698 [Multi-domain]  Cd Length: 32  Bit Score: 40.45  E-value: 8.13e-05
                           10        20        30
                   ....*....|....*....|....*....|..
gi 2490654199  708 IGQTVRISQGPYKGYIGVVKDATESTARVELH 739
Cdd:pfam00467    1 KGDVVRVIAGPFKGKVGKVVEVDDKKNRVLVE 32
KOW smart00739
KOW (Kyprides, Ouzounis, Woese) motif; Motif in ribosomal proteins, NusG, Spt5p, KIN17 and T54.
477-504 1.02e-04

KOW (Kyprides, Ouzounis, Woese) motif; Motif in ribosomal proteins, NusG, Spt5p, KIN17 and T54.


Pssm-ID: 128978  Cd Length: 28  Bit Score: 40.39  E-value: 1.02e-04
                            10        20
                    ....*....|....*....|....*...
gi 2490654199   477 YFRMGDHVKVIAGRYEGDTGLIVRVEEN 504
Cdd:smart00739    1 KFEVGDTVRVIAGPFKGKVGKVLEVDGE 28
KOW pfam00467
KOW motif; This family has been extended to coincide with ref. The KOW (Kyprides, Ouzounis, ...
481-509 2.46e-04

KOW motif; This family has been extended to coincide with ref. The KOW (Kyprides, Ouzounis, Woese) motif is found in a variety of ribosomal proteins and NusG.


Pssm-ID: 425698 [Multi-domain]  Cd Length: 32  Bit Score: 39.29  E-value: 2.46e-04
                           10        20        30
                   ....*....|....*....|....*....|.
gi 2490654199  481 GDHVKVIAGRYEGDTGLIVRVEE--NFVILF 509
Cdd:pfam00467    2 GDVVRVIAGPFKGKVGKVVEVDDkkNRVLVE 32
KLF1_2_4_N cd21972
N-terminal domain of Kruppel-like factor (KLF) 1, KLF2, KLF4, and similar proteins; Kruppel ...
843-971 8.06e-04

N-terminal domain of Kruppel-like factor (KLF) 1, KLF2, KLF4, and similar proteins; Kruppel/Krueppel-like transcription factors (KLFs) belong to a family of proteins called the Specificity Protein (SP)/KLF family, characterized by a C-terminal DNA-binding domain of 81 amino acids consisting of three Kruppel-like C2H2 zinc fingers. These factors bind to a loose consensus motif, namely NNRCRCCYY (where N is any nucleotide; R is A/G, and Y is C/T), such as the recurring motifs in GC and GT boxes (5'-GGGGCGGGG-3' and 5-GGTGTGGGG-3') that are present in promoters and more distal regulatory elements of mammalian genes. Members of the KLF family can act as activators or repressors of transcription depending on cell and promoter context. KLFs regulate various cellular functions, such as proliferation, differentiation, and apoptosis, as well as the development and homeostasis of several types of tissue. In addition to the C-terminal DNA-binding domain, each KLF also has a unique N-terminal activation/repression domain that confers specifity and allows it to bind specifically to a certain partner, leading to distinct activities in vivo. This model represents the related N-terminal domains of KLF1, KLF2, KLF4, and similar proteins.


Pssm-ID: 409230 [Multi-domain]  Cd Length: 194  Bit Score: 41.89  E-value: 8.06e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2490654199  843 DDEPSPSPQGYG--STPNPQTPGYPEAPSPQVNPQYN-PQTPGTPAMYNTEQYSPYAAPSPQGSYQPSP------SPQSY 913
Cdd:cd21972     36 NDNPPPPDPAYPppESPESCSTVYDSDGCHPTPNAYCgPNGPGLPGHFLLAGNSPNLGPKIKTENQEQAcmpvagYSGHY 115
                           90       100       110       120       130       140
                   ....*....|....*....|....*....|....*....|....*....|....*....|....
gi 2490654199  914 HHQVAPSPVG------YQNTHSPASYHPTPSPMAYQASPSPSPVGYSPmtPGAPSPGGYNPHTP 971
Cdd:cd21972    116 GPREPQRVPPappppqYAGHFQYHGHFNMFSPPLRANHPGMSTVMLTP--LSTPPLGFLSPEEA 177
 
Name Accession Description Interval E-value
NGN_Euk cd09888
Eukaryotic N-Utilization Substance G (NusG) N-terminal (NGN) domain, including plant KTF1 (KOW ...
185-272 2.79e-43

Eukaryotic N-Utilization Substance G (NusG) N-terminal (NGN) domain, including plant KTF1 (KOW domain-containing Transcription Factor 1); The N-Utilization Substance G (NusG) protein and its eukaryotic homolog, Spt5, are involved in transcription elongation and termination. NusG contains an NGN domain at its N-terminus and Kyrpides Ouzounis and Woese (KOW) repeats at its C-terminus. Spt5 forms an Spt4-Spt5 complex that is an essential RNA polymerase II elongation factor. NusG was originally discovered as an N-dependent antitermination enhancing activity in Escherichia coli, and has a variety of functions such as its involvement in RNA polymerase elongation and Rho-termination in bacteria. Orthologs of the NusG gene exist in all bacteria, but their functions and requirements are different. Spt5-like is homologous to the Spt5 proteins present in all eukaryotes, which is unique as it encodes a protein with an additional long carboxy-terminal extension that contains WG/GW motifs. Spt5-like, or KTF1 (KOW domain-containing Transcription Factor 1), is a RNA-directed DNA methylation (RdDM) pathway effector in plants.


Pssm-ID: 193577 [Multi-domain]  Cd Length: 86  Bit Score: 151.91  E-value: 2.79e-43
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2490654199  185 NLWTVKCKIGEERATAIALMRKFIAYQFTDTPLQIKSVVAPDHVKGYIYVESYKQTHVKAAIEGIGNLRMglWNQQMVPI 264
Cdd:cd09888      1 KLWAVKCKPGKEREIVISLMRKFLDLQRTGNPLGIKSVFARDGLKGYIYIEARKEAHVKDAIEGLRGVYL--NTIKLVPI 78

                   ....*...
gi 2490654199  265 KEMTDVLK 272
Cdd:cd09888     79 KEMPDVLS 86
Spt5-NGN pfam03439
Early transcription elongation factor of RNA pol II, NGN section; Spt5p and prokaryotic NusG ...
185-271 1.35e-30

Early transcription elongation factor of RNA pol II, NGN section; Spt5p and prokaryotic NusG are shown to contain a novel 'NGN' domain. The combined NGN and KOW motif regions of Spt5 form the binding domain with Spt4. Spt5 complexes with Spt4 as a 1:1 heterodimer snf this Spt5-Spt4 complex regulates early transcription elongation by RNA polymerase II and has an imputed role in pre-mRNA processing via its physical association with mRNA capping enzymes. The Schizosaccharomyces pombe core Spt5-Spt4 complex is a heterodimer bearing a trypsin-resistant Spt4-binding domain within the Spt5 subunit.


Pssm-ID: 397481  Cd Length: 84  Bit Score: 115.37  E-value: 1.35e-30
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2490654199  185 NLWTVKCKIGEERATAIALMRKFIAYQfTDTPLQIKSVVAPDHVKGYIYVESYKQTHVKAAIEGIGNLRMGlwNQQMVPI 264
Cdd:pfam03439    1 KIWAVKCTPGQEREVALSLMRKILALA-KTNNLGIYSVFAPDGLKGYIYVEADRQAAVKRALEGIPNVRGL--VPGLVPI 77

                   ....*..
gi 2490654199  265 KEMTDVL 271
Cdd:pfam03439   78 KEMEHLL 84
CTD smart01104
Spt5 C-terminal nonapeptide repeat binding Spt4; The C-terminal domain of the transcription ...
774-891 1.43e-28

Spt5 C-terminal nonapeptide repeat binding Spt4; The C-terminal domain of the transcription elongation factor protein Spt5 is necessary for binding to Spt4 to form the functional complex that regulates early transcription elongation by RNA polymerase II. The complex may be involved in pre-mRNA processing through its association with mRNA capping enzymes. This CTD domain carries a regular nonapeptide repeat that can be present in up to 18 copies, as in S. pombe. The repeat has a characteristic TPA motif.


Pssm-ID: 215026 [Multi-domain]  Cd Length: 121  Bit Score: 111.07  E-value: 1.43e-28
                            10        20        30        40        50        60        70        80
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2490654199   774 GSQTPMYG-TGSRTPMYGSQTPIHEGNRTP----HYGSQTPLHDG--NRTPGQSGAWdPNNPNTPSRNDEEYDFGYDDEP 846
Cdd:smart01104    1 GGRTPAWGaSGSKTPAWGSRTPGTAAGGAPtargGSGSRTPAWGGagSRTPAWGGAG-PTGSRTPAWGGASAWGNKSSEG 79
                            90       100       110       120
                    ....*....|....*....|....*....|....*....|....*..
gi 2490654199   847 SPSPQ--GYGSTPNPQTPGYPEAPSPqvnpqYNPQTPGTPAMYNTEQ 891
Cdd:smart01104   80 SASSWaaGPGGAYGAPTPGYGGTPSA-----YGPATPGGGAMAGSAS 121
KOW_Spt5_3 cd06083
KOW domain of Spt5, repeat 3; Spt5, an eukaryotic ortholog of NusG, contains multiple KOW ...
477-527 9.13e-28

KOW domain of Spt5, repeat 3; Spt5, an eukaryotic ortholog of NusG, contains multiple KOW motifs at its C-terminus. Spt5 is involved in transcription elongation and termination. KOW domain is known as an RNA-binding motif that is shared so far among some families of ribosomal proteins, the essential bacterial transcriptional elongation factor NusG, the eukaryotic chromatin elongation factor Spt5, the higher eukaryotic KIN17 proteins and Mtr4. KOW_Spt5 domains play critical roles in recruitment of multiple other eukaryotic transcription elongation and RNA biogenesis factors and additionally are involved in the binding of the eukaryotic Spt5 proteins to RNA polymerases.


Pssm-ID: 240507  Cd Length: 51  Bit Score: 106.46  E-value: 9.13e-28
                           10        20        30        40        50
                   ....*....|....*....|....*....|....*....|....*....|.
gi 2490654199  477 YFRMGDHVKVIAGRYEGDTGLIVRVEENFVILFSDLTMHELKVLPRDLQLC 527
Cdd:cd06083      1 HFKVGDHVKVISGRHEGETGLVVKVEDDVVTVFSDLTMRELKVFPRDLQLS 51
KOW_Spt5_5 cd06085
KOW domain of Spt5, repeat 5; Spt5, an eukaryotic ortholog of NusG, contains multiple KOW ...
704-753 5.31e-25

KOW domain of Spt5, repeat 5; Spt5, an eukaryotic ortholog of NusG, contains multiple KOW motifs at its C-terminus. Spt5 is involved in transcription elongation and termination. KOW domain is known as an RNA-binding motif that is shared so far among some families of ribosomal proteins, the essential bacterial transcriptional elongation factor NusG, the eukaryotic chromatin elongation factor Spt5, the higher eukaryotic KIN17 proteins and Mtr4. KOW_Spt5 domains play critical roles in recruitment of multiple other eukaryotic transcription elongation and RNA biogenesis factors and additionally are involved in the binding of the eukaryotic Spt5 proteins to RNA polymerases.


Pssm-ID: 240509  Cd Length: 52  Bit Score: 98.33  E-value: 5.31e-25
                           10        20        30        40        50
                   ....*....|....*....|....*....|....*....|....*....|
gi 2490654199  704 DNELIGQTVRISQGPYKGYIGVVKDATESTARVELHSTCQTISVDRQRLT 753
Cdd:cd06085      2 RDPLIGKTVRIRKGPYKGYIGIVKDATGTTARVELHSKNKTITVDRSRLA 51
KOW_Spt5_6 cd06086
KOW domain of Spt5, repeat 6; Spt5, an eukaryotic ortholog of NusG, contains multiple KOW ...
1029-1085 7.08e-25

KOW domain of Spt5, repeat 6; Spt5, an eukaryotic ortholog of NusG, contains multiple KOW motifs at its C-terminus. Spt5 is involved in transcription elongation and termination. KOW domain is known as an RNA-binding motif that is shared so far among some families of ribosomal proteins, the essential bacterial transcriptional elongation factor NusG, the eukaryotic chromatin elongation factor Spt5, the higher eukaryotic KIN17 proteins and Mtr4. KOW_Spt5 domains play critical roles in recruitment of multiple other eukaryotic transcription elongation and RNA biogenesis factors and additionally are involved in the binding of the eukaryotic Spt5 proteins to RNA polymerases.


Pssm-ID: 240510  Cd Length: 58  Bit Score: 98.36  E-value: 7.08e-25
                           10        20        30        40        50
                   ....*....|....*....|....*....|....*....|....*....|....*..
gi 2490654199 1029 DHLEPVTPTKNNKVKVILGEDREATGVLLSIDGDDGIVRMELDDQLKILNLRFLGRL 1085
Cdd:cd06086      1 EHLEPVPPEKGDRVKVIKGEDRGSTGELISIDGADGIVKMDSDGDIKILPMNFLAKL 57
KOW_Spt5_2 cd06082
KOW domain of Spt5, repeat 2; Spt5, an eukaryotic ortholog of NusG, contains multiple KOW ...
426-476 1.04e-24

KOW domain of Spt5, repeat 2; Spt5, an eukaryotic ortholog of NusG, contains multiple KOW motifs at its C-terminus. Spt5 is involved in transcription elongation and termination. KOW domain is known as an RNA-binding motif that is shared so far among some families of ribosomal proteins, the essential bacterial transcriptional elongation factor NusG, the eukaryotic chromatin elongation factor Spt5, the higher eukaryotic KIN17 proteins and Mtr4. KOW_Spt5 domains play critical roles in recruitment of multiple other eukaryotic transcription elongation and RNA biogenesis factors and additionally are involved in the binding of the eukaryotic Spt5 proteins to RNA polymerases.


Pssm-ID: 240506  Cd Length: 51  Bit Score: 97.57  E-value: 1.04e-24
                           10        20        30        40        50
                   ....*....|....*....|....*....|....*....|....*....|.
gi 2490654199  426 LQAGDNVEVCEGELINLQGKILSVDGNKITIMPKHEDLKDPLEFPAHELKK 476
Cdd:cd06082      1 FQPGDNVEVIEGELKGLQGKVESVDGDIVTIMPKHEDLKEPLEFPAKELRK 51
KOW_Spt5_4 cd06084
KOW domain of Spt5, repeat 4; Spt5, an eukaryotic ortholog of NusG, contains multiple KOW ...
603-645 2.16e-19

KOW domain of Spt5, repeat 4; Spt5, an eukaryotic ortholog of NusG, contains multiple KOW motifs at its C-terminus. Spt5 is involved in transcription elongation and termination. KOW domain is known as an RNA-binding motif that is shared so far among some families of ribosomal proteins, the essential bacterial transcriptional elongation factor NusG, the eukaryotic chromatin elongation factor Spt5, the higher eukaryotic KIN17 proteins and Mtr4. KOW_Spt5 domains play critical roles in recruitment of multiple other eukaryotic transcription elongation and RNA biogenesis factors and additionally are involved in the binding of the eukaryotic Spt5 proteins to RNA polymerases.


Pssm-ID: 240508  Cd Length: 43  Bit Score: 82.18  E-value: 2.16e-19
                           10        20        30        40
                   ....*....|....*....|....*....|....*....|...
gi 2490654199  603 KDIVKVIDGPHSGREGEIRHLFRGFAFLHCKKLVENGGMFVCK 645
Cdd:cd06084      1 GDTVKVVDGPYKGRQGTVLHIYRGTLFLHSREVTENGGIFVVR 43
KOW_Spt5_1 cd06081
KOW domain of Spt5, repeat 1; Spt5, an eukaryotic ortholog of NusG, contains multiple KOW ...
283-320 9.81e-17

KOW domain of Spt5, repeat 1; Spt5, an eukaryotic ortholog of NusG, contains multiple KOW motifs at its C-terminus. Spt5 is involved in transcription elongation and termination. KOW domain is known as an RNA-binding motif that is shared so far among some families of ribosomal proteins, the essential bacterial transcriptional elongation factor NusG, the eukaryotic chromatin elongation factor Spt5, the higher eukaryotic KIN17 proteins and Mtr4. KOW_Spt5 domains play critical roles in recruitment of multiple other eukaryotic transcription elongation and RNA biogenesis factors and additionally are involved in the binding of the eukaryotic Spt5 proteins to RNA polymerases.


Pssm-ID: 240505  Cd Length: 38  Bit Score: 74.43  E-value: 9.81e-17
                           10        20        30
                   ....*....|....*....|....*....|....*...
gi 2490654199  283 KSWVRLKRGLYKDDIAQVDYVEPSQNTISLKMIPRIDL 320
Cdd:cd06081      1 GSWVRIKRGIYKGDLAQVDEVDENGNRVVVKLIPRIDY 38
CTD pfam12815
Spt5 C-terminal nonapeptide repeat binding Spt4; The C-terminal domain of the transcription ...
774-832 1.09e-12

Spt5 C-terminal nonapeptide repeat binding Spt4; The C-terminal domain of the transcription elongation factor protein Spt5 is necessary for binding to Spt4 to form the functional complex that regulates early transcription elongation by RNA polymerase II. The complex may be involved in pre-mRNA processing through its association with mRNA capping enzymes. This CTD domain carries a regular nonapeptide repeat that can be present in up to 18 copies, as in S. pombe. The repeat has a characteriztic TPA motif.


Pssm-ID: 372327 [Multi-domain]  Cd Length: 71  Bit Score: 64.00  E-value: 1.09e-12
                           10        20        30        40        50        60        70
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2490654199  774 GSQTPMYGT--GSRTPMY---GSQTPIHE--GNRTPHY--GSQTPLHD--GNRTPGQSGAWDPnnPNTPS 832
Cdd:pfam12815    1 GSRTPAYNSagGSRTPAWgadGSRTPAYGgaGGRTPAYnqGGKTPAWGgaGSRTPAYYGAWGG--SRTPA 68
nusG PRK08559
transcription antitermination protein NusG; Validated
182-315 1.15e-12

transcription antitermination protein NusG; Validated


Pssm-ID: 181467 [Multi-domain]  Cd Length: 153  Bit Score: 66.81  E-value: 1.15e-12
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2490654199  182 KDPNLWTVKCKIGEERATAIALMRKFIAYQftdtpLQIKSVVAPDHVKGYIYVESYKQTHVKAAIEGIGNLRMGLwnQQM 261
Cdd:PRK08559     4 EMSMIFAVKTTAGQERNVALMLAMRAKKEN-----LPIYAILAPPELKGYVLVEAESKGAVEEAIRGIPHVRGVV--PGE 76
                           90       100       110       120       130
                   ....*....|....*....|....*....|....*....|....*....|....
gi 2490654199  262 VPIKEMTDVLKVVKEVTNLKPKSWVRLKRGLYKDDIAQVDYVEPSQNTISLKMI 315
Cdd:PRK08559    77 ISFEEVEHFLKPKPIVEGIKEGDIVELIAGPFKGEKARVVRVDESKEEVTVELL 130
NGN cd08000
N-Utilization Substance G (NusG) N-terminal (NGN) domain Superfamily; The N-Utilization ...
185-271 9.53e-11

N-Utilization Substance G (NusG) N-terminal (NGN) domain Superfamily; The N-Utilization Substance G (NusG) and its eukaryotic homolog Spt5 are involved in transcription elongation and termination. NusG contains an NGN domain at its N-terminus and Kyrpides Ouzounis and Woese (KOW) repeats at its C-terminus in bacteria and archaea. The eukaryotic ortholog, Spt5, is a large protein composed of an acidic N-terminus, an NGN domain, and multiple KOW motifs at its C-terminus. Spt5 forms a Spt4-Spt5 complex that is an essential RNA Polymerase II elongation factor. NusG was originally discovered as an N-dependent antitermination enhancing activity in Escherichia coli and has a variety of functions, such as being involved in RNA polymerase elongation and Rho-termination in bacteria. Orthologs of the NusG gene exist in all bacteria, but its functions and requirements are different. The diverse activities suggest that, after diverging from a common ancestor, NusG proteins became specialized in different bacteria.


Pssm-ID: 193574 [Multi-domain]  Cd Length: 99  Bit Score: 59.64  E-value: 9.53e-11
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2490654199  185 NLWTVKCKIGEERATAIALMRKFIA---------YQFTDTPLQIKSVVAPDHVKGYIYVESYKQTHVKAAIEGIGNLRMG 255
Cdd:cd08000      1 NWYVLFVKTGREEKVEKLLEKRFEAndieafvpkKEVPERKRGKIEEVIKPLFPGYVFVETDLSPELYELIREVPGVIGI 80
                           90
                   ....*....|....*....
gi 2490654199  256 LWN---QQMVPIKEMTDVL 271
Cdd:cd08000     81 LGNgeePSPVSDEEIEMIL 99
NGN smart00738
In Spt5p, this domain may confer affinity for Spt4p. It possesses a RNP-like fold; In Spt5p, ...
185-273 2.43e-10

In Spt5p, this domain may confer affinity for Spt4p. It possesses a RNP-like fold; In Spt5p, this domain may confer affinity for Spt4p.Spt4p


Pssm-ID: 197850 [Multi-domain]  Cd Length: 106  Bit Score: 58.54  E-value: 2.43e-10
                            10        20        30        40        50        60        70        80
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2490654199   185 NLWTVKCKIGEERATAIALMRKFIAYQFTDtplQIKSVVAP-DHVK----------------GYIYVESYKQTHVKAAIE 247
Cdd:smart00738    1 NWYAVRTTSGQEKRVAENLERKAEALGLED---KIVSILVPtEEVKeirrgkkkvverklfpGYIFVEADLEDEVWTAIR 77
                            90       100
                    ....*....|....*....|....*....
gi 2490654199   248 GIGNLRMGLWN---QQMVPIKEMTDVLKV 273
Cdd:smart00738   78 GTPGVRGFVGGggkPTPVPDDEIEKILKP 106
Spt5_N pfam11942
Spt5 transcription elongation factor, acidic N-terminal; This is the very acidic N-terminal ...
100-179 3.32e-10

Spt5 transcription elongation factor, acidic N-terminal; This is the very acidic N-terminal region of the early transcription elongation factor Spt5. The Spt5-Spt4 complex regulates early transcription elongation by RNA polymerase II and has an imputed role in pre-mRNA processing via its physical association with mRNA capping enzymes. The actual function of this N-terminal domain is not known although it is dispensable for binding to Spt4.


Pssm-ID: 463406  Cd Length: 97  Bit Score: 58.05  E-value: 3.32e-10
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2490654199  100 GAEDILEKgsDYAEVSNLDHIPLDDDHSGSRRLQNLWRDSREEALGEYYMRKYAKSAggEHYSGGSEElsDDITQQQLLP 179
Cdd:pfam11942   24 GADDFIED--DEEDEDEEDGRRDDRRHRELDRRRELEEDEDAEEIAEYLKERYGRSS--SDAYRGDAE--EGVPQRLLLP 97
Atrophin-1 pfam03154
Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian ...
846-974 5.22e-09

Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian atrophy (DRPLA) gene. DRPLA OMIM:125370 is a progressive neurodegenerative disorder. It is caused by the expansion of a CAG repeat in the DRPLA gene on chromosome 12p. This results in an extended polyglutamine region in atrophin-1, that is thought to confer toxicity to the protein, possibly through altering its interactions with other proteins. The expansion of a CAG repeat is also the underlying defect in six other neurodegenerative disorders, including Huntington's disease. One interaction of expanded polyglutamine repeats that is thought to be pathogenic is that with the short glutamine repeat in the transcriptional coactivator CREB binding protein, CBP. This interaction draws CBP away from its usual nuclear location to the expanded polyglutamine repeat protein aggregates that are characteriztic of the polyglutamine neurodegenerative disorders. This interferes with CBP-mediated transcription and causes cytotoxicity.


Pssm-ID: 460830 [Multi-domain]  Cd Length: 991  Bit Score: 60.55  E-value: 5.22e-09
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2490654199  846 PSPSPQGYG-STPNPQTPGYPEAPSPQVNPQYNPQTPGTPAMynteqysPYAAPSPQGSYQPSPSPQSYHH---QVAPSP 921
Cdd:pfam03154  311 PGPSPAAPGqSQQRIHTPPSQSQLQSQQPPREQPLPPAPLSM-------PHIKPPPTTPIPQLPNPQSHKHpphLSGPSP 383
                           90       100       110       120       130       140       150
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|.
gi 2490654199  922 ---------------VGYQNTHSPASYHPTPSPMAYQASPSPSPVGYSPM---TPGAPSPGGYNPHTPGSN 974
Cdd:pfam03154  384 fqmnsnlppppalkpLSSLSTHHPPSAHPPPLQLMPQSQQLPPPPAQPPVltqSQSLPPPAASHPPTSGLH 454
PHA03247 PHA03247
large tegument protein UL36; Provisional
800-979 7.72e-08

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 56.87  E-value: 7.72e-08
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2490654199  800 RTPHYGSQTPLHDGNRTPGQSGAWDPNNPNTPsrndeeydfgyddePSPS----PQGYGSTPNPQTPGYPEAPSPQVNPQ 875
Cdd:PHA03247  2711 APHALVSATPLPPGPAAARQASPALPAAPAPP--------------AVPAgpatPGGPARPARPPTTAGPPAPAPPAAPA 2776
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2490654199  876 YNPQTPGTPAMYNTEQYSPYAAPSPqgsyqPSPSPqsyhhqvAPSPVGYQNTHSPASYHPtpspmayqASPSPSPVGYSP 955
Cdd:PHA03247  2777 AGPPRRLTRPAVASLSESRESLPSP-----WDPAD-------PPAAVLAPAAALPPAASP--------AGPLPPPTSAQP 2836
                          170       180
                   ....*....|....*....|....
gi 2490654199  956 MTPGAPSPGGYNPHTPGSNIEQGG 979
Cdd:PHA03247  2837 TAPPPPPGPPPPSLPLGGSVAPGG 2860
CTD pfam12815
Spt5 C-terminal nonapeptide repeat binding Spt4; The C-terminal domain of the transcription ...
753-806 1.12e-07

Spt5 C-terminal nonapeptide repeat binding Spt4; The C-terminal domain of the transcription elongation factor protein Spt5 is necessary for binding to Spt4 to form the functional complex that regulates early transcription elongation by RNA polymerase II. The complex may be involved in pre-mRNA processing through its association with mRNA capping enzymes. This CTD domain carries a regular nonapeptide repeat that can be present in up to 18 copies, as in S. pombe. The repeat has a characteriztic TPA motif.


Pssm-ID: 372327 [Multi-domain]  Cd Length: 71  Bit Score: 49.75  E-value: 1.12e-07
                           10        20        30        40        50        60
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 2490654199  753 TTMGAKRQGGMTSAH----SRTPMYG---SQTPMYGTGSRTPMY---GSQTPIH----EGNRTPHYGS 806
Cdd:pfam12815    4 TPAYNSAGGSRTPAWgadgSRTPAYGgagGRTPAYNQGGKTPAWggaGSRTPAYygawGGSRTPAYGG 71
Pro-rich pfam15240
Proline-rich protein; This family includes several eukaryotic proline-rich proteins.
833-971 1.25e-07

Proline-rich protein; This family includes several eukaryotic proline-rich proteins.


Pssm-ID: 464580 [Multi-domain]  Cd Length: 167  Bit Score: 52.73  E-value: 1.25e-07
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2490654199  833 RNDEEYDFGYDDEPSPSPQGyGSTPNPQTPGYPEAPSPQVNPQYNPQTPGTPamyNTEQYSPYAAPSPQGSYQPSPSPQS 912
Cdd:pfam15240   33 SEEEGQSQQGGQGPQGPPPG-GFPPQPPASDDPPGPPPPGGPQQPPPQGGKQ---KPQGPPPQGGPRPPPGKPQGPPPQG 108
                           90       100       110       120       130
                   ....*....|....*....|....*....|....*....|....*....|....*....
gi 2490654199  913 YHHQVAPSPVGYQNTHSPASYHPTPSPMAYQASPSPSPvgYSPMTPGAPSPGGYNPHTP 971
Cdd:pfam15240  109 GNQQQGPPPPGKPQGPPPQGGGPPPQGGNQQGPPPPPP--GNPQGPPQRPPQPGNPQGP 165
KOW_elon_Spt5 TIGR00405
transcription elongation factor Spt5; This protein contains a KOW domain, shared by bacterial ...
187-315 1.97e-07

transcription elongation factor Spt5; This protein contains a KOW domain, shared by bacterial NusG and the uL24 (previously L24p/L26e) family of ribosomal proteins. The most recent papers and crystal structures make this a transcription elongation factor rather than a ribosomal protein.


Pssm-ID: 129499 [Multi-domain]  Cd Length: 145  Bit Score: 51.43  E-value: 1.97e-07
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2490654199  187 WTVKCKIGEERATAialmrKFIAYQFTDTPLQIKSVVAPDHVKGYIYVESYKQTHVKAAIEGIGNLRmGLWNQQmVPIKE 266
Cdd:TIGR00405    1 FAVKTSVGQEKNVA-----RLMARKARKSGLEVYSILAPESLKGYILVEAETKIDMRNPIIGVPHVR-GVVEGE-IDFEE 73
                           90       100       110       120
                   ....*....|....*....|....*....|....*....|....*....
gi 2490654199  267 MTDVLKVVKEVTNLKPKSWVRLKRGLYKDDIAQVDYVEPSQNTISLKMI 315
Cdd:TIGR00405   74 IERFLTPKKIIESIKKGDIVEIISGPFKGERAKVIRVDESKEEVTLELI 122
PHA03269 PHA03269
envelope glycoprotein C; Provisional
855-973 2.23e-07

envelope glycoprotein C; Provisional


Pssm-ID: 165527 [Multi-domain]  Cd Length: 566  Bit Score: 54.73  E-value: 2.23e-07
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2490654199  855 STPNPQTPGYPEAPSPQVNPQYNPQTPGTPAMYNTEQYS----PYAAPSPQGSYQPSPSPQSYHHQV-APSPVGYQNTHS 929
Cdd:PHA03269    27 PIPELHTSAATQKPDPAPAPHQAASRAPDPAVAPTSAASrkpdLAQAPTPAASEKFDPAPAPHQAASrAPDPAVAPQLAA 106
                           90       100       110       120
                   ....*....|....*....|....*....|....*....|....*....
gi 2490654199  930 PasyhPTPSPM-----AYQASPSPSPVGYSPMTPgAPSPGGYNPHTPGS 973
Cdd:PHA03269   107 A----PKPDAAeaftsAAQAHEAPADAGTSAASK-KPDPAAHTQHSPPP 150
NGN_Arch cd09887
Archaeal N-Utilization Substance G (NusG) N-terminal (NGN) domain; The N-Utilization Substance ...
186-253 3.65e-07

Archaeal N-Utilization Substance G (NusG) N-terminal (NGN) domain; The N-Utilization Substance G (NusG) protein and its eukaryotic homolog, Spt5, are involved in transcription elongation and termination. Transcription in archaea has a eukaryotic-type transcription apparatus, but contains bacterial-type transcription factors. NusG is one of the few archaeal transcription factors that has orthologs in both bacteria and eukaryotes. Archaeal NusG is similar to bacterial NusG, composed of an NGN domain and a Kyrpides Ouzounis and Woese (KOW) repeat. The eukaryotic ortholog, Spt5, is a large protein composed of an acidic N-terminus, an NGN domain, and multiple KOW motifs at its C-terminus. NusG was originally discovered as a N-dependent antitermination enhancing activity in Escherichia coli and has a variety of functions, such as being involved in RNA polymerase elongation and Rho-termination in bacteria. Archaeal NusG forms a complex with DNA-directed RNA polymerase subunit E (rpoE) that is similar to the Spt5-Spt4 complex in eukaryotes.


Pssm-ID: 193576  Cd Length: 82  Bit Score: 48.69  E-value: 3.65e-07
                           10        20        30        40        50        60
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 2490654199  186 LWTVKCKIGEERATAIALMRKFiayqfTDTPLQIKSVVAPDHVKGYIYVESYKQTHVKAAIEGIGNLR 253
Cdd:cd09887      2 IYAVKTTAGQERNVADLLAMRA-----EKENLDVYSILVPEELKGYVFVEAEDPDRVEELIRGIPHVR 64
PspC_subgroup_2 NF033839
pneumococcal surface protein PspC, LPXTG-anchored form; The pneumococcal surface protein PspC, ...
792-968 4.22e-07

pneumococcal surface protein PspC, LPXTG-anchored form; The pneumococcal surface protein PspC, as described in Streptococcus pneumoniae, is a repetitive and highly variable protein, recognized by a conserved N-terminal domain and also by genomic location. This form, subgroup 2, is anchored covalently after cleavage by sortase at a C-terminal LPXTG site. The other form, subgroup 1, has variable numbers of a choline-binding repeat in the C-terminal region, and is also known as choline-binding protein A.


Pssm-ID: 468202 [Multi-domain]  Cd Length: 557  Bit Score: 54.00  E-value: 4.22e-07
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2490654199  792 QTPIHEGNRTPhygsQTPLHDGNRTPGQSGAWDPNNPNTPSRNDEEYDFGYDDEPSPSPQGYG-------STPNPQTPGY 864
Cdd:NF033839   282 DTPKEPGNKKP----SAPKPGMQPSPQPEKKEVKPEPETPKPEVKPQLEKPKPEVKPQPEKPKpevkpqlETPKPEVKPQ 357
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2490654199  865 PEAPSPQVNPQynPQTPGTPAMYNTEQYSPYAAPSPQGSY-QPSPSPQSYHHQVAPSPVGYQNTHSPASYHPTPS--PMA 941
Cdd:NF033839   358 PEKPKPEVKPQ--PEKPKPEVKPQPETPKPEVKPQPEKPKpEVKPQPEKPKPEVKPQPEKPKPEVKPQPEKPKPEvkPQP 435
                          170       180
                   ....*....|....*....|....*..
gi 2490654199  942 YQASPSPSPVGYSPMTPGAPSPGGYNP 968
Cdd:NF033839   436 EKPKPEVKPQPEKPKPEVKPQPETPKP 462
Atrophin-1 pfam03154
Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian ...
846-977 6.26e-07

Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian atrophy (DRPLA) gene. DRPLA OMIM:125370 is a progressive neurodegenerative disorder. It is caused by the expansion of a CAG repeat in the DRPLA gene on chromosome 12p. This results in an extended polyglutamine region in atrophin-1, that is thought to confer toxicity to the protein, possibly through altering its interactions with other proteins. The expansion of a CAG repeat is also the underlying defect in six other neurodegenerative disorders, including Huntington's disease. One interaction of expanded polyglutamine repeats that is thought to be pathogenic is that with the short glutamine repeat in the transcriptional coactivator CREB binding protein, CBP. This interaction draws CBP away from its usual nuclear location to the expanded polyglutamine repeat protein aggregates that are characteriztic of the polyglutamine neurodegenerative disorders. This interferes with CBP-mediated transcription and causes cytotoxicity.


Pssm-ID: 460830 [Multi-domain]  Cd Length: 991  Bit Score: 53.62  E-value: 6.26e-07
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2490654199  846 PSPSPQGYGSTPNPQTPGYPEAPSPQVNPQYNPQ----TPGTPaMYNTEQYSPYAAPSPqGSYQPSP-SPQSYHHQVAPS 920
Cdd:pfam03154  235 PTLHPQRLPSPHPPLQPMTQPPPPSQVSPQPLPQpslhGQMPP-MPHSLQTGPSHMQHP-VPPQPFPlTPQSSQSQVPPG 312
                           90       100       110       120       130
                   ....*....|....*....|....*....|....*....|....*....|....*..
gi 2490654199  921 PVGYQNTHSPASYHPTPSPmayQASPSPSPVGYSPMTPgAPSPGGYNPHTPGSNIEQ 977
Cdd:pfam03154  313 PSPAAPGQSQQRIHTPPSQ---SQLQSQQPPREQPLPP-APLSMPHIKPPPTTPIPQ 365
PRK10263 PRK10263
DNA translocase FtsK; Provisional
771-974 8.69e-07

DNA translocase FtsK; Provisional


Pssm-ID: 236669 [Multi-domain]  Cd Length: 1355  Bit Score: 53.55  E-value: 8.69e-07
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2490654199  771 PMYGSQTPMYGTGSRTPMYGSQTPIHEGNRTPHYGSQTPLHDGNRTPGQSGAWDPNNPNTPSR-NDEEYDFGYDDEPSPS 849
Cdd:PRK10263   384 SQYAQPAVQYNEPLQQPVQPQQPYYAPAAEQPAQQPYYAPAPEQPAQQPYYAPAPEQPVAGNAwQAEEQQSTFAPQSTYQ 463
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2490654199  850 PQGYGSTPNPQTPGY--PEAPSPQVNPQYNPQT----PGTPAMYNTEQYSPYAAPSPQ---GSYQPSPSPQSYHHQVAPS 920
Cdd:PRK10263   464 TEQTYQQPAAQEPLYqqPQPVEQQPVVEPEPVVeetkPARPPLYYFEEVEEKRAREREqlaAWYQPIPEPVKEPEPIKSS 543
                          170       180       190       200       210       220
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 2490654199  921 -PVGYQNTHSPASYHPTPSPMAY----------QASPSPSPVgYSPMTPGAPSPG---GYNPHTPGSN 974
Cdd:PRK10263   544 lKAPSVAAVPPVEAAAAVSPLASgvkkatlatgAAATVAAPV-FSLANSGGPRPQvkeGIGPQLPRPK 610
Atrophin-1 pfam03154
Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian ...
778-973 1.06e-06

Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian atrophy (DRPLA) gene. DRPLA OMIM:125370 is a progressive neurodegenerative disorder. It is caused by the expansion of a CAG repeat in the DRPLA gene on chromosome 12p. This results in an extended polyglutamine region in atrophin-1, that is thought to confer toxicity to the protein, possibly through altering its interactions with other proteins. The expansion of a CAG repeat is also the underlying defect in six other neurodegenerative disorders, including Huntington's disease. One interaction of expanded polyglutamine repeats that is thought to be pathogenic is that with the short glutamine repeat in the transcriptional coactivator CREB binding protein, CBP. This interaction draws CBP away from its usual nuclear location to the expanded polyglutamine repeat protein aggregates that are characteriztic of the polyglutamine neurodegenerative disorders. This interferes with CBP-mediated transcription and causes cytotoxicity.


Pssm-ID: 460830 [Multi-domain]  Cd Length: 991  Bit Score: 52.85  E-value: 1.06e-06
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2490654199  778 PMYGTGSRTPMYGSQTPIHEGNRTPHYGSQTPlhdgNRTPGQSGAWDPNNPNTPSRNDEEYDFGYDDEP--SPSPQgygs 855
Cdd:pfam03154  294 PPQPFPLTPQSSQSQVPPGPSPAAPGQSQQRI----HTPPSQSQLQSQQPPREQPLPPAPLSMPHIKPPptTPIPQ---- 365
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2490654199  856 TPNPQT---PGYPEAPSPQvnpQYNPQTPGTPAMYNTEQYSPYAAPS---------PQGSYQPSPSPQSyhhqvaPSPVG 923
Cdd:pfam03154  366 LPNPQShkhPPHLSGPSPF---QMNSNLPPPPALKPLSSLSTHHPPSahppplqlmPQSQQLPPPPAQP------PVLTQ 436
                          170       180       190       200       210
                   ....*....|....*....|....*....|....*....|....*....|....
gi 2490654199  924 YQNTHSPASYHPTPSpmAYQASPSPSPVGYSPMTPGAP----SPGGYNPHTPGS 973
Cdd:pfam03154  437 SQSLPPPAASHPPTS--GLHQVPSQSPFPQHPFVPGGPppitPPSGPPTSTSSA 488
KOW cd00380
KOW: an acronym for the authors' surnames (Kyrpides, Ouzounis and Woese); KOW domain is known ...
481-525 1.63e-06

KOW: an acronym for the authors' surnames (Kyrpides, Ouzounis and Woese); KOW domain is known as an RNA-binding motif that is shared so far among some families of ribosomal proteins, the essential bacterial transcriptional elongation factor NusG, the eukaryotic chromatin elongation factor Spt5, the higher eukaryotic KIN17 proteins and Mtr4. The KOW motif contains an invariants glycine residue and comprises alternating blocks of hydrophilic and hydrophobic residues.


Pssm-ID: 240504  Cd Length: 49  Bit Score: 46.06  E-value: 1.63e-06
                           10        20        30        40
                   ....*....|....*....|....*....|....*....|....*....
gi 2490654199  481 GDHVKVIAGRYEGDTGLIVRVEENFVIL----FSDLTMHELKVLPRDLQ 525
Cdd:cd00380      1 GDVVRVLRGPYKGREGVVVDIDPRFGIVtvkgATGSKGAELKVRFDDVD 49
PHA03378 PHA03378
EBNA-3B; Provisional
765-963 1.89e-06

EBNA-3B; Provisional


Pssm-ID: 223065 [Multi-domain]  Cd Length: 991  Bit Score: 52.38  E-value: 1.89e-06
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2490654199  765 SAHSRTPMYGSQTPMYGTGSRTPMYGSQTPIHEGNRTPhygsQTPLHDGNRTPgqsgAWDPNNPNTPSRNDeeYDFGYDD 844
Cdd:PHA03378   603 SQTPEPPTTQSHIPETSAPRQWPMPLRPIPMRPLRMQP----ITFNVLVFPTP----HQPPQVEITPYKPT--WTQIGHI 672
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2490654199  845 EPSPSPQGYGSTPNPQ-TPGY----PEAPSPQVNPQYNPQTPGTPAMYNTEQYSPYAAPSPQGSYQPSPSPqsyhhqvAP 919
Cdd:PHA03378   673 PYQPSPTGANTMLPIQwAPGTmqppPRAPTPMRPPAAPPGRAQRPAAATGRARPPAAAPGRARPPAAAPGR-------AR 745
                          170       180       190       200       210
                   ....*....|....*....|....*....|....*....|....*....|.
gi 2490654199  920 SPVGYQNTHSPASYHPTPS--PMAYQASPSPSPVGYSPMTP-----GAPSP 963
Cdd:PHA03378   746 PPAAAPGRARPPAAAPGRArpPAAAPGAPTPQPPPQAPPAPqqrprGAPTP 796
PHA03247 PHA03247
large tegument protein UL36; Provisional
807-972 2.00e-06

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 52.25  E-value: 2.00e-06
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2490654199  807 QTPLHDGNRTPGQSgawdPNNPNTPsrndeeyDFGYDDEPSPSPQ-------GYGSTPNPQTPGYPEAPSP-QVNPQYNP 878
Cdd:PHA03247  2599 RAPVDDRGDPRGPA----PPSPLPP-------DTHAPDPPPPSPSpaanepdPHPPPTVPPPERPRDDPAPgRVSRPRRA 2667
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2490654199  879 QTPGTPAMYN--TEQYSPYAAPSPQGSYQPS--PSPQSYHHQVAPSPVGYQNTHSPASYHPTPSPMAYQASPSPSPVGYS 954
Cdd:PHA03247  2668 RRLGRAAQASspPQRPRRRAARPTVGSLTSLadPPPPPPTPEPAPHALVSATPLPPGPAAARQASPALPAAPAPPAVPAG 2747
                          170
                   ....*....|....*...
gi 2490654199  955 PMTPGAPSPGGYNPHTPG 972
Cdd:PHA03247  2748 PATPGGPARPARPPTTAG 2765
PHA03377 PHA03377
EBNA-3C; Provisional
751-978 2.22e-06

EBNA-3C; Provisional


Pssm-ID: 177614 [Multi-domain]  Cd Length: 1000  Bit Score: 51.98  E-value: 2.22e-06
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2490654199  751 RLTTMGAKRqggmTSAHSRTPMY-GSQTPMygTGSRTPMYGSQTPihegNRTPHYGSQTPLHDGNRTPGQsgaWDPNNPN 829
Cdd:PHA03377   705 HLSSMSPTQ----PISHEEQPRYeDPDDPL--DLSLHPDQAPPPS----HQAPYSGHEEPQAQQAPYPGY---WEPRPPQ 771
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2490654199  830 TPSrndeeydFGYDDEPSPSPQ-----GYGS--TPNPQTPGY--PEAPSPQvNPQY-NPQTPGTPamynteqYSPYAAPS 899
Cdd:PHA03377   772 APY-------LGYQEPQAQGVQvssypGYAGpwGLRAQHPRYrhSWAYWSQ-YPGHgHPQGPWAP-------RPPHLPPQ 836
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2490654199  900 PQGSYQPSPS--PQSYHHQVAPSPVGYQNTHSPASYHPTP------------------SPMAYQASPSPSPVGYSpMTPG 959
Cdd:PHA03377   837 WDGSAGHGQDqvSQFPHLQSETGPPRLQLSQVPQLPYSQTlvsssapswsspqprapiRPIPTRFPPPPMPLQDS-MAVG 915
                          250       260
                   ....*....|....*....|
gi 2490654199  960 APSPGGYNPHTP-GSNIEQG 978
Cdd:PHA03377   916 CDSSGTACPSMPfASDYSQG 935
PHA02682 PHA02682
ORF080 virion core protein; Provisional
846-1008 2.36e-06

ORF080 virion core protein; Provisional


Pssm-ID: 177464 [Multi-domain]  Cd Length: 280  Bit Score: 50.63  E-value: 2.36e-06
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2490654199  846 PSPSPQGYGSTPNPQTP--GYPEAPSPQVNPQYNPQTPGTPAmyNTEQYSPyAAPSPQgsyqPSPSPqsyhhqvAPSPVG 923
Cdd:PHA02682   101 AAPAPAVTCPAPAPACPpaTAPTCPPPAVCPAPARPAPACPP--STRQCPP-APPLPT----PKPAP-------AAKPIF 166
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2490654199  924 YQNTHSPASYhPTPSPMAYQASPSPSPVgyspmtpgapspggYNPHTPGSNIEqggGDWVTTDI----LVRVKDSFMDLM 999
Cdd:PHA02682   167 LHNQLPPPDY-PAASCPTIETAPAASPV--------------LEPRIPDKIID---ADNDDKDLikkeLADIADSVRDLN 228
                          170
                   ....*....|
gi 2490654199 1000 GQ-TGVIRSI 1008
Cdd:PHA02682   229 AEsLSLTRDI 238
PHA03247 PHA03247
large tegument protein UL36; Provisional
845-970 2.56e-06

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 51.86  E-value: 2.56e-06
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2490654199  845 EPSPSPQGYGSTPNPQTPGYPEAPSPQVNPQYNPQTPGTPAmynteQYSPYAAPSPQGSYQPSPSPQSYHHQVAPSPvgy 924
Cdd:PHA03247  2700 DPPPPPPTPEPAPHALVSATPLPPGPAAARQASPALPAAPA-----PPAVPAGPATPGGPARPARPPTTAGPPAPAP--- 2771
                           90       100       110       120
                   ....*....|....*....|....*....|....*....|....*.
gi 2490654199  925 qnthsPASYHPTPSPmayqASPSPSPVGYSPMTPGAPSPGGYNPHT 970
Cdd:PHA03247  2772 -----PAAPAAGPPR----RLTRPAVASLSESRESLPSPWDPADPP 2808
Med15 pfam09606
ARC105 or Med15 subunit of Mediator complex non-fungal; The approx. 70 residue Med15 domain of ...
755-972 3.24e-06

ARC105 or Med15 subunit of Mediator complex non-fungal; The approx. 70 residue Med15 domain of the ARC-Mediator co-activator is a three-helix bundle with marked similarity to the KIX domain. The sterol regulatory element binding protein (SREBP) family of transcription activators use the ARC105 subunit to activate target genes in the regulation of cholesterol and fatty acid homeostasis. In addition, Med15 is a critical transducer of gene activation signals that control early metazoan development.


Pssm-ID: 312941 [Multi-domain]  Cd Length: 732  Bit Score: 51.16  E-value: 3.24e-06
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2490654199  755 MGAKRQGGmtsAHSRTPMYGSQTPMYGTGSRTPMYGS---QTPIHEGNRTPHYGSQTPLHDGNRTPG----QSGAWDPNN 827
Cdd:pfam09606  229 MNPQQMGG---APNQVAMQQQQPQQQGQQSQLGMGINqmqQMPQGVGGGAGQGGPGQPMGPPGQQPGampnVMSIGDQNN 305
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2490654199  828 PNTPSRNDEEYDFGYDDEPSPSPQ--------------GYGSTPNPQTPGypeapsPQVNPQYNPQTPGTPAMYNTEQYS 893
Cdd:pfam09606  306 YQQQQTRQQQQQQGGNHPAAHQQQmnqsvgqggqvvalGGLNHLETWNPG------NFGGLGANPMQRGQPGMMSSPSPV 379
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2490654199  894 PYAAP---SPQGSYQPSPSPqsyhHQVAPSPVGYQNTHSPASyHPTPSPmAYQASPSPSPVGY--SPMTPGAPSPGGyNP 968
Cdd:pfam09606  380 PGQQVrqvTPNQFMRQSPQP----SVPSPQGPGSQPPQSHPG-GMIPSP-ALIPSPSPQMSQQpaQQRTIGQDSPGG-SL 452

                   ....
gi 2490654199  969 HTPG 972
Cdd:pfam09606  453 NTPG 456
KOW cd00380
KOW: an acronym for the authors' surnames (Kyrpides, Ouzounis and Woese); KOW domain is known ...
429-475 3.24e-06

KOW: an acronym for the authors' surnames (Kyrpides, Ouzounis and Woese); KOW domain is known as an RNA-binding motif that is shared so far among some families of ribosomal proteins, the essential bacterial transcriptional elongation factor NusG, the eukaryotic chromatin elongation factor Spt5, the higher eukaryotic KIN17 proteins and Mtr4. The KOW motif contains an invariants glycine residue and comprises alternating blocks of hydrophilic and hydrophobic residues.


Pssm-ID: 240504  Cd Length: 49  Bit Score: 44.90  E-value: 3.24e-06
                           10        20        30        40
                   ....*....|....*....|....*....|....*....|....*....
gi 2490654199  429 GDNVEVCEGELINLQGKILSVDG--NKITIMPKHEDLKDPLEFPAHELK 475
Cdd:cd00380      1 GDVVRVLRGPYKGREGVVVDIDPrfGIVTVKGATGSKGAELKVRFDDVD 49
Herpes_BLLF1 pfam05109
Herpes virus major outer envelope glycoprotein (BLLF1); This family consists of the BLLF1 ...
770-964 3.28e-06

Herpes virus major outer envelope glycoprotein (BLLF1); This family consists of the BLLF1 viral late glycoprotein, also termed gp350/220. It is the most abundantly expressed glycoprotein in the viral envelope of the Herpesviruses and is the major antigen responsible for stimulating the production of neutralising antibodies in vivo.


Pssm-ID: 282904 [Multi-domain]  Cd Length: 886  Bit Score: 51.46  E-value: 3.28e-06
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2490654199  770 TPMYGSQTPMYGTGSRTPMYGSQTPiheGNRTPHYGSQTPLhDGNRTPGQSGAWDPNNPNTPSRNDEEYDFGyddEPSPS 849
Cdd:pfam05109  508 SPTSAVTTPTPNATSPTPAVTTPTP---NATSPTLGKTSPT-SAVTTPTPNATSPTPAVTTPTPNATIPTLG---KTSPT 580
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2490654199  850 PQGYGSTPNPQTPGYPEApSPQVNPQYNP--QTPGTPAMYNTEQYSPYAAPSPQGSYQPSPS------PQSYHHQVAPSP 921
Cdd:pfam05109  581 SAVTTPTPNATSPTVGET-SPQANTTNHTlgGTSSTPVVTSPPKNATSAVTTGQHNITSSSTssmslrPSSISETLSPST 659
                          170       180       190       200
                   ....*....|....*....|....*....|....*....|....*
gi 2490654199  922 VGYQNTHSP--ASYHPTPSPMAYQASPSPSPVGYSPMTPGAPSPG 964
Cdd:pfam05109  660 SDNSTSHMPllTSAHPTGGENITQVTPASTSTHHVSTSSPAPRPG 704
PRK10263 PRK10263
DNA translocase FtsK; Provisional
865-994 3.73e-06

DNA translocase FtsK; Provisional


Pssm-ID: 236669 [Multi-domain]  Cd Length: 1355  Bit Score: 51.24  E-value: 3.73e-06
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2490654199  865 PEAP--SPQVNPQYNPQTPgtpamynteqysPYAAPSPQGSYQPSPSPQSYHHQVAPSPVGYQNTHSPASYHPTPSPMAY 942
Cdd:PRK10263   740 PHEPlfTPIVEPVQQPQQP------------VAPQQQYQQPQQPVAPQPQYQQPQQPVAPQPQYQQPQQPVAPQPQYQQP 807
                           90       100       110       120       130
                   ....*....|....*....|....*....|....*....|....*....|..
gi 2490654199  943 QASPSPSPVGYSPMTPGAPSPGGYNPHTPGSNIEQgggDWVTTDILVRVKDS 994
Cdd:PRK10263   808 QQPVAPQPQYQQPQQPVAPQPQYQQPQQPVAPQPQ---DTLLHPLLMRNGDS 856
KOW cd00380
KOW: an acronym for the authors' surnames (Kyrpides, Ouzounis and Woese); KOW domain is known ...
709-752 3.83e-06

KOW: an acronym for the authors' surnames (Kyrpides, Ouzounis and Woese); KOW domain is known as an RNA-binding motif that is shared so far among some families of ribosomal proteins, the essential bacterial transcriptional elongation factor NusG, the eukaryotic chromatin elongation factor Spt5, the higher eukaryotic KIN17 proteins and Mtr4. The KOW motif contains an invariants glycine residue and comprises alternating blocks of hydrophilic and hydrophobic residues.


Pssm-ID: 240504  Cd Length: 49  Bit Score: 44.90  E-value: 3.83e-06
                           10        20        30        40
                   ....*....|....*....|....*....|....*....|....*...
gi 2490654199  709 GQTVRISQGPYKGYIGVVKDATEST--ARVELH--STCQTISVDRQRL 752
Cdd:cd00380      1 GDVVRVLRGPYKGREGVVVDIDPRFgiVTVKGAtgSKGAELKVRFDDV 48
PHA03247 PHA03247
large tegument protein UL36; Provisional
816-968 4.35e-06

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 51.09  E-value: 4.35e-06
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2490654199  816 TPGQSGAWDPNNPNTP--SRNDEEYDFGYDDEPSPSPQGYGSTPNPQTPGYPEAPSP-------------QVNPQYNPQT 880
Cdd:PHA03247  2795 RESLPSPWDPADPPAAvlAPAAALPPAASPAGPLPPPTSAQPTAPPPPPGPPPPSLPlggsvapggdvrrRPPSRSPAAK 2874
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2490654199  881 PGTPAMYNTEQYS-PYAAPSPQGSYQPSPSPQSYHHQVAPSPVGYQNThSPASYHPTPSPMAYQASPSPSPVGYSPMTPG 959
Cdd:PHA03247  2875 PAAPARPPVRRLArPAVSRSTESFALPPDQPERPPQPQAPPPPQPQPQ-PPPPPQPQPPPPPPPRPQPPLAPTTDPAGAG 2953

                   ....*....
gi 2490654199  960 APSPGGYNP 968
Cdd:PHA03247  2954 EPSGAVPQP 2962
PRK10263 PRK10263
DNA translocase FtsK; Provisional
845-960 4.64e-06

DNA translocase FtsK; Provisional


Pssm-ID: 236669 [Multi-domain]  Cd Length: 1355  Bit Score: 50.85  E-value: 4.64e-06
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2490654199  845 EPSPSPQGYGSTPNPQTPGYPEAPSPQVNPQYNPQTPGTPAMYNTEQYSPYAAPSP-QGSYQPSPSPQ--------SYHH 915
Cdd:PRK10263   370 EPVIAPAPEGYPQQSQYAQPAVQYNEPLQQPVQPQQPYYAPAAEQPAQQPYYAPAPeQPAQQPYYAPApeqpvagnAWQA 449
                           90       100       110       120
                   ....*....|....*....|....*....|....*....|....*..
gi 2490654199  916 QVAPSPVGYQNTHSPASYHPTPSPMA--YQASPSPSPVGYSPMTPGA 960
Cdd:PRK10263   450 EEQQSTFAPQSTYQTEQTYQQPAAQEplYQQPQPVEQQPVVEPEPVV 496
dnaA PRK14086
chromosomal replication initiator protein DnaA;
794-952 5.70e-06

chromosomal replication initiator protein DnaA;


Pssm-ID: 237605 [Multi-domain]  Cd Length: 617  Bit Score: 50.59  E-value: 5.70e-06
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2490654199  794 PIHEGNRTPHYGSQT-PLHDGNRTPGQSGAWDPNN------PNTPSRNDEEYDFGYDDEPSPSPQGYGSTPNPQTPGYPE 866
Cdd:PRK14086   103 RRTSEPELPRPGRRPyEGYGGPRADDRPPGLPRQDqlptarPAYPAYQQRPEPGAWPRAADDYGWQQQRLGFPPRAPYAS 182
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2490654199  867 APSPQVNPQYNPQTPGTPAMYNTEQYSPYAAPSPQGSY-------QPSPSPQSYH-HQVAPSPVGYQNTHSPASYHPTPS 938
Cdd:PRK14086   183 PASYAPEQERDREPYDAGRPEYDQRRRDYDHPRPDWDRprrdrtdRPEPPPGAGHvHRGGPGPPERDDAPVVPIRPSAPG 262
                          170
                   ....*....|....
gi 2490654199  939 PMAYQASPSPSPVG 952
Cdd:PRK14086   263 PLAAQPAPAPGPGE 276
PHA03247 PHA03247
large tegument protein UL36; Provisional
819-971 6.30e-06

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 50.71  E-value: 6.30e-06
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2490654199  819 QSGAWDPNNPNTPSR----NDEEYDFGYDDEPSPSPQgygSTPNPQTPgyPEAPSPQVNPQYNPQTPGTPAMYNTEQYS- 893
Cdd:PHA03247  2583 TSRARRPDAPPQSARprapVDDRGDPRGPAPPSPLPP---DTHAPDPP--PPSPSPAANEPDPHPPPTVPPPERPRDDPa 2657
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2490654199  894 ------PYAAPSPQGSYQPSPSPQSYHHQVAPSPVGyqnthSPASYHPTPSPMAYQASPSPSPVGYSPMTPGAPSPGGYN 967
Cdd:PHA03247  2658 pgrvsrPRRARRLGRAAQASSPPQRPRRRAARPTVG-----SLTSLADPPPPPPTPEPAPHALVSATPLPPGPAAARQAS 2732

                   ....
gi 2490654199  968 PHTP 971
Cdd:PHA03247  2733 PALP 2736
PABP-1234 TIGR01628
polyadenylate binding protein, human types 1, 2, 3, 4 family; These eukaryotic proteins ...
749-886 8.68e-06

polyadenylate binding protein, human types 1, 2, 3, 4 family; These eukaryotic proteins recognize the poly-A of mRNA and consists of four tandem RNA recognition domains at the N-terminus (rrm: pfam00076) followed by a PABP-specific domain (pfam00658) at the C-terminus. The protein is involved in the transport of mRNA's from the nucleus to the cytoplasm. There are four paralogs in Homo sapiens which are expressed in testis, platelets, broadly expressed and of unknown tissue range.


Pssm-ID: 130689 [Multi-domain]  Cd Length: 562  Bit Score: 49.81  E-value: 8.68e-06
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2490654199  749 RQRLTTMGAKRQGGMTSAHSRTPMYGS--QTPMYGTGSRTpMYGSQTPIHEGNR---TPHYGSQTPLHDGNRTPGQSGAW 823
Cdd:TIGR01628  368 RAHLQDQFMQLQPRMRQLPMGSPMGGAmgQPPYYGQGPQQ-QFNGQPLGWPRMSmmpTPMGPGGPLRPNGLAPMNAVRAP 446
                           90       100       110       120       130       140
                   ....*....|....*....|....*....|....*....|....*....|....*....|....
gi 2490654199  824 DPNNPNTPSRndeeydfgyddepsPSPQGYGSTPNPQT-PGYPEAPSPQVNPQYNPQTPGTPAM 886
Cdd:TIGR01628  447 SRNAQNAAQK--------------PPMQPVMYPPNYQSlPLSQDLPQPQSTASQGGQNKKLAQV 496
PHA03247 PHA03247
large tegument protein UL36; Provisional
768-954 9.36e-06

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 50.32  E-value: 9.36e-06
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2490654199  768 SRTPMYGSQTPMygtgSRTPMYGSQTPIHEGNRTPHYGSQTPLHD-GNRTPGQSGAWDPNNPNTP----------SRNDE 836
Cdd:PHA03247  2821 AASPAGPLPPPT----SAQPTAPPPPPGPPPPSLPLGGSVAPGGDvRRRPPSRSPAAKPAAPARPpvrrlarpavSRSTE 2896
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2490654199  837 EYDFGyDDEPSPSPQGYGSTPNPQTPGYPEAPSPQVNPQYNPQTPGTPAmynteqysPYAAPSPQGSYQPS-PSPQSYHh 915
Cdd:PHA03247  2897 SFALP-PDQPERPPQPQAPPPPQPQPQPPPPPQPQPPPPPPPRPQPPLA--------PTTDPAGAGEPSGAvPQPWLGA- 2966
                          170       180       190
                   ....*....|....*....|....*....|....*....
gi 2490654199  916 qVAPSPVGYQNTHSPASYHPTPSPmayqASPSPSPVGYS 954
Cdd:PHA03247  2967 -LVPGRVAVPRFRVPQPAPSREAP----ASSTPPLTGHS 3000
PHA03307 PHA03307
transcriptional regulator ICP4; Provisional
746-964 9.73e-06

transcriptional regulator ICP4; Provisional


Pssm-ID: 223039 [Multi-domain]  Cd Length: 1352  Bit Score: 49.78  E-value: 9.73e-06
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2490654199  746 SVDRQRLTTMGAKRQGGMTSAHSRTPMYGSQTPMYGTGSRTPMYGSqTPIHEGNRTPHYGSQTPlhdgnrTPGQSGAWDP 825
Cdd:PHA03307   228 AADDAGASSSDSSSSESSGCGWGPENECPLPRPAPITLPTRIWEAS-GWNGPSSRPGPASSSSS------PRERSPSPSP 300
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2490654199  826 NNP-----NTPSRNDEEYDFGYDDE-PSPSPQGYGSTPNPQTPGYPEAPSPQVNPQYNPQTPGTPAmyntEQYSPYAAPS 899
Cdd:PHA03307   301 SSPgsgpaPSSPRASSSSSSSRESSsSSTSSSSESSRGAAVSPGPSPSRSPSPSRPPPPADPSSPR----KRPRPSRAPS 376
                          170       180       190       200       210       220
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*.
gi 2490654199  900 PQGSYQPSPSPQSYHHQVAPSPVGYQNTHSPASYHPTPSPMAYQASPSPSPVGYSPMTP-GAPSPG 964
Cdd:PHA03307   377 SPAASAGRPTRRRARAAVAGRARRRDATGRFPAGRPRPSPLDAGAASGAFYARYPLLTPsGEPWPG 442
dnaA PRK14086
chromosomal replication initiator protein DnaA;
845-981 1.31e-05

chromosomal replication initiator protein DnaA;


Pssm-ID: 237605 [Multi-domain]  Cd Length: 617  Bit Score: 49.05  E-value: 1.31e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2490654199  845 EPSPSPQGYGSTPNPQTPGYPEAPSPQVNPQYNPQTPGTPAMYNTEQYSPYAAPSPQGSYQPSPSPQsyhhqvAPSPVGY 924
Cdd:PRK14086    94 EPAPPPPHARRTSEPELPRPGRRPYEGYGGPRADDRPPGLPRQDQLPTARPAYPAYQQRPEPGAWPR------AADDYGW 167
                           90       100       110       120       130
                   ....*....|....*....|....*....|....*....|....*....|....*...
gi 2490654199  925 QNT-HSPASYHPTPSPMAYQASPSPSPVGYSPMTPGAPSPGGyNPHTPGSNIEQGGGD 981
Cdd:PRK14086   168 QQQrLGFPPRAPYASPASYAPEQERDREPYDAGRPEYDQRRR-DYDHPRPDWDRPRRD 224
KOW cd00380
KOW: an acronym for the authors' surnames (Kyrpides, Ouzounis and Woese); KOW domain is known ...
603-640 1.35e-05

KOW: an acronym for the authors' surnames (Kyrpides, Ouzounis and Woese); KOW domain is known as an RNA-binding motif that is shared so far among some families of ribosomal proteins, the essential bacterial transcriptional elongation factor NusG, the eukaryotic chromatin elongation factor Spt5, the higher eukaryotic KIN17 proteins and Mtr4. The KOW motif contains an invariants glycine residue and comprises alternating blocks of hydrophilic and hydrophobic residues.


Pssm-ID: 240504  Cd Length: 49  Bit Score: 43.36  E-value: 1.35e-05
                           10        20        30
                   ....*....|....*....|....*....|....*...
gi 2490654199  603 KDIVKVIDGPHSGREGEIRHLFRGFAFLHCKKLVENGG 640
Cdd:cd00380      1 GDVVRVLRGPYKGREGVVVDIDPRFGIVTVKGATGSKG 38
Atrophin-1 pfam03154
Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian ...
814-979 1.49e-05

Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian atrophy (DRPLA) gene. DRPLA OMIM:125370 is a progressive neurodegenerative disorder. It is caused by the expansion of a CAG repeat in the DRPLA gene on chromosome 12p. This results in an extended polyglutamine region in atrophin-1, that is thought to confer toxicity to the protein, possibly through altering its interactions with other proteins. The expansion of a CAG repeat is also the underlying defect in six other neurodegenerative disorders, including Huntington's disease. One interaction of expanded polyglutamine repeats that is thought to be pathogenic is that with the short glutamine repeat in the transcriptional coactivator CREB binding protein, CBP. This interaction draws CBP away from its usual nuclear location to the expanded polyglutamine repeat protein aggregates that are characteriztic of the polyglutamine neurodegenerative disorders. This interferes with CBP-mediated transcription and causes cytotoxicity.


Pssm-ID: 460830 [Multi-domain]  Cd Length: 991  Bit Score: 49.38  E-value: 1.49e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2490654199  814 NRTPGQSGAWDPNN---------PNTPSRNDEEYDfgyDDEPSPSPQGYGSTPNPQTPGYPEAPSPqvnpqynPQTPGTP 884
Cdd:pfam03154  123 GRSVNDEGSSDPKDidqdnrstsPSIPSPQDNESD---SDSSAQQQILQTQPPVLQAQSGAASPPS-------PPPPGTT 192
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2490654199  885 AMYNTEQYSPYAAPSPQGSYQPSPSPQSYHHQVAPSPVGYQNTHSPASYHPTPSPMAYQASPSPSPVGYSPMTpgAPSPG 964
Cdd:pfam03154  193 QAATAGPTPSAPSVPPQGSPATSQPPNQTQSTAAPHTLIQQTPTLHPQRLPSPHPPLQPMTQPPPPSQVSPQP--LPQPS 270
                          170
                   ....*....|....*
gi 2490654199  965 GYNPHTPGSNIEQGG 979
Cdd:pfam03154  271 LHGQMPPMPHSLQTG 285
PHA03247 PHA03247
large tegument protein UL36; Provisional
813-950 1.84e-05

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 49.17  E-value: 1.84e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2490654199  813 GNRTPGQSGAWDPNNPNTPSR------NDEEY----------------DFGYDD--EPSP-------SPQGYGSTPNPQT 861
Cdd:PHA03247  2494 AAPDPGGGGPPDPDAPPAPSRlapailPDEPVgepvhprmltwirgleELASDDagDPPPplppaapPAAPDRSVPPPRP 2573
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2490654199  862 PGYPEAPS-------PQVNPQYN-PQTPGTPAmynteqyspyaaPSPQGSYQPSPSPQSYHHQVAPSPVGYQNTHSPASY 933
Cdd:PHA03247  2574 APRPSEPAvtsrarrPDAPPQSArPRAPVDDR------------GDPRGPAPPSPLPPDTHAPDPPPPSPSPAANEPDPH 2641
                          170
                   ....*....|....*..
gi 2490654199  934 HPTPSPMAYQASPSPSP 950
Cdd:PHA03247  2642 PPPTVPPPERPRDDPAP 2658
PTZ00395 PTZ00395
Sec24-related protein; Provisional
790-969 2.53e-05

Sec24-related protein; Provisional


Pssm-ID: 185594 [Multi-domain]  Cd Length: 1560  Bit Score: 48.53  E-value: 2.53e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2490654199  790 GSQTPIHEGNRTPHYGSQTPL-HDGNRTPGQSGAWDPNNPNTPSRNDEEYDFGYDDEPSPSPQGYGSTP--NP--QTPGY 864
Cdd:PTZ00395   345 GSPNAASAGAPFNGLGNQADGgHINQVHPDARGAWAGGPHSNASYNCAAYSNAAQSNAAQSNAGFSNAGysNPgnSNPGY 424
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2490654199  865 PEAP---SPQVNPQY------NPQTPGTPamYNTEQYSPYAAPSPQGSYQPsPSPQSYHHQVAPSPVGYQNTHSPASYHP 935
Cdd:PTZ00395   425 NNAPnsnTPYNNPPNsntpysNPPNSNPP--YSNLPYSNTPYSNAPLSNAP-PSSAKDHHSAYHAAYQHRAANQPAANLP 501
                          170       180       190
                   ....*....|....*....|....*....|....
gi 2490654199  936 TPSPMAyqASPSPSPVGYSPMTPGAPSPGGYNPH 969
Cdd:PTZ00395   502 TANQPA--ANNFHGAAGNSVGNPFASRPFGSAPY 533
PTZ00395 PTZ00395
Sec24-related protein; Provisional
841-962 3.04e-05

Sec24-related protein; Provisional


Pssm-ID: 185594 [Multi-domain]  Cd Length: 1560  Bit Score: 48.53  E-value: 3.04e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2490654199  841 GYDDEPSPSPqGYGSTPNPQTP--GYPEAPSPQVNPQY-NPQTPGTPamYNTEQYS--PYA-APSPQGSYQPSPSPQSYH 914
Cdd:PTZ00395   413 GYSNPGNSNP-GYNNAPNSNTPynNPPNSNTPYSNPPNsNPPYSNLP--YSNTPYSnaPLSnAPPSSAKDHHSAYHAAYQ 489
                           90       100       110       120       130
                   ....*....|....*....|....*....|....*....|....*....|....*.
gi 2490654199  915 HQVAPSPVGYQNTHSPASYHP--------TPSPMAYQASPSpSPVGYSPMTPGAPS 962
Cdd:PTZ00395   490 HRAANQPAANLPTANQPAANNfhgaagnsVGNPFASRPFGS-APYGGNAATTADPN 544
PRK14959 PRK14959
DNA polymerase III subunits gamma and tau; Provisional
846-983 3.87e-05

DNA polymerase III subunits gamma and tau; Provisional


Pssm-ID: 184923 [Multi-domain]  Cd Length: 624  Bit Score: 47.75  E-value: 3.87e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2490654199  846 PSPSPQGYGSTPNPQTPGyPEAPSPQVNPQYNPQTPGTPAmynteqysPYAAPSPQGSYQPSPspqsyhhqvaPSPVGYQ 925
Cdd:PRK14959   387 EGPASGGAATIPTPGTQG-PQGTAPAAGMTPSSAAPATPA--------PSAAPSPRVPWDDAP----------PAPPRSG 447
                           90       100       110       120       130
                   ....*....|....*....|....*....|....*....|....*....|....*...
gi 2490654199  926 NTHSPASYHPTPSPMAYQASPSPSPVGYSPmTPGAPSPGGynPHTPGsnieqGGGDWV 983
Cdd:PRK14959   448 IPPRPAPRMPEASPVPGAPDSVASASDAPP-TLGDPSDTA--EHTPS-----GPRTWD 497
PHA03291 PHA03291
envelope glycoprotein I; Provisional
880-974 4.31e-05

envelope glycoprotein I; Provisional


Pssm-ID: 223033 [Multi-domain]  Cd Length: 401  Bit Score: 47.26  E-value: 4.31e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2490654199  880 TPGTPAMYNTEQYS------PYAAP--SPQGSYQPSPSPQSYHHQVAPSPVGYQNTHSPASYHPTPSPMAYQASPSPSPV 951
Cdd:PHA03291   171 TLAAPPLGEGSADGscdpalPLSAPrlGPADVFVPATPRPTPRTTASPETTPTPSTTTSPPSTTIPAPSTTIAAPQAGTT 250
                           90       100
                   ....*....|....*....|...
gi 2490654199  952 GYSPMTPGAPSPGGynPHTPGSN 974
Cdd:PHA03291   251 PEAEGTPAPPTPGG--GEAPPAN 271
PHA03381 PHA03381
tegument protein VP22; Provisional
796-970 6.15e-05

tegument protein VP22; Provisional


Pssm-ID: 177618 [Multi-domain]  Cd Length: 290  Bit Score: 46.16  E-value: 6.15e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2490654199  796 HEGNRTPHYGSQTPLHDGNRTPGQSGAWDPNNPNTPSR---NDEEYDFGYDDEPSPSPQGYGST------PNPQTP---- 862
Cdd:PHA03381    15 DEVEADVYYDFISPDASPARVSFEEPADRARRGAGQARgrsQAERRFHHYDEARADYPYYTGSSsederpADPRPSrrph 94
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2490654199  863 GYPEAPSPqvNPQYNPQTPGTP--AMYNTEQYSPYAAPSPQGSYQP----SPSPQSYHHQVAPSPVGYQNTHSPASyhpT 936
Cdd:PHA03381    95 AQPEASGP--GPARGARGPAGSrgRGRRAESPSPRDPPNPKGASAPrgrkSACADSAALLDAPAPAAPKRQKTPAG---L 169
                          170       180       190
                   ....*....|....*....|....*....|....
gi 2490654199  937 PSPMAYQASPSpspvgySPMTPGAPSPGGYNPHT 970
Cdd:PHA03381   170 ARKLHFSTAPT------SPTAPWTPRVAGFNKRT 197
PHA03377 PHA03377
EBNA-3C; Provisional
813-971 6.67e-05

EBNA-3C; Provisional


Pssm-ID: 177614 [Multi-domain]  Cd Length: 1000  Bit Score: 46.97  E-value: 6.67e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2490654199  813 GNRTPGQSGAWDPNNPNTPSRNDEEYDFGYDDEPS-----------PSPQ----GYGSTPNPQTP--GYPEAPSPQVnPQ 875
Cdd:PHA03377   696 GRAQPSEESHLSSMSPTQPISHEEQPRYEDPDDPLdlslhpdqappPSHQapysGHEEPQAQQAPypGYWEPRPPQA-PY 774
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2490654199  876 YNPQTPGTPAMyNTEQYSPYAAPSPqgsyqPSPSPQSYHHQVAPSPV--GYQNTHSP-ASYHPTPSPmAYQASPSPSPVG 952
Cdd:PHA03377   775 LGYQEPQAQGV-QVSSYPGYAGPWG-----LRAQHPRYRHSWAYWSQypGHGHPQGPwAPRPPHLPP-QWDGSAGHGQDQ 847
                          170
                   ....*....|....*....
gi 2490654199  953 YSPMTPGAPSPGgynPHTP 971
Cdd:PHA03377   848 VSQFPHLQSETG---PPRL 863
KOW pfam00467
KOW motif; This family has been extended to coincide with ref. The KOW (Kyprides, Ouzounis, ...
708-739 8.13e-05

KOW motif; This family has been extended to coincide with ref. The KOW (Kyprides, Ouzounis, Woese) motif is found in a variety of ribosomal proteins and NusG.


Pssm-ID: 425698 [Multi-domain]  Cd Length: 32  Bit Score: 40.45  E-value: 8.13e-05
                           10        20        30
                   ....*....|....*....|....*....|..
gi 2490654199  708 IGQTVRISQGPYKGYIGVVKDATESTARVELH 739
Cdd:pfam00467    1 KGDVVRVIAGPFKGKVGKVVEVDDKKNRVLVE 32
PHA03247 PHA03247
large tegument protein UL36; Provisional
828-971 8.89e-05

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 46.86  E-value: 8.89e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2490654199  828 PNTPS-RNDEEYDFGYDDEPSPSPqGYGSTPNPQTPGYPEAPSPQVNPQYNPQTPGTPAMY----NTEQYSPYAAPSPQG 902
Cdd:PHA03247  2475 PGAPVyRRPAEARFPFAAGAAPDP-GGGGPPDPDAPPAPSRLAPAILPDEPVGEPVHPRMLtwirGLEELASDDAGDPPP 2553
                           90       100       110       120       130       140       150
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|.
gi 2490654199  903 SYQPSPSPQSYHHQVAPSpvgyqnthSPASYHPTPSPMAYQASPSPSPVGYSPMTPGAP--SPGGYNPHTP 971
Cdd:PHA03247  2554 PLPPAAPPAAPDRSVPPP--------RPAPRPSEPAVTSRARRPDAPPQSARPRAPVDDrgDPRGPAPPSP 2616
KOW smart00739
KOW (Kyprides, Ouzounis, Woese) motif; Motif in ribosomal proteins, NusG, Spt5p, KIN17 and T54.
477-504 1.02e-04

KOW (Kyprides, Ouzounis, Woese) motif; Motif in ribosomal proteins, NusG, Spt5p, KIN17 and T54.


Pssm-ID: 128978  Cd Length: 28  Bit Score: 40.39  E-value: 1.02e-04
                            10        20
                    ....*....|....*....|....*...
gi 2490654199   477 YFRMGDHVKVIAGRYEGDTGLIVRVEEN 504
Cdd:smart00739    1 KFEVGDTVRVIAGPFKGKVGKVLEVDGE 28
PRK07764 PRK07764
DNA polymerase III subunits gamma and tau; Validated
845-980 1.12e-04

DNA polymerase III subunits gamma and tau; Validated


Pssm-ID: 236090 [Multi-domain]  Cd Length: 824  Bit Score: 46.52  E-value: 1.12e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2490654199  845 EPSPSPQGYGSTPNPQTPGYPEAPSPQVNPQYnPQTPGTPAMYNTEQYSPYAAPSPQGSYQPSPSPQSyhhQVAPSPVGY 924
Cdd:PRK07764   598 EGPPAPASSGPPEEAARPAAPAAPAAPAAPAP-AGAAAAPAEASAAPAPGVAAPEHHPKHVAVPDASD---GGDGWPAKA 673
                           90       100       110       120       130
                   ....*....|....*....|....*....|....*....|....*....|....*..
gi 2490654199  925 QnTHSPASYHPTPSPMAYQA-SPSPSPVGYSPMTPGAPSPGGYNPHTPGSNIEQGGG 980
Cdd:PRK07764   674 G-GAAPAAPPPAPAPAAPAApAGAAPAQPAPAPAATPPAGQADDPAAQPPQAAQGAS 729
Atrophin-1 pfam03154
Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian ...
819-963 1.21e-04

Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian atrophy (DRPLA) gene. DRPLA OMIM:125370 is a progressive neurodegenerative disorder. It is caused by the expansion of a CAG repeat in the DRPLA gene on chromosome 12p. This results in an extended polyglutamine region in atrophin-1, that is thought to confer toxicity to the protein, possibly through altering its interactions with other proteins. The expansion of a CAG repeat is also the underlying defect in six other neurodegenerative disorders, including Huntington's disease. One interaction of expanded polyglutamine repeats that is thought to be pathogenic is that with the short glutamine repeat in the transcriptional coactivator CREB binding protein, CBP. This interaction draws CBP away from its usual nuclear location to the expanded polyglutamine repeat protein aggregates that are characteriztic of the polyglutamine neurodegenerative disorders. This interferes with CBP-mediated transcription and causes cytotoxicity.


Pssm-ID: 460830 [Multi-domain]  Cd Length: 991  Bit Score: 46.30  E-value: 1.21e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2490654199  819 QSGAWDPNNPNTPSRNDEEYDFGYDDEPSPSPQGygSTPNPQTPGYPEAPSP-----QVNPQYNPQTpgTPAMYNTEQYS 893
Cdd:pfam03154  177 QSGAASPPSPPPPGTTQAATAGPTPSAPSVPPQG--SPATSQPPNQTQSTAAphtliQQTPTLHPQR--LPSPHPPLQPM 252
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2490654199  894 PYAAPSPQGSYQPSPSPQSY-----------------HHQVAPSPVGYQNTHSPASYHPTPSPMAyqASPSPSPVGYSPM 956
Cdd:pfam03154  253 TQPPPPSQVSPQPLPQPSLHgqmppmphslqtgpshmQHPVPPQPFPLTPQSSQSQVPPGPSPAA--PGQSQQRIHTPPS 330

                   ....*..
gi 2490654199  957 TPGAPSP 963
Cdd:pfam03154  331 QSQLQSQ 337
PHA03307 PHA03307
transcriptional regulator ICP4; Provisional
798-971 1.57e-04

transcriptional regulator ICP4; Provisional


Pssm-ID: 223039 [Multi-domain]  Cd Length: 1352  Bit Score: 45.93  E-value: 1.57e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2490654199  798 GNRTPHYGSQTPLHDGNRTPGQSGAWDPnnPNTPSRNDEeydfgyDDEPSPSPQGyGSTPNPQTPGyPEAPSPQVNPQYN 877
Cdd:PHA03307   773 ALLEPAEPQRGAGSSPPVRAEAAFRRPG--RLRRSGPAA------DAASRTASKR-KSRSHTPDGG-SESSGPARPPGAA 842
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2490654199  878 PQTPG---TPAMYNTEQYSPYAAPSPQGSYQPSPSPQSYHHQVAPSPvgyqnthsPASYHPTPSPMAYQASPSPSPVGYS 954
Cdd:PHA03307   843 ARPPParsSESSKSKPAAAGGRARGKNGRRRPRPPEPRARPGAAAPP--------KAAAAAPPAGAPAPRPRPAPRVKLG 914
                          170       180
                   ....*....|....*....|....
gi 2490654199  955 PMTPGAPSP-GGY------NPHTP 971
Cdd:PHA03307   915 PMPPGGPDPrGGFrrvppgDLHTP 938
PHA03378 PHA03378
EBNA-3B; Provisional
801-971 1.68e-04

EBNA-3B; Provisional


Pssm-ID: 223065 [Multi-domain]  Cd Length: 991  Bit Score: 45.83  E-value: 1.68e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2490654199  801 TPHYGSQTPLHDGNRTPGQSGAWDPNNPNTPSRNDEEYDFGYDDEPSPSPQGYGSTPNPQTPGYPEAPSPQVNPQ----- 875
Cdd:PHA03378   582 TSQLASSAPSYAQTPWPVPHPSQTPEPPTTQSHIPETSAPRQWPMPLRPIPMRPLRMQPITFNVLVFPTPHQPPQveitp 661
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2490654199  876 ------------YNPQTPG---------TPAMYNTEQYSPYAAPSPQGSYQPSPSPQSYHHQvAPSPVGYQNTHSPASYH 934
Cdd:PHA03378   662 ykptwtqighipYQPSPTGantmlpiqwAPGTMQPPPRAPTPMRPPAAPPGRAQRPAAATGR-ARPPAAAPGRARPPAAA 740
                          170       180       190
                   ....*....|....*....|....*....|....*....
gi 2490654199  935 PTPS--PMAYQASPSPSPVGYSPMTPGAPSPGGYNPHTP 971
Cdd:PHA03378   741 PGRArpPAAAPGRARPPAAAPGRARPPAAAPGAPTPQPP 779
PLN03209 PLN03209
translocon at the inner envelope of chloroplast subunit 62; Provisional
842-965 1.78e-04

translocon at the inner envelope of chloroplast subunit 62; Provisional


Pssm-ID: 178748 [Multi-domain]  Cd Length: 576  Bit Score: 45.69  E-value: 1.78e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2490654199  842 YDD-----EPSPSPQGYGSTPNPQTPGYPEAP-SPQVNPQYNPQTPGTPAMYNTEQYSPYAAPSPQgsyqPSPSPQSYHH 915
Cdd:PLN03209   444 YEDlkpptSPSPTAPTGVSPSVSSTSSVPAVPdTAPATAATDAAAPPPANMRPLSPYAVYDDLKPP----TSPSPAAPVG 519
                           90       100       110       120       130
                   ....*....|....*....|....*....|....*....|....*....|....*..
gi 2490654199  916 QVAPSPVGYQNTHSPASYHPTPSPMAYQASPSPSPVgySPMT-------PGAPSPGG 965
Cdd:PLN03209   520 KVAPSSTNEVVKVGNSAPPTALADEQHHAQPKPRPL--SPYTmyedlkpPTSPTPSP 574
PHA03247 PHA03247
large tegument protein UL36; Provisional
753-981 1.82e-04

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 46.08  E-value: 1.82e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2490654199  753 TTMGAKRQGGMTSAHSRTPMygsqTPMYGTGSRTPmyGSQTPIHEGNRTPHYGSQTPLHDGNRTPGQSGAWDPNNPNTPS 832
Cdd:PHA03247  2721 LPPGPAAARQASPALPAAPA----PPAVPAGPATP--GGPARPARPPTTAGPPAPAPPAAPAAGPPRRLTRPAVASLSES 2794
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2490654199  833 RNDEEYDFGYDDEPSPSPQGYGSTPNPQTPGYPEAPSPQVNPQYNPQTPGTPAMYNTeqysPYAAPSPQG-------SYQ 905
Cdd:PHA03247  2795 RESLPSPWDPADPPAAVLAPAAALPPAASPAGPLPPPTSAQPTAPPPPPGPPPPSLP----LGGSVAPGGdvrrrppSRS 2870
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2490654199  906 PSPSPQSYHH---------QVAPSPVGYQNTHSPASYHPTPSPMAYQASPSPSPVGYSPmTPGAPSPGGYNPHTPGSNIE 976
Cdd:PHA03247  2871 PAAKPAAPARppvrrlarpAVSRSTESFALPPDQPERPPQPQAPPPPQPQPQPPPPPQP-QPPPPPPPRPQPPLAPTTDP 2949

                   ....*
gi 2490654199  977 QGGGD 981
Cdd:PHA03247  2950 AGAGE 2954
DUF3824 pfam12868
Domain of unknwon function (DUF3824); This is a repeating domain found in fungal proteins. It ...
832-921 2.16e-04

Domain of unknwon function (DUF3824); This is a repeating domain found in fungal proteins. It is proline-rich, and the function is not known.


Pssm-ID: 372351 [Multi-domain]  Cd Length: 145  Bit Score: 42.81  E-value: 2.16e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2490654199  832 SRNDEEYDFGYDDEPSPSPQGY-----------------------GSTPNPQTPGYPEAPSPQVNPQYNPQTPGTpamyn 888
Cdd:pfam12868   42 RRYEDDYRDYYEDPYSPSPYPPspagpyasqgqyypetnyfppppGSTPQPPVDPQPNAPPPPYNPADYPPPPGA----- 116
                           90       100       110
                   ....*....|....*....|....*....|...
gi 2490654199  889 teqyspyAAPSPQGSYQPSPSPQSYhhqvAPSP 921
Cdd:pfam12868  117 -------APPPQPYQYPPPPGPDPY----APRP 138
KOW pfam00467
KOW motif; This family has been extended to coincide with ref. The KOW (Kyprides, Ouzounis, ...
481-509 2.46e-04

KOW motif; This family has been extended to coincide with ref. The KOW (Kyprides, Ouzounis, Woese) motif is found in a variety of ribosomal proteins and NusG.


Pssm-ID: 425698 [Multi-domain]  Cd Length: 32  Bit Score: 39.29  E-value: 2.46e-04
                           10        20        30
                   ....*....|....*....|....*....|.
gi 2490654199  481 GDHVKVIAGRYEGDTGLIVRVEE--NFVILF 509
Cdd:pfam00467    2 GDVVRVIAGPFKGKVGKVVEVDDkkNRVLVE 32
DUF1373 pfam07117
Protein of unknown function (DUF1373); This family consists of several hypothetical proteins ...
844-939 2.56e-04

Protein of unknown function (DUF1373); This family consists of several hypothetical proteins which seem to be specific to Oryzias latipes (Japanese ricefish). Members of this family are typically around 200 residues in length. The function of this family is unknown.


Pssm-ID: 462093 [Multi-domain]  Cd Length: 212  Bit Score: 43.63  E-value: 2.56e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2490654199  844 DEPSPSPQGYGstpnpqTPGYPEAPSPQvnpqynPQTPGTPAMYNTEQYsPYAAPSPQGSYQPSPSPQSYHHQ---VAPS 920
Cdd:pfam07117   46 EEEEGQGGGGG------TFPFPGSPEPE------PGGGGSGPMPMSASA-PEPEPAKAKPQRPAPAQGHGHGGggdSDSS 112
                           90
                   ....*....|....*....
gi 2490654199  921 PVGYQNTHSPASYHPTPSP 939
Cdd:pfam07117  113 GSGSGHQGSGGAGAGAGAP 131
PRK10263 PRK10263
DNA translocase FtsK; Provisional
801-951 2.58e-04

DNA translocase FtsK; Provisional


Pssm-ID: 236669 [Multi-domain]  Cd Length: 1355  Bit Score: 45.46  E-value: 2.58e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2490654199  801 TPHYGSQTPLHDGNRTPGQSGAWDPNNPNTPSRNDEEYDFGYDDEPSPSPQGYgstpnpQTPGYPEAPSPQVNpqyNPQT 880
Cdd:PRK10263   739 GPHEPLFTPIVEPVQQPQQPVAPQQQYQQPQQPVAPQPQYQQPQQPVAPQPQY------QQPQQPVAPQPQYQ---QPQQ 809
                           90       100       110       120       130       140       150
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|..
gi 2490654199  881 PGTPAMYNTEQYSPYAAPSPQGSYQPSPSPQSYHHQVapSPVGYQNTHSPASYHP-TPSPMAYQASPSPSPV 951
Cdd:PRK10263   810 PVAPQPQYQQPQQPVAPQPQYQQPQQPVAPQPQDTLL--HPLLMRNGDSRPLHKPtTPLPSLDLLTPPPSEV 879
PHA03307 PHA03307
transcriptional regulator ICP4; Provisional
793-981 2.61e-04

transcriptional regulator ICP4; Provisional


Pssm-ID: 223039 [Multi-domain]  Cd Length: 1352  Bit Score: 45.16  E-value: 2.61e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2490654199  793 TPIHEGNRTPHYGSQ----TPLHDGNRTPGQSGAW------DPNNPNT---------PSRNDEEYDFGYDDEPSPSPQGY 853
Cdd:PHA03307    26 ATPGDAADDLLSGSQgqlvSDSAELAAVTVVAGAAacdrfePPTGPPPgpgteapanESRSTPTWSLSTLAPASPAREGS 105
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2490654199  854 GSTPNPQTPGYPEAPSPQVNPqynPQTPGTPAMYNteqySPYAAPSPQGSYQPSPSPQSYHHQVAPSPVGYqnthSPASY 933
Cdd:PHA03307   106 PTPPGPSSPDPPPPTPPPASP---PPSPAPDLSEM----LRPVGSPGPPPAASPPAAGASPAAVASDAASS----RQAAL 174
                          170       180       190       200
                   ....*....|....*....|....*....|....*....|....*...
gi 2490654199  934 hptPSPMAYQASPSPSPVGYSPMTPGAPSPGGYNPHTPGSNIEQGGGD 981
Cdd:PHA03307   175 ---PLSSPEETARAPSSPPAEPPPSTPPAAASPRPPRRSSPISASASS 219
PRK10263 PRK10263
DNA translocase FtsK; Provisional
856-971 4.29e-04

DNA translocase FtsK; Provisional


Pssm-ID: 236669 [Multi-domain]  Cd Length: 1355  Bit Score: 44.69  E-value: 4.29e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2490654199  856 TPNPQTPGYPEAPS-PQV--NPQYNPQTPgtpamynteqySPYAAPSPQGsYQPSP---SPQSYHHQVAPSPVGYQNTHS 929
Cdd:PRK10263   341 TQTPPVASVDVPPAqPTVawQPVPGPQTG-----------EPVIAPAPEG-YPQQSqyaQPAVQYNEPLQQPVQPQQPYY 408
                           90       100       110       120
                   ....*....|....*....|....*....|....*....|..
gi 2490654199  930 PASYHPTPSPMAYQASPSpSPVGYSPMTPGAPSPGGYNPHTP 971
Cdd:PRK10263   409 APAAEQPAQQPYYAPAPE-QPAQQPYYAPAPEQPVAGNAWQA 449
PHA03247 PHA03247
large tegument protein UL36; Provisional
846-966 4.48e-04

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 44.54  E-value: 4.48e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2490654199  846 PSPSPqgygsTPNPQTPGYPEA-----PSPQVNPQYNPQTPGTPAMYNTEQYSPYAAPSPQGSYQPSPSPqsyhhqvAPS 920
Cdd:PHA03247  2867 PSRSP-----AAKPAAPARPPVrrlarPAVSRSTESFALPPDQPERPPQPQAPPPPQPQPQPPPPPQPQP-------PPP 2934
                           90       100       110       120
                   ....*....|....*....|....*....|....*....|....*.
gi 2490654199  921 PVGyqntHSPASYHPTPSPmayQASPSPSPVGYSPMTpGAPSPGGY 966
Cdd:PHA03247  2935 PPP----RPQPPLAPTTDP---AGAGEPSGAVPQPWL-GALVPGRV 2972
PLN02983 PLN02983
biotin carboxyl carrier protein of acetyl-CoA carboxylase
865-948 5.26e-04

biotin carboxyl carrier protein of acetyl-CoA carboxylase


Pssm-ID: 215533 [Multi-domain]  Cd Length: 274  Bit Score: 43.29  E-value: 5.26e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2490654199  865 PEAPSPQVNPQYNPQTPGTPAMYNTEQYSPYAAPSPqgsyqPSPSPQSyhhqvaPSPvgyqnTHSPASYHPT-PSPMA-- 941
Cdd:PLN02983   144 PPPPAPVVMMQPPPPHAMPPASPPAAQPAPSAPASS-----PPPTPAS------PPP-----AKAPKSSHPPlKSPMAgt 207

                   ....*...
gi 2490654199  942 -YQaSPSP 948
Cdd:PLN02983   208 fYR-SPAP 214
PRK07764 PRK07764
DNA polymerase III subunits gamma and tau; Validated
845-969 6.40e-04

DNA polymerase III subunits gamma and tau; Validated


Pssm-ID: 236090 [Multi-domain]  Cd Length: 824  Bit Score: 43.82  E-value: 6.40e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2490654199  845 EPSPSPQGYGSTPNPQTPGYPEAPSPqvnpqynPQTPGTPAMYNTEQY--SPYAAPSPQGSYQPSPSPQSYHHQVAPSPV 922
Cdd:PRK07764   391 AGAPAAAAPSAAAAAPAAAPAPAAAA-------PAAAAAPAPAAAPQPapAPAPAPAPPSPAGNAPAGGAPSPPPAAAPS 463
                           90       100       110       120
                   ....*....|....*....|....*....|....*....|....*..
gi 2490654199  923 GYQNTHSPASYHPTPSPmayqaSPSPSPVGYSPMTPGAPSPGGYNPH 969
Cdd:PRK07764   464 AQPAPAPAAAPEPTAAP-----APAPPAAPAPAAAPAAPAAPAAPAG 505
PRK14950 PRK14950
DNA polymerase III subunits gamma and tau; Provisional
856-992 6.93e-04

DNA polymerase III subunits gamma and tau; Provisional


Pssm-ID: 237864 [Multi-domain]  Cd Length: 585  Bit Score: 43.64  E-value: 6.93e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2490654199  856 TPNP-QTPGYPEAPSPQ-VNPQYNPQTPGTPAMynteqyspyAAPSPQGSYQPSPSPQSyhhQVAPSPVgyqnthspASY 933
Cdd:PRK14950   361 VPVPaPQPAKPTAAAPSpVRPTPAPSTRPKAAA---------AANIPPKEPVRETATPP---PVPPRPV--------APP 420
                           90       100       110       120       130       140
                   ....*....|....*....|....*....|....*....|....*....|....*....|.
gi 2490654199  934 HPTPSPMAYQASPSPSPVGYSPM-TPGAPSPGGYNP-HTPGSNIEQGGGDWvtTDILVRVK 992
Cdd:PRK14950   421 VPHTPESAPKLTRAAIPVDEKPKyTPPAPPKEEEKAlIADGDVLEQLEAIW--KQILRDVP 479
PRK10263 PRK10263
DNA translocase FtsK; Provisional
881-986 7.06e-04

DNA translocase FtsK; Provisional


Pssm-ID: 236669 [Multi-domain]  Cd Length: 1355  Bit Score: 43.92  E-value: 7.06e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2490654199  881 PGTPAMYNTEQYSPYAAPS-PQGSYQPSPSPQSYHHQVAPSPVGY---------QNTHSPASYHPTPSPMAYQASPSPSP 950
Cdd:PRK10263   336 PVEPVTQTPPVASVDVPPAqPTVAWQPVPGPQTGEPVIAPAPEGYpqqsqyaqpAVQYNEPLQQPVQPQQPYYAPAAEQP 415
                           90       100       110
                   ....*....|....*....|....*....|....*.
gi 2490654199  951 VGYSPMTPGAPSPGGYNPHTPGSNIEQGGGDWVTTD 986
Cdd:PRK10263   416 AQQPYYAPAPEQPAQQPYYAPAPEQPVAGNAWQAEE 451
GATA-N pfam05349
GATA-type transcription activator, N-terminal; GATA transcription factors mediate cell ...
851-966 7.87e-04

GATA-type transcription activator, N-terminal; GATA transcription factors mediate cell differentiation in a diverse range of tissues. Mutation are often associated with certain congenital human disorders. The six classical vertebrate GATA proteins, GATA-1 to GATA-6, are highly homologous and have two tandem zinc fingers. The classical GATA transcription factors function transcription activators. In lower metazoans GATA proteins carry a single canonical zinc finger. This family represents the N-terminal domain of the family of GATA transcription activators.


Pssm-ID: 461628 [Multi-domain]  Cd Length: 174  Bit Score: 41.65  E-value: 7.87e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2490654199  851 QGYGSTPNPQTPGYPEAP-----SPQVNPQYNPqTPGTPAMYNTEQYSPYAAPSPQGS--------YQPSPSPQSYhhqv 917
Cdd:pfam05349    3 QSLALAANHGQAAYDHDSggflhSAASSPVYVP-TTRVPSMLPTLPYLQGCGSSQQSHpvsshsgwAQAGAESSSY---- 77
                           90       100       110       120
                   ....*....|....*....|....*....|....*....|....*....
gi 2490654199  918 apspvgyqnthSPASYHPTPspmAYQASPSPspvgysPMTPGAPSPGGY 966
Cdd:pfam05349   78 -----------NPGSPHPSP---RFSYSHSP------PGSNGTSRDAAY 106
KLF1_2_4_N cd21972
N-terminal domain of Kruppel-like factor (KLF) 1, KLF2, KLF4, and similar proteins; Kruppel ...
843-971 8.06e-04

N-terminal domain of Kruppel-like factor (KLF) 1, KLF2, KLF4, and similar proteins; Kruppel/Krueppel-like transcription factors (KLFs) belong to a family of proteins called the Specificity Protein (SP)/KLF family, characterized by a C-terminal DNA-binding domain of 81 amino acids consisting of three Kruppel-like C2H2 zinc fingers. These factors bind to a loose consensus motif, namely NNRCRCCYY (where N is any nucleotide; R is A/G, and Y is C/T), such as the recurring motifs in GC and GT boxes (5'-GGGGCGGGG-3' and 5-GGTGTGGGG-3') that are present in promoters and more distal regulatory elements of mammalian genes. Members of the KLF family can act as activators or repressors of transcription depending on cell and promoter context. KLFs regulate various cellular functions, such as proliferation, differentiation, and apoptosis, as well as the development and homeostasis of several types of tissue. In addition to the C-terminal DNA-binding domain, each KLF also has a unique N-terminal activation/repression domain that confers specifity and allows it to bind specifically to a certain partner, leading to distinct activities in vivo. This model represents the related N-terminal domains of KLF1, KLF2, KLF4, and similar proteins.


Pssm-ID: 409230 [Multi-domain]  Cd Length: 194  Bit Score: 41.89  E-value: 8.06e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2490654199  843 DDEPSPSPQGYG--STPNPQTPGYPEAPSPQVNPQYN-PQTPGTPAMYNTEQYSPYAAPSPQGSYQPSP------SPQSY 913
Cdd:cd21972     36 NDNPPPPDPAYPppESPESCSTVYDSDGCHPTPNAYCgPNGPGLPGHFLLAGNSPNLGPKIKTENQEQAcmpvagYSGHY 115
                           90       100       110       120       130       140
                   ....*....|....*....|....*....|....*....|....*....|....*....|....
gi 2490654199  914 HHQVAPSPVG------YQNTHSPASYHPTPSPMAYQASPSPSPVGYSPmtPGAPSPGGYNPHTP 971
Cdd:cd21972    116 GPREPQRVPPappppqYAGHFQYHGHFNMFSPPLRANHPGMSTVMLTP--LSTPPLGFLSPEEA 177
PRK12323 PRK12323
DNA polymerase III subunit gamma/tau;
849-982 9.02e-04

DNA polymerase III subunit gamma/tau;


Pssm-ID: 237057 [Multi-domain]  Cd Length: 700  Bit Score: 43.33  E-value: 9.02e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2490654199  849 SPQGYGSTPNPqTPGYPEAPSPQVNPQY-NPQTPGTPAMYNTEQYSPYAAPSPQGSYQPS--------PSPQSYHHQVAP 919
Cdd:PRK12323   440 SARGPGGAPAP-APAPAAAPAAAARPAAaGPRPVAAAAAAAPARAAPAAAPAPADDDPPPweelppefASPAPAQPDAAP 518
                           90       100       110       120       130       140
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 2490654199  920 SPVGYQNTHSPASYHPTPSPMAYQASPSPSPVgysPMTPGAPSPggYNPHTPGSNIEQG-----GGDW 982
Cdd:PRK12323   519 AGWVAESIPDPATADPDDAFETLAPAPAAAPA---PRAAAATEP--VVAPRPPRASASGlpdmfDGDW 581
Pneumo_att_G pfam05539
Pneumovirinae attachment membrane glycoprotein G;
816-974 1.32e-03

Pneumovirinae attachment membrane glycoprotein G;


Pssm-ID: 114270 [Multi-domain]  Cd Length: 408  Bit Score: 42.34  E-value: 1.32e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2490654199  816 TPGQSGAWDPN--NPNTPS-RNDEEYDFGYDDEPSPSPQGYGSTPNPQTPG--YPEAPSPQVNPQYNPQTPGTpamynte 890
Cdd:pfam05539  173 TTSKTTSWPTEvsHPTYPSqVTPQSQPATQGHQTATANQRLSSTEPVGTQGttTSSNPEPQTEPPPSQRGPSG------- 245
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2490654199  891 qySPYAAPSPQGSYQPSPSPQSYHHQVAPSPVGYQNTHSPASyHPTPSPM-----AYQASPSPSPVGYSPMTPGAPSPGG 965
Cdd:pfam05539  246 --SPQHPPSTTSQDQSTTGDGQEHTQRRKTPPATSNRRSPHS-TATPPPTtkrqeTGRPTPRPTATTQSGSSPPHSSPPG 322

                   ....*....
gi 2490654199  966 YNPHTPGSN 974
Cdd:pfam05539  323 VQANPTTQN 331
PHA03307 PHA03307
transcriptional regulator ICP4; Provisional
763-973 1.52e-03

transcriptional regulator ICP4; Provisional


Pssm-ID: 223039 [Multi-domain]  Cd Length: 1352  Bit Score: 42.85  E-value: 1.52e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2490654199  763 MTSAHSRTPmygsqtpmyGTGSRTPmyGSQTPIHEGNRTPHyGSQTPLHDGNRTPGQSGAWDPNNPNTPSRND------E 836
Cdd:PHA03307   178 SPEETARAP---------SSPPAEP--PPSTPPAAASPRPP-RRSSPISASASSPAPAPGRSAADDAGASSSDssssesS 245
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2490654199  837 EYDFGYDDE---PSPSPQ------GYGSTPNPQ----TPGYPEAPSPQVNPQYNPQTPGTPAMYNTEQYSPYAAPSPQGS 903
Cdd:PHA03307   246 GCGWGPENEcplPRPAPItlptriWEASGWNGPssrpGPASSSSSPRERSPSPSPSSPGSGPAPSSPRASSSSSSSRESS 325
                          170       180       190       200       210       220       230
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2490654199  904 yQPSPSPQSyhhqVAPSPVGyqnTHSPASYHPTPSPmayqASPSPSPVGYSPMTPGAPSPGGYNPHTPGS 973
Cdd:PHA03307   326 -SSSTSSSS----ESSRGAA---VSPGPSPSRSPSP----SRPPPPADPSSPRKRPRPSRAPSSPAASAG 383
PRK07764 PRK07764
DNA polymerase III subunits gamma and tau; Validated
817-973 1.54e-03

DNA polymerase III subunits gamma and tau; Validated


Pssm-ID: 236090 [Multi-domain]  Cd Length: 824  Bit Score: 42.67  E-value: 1.54e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2490654199  817 PGQSGAWDPNNPNTPSRNDEEydfgydDEPSPSPQgYGSTPNPQTPGYPEAPSPQVNPQYNPQTPGTPAMYNTEQYSPYA 896
Cdd:PRK07764   592 PGAAGGEGPPAPASSGPPEEA------ARPAAPAA-PAAPAAPAPAGAAAAPAEASAAPAPGVAAPEHHPKHVAVPDASD 664
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2490654199  897 APSPQGSY-------QPSPSPQSYHHQVAPSPVGYQNTHSPASYHPT----------PSPMAYQASPSPSPVGYSPMTPG 959
Cdd:PRK07764   665 GGDGWPAKaggaapaAPPPAPAPAAPAAPAGAAPAQPAPAPAATPPAgqaddpaaqpPQAAQGASAPSPAADDPVPLPPE 744
                          170
                   ....*....|....
gi 2490654199  960 APSPGGYNPHTPGS 973
Cdd:PRK07764   745 PDDPPDPAGAPAQP 758
PRK14951 PRK14951
DNA polymerase III subunits gamma and tau; Provisional
859-972 1.70e-03

DNA polymerase III subunits gamma and tau; Provisional


Pssm-ID: 237865 [Multi-domain]  Cd Length: 618  Bit Score: 42.39  E-value: 1.70e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2490654199  859 PQTPGYPEAPSPQVNPQYNPQTPGTPAMYNTEQYSPYAAPSPqgsyQPSPSPQSYHHQVAPSPVGyqnTHSPASYHPTPS 938
Cdd:PRK14951   378 KKTPARPEAAAPAAAPVAQAAAAPAPAAAPAAAASAPAAPPA----AAPPAPVAAPAAAAPAAAP---AAAPAAVALAPA 450
                           90       100       110       120
                   ....*....|....*....|....*....|....*....|....
gi 2490654199  939 P----------MAYQASPSPSPVGYSPMTPGAPSPGGYNPHTPG 972
Cdd:PRK14951   451 PpaqaapetvaIPVRVAPEPAVASAAPAPAAAPAAARLTPTEEG 494
PHA03369 PHA03369
capsid maturational protease; Provisional
843-925 2.17e-03

capsid maturational protease; Provisional


Pssm-ID: 223061 [Multi-domain]  Cd Length: 663  Bit Score: 41.91  E-value: 2.17e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2490654199  843 DDEPSPSPQGYGSTPNPQTPGYPEAPSPQVnpqynPQTPGTPAMYNTEQYSPYAAPSPQGSYQPSPS-----PQSYHHQV 917
Cdd:PHA03369   380 DRQRPQRPDGIPYSVPARSPMTAYPPVPQF-----CGDPGLVSPYNPQSPGTSYGPEPVGPVPPQPTnpyvmPISMANMV 454
                           90
                   ....*....|.
gi 2490654199  918 A---PSPVGYQ 925
Cdd:PHA03369   455 YpghPQEHGHE 465
KREPA2 cd23959
Kinetoplastid RNA Editing Protein A2 (KREPA2); The KREPA2 (TbMP63) protein is a component of ...
844-963 2.84e-03

Kinetoplastid RNA Editing Protein A2 (KREPA2); The KREPA2 (TbMP63) protein is a component of the parasitic protozoan's KREPA RNA editing catalytic complex (RECC). Kinetoplastid RNA editing (KRE) proteins occur as pairs or sets of related proteins in multiple complexes. KREPA complex is composed of six components (KREPA1-6), which share a conserved C-terminal region containing an oligonucleotide-binding (OB)-fold-like domain. KREPAs are responsible for the site-specific insertion and deletion of U nucleotides in the kinetoplastid mitochondria pre-messenger RNA. Apart from the conserved C-terminal OB-fold domain, KREPA1, KREPA2, and KREPA3 contain two conserved C2H2 zinc-finger domains. KREPA2 and kinetoplastid RNA editing ligase 1 (KREL1) are specific for ligation post-U-deletion and are paralogous to KREL2 and KREPA1 that are specific for ligation post-U-insertion. KREPA2, is critical for RECC stability and KREL1 integration into the complex.


Pssm-ID: 467780 [Multi-domain]  Cd Length: 424  Bit Score: 41.39  E-value: 2.84e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2490654199  844 DEPSPSPQGYGSTPNPQTPGYPEAPSPQvnpqynpqTPGTPAMYNTEQ-YSPYAAPSPQGSYQPSPSPQSYHHQVAPSPV 922
Cdd:cd23959    136 APPKAEPQTAPVTPFGQLPMFGQHPPPA--------KPLPAAAAAQQSsASPGEVASPFASGTVSASPFATATDTAPSSG 207
                           90       100       110       120
                   ....*....|....*....|....*....|....*....|.
gi 2490654199  923 GYQNTHSPASyhpTPSPMAyqASPSPSPVGYSPMTPGAPSP 963
Cdd:cd23959    208 APDGFPAEAS---APSPFA--APASAASFPAAPVANGEAAT 243
KLF1_N cd21581
N-terminal domain of Kruppel-like Factor 1; Kruppel-like Factor 1 (KLF1, also known as ...
846-969 4.28e-03

N-terminal domain of Kruppel-like Factor 1; Kruppel-like Factor 1 (KLF1, also known as Krueppel-like factor 1 or Erythroid Kruppel-like Factor/EKLF) was the first Kruppel-like factor discovered. It was found to be vitally important for embryonic erythropoiesis in promoting the switch from fetal hemoglobin (Hemoglobin F) to adult hemoglobin (Hemoglobin A) gene expression by binding to highly conserved CACCC domains. EKLF ablation in mouse embryos produces a lethal anemic phenotype, causing death by embryonic day 14, and natural mutations lead to beta+ thalassemia in humans. However, expression of embryonic hemoglobin and fetal hemoglobin genes is normal in EKLF-deficient mice, suggesting other factors may be involved. KLF1 functions as a transcriptional activator. It belongs to a family of proteins, called the Specificity Protein (SP)/KLF family, characterized by a C-terminal DNA-binding domain of 81 amino acids consisting of three Kruppel-like C2H2 zinc fingers. These factors bind to a loose consensus motif, namely NNRCRCCYY (where N is any nucleotide; R is A/G, and Y is C/T), such as the recurring motifs in GC and GT boxes (5'-GGGGCGGGG-3' and 5-GGTGTGGGG-3') that are present in promoters and more distal regulatory elements of mammalian genes. Members of the KLF family can act as activators or repressors of transcription depending on cell and promoter context. KLFs regulate various cellular functions, such as proliferation, differentiation, and apoptosis, as well as the development and homeostasis of several types of tissue. In addition to the C-terminal DNA-binding domain, each KLF also has a unique N-terminal activation/repression domain that confers specifity and allows it to bind specifically to a certain partner, leading to distinct activities in vivo. This model represents the N-terminal domain of KLF1, which is related to the N-terminal domains of KLF2 and KLF4.


Pssm-ID: 409227 [Multi-domain]  Cd Length: 278  Bit Score: 40.41  E-value: 4.28e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2490654199  846 PSPSPQGygsTPNPQTPGYPEAPSPQVNPQYNPQTPGTPAMYN-------------------TEQYSPYAAPSPQGSYQP 906
Cdd:cd21581     77 PSLNPSL---DNNTQALPQEEQPGAYYEPPKKDQPGTEGLQVGgpglmaellspeestgwapPEPHHGYPDAFVGPALFP 153
                           90       100       110       120       130       140       150
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|.
gi 2490654199  907 SPSPQSYH-HQVAPSPVGYQNTHSPAS------YHPT-PSPMAYQASPSPSPVGYSPMTPgAPSPGGYNPH 969
Cdd:cd21581    154 APANVDQFgFPQGGSVDRRGNLSKSGSwdfgsyYPQQhPSVVAFPDSRFGPLSGPQALTP-DPQHYGYFQL 223
PRK07764 PRK07764
DNA polymerase III subunits gamma and tau; Validated
783-963 5.00e-03

DNA polymerase III subunits gamma and tau; Validated


Pssm-ID: 236090 [Multi-domain]  Cd Length: 824  Bit Score: 41.12  E-value: 5.00e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2490654199  783 GSRTPMYGSQTPIHEGNR---TPHYGSQTPLHDGNRTPGQSGAWDPNNPNTPSRNDEEydfGYDDEPSPSPQGYGSTPNP 859
Cdd:PRK07764   597 GEGPPAPASSGPPEEAARpaaPAAPAAPAAPAPAGAAAAPAEASAAPAPGVAAPEHHP---KHVAVPDASDGGDGWPAKA 673
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2490654199  860 QTPGyPEAPSPQvnPQYNPQTPGTPAMynteqySPYAAPSP-------QGSYQPSPSPQSYHHQVAPSPVGYQNTHSPAS 932
Cdd:PRK07764   674 GGAA-PAAPPPA--PAPAAPAAPAGAA------PAQPAPAPaatppagQADDPAAQPPQAAQGASAPSPAADDPVPLPPE 744
                          170       180       190
                   ....*....|....*....|....*....|.
gi 2490654199  933 YHPTPSPMAYQASPSPSPVGYSPMTPGAPSP 963
Cdd:PRK07764   745 PDDPPDPAGAPAQPPPPPAPAPAAAPAAAPP 775
PRK07764 PRK07764
DNA polymerase III subunits gamma and tau; Validated
845-964 5.49e-03

DNA polymerase III subunits gamma and tau; Validated


Pssm-ID: 236090 [Multi-domain]  Cd Length: 824  Bit Score: 40.74  E-value: 5.49e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2490654199  845 EPSPSPQGYGsTPNPQTPGYPEAPSPQVNPQYNPQTPGTPAMYNTEQySPYAAPSPQGSYQPSPSPQSyhhqvAPSPVGy 924
Cdd:PRK07764   407 AAAPAPAAAA-PAAAAAPAPAAAPQPAPAPAPAPAPPSPAGNAPAGG-APSPPPAAAPSAQPAPAPAA-----APEPTA- 478
                           90       100       110       120
                   ....*....|....*....|....*....|....*....|
gi 2490654199  925 qnthspasyHPTPSPMAYQASPSPSPVgysPMTPGAPSPG 964
Cdd:PRK07764   479 ---------APAPAPPAAPAPAAAPAA---PAAPAAPAGA 506
 
Blast search parameters
Data Source: Precalculated data, version = cdd.v.3.21
Preset Options:Database: CDSEARCH/cdd   Low complexity filter: no  Composition Based Adjustment: yes   E-value threshold: 0.01

References:

  • Wang J et al. (2023), "The conserved domain database in 2023", Nucleic Acids Res.51(D)384-8.
  • Lu S et al. (2020), "The conserved domain database in 2020", Nucleic Acids Res.48(D)265-8.
  • Marchler-Bauer A et al. (2017), "CDD/SPARCLE: functional classification of proteins via subfamily domain architectures.", Nucleic Acids Res.45(D)200-3.
Help | Disclaimer | Write to the Help Desk
NCBI | NLM | NIH