NCBI Home Page NCBI Site Search page NCBI Guide that lists and describes the NCBI resources
Conserved domains on  [gi|1720408520|ref|XP_030109352|]
View 

eukaryotic translation initiation factor 4 gamma 3 isoform X25 [Mus musculus]

Protein Classification

Graphical summary

 Zoom to residue level

show extra options »

Show site features     Horizontal zoom: ×

List of domain hits

Name Accession Description Interval E-value
MIF4G pfam02854
MIF4G domain; MIF4G is named after Middle domain of eukaryotic initiation factor 4G (eIF4G). ...
749-977 2.06e-63

MIF4G domain; MIF4G is named after Middle domain of eukaryotic initiation factor 4G (eIF4G). Also occurs in NMD2p and CBP80. The domain is rich in alpha-helices and may contain multiple alpha-helical repeats. In eIF4G, this domain binds eIF4A, eIF3, RNA and DNA.


:

Pssm-ID: 397130  Cd Length: 203  Bit Score: 214.53  E-value: 2.06e-63
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720408520  749 FRKVRSILNKLTPQMFNQLMKQVSALTVDTEERLKGVIDLVFEKAIDEPSFSVAYANMCRCLVTLkvpmadkpgNTVNFR 828
Cdd:pfam02854    1 LKKVKGILNKLSPENFEKLIKELLKLIMSDPELLKYLIELIFEKAVEEPNFIPAYARLCSGLNLR---------NPTDFG 71
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720408520  829 KLLLNRCQKEFEKdkadddvfekkqkeleaasapeertrlHDELEEAKDKARRRSIGNIKFIGELFKLKMLTEAIMHDCV 908
Cdd:pfam02854   72 IHLLNRLQEEFEK---------------------------RFELEENEQGNRRRRLGLVRFLGELYKFGLLTEKILFECL 124
                          170       180       190       200       210       220       230
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 1720408520  909 VKLLKNH-------DEESLECLCRLLTTIGKDLDFEKAKPRMDQYFNQMEKIVKERK---TSSRIRFMLQDVIDLRLCN 977
Cdd:pfam02854  125 KELLSSLtkedlkrDLFNLECLLTLLTTIGKLLENEKLPKLMDQFLDEIQKYVLSKDdpkLSSRLRFMLQDLIELRKNK 203
W2_eIF4G1_like cd11559
C-terminal W2 domain of eukaryotic translation initiation factor 4 gamma 1 and similar ...
1417-1549 1.90e-48

C-terminal W2 domain of eukaryotic translation initiation factor 4 gamma 1 and similar proteins; eIF4G1 is a component of the multi-subunit eukaryotic translation initiation factor 4F, which facilitates recruitment of the mRNA to the ribosome, a rate-limiting step during translation initiation. This C-terminal domain, whose structure resembles that of a set of concatenated HEAT repeats, has been associated with binding to/recruiting the kinase Mnk1, which phosphorylates eIF4E.


:

Pssm-ID: 211397  Cd Length: 134  Bit Score: 169.00  E-value: 1.90e-48
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720408520 1417 EELSQRLEKLIMEEKADDErIFDWVEANLDESQMSSPTFLRALMTAVCKAAIIADcSTFRVDTAVIKQRVPILLKYLDSD 1496
Cdd:cd11559      4 LRVQAELLKLLQEDPNPDE-LYKWIKENVSPELYASPGFVRALMTAVLKYAIEEK-SLPEKEKALLEKYAPLLQKYLDDD 81
                           90       100       110       120       130
                   ....*....|....*....|....*....|....*....|....*....|...
gi 1720408520 1497 TEKELQALYALQASIVKLDQPANLLRMFFDCLYDEEVISEDAFYKWESSKDPA 1549
Cdd:cd11559     82 EQLQLQALYALQALVHTLEFPKGLLLRFFDALYDEDVIEEEAFLKWKEDVDPA 134
MA3 pfam02847
MA3 domain; Domain in DAP-5, eIF4G, MA-3 and other proteins. Highly alpha-helical. May contain ...
1215-1327 9.24e-35

MA3 domain; Domain in DAP-5, eIF4G, MA-3 and other proteins. Highly alpha-helical. May contain repeats and/or regions similar to MIF4G domains.


:

Pssm-ID: 397128  Cd Length: 113  Bit Score: 128.93  E-value: 9.24e-35
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720408520 1215 VERKSKSIIDEFLHINDFKEATQCIEELSAQGPLHVFVKVGVEFTLERSQITRDHMGHLLYQLVQSEKLSKQDFFKGFSE 1294
Cdd:pfam02847    1 LKRKIFLILEEYLSSGDYDEAARCLLKLGLPSQHHEVVKVLIECALEESKTYREFYGLLLERLCEFNLISTKQFEKGFWR 80
                           90       100       110
                   ....*....|....*....|....*....|...
gi 1720408520 1295 TLELADDMAIDIPHIWLYLAELVTPMLKGGGIS 1327
Cdd:pfam02847   81 VLEDLEDLELDIPNAWRNLAEFVARLISDDGLP 113
PRK07003 super family cl35530
DNA polymerase III subunit gamma/tau;
161-324 4.66e-06

DNA polymerase III subunit gamma/tau;


The actual alignment was detected with superfamily member PRK07003:

Pssm-ID: 235906 [Multi-domain]  Cd Length: 830  Bit Score: 51.39  E-value: 4.66e-06
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720408520  161 GGGSRNPTPPIGRPASTPTPPQLP--SQVPEHSPVVYGTVESAHLAASTPVTAASDQKQEEKPKPDPVFQSPSTVLRLVL 238
Cdd:PRK07003   368 PGGGVPARVAGAVPAPGARAAAAVgaSAVPAVTAVTGAAGAALAPKAAAAAAATRAEAPPAAPAPPATADRGDDAADGDA 447
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720408520  239 SGEKKEQAGQMPETAAGEPTPEPPRTSSPTS-----LPPLARSSlPSPMSAALSSQPLFTAEDKCELPSSKEEDAPPVP- 312
Cdd:PRK07003   448 PVPAKANARASADSRCDERDAQPPADSGSASapasdAPPDAAFE-PAPRAAAPSAATPAAVPDARAPAAASREDAPAAAa 526
                          170
                   ....*....|..
gi 1720408520  313 SPTSCTAASGPS 324
Cdd:PRK07003   527 PPAPEARPPTPA 538
W2 super family cl17013
C-terminal domain of eIF4-gamma/eIF5/eIF2b-epsilon; This domain is found at the C-terminus of ...
1528-1574 1.15e-04

C-terminal domain of eIF4-gamma/eIF5/eIF2b-epsilon; This domain is found at the C-terminus of several translation initiation factors, including the epsilon chain of eIF2b, where it has been found to catalyze the conversion of eIF2.GDP to its active eIF2.GTP form. The structure of the domain resembles that of a set of concatenated HEAT repeats.


The actual alignment was detected with superfamily member cd11560:

Pssm-ID: 473053 [Multi-domain]  Cd Length: 194  Bit Score: 44.90  E-value: 1.15e-04
                           10        20        30        40
                   ....*....|....*....|....*....|....*....|....*..
gi 1720408520 1528 LYDEEVISEDAFYKWesSKDPAEQAGKGVALKSVTAFFTWLREAEEE 1574
Cdd:cd11560    150 LYKADVLSEDAILKW--YKKGHSPKGKQVFLKQMEPFVEWLQEAEEE 194
Atrophin-1 super family cl38111
Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian ...
13-350 7.87e-04

Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian atrophy (DRPLA) gene. DRPLA OMIM:125370 is a progressive neurodegenerative disorder. It is caused by the expansion of a CAG repeat in the DRPLA gene on chromosome 12p. This results in an extended polyglutamine region in atrophin-1, that is thought to confer toxicity to the protein, possibly through altering its interactions with other proteins. The expansion of a CAG repeat is also the underlying defect in six other neurodegenerative disorders, including Huntington's disease. One interaction of expanded polyglutamine repeats that is thought to be pathogenic is that with the short glutamine repeat in the transcriptional coactivator CREB binding protein, CBP. This interaction draws CBP away from its usual nuclear location to the expanded polyglutamine repeat protein aggregates that are characteriztic of the polyglutamine neurodegenerative disorders. This interferes with CBP-mediated transcription and causes cytotoxicity.


The actual alignment was detected with superfamily member pfam03154:

Pssm-ID: 460830 [Multi-domain]  Cd Length: 991  Bit Score: 44.37  E-value: 7.87e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720408520   13 QRPQIQPPRaaIPNSSPSIRPGVQTPTAVYQANQhimmvnhlPMPYPVTQGHQYCIPQYRHSGPPYVGPPQQYPVqppgp 92
Cdd:pfam03154  233 QTPTLHPQR--LPSPHPPLQPMTQPPPPSQVSPQ--------PLPQPSLHGQMPPMPHSLQTGPSHMQHPVPPQP----- 297
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720408520   93 gpfypGPGPGDFANAYGTPFYPSQPVYQSA--PIIVPTQQQPPPAKREKKTIRIRDPnqggkditeeimsgggsrNPTPP 170
Cdd:pfam03154  298 -----FPLTPQSSQSQVPPGPSPAAPGQSQqrIHTPPSQSQLQSQQPPREQPLPPAP------------------LSMPH 354
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720408520  171 IGRPASTPTPPQLPSQVPEHSPvvygtvesaHLAASTPVTAASDQkqeekpKPDPVFQSPSTVLRLVLSGEKKEQAGQMP 250
Cdd:pfam03154  355 IKPPPTTPIPQLPNPQSHKHPP---------HLSGPSPFQMNSNL------PPPPALKPLSSLSTHHPPSAHPPPLQLMP 419
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720408520  251 ETAAGEPTP-EPPRTSSPTSLPPLARSSLPSPMSAALSSQPLFTAEDKceLPSSKEEDAPPVPSPTScTAASGPSLtdns 329
Cdd:pfam03154  420 QSQQLPPPPaQPPVLTQSQSLPPPAASHPPTSGLHQVPSQSPFPQHPF--VPGGPPPITPPSGPPTS-TSSAMPGI---- 492
                          330       340
                   ....*....|....*....|.
gi 1720408520  330 dicKKPCSVAPHDSQLISSTI 350
Cdd:pfam03154  493 ---QPPSSASVSSSGPVPAAV 510
 
Name Accession Description Interval E-value
MIF4G pfam02854
MIF4G domain; MIF4G is named after Middle domain of eukaryotic initiation factor 4G (eIF4G). ...
749-977 2.06e-63

MIF4G domain; MIF4G is named after Middle domain of eukaryotic initiation factor 4G (eIF4G). Also occurs in NMD2p and CBP80. The domain is rich in alpha-helices and may contain multiple alpha-helical repeats. In eIF4G, this domain binds eIF4A, eIF3, RNA and DNA.


Pssm-ID: 397130  Cd Length: 203  Bit Score: 214.53  E-value: 2.06e-63
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720408520  749 FRKVRSILNKLTPQMFNQLMKQVSALTVDTEERLKGVIDLVFEKAIDEPSFSVAYANMCRCLVTLkvpmadkpgNTVNFR 828
Cdd:pfam02854    1 LKKVKGILNKLSPENFEKLIKELLKLIMSDPELLKYLIELIFEKAVEEPNFIPAYARLCSGLNLR---------NPTDFG 71
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720408520  829 KLLLNRCQKEFEKdkadddvfekkqkeleaasapeertrlHDELEEAKDKARRRSIGNIKFIGELFKLKMLTEAIMHDCV 908
Cdd:pfam02854   72 IHLLNRLQEEFEK---------------------------RFELEENEQGNRRRRLGLVRFLGELYKFGLLTEKILFECL 124
                          170       180       190       200       210       220       230
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 1720408520  909 VKLLKNH-------DEESLECLCRLLTTIGKDLDFEKAKPRMDQYFNQMEKIVKERK---TSSRIRFMLQDVIDLRLCN 977
Cdd:pfam02854  125 KELLSSLtkedlkrDLFNLECLLTLLTTIGKLLENEKLPKLMDQFLDEIQKYVLSKDdpkLSSRLRFMLQDLIELRKNK 203
MIF4G smart00543
Middle domain of eukaryotic initiation factor 4G (eIF4G); Also occurs in NMD2p and CBP80. The ...
750-974 9.59e-51

Middle domain of eukaryotic initiation factor 4G (eIF4G); Also occurs in NMD2p and CBP80. The domain is rich in alpha-helices and may contain multiple alpha-helical repeats. In eIF4G, this domain binds eIF4A, eIF3, RNA and DNA. Ponting (TiBS) "Novel eIF4G domain homologues (in press)


Pssm-ID: 214713  Cd Length: 200  Bit Score: 178.32  E-value: 9.59e-51
                            10        20        30        40        50        60        70        80
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720408520   750 RKVRSILNKLTPQMFNQLMKQVSALTVDTEERLKGVIDLVFEKAIDEPSFSVAYANMCRCLVtLKVPmadkpgntvNFRK 829
Cdd:smart00543    2 KKVKGLINKLSPSNFESIIKELLKLNNSDKNLRKYILELIFEKAVEEPNFIPAYARLCALLN-AKNP---------DFGS 71
                            90       100       110       120       130       140       150       160
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720408520   830 LLLNRCQKEFEKDkadddvfekkqkeleaasapeertrlhdeLEEAKDKARRRSIGNIKFIGELFKLKMLTEAIMHDCVV 909
Cdd:smart00543   72 LLLERLQEEFEKG-----------------------------LESEEESDKQRRLGLVRFLGELYNFQVLTSKIILELLK 122
                           170       180       190       200       210       220       230
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*
gi 1720408520   910 KLLKNH-------DEESLECLCRLLTTIGKDLDFEKAKPRMDQYFNQMEKIVKERKT---SSRIRFMLQDVIDLR 974
Cdd:smart00543  123 ELLNDLtkldpprSDFSVECLLSLLPTCGKDLEREKSPKLLDEILERLQDYLLKKDKtelSSRLRFMLELLIELR 197
W2_eIF4G1_like cd11559
C-terminal W2 domain of eukaryotic translation initiation factor 4 gamma 1 and similar ...
1417-1549 1.90e-48

C-terminal W2 domain of eukaryotic translation initiation factor 4 gamma 1 and similar proteins; eIF4G1 is a component of the multi-subunit eukaryotic translation initiation factor 4F, which facilitates recruitment of the mRNA to the ribosome, a rate-limiting step during translation initiation. This C-terminal domain, whose structure resembles that of a set of concatenated HEAT repeats, has been associated with binding to/recruiting the kinase Mnk1, which phosphorylates eIF4E.


Pssm-ID: 211397  Cd Length: 134  Bit Score: 169.00  E-value: 1.90e-48
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720408520 1417 EELSQRLEKLIMEEKADDErIFDWVEANLDESQMSSPTFLRALMTAVCKAAIIADcSTFRVDTAVIKQRVPILLKYLDSD 1496
Cdd:cd11559      4 LRVQAELLKLLQEDPNPDE-LYKWIKENVSPELYASPGFVRALMTAVLKYAIEEK-SLPEKEKALLEKYAPLLQKYLDDD 81
                           90       100       110       120       130
                   ....*....|....*....|....*....|....*....|....*....|...
gi 1720408520 1497 TEKELQALYALQASIVKLDQPANLLRMFFDCLYDEEVISEDAFYKWESSKDPA 1549
Cdd:cd11559     82 EQLQLQALYALQALVHTLEFPKGLLLRFFDALYDEDVIEEEAFLKWKEDVDPA 134
MA3 pfam02847
MA3 domain; Domain in DAP-5, eIF4G, MA-3 and other proteins. Highly alpha-helical. May contain ...
1215-1327 9.24e-35

MA3 domain; Domain in DAP-5, eIF4G, MA-3 and other proteins. Highly alpha-helical. May contain repeats and/or regions similar to MIF4G domains.


Pssm-ID: 397128  Cd Length: 113  Bit Score: 128.93  E-value: 9.24e-35
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720408520 1215 VERKSKSIIDEFLHINDFKEATQCIEELSAQGPLHVFVKVGVEFTLERSQITRDHMGHLLYQLVQSEKLSKQDFFKGFSE 1294
Cdd:pfam02847    1 LKRKIFLILEEYLSSGDYDEAARCLLKLGLPSQHHEVVKVLIECALEESKTYREFYGLLLERLCEFNLISTKQFEKGFWR 80
                           90       100       110
                   ....*....|....*....|....*....|...
gi 1720408520 1295 TLELADDMAIDIPHIWLYLAELVTPMLKGGGIS 1327
Cdd:pfam02847   81 VLEDLEDLELDIPNAWRNLAEFVARLISDDGLP 113
MA3 smart00544
Domain in DAP-5, eIF4G, MA-3 and other proteins; Highly alpha-helical. May contain repeats and ...
1215-1327 3.28e-32

Domain in DAP-5, eIF4G, MA-3 and other proteins; Highly alpha-helical. May contain repeats and/or regions similar to MIF4G domains Ponting (TIBS) "Novel eIF4G domain homologues" in press


Pssm-ID: 214714  Cd Length: 113  Bit Score: 121.58  E-value: 3.28e-32
                            10        20        30        40        50        60        70        80
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720408520  1215 VERKSKSIIDEFLHINDFKEATQCIEELSAQGPLHVFVKVGVEFTLERSQITRDHMGHLLYQLVQSEKLSKQDFFKGFSE 1294
Cdd:smart00544    1 LKKKIFLIIEEYLSSGDTDEAVHCLLELKLPEQHHEVVKVLLTCALEEKRTYREMYSVLLSRLCQANVISTKQFEKGFWR 80
                            90       100       110
                    ....*....|....*....|....*....|...
gi 1720408520  1295 TLELADDMAIDIPHIWLYLAELVTPMLKGGGIS 1327
Cdd:smart00544   81 LLEDIEDLELDIPNAWRNLAEFVARLISDGILP 113
eIF5C smart00515
Domain at the C-termini of GCD6, eIF-2B epsilon, eIF-4 gamma and eIF-5;
1487-1571 2.69e-28

Domain at the C-termini of GCD6, eIF-2B epsilon, eIF-4 gamma and eIF-5;


Pssm-ID: 214705  Cd Length: 83  Bit Score: 109.30  E-value: 2.69e-28
                            10        20        30        40        50        60        70        80
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720408520  1487 PILLKYLDSDTEKELQALYALQASIVKLDQPANLLRMFFDCLYDEEVISEDAFYKWESSKDPAEqaGKGVALKSVTAFFT 1566
Cdd:smart00515    1 GPLLKFLAKDEEEQLELLYAIEEFCVELEKLGKLLPKILKSLYDADILEEEAILKWYEKAVSAE--GKKKVRKNAKPFVT 78

                    ....*
gi 1720408520  1567 WLREA 1571
Cdd:smart00515   79 WLQEA 83
W2 pfam02020
eIF4-gamma/eIF5/eIF2-epsilon; This domain of unknown function is found at the C-terminus of ...
1500-1576 1.03e-23

eIF4-gamma/eIF5/eIF2-epsilon; This domain of unknown function is found at the C-terminus of several translation initiation factors.


Pssm-ID: 460415  Cd Length: 76  Bit Score: 96.06  E-value: 1.03e-23
                           10        20        30        40        50        60        70
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*..
gi 1720408520 1500 ELQALYALQASIVKLDQPANLLRMFFDCLYDEEVISEDAFYKWESSKDPAEQaGKGVALKSVTAFFTWLREAEEESE 1576
Cdd:pfam02020    1 QVDLLLALQEFCAKLEELLKLLLKILKALYDLDIVEEEAILKWWEDVSSAEK-GMKKVRKQAKPFVEWLEEAEEESD 76
PRK07003 PRK07003
DNA polymerase III subunit gamma/tau;
161-324 4.66e-06

DNA polymerase III subunit gamma/tau;


Pssm-ID: 235906 [Multi-domain]  Cd Length: 830  Bit Score: 51.39  E-value: 4.66e-06
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720408520  161 GGGSRNPTPPIGRPASTPTPPQLP--SQVPEHSPVVYGTVESAHLAASTPVTAASDQKQEEKPKPDPVFQSPSTVLRLVL 238
Cdd:PRK07003   368 PGGGVPARVAGAVPAPGARAAAAVgaSAVPAVTAVTGAAGAALAPKAAAAAAATRAEAPPAAPAPPATADRGDDAADGDA 447
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720408520  239 SGEKKEQAGQMPETAAGEPTPEPPRTSSPTS-----LPPLARSSlPSPMSAALSSQPLFTAEDKCELPSSKEEDAPPVP- 312
Cdd:PRK07003   448 PVPAKANARASADSRCDERDAQPPADSGSASapasdAPPDAAFE-PAPRAAAPSAATPAAVPDARAPAAASREDAPAAAa 526
                          170
                   ....*....|..
gi 1720408520  313 SPTSCTAASGPS 324
Cdd:PRK07003   527 PPAPEARPPTPA 538
W2_eIF5C_like cd11560
C-terminal W2 domain of the eukaryotic translation initiation factor 5C and similar proteins; ...
1528-1574 1.15e-04

C-terminal W2 domain of the eukaryotic translation initiation factor 5C and similar proteins; eIF5C appears to be essential for the initiation of protein translation; its actual function, and specifically that of the C-terminal W2 domain, are not well understood. The Drosophila ortholog, kra (krasavietz) or exba (extra bases), may be involved in translational inhibition in neural development. The structure of this C-terminal domain resembles that of a set of concatenated HEAT repeats.


Pssm-ID: 211398 [Multi-domain]  Cd Length: 194  Bit Score: 44.90  E-value: 1.15e-04
                           10        20        30        40
                   ....*....|....*....|....*....|....*....|....*..
gi 1720408520 1528 LYDEEVISEDAFYKWesSKDPAEQAGKGVALKSVTAFFTWLREAEEE 1574
Cdd:cd11560    150 LYKADVLSEDAILKW--YKKGHSPKGKQVFLKQMEPFVEWLQEAEEE 194
Atrophin-1 pfam03154
Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian ...
13-350 7.87e-04

Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian atrophy (DRPLA) gene. DRPLA OMIM:125370 is a progressive neurodegenerative disorder. It is caused by the expansion of a CAG repeat in the DRPLA gene on chromosome 12p. This results in an extended polyglutamine region in atrophin-1, that is thought to confer toxicity to the protein, possibly through altering its interactions with other proteins. The expansion of a CAG repeat is also the underlying defect in six other neurodegenerative disorders, including Huntington's disease. One interaction of expanded polyglutamine repeats that is thought to be pathogenic is that with the short glutamine repeat in the transcriptional coactivator CREB binding protein, CBP. This interaction draws CBP away from its usual nuclear location to the expanded polyglutamine repeat protein aggregates that are characteriztic of the polyglutamine neurodegenerative disorders. This interferes with CBP-mediated transcription and causes cytotoxicity.


Pssm-ID: 460830 [Multi-domain]  Cd Length: 991  Bit Score: 44.37  E-value: 7.87e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720408520   13 QRPQIQPPRaaIPNSSPSIRPGVQTPTAVYQANQhimmvnhlPMPYPVTQGHQYCIPQYRHSGPPYVGPPQQYPVqppgp 92
Cdd:pfam03154  233 QTPTLHPQR--LPSPHPPLQPMTQPPPPSQVSPQ--------PLPQPSLHGQMPPMPHSLQTGPSHMQHPVPPQP----- 297
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720408520   93 gpfypGPGPGDFANAYGTPFYPSQPVYQSA--PIIVPTQQQPPPAKREKKTIRIRDPnqggkditeeimsgggsrNPTPP 170
Cdd:pfam03154  298 -----FPLTPQSSQSQVPPGPSPAAPGQSQqrIHTPPSQSQLQSQQPPREQPLPPAP------------------LSMPH 354
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720408520  171 IGRPASTPTPPQLPSQVPEHSPvvygtvesaHLAASTPVTAASDQkqeekpKPDPVFQSPSTVLRLVLSGEKKEQAGQMP 250
Cdd:pfam03154  355 IKPPPTTPIPQLPNPQSHKHPP---------HLSGPSPFQMNSNL------PPPPALKPLSSLSTHHPPSAHPPPLQLMP 419
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720408520  251 ETAAGEPTP-EPPRTSSPTSLPPLARSSLPSPMSAALSSQPLFTAEDKceLPSSKEEDAPPVPSPTScTAASGPSLtdns 329
Cdd:pfam03154  420 QSQQLPPPPaQPPVLTQSQSLPPPAASHPPTSGLHQVPSQSPFPQHPF--VPGGPPPITPPSGPPTS-TSSAMPGI---- 492
                          330       340
                   ....*....|....*....|.
gi 1720408520  330 dicKKPCSVAPHDSQLISSTI 350
Cdd:pfam03154  493 ---QPPSSASVSSSGPVPAAV 510
PHA03247 PHA03247
large tegument protein UL36; Provisional
5-340 1.86e-03

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 43.39  E-value: 1.86e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720408520    5 PQARSPFFQRpQIQPPRAAIPNSSPSIRPGVQTPTAVYQANQhimmvnhlpmpyPVTQGHQYCIPqyRHSGPPYVGPPQQ 84
Cdd:PHA03247  2557 PAAPPAAPDR-SVPPPRPAPRPSEPAVTSRARRPDAPPQSAR------------PRAPVDDRGDP--RGPAPPSPLPPDT 2621
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720408520   85 YPVQPPGPGPFYPGPGPGDFANAYGTPfyPSQPVYQSAPiivPTQQQPPPAKREKKTIRIRDPNQGgkditeeimsgggs 164
Cdd:PHA03247  2622 HAPDPPPPSPSPAANEPDPHPPPTVPP--PERPRDDPAP---GRVSRPRRARRLGRAAQASSPPQR-------------- 2682
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720408520  165 rnPTPPIGRPASTPT------PPQLPSQVPEHSPVVYGT----VESAHLAASTPVTAAsdqkqeekPKPDPVFQSPSTVL 234
Cdd:PHA03247  2683 --PRRRAARPTVGSLtsladpPPPPPTPEPAPHALVSATplppGPAAARQASPALPAA--------PAPPAVPAGPATPG 2752
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720408520  235 RLVLSGEKKEQAGQMPETAAGEPTPEPPRTSSPTSLPPL--ARSSLPSPMSAALSSQPLfTAEDKCELPSSKEedAPPVP 312
Cdd:PHA03247  2753 GPARPARPPTTAGPPAPAPPAAPAAGPPRRLTRPAVASLseSRESLPSPWDPADPPAAV-LAPAAALPPAASP--AGPLP 2829
                          330       340
                   ....*....|....*....|....*....
gi 1720408520  313 SPTSCTAASGPSLTDNSDICKKPC-SVAP 340
Cdd:PHA03247  2830 PPTSAQPTAPPPPPGPPPPSLPLGgSVAP 2858
DUF4045 pfam13254
Domain of unknown function (DUF4045); This presumed domain is functionally uncharacterized. ...
173-340 2.61e-03

Domain of unknown function (DUF4045); This presumed domain is functionally uncharacterized. This domain family is found in bacteria and eukaryotes, and is typically between 384 and 430 amino acids in length.


Pssm-ID: 433066 [Multi-domain]  Cd Length: 415  Bit Score: 42.08  E-value: 2.61e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720408520  173 RPASTPTPPQLPSQVPEHSPVvYGTVESAHLAASTPVTAASDQKQEEKPKPDPVFQSPSTVLRLVLSGEKKEQAGQMPET 252
Cdd:pfam13254  165 KPKAQPSQPAQPAWMKELNKI-RQSRASVDLGRPNSFKEVTPVGLMRSPAPGGHSKSPSVSGISADSSPTKEEPSEEADT 243
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720408520  253 AAGEPTPEPPRTSSPTSLPPLARSSLPSPMSAALSSQPLFTAEDKCELPSSK-----EEDAPPVPSPTSCTAASGPSLTD 327
Cdd:pfam13254  244 LSTDKEQSPAPTSASEPPPKTKELPKDSEEPAAPSKSAEASTEKKEPDTESSpetssEKSAPSLLSPVSKASIDKPLSSP 323
                          170
                   ....*....|....
gi 1720408520  328 NSD-ICKKPCSVAP 340
Cdd:pfam13254  324 DRDpLSPKPKPQSP 337
 
Name Accession Description Interval E-value
MIF4G pfam02854
MIF4G domain; MIF4G is named after Middle domain of eukaryotic initiation factor 4G (eIF4G). ...
749-977 2.06e-63

MIF4G domain; MIF4G is named after Middle domain of eukaryotic initiation factor 4G (eIF4G). Also occurs in NMD2p and CBP80. The domain is rich in alpha-helices and may contain multiple alpha-helical repeats. In eIF4G, this domain binds eIF4A, eIF3, RNA and DNA.


Pssm-ID: 397130  Cd Length: 203  Bit Score: 214.53  E-value: 2.06e-63
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720408520  749 FRKVRSILNKLTPQMFNQLMKQVSALTVDTEERLKGVIDLVFEKAIDEPSFSVAYANMCRCLVTLkvpmadkpgNTVNFR 828
Cdd:pfam02854    1 LKKVKGILNKLSPENFEKLIKELLKLIMSDPELLKYLIELIFEKAVEEPNFIPAYARLCSGLNLR---------NPTDFG 71
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720408520  829 KLLLNRCQKEFEKdkadddvfekkqkeleaasapeertrlHDELEEAKDKARRRSIGNIKFIGELFKLKMLTEAIMHDCV 908
Cdd:pfam02854   72 IHLLNRLQEEFEK---------------------------RFELEENEQGNRRRRLGLVRFLGELYKFGLLTEKILFECL 124
                          170       180       190       200       210       220       230
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 1720408520  909 VKLLKNH-------DEESLECLCRLLTTIGKDLDFEKAKPRMDQYFNQMEKIVKERK---TSSRIRFMLQDVIDLRLCN 977
Cdd:pfam02854  125 KELLSSLtkedlkrDLFNLECLLTLLTTIGKLLENEKLPKLMDQFLDEIQKYVLSKDdpkLSSRLRFMLQDLIELRKNK 203
MIF4G smart00543
Middle domain of eukaryotic initiation factor 4G (eIF4G); Also occurs in NMD2p and CBP80. The ...
750-974 9.59e-51

Middle domain of eukaryotic initiation factor 4G (eIF4G); Also occurs in NMD2p and CBP80. The domain is rich in alpha-helices and may contain multiple alpha-helical repeats. In eIF4G, this domain binds eIF4A, eIF3, RNA and DNA. Ponting (TiBS) "Novel eIF4G domain homologues (in press)


Pssm-ID: 214713  Cd Length: 200  Bit Score: 178.32  E-value: 9.59e-51
                            10        20        30        40        50        60        70        80
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720408520   750 RKVRSILNKLTPQMFNQLMKQVSALTVDTEERLKGVIDLVFEKAIDEPSFSVAYANMCRCLVtLKVPmadkpgntvNFRK 829
Cdd:smart00543    2 KKVKGLINKLSPSNFESIIKELLKLNNSDKNLRKYILELIFEKAVEEPNFIPAYARLCALLN-AKNP---------DFGS 71
                            90       100       110       120       130       140       150       160
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720408520   830 LLLNRCQKEFEKDkadddvfekkqkeleaasapeertrlhdeLEEAKDKARRRSIGNIKFIGELFKLKMLTEAIMHDCVV 909
Cdd:smart00543   72 LLLERLQEEFEKG-----------------------------LESEEESDKQRRLGLVRFLGELYNFQVLTSKIILELLK 122
                           170       180       190       200       210       220       230
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*
gi 1720408520   910 KLLKNH-------DEESLECLCRLLTTIGKDLDFEKAKPRMDQYFNQMEKIVKERKT---SSRIRFMLQDVIDLR 974
Cdd:smart00543  123 ELLNDLtkldpprSDFSVECLLSLLPTCGKDLEREKSPKLLDEILERLQDYLLKKDKtelSSRLRFMLELLIELR 197
W2_eIF4G1_like cd11559
C-terminal W2 domain of eukaryotic translation initiation factor 4 gamma 1 and similar ...
1417-1549 1.90e-48

C-terminal W2 domain of eukaryotic translation initiation factor 4 gamma 1 and similar proteins; eIF4G1 is a component of the multi-subunit eukaryotic translation initiation factor 4F, which facilitates recruitment of the mRNA to the ribosome, a rate-limiting step during translation initiation. This C-terminal domain, whose structure resembles that of a set of concatenated HEAT repeats, has been associated with binding to/recruiting the kinase Mnk1, which phosphorylates eIF4E.


Pssm-ID: 211397  Cd Length: 134  Bit Score: 169.00  E-value: 1.90e-48
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720408520 1417 EELSQRLEKLIMEEKADDErIFDWVEANLDESQMSSPTFLRALMTAVCKAAIIADcSTFRVDTAVIKQRVPILLKYLDSD 1496
Cdd:cd11559      4 LRVQAELLKLLQEDPNPDE-LYKWIKENVSPELYASPGFVRALMTAVLKYAIEEK-SLPEKEKALLEKYAPLLQKYLDDD 81
                           90       100       110       120       130
                   ....*....|....*....|....*....|....*....|....*....|...
gi 1720408520 1497 TEKELQALYALQASIVKLDQPANLLRMFFDCLYDEEVISEDAFYKWESSKDPA 1549
Cdd:cd11559     82 EQLQLQALYALQALVHTLEFPKGLLLRFFDALYDEDVIEEEAFLKWKEDVDPA 134
MA3 pfam02847
MA3 domain; Domain in DAP-5, eIF4G, MA-3 and other proteins. Highly alpha-helical. May contain ...
1215-1327 9.24e-35

MA3 domain; Domain in DAP-5, eIF4G, MA-3 and other proteins. Highly alpha-helical. May contain repeats and/or regions similar to MIF4G domains.


Pssm-ID: 397128  Cd Length: 113  Bit Score: 128.93  E-value: 9.24e-35
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720408520 1215 VERKSKSIIDEFLHINDFKEATQCIEELSAQGPLHVFVKVGVEFTLERSQITRDHMGHLLYQLVQSEKLSKQDFFKGFSE 1294
Cdd:pfam02847    1 LKRKIFLILEEYLSSGDYDEAARCLLKLGLPSQHHEVVKVLIECALEESKTYREFYGLLLERLCEFNLISTKQFEKGFWR 80
                           90       100       110
                   ....*....|....*....|....*....|...
gi 1720408520 1295 TLELADDMAIDIPHIWLYLAELVTPMLKGGGIS 1327
Cdd:pfam02847   81 VLEDLEDLELDIPNAWRNLAEFVARLISDDGLP 113
MA3 smart00544
Domain in DAP-5, eIF4G, MA-3 and other proteins; Highly alpha-helical. May contain repeats and ...
1215-1327 3.28e-32

Domain in DAP-5, eIF4G, MA-3 and other proteins; Highly alpha-helical. May contain repeats and/or regions similar to MIF4G domains Ponting (TIBS) "Novel eIF4G domain homologues" in press


Pssm-ID: 214714  Cd Length: 113  Bit Score: 121.58  E-value: 3.28e-32
                            10        20        30        40        50        60        70        80
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720408520  1215 VERKSKSIIDEFLHINDFKEATQCIEELSAQGPLHVFVKVGVEFTLERSQITRDHMGHLLYQLVQSEKLSKQDFFKGFSE 1294
Cdd:smart00544    1 LKKKIFLIIEEYLSSGDTDEAVHCLLELKLPEQHHEVVKVLLTCALEEKRTYREMYSVLLSRLCQANVISTKQFEKGFWR 80
                            90       100       110
                    ....*....|....*....|....*....|...
gi 1720408520  1295 TLELADDMAIDIPHIWLYLAELVTPMLKGGGIS 1327
Cdd:smart00544   81 LLEDIEDLELDIPNAWRNLAEFVARLISDGILP 113
eIF5C smart00515
Domain at the C-termini of GCD6, eIF-2B epsilon, eIF-4 gamma and eIF-5;
1487-1571 2.69e-28

Domain at the C-termini of GCD6, eIF-2B epsilon, eIF-4 gamma and eIF-5;


Pssm-ID: 214705  Cd Length: 83  Bit Score: 109.30  E-value: 2.69e-28
                            10        20        30        40        50        60        70        80
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720408520  1487 PILLKYLDSDTEKELQALYALQASIVKLDQPANLLRMFFDCLYDEEVISEDAFYKWESSKDPAEqaGKGVALKSVTAFFT 1566
Cdd:smart00515    1 GPLLKFLAKDEEEQLELLYAIEEFCVELEKLGKLLPKILKSLYDADILEEEAILKWYEKAVSAE--GKKKVRKNAKPFVT 78

                    ....*
gi 1720408520  1567 WLREA 1571
Cdd:smart00515   79 WLQEA 83
W2 pfam02020
eIF4-gamma/eIF5/eIF2-epsilon; This domain of unknown function is found at the C-terminus of ...
1500-1576 1.03e-23

eIF4-gamma/eIF5/eIF2-epsilon; This domain of unknown function is found at the C-terminus of several translation initiation factors.


Pssm-ID: 460415  Cd Length: 76  Bit Score: 96.06  E-value: 1.03e-23
                           10        20        30        40        50        60        70
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*..
gi 1720408520 1500 ELQALYALQASIVKLDQPANLLRMFFDCLYDEEVISEDAFYKWESSKDPAEQaGKGVALKSVTAFFTWLREAEEESE 1576
Cdd:pfam02020    1 QVDLLLALQEFCAKLEELLKLLLKILKALYDLDIVEEEAILKWWEDVSSAEK-GMKKVRKQAKPFVEWLEEAEEESD 76
W2 cd11473
C-terminal domain of eIF4-gamma/eIF5/eIF2b-epsilon; This domain is found at the C-terminus of ...
1417-1543 1.92e-18

C-terminal domain of eIF4-gamma/eIF5/eIF2b-epsilon; This domain is found at the C-terminus of several translation initiation factors, including the epsilon chain of eIF2b, where it has been found to catalyze the conversion of eIF2.GDP to its active eIF2.GTP form. The structure of the domain resembles that of a set of concatenated HEAT repeats.


Pssm-ID: 211395  Cd Length: 135  Bit Score: 83.29  E-value: 1.92e-18
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720408520 1417 EELSQRLEKLIMEEKADDERIFDWVEANLDESQMSSPTFLRALMTAVCKAAIIADCSTF---RVDTAVIKQRVPILLKYL 1493
Cdd:cd11473      4 KKLRDSLLKELEEDKSSDVESVKAAKSKLDLDPISLEEVVKVLLTAVVNAVESADSISLtqkEQLVLVLKKYGPVLRELL 83
                           90       100       110       120       130
                   ....*....|....*....|....*....|....*....|....*....|..
gi 1720408520 1494 DSDTEKELQALYALQA--SIVKLDQPANLLRMFFDCLYDEEVISEDAFYKWE 1543
Cdd:cd11473     84 KLIKKDQLYLLLKIEKlcLQLKLSELISLLEKILDLLYDADVLSEEAILSWF 135
W2_eIF2B_epsilon cd11558
C-terminal W2 domain of eukaryotic translation initiation factor 2B epsilon; eIF2B is a ...
1457-1576 6.73e-15

C-terminal W2 domain of eukaryotic translation initiation factor 2B epsilon; eIF2B is a heteropentameric complex which functions as a guanine nucleotide exchange factor in the recycling of eIF-2 during the initiation of translation in eukaryotes. The epsilon and gamma subunits are sequence similar and both are essential in yeast. Epsilon appears to be the catalytically active subunit, with gamma enhancing its activity. The C-terminal domain of the eIF2B epsilon subunit contains bipartite motifs rich in acidic and aromatic residues, which are responsible for the interaction with eIF2. The structure of the domain resembles that of a set of concatenated HEAT repeats.


Pssm-ID: 211396  Cd Length: 169  Bit Score: 74.22  E-value: 6.73e-15
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720408520 1457 RALMTAVCK-AAIIADCSTFRVDTA---VIKQRVPILLKYLDSDTEkELQALYALQASIVKLDQPANLLRMFFDCLYDEE 1532
Cdd:cd11558     47 RAVVKALLElILEVSSTSTAELLEAlkkLLSKWGPLLENYVKSQDD-QVELLLALEEFCLESEEGGPLFAKLLHALYDLD 125
                           90       100       110       120
                   ....*....|....*....|....*....|....*....|....
gi 1720408520 1533 VISEDAFYKWESSKDPAEQAGKGVALKSVTAFFTWLREAEEESE 1576
Cdd:cd11558    126 ILEEEAILEWWEEPDAGADEEMKKVRELVKKFIEWLEEAEEESD 169
W2_eIF5 cd11561
C-terminal W2 domain of eukaryotic translation initiation factor 5; eIF5 functions as a GTPase ...
1429-1576 1.77e-08

C-terminal W2 domain of eukaryotic translation initiation factor 5; eIF5 functions as a GTPase acceleration protein (GAP), as well as a GDP dissociation inhibitor (GDI) during translational initiation in eukaryotes. The structure of this C-terminal domain resembles that of a set of concatenated HEAT repeats.


Pssm-ID: 211399  Cd Length: 157  Bit Score: 55.31  E-value: 1.77e-08
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720408520 1429 EEKADD--ERIFDWVEANLDESQMSSptfLRALMTAV-------CKAAIIADCSTFRVDTA-VIKQRVPILLKYLDSDte 1498
Cdd:cd11561      1 EEEEDErvDELGEFLKKNKDESGLSE---LKEILKEAerldvvkDKAVLVLAEVLFDENIVkEIKKRKALLLKLVTDE-- 75
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720408520 1499 kelQALYALQASIVKL--DQPANLLRMF---FDCLYDEEVISEDAFYKW--ESSKDPAEQAGKGVALKSVTAFFTWLREA 1571
Cdd:cd11561     76 ---KAQKALLGGIERFcgKHSPELLKKVpliLKALYDNDILEEEVILKWyeKVSKKYVSKEKSKKVRKAAEPFVEWLEEA 152

                   ....*
gi 1720408520 1572 EEESE 1576
Cdd:cd11561    153 EEEEE 157
PRK07003 PRK07003
DNA polymerase III subunit gamma/tau;
161-324 4.66e-06

DNA polymerase III subunit gamma/tau;


Pssm-ID: 235906 [Multi-domain]  Cd Length: 830  Bit Score: 51.39  E-value: 4.66e-06
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720408520  161 GGGSRNPTPPIGRPASTPTPPQLP--SQVPEHSPVVYGTVESAHLAASTPVTAASDQKQEEKPKPDPVFQSPSTVLRLVL 238
Cdd:PRK07003   368 PGGGVPARVAGAVPAPGARAAAAVgaSAVPAVTAVTGAAGAALAPKAAAAAAATRAEAPPAAPAPPATADRGDDAADGDA 447
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720408520  239 SGEKKEQAGQMPETAAGEPTPEPPRTSSPTS-----LPPLARSSlPSPMSAALSSQPLFTAEDKCELPSSKEEDAPPVP- 312
Cdd:PRK07003   448 PVPAKANARASADSRCDERDAQPPADSGSASapasdAPPDAAFE-PAPRAAAPSAATPAAVPDARAPAAASREDAPAAAa 526
                          170
                   ....*....|..
gi 1720408520  313 SPTSCTAASGPS 324
Cdd:PRK07003   527 PPAPEARPPTPA 538
PHA03378 PHA03378
EBNA-3B; Provisional
35-320 4.43e-05

EBNA-3B; Provisional


Pssm-ID: 223065 [Multi-domain]  Cd Length: 991  Bit Score: 48.52  E-value: 4.43e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720408520   35 VQTPTAVYQANQHIMMVNHLPMPYPVTQGHQYCIPQYRHSGPPYVGPPQQYPVQPPGPGPFYPGPGPGDFANAYG-TPFY 113
Cdd:PHA03378   574 IQPLTSPTTSQLASSAPSYAQTPWPVPHPSQTPEPPTTQSHIPETSAPRQWPMPLRPIPMRPLRMQPITFNVLVFpTPHQ 653
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720408520  114 PSQ---PVYQSAPIIVP-TQQQPPPA------KREKKTIRIRDPNQGGKDITEEIMSGGGSRNPTPPIGR---PASTPTP 180
Cdd:PHA03378   654 PPQveiTPYKPTWTQIGhIPYQPSPTgantmlPIQWAPGTMQPPPRAPTPMRPPAAPPGRAQRPAAATGRarpPAAAPGR 733
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720408520  181 PQLPSQVPEHSPVVYGTVESAHLAASTPVTAASDQKQEEKPKPDPVFQSPSTvlrlvlsgekkeqAGQMPEtaaGEPTPE 260
Cdd:PHA03378   734 ARPPAAAPGRARPPAAAPGRARPPAAAPGRARPPAAAPGAPTPQPPPQAPPA-------------PQQRPR---GAPTPQ 797
                          250       260       270       280       290       300
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*..
gi 1720408520  261 PPRTSSPTSLPPLARSSLPSPMSAALSSQPLFTAEDKCELPSSK-----EEDAP--PVPSPTSCTAA 320
Cdd:PHA03378   798 PPPQAGPTSMQLMPRAAPGQQGPTKQILRQLLTGGVKRGRPSLKkpaalERQAAagPTPSPGSGTSD 864
W2_eIF5C_like cd11560
C-terminal W2 domain of the eukaryotic translation initiation factor 5C and similar proteins; ...
1528-1574 1.15e-04

C-terminal W2 domain of the eukaryotic translation initiation factor 5C and similar proteins; eIF5C appears to be essential for the initiation of protein translation; its actual function, and specifically that of the C-terminal W2 domain, are not well understood. The Drosophila ortholog, kra (krasavietz) or exba (extra bases), may be involved in translational inhibition in neural development. The structure of this C-terminal domain resembles that of a set of concatenated HEAT repeats.


Pssm-ID: 211398 [Multi-domain]  Cd Length: 194  Bit Score: 44.90  E-value: 1.15e-04
                           10        20        30        40
                   ....*....|....*....|....*....|....*....|....*..
gi 1720408520 1528 LYDEEVISEDAFYKWesSKDPAEQAGKGVALKSVTAFFTWLREAEEE 1574
Cdd:cd11560    150 LYKADVLSEDAILKW--YKKGHSPKGKQVFLKQMEPFVEWLQEAEEE 194
PRK10263 PRK10263
DNA translocase FtsK; Provisional
10-314 1.30e-04

DNA translocase FtsK; Provisional


Pssm-ID: 236669 [Multi-domain]  Cd Length: 1355  Bit Score: 47.00  E-value: 1.30e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720408520   10 PFFQRPQIqpPRAAIPNSSPSIR----PGVQTPTAVYQanqhimmvnhlPMPYPVTQGHQYCIPQYRHSGP---PYVGPP 82
Cdd:PRK10263   339 PVTQTPPV--ASVDVPPAQPTVAwqpvPGPQTGEPVIA-----------PAPEGYPQQSQYAQPAVQYNEPlqqPVQPQQ 405
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720408520   83 QQYPVQPPGPgpfypgpgpgdfanaygtpfyPSQPVYQSAPIIVPTQQQPPPAKREKKTIRIRDPNQGGKDITEEimsgg 162
Cdd:PRK10263   406 PYYAPAAEQP---------------------AQQPYYAPAPEQPAQQPYYAPAPEQPVAGNAWQAEEQQSTFAPQ----- 459
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720408520  163 GSRNPTPPIGRPASTPTPPQLPSQVPEHSPVVYGTVESAHLAASTP------VTAASDQKQEE-----KPKPDPVfqsps 231
Cdd:PRK10263   460 STYQTEQTYQQPAAQEPLYQQPQPVEQQPVVEPEPVVEETKPARPPlyyfeeVEEKRAREREQlaawyQPIPEPV----- 534
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720408520  232 tvlrlvlsgekKEQAGQMPETAAGEPTPEPPRTSSPTSLPPLA--RSSLPSPMSAALSSQPLFT-AEDKCELPSSKEEDA 308
Cdd:PRK10263   535 -----------KEPEPIKSSLKAPSVAAVPPVEAAAAVSPLASgvKKATLATGAAATVAAPVFSlANSGGPRPQVKEGIG 603

                   ....*.
gi 1720408520  309 PPVPSP 314
Cdd:PRK10263   604 PQLPRP 609
PHA03307 PHA03307
transcriptional regulator ICP4; Provisional
172-327 1.59e-04

transcriptional regulator ICP4; Provisional


Pssm-ID: 223039 [Multi-domain]  Cd Length: 1352  Bit Score: 46.70  E-value: 1.59e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720408520  172 GRPASTPTPPQLPSQVPEHSPVVYGTVESAHLAASTPV---TAASDQKQEEKPKPDPVFQSPS------TVLRLVLSGEK 242
Cdd:PHA03307    19 EFFPRPPATPGDAADDLLSGSQGQLVSDSAELAAVTVVagaAACDRFEPPTGPPPGPGTEAPAnesrstPTWSLSTLAPA 98
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720408520  243 KEQAGQMPETAAGEPTPEPPRTSSPTSLPPLARSSLPSPMSAALSSQPLFTAEDKC-ELPSSKEEDAPPVPSPTSCTAAS 321
Cdd:PHA03307    99 SPAREGSPTPPGPSSPDPPPPTPPPASPPPSPAPDLSEMLRPVGSPGPPPAASPPAaGASPAAVASDAASSRQAALPLSS 178

                   ....*.
gi 1720408520  322 GPSLTD 327
Cdd:PHA03307   179 PEETAR 184
PRK12323 PRK12323
DNA polymerase III subunit gamma/tau;
161-330 4.09e-04

DNA polymerase III subunit gamma/tau;


Pssm-ID: 237057 [Multi-domain]  Cd Length: 700  Bit Score: 44.87  E-value: 4.09e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720408520  161 GGGSRNPTPPIGRPASTPTPPQLPSQV--PEHSPVVYGTVESAHLAASTPVTAASDQKQEEKPKPDPVFQSPSTVLRlvl 238
Cdd:PRK12323   368 SGGGAGPATAAAAPVAQPAPAAAAPAAaaPAPAAPPAAPAAAPAAAAAARAVAAAPARRSPAPEALAAARQASARGP--- 444
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720408520  239 SGEKKEQAGQMPETAAGEPTPEPPRTSSPTSLPPLARSSLPSPMSAALSSQPLFTAEDKCELPSSKEEDAPPVPSPTSCT 318
Cdd:PRK12323   445 GGAPAPAPAPAAAPAAAARPAAAGPRPVAAAAAAAPARAAPAAAPAPADDDPPPWEELPPEFASPAPAQPDAAPAGWVAE 524
                          170
                   ....*....|..
gi 1720408520  319 AASGPSLTDNSD 330
Cdd:PRK12323   525 SIPDPATADPDD 536
PHA03247 PHA03247
large tegument protein UL36; Provisional
110-330 5.09e-04

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 44.93  E-value: 5.09e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720408520  110 TPFYPSQPVYQSAPIIVPTQQQPPPAKREKKTIR-----IRDPNQGGKDITEEIMSGGGSRNPTPPIGRP-ASTPTPPQL 183
Cdd:PHA03247  2762 TTAGPPAPAPPAAPAAGPPRRLTRPAVASLSESReslpsPWDPADPPAAVLAPAAALPPAASPAGPLPPPtSAQPTAPPP 2841
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720408520  184 PSQVPEHSPVVYGTVESA------HLAASTPVTAASDQKQEEKPKPDPVFQSPSTVLRLVLSGEKKEQAGQMPETAAGEP 257
Cdd:PHA03247  2842 PPGPPPPSLPLGGSVAPGgdvrrrPPSRSPAAKPAAPARPPVRRLARPAVSRSTESFALPPDQPERPPQPQAPPPPQPQP 2921
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720408520  258 TPEPP-RTSSPTSLPPLARSSLPSPMSAALSSQPLFTAED---------KCELPSSK-EEDAPPVPSPtsctAASGPSLT 326
Cdd:PHA03247  2922 QPPPPpQPQPPPPPPPRPQPPLAPTTDPAGAGEPSGAVPQpwlgalvpgRVAVPRFRvPQPAPSREAP----ASSTPPLT 2997

                   ....
gi 1720408520  327 DNSD 330
Cdd:PHA03247  2998 GHSL 3001
Atrophin-1 pfam03154
Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian ...
13-350 7.87e-04

Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian atrophy (DRPLA) gene. DRPLA OMIM:125370 is a progressive neurodegenerative disorder. It is caused by the expansion of a CAG repeat in the DRPLA gene on chromosome 12p. This results in an extended polyglutamine region in atrophin-1, that is thought to confer toxicity to the protein, possibly through altering its interactions with other proteins. The expansion of a CAG repeat is also the underlying defect in six other neurodegenerative disorders, including Huntington's disease. One interaction of expanded polyglutamine repeats that is thought to be pathogenic is that with the short glutamine repeat in the transcriptional coactivator CREB binding protein, CBP. This interaction draws CBP away from its usual nuclear location to the expanded polyglutamine repeat protein aggregates that are characteriztic of the polyglutamine neurodegenerative disorders. This interferes with CBP-mediated transcription and causes cytotoxicity.


Pssm-ID: 460830 [Multi-domain]  Cd Length: 991  Bit Score: 44.37  E-value: 7.87e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720408520   13 QRPQIQPPRaaIPNSSPSIRPGVQTPTAVYQANQhimmvnhlPMPYPVTQGHQYCIPQYRHSGPPYVGPPQQYPVqppgp 92
Cdd:pfam03154  233 QTPTLHPQR--LPSPHPPLQPMTQPPPPSQVSPQ--------PLPQPSLHGQMPPMPHSLQTGPSHMQHPVPPQP----- 297
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720408520   93 gpfypGPGPGDFANAYGTPFYPSQPVYQSA--PIIVPTQQQPPPAKREKKTIRIRDPnqggkditeeimsgggsrNPTPP 170
Cdd:pfam03154  298 -----FPLTPQSSQSQVPPGPSPAAPGQSQqrIHTPPSQSQLQSQQPPREQPLPPAP------------------LSMPH 354
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720408520  171 IGRPASTPTPPQLPSQVPEHSPvvygtvesaHLAASTPVTAASDQkqeekpKPDPVFQSPSTVLRLVLSGEKKEQAGQMP 250
Cdd:pfam03154  355 IKPPPTTPIPQLPNPQSHKHPP---------HLSGPSPFQMNSNL------PPPPALKPLSSLSTHHPPSAHPPPLQLMP 419
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720408520  251 ETAAGEPTP-EPPRTSSPTSLPPLARSSLPSPMSAALSSQPLFTAEDKceLPSSKEEDAPPVPSPTScTAASGPSLtdns 329
Cdd:pfam03154  420 QSQQLPPPPaQPPVLTQSQSLPPPAASHPPTSGLHQVPSQSPFPQHPF--VPGGPPPITPPSGPPTS-TSSAMPGI---- 492
                          330       340
                   ....*....|....*....|.
gi 1720408520  330 dicKKPCSVAPHDSQLISSTI 350
Cdd:pfam03154  493 ---QPPSSASVSSSGPVPAAV 510
PRK11901 PRK11901
hypothetical protein; Reviewed
114-240 9.30e-04

hypothetical protein; Reviewed


Pssm-ID: 237015 [Multi-domain]  Cd Length: 327  Bit Score: 43.13  E-value: 9.30e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720408520  114 PSQPVYQSAPIIVPTQQQPPPAKREKKTIRIRDP---------NQGGKDITEEIMSGGGSRNPTPP--------IGRPAS 176
Cdd:PRK11901   113 TAPPQDISAPPISPTPTQAAPPQTPNGQQRIELPgnisdalsqQQGQVNAASQNAQGNTSTLPTAPatvapskgAKVPAT 192
                           90       100       110       120       130       140
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*
gi 1720408520  177 TPTPPQLPSQVPEHSPVVygtveSAHLAASTPVTAASDQKQEEKPKPDPVFQS-PSTVLRLVLSG 240
Cdd:PRK11901   193 AETHPTPPQKPATKKPAV-----NHHKTATVAVPPATSGKPKSGAASARALSSaPASHYTLQLSS 252
PRK07764 PRK07764
DNA polymerase III subunits gamma and tau; Validated
160-324 9.82e-04

DNA polymerase III subunits gamma and tau; Validated


Pssm-ID: 236090 [Multi-domain]  Cd Length: 824  Bit Score: 43.82  E-value: 9.82e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720408520  160 SGGGSRNPTPPIGRPASTPTPPQLPSQVPEHSPVVyGTVESAHLAASTPVTAASDQKQEEKPKPDPVFQSPSTVLRLVLS 239
Cdd:PRK07764   601 PAPASSGPPEEAARPAAPAAPAAPAAPAPAGAAAA-PAEASAAPAPGVAAPEHHPKHVAVPDASDGGDGWPAKAGGAAPA 679
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720408520  240 GEKKEQAGQMPETAAGEPTPEP----------PRTSSPTSLPPLARSSLPSPMSAALSSQPLFTAEDKCELPSSKEEDAP 309
Cdd:PRK07764   680 APPPAPAPAAPAAPAGAAPAQPapapaatppaGQADDPAAQPPQAAQGASAPSPAADDPVPLPPEPDDPPDPAGAPAQPP 759
                          170
                   ....*....|....*
gi 1720408520  310 PVPSPTSCTAASGPS 324
Cdd:PRK07764   760 PPPAPAPAAAPAAAP 774
PHA03247 PHA03247
large tegument protein UL36; Provisional
5-340 1.86e-03

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 43.39  E-value: 1.86e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720408520    5 PQARSPFFQRpQIQPPRAAIPNSSPSIRPGVQTPTAVYQANQhimmvnhlpmpyPVTQGHQYCIPqyRHSGPPYVGPPQQ 84
Cdd:PHA03247  2557 PAAPPAAPDR-SVPPPRPAPRPSEPAVTSRARRPDAPPQSAR------------PRAPVDDRGDP--RGPAPPSPLPPDT 2621
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720408520   85 YPVQPPGPGPFYPGPGPGDFANAYGTPfyPSQPVYQSAPiivPTQQQPPPAKREKKTIRIRDPNQGgkditeeimsgggs 164
Cdd:PHA03247  2622 HAPDPPPPSPSPAANEPDPHPPPTVPP--PERPRDDPAP---GRVSRPRRARRLGRAAQASSPPQR-------------- 2682
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720408520  165 rnPTPPIGRPASTPT------PPQLPSQVPEHSPVVYGT----VESAHLAASTPVTAAsdqkqeekPKPDPVFQSPSTVL 234
Cdd:PHA03247  2683 --PRRRAARPTVGSLtsladpPPPPPTPEPAPHALVSATplppGPAAARQASPALPAA--------PAPPAVPAGPATPG 2752
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720408520  235 RLVLSGEKKEQAGQMPETAAGEPTPEPPRTSSPTSLPPL--ARSSLPSPMSAALSSQPLfTAEDKCELPSSKEedAPPVP 312
Cdd:PHA03247  2753 GPARPARPPTTAGPPAPAPPAAPAAGPPRRLTRPAVASLseSRESLPSPWDPADPPAAV-LAPAAALPPAASP--AGPLP 2829
                          330       340
                   ....*....|....*....|....*....
gi 1720408520  313 SPTSCTAASGPSLTDNSDICKKPC-SVAP 340
Cdd:PHA03247  2830 PPTSAQPTAPPPPPGPPPPSLPLGgSVAP 2858
DUF4045 pfam13254
Domain of unknown function (DUF4045); This presumed domain is functionally uncharacterized. ...
173-340 2.61e-03

Domain of unknown function (DUF4045); This presumed domain is functionally uncharacterized. This domain family is found in bacteria and eukaryotes, and is typically between 384 and 430 amino acids in length.


Pssm-ID: 433066 [Multi-domain]  Cd Length: 415  Bit Score: 42.08  E-value: 2.61e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720408520  173 RPASTPTPPQLPSQVPEHSPVvYGTVESAHLAASTPVTAASDQKQEEKPKPDPVFQSPSTVLRLVLSGEKKEQAGQMPET 252
Cdd:pfam13254  165 KPKAQPSQPAQPAWMKELNKI-RQSRASVDLGRPNSFKEVTPVGLMRSPAPGGHSKSPSVSGISADSSPTKEEPSEEADT 243
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720408520  253 AAGEPTPEPPRTSSPTSLPPLARSSLPSPMSAALSSQPLFTAEDKCELPSSK-----EEDAPPVPSPTSCTAASGPSLTD 327
Cdd:pfam13254  244 LSTDKEQSPAPTSASEPPPKTKELPKDSEEPAAPSKSAEASTEKKEPDTESSpetssEKSAPSLLSPVSKASIDKPLSSP 323
                          170
                   ....*....|....
gi 1720408520  328 NSD-ICKKPCSVAP 340
Cdd:pfam13254  324 DRDpLSPKPKPQSP 337
Pneumo_att_G pfam05539
Pneumovirinae attachment membrane glycoprotein G;
110-295 3.36e-03

Pneumovirinae attachment membrane glycoprotein G;


Pssm-ID: 114270 [Multi-domain]  Cd Length: 408  Bit Score: 41.57  E-value: 3.36e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720408520  110 TPFYPSQpvyqsapiiVPTQQQPPPAKREKKTIRIRDPNQGgkditeeimSGGGSRNPTPPIGRPASTPTPPQL-PSQVP 188
Cdd:pfam05539  186 HPTYPSQ---------VTPQSQPATQGHQTATANQRLSSTE---------PVGTQGTTTSSNPEPQTEPPPSQRgPSGSP 247
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720408520  189 EHSPvvygtvesahlaaSTP----VTAASDQKQEEKPKPDPVFQSPSTVLRLVLSGEKKeqagqmPETAAGEPTPEPPRT 264
Cdd:pfam05539  248 QHPP-------------STTsqdqSTTGDGQEHTQRRKTPPATSNRRSPHSTATPPPTT------KRQETGRPTPRPTAT 308
                          170       180       190
                   ....*....|....*....|....*....|.
gi 1720408520  265 SSPTSLPPlarSSLPSPMSAALSSQPLFTAE 295
Cdd:pfam05539  309 TQSGSSPP---HSSPPGVQANPTTQNLVDCK 336
PRK07003 PRK07003
DNA polymerase III subunit gamma/tau;
114-262 3.47e-03

DNA polymerase III subunit gamma/tau;


Pssm-ID: 235906 [Multi-domain]  Cd Length: 830  Bit Score: 42.14  E-value: 3.47e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720408520  114 PSQPVYQSAPIIVPTQQQPPPAKREKKTIRIRDPNQGGKDITEEIMSGGGSRNPTPPigrPASTPTPPQLPSQVPEHSPV 193
Cdd:PRK07003   467 DAQPPADSGSASAPASDAPPDAAFEPAPRAAAPSAATPAAVPDARAPAAASREDAPA---AAAPPAPEARPPTPAAAAPA 543
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720408520  194 VYGTVESAHL-----------------AASTPVTAASDQKQEEKPKPDPVFQSPSTVLRLVLSGEKKEQAGQMPEtAAGE 256
Cdd:PRK07003   544 ARAGGAAAALdvlrnagmrvssdrgarAAAAAKPAAAPAAAPKPAAPRVAVQVPTPRARAATGDAPPNGAARAEQ-AAES 622

                   ....*.
gi 1720408520  257 PTPEPP 262
Cdd:PRK07003   623 RGAPPP 628
PLN03209 PLN03209
translocon at the inner envelope of chloroplast subunit 62; Provisional
114-316 5.53e-03

translocon at the inner envelope of chloroplast subunit 62; Provisional


Pssm-ID: 178748 [Multi-domain]  Cd Length: 576  Bit Score: 41.45  E-value: 5.53e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720408520  114 PSQPVYQSAPIiVPTQQQPPPAKREKKtiRIRDPNQGGKDITEEImsgggSRNPTPPIGRPAS--------------TPT 179
Cdd:PLN03209   343 PTKPVTPEAPS-PPIEEEPPQPKAVVP--RPLSPYTAYEDLKPPT-----SPIPTPPSSSPASsksvdavakpaepdVVP 414
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1720408520  180 PPQLPSQVPEHSPvvyGTVESAHLAASTPVTAASDQKQEEKPKPDPVFQSPSTVLRLVLSGEKKEQAGQMPETAAGEPTP 259
Cdd:PLN03209   415 SPGSASNVPEVEP---AQVEAKKTRPLSPYARYEDLKPPTSPSPTAPTGVSPSVSSTSSVPAVPDTAPATAATDAAAPPP 491
                          170       180       190       200       210       220
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*..
gi 1720408520  260 EPPRTSS----------PTSLPPLARSSLPSPMSAALSSQPLFTAEDKCelPSSKEEDAPPVPSPTS 316
Cdd:PLN03209   492 ANMRPLSpyavyddlkpPTSPSPAAPVGKVAPSSTNEVVKVGNSAPPTA--LADEQHHAQPKPRPLS 556
 
Blast search parameters
Data Source: Precalculated data, version = cdd.v.3.21
Preset Options:Database: CDSEARCH/cdd   Low complexity filter: no  Composition Based Adjustment: yes   E-value threshold: 0.01

References:

  • Wang J et al. (2023), "The conserved domain database in 2023", Nucleic Acids Res.51(D)384-8.
  • Lu S et al. (2020), "The conserved domain database in 2020", Nucleic Acids Res.48(D)265-8.
  • Marchler-Bauer A et al. (2017), "CDD/SPARCLE: functional classification of proteins via subfamily domain architectures.", Nucleic Acids Res.45(D)200-3.
Help | Disclaimer | Write to the Help Desk
NCBI | NLM | NIH