NCBI Home Page NCBI Site Search page NCBI Guide that lists and describes the NCBI resources
Conserved domains on  [gi|1046853695|ref|XP_017453671|]
View 

eukaryotic translation initiation factor 4 gamma 1 isoform X1 [Rattus norvegicus]

Protein Classification

eukaryotic translation initiation factor 4 gamma( domain architecture ID 10501430)

eukaryotic translation initiation factor 4 gamma (EIF4G) plays a key functional role in the initiation of cap-dependent translation by acting as an adapter to nucleate the assembly of eIF4F complex

Graphical summary

 Zoom to residue level

show extra options »

Show site features     Horizontal zoom: ×

List of domain hits

Name Accession Description Interval E-value
MIF4G pfam02854
MIF4G domain; MIF4G is named after Middle domain of eukaryotic initiation factor 4G (eIF4G). ...
764-992 2.59e-63

MIF4G domain; MIF4G is named after Middle domain of eukaryotic initiation factor 4G (eIF4G). Also occurs in NMD2p and CBP80. The domain is rich in alpha-helices and may contain multiple alpha-helical repeats. In eIF4G, this domain binds eIF4A, eIF3, RNA and DNA.


:

Pssm-ID: 397130  Cd Length: 203  Bit Score: 214.15  E-value: 2.59e-63
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1046853695  764 FRRVRSILNKLTPQMFQQLMKQVTQLAIDTEERLKGVIDLIFEKAISEPNFSVAYANMCRCLMAlkvpttekpTVTVNFR 843
Cdd:pfam02854    1 LKKVKGILNKLSPENFEKLIKELLKLIMSDPELLKYLIELIFEKAVEEPNFIPAYARLCSGLNL---------RNPTDFG 71
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1046853695  844 KLLLNRCQKEFEKdkdddevfekkqkemdeaataeergrlKEELEEARDIARRRSLGNIKFIGELFKLKMLTEAIMHDCV 923
Cdd:pfam02854   72 IHLLNRLQEEFEK---------------------------RFELEENEQGNRRRRLGLVRFLGELYKFGLLTEKILFECL 124
                          170       180       190       200       210       220       230
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 1046853695  924 VKLLKNH-------DEESLECLCRLLTTIGKDLDFAKAKPRMDQYFNQMEKII---KEKKTSSRIRFMLQDVLDLRQSN 992
Cdd:pfam02854  125 KELLSSLtkedlkrDLFNLECLLTLLTTIGKLLENEKLPKLMDQFLDEIQKYVlskDDPKLSSRLRFMLQDLIELRKNK 203
W2_eIF4G1_like cd11559
C-terminal W2 domain of eukaryotic translation initiation factor 4 gamma 1 and similar ...
1440-1569 1.99e-55

C-terminal W2 domain of eukaryotic translation initiation factor 4 gamma 1 and similar proteins; eIF4G1 is a component of the multi-subunit eukaryotic translation initiation factor 4F, which facilitates recruitment of the mRNA to the ribosome, a rate-limiting step during translation initiation. This C-terminal domain, whose structure resembles that of a set of concatenated HEAT repeats, has been associated with binding to/recruiting the kinase Mnk1, which phosphorylates eIF4E.


:

Pssm-ID: 211397  Cd Length: 134  Bit Score: 189.03  E-value: 1.99e-55
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1046853695 1440 EELRRQLEKLLKDGGSNQRVFDWIEANLNEQQIASNTLVRALMTTVCYSAIIFETPLRVDVQVLKVRARLLQKYL-SDEQ 1518
Cdd:cd11559      4 LRVQAELLKLLQEDPNPDELYKWIKENVSPELYASPGFVRALMTAVLKYAIEEKSLPEKEKALLEKYAPLLQKYLdDDEQ 83
                           90       100       110       120       130
                   ....*....|....*....|....*....|....*....|....*....|.
gi 1046853695 1519 KELQALYALQALVVTLEQPANLLRMFFDALYDEDVVKEDAFYSWESSKDPA 1569
Cdd:cd11559     84 LQLQALYALQALVHTLEFPKGLLLRFFDALYDEDVIEEEAFLKWKEDVDPA 134
MA3 pfam02847
MA3 domain; Domain in DAP-5, eIF4G, MA-3 and other proteins. Highly alpha-helical. May contain ...
1241-1353 3.32e-35

MA3 domain; Domain in DAP-5, eIF4G, MA-3 and other proteins. Highly alpha-helical. May contain repeats and/or regions similar to MIF4G domains.


:

Pssm-ID: 397128  Cd Length: 113  Bit Score: 130.09  E-value: 3.32e-35
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1046853695 1241 VERKSKAIIEEYLHLNDMKEAVQCVQELASPSLLFIFVRLGIESTLERSTIAREHMGRLLHQLLCAGHLSTAQYYQGLYE 1320
Cdd:pfam02847    1 LKRKIFLILEEYLSSGDYDEAARCLLKLGLPSQHHEVVKVLIECALEESKTYREFYGLLLERLCEFNLISTKQFEKGFWR 80
                           90       100       110
                   ....*....|....*....|....*....|...
gi 1046853695 1321 TLELAEDMEIDIPHVWLYLAELITPILQEDGVP 1353
Cdd:pfam02847   81 VLEDLEDLELDIPNAWRNLAEFVARLISDDGLP 113
PHA03247 super family cl33720
large tegument protein UL36; Provisional
4-457 2.39e-09

large tegument protein UL36; Provisional


The actual alignment was detected with superfamily member PHA03247:

Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 62.65  E-value: 2.39e-09
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1046853695    4 APQPTGPPPARSPGLPQPAFPPGQTAPVVFSTPQATQMNTPS-----------QPRQGGF-RSLQHFYPSRA---QPPSS 68
Cdd:PHA03247  2477 APVYRRPAEARFPFAAGAAPDPGGGGPPDPDAPPAPSRLAPAilpdepvgepvHPRMLTWiRGLEELASDDAgdpPPPLP 2556
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1046853695   69 AASRVQSAAPARPGPAPHVYPAGSQVMMIPSQISYSASQGAYYIPGQGRSTYVVPTQQYPVQPGAPGFYPGA-SPTEFGT 147
Cdd:PHA03247  2557 PAAPPAAPDRSVPPPRPAPRPSEPAVTSRARRPDAPPQSARPRAPVDDRGDPRGPAPPSPLPPDTHAPDPPPpSPSPAAN 2636
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1046853695  148 YAGAYYPAQSVQQFPASVAPAPVLMNQPPQIapKRERKTIRIRDPNQGGKDITEEIMSGARTASTPTPPQtGGSLEPQPN 227
Cdd:PHA03247  2637 EPDPHPPPTVPPPERPRDDPAPGRVSRPRRA--RRLGRAAQASSPPQRPRRRAARPTVGSLTSLADPPPP-PPTPEPAPH 2713
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1046853695  228 GESPqvAVIIRPDDRSQGAAIGGRPGLPGPEHSPGTESQPSSPSPTPSPPPILEPGSESNLGVLSIPGDTMTTGMIPISV 307
Cdd:PHA03247  2714 ALVS--ATPLPPGPAAARQASPALPAAPAPPAVPAGPATPGGPARPARPPTTAGPPAPAPPAAPAAGPPRRLTRPAVASL 2791
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1046853695  308 EESTPISCESGEPyclSPEPTLAEPILEVEVTLSKPIPESEFSSSPLQVSTSLVPHRAETHEPNG--VIPSEDL------ 379
Cdd:PHA03247  2792 SESRESLPSPWDP---ADPPAAVLAPAAALPPAASPAGPLPPPTSAQPTAPPPPPGPPPPSLPLGgsVAPGGDVrrrpps 2868
                          410       420       430       440       450       460       470
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 1046853695  380 EPEVESSTEPAPPPLSACASESLV-PIAPTAQPEELLNGAPSPPAVDlSPVSEPEEQAKEVPSAALASIVSPTPPVAPS 457
Cdd:PHA03247  2869 RSPAAKPAAPARPPVRRLARPAVSrSTESFALPPDQPERPPQPQAPP-PPQPQPQPPPPPQPQPPPPPPPRPQPPLAPT 2946
rad2 super family cl36701
DNA excision repair protein (rad2); All proteins in this family for which functions are known ...
505-610 2.12e-03

DNA excision repair protein (rad2); All proteins in this family for which functions are known are flap endonucleases that generate the 3' incision next to DNA damage as part of nucleotide excision repair. This family is related to many other flap endonuclease families including the fen1 family. This family is based on the phylogenomic analysis of JA Eisen (1999, Ph.D. Thesis, Stanford University). [DNA metabolism, DNA replication, recombination, and repair]


The actual alignment was detected with superfamily member TIGR00600:

Pssm-ID: 273166 [Multi-domain]  Cd Length: 1034  Bit Score: 42.96  E-value: 2.12e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1046853695  505 AAPQVAVSVPKRRRKIkELNKKEAVGDllDAFKEVDPAVPEVENQPPTGSNPSPESEGSAALPQPEEAEET------WDS 578
Cdd:TIGR00600  632 SADLLLISNPMEVEPM-ESEKEESESD--GSFIEVDSVSSTLELQVPSKSQPTDESEENAENKVASIEGEHrkeiedLLF 708
                           90       100       110
                   ....*....|....*....|....*....|..
gi 1046853695  579 KEDKIHNAENIQPGEQKYEYKSDQWKPLNLEE 610
Cdd:TIGR00600  709 DESEEDNIVGMIEEEKDADDFKNEWQDISLEE 740
 
Name Accession Description Interval E-value
MIF4G pfam02854
MIF4G domain; MIF4G is named after Middle domain of eukaryotic initiation factor 4G (eIF4G). ...
764-992 2.59e-63

MIF4G domain; MIF4G is named after Middle domain of eukaryotic initiation factor 4G (eIF4G). Also occurs in NMD2p and CBP80. The domain is rich in alpha-helices and may contain multiple alpha-helical repeats. In eIF4G, this domain binds eIF4A, eIF3, RNA and DNA.


Pssm-ID: 397130  Cd Length: 203  Bit Score: 214.15  E-value: 2.59e-63
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1046853695  764 FRRVRSILNKLTPQMFQQLMKQVTQLAIDTEERLKGVIDLIFEKAISEPNFSVAYANMCRCLMAlkvpttekpTVTVNFR 843
Cdd:pfam02854    1 LKKVKGILNKLSPENFEKLIKELLKLIMSDPELLKYLIELIFEKAVEEPNFIPAYARLCSGLNL---------RNPTDFG 71
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1046853695  844 KLLLNRCQKEFEKdkdddevfekkqkemdeaataeergrlKEELEEARDIARRRSLGNIKFIGELFKLKMLTEAIMHDCV 923
Cdd:pfam02854   72 IHLLNRLQEEFEK---------------------------RFELEENEQGNRRRRLGLVRFLGELYKFGLLTEKILFECL 124
                          170       180       190       200       210       220       230
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 1046853695  924 VKLLKNH-------DEESLECLCRLLTTIGKDLDFAKAKPRMDQYFNQMEKII---KEKKTSSRIRFMLQDVLDLRQSN 992
Cdd:pfam02854  125 KELLSSLtkedlkrDLFNLECLLTLLTTIGKLLENEKLPKLMDQFLDEIQKYVlskDDPKLSSRLRFMLQDLIELRKNK 203
W2_eIF4G1_like cd11559
C-terminal W2 domain of eukaryotic translation initiation factor 4 gamma 1 and similar ...
1440-1569 1.99e-55

C-terminal W2 domain of eukaryotic translation initiation factor 4 gamma 1 and similar proteins; eIF4G1 is a component of the multi-subunit eukaryotic translation initiation factor 4F, which facilitates recruitment of the mRNA to the ribosome, a rate-limiting step during translation initiation. This C-terminal domain, whose structure resembles that of a set of concatenated HEAT repeats, has been associated with binding to/recruiting the kinase Mnk1, which phosphorylates eIF4E.


Pssm-ID: 211397  Cd Length: 134  Bit Score: 189.03  E-value: 1.99e-55
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1046853695 1440 EELRRQLEKLLKDGGSNQRVFDWIEANLNEQQIASNTLVRALMTTVCYSAIIFETPLRVDVQVLKVRARLLQKYL-SDEQ 1518
Cdd:cd11559      4 LRVQAELLKLLQEDPNPDELYKWIKENVSPELYASPGFVRALMTAVLKYAIEEKSLPEKEKALLEKYAPLLQKYLdDDEQ 83
                           90       100       110       120       130
                   ....*....|....*....|....*....|....*....|....*....|.
gi 1046853695 1519 KELQALYALQALVVTLEQPANLLRMFFDALYDEDVVKEDAFYSWESSKDPA 1569
Cdd:cd11559     84 LQLQALYALQALVHTLEFPKGLLLRFFDALYDEDVIEEEAFLKWKEDVDPA 134
MIF4G smart00543
Middle domain of eukaryotic initiation factor 4G (eIF4G); Also occurs in NMD2p and CBP80. The ...
765-992 7.60e-53

Middle domain of eukaryotic initiation factor 4G (eIF4G); Also occurs in NMD2p and CBP80. The domain is rich in alpha-helices and may contain multiple alpha-helical repeats. In eIF4G, this domain binds eIF4A, eIF3, RNA and DNA. Ponting (TiBS) "Novel eIF4G domain homologues (in press)


Pssm-ID: 214713  Cd Length: 200  Bit Score: 184.10  E-value: 7.60e-53
                            10        20        30        40        50        60        70        80
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1046853695   765 RRVRSILNKLTPQMFQQLMKQVTQLAIDTEERLKGVIDLIFEKAISEPNFSVAYANMCRCLMAlKVPttekptvtvNFRK 844
Cdd:smart00543    2 KKVKGLINKLSPSNFESIIKELLKLNNSDKNLRKYILELIFEKAVEEPNFIPAYARLCALLNA-KNP---------DFGS 71
                            90       100       110       120       130       140       150       160
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1046853695   845 LLLNRCQKEFEKDkdddevfekkqkemdeaataeergrlkeeLEEARDIARRRSLGNIKFIGELFKLKMLTEAIMHDCVV 924
Cdd:smart00543   72 LLLERLQEEFEKG-----------------------------LESEEESDKQRRLGLVRFLGELYNFQVLTSKIILELLK 122
                           170       180       190       200       210       220       230
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 1046853695   925 KLLKNH-------DEESLECLCRLLTTIGKDLDFAKAKPRMDQYFNQMEKIIKEKKT---SSRIRFMLQDVLDLRQSN 992
Cdd:smart00543  123 ELLNDLtkldpprSDFSVECLLSLLPTCGKDLEREKSPKLLDEILERLQDYLLKKDKtelSSRLRFMLELLIELRKNK 200
MA3 pfam02847
MA3 domain; Domain in DAP-5, eIF4G, MA-3 and other proteins. Highly alpha-helical. May contain ...
1241-1353 3.32e-35

MA3 domain; Domain in DAP-5, eIF4G, MA-3 and other proteins. Highly alpha-helical. May contain repeats and/or regions similar to MIF4G domains.


Pssm-ID: 397128  Cd Length: 113  Bit Score: 130.09  E-value: 3.32e-35
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1046853695 1241 VERKSKAIIEEYLHLNDMKEAVQCVQELASPSLLFIFVRLGIESTLERSTIAREHMGRLLHQLLCAGHLSTAQYYQGLYE 1320
Cdd:pfam02847    1 LKRKIFLILEEYLSSGDYDEAARCLLKLGLPSQHHEVVKVLIECALEESKTYREFYGLLLERLCEFNLISTKQFEKGFWR 80
                           90       100       110
                   ....*....|....*....|....*....|...
gi 1046853695 1321 TLELAEDMEIDIPHVWLYLAELITPILQEDGVP 1353
Cdd:pfam02847   81 VLEDLEDLELDIPNAWRNLAEFVARLISDDGLP 113
MA3 smart00544
Domain in DAP-5, eIF4G, MA-3 and other proteins; Highly alpha-helical. May contain repeats and ...
1241-1353 2.92e-34

Domain in DAP-5, eIF4G, MA-3 and other proteins; Highly alpha-helical. May contain repeats and/or regions similar to MIF4G domains Ponting (TIBS) "Novel eIF4G domain homologues" in press


Pssm-ID: 214714  Cd Length: 113  Bit Score: 127.75  E-value: 2.92e-34
                            10        20        30        40        50        60        70        80
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1046853695  1241 VERKSKAIIEEYLHLNDMKEAVQCVQELASPSLLFIFVRLGIESTLERSTIAREHMGRLLHQLLCAGHLSTAQYYQGLYE 1320
Cdd:smart00544    1 LKKKIFLIIEEYLSSGDTDEAVHCLLELKLPEQHHEVVKVLLTCALEEKRTYREMYSVLLSRLCQANVISTKQFEKGFWR 80
                            90       100       110
                    ....*....|....*....|....*....|...
gi 1046853695  1321 TLELAEDMEIDIPHVWLYLAELITPILQEDGVP 1353
Cdd:smart00544   81 LLEDIEDLELDIPNAWRNLAEFVARLISDGILP 113
eIF5C smart00515
Domain at the C-termini of GCD6, eIF-2B epsilon, eIF-4 gamma and eIF-5;
1509-1591 2.43e-27

Domain at the C-termini of GCD6, eIF-2B epsilon, eIF-4 gamma and eIF-5;


Pssm-ID: 214705  Cd Length: 83  Bit Score: 106.60  E-value: 2.43e-27
                            10        20        30        40        50        60        70        80
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1046853695  1509 LLQKYLSDEQKELQALYALQALVVTLEQPANLLRMFFDALYDEDVVKEDAFYSWESSKDPAEqqGKGVALKSVTAFFNWL 1588
Cdd:smart00515    3 LLKFLAKDEEEQLELLYAIEEFCVELEKLGKLLPKILKSLYDADILEEEAILKWYEKAVSAE--GKKKVRKNAKPFVTWL 80

                    ...
gi 1046853695  1589 REA 1591
Cdd:smart00515   81 QEA 83
W2 pfam02020
eIF4-gamma/eIF5/eIF2-epsilon; This domain of unknown function is found at the C-terminus of ...
1520-1596 3.93e-23

eIF4-gamma/eIF5/eIF2-epsilon; This domain of unknown function is found at the C-terminus of several translation initiation factors.


Pssm-ID: 460415  Cd Length: 76  Bit Score: 94.52  E-value: 3.93e-23
                           10        20        30        40        50        60        70
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*..
gi 1046853695 1520 ELQALYALQALVVTLEQPANLLRMFFDALYDEDVVKEDAFYSWESSKDPAEqQGKGVALKSVTAFFNWLREAEDEES 1596
Cdd:pfam02020    1 QVDLLLALQEFCAKLEELLKLLLKILKALYDLDIVEEEAILKWWEDVSSAE-KGMKKVRKQAKPFVEWLEEAEEESD 76
PHA03247 PHA03247
large tegument protein UL36; Provisional
4-457 2.39e-09

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 62.65  E-value: 2.39e-09
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1046853695    4 APQPTGPPPARSPGLPQPAFPPGQTAPVVFSTPQATQMNTPS-----------QPRQGGF-RSLQHFYPSRA---QPPSS 68
Cdd:PHA03247  2477 APVYRRPAEARFPFAAGAAPDPGGGGPPDPDAPPAPSRLAPAilpdepvgepvHPRMLTWiRGLEELASDDAgdpPPPLP 2556
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1046853695   69 AASRVQSAAPARPGPAPHVYPAGSQVMMIPSQISYSASQGAYYIPGQGRSTYVVPTQQYPVQPGAPGFYPGA-SPTEFGT 147
Cdd:PHA03247  2557 PAAPPAAPDRSVPPPRPAPRPSEPAVTSRARRPDAPPQSARPRAPVDDRGDPRGPAPPSPLPPDTHAPDPPPpSPSPAAN 2636
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1046853695  148 YAGAYYPAQSVQQFPASVAPAPVLMNQPPQIapKRERKTIRIRDPNQGGKDITEEIMSGARTASTPTPPQtGGSLEPQPN 227
Cdd:PHA03247  2637 EPDPHPPPTVPPPERPRDDPAPGRVSRPRRA--RRLGRAAQASSPPQRPRRRAARPTVGSLTSLADPPPP-PPTPEPAPH 2713
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1046853695  228 GESPqvAVIIRPDDRSQGAAIGGRPGLPGPEHSPGTESQPSSPSPTPSPPPILEPGSESNLGVLSIPGDTMTTGMIPISV 307
Cdd:PHA03247  2714 ALVS--ATPLPPGPAAARQASPALPAAPAPPAVPAGPATPGGPARPARPPTTAGPPAPAPPAAPAAGPPRRLTRPAVASL 2791
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1046853695  308 EESTPISCESGEPyclSPEPTLAEPILEVEVTLSKPIPESEFSSSPLQVSTSLVPHRAETHEPNG--VIPSEDL------ 379
Cdd:PHA03247  2792 SESRESLPSPWDP---ADPPAAVLAPAAALPPAASPAGPLPPPTSAQPTAPPPPPGPPPPSLPLGgsVAPGGDVrrrpps 2868
                          410       420       430       440       450       460       470
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 1046853695  380 EPEVESSTEPAPPPLSACASESLV-PIAPTAQPEELLNGAPSPPAVDlSPVSEPEEQAKEVPSAALASIVSPTPPVAPS 457
Cdd:PHA03247  2869 RSPAAKPAAPARPPVRRLARPAVSrSTESFALPPDQPERPPQPQAPP-PPQPQPQPPPPPQPQPPPPPPPRPQPPLAPT 2946
PAT1 pfam09770
Topoisomerase II-associated protein PAT1; Members of this family are necessary for accurate ...
3-179 8.09e-05

Topoisomerase II-associated protein PAT1; Members of this family are necessary for accurate chromosome transmission during cell division.


Pssm-ID: 401645 [Multi-domain]  Cd Length: 846  Bit Score: 47.34  E-value: 8.09e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1046853695    3 KAPQPTGPPPARSPGLPQPAFPPGQTAPVVFSTPQATQMNTPSQPRQGGFRSLQHFYPSRAQPPSSAASRVQSAAPARPG 82
Cdd:pfam09770  206 QAKKPAQQPAPAPAQPPAAPPAQQAQQQQQFPPQIQQQQQPQQQPQQPQQHPGQGHPVTILQRPQSPQPDPAQPSIQPQA 285
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1046853695   83 PAPHVYPAgsQVMMIPSQIsysasqgayyipgqgrstyvvpTQQYPVQPGAPGFYPGAsptefGTYAGAYYPAQSVQQFP 162
Cdd:pfam09770  286 QQFHQQPP--PVPVQPTQI----------------------LQNPNRLSAARVGYPQN-----PQPGVQPAPAHQAHRQQ 336
                          170
                   ....*....|....*..
gi 1046853695  163 ASVAPAPVLMNQPPQIA 179
Cdd:pfam09770  337 GSFGRQAPIITHPQQLA 353
rad2 TIGR00600
DNA excision repair protein (rad2); All proteins in this family for which functions are known ...
505-610 2.12e-03

DNA excision repair protein (rad2); All proteins in this family for which functions are known are flap endonucleases that generate the 3' incision next to DNA damage as part of nucleotide excision repair. This family is related to many other flap endonuclease families including the fen1 family. This family is based on the phylogenomic analysis of JA Eisen (1999, Ph.D. Thesis, Stanford University). [DNA metabolism, DNA replication, recombination, and repair]


Pssm-ID: 273166 [Multi-domain]  Cd Length: 1034  Bit Score: 42.96  E-value: 2.12e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1046853695  505 AAPQVAVSVPKRRRKIkELNKKEAVGDllDAFKEVDPAVPEVENQPPTGSNPSPESEGSAALPQPEEAEET------WDS 578
Cdd:TIGR00600  632 SADLLLISNPMEVEPM-ESEKEESESD--GSFIEVDSVSSTLELQVPSKSQPTDESEENAENKVASIEGEHrkeiedLLF 708
                           90       100       110
                   ....*....|....*....|....*....|..
gi 1046853695  579 KEDKIHNAENIQPGEQKYEYKSDQWKPLNLEE 610
Cdd:TIGR00600  709 DESEEDNIVGMIEEEKDADDFKNEWQDISLEE 740
 
Name Accession Description Interval E-value
MIF4G pfam02854
MIF4G domain; MIF4G is named after Middle domain of eukaryotic initiation factor 4G (eIF4G). ...
764-992 2.59e-63

MIF4G domain; MIF4G is named after Middle domain of eukaryotic initiation factor 4G (eIF4G). Also occurs in NMD2p and CBP80. The domain is rich in alpha-helices and may contain multiple alpha-helical repeats. In eIF4G, this domain binds eIF4A, eIF3, RNA and DNA.


Pssm-ID: 397130  Cd Length: 203  Bit Score: 214.15  E-value: 2.59e-63
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1046853695  764 FRRVRSILNKLTPQMFQQLMKQVTQLAIDTEERLKGVIDLIFEKAISEPNFSVAYANMCRCLMAlkvpttekpTVTVNFR 843
Cdd:pfam02854    1 LKKVKGILNKLSPENFEKLIKELLKLIMSDPELLKYLIELIFEKAVEEPNFIPAYARLCSGLNL---------RNPTDFG 71
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1046853695  844 KLLLNRCQKEFEKdkdddevfekkqkemdeaataeergrlKEELEEARDIARRRSLGNIKFIGELFKLKMLTEAIMHDCV 923
Cdd:pfam02854   72 IHLLNRLQEEFEK---------------------------RFELEENEQGNRRRRLGLVRFLGELYKFGLLTEKILFECL 124
                          170       180       190       200       210       220       230
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 1046853695  924 VKLLKNH-------DEESLECLCRLLTTIGKDLDFAKAKPRMDQYFNQMEKII---KEKKTSSRIRFMLQDVLDLRQSN 992
Cdd:pfam02854  125 KELLSSLtkedlkrDLFNLECLLTLLTTIGKLLENEKLPKLMDQFLDEIQKYVlskDDPKLSSRLRFMLQDLIELRKNK 203
W2_eIF4G1_like cd11559
C-terminal W2 domain of eukaryotic translation initiation factor 4 gamma 1 and similar ...
1440-1569 1.99e-55

C-terminal W2 domain of eukaryotic translation initiation factor 4 gamma 1 and similar proteins; eIF4G1 is a component of the multi-subunit eukaryotic translation initiation factor 4F, which facilitates recruitment of the mRNA to the ribosome, a rate-limiting step during translation initiation. This C-terminal domain, whose structure resembles that of a set of concatenated HEAT repeats, has been associated with binding to/recruiting the kinase Mnk1, which phosphorylates eIF4E.


Pssm-ID: 211397  Cd Length: 134  Bit Score: 189.03  E-value: 1.99e-55
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1046853695 1440 EELRRQLEKLLKDGGSNQRVFDWIEANLNEQQIASNTLVRALMTTVCYSAIIFETPLRVDVQVLKVRARLLQKYL-SDEQ 1518
Cdd:cd11559      4 LRVQAELLKLLQEDPNPDELYKWIKENVSPELYASPGFVRALMTAVLKYAIEEKSLPEKEKALLEKYAPLLQKYLdDDEQ 83
                           90       100       110       120       130
                   ....*....|....*....|....*....|....*....|....*....|.
gi 1046853695 1519 KELQALYALQALVVTLEQPANLLRMFFDALYDEDVVKEDAFYSWESSKDPA 1569
Cdd:cd11559     84 LQLQALYALQALVHTLEFPKGLLLRFFDALYDEDVIEEEAFLKWKEDVDPA 134
MIF4G smart00543
Middle domain of eukaryotic initiation factor 4G (eIF4G); Also occurs in NMD2p and CBP80. The ...
765-992 7.60e-53

Middle domain of eukaryotic initiation factor 4G (eIF4G); Also occurs in NMD2p and CBP80. The domain is rich in alpha-helices and may contain multiple alpha-helical repeats. In eIF4G, this domain binds eIF4A, eIF3, RNA and DNA. Ponting (TiBS) "Novel eIF4G domain homologues (in press)


Pssm-ID: 214713  Cd Length: 200  Bit Score: 184.10  E-value: 7.60e-53
                            10        20        30        40        50        60        70        80
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1046853695   765 RRVRSILNKLTPQMFQQLMKQVTQLAIDTEERLKGVIDLIFEKAISEPNFSVAYANMCRCLMAlKVPttekptvtvNFRK 844
Cdd:smart00543    2 KKVKGLINKLSPSNFESIIKELLKLNNSDKNLRKYILELIFEKAVEEPNFIPAYARLCALLNA-KNP---------DFGS 71
                            90       100       110       120       130       140       150       160
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1046853695   845 LLLNRCQKEFEKDkdddevfekkqkemdeaataeergrlkeeLEEARDIARRRSLGNIKFIGELFKLKMLTEAIMHDCVV 924
Cdd:smart00543   72 LLLERLQEEFEKG-----------------------------LESEEESDKQRRLGLVRFLGELYNFQVLTSKIILELLK 122
                           170       180       190       200       210       220       230
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 1046853695   925 KLLKNH-------DEESLECLCRLLTTIGKDLDFAKAKPRMDQYFNQMEKIIKEKKT---SSRIRFMLQDVLDLRQSN 992
Cdd:smart00543  123 ELLNDLtkldpprSDFSVECLLSLLPTCGKDLEREKSPKLLDEILERLQDYLLKKDKtelSSRLRFMLELLIELRKNK 200
MA3 pfam02847
MA3 domain; Domain in DAP-5, eIF4G, MA-3 and other proteins. Highly alpha-helical. May contain ...
1241-1353 3.32e-35

MA3 domain; Domain in DAP-5, eIF4G, MA-3 and other proteins. Highly alpha-helical. May contain repeats and/or regions similar to MIF4G domains.


Pssm-ID: 397128  Cd Length: 113  Bit Score: 130.09  E-value: 3.32e-35
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1046853695 1241 VERKSKAIIEEYLHLNDMKEAVQCVQELASPSLLFIFVRLGIESTLERSTIAREHMGRLLHQLLCAGHLSTAQYYQGLYE 1320
Cdd:pfam02847    1 LKRKIFLILEEYLSSGDYDEAARCLLKLGLPSQHHEVVKVLIECALEESKTYREFYGLLLERLCEFNLISTKQFEKGFWR 80
                           90       100       110
                   ....*....|....*....|....*....|...
gi 1046853695 1321 TLELAEDMEIDIPHVWLYLAELITPILQEDGVP 1353
Cdd:pfam02847   81 VLEDLEDLELDIPNAWRNLAEFVARLISDDGLP 113
MA3 smart00544
Domain in DAP-5, eIF4G, MA-3 and other proteins; Highly alpha-helical. May contain repeats and ...
1241-1353 2.92e-34

Domain in DAP-5, eIF4G, MA-3 and other proteins; Highly alpha-helical. May contain repeats and/or regions similar to MIF4G domains Ponting (TIBS) "Novel eIF4G domain homologues" in press


Pssm-ID: 214714  Cd Length: 113  Bit Score: 127.75  E-value: 2.92e-34
                            10        20        30        40        50        60        70        80
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1046853695  1241 VERKSKAIIEEYLHLNDMKEAVQCVQELASPSLLFIFVRLGIESTLERSTIAREHMGRLLHQLLCAGHLSTAQYYQGLYE 1320
Cdd:smart00544    1 LKKKIFLIIEEYLSSGDTDEAVHCLLELKLPEQHHEVVKVLLTCALEEKRTYREMYSVLLSRLCQANVISTKQFEKGFWR 80
                            90       100       110
                    ....*....|....*....|....*....|...
gi 1046853695  1321 TLELAEDMEIDIPHVWLYLAELITPILQEDGVP 1353
Cdd:smart00544   81 LLEDIEDLELDIPNAWRNLAEFVARLISDGILP 113
eIF5C smart00515
Domain at the C-termini of GCD6, eIF-2B epsilon, eIF-4 gamma and eIF-5;
1509-1591 2.43e-27

Domain at the C-termini of GCD6, eIF-2B epsilon, eIF-4 gamma and eIF-5;


Pssm-ID: 214705  Cd Length: 83  Bit Score: 106.60  E-value: 2.43e-27
                            10        20        30        40        50        60        70        80
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1046853695  1509 LLQKYLSDEQKELQALYALQALVVTLEQPANLLRMFFDALYDEDVVKEDAFYSWESSKDPAEqqGKGVALKSVTAFFNWL 1588
Cdd:smart00515    3 LLKFLAKDEEEQLELLYAIEEFCVELEKLGKLLPKILKSLYDADILEEEAILKWYEKAVSAE--GKKKVRKNAKPFVTWL 80

                    ...
gi 1046853695  1589 REA 1591
Cdd:smart00515   81 QEA 83
W2 pfam02020
eIF4-gamma/eIF5/eIF2-epsilon; This domain of unknown function is found at the C-terminus of ...
1520-1596 3.93e-23

eIF4-gamma/eIF5/eIF2-epsilon; This domain of unknown function is found at the C-terminus of several translation initiation factors.


Pssm-ID: 460415  Cd Length: 76  Bit Score: 94.52  E-value: 3.93e-23
                           10        20        30        40        50        60        70
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*..
gi 1046853695 1520 ELQALYALQALVVTLEQPANLLRMFFDALYDEDVVKEDAFYSWESSKDPAEqQGKGVALKSVTAFFNWLREAEDEES 1596
Cdd:pfam02020    1 QVDLLLALQEFCAKLEELLKLLLKILKALYDLDIVEEEAILKWWEDVSSAE-KGMKKVRKQAKPFVEWLEEAEEESD 76
W2 cd11473
C-terminal domain of eIF4-gamma/eIF5/eIF2b-epsilon; This domain is found at the C-terminus of ...
1440-1563 1.15e-19

C-terminal domain of eIF4-gamma/eIF5/eIF2b-epsilon; This domain is found at the C-terminus of several translation initiation factors, including the epsilon chain of eIF2b, where it has been found to catalyze the conversion of eIF2.GDP to its active eIF2.GTP form. The structure of the domain resembles that of a set of concatenated HEAT repeats.


Pssm-ID: 211395  Cd Length: 135  Bit Score: 86.76  E-value: 1.15e-19
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1046853695 1440 EELRRQLEKLLK-DGGSNQRVFDWIEANLNEQQIASNTLVRALMTTVCYSAIIFE----TPLRVDVQVLKVRARLLQKYL 1514
Cdd:cd11473      4 KKLRDSLLKELEeDKSSDVESVKAAKSKLDLDPISLEEVVKVLLTAVVNAVESADsislTQKEQLVLVLKKYGPVLRELL 83
                           90       100       110       120       130
                   ....*....|....*....|....*....|....*....|....*....|..
gi 1046853695 1515 SD-EQKELQALYALQALVVT--LEQPANLLRMFFDALYDEDVVKEDAFYSWE 1563
Cdd:cd11473     84 KLiKKDQLYLLLKIEKLCLQlkLSELISLLEKILDLLYDADVLSEEAILSWF 135
W2_eIF2B_epsilon cd11558
C-terminal W2 domain of eukaryotic translation initiation factor 2B epsilon; eIF2B is a ...
1509-1596 9.24e-14

C-terminal W2 domain of eukaryotic translation initiation factor 2B epsilon; eIF2B is a heteropentameric complex which functions as a guanine nucleotide exchange factor in the recycling of eIF-2 during the initiation of translation in eukaryotes. The epsilon and gamma subunits are sequence similar and both are essential in yeast. Epsilon appears to be the catalytically active subunit, with gamma enhancing its activity. The C-terminal domain of the eIF2B epsilon subunit contains bipartite motifs rich in acidic and aromatic residues, which are responsible for the interaction with eIF2. The structure of the domain resembles that of a set of concatenated HEAT repeats.


Pssm-ID: 211396  Cd Length: 169  Bit Score: 70.75  E-value: 9.24e-14
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1046853695 1509 LLQKYLSDEQKELQALYALQALVVTLEQPANLLRMFFDALYDEDVVKEDAFYSWESSKDPAEQQGKGVALKSVTAFFNWL 1588
Cdd:cd11558     82 LLENYVKSQDDQVELLLALEEFCLESEEGGPLFAKLLHALYDLDILEEEAILEWWEEPDAGADEEMKKVRELVKKFIEWL 161

                   ....*...
gi 1046853695 1589 REAEDEES 1596
Cdd:cd11558    162 EEAEEESD 169
PHA03247 PHA03247
large tegument protein UL36; Provisional
4-457 2.39e-09

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 62.65  E-value: 2.39e-09
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1046853695    4 APQPTGPPPARSPGLPQPAFPPGQTAPVVFSTPQATQMNTPS-----------QPRQGGF-RSLQHFYPSRA---QPPSS 68
Cdd:PHA03247  2477 APVYRRPAEARFPFAAGAAPDPGGGGPPDPDAPPAPSRLAPAilpdepvgepvHPRMLTWiRGLEELASDDAgdpPPPLP 2556
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1046853695   69 AASRVQSAAPARPGPAPHVYPAGSQVMMIPSQISYSASQGAYYIPGQGRSTYVVPTQQYPVQPGAPGFYPGA-SPTEFGT 147
Cdd:PHA03247  2557 PAAPPAAPDRSVPPPRPAPRPSEPAVTSRARRPDAPPQSARPRAPVDDRGDPRGPAPPSPLPPDTHAPDPPPpSPSPAAN 2636
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1046853695  148 YAGAYYPAQSVQQFPASVAPAPVLMNQPPQIapKRERKTIRIRDPNQGGKDITEEIMSGARTASTPTPPQtGGSLEPQPN 227
Cdd:PHA03247  2637 EPDPHPPPTVPPPERPRDDPAPGRVSRPRRA--RRLGRAAQASSPPQRPRRRAARPTVGSLTSLADPPPP-PPTPEPAPH 2713
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1046853695  228 GESPqvAVIIRPDDRSQGAAIGGRPGLPGPEHSPGTESQPSSPSPTPSPPPILEPGSESNLGVLSIPGDTMTTGMIPISV 307
Cdd:PHA03247  2714 ALVS--ATPLPPGPAAARQASPALPAAPAPPAVPAGPATPGGPARPARPPTTAGPPAPAPPAAPAAGPPRRLTRPAVASL 2791
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1046853695  308 EESTPISCESGEPyclSPEPTLAEPILEVEVTLSKPIPESEFSSSPLQVSTSLVPHRAETHEPNG--VIPSEDL------ 379
Cdd:PHA03247  2792 SESRESLPSPWDP---ADPPAAVLAPAAALPPAASPAGPLPPPTSAQPTAPPPPPGPPPPSLPLGgsVAPGGDVrrrpps 2868
                          410       420       430       440       450       460       470
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 1046853695  380 EPEVESSTEPAPPPLSACASESLV-PIAPTAQPEELLNGAPSPPAVDlSPVSEPEEQAKEVPSAALASIVSPTPPVAPS 457
Cdd:PHA03247  2869 RSPAAKPAAPARPPVRRLARPAVSrSTESFALPPDQPERPPQPQAPP-PPQPQPQPPPPPQPQPPPPPPPRPQPPLAPT 2946
W2_eIF5 cd11561
C-terminal W2 domain of eukaryotic translation initiation factor 5; eIF5 functions as a GTPase ...
1438-1596 1.65e-08

C-terminal W2 domain of eukaryotic translation initiation factor 5; eIF5 functions as a GTPase acceleration protein (GAP), as well as a GDP dissociation inhibitor (GDI) during translational initiation in eukaryotes. The structure of this C-terminal domain resembles that of a set of concatenated HEAT repeats.


Pssm-ID: 211399  Cd Length: 157  Bit Score: 55.31  E-value: 1.65e-08
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1046853695 1438 AFEELRRQLEKLLKDGGSNQrvfdwIEANLNEQQIASNTLVRALmttvcysAIIFETPLRVD-VQVLKVRARLLQKYLSD 1516
Cdd:cd11561      7 RVDELGEFLKKNKDESGLSE-----LKEILKEAERLDVVKDKAV-------LVLAEVLFDENiVKEIKKRKALLLKLVTD 74
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1046853695 1517 EQKELQALYALQALVVtlEQPANLLRMF---FDALYDEDVVKEDAFYSW---ESSKDPAEQQGKGVaLKSVTAFFNWLRE 1590
Cdd:cd11561     75 EKAQKALLGGIERFCG--KHSPELLKKVpliLKALYDNDILEEEVILKWyekVSKKYVSKEKSKKV-RKAAEPFVEWLEE 151

                   ....*.
gi 1046853695 1591 AEDEES 1596
Cdd:cd11561    152 AEEEEE 157
W2_eIF5C_like cd11560
C-terminal W2 domain of the eukaryotic translation initiation factor 5C and similar proteins; ...
1434-1594 9.73e-08

C-terminal W2 domain of the eukaryotic translation initiation factor 5C and similar proteins; eIF5C appears to be essential for the initiation of protein translation; its actual function, and specifically that of the C-terminal W2 domain, are not well understood. The Drosophila ortholog, kra (krasavietz) or exba (extra bases), may be involved in translational inhibition in neural development. The structure of this C-terminal domain resembles that of a set of concatenated HEAT repeats.


Pssm-ID: 211398 [Multi-domain]  Cd Length: 194  Bit Score: 54.14  E-value: 9.73e-08
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1046853695 1434 QRALAFEELRRQLEKLLKDGGSNQRVFDWIEANLNEQQIASN--------TLVRALMTTVCYSA---IIFETPLRVdvqv 1502
Cdd:cd11560     29 YRKQASQEIKKELQQELKEMIAEEEPVKEIIAAVKEQMKKSSlpehevvgLLWTALMDAVEWSKkedQIAEQALRH---- 104
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1046853695 1503 LKVRARLLQKYLSDEQKELQALYALQalVVTLEQpANLLRMFFD---ALYDEDVVKEDAFYSWesSKDPAEQQGKGVALK 1579
Cdd:cd11560    105 LKKYAPLLAAFCTTARAELALLNKIQ--EYCYEN-MKFMKVFQKivkLLYKADVLSEDAILKW--YKKGHSPKGKQVFLK 179
                          170
                   ....*....|....*
gi 1046853695 1580 SVTAFFNWLREAEDE 1594
Cdd:cd11560    180 QMEPFVEWLQEAEEE 194
PHA03247 PHA03247
large tegument protein UL36; Provisional
5-360 3.22e-07

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 55.71  E-value: 3.22e-07
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1046853695    5 PQPTGPPPARSPGLPQP--------AFPPGQTAPVVFSTPQATQmnTPSQPRQGGFRslqhfyPSRAQPPSSAASRVQSA 76
Cdd:PHA03247  2706 PTPEPAPHALVSATPLPpgpaaarqASPALPAAPAPPAVPAGPA--TPGGPARPARP------PTTAGPPAPAPPAAPAA 2777
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1046853695   77 APARPGPAPHVYPAGSQVMMIPSQISYSASQGAYYIPGQGRSTYVVPTQQYP----VQPGAPGFYPGASPTEFgTYAGAY 152
Cdd:PHA03247  2778 GPPRRLTRPAVASLSESRESLPSPWDPADPPAAVLAPAAALPPAASPAGPLPpptsAQPTAPPPPPGPPPPSL-PLGGSV 2856
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1046853695  153 YPAQSVQQFPASVAPAPVL----------MNQP-------PQIAPKRERKTIRIRDPNQGGKDITEEIMSGARTASTPTP 215
Cdd:PHA03247  2857 APGGDVRRRPPSRSPAAKPaaparppvrrLARPavsrsteSFALPPDQPERPPQPQAPPPPQPQPQPPPPPQPQPPPPPP 2936
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1046853695  216 PQTGGSLEPQPNGESPQVAVIIRPDDRSqGAAIGGR------------PGLPGPEHSPGTESQPSSPSPTPSPPPIL--- 280
Cdd:PHA03247  2937 PRPQPPLAPTTDPAGAGEPSGAVPQPWL-GALVPGRvavprfrvpqpaPSREAPASSTPPLTGHSLSRVSSWASSLAlhe 3015
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1046853695  281 --EPGSESNLGVLSIPGDTMTTGMIPISVEESTPISCESGEPycLSPEPTLA---EPILEVEVTLSKPIPESEFSSSPLQ 355
Cdd:PHA03247  3016 etDPPPVSLKQTLWPPDDTEDSDADSLFDSDSERSDLEALDP--LPPEPHDPfahEPDPATPEAGARESPSSQFGPPPLS 3093

                   ....*
gi 1046853695  356 VSTSL 360
Cdd:PHA03247  3094 ANAAL 3098
PAT1 pfam09770
Topoisomerase II-associated protein PAT1; Members of this family are necessary for accurate ...
3-179 8.09e-05

Topoisomerase II-associated protein PAT1; Members of this family are necessary for accurate chromosome transmission during cell division.


Pssm-ID: 401645 [Multi-domain]  Cd Length: 846  Bit Score: 47.34  E-value: 8.09e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1046853695    3 KAPQPTGPPPARSPGLPQPAFPPGQTAPVVFSTPQATQMNTPSQPRQGGFRSLQHFYPSRAQPPSSAASRVQSAAPARPG 82
Cdd:pfam09770  206 QAKKPAQQPAPAPAQPPAAPPAQQAQQQQQFPPQIQQQQQPQQQPQQPQQHPGQGHPVTILQRPQSPQPDPAQPSIQPQA 285
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1046853695   83 PAPHVYPAgsQVMMIPSQIsysasqgayyipgqgrstyvvpTQQYPVQPGAPGFYPGAsptefGTYAGAYYPAQSVQQFP 162
Cdd:pfam09770  286 QQFHQQPP--PVPVQPTQI----------------------LQNPNRLSAARVGYPQN-----PQPGVQPAPAHQAHRQQ 336
                          170
                   ....*....|....*..
gi 1046853695  163 ASVAPAPVLMNQPPQIA 179
Cdd:pfam09770  337 GSFGRQAPIITHPQQLA 353
PHA03247 PHA03247
large tegument protein UL36; Provisional
162-460 1.17e-04

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 47.24  E-value: 1.17e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1046853695  162 PASVAPAPVLMNQPPQIAPK--------RERKTIRIRDPNQGGKDITEEIMSGARTASTPTPPQTGgSLEPQPNGESPqv 233
Cdd:PHA03247  2557 PAAPPAAPDRSVPPPRPAPRpsepavtsRARRPDAPPQSARPRAPVDDRGDPRGPAPPSPLPPDTH-APDPPPPSPSP-- 2633
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1046853695  234 aviirpddRSQGAAIGGRPGLPGPEHSPGTESQPSSPSPTPSPPPILEPGSESNLGVLSIPGDTMTTGMIPISVEESTPi 313
Cdd:PHA03247  2634 --------AANEPDPHPPPTVPPPERPRDDPAPGRVSRPRRARRLGRAAQASSPPQRPRRRAARPTVGSLTSLADPPPP- 2704
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1046853695  314 scesgEPyclSPEPTlaePILEVEVTLSKPIPESEFSSSPLQVSTSLVPHRAETHEPNGVIPSEDLEPEVESSTEPAPPP 393
Cdd:PHA03247  2705 -----PP---TPEPA---PHALVSATPLPPGPAAARQASPALPAAPAPPAVPAGPATPGGPARPARPPTTAGPPAPAPPA 2773
                          250       260       270       280       290       300       310
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1046853695  394 LSACASESLVPIAPTAQPEELLNGAPSPPAVDLSPVSEPEEQAKEVPSAALASIVSPTP---PVAPSDTS 460
Cdd:PHA03247  2774 APAAGPPRRLTRPAVASLSESRESLPSPWDPADPPAAVLAPAAALPPAASPAGPLPPPTsaqPTAPPPPP 2843
PRK08691 PRK08691
DNA polymerase III subunits gamma and tau; Validated
327-458 2.06e-03

DNA polymerase III subunits gamma and tau; Validated


Pssm-ID: 236333 [Multi-domain]  Cd Length: 709  Bit Score: 42.77  E-value: 2.06e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1046853695  327 PTLAEPILEVEVTLSKPIPESEFSSSPLQV-STSLVPHRAETHEP------NGVIPSEDLEPEVESSTEPAPPPLSA--C 397
Cdd:PRK08691   380 PSAQTAEKETAAKKPQPRPEAETAQTPVQTaSAAAMPSEGKTAGPvsnqenNDVPPWEDAPDEAQTAAGTAQTSAKSiqT 459
                           90       100       110       120       130       140       150
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|..
gi 1046853695  398 ASESLVP--------IAPTAQPEELLNGAPSPPAVDLSPVSEP---EEQAKEVPSAALASIVSPTPPVAPSD 458
Cdd:PRK08691   460 ASEAETPpenqvsknKAADNETDAPLSEVPSENPIQATPNDEAvetETFAHEAPAEPFYGYGFPDNDCPPED 531
rad2 TIGR00600
DNA excision repair protein (rad2); All proteins in this family for which functions are known ...
505-610 2.12e-03

DNA excision repair protein (rad2); All proteins in this family for which functions are known are flap endonucleases that generate the 3' incision next to DNA damage as part of nucleotide excision repair. This family is related to many other flap endonuclease families including the fen1 family. This family is based on the phylogenomic analysis of JA Eisen (1999, Ph.D. Thesis, Stanford University). [DNA metabolism, DNA replication, recombination, and repair]


Pssm-ID: 273166 [Multi-domain]  Cd Length: 1034  Bit Score: 42.96  E-value: 2.12e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1046853695  505 AAPQVAVSVPKRRRKIkELNKKEAVGDllDAFKEVDPAVPEVENQPPTGSNPSPESEGSAALPQPEEAEET------WDS 578
Cdd:TIGR00600  632 SADLLLISNPMEVEPM-ESEKEESESD--GSFIEVDSVSSTLELQVPSKSQPTDESEENAENKVASIEGEHrkeiedLLF 708
                           90       100       110
                   ....*....|....*....|....*....|..
gi 1046853695  579 KEDKIHNAENIQPGEQKYEYKSDQWKPLNLEE 610
Cdd:TIGR00600  709 DESEEDNIVGMIEEEKDADDFKNEWQDISLEE 740
PRK07764 PRK07764
DNA polymerase III subunits gamma and tau; Validated
4-208 3.43e-03

DNA polymerase III subunits gamma and tau; Validated


Pssm-ID: 236090 [Multi-domain]  Cd Length: 824  Bit Score: 42.28  E-value: 3.43e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1046853695    4 APQPTGPPPARSPGLPQPAFPPGQ----------TAPVVFSTPQATQMNTPSQPRQGGFRSLQHFYPSRAQPPSSAASRV 73
Cdd:PRK07764   612 AARPAAPAAPAAPAAPAPAGAAAApaeasaapapGVAAPEHHPKHVAVPDASDGGDGWPAKAGGAAPAAPPPAPAPAAPA 691
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1046853695   74 QSAAPARPGPAPHVyPAGSQVMMIPSQISYSASQGAYYIPGQGRSTYVVPTQQYPVQPGAPGFYPGASPTEFGTYAGayy 153
Cdd:PRK07764   692 APAGAAPAQPAPAP-AATPPAGQADDPAAQPPQAAQGASAPSPAADDPVPLPPEPDDPPDPAGAPAQPPPPPAPAPA--- 767
                          170       180       190       200       210
                   ....*....|....*....|....*....|....*....|....*....|....*
gi 1046853695  154 PAQSVQQFPASVAPAPVLMNQPPQIAPKRERktiriRDPNQGGKDITEEIMsGAR 208
Cdd:PRK07764   768 AAPAAAPPPSPPSEEEEMAEDDAPSMDDEDR-----RDAEEVAMELLEEEL-GAK 816
Rib_recp_KP_reg pfam05104
Ribosome receptor lysine/proline rich region; This highly conserved region is found towards ...
368-456 5.64e-03

Ribosome receptor lysine/proline rich region; This highly conserved region is found towards the C-terminus of the transmembrane domain. The function is unclear.


Pssm-ID: 461548 [Multi-domain]  Cd Length: 140  Bit Score: 38.95  E-value: 5.64e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1046853695  368 HEPNGVIPseDLEPEVESSTEPAPPPLSACASESLVPIAPTAQPEELLNGAPSPPAVDLSPVSEPEEQAKEVPSAALASI 447
Cdd:pfam05104   44 EKPNGKLP--ESEQADESEEEPREFKTPDEAPSAALEPEPVPTPVPAPVEPEPAPPSESPAPSPKEKKKKEKKSAKVEPA 121

                   ....*....
gi 1046853695  448 VSPTPPVAP 456
Cdd:pfam05104  122 ETPEAVQPK 130
PRK11633 PRK11633
cell division protein DedD; Provisional
346-459 9.24e-03

cell division protein DedD; Provisional


Pssm-ID: 236940 [Multi-domain]  Cd Length: 226  Bit Score: 39.60  E-value: 9.24e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1046853695  346 ESEFSSSPLqvstslVPHRAETHEPNGV---------IPSEDLEPEVESSTEPAPPPLSACASESLVPIAPTAQPEElln 416
Cdd:PRK11633    35 QDEFAAIPL------VPKPGDRDEPDMMpaatqalptQPPEGAAEAVRAGDAAAPSLDPATVAPPNTPVEPEPAPVE--- 105
                           90       100       110       120
                   ....*....|....*....|....*....|....*....|....
gi 1046853695  417 gAPSPPavdlsPVSEPEEQAKEVPSAALASIVSPTP-PVAPSDT 459
Cdd:PRK11633   106 -PPKPK-----PVEKPKPKPKPQQKVEAPPAPKPEPkPVVEEKA 143
 
Blast search parameters
Data Source: Precalculated data, version = cdd.v.3.21
Preset Options:Database: CDSEARCH/cdd   Low complexity filter: no  Composition Based Adjustment: yes   E-value threshold: 0.01

References:

  • Wang J et al. (2023), "The conserved domain database in 2023", Nucleic Acids Res.51(D)384-8.
  • Lu S et al. (2020), "The conserved domain database in 2020", Nucleic Acids Res.48(D)265-8.
  • Marchler-Bauer A et al. (2017), "CDD/SPARCLE: functional classification of proteins via subfamily domain architectures.", Nucleic Acids Res.45(D)200-3.
Help | Disclaimer | Write to the Help Desk
NCBI | NLM | NIH