NCBI Home Page NCBI Site Search page NCBI Guide that lists and describes the NCBI resources
Conserved domains on  [gi|1246417673|ref|NP_001342628|]
View 

eukaryotic translation initiation factor 4 gamma 3 isoform 4 [Mus musculus]

Protein Classification

Graphical summary

 Zoom to residue level

show extra options »

Show site features     Horizontal zoom: ×

List of domain hits

Name Accession Description Interval E-value
MIF4G pfam02854
MIF4G domain; MIF4G is named after Middle domain of eukaryotic initiation factor 4G (eIF4G). ...
799-1027 1.27e-63

MIF4G domain; MIF4G is named after Middle domain of eukaryotic initiation factor 4G (eIF4G). Also occurs in NMD2p and CBP80. The domain is rich in alpha-helices and may contain multiple alpha-helical repeats. In eIF4G, this domain binds eIF4A, eIF3, RNA and DNA.


:

Pssm-ID: 397130  Cd Length: 203  Bit Score: 215.30  E-value: 1.27e-63
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1246417673  799 FRKVRSILNKLTPQMFNQLMKQVSALTVDTEERLKGVIDLVFEKAIDEPSFSVAYANMCRCLVTLkvpmadkpgNTVNFR 878
Cdd:pfam02854    1 LKKVKGILNKLSPENFEKLIKELLKLIMSDPELLKYLIELIFEKAVEEPNFIPAYARLCSGLNLR---------NPTDFG 71
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1246417673  879 KLLLNRCQKEFEKdkadddvfekkqkeleaasapeertrlHDELEEAKDKARRRSIGNIKFIGELFKLKMLTEAIMHDCV 958
Cdd:pfam02854   72 IHLLNRLQEEFEK---------------------------RFELEENEQGNRRRRLGLVRFLGELYKFGLLTEKILFECL 124
                          170       180       190       200       210       220       230
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 1246417673  959 VKLLKNH-------DEESLECLCRLLTTIGKDLDFEKAKPRMDQYFNQMEKIVKERK---TSSRIRFMLQDVIDLRLCN 1027
Cdd:pfam02854  125 KELLSSLtkedlkrDLFNLECLLTLLTTIGKLLENEKLPKLMDQFLDEIQKYVLSKDdpkLSSRLRFMLQDLIELRKNK 203
W2_eIF4G1_like cd11559
C-terminal W2 domain of eukaryotic translation initiation factor 4 gamma 1 and similar ...
1448-1580 1.32e-48

C-terminal W2 domain of eukaryotic translation initiation factor 4 gamma 1 and similar proteins; eIF4G1 is a component of the multi-subunit eukaryotic translation initiation factor 4F, which facilitates recruitment of the mRNA to the ribosome, a rate-limiting step during translation initiation. This C-terminal domain, whose structure resembles that of a set of concatenated HEAT repeats, has been associated with binding to/recruiting the kinase Mnk1, which phosphorylates eIF4E.


:

Pssm-ID: 211397  Cd Length: 134  Bit Score: 169.39  E-value: 1.32e-48
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1246417673 1448 EELSQRLEKLIMEEKADDErIFDWVEANLDESQMSSPTFLRALMTAVCKAAIIADcSTFRVDTAVIKQRVPILLKYLDSD 1527
Cdd:cd11559      4 LRVQAELLKLLQEDPNPDE-LYKWIKENVSPELYASPGFVRALMTAVLKYAIEEK-SLPEKEKALLEKYAPLLQKYLDDD 81
                           90       100       110       120       130
                   ....*....|....*....|....*....|....*....|....*....|...
gi 1246417673 1528 TEKELQALYALQASIVKLDQPANLLRMFFDCLYDEEVISEDAFYKWESSKDPA 1580
Cdd:cd11559     82 EQLQLQALYALQALVHTLEFPKGLLLRFFDALYDEDVIEEEAFLKWKEDVDPA 134
MA3 pfam02847
MA3 domain; Domain in DAP-5, eIF4G, MA-3 and other proteins. Highly alpha-helical. May contain ...
1246-1358 7.40e-35

MA3 domain; Domain in DAP-5, eIF4G, MA-3 and other proteins. Highly alpha-helical. May contain repeats and/or regions similar to MIF4G domains.


:

Pssm-ID: 397128  Cd Length: 113  Bit Score: 129.32  E-value: 7.40e-35
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1246417673 1246 VERKSKSIIDEFLHINDFKEATQCIEELSAQGPLHVFVKVGVEFTLERSQITRDHMGHLLYQLVQSEKLSKQDFFKGFSE 1325
Cdd:pfam02847    1 LKRKIFLILEEYLSSGDYDEAARCLLKLGLPSQHHEVVKVLIECALEESKTYREFYGLLLERLCEFNLISTKQFEKGFWR 80
                           90       100       110
                   ....*....|....*....|....*....|...
gi 1246417673 1326 TLELADDMAIDIPHIWLYLAELVTPMLKGGGIS 1358
Cdd:pfam02847   81 VLEDLEDLELDIPNAWRNLAEFVARLISDDGLP 113
PRK07003 super family cl35530
DNA polymerase III subunit gamma/tau;
210-374 5.19e-06

DNA polymerase III subunit gamma/tau;


The actual alignment was detected with superfamily member PRK07003:

Pssm-ID: 235906 [Multi-domain]  Cd Length: 830  Bit Score: 51.39  E-value: 5.19e-06
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1246417673  210 GGGSRNPTPPIGRPASTPTPPQQLPSQV-PEHSPVVYGTVESAHLAASTPVTAASDQKQEEKPKPDPVFQSPSTVLRLVL 288
Cdd:PRK07003   368 PGGGVPARVAGAVPAPGARAAAAVGASAvPAVTAVTGAAGAALAPKAAAAAAATRAEAPPAAPAPPATADRGDDAADGDA 447
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1246417673  289 SGEKKEQAGQMPETAAGEPTPEPPRTSSPTS-----LPPLARSSlPSPMSAALSSQPLFTAEDKCELPSSKEEDAPPVP- 362
Cdd:PRK07003   448 PVPAKANARASADSRCDERDAQPPADSGSASapasdAPPDAAFE-PAPRAAAPSAATPAAVPDARAPAAASREDAPAAAa 526
                          170
                   ....*....|..
gi 1246417673  363 SPTSCTAASGPS 374
Cdd:PRK07003   527 PPAPEARPPTPA 538
W2 super family cl17013
C-terminal domain of eIF4-gamma/eIF5/eIF2b-epsilon; This domain is found at the C-terminus of ...
1559-1605 1.13e-04

C-terminal domain of eIF4-gamma/eIF5/eIF2b-epsilon; This domain is found at the C-terminus of several translation initiation factors, including the epsilon chain of eIF2b, where it has been found to catalyze the conversion of eIF2.GDP to its active eIF2.GTP form. The structure of the domain resembles that of a set of concatenated HEAT repeats.


The actual alignment was detected with superfamily member cd11560:

Pssm-ID: 473053 [Multi-domain]  Cd Length: 194  Bit Score: 44.90  E-value: 1.13e-04
                           10        20        30        40
                   ....*....|....*....|....*....|....*....|....*..
gi 1246417673 1559 LYDEEVISEDAFYKWesSKDPAEQAGKGVALKSVTAFFTWLREAEEE 1605
Cdd:cd11560    150 LYKADVLSEDAILKW--YKKGHSPKGKQVFLKQMEPFVEWLQEAEEE 194
Atrophin-1 super family cl38111
Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian ...
62-400 1.52e-03

Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian atrophy (DRPLA) gene. DRPLA OMIM:125370 is a progressive neurodegenerative disorder. It is caused by the expansion of a CAG repeat in the DRPLA gene on chromosome 12p. This results in an extended polyglutamine region in atrophin-1, that is thought to confer toxicity to the protein, possibly through altering its interactions with other proteins. The expansion of a CAG repeat is also the underlying defect in six other neurodegenerative disorders, including Huntington's disease. One interaction of expanded polyglutamine repeats that is thought to be pathogenic is that with the short glutamine repeat in the transcriptional coactivator CREB binding protein, CBP. This interaction draws CBP away from its usual nuclear location to the expanded polyglutamine repeat protein aggregates that are characteriztic of the polyglutamine neurodegenerative disorders. This interferes with CBP-mediated transcription and causes cytotoxicity.


The actual alignment was detected with superfamily member pfam03154:

Pssm-ID: 460830 [Multi-domain]  Cd Length: 991  Bit Score: 43.22  E-value: 1.52e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1246417673   62 QRPQIQPPRaaIPNSSPSIRPGVQTPTAVYQANQhimmvnhlPMPYPVTQGHQYCIPQYRHSGPPYVGPPQQYPVqppgp 141
Cdd:pfam03154  233 QTPTLHPQR--LPSPHPPLQPMTQPPPPSQVSPQ--------PLPQPSLHGQMPPMPHSLQTGPSHMQHPVPPQP----- 297
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1246417673  142 gpfypGPGPGDFANAYGTPF-YPSQPVY-QSAPIIVPTQQQPPPAKREKKTIRIRDPnqggkditeeimsgggsrNPTPP 219
Cdd:pfam03154  298 -----FPLTPQSSQSQVPPGpSPAAPGQsQQRIHTPPSQSQLQSQQPPREQPLPPAP------------------LSMPH 354
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1246417673  220 IGRPASTPTPPQQLPsQVPEHSPvvygtvesaHLAASTPVTAASDQkqeekpKPDPVFQSPSTVLRLVLSGEKKEQAGQM 299
Cdd:pfam03154  355 IKPPPTTPIPQLPNP-QSHKHPP---------HLSGPSPFQMNSNL------PPPPALKPLSSLSTHHPPSAHPPPLQLM 418
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1246417673  300 PETAAGEPTP-EPPRTSSPTSLPPLARSSLPSPMSAALSSQPLFTAEDKceLPSSKEEDAPPVPSPTScTAASGPSLtdn 378
Cdd:pfam03154  419 PQSQQLPPPPaQPPVLTQSQSLPPPAASHPPTSGLHQVPSQSPFPQHPF--VPGGPPPITPPSGPPTS-TSSAMPGI--- 492
                          330       340
                   ....*....|....*....|..
gi 1246417673  379 sdicKKPCSVAPHDSQLISSTI 400
Cdd:pfam03154  493 ----QPPSSASVSSSGPVPAAV 510
 
Name Accession Description Interval E-value
MIF4G pfam02854
MIF4G domain; MIF4G is named after Middle domain of eukaryotic initiation factor 4G (eIF4G). ...
799-1027 1.27e-63

MIF4G domain; MIF4G is named after Middle domain of eukaryotic initiation factor 4G (eIF4G). Also occurs in NMD2p and CBP80. The domain is rich in alpha-helices and may contain multiple alpha-helical repeats. In eIF4G, this domain binds eIF4A, eIF3, RNA and DNA.


Pssm-ID: 397130  Cd Length: 203  Bit Score: 215.30  E-value: 1.27e-63
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1246417673  799 FRKVRSILNKLTPQMFNQLMKQVSALTVDTEERLKGVIDLVFEKAIDEPSFSVAYANMCRCLVTLkvpmadkpgNTVNFR 878
Cdd:pfam02854    1 LKKVKGILNKLSPENFEKLIKELLKLIMSDPELLKYLIELIFEKAVEEPNFIPAYARLCSGLNLR---------NPTDFG 71
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1246417673  879 KLLLNRCQKEFEKdkadddvfekkqkeleaasapeertrlHDELEEAKDKARRRSIGNIKFIGELFKLKMLTEAIMHDCV 958
Cdd:pfam02854   72 IHLLNRLQEEFEK---------------------------RFELEENEQGNRRRRLGLVRFLGELYKFGLLTEKILFECL 124
                          170       180       190       200       210       220       230
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 1246417673  959 VKLLKNH-------DEESLECLCRLLTTIGKDLDFEKAKPRMDQYFNQMEKIVKERK---TSSRIRFMLQDVIDLRLCN 1027
Cdd:pfam02854  125 KELLSSLtkedlkrDLFNLECLLTLLTTIGKLLENEKLPKLMDQFLDEIQKYVLSKDdpkLSSRLRFMLQDLIELRKNK 203
MIF4G smart00543
Middle domain of eukaryotic initiation factor 4G (eIF4G); Also occurs in NMD2p and CBP80. The ...
800-1024 5.59e-51

Middle domain of eukaryotic initiation factor 4G (eIF4G); Also occurs in NMD2p and CBP80. The domain is rich in alpha-helices and may contain multiple alpha-helical repeats. In eIF4G, this domain binds eIF4A, eIF3, RNA and DNA. Ponting (TiBS) "Novel eIF4G domain homologues (in press)


Pssm-ID: 214713  Cd Length: 200  Bit Score: 178.71  E-value: 5.59e-51
                            10        20        30        40        50        60        70        80
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1246417673   800 RKVRSILNKLTPQMFNQLMKQVSALTVDTEERLKGVIDLVFEKAIDEPSFSVAYANMCRCLVtLKVPmadkpgntvNFRK 879
Cdd:smart00543    2 KKVKGLINKLSPSNFESIIKELLKLNNSDKNLRKYILELIFEKAVEEPNFIPAYARLCALLN-AKNP---------DFGS 71
                            90       100       110       120       130       140       150       160
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1246417673   880 LLLNRCQKEFEKDkadddvfekkqkeleaasapeertrlhdeLEEAKDKARRRSIGNIKFIGELFKLKMLTEAIMHDCVV 959
Cdd:smart00543   72 LLLERLQEEFEKG-----------------------------LESEEESDKQRRLGLVRFLGELYNFQVLTSKIILELLK 122
                           170       180       190       200       210       220       230
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*
gi 1246417673   960 KLLKNH-------DEESLECLCRLLTTIGKDLDFEKAKPRMDQYFNQMEKIVKERKT---SSRIRFMLQDVIDLR 1024
Cdd:smart00543  123 ELLNDLtkldpprSDFSVECLLSLLPTCGKDLEREKSPKLLDEILERLQDYLLKKDKtelSSRLRFMLELLIELR 197
W2_eIF4G1_like cd11559
C-terminal W2 domain of eukaryotic translation initiation factor 4 gamma 1 and similar ...
1448-1580 1.32e-48

C-terminal W2 domain of eukaryotic translation initiation factor 4 gamma 1 and similar proteins; eIF4G1 is a component of the multi-subunit eukaryotic translation initiation factor 4F, which facilitates recruitment of the mRNA to the ribosome, a rate-limiting step during translation initiation. This C-terminal domain, whose structure resembles that of a set of concatenated HEAT repeats, has been associated with binding to/recruiting the kinase Mnk1, which phosphorylates eIF4E.


Pssm-ID: 211397  Cd Length: 134  Bit Score: 169.39  E-value: 1.32e-48
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1246417673 1448 EELSQRLEKLIMEEKADDErIFDWVEANLDESQMSSPTFLRALMTAVCKAAIIADcSTFRVDTAVIKQRVPILLKYLDSD 1527
Cdd:cd11559      4 LRVQAELLKLLQEDPNPDE-LYKWIKENVSPELYASPGFVRALMTAVLKYAIEEK-SLPEKEKALLEKYAPLLQKYLDDD 81
                           90       100       110       120       130
                   ....*....|....*....|....*....|....*....|....*....|...
gi 1246417673 1528 TEKELQALYALQASIVKLDQPANLLRMFFDCLYDEEVISEDAFYKWESSKDPA 1580
Cdd:cd11559     82 EQLQLQALYALQALVHTLEFPKGLLLRFFDALYDEDVIEEEAFLKWKEDVDPA 134
MA3 pfam02847
MA3 domain; Domain in DAP-5, eIF4G, MA-3 and other proteins. Highly alpha-helical. May contain ...
1246-1358 7.40e-35

MA3 domain; Domain in DAP-5, eIF4G, MA-3 and other proteins. Highly alpha-helical. May contain repeats and/or regions similar to MIF4G domains.


Pssm-ID: 397128  Cd Length: 113  Bit Score: 129.32  E-value: 7.40e-35
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1246417673 1246 VERKSKSIIDEFLHINDFKEATQCIEELSAQGPLHVFVKVGVEFTLERSQITRDHMGHLLYQLVQSEKLSKQDFFKGFSE 1325
Cdd:pfam02847    1 LKRKIFLILEEYLSSGDYDEAARCLLKLGLPSQHHEVVKVLIECALEESKTYREFYGLLLERLCEFNLISTKQFEKGFWR 80
                           90       100       110
                   ....*....|....*....|....*....|...
gi 1246417673 1326 TLELADDMAIDIPHIWLYLAELVTPMLKGGGIS 1358
Cdd:pfam02847   81 VLEDLEDLELDIPNAWRNLAEFVARLISDDGLP 113
MA3 smart00544
Domain in DAP-5, eIF4G, MA-3 and other proteins; Highly alpha-helical. May contain repeats and ...
1246-1358 2.71e-32

Domain in DAP-5, eIF4G, MA-3 and other proteins; Highly alpha-helical. May contain repeats and/or regions similar to MIF4G domains Ponting (TIBS) "Novel eIF4G domain homologues" in press


Pssm-ID: 214714  Cd Length: 113  Bit Score: 121.97  E-value: 2.71e-32
                            10        20        30        40        50        60        70        80
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1246417673  1246 VERKSKSIIDEFLHINDFKEATQCIEELSAQGPLHVFVKVGVEFTLERSQITRDHMGHLLYQLVQSEKLSKQDFFKGFSE 1325
Cdd:smart00544    1 LKKKIFLIIEEYLSSGDTDEAVHCLLELKLPEQHHEVVKVLLTCALEEKRTYREMYSVLLSRLCQANVISTKQFEKGFWR 80
                            90       100       110
                    ....*....|....*....|....*....|...
gi 1246417673  1326 TLELADDMAIDIPHIWLYLAELVTPMLKGGGIS 1358
Cdd:smart00544   81 LLEDIEDLELDIPNAWRNLAEFVARLISDGILP 113
eIF5C smart00515
Domain at the C-termini of GCD6, eIF-2B epsilon, eIF-4 gamma and eIF-5;
1518-1602 2.22e-28

Domain at the C-termini of GCD6, eIF-2B epsilon, eIF-4 gamma and eIF-5;


Pssm-ID: 214705  Cd Length: 83  Bit Score: 109.69  E-value: 2.22e-28
                            10        20        30        40        50        60        70        80
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1246417673  1518 PILLKYLDSDTEKELQALYALQASIVKLDQPANLLRMFFDCLYDEEVISEDAFYKWESSKDPAEqaGKGVALKSVTAFFT 1597
Cdd:smart00515    1 GPLLKFLAKDEEEQLELLYAIEEFCVELEKLGKLLPKILKSLYDADILEEEAILKWYEKAVSAE--GKKKVRKNAKPFVT 78

                    ....*
gi 1246417673  1598 WLREA 1602
Cdd:smart00515   79 WLQEA 83
W2 pfam02020
eIF4-gamma/eIF5/eIF2-epsilon; This domain of unknown function is found at the C-terminus of ...
1531-1607 8.10e-24

eIF4-gamma/eIF5/eIF2-epsilon; This domain of unknown function is found at the C-terminus of several translation initiation factors.


Pssm-ID: 460415  Cd Length: 76  Bit Score: 96.44  E-value: 8.10e-24
                           10        20        30        40        50        60        70
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*..
gi 1246417673 1531 ELQALYALQASIVKLDQPANLLRMFFDCLYDEEVISEDAFYKWESSKDPAEQaGKGVALKSVTAFFTWLREAEEESE 1607
Cdd:pfam02020    1 QVDLLLALQEFCAKLEELLKLLLKILKALYDLDIVEEEAILKWWEDVSSAEK-GMKKVRKQAKPFVEWLEEAEEESD 76
PRK07003 PRK07003
DNA polymerase III subunit gamma/tau;
210-374 5.19e-06

DNA polymerase III subunit gamma/tau;


Pssm-ID: 235906 [Multi-domain]  Cd Length: 830  Bit Score: 51.39  E-value: 5.19e-06
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1246417673  210 GGGSRNPTPPIGRPASTPTPPQQLPSQV-PEHSPVVYGTVESAHLAASTPVTAASDQKQEEKPKPDPVFQSPSTVLRLVL 288
Cdd:PRK07003   368 PGGGVPARVAGAVPAPGARAAAAVGASAvPAVTAVTGAAGAALAPKAAAAAAATRAEAPPAAPAPPATADRGDDAADGDA 447
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1246417673  289 SGEKKEQAGQMPETAAGEPTPEPPRTSSPTS-----LPPLARSSlPSPMSAALSSQPLFTAEDKCELPSSKEEDAPPVP- 362
Cdd:PRK07003   448 PVPAKANARASADSRCDERDAQPPADSGSASapasdAPPDAAFE-PAPRAAAPSAATPAAVPDARAPAAASREDAPAAAa 526
                          170
                   ....*....|..
gi 1246417673  363 SPTSCTAASGPS 374
Cdd:PRK07003   527 PPAPEARPPTPA 538
W2_eIF5C_like cd11560
C-terminal W2 domain of the eukaryotic translation initiation factor 5C and similar proteins; ...
1559-1605 1.13e-04

C-terminal W2 domain of the eukaryotic translation initiation factor 5C and similar proteins; eIF5C appears to be essential for the initiation of protein translation; its actual function, and specifically that of the C-terminal W2 domain, are not well understood. The Drosophila ortholog, kra (krasavietz) or exba (extra bases), may be involved in translational inhibition in neural development. The structure of this C-terminal domain resembles that of a set of concatenated HEAT repeats.


Pssm-ID: 211398 [Multi-domain]  Cd Length: 194  Bit Score: 44.90  E-value: 1.13e-04
                           10        20        30        40
                   ....*....|....*....|....*....|....*....|....*..
gi 1246417673 1559 LYDEEVISEDAFYKWesSKDPAEQAGKGVALKSVTAFFTWLREAEEE 1605
Cdd:cd11560    150 LYKADVLSEDAILKW--YKKGHSPKGKQVFLKQMEPFVEWLQEAEEE 194
Pneumo_att_G pfam05539
Pneumovirinae attachment membrane glycoprotein G;
159-345 1.18e-04

Pneumovirinae attachment membrane glycoprotein G;


Pssm-ID: 114270 [Multi-domain]  Cd Length: 408  Bit Score: 46.58  E-value: 1.18e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1246417673  159 TPFYPSQpvyqsapiiVPTQQQPPPAKREKKTIRIRDPNQGgkditeeimSGGGSRNPTPPIGRPASTPTPPQQLPSQVP 238
Cdd:pfam05539  186 HPTYPSQ---------VTPQSQPATQGHQTATANQRLSSTE---------PVGTQGTTTSSNPEPQTEPPPSQRGPSGSP 247
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1246417673  239 EHSPvvygtvesahlaaSTP----VTAASDQKQEEKPKPDPVFQSPSTVLRLVLSGEKKeqagqmPETAAGEPTPEPPRT 314
Cdd:pfam05539  248 QHPP-------------STTsqdqSTTGDGQEHTQRRKTPPATSNRRSPHSTATPPPTT------KRQETGRPTPRPTAT 308
                          170       180       190
                   ....*....|....*....|....*....|.
gi 1246417673  315 SSPTSLPPlarSSLPSPMSAALSSQPLFTAE 345
Cdd:pfam05539  309 TQSGSSPP---HSSPPGVQANPTTQNLVDCK 336
Atrophin-1 pfam03154
Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian ...
62-400 1.52e-03

Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian atrophy (DRPLA) gene. DRPLA OMIM:125370 is a progressive neurodegenerative disorder. It is caused by the expansion of a CAG repeat in the DRPLA gene on chromosome 12p. This results in an extended polyglutamine region in atrophin-1, that is thought to confer toxicity to the protein, possibly through altering its interactions with other proteins. The expansion of a CAG repeat is also the underlying defect in six other neurodegenerative disorders, including Huntington's disease. One interaction of expanded polyglutamine repeats that is thought to be pathogenic is that with the short glutamine repeat in the transcriptional coactivator CREB binding protein, CBP. This interaction draws CBP away from its usual nuclear location to the expanded polyglutamine repeat protein aggregates that are characteriztic of the polyglutamine neurodegenerative disorders. This interferes with CBP-mediated transcription and causes cytotoxicity.


Pssm-ID: 460830 [Multi-domain]  Cd Length: 991  Bit Score: 43.22  E-value: 1.52e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1246417673   62 QRPQIQPPRaaIPNSSPSIRPGVQTPTAVYQANQhimmvnhlPMPYPVTQGHQYCIPQYRHSGPPYVGPPQQYPVqppgp 141
Cdd:pfam03154  233 QTPTLHPQR--LPSPHPPLQPMTQPPPPSQVSPQ--------PLPQPSLHGQMPPMPHSLQTGPSHMQHPVPPQP----- 297
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1246417673  142 gpfypGPGPGDFANAYGTPF-YPSQPVY-QSAPIIVPTQQQPPPAKREKKTIRIRDPnqggkditeeimsgggsrNPTPP 219
Cdd:pfam03154  298 -----FPLTPQSSQSQVPPGpSPAAPGQsQQRIHTPPSQSQLQSQQPPREQPLPPAP------------------LSMPH 354
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1246417673  220 IGRPASTPTPPQQLPsQVPEHSPvvygtvesaHLAASTPVTAASDQkqeekpKPDPVFQSPSTVLRLVLSGEKKEQAGQM 299
Cdd:pfam03154  355 IKPPPTTPIPQLPNP-QSHKHPP---------HLSGPSPFQMNSNL------PPPPALKPLSSLSTHHPPSAHPPPLQLM 418
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1246417673  300 PETAAGEPTP-EPPRTSSPTSLPPLARSSLPSPMSAALSSQPLFTAEDKceLPSSKEEDAPPVPSPTScTAASGPSLtdn 378
Cdd:pfam03154  419 PQSQQLPPPPaQPPVLTQSQSLPPPAASHPPTSGLHQVPSQSPFPQHPF--VPGGPPPITPPSGPPTS-TSSAMPGI--- 492
                          330       340
                   ....*....|....*....|..
gi 1246417673  379 sdicKKPCSVAPHDSQLISSTI 400
Cdd:pfam03154  493 ----QPPSSASVSSSGPVPAAV 510
 
Name Accession Description Interval E-value
MIF4G pfam02854
MIF4G domain; MIF4G is named after Middle domain of eukaryotic initiation factor 4G (eIF4G). ...
799-1027 1.27e-63

MIF4G domain; MIF4G is named after Middle domain of eukaryotic initiation factor 4G (eIF4G). Also occurs in NMD2p and CBP80. The domain is rich in alpha-helices and may contain multiple alpha-helical repeats. In eIF4G, this domain binds eIF4A, eIF3, RNA and DNA.


Pssm-ID: 397130  Cd Length: 203  Bit Score: 215.30  E-value: 1.27e-63
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1246417673  799 FRKVRSILNKLTPQMFNQLMKQVSALTVDTEERLKGVIDLVFEKAIDEPSFSVAYANMCRCLVTLkvpmadkpgNTVNFR 878
Cdd:pfam02854    1 LKKVKGILNKLSPENFEKLIKELLKLIMSDPELLKYLIELIFEKAVEEPNFIPAYARLCSGLNLR---------NPTDFG 71
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1246417673  879 KLLLNRCQKEFEKdkadddvfekkqkeleaasapeertrlHDELEEAKDKARRRSIGNIKFIGELFKLKMLTEAIMHDCV 958
Cdd:pfam02854   72 IHLLNRLQEEFEK---------------------------RFELEENEQGNRRRRLGLVRFLGELYKFGLLTEKILFECL 124
                          170       180       190       200       210       220       230
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 1246417673  959 VKLLKNH-------DEESLECLCRLLTTIGKDLDFEKAKPRMDQYFNQMEKIVKERK---TSSRIRFMLQDVIDLRLCN 1027
Cdd:pfam02854  125 KELLSSLtkedlkrDLFNLECLLTLLTTIGKLLENEKLPKLMDQFLDEIQKYVLSKDdpkLSSRLRFMLQDLIELRKNK 203
MIF4G smart00543
Middle domain of eukaryotic initiation factor 4G (eIF4G); Also occurs in NMD2p and CBP80. The ...
800-1024 5.59e-51

Middle domain of eukaryotic initiation factor 4G (eIF4G); Also occurs in NMD2p and CBP80. The domain is rich in alpha-helices and may contain multiple alpha-helical repeats. In eIF4G, this domain binds eIF4A, eIF3, RNA and DNA. Ponting (TiBS) "Novel eIF4G domain homologues (in press)


Pssm-ID: 214713  Cd Length: 200  Bit Score: 178.71  E-value: 5.59e-51
                            10        20        30        40        50        60        70        80
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1246417673   800 RKVRSILNKLTPQMFNQLMKQVSALTVDTEERLKGVIDLVFEKAIDEPSFSVAYANMCRCLVtLKVPmadkpgntvNFRK 879
Cdd:smart00543    2 KKVKGLINKLSPSNFESIIKELLKLNNSDKNLRKYILELIFEKAVEEPNFIPAYARLCALLN-AKNP---------DFGS 71
                            90       100       110       120       130       140       150       160
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1246417673   880 LLLNRCQKEFEKDkadddvfekkqkeleaasapeertrlhdeLEEAKDKARRRSIGNIKFIGELFKLKMLTEAIMHDCVV 959
Cdd:smart00543   72 LLLERLQEEFEKG-----------------------------LESEEESDKQRRLGLVRFLGELYNFQVLTSKIILELLK 122
                           170       180       190       200       210       220       230
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*
gi 1246417673   960 KLLKNH-------DEESLECLCRLLTTIGKDLDFEKAKPRMDQYFNQMEKIVKERKT---SSRIRFMLQDVIDLR 1024
Cdd:smart00543  123 ELLNDLtkldpprSDFSVECLLSLLPTCGKDLEREKSPKLLDEILERLQDYLLKKDKtelSSRLRFMLELLIELR 197
W2_eIF4G1_like cd11559
C-terminal W2 domain of eukaryotic translation initiation factor 4 gamma 1 and similar ...
1448-1580 1.32e-48

C-terminal W2 domain of eukaryotic translation initiation factor 4 gamma 1 and similar proteins; eIF4G1 is a component of the multi-subunit eukaryotic translation initiation factor 4F, which facilitates recruitment of the mRNA to the ribosome, a rate-limiting step during translation initiation. This C-terminal domain, whose structure resembles that of a set of concatenated HEAT repeats, has been associated with binding to/recruiting the kinase Mnk1, which phosphorylates eIF4E.


Pssm-ID: 211397  Cd Length: 134  Bit Score: 169.39  E-value: 1.32e-48
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1246417673 1448 EELSQRLEKLIMEEKADDErIFDWVEANLDESQMSSPTFLRALMTAVCKAAIIADcSTFRVDTAVIKQRVPILLKYLDSD 1527
Cdd:cd11559      4 LRVQAELLKLLQEDPNPDE-LYKWIKENVSPELYASPGFVRALMTAVLKYAIEEK-SLPEKEKALLEKYAPLLQKYLDDD 81
                           90       100       110       120       130
                   ....*....|....*....|....*....|....*....|....*....|...
gi 1246417673 1528 TEKELQALYALQASIVKLDQPANLLRMFFDCLYDEEVISEDAFYKWESSKDPA 1580
Cdd:cd11559     82 EQLQLQALYALQALVHTLEFPKGLLLRFFDALYDEDVIEEEAFLKWKEDVDPA 134
MA3 pfam02847
MA3 domain; Domain in DAP-5, eIF4G, MA-3 and other proteins. Highly alpha-helical. May contain ...
1246-1358 7.40e-35

MA3 domain; Domain in DAP-5, eIF4G, MA-3 and other proteins. Highly alpha-helical. May contain repeats and/or regions similar to MIF4G domains.


Pssm-ID: 397128  Cd Length: 113  Bit Score: 129.32  E-value: 7.40e-35
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1246417673 1246 VERKSKSIIDEFLHINDFKEATQCIEELSAQGPLHVFVKVGVEFTLERSQITRDHMGHLLYQLVQSEKLSKQDFFKGFSE 1325
Cdd:pfam02847    1 LKRKIFLILEEYLSSGDYDEAARCLLKLGLPSQHHEVVKVLIECALEESKTYREFYGLLLERLCEFNLISTKQFEKGFWR 80
                           90       100       110
                   ....*....|....*....|....*....|...
gi 1246417673 1326 TLELADDMAIDIPHIWLYLAELVTPMLKGGGIS 1358
Cdd:pfam02847   81 VLEDLEDLELDIPNAWRNLAEFVARLISDDGLP 113
MA3 smart00544
Domain in DAP-5, eIF4G, MA-3 and other proteins; Highly alpha-helical. May contain repeats and ...
1246-1358 2.71e-32

Domain in DAP-5, eIF4G, MA-3 and other proteins; Highly alpha-helical. May contain repeats and/or regions similar to MIF4G domains Ponting (TIBS) "Novel eIF4G domain homologues" in press


Pssm-ID: 214714  Cd Length: 113  Bit Score: 121.97  E-value: 2.71e-32
                            10        20        30        40        50        60        70        80
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1246417673  1246 VERKSKSIIDEFLHINDFKEATQCIEELSAQGPLHVFVKVGVEFTLERSQITRDHMGHLLYQLVQSEKLSKQDFFKGFSE 1325
Cdd:smart00544    1 LKKKIFLIIEEYLSSGDTDEAVHCLLELKLPEQHHEVVKVLLTCALEEKRTYREMYSVLLSRLCQANVISTKQFEKGFWR 80
                            90       100       110
                    ....*....|....*....|....*....|...
gi 1246417673  1326 TLELADDMAIDIPHIWLYLAELVTPMLKGGGIS 1358
Cdd:smart00544   81 LLEDIEDLELDIPNAWRNLAEFVARLISDGILP 113
eIF5C smart00515
Domain at the C-termini of GCD6, eIF-2B epsilon, eIF-4 gamma and eIF-5;
1518-1602 2.22e-28

Domain at the C-termini of GCD6, eIF-2B epsilon, eIF-4 gamma and eIF-5;


Pssm-ID: 214705  Cd Length: 83  Bit Score: 109.69  E-value: 2.22e-28
                            10        20        30        40        50        60        70        80
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1246417673  1518 PILLKYLDSDTEKELQALYALQASIVKLDQPANLLRMFFDCLYDEEVISEDAFYKWESSKDPAEqaGKGVALKSVTAFFT 1597
Cdd:smart00515    1 GPLLKFLAKDEEEQLELLYAIEEFCVELEKLGKLLPKILKSLYDADILEEEAILKWYEKAVSAE--GKKKVRKNAKPFVT 78

                    ....*
gi 1246417673  1598 WLREA 1602
Cdd:smart00515   79 WLQEA 83
W2 pfam02020
eIF4-gamma/eIF5/eIF2-epsilon; This domain of unknown function is found at the C-terminus of ...
1531-1607 8.10e-24

eIF4-gamma/eIF5/eIF2-epsilon; This domain of unknown function is found at the C-terminus of several translation initiation factors.


Pssm-ID: 460415  Cd Length: 76  Bit Score: 96.44  E-value: 8.10e-24
                           10        20        30        40        50        60        70
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*..
gi 1246417673 1531 ELQALYALQASIVKLDQPANLLRMFFDCLYDEEVISEDAFYKWESSKDPAEQaGKGVALKSVTAFFTWLREAEEESE 1607
Cdd:pfam02020    1 QVDLLLALQEFCAKLEELLKLLLKILKALYDLDIVEEEAILKWWEDVSSAEK-GMKKVRKQAKPFVEWLEEAEEESD 76
W2 cd11473
C-terminal domain of eIF4-gamma/eIF5/eIF2b-epsilon; This domain is found at the C-terminus of ...
1448-1574 1.16e-18

C-terminal domain of eIF4-gamma/eIF5/eIF2b-epsilon; This domain is found at the C-terminus of several translation initiation factors, including the epsilon chain of eIF2b, where it has been found to catalyze the conversion of eIF2.GDP to its active eIF2.GTP form. The structure of the domain resembles that of a set of concatenated HEAT repeats.


Pssm-ID: 211395  Cd Length: 135  Bit Score: 83.68  E-value: 1.16e-18
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1246417673 1448 EELSQRLEKLIMEEKADDERIFDWVEANLDESQMSSPTFLRALMTAVCKAAIIADCSTF---RVDTAVIKQRVPILLKYL 1524
Cdd:cd11473      4 KKLRDSLLKELEEDKSSDVESVKAAKSKLDLDPISLEEVVKVLLTAVVNAVESADSISLtqkEQLVLVLKKYGPVLRELL 83
                           90       100       110       120       130
                   ....*....|....*....|....*....|....*....|....*....|..
gi 1246417673 1525 DSDTEKELQALYALQA--SIVKLDQPANLLRMFFDCLYDEEVISEDAFYKWE 1574
Cdd:cd11473     84 KLIKKDQLYLLLKIEKlcLQLKLSELISLLEKILDLLYDADVLSEEAILSWF 135
W2_eIF2B_epsilon cd11558
C-terminal W2 domain of eukaryotic translation initiation factor 2B epsilon; eIF2B is a ...
1488-1607 5.10e-15

C-terminal W2 domain of eukaryotic translation initiation factor 2B epsilon; eIF2B is a heteropentameric complex which functions as a guanine nucleotide exchange factor in the recycling of eIF-2 during the initiation of translation in eukaryotes. The epsilon and gamma subunits are sequence similar and both are essential in yeast. Epsilon appears to be the catalytically active subunit, with gamma enhancing its activity. The C-terminal domain of the eIF2B epsilon subunit contains bipartite motifs rich in acidic and aromatic residues, which are responsible for the interaction with eIF2. The structure of the domain resembles that of a set of concatenated HEAT repeats.


Pssm-ID: 211396  Cd Length: 169  Bit Score: 74.60  E-value: 5.10e-15
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1246417673 1488 RALMTAVCK-AAIIADCSTFRVDTA---VIKQRVPILLKYLDSDTEkELQALYALQASIVKLDQPANLLRMFFDCLYDEE 1563
Cdd:cd11558     47 RAVVKALLElILEVSSTSTAELLEAlkkLLSKWGPLLENYVKSQDD-QVELLLALEEFCLESEEGGPLFAKLLHALYDLD 125
                           90       100       110       120
                   ....*....|....*....|....*....|....*....|....
gi 1246417673 1564 VISEDAFYKWESSKDPAEQAGKGVALKSVTAFFTWLREAEEESE 1607
Cdd:cd11558    126 ILEEEAILEWWEEPDAGADEEMKKVRELVKKFIEWLEEAEEESD 169
W2_eIF5 cd11561
C-terminal W2 domain of eukaryotic translation initiation factor 5; eIF5 functions as a GTPase ...
1460-1607 1.20e-08

C-terminal W2 domain of eukaryotic translation initiation factor 5; eIF5 functions as a GTPase acceleration protein (GAP), as well as a GDP dissociation inhibitor (GDI) during translational initiation in eukaryotes. The structure of this C-terminal domain resembles that of a set of concatenated HEAT repeats.


Pssm-ID: 211399  Cd Length: 157  Bit Score: 55.70  E-value: 1.20e-08
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1246417673 1460 EEKADD--ERIFDWVEANLDESQMS-------SPTFLRALMTAVCkaAIIADCsTFRVDTA-VIKQRVPILLKYLDSDte 1529
Cdd:cd11561      1 EEEEDErvDELGEFLKKNKDESGLSelkeilkEAERLDVVKDKAV--LVLAEV-LFDENIVkEIKKRKALLLKLVTDE-- 75
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1246417673 1530 kelQALYALQASIVKL--DQPANLLRMF---FDCLYDEEVISEDAFYKWeSSKDPAEQAGKGVA---LKSVTAFFTWLRE 1601
Cdd:cd11561     76 ---KAQKALLGGIERFcgKHSPELLKKVpliLKALYDNDILEEEVILKW-YEKVSKKYVSKEKSkkvRKAAEPFVEWLEE 151

                   ....*.
gi 1246417673 1602 AEEESE 1607
Cdd:cd11561    152 AEEEEE 157
PRK07003 PRK07003
DNA polymerase III subunit gamma/tau;
210-374 5.19e-06

DNA polymerase III subunit gamma/tau;


Pssm-ID: 235906 [Multi-domain]  Cd Length: 830  Bit Score: 51.39  E-value: 5.19e-06
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1246417673  210 GGGSRNPTPPIGRPASTPTPPQQLPSQV-PEHSPVVYGTVESAHLAASTPVTAASDQKQEEKPKPDPVFQSPSTVLRLVL 288
Cdd:PRK07003   368 PGGGVPARVAGAVPAPGARAAAAVGASAvPAVTAVTGAAGAALAPKAAAAAAATRAEAPPAAPAPPATADRGDDAADGDA 447
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1246417673  289 SGEKKEQAGQMPETAAGEPTPEPPRTSSPTS-----LPPLARSSlPSPMSAALSSQPLFTAEDKCELPSSKEEDAPPVP- 362
Cdd:PRK07003   448 PVPAKANARASADSRCDERDAQPPADSGSASapasdAPPDAAFE-PAPRAAAPSAATPAAVPDARAPAAASREDAPAAAa 526
                          170
                   ....*....|..
gi 1246417673  363 SPTSCTAASGPS 374
Cdd:PRK07003   527 PPAPEARPPTPA 538
PRK12323 PRK12323
DNA polymerase III subunit gamma/tau;
210-380 9.70e-05

DNA polymerase III subunit gamma/tau;


Pssm-ID: 237057 [Multi-domain]  Cd Length: 700  Bit Score: 47.18  E-value: 9.70e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1246417673  210 GGGSRNPTPPIGRPASTPTPPQQLPSQV-PEHSPVVYGTVESAHLAASTPVTAASDQKQEEKPKPDPVFQSPSTVLRlvl 288
Cdd:PRK12323   368 SGGGAGPATAAAAPVAQPAPAAAAPAAAaPAPAAPPAAPAAAPAAAAAARAVAAAPARRSPAPEALAAARQASARGP--- 444
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1246417673  289 SGEKKEQAGQMPETAAGEPTPEPPRTSSPTSLPPLARSSLPSPMSAALSSQPLFTAEDKCELPSSKEEDAPPVPSPTSCT 368
Cdd:PRK12323   445 GGAPAPAPAPAAAPAAAARPAAAGPRPVAAAAAAAPARAAPAAAPAPADDDPPPWEELPPEFASPAPAQPDAAPAGWVAE 524
                          170
                   ....*....|..
gi 1246417673  369 AASGPSLTDNSD 380
Cdd:PRK12323   525 SIPDPATADPDD 536
W2_eIF5C_like cd11560
C-terminal W2 domain of the eukaryotic translation initiation factor 5C and similar proteins; ...
1559-1605 1.13e-04

C-terminal W2 domain of the eukaryotic translation initiation factor 5C and similar proteins; eIF5C appears to be essential for the initiation of protein translation; its actual function, and specifically that of the C-terminal W2 domain, are not well understood. The Drosophila ortholog, kra (krasavietz) or exba (extra bases), may be involved in translational inhibition in neural development. The structure of this C-terminal domain resembles that of a set of concatenated HEAT repeats.


Pssm-ID: 211398 [Multi-domain]  Cd Length: 194  Bit Score: 44.90  E-value: 1.13e-04
                           10        20        30        40
                   ....*....|....*....|....*....|....*....|....*..
gi 1246417673 1559 LYDEEVISEDAFYKWesSKDPAEQAGKGVALKSVTAFFTWLREAEEE 1605
Cdd:cd11560    150 LYKADVLSEDAILKW--YKKGHSPKGKQVFLKQMEPFVEWLQEAEEE 194
Pneumo_att_G pfam05539
Pneumovirinae attachment membrane glycoprotein G;
159-345 1.18e-04

Pneumovirinae attachment membrane glycoprotein G;


Pssm-ID: 114270 [Multi-domain]  Cd Length: 408  Bit Score: 46.58  E-value: 1.18e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1246417673  159 TPFYPSQpvyqsapiiVPTQQQPPPAKREKKTIRIRDPNQGgkditeeimSGGGSRNPTPPIGRPASTPTPPQQLPSQVP 238
Cdd:pfam05539  186 HPTYPSQ---------VTPQSQPATQGHQTATANQRLSSTE---------PVGTQGTTTSSNPEPQTEPPPSQRGPSGSP 247
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1246417673  239 EHSPvvygtvesahlaaSTP----VTAASDQKQEEKPKPDPVFQSPSTVLRLVLSGEKKeqagqmPETAAGEPTPEPPRT 314
Cdd:pfam05539  248 QHPP-------------STTsqdqSTTGDGQEHTQRRKTPPATSNRRSPHSTATPPPTT------KRQETGRPTPRPTAT 308
                          170       180       190
                   ....*....|....*....|....*....|.
gi 1246417673  315 SSPTSLPPlarSSLPSPMSAALSSQPLFTAE 345
Cdd:pfam05539  309 TQSGSSPP---HSSPPGVQANPTTQNLVDCK 336
PHA03247 PHA03247
large tegument protein UL36; Provisional
57-380 2.21e-04

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 46.47  E-value: 2.21e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1246417673   57 PIQFFQRPqiqPPRAAIPNSSP-SIRPGVQTPTAVYQANQHIMMVNHLPMPYPVTQGHQYCIPQYRHSGPPYVGPPQQYP 135
Cdd:PHA03247  2694 SLTSLADP---PPPPPTPEPAPhALVSATPLPPGPAAARQASPALPAAPAPPAVPAGPATPGGPARPARPPTTAGPPAPA 2770
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1246417673  136 VQPpgpgpfypgpgpgdfANAYGTPfyPSQPVYQSAPIIVPTQQQPPPAKREKKTIRIRDPNQggkdiTEEIMSGGGSRN 215
Cdd:PHA03247  2771 PPA---------------APAAGPP--RRLTRPAVASLSESRESLPSPWDPADPPAAVLAPAA-----ALPPAASPAGPL 2828
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1246417673  216 PTPPIGRPASTPTPPQQLPSQVPEHSPVVYG-TVESAHLAASTPVTAASDQKQEEKPKPDPVFQSPSTVLRLVLSGEKKE 294
Cdd:PHA03247  2829 PPPTSAQPTAPPPPPGPPPPSLPLGGSVAPGgDVRRRPPSRSPAAKPAAPARPPVRRLARPAVSRSTESFALPPDQPERP 2908
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1246417673  295 QAGQMPETAAGEPTPEPP-RTSSPTSLPPLARSSLPSPMSAALSSQPLFTAED---------KCELPSSK-EEDAPPVPS 363
Cdd:PHA03247  2909 PQPQAPPPPQPQPQPPPPpQPQPPPPPPPRPQPPLAPTTDPAGAGEPSGAVPQpwlgalvpgRVAVPRFRvPQPAPSREA 2988
                          330
                   ....*....|....*..
gi 1246417673  364 PtsctAASGPSLTDNSD 380
Cdd:PHA03247  2989 P----ASSTPPLTGHSL 3001
PRK10263 PRK10263
DNA translocase FtsK; Provisional
69-364 2.42e-04

DNA translocase FtsK; Provisional


Pssm-ID: 236669 [Multi-domain]  Cd Length: 1355  Bit Score: 46.23  E-value: 2.42e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1246417673   69 PRAAIPNSSPSIR----PGVQTPTAVYQanqhimmvnhlPMPYPVTQGHQYCIPQYRHSGP---PYVGPPQQYPVQPPGP 141
Cdd:PRK10263   347 ASVDVPPAQPTVAwqpvPGPQTGEPVIA-----------PAPEGYPQQSQYAQPAVQYNEPlqqPVQPQQPYYAPAAEQP 415
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1246417673  142 gpfypgpgpgdfanaygtpfyPSQPVYQSAPIIVPTQQQPPPAKREKKTIRIRDPNQGGKDITEEimsggGSRNPTPPIG 221
Cdd:PRK10263   416 ---------------------AQQPYYAPAPEQPAQQPYYAPAPEQPVAGNAWQAEEQQSTFAPQ-----STYQTEQTYQ 469
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1246417673  222 RPAstPTPPQQLPSQVPEHSPVVYGTVESAHLAASTPVTAASDQKQEEK------------PKPDPVfqspstvlrlvls 289
Cdd:PRK10263   470 QPA--AQEPLYQQPQPVEQQPVVEPEPVVEETKPARPPLYYFEEVEEKRarereqlaawyqPIPEPV------------- 534
                          250       260       270       280       290       300       310
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 1246417673  290 gekKEQAGQMPETAAGEPTPEPPRTSSPTSLPPLA--RSSLPSPMSAALSSQPLFT-AEDKCELPSSKEEDAPPVPSP 364
Cdd:PRK10263   535 ---KEPEPIKSSLKAPSVAAVPPVEAAAAVSPLASgvKKATLATGAAATVAAPVFSlANSGGPRPQVKEGIGPQLPRP 609
PHA03307 PHA03307
transcriptional regulator ICP4; Provisional
222-377 5.34e-04

transcriptional regulator ICP4; Provisional


Pssm-ID: 223039 [Multi-domain]  Cd Length: 1352  Bit Score: 44.78  E-value: 5.34e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1246417673  222 RPASTPTPPQQLPSQvpEHSPVVygTVESAHLAASTPV---TAASDQKQEEKPKPDPVFQSPS------TVLRLVLSGEK 292
Cdd:PHA03307    23 RPPATPGDAADDLLS--GSQGQL--VSDSAELAAVTVVagaAACDRFEPPTGPPPGPGTEAPAnesrstPTWSLSTLAPA 98
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1246417673  293 KEQAGQMPETAAGEPTPEPPRTSSPTSLPPLARSSLPSPMSAALSSQPLFTAEDKC-ELPSSKEEDAPPVPSPTSCTAAS 371
Cdd:PHA03307    99 SPAREGSPTPPGPSSPDPPPPTPPPASPPPSPAPDLSEMLRPVGSPGPPPAASPPAaGASPAAVASDAASSRQAALPLSS 178

                   ....*.
gi 1246417673  372 GPSLTD 377
Cdd:PHA03307   179 PEETAR 184
PRK11901 PRK11901
hypothetical protein; Reviewed
163-290 8.19e-04

hypothetical protein; Reviewed


Pssm-ID: 237015 [Multi-domain]  Cd Length: 327  Bit Score: 43.52  E-value: 8.19e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1246417673  163 PSQPVYQSAPIIVPTQQQPPPAKREKKTIRIRDP---------NQGGKDITEEIMSGGGSRNPTPPIGRPASTPTPPQQL 233
Cdd:PRK11901   113 TAPPQDISAPPISPTPTQAAPPQTPNGQQRIELPgnisdalsqQQGQVNAASQNAQGNTSTLPTAPATVAPSKGAKVPAT 192
                           90       100       110       120       130       140
                   ....*....|....*....|....*....|....*....|....*....|....*....|
gi 1246417673  234 PSQVPEHSPVVYGT--VESAHLAASTPVTAASDQKQEEKPKPDPVFQS-PSTVLRLVLSG 290
Cdd:PRK11901   193 AETHPTPPQKPATKkpAVNHHKTATVAVPPATSGKPKSGAASARALSSaPASHYTLQLSS 252
PRK07764 PRK07764
DNA polymerase III subunits gamma and tau; Validated
209-374 1.03e-03

DNA polymerase III subunits gamma and tau; Validated


Pssm-ID: 236090 [Multi-domain]  Cd Length: 824  Bit Score: 43.82  E-value: 1.03e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1246417673  209 SGGGSRNPTPPIGRPASTPTPPQQLPSQVPehSPVVYGTVESAHLAASTPVTAASDQKQEEKPKPDPVFQSPSTVLRLVL 288
Cdd:PRK07764   601 PAPASSGPPEEAARPAAPAAPAAPAAPAPA--GAAAAPAEASAAPAPGVAAPEHHPKHVAVPDASDGGDGWPAKAGGAAP 678
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1246417673  289 SGEKKEQAGQMPETAAGEPTPEP----------PRTSSPTSLPPLARSSLPSPMSAALSSQPLFTAEDKCELPSSKEEDA 358
Cdd:PRK07764   679 AAPPPAPAPAAPAAPAGAAPAQPapapaatppaGQADDPAAQPPQAAQGASAPSPAADDPVPLPPEPDDPPDPAGAPAQP 758
                          170
                   ....*....|....*.
gi 1246417673  359 PPVPSPTSCTAASGPS 374
Cdd:PRK07764   759 PPPPAPAPAAAPAAAP 774
Atrophin-1 pfam03154
Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian ...
62-400 1.52e-03

Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian atrophy (DRPLA) gene. DRPLA OMIM:125370 is a progressive neurodegenerative disorder. It is caused by the expansion of a CAG repeat in the DRPLA gene on chromosome 12p. This results in an extended polyglutamine region in atrophin-1, that is thought to confer toxicity to the protein, possibly through altering its interactions with other proteins. The expansion of a CAG repeat is also the underlying defect in six other neurodegenerative disorders, including Huntington's disease. One interaction of expanded polyglutamine repeats that is thought to be pathogenic is that with the short glutamine repeat in the transcriptional coactivator CREB binding protein, CBP. This interaction draws CBP away from its usual nuclear location to the expanded polyglutamine repeat protein aggregates that are characteriztic of the polyglutamine neurodegenerative disorders. This interferes with CBP-mediated transcription and causes cytotoxicity.


Pssm-ID: 460830 [Multi-domain]  Cd Length: 991  Bit Score: 43.22  E-value: 1.52e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1246417673   62 QRPQIQPPRaaIPNSSPSIRPGVQTPTAVYQANQhimmvnhlPMPYPVTQGHQYCIPQYRHSGPPYVGPPQQYPVqppgp 141
Cdd:pfam03154  233 QTPTLHPQR--LPSPHPPLQPMTQPPPPSQVSPQ--------PLPQPSLHGQMPPMPHSLQTGPSHMQHPVPPQP----- 297
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1246417673  142 gpfypGPGPGDFANAYGTPF-YPSQPVY-QSAPIIVPTQQQPPPAKREKKTIRIRDPnqggkditeeimsgggsrNPTPP 219
Cdd:pfam03154  298 -----FPLTPQSSQSQVPPGpSPAAPGQsQQRIHTPPSQSQLQSQQPPREQPLPPAP------------------LSMPH 354
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1246417673  220 IGRPASTPTPPQQLPsQVPEHSPvvygtvesaHLAASTPVTAASDQkqeekpKPDPVFQSPSTVLRLVLSGEKKEQAGQM 299
Cdd:pfam03154  355 IKPPPTTPIPQLPNP-QSHKHPP---------HLSGPSPFQMNSNL------PPPPALKPLSSLSTHHPPSAHPPPLQLM 418
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1246417673  300 PETAAGEPTP-EPPRTSSPTSLPPLARSSLPSPMSAALSSQPLFTAEDKceLPSSKEEDAPPVPSPTScTAASGPSLtdn 378
Cdd:pfam03154  419 PQSQQLPPPPaQPPVLTQSQSLPPPAASHPPTSGLHQVPSQSPFPQHPF--VPGGPPPITPPSGPPTS-TSSAMPGI--- 492
                          330       340
                   ....*....|....*....|..
gi 1246417673  379 sdicKKPCSVAPHDSQLISSTI 400
Cdd:pfam03154  493 ----QPPSSASVSSSGPVPAAV 510
PHA03247 PHA03247
large tegument protein UL36; Provisional
66-390 2.19e-03

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 43.00  E-value: 2.19e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1246417673   66 IQPPRAAIPNSSPSIRPGVQTPTAVYQANQhimmvnhlpmpyPVTQGHQYCIPqyRHSGPPYVGPPQQYPVQPPGPGPFY 145
Cdd:PHA03247  2568 VPPPRPAPRPSEPAVTSRARRPDAPPQSAR------------PRAPVDDRGDP--RGPAPPSPLPPDTHAPDPPPPSPSP 2633
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1246417673  146 PGPGPGDFANAYGTPfyPSQPVYQSAPiivPTQQQPPPAKREKKTIRIRDPNQGGKditeeimsgggSRNPTPPIGRPAS 225
Cdd:PHA03247  2634 AANEPDPHPPPTVPP--PERPRDDPAP---GRVSRPRRARRLGRAAQASSPPQRPR-----------RRAARPTVGSLTS 2697
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1246417673  226 TPTPPQQLPSQVPEHSPVVYGT----VESAHLAASTPVTAAsdqkqeekPKPDPVFQSPSTVLRLVLSGEKKEQAGQMPE 301
Cdd:PHA03247  2698 LADPPPPPPTPEPAPHALVSATplppGPAAARQASPALPAA--------PAPPAVPAGPATPGGPARPARPPTTAGPPAP 2769
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1246417673  302 TAAGEPTPEPPRTSSPTSLPPL--ARSSLPSPMSAALSSQPLfTAEDKCELPSSKEedAPPVPSPTSCTAASGPSLTDNS 379
Cdd:PHA03247  2770 APPAAPAAGPPRRLTRPAVASLseSRESLPSPWDPADPPAAV-LAPAAALPPAASP--AGPLPPPTSAQPTAPPPPPGPP 2846
                          330
                   ....*....|..
gi 1246417673  380 DICKKPC-SVAP 390
Cdd:PHA03247  2847 PPSLPLGgSVAP 2858
PHA03378 PHA03378
EBNA-3B; Provisional
84-370 2.45e-03

EBNA-3B; Provisional


Pssm-ID: 223065 [Multi-domain]  Cd Length: 991  Bit Score: 42.75  E-value: 2.45e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1246417673   84 VQTPTAVYQANQHIMMVNHLPMPYPVTQGHQYCIPQYRHSGPPYVGPPQQYPVQPPGPGPFYPGPGPGDFaNAYGTPfYP 163
Cdd:PHA03378   574 IQPLTSPTTSQLASSAPSYAQTPWPVPHPSQTPEPPTTQSHIPETSAPRQWPMPLRPIPMRPLRMQPITF-NVLVFP-TP 651
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1246417673  164 SQPVYQSAPIIVPTQQQPPPAKREkktirirdPNQGGKDITEEIMSGGGSRNP---TPPIGRPASTPTPPQQLPSQVPEH 240
Cdd:PHA03378   652 HQPPQVEITPYKPTWTQIGHIPYQ--------PSPTGANTMLPIQWAPGTMQPpprAPTPMRPPAAPPGRAQRPAAATGR 723
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1246417673  241 SPVVYGTVESAHLAASTPVTAASDQKQEEKPKPDPVFQSPSTVLRLVLSGEKKEQAGQMPETAA----GEPTPEPPRTSS 316
Cdd:PHA03378   724 ARPPAAAPGRARPPAAAPGRARPPAAAPGRARPPAAAPGRARPPAAAPGAPTPQPPPQAPPAPQqrprGAPTPQPPPQAG 803
                          250       260       270       280       290       300
                   ....*....|....*....|....*....|....*....|....*....|....*....|.
gi 1246417673  317 PTSLPPLARSSLPSPMSAALSSQPLFTAEDKCELPSSK-----EEDAP--PVPSPTSCTAA 370
Cdd:PHA03378   804 PTSMQLMPRAAPGQQGPTKQILRQLLTGGVKRGRPSLKkpaalERQAAagPTPSPGSGTSD 864
PRK10263 PRK10263
DNA translocase FtsK; Provisional
234-375 2.53e-03

DNA translocase FtsK; Provisional


Pssm-ID: 236669 [Multi-domain]  Cd Length: 1355  Bit Score: 42.76  E-value: 2.53e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1246417673  234 PSQVPEHSPVVYGTVESAHLAASTPVTAASdqkQEEKPKPDPVFQSPStvlrlVLSGEKKEQAGQMP-ETAAGEPTPEPP 312
Cdd:PRK10263   301 QPEYDEYDPLLNGAPITEPVAVAAAATTAT---QSWAAPVEPVTQTPP-----VASVDVPPAQPTVAwQPVPGPQTGEPV 372
                           90       100       110       120       130       140
                   ....*....|....*....|....*....|....*....|....*....|....*....|...
gi 1246417673  313 RTSSPTSLPPLARSSLPSPMSAALSSQPlFTAEDKCELPSSKEEDAPPVPSPTSCTAASGPSL 375
Cdd:PRK10263   373 IAPAPEGYPQQSQYAQPAVQYNEPLQQP-VQPQQPYYAPAAEQPAQQPYYAPAPEQPAQQPYY 434
DUF4045 pfam13254
Domain of unknown function (DUF4045); This presumed domain is functionally uncharacterized. ...
176-359 2.91e-03

Domain of unknown function (DUF4045); This presumed domain is functionally uncharacterized. This domain family is found in bacteria and eukaryotes, and is typically between 384 and 430 amino acids in length.


Pssm-ID: 433066 [Multi-domain]  Cd Length: 415  Bit Score: 42.08  E-value: 2.91e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1246417673  176 PTQQQPPPAKREKKTIRirdPNQGGKDiteeiMSGGGSRNPTPPIGRPASTPTPPQQLPSQVPEHSPVVYGTVESAHLAA 255
Cdd:pfam13254  170 PSQPAQPAWMKELNKIR---QSRASVD-----LGRPNSFKEVTPVGLMRSPAPGGHSKSPSVSGISADSSPTKEEPSEEA 241
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1246417673  256 STPVTaasdqKQEEKPKPDPVFQSPSTVLRLvlsgEKKEQAGQMPETAAGEPT--PEPPRTSSPTSLPPLARSSLPSPMS 333
Cdd:pfam13254  242 DTLST-----DKEQSPAPTSASEPPPKTKEL----PKDSEEPAAPSKSAEASTekKEPDTESSPETSSEKSAPSLLSPVS 312
                          170       180
                   ....*....|....*....|....*.
gi 1246417673  334 AALSSQPLFTAEDKCELPSSKEEDAP 359
Cdd:pfam13254  313 KASIDKPLSSPDRDPLSPKPKPQSPP 338
DUF4045 pfam13254
Domain of unknown function (DUF4045); This presumed domain is functionally uncharacterized. ...
222-390 9.13e-03

Domain of unknown function (DUF4045); This presumed domain is functionally uncharacterized. This domain family is found in bacteria and eukaryotes, and is typically between 384 and 430 amino acids in length.


Pssm-ID: 433066 [Multi-domain]  Cd Length: 415  Bit Score: 40.54  E-value: 9.13e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1246417673  222 RPASTPTPPQQlPSQVPEHSPVvYGTVESAHLAASTPVTAASDQKQEEKPKPDPVFQSPSTVLRLVLSGEKKEQAGQMPE 301
Cdd:pfam13254  165 KPKAQPSQPAQ-PAWMKELNKI-RQSRASVDLGRPNSFKEVTPVGLMRSPAPGGHSKSPSVSGISADSSPTKEEPSEEAD 242
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1246417673  302 TAAGEPTPEPPRTSSPTSLPPLARSSLPSPMSAALSSQPLFTAEDKCELPSSK-----EEDAPPVPSPTSCTAASGPSLT 376
Cdd:pfam13254  243 TLSTDKEQSPAPTSASEPPPKTKELPKDSEEPAAPSKSAEASTEKKEPDTESSpetssEKSAPSLLSPVSKASIDKPLSS 322
                          170
                   ....*....|....*
gi 1246417673  377 DNSD-ICKKPCSVAP 390
Cdd:pfam13254  323 PDRDpLSPKPKPQSP 337
 
Blast search parameters
Data Source: Precalculated data, version = cdd.v.3.21
Preset Options:Database: CDSEARCH/cdd   Low complexity filter: no  Composition Based Adjustment: yes   E-value threshold: 0.01

References:

  • Wang J et al. (2023), "The conserved domain database in 2023", Nucleic Acids Res.51(D)384-8.
  • Lu S et al. (2020), "The conserved domain database in 2020", Nucleic Acids Res.48(D)265-8.
  • Marchler-Bauer A et al. (2017), "CDD/SPARCLE: functional classification of proteins via subfamily domain architectures.", Nucleic Acids Res.45(D)200-3.
Help | Disclaimer | Write to the Help Desk
NCBI | NLM | NIH