NCBI Home Page NCBI Site Search page NCBI Guide that lists and describes the NCBI resources
Conserved domains on  [gi|109730811|gb|AAI17928|]
View 

R3hdm2 protein [Mus musculus]

Protein Classification

R3H_encore_like and Med15 domain-containing protein( domain architecture ID 11552550)

R3H_encore_like and Med15 domain-containing protein

Graphical summary

 Zoom to residue level

show extra options »

Show site features     Horizontal zoom: ×

List of domain hits

Name Accession Description Interval E-value
R3H_encore_like cd02642
R3H domain of encore-like and DIP1-like proteins. Drosophila encore is involved in the ...
168-229 3.40e-27

R3H domain of encore-like and DIP1-like proteins. Drosophila encore is involved in the germline exit after four mitotic divisions, by facilitating SCF-ubiquitin-proteasome-dependent proteolysis. Maize DBF1-interactor protein 1 (DIP1) containing an R3H domain is a potential regulator of DBF1 activity in stress responses. The name of the R3H domain comes from the characteristic spacing of the most conserved arginine and histidine residues. The function of the domain is predicted to bind ssDNA or ssRNA in a sequence-specific manner.


:

Pssm-ID: 100071  Cd Length: 63  Bit Score: 104.99  E-value: 3.40e-27
                          10        20        30        40        50        60
                  ....*....|....*....|....*....|....*....|....*....|....*....|...
gi 109730811  168 DRMMLLKLEQEILDFINDNNNQFKKFPQMTSYHRMLLHRVAAYFGMDHNVDQTG-KAVIINKT 229
Cdd:cd02642     1 DRLFVLKLEKDLLAFIKDSTRQSLELPPMNSYYRLLAHRVAQYYGLDHNVDNSGgKCVIVNKT 63
Med15 super family cl26621
ARC105 or Med15 subunit of Mediator complex non-fungal; The approx. 70 residue Med15 domain of ...
419-768 1.64e-11

ARC105 or Med15 subunit of Mediator complex non-fungal; The approx. 70 residue Med15 domain of the ARC-Mediator co-activator is a three-helix bundle with marked similarity to the KIX domain. The sterol regulatory element binding protein (SREBP) family of transcription activators use the ARC105 subunit to activate target genes in the regulation of cholesterol and fatty acid homeostasis. In addition, Med15 is a critical transducer of gene activation signals that control early metazoan development.


The actual alignment was detected with superfamily member pfam09606:

Pssm-ID: 312941 [Multi-domain]  Cd Length: 732  Bit Score: 68.50  E-value: 1.64e-11
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109730811   419 LPPTPQHQPPLNNHMISQPVPALQPSPQPVQFSPSSCPQVLLPVSPPQQYNMAEDLSNPF--GQMSLSRQGSTEAADPSS 496
Cdd:pfam09606  124 LASLGRPQMPMGGAGFPSQMSRVGRMQPGGQAGGMMQPSSGQPGSGTPNQMGPNGGPGQGqaGGMNGGQQGPMGGQMPPQ 203
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109730811   497 ALFQPPLISQHPQQASFIMASAGQPLPTSNYSTSSHAPPTQQVLPPQGYMQppQQIQVSYYPPGQYPNSNQQYRPLSHPV 576
Cdd:pfam09606  204 MGVPGMPGPADAGAQMGQQAQANGGMNPQQMGGAPNQVAMQQQQPQQQGQQ--SQLGMGINQMQQMPQGVGGGAGQGGPG 281
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109730811   577 AYSPQRGQQLPQASQQ--PGLQPMMSNQQQTAYQGMLGVQQPQNQGLLSNQRSSMGGQMQ---GLVVQYTPLPSYQVPVG 651
Cdd:pfam09606  282 QPMGPPGQQPGAMPNVmsIGDQNNYQQQQTRQQQQQQGGNHPAAHQQQMNQSVGQGGQVValgGLNHLETWNPGNFGGLG 361
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109730811   652 SDSQNVVQP---SFQQPMLVPASQSVQG--GLPTGGVPVYYSMIPPAQQNGTSPSVGFLQPPGSEQYQMPQSPSPCSPPQ 726
Cdd:pfam09606  362 ANPMQRGQPgmmSSPSPVPGQQVRQVTPnqFMRQSPQPSVPSPQGPGSQPPQSHPGGMIPSPALIPSPSPQMSQQPAQQR 441
                          330       340       350       360
                   ....*....|....*....|....*....|....*....|...
gi 109730811   727 MSQQYSGVSPSG-PGVVVMqlNVPNGPQAPQNPSMVQWSHCKY 768
Cdd:pfam09606  442 TIGQDSPGGSLNtPGQSAV--NSPLNPQEEQLYREKYRQLTKY 482
 
Name Accession Description Interval E-value
R3H_encore_like cd02642
R3H domain of encore-like and DIP1-like proteins. Drosophila encore is involved in the ...
168-229 3.40e-27

R3H domain of encore-like and DIP1-like proteins. Drosophila encore is involved in the germline exit after four mitotic divisions, by facilitating SCF-ubiquitin-proteasome-dependent proteolysis. Maize DBF1-interactor protein 1 (DIP1) containing an R3H domain is a potential regulator of DBF1 activity in stress responses. The name of the R3H domain comes from the characteristic spacing of the most conserved arginine and histidine residues. The function of the domain is predicted to bind ssDNA or ssRNA in a sequence-specific manner.


Pssm-ID: 100071  Cd Length: 63  Bit Score: 104.99  E-value: 3.40e-27
                          10        20        30        40        50        60
                  ....*....|....*....|....*....|....*....|....*....|....*....|...
gi 109730811  168 DRMMLLKLEQEILDFINDNNNQFKKFPQMTSYHRMLLHRVAAYFGMDHNVDQTG-KAVIINKT 229
Cdd:cd02642     1 DRLFVLKLEKDLLAFIKDSTRQSLELPPMNSYYRLLAHRVAQYYGLDHNVDNSGgKCVIVNKT 63
R3H smart00393
Putative single-stranded nucleic acids-binding domain;
152-229 1.33e-14

Putative single-stranded nucleic acids-binding domain;


Pssm-ID: 214647  Cd Length: 79  Bit Score: 69.64  E-value: 1.33e-14
                            10        20        30        40        50        60        70        80
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109730811    152 IDLHEFLVNTLKKNPRDRMMLLKLEQEILDFINdNNNQFKKFPQMTSYHRMLLHRVAAYFGMDHNVDQTGKA--VIINKT 229
Cdd:smart00393    1 ADFLPVTLDALSYRPRRREELIELELEIARFVK-STKESVELPPMNSYERKIVHELAEKYGLESESFGEGPKrrVVISKK 79
Med15 pfam09606
ARC105 or Med15 subunit of Mediator complex non-fungal; The approx. 70 residue Med15 domain of ...
419-768 1.64e-11

ARC105 or Med15 subunit of Mediator complex non-fungal; The approx. 70 residue Med15 domain of the ARC-Mediator co-activator is a three-helix bundle with marked similarity to the KIX domain. The sterol regulatory element binding protein (SREBP) family of transcription activators use the ARC105 subunit to activate target genes in the regulation of cholesterol and fatty acid homeostasis. In addition, Med15 is a critical transducer of gene activation signals that control early metazoan development.


Pssm-ID: 312941 [Multi-domain]  Cd Length: 732  Bit Score: 68.50  E-value: 1.64e-11
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109730811   419 LPPTPQHQPPLNNHMISQPVPALQPSPQPVQFSPSSCPQVLLPVSPPQQYNMAEDLSNPF--GQMSLSRQGSTEAADPSS 496
Cdd:pfam09606  124 LASLGRPQMPMGGAGFPSQMSRVGRMQPGGQAGGMMQPSSGQPGSGTPNQMGPNGGPGQGqaGGMNGGQQGPMGGQMPPQ 203
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109730811   497 ALFQPPLISQHPQQASFIMASAGQPLPTSNYSTSSHAPPTQQVLPPQGYMQppQQIQVSYYPPGQYPNSNQQYRPLSHPV 576
Cdd:pfam09606  204 MGVPGMPGPADAGAQMGQQAQANGGMNPQQMGGAPNQVAMQQQQPQQQGQQ--SQLGMGINQMQQMPQGVGGGAGQGGPG 281
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109730811   577 AYSPQRGQQLPQASQQ--PGLQPMMSNQQQTAYQGMLGVQQPQNQGLLSNQRSSMGGQMQ---GLVVQYTPLPSYQVPVG 651
Cdd:pfam09606  282 QPMGPPGQQPGAMPNVmsIGDQNNYQQQQTRQQQQQQGGNHPAAHQQQMNQSVGQGGQVValgGLNHLETWNPGNFGGLG 361
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109730811   652 SDSQNVVQP---SFQQPMLVPASQSVQG--GLPTGGVPVYYSMIPPAQQNGTSPSVGFLQPPGSEQYQMPQSPSPCSPPQ 726
Cdd:pfam09606  362 ANPMQRGQPgmmSSPSPVPGQQVRQVTPnqFMRQSPQPSVPSPQGPGSQPPQSHPGGMIPSPALIPSPSPQMSQQPAQQR 441
                          330       340       350       360
                   ....*....|....*....|....*....|....*....|...
gi 109730811   727 MSQQYSGVSPSG-PGVVVMqlNVPNGPQAPQNPSMVQWSHCKY 768
Cdd:pfam09606  442 TIGQDSPGGSLNtPGQSAV--NSPLNPQEEQLYREKYRQLTKY 482
R3H pfam01424
R3H domain; The name of the R3H domain comes from the characteriztic spacing of the most ...
170-228 1.35e-10

R3H domain; The name of the R3H domain comes from the characteriztic spacing of the most conserved arginine and histidine residues. The function of the domain is predicted to be binding ssDNA.


Pssm-ID: 460206  Cd Length: 60  Bit Score: 57.50  E-value: 1.35e-10
                           10        20        30        40        50        60
                   ....*....|....*....|....*....|....*....|....*....|....*....|.
gi 109730811   170 MMLLKLEQEILDFINDNNNQFKkFPQMTSYHRMLLHRVAAYFGMDHNV--DQTGKAVIINK 228
Cdd:pfam01424    1 EFLEQLAEKLAEFVKDTGKSLE-LPPMSSYERRIIHELAQKYGLESESegEEPNRRVVVYK 60
PHA03379 PHA03379
EBNA-3A; Provisional
437-816 2.73e-06

EBNA-3A; Provisional


Pssm-ID: 223066 [Multi-domain]  Cd Length: 935  Bit Score: 51.60  E-value: 2.73e-06
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109730811  437 PVPALQPSPQPVQFSPSSCPQVLLPVSPPQQYNMAEDLSNPFgqmSLSRQGSTEAADPSSALFQP--PLISQHPQQASFI 514
Cdd:PHA03379  447 PVHDLEPGPLHDQHSMAPCPVAQLPPGPLQDLEPGDQLPGVV---QDGRPACAPVPAPAGPIVRPweASLSQVPGVAFAP 523
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109730811  515 MAS---AGQPLPTSNYSTSSHAPPtqqvLPPQGYMQPP------QQIQVSYYPPGQYPNSNQQYRPLshPVAYSPQRGQQ 585
Cdd:PHA03379  524 VMPqpmPVEPVPVPTVALERPVCP----APPLIAMQGPgetsgiVRVRERWRPAPWTPNPPRSPSQM--SVRDRLARLRA 597
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109730811  586 LPQASQQP-GLQPMMSNQQQTayqgmlgvQQPQNQGLLSNQRSSMGGQMQGL--VVQYTPLPSYQVPvgSDSQNVVQPSF 662
Cdd:PHA03379  598 EAQPYQASvEVQPPQLTQVSP--------QQPMEYPLEPEQQMFPGSPFSQVadVMRAGGVPAMQPQ--YFDLPLQQPIS 667
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109730811  663 QQPMLVPASQSVQGGLPTGGVPVYYSMIPPAQQ--NGTSPSVGFLQPPGS-------EQYQMPQSPSPCSPPQMSQQYSG 733
Cdd:PHA03379  668 QGAPLAPLRASMGPVPPVPATQPQYFDIPLTEPinQGASAAHFLPQQPMEgplvperWMFQGATLSQSVRPGVAQSQYFD 747
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109730811  734 VSPSGPgvvvMQLNVPNGPQAPQNPSMVQWshckyysVEQRGQKPGDLYSPdGSPQANAQMGSSPVTSPTQSPAPSPVTS 813
Cdd:PHA03379  748 LPLTQP----INHGAPAAHFLHQPPMEGPW-------VPEQWMFQGAPPSQ-GTDVVQHQLDALGYVLHVLNHPGVPVSP 815

                  ...
gi 109730811  814 LSN 816
Cdd:PHA03379  816 AVN 818
PABP-1234 TIGR01628
polyadenylate binding protein, human types 1, 2, 3, 4 family; These eukaryotic proteins ...
474-643 1.62e-04

polyadenylate binding protein, human types 1, 2, 3, 4 family; These eukaryotic proteins recognize the poly-A of mRNA and consists of four tandem RNA recognition domains at the N-terminus (rrm: pfam00076) followed by a PABP-specific domain (pfam00658) at the C-terminus. The protein is involved in the transport of mRNA's from the nucleus to the cytoplasm. There are four paralogs in Homo sapiens which are expressed in testis, platelets, broadly expressed and of unknown tissue range.


Pssm-ID: 130689 [Multi-domain]  Cd Length: 562  Bit Score: 45.57  E-value: 1.62e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109730811   474 LSNPFGQMSLSRQGSTEAADPSSALFQPPLISQHPQQASfimasAGQPLptsNYstsshaPPTQQVLPPQGYMQPPQQIQ 553
Cdd:TIGR01628  371 LQDQFMQLQPRMRQLPMGSPMGGAMGQPPYYGQGPQQQF-----NGQPL---GW------PRMSMMPTPMGPGGPLRPNG 436
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109730811   554 VSYYPPGQYPNSNQQYRPLSHPVA---YSPQ-RGQQLPQASQQPGLQPMMSNQQQTAYQgMLGVQQPQnqgllsNQRSSM 629
Cdd:TIGR01628  437 LAPMNAVRAPSRNAQNAAQKPPMQpvmYPPNyQSLPLSQDLPQPQSTASQGGQNKKLAQ-VLASATPQ------MQKQVL 509
                          170
                   ....*....|....
gi 109730811   630 GGQMQGLVVQYTPL 643
Cdd:TIGR01628  510 GERLFPLVEAIEPA 523
Amelin smart00817
Ameloblastin precursor (Amelin); This family consists of several mammalian Ameloblastin ...
485-620 4.83e-03

Ameloblastin precursor (Amelin); This family consists of several mammalian Ameloblastin precursor (Amelin) proteins. Matrix proteins of tooth enamel consist mainly of amelogenin but also of non-amelogenin proteins, which, although their volumetric percentage is low, have an important role in enamel mineralisation. One of the non-amelogenin proteins is ameloblastin, also known as amelin and sheathlin. Ameloblastin (AMBN) is one of the enamel sheath proteins which is though to have a role in determining the prismatic structure of growing enamel crystals.


Pssm-ID: 214832 [Multi-domain]  Cd Length: 411  Bit Score: 40.64  E-value: 4.83e-03
                            10        20        30        40        50        60        70        80
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109730811    485 RQGSTEAADPSSALFQPPLISQHPQQAsfimasagqPLPTSNYSTSSHAPPTQQVLPPQGYMQPPQQIQVSYYPPGQYPN 564
Cdd:smart00817   82 REHETQQYEYSLPVHPPPLPSQPSLQP---------QQPGLKPFLQPTALPTNQATPQKNGPQPPMHLGQPPLQQAELPM 152
                            90       100       110       120       130       140
                    ....*....|....*....|....*....|....*....|....*....|....*....|...
gi 109730811    565 SNQQYRPLSHP-------VAYSPQRGQQLPQASQQPGLQPMMSNQQQTAYQGMLGVQQPQNQG 620
Cdd:smart00817  153 IPPQVAPSDKPpqtelplYDFADPQNPLLFQIAHLMSRGPMPQNKQQHLYPGLFYMSYGANQL 215
 
Name Accession Description Interval E-value
R3H_encore_like cd02642
R3H domain of encore-like and DIP1-like proteins. Drosophila encore is involved in the ...
168-229 3.40e-27

R3H domain of encore-like and DIP1-like proteins. Drosophila encore is involved in the germline exit after four mitotic divisions, by facilitating SCF-ubiquitin-proteasome-dependent proteolysis. Maize DBF1-interactor protein 1 (DIP1) containing an R3H domain is a potential regulator of DBF1 activity in stress responses. The name of the R3H domain comes from the characteristic spacing of the most conserved arginine and histidine residues. The function of the domain is predicted to bind ssDNA or ssRNA in a sequence-specific manner.


Pssm-ID: 100071  Cd Length: 63  Bit Score: 104.99  E-value: 3.40e-27
                          10        20        30        40        50        60
                  ....*....|....*....|....*....|....*....|....*....|....*....|...
gi 109730811  168 DRMMLLKLEQEILDFINDNNNQFKKFPQMTSYHRMLLHRVAAYFGMDHNVDQTG-KAVIINKT 229
Cdd:cd02642     1 DRLFVLKLEKDLLAFIKDSTRQSLELPPMNSYYRLLAHRVAQYYGLDHNVDNSGgKCVIVNKT 63
R3H smart00393
Putative single-stranded nucleic acids-binding domain;
152-229 1.33e-14

Putative single-stranded nucleic acids-binding domain;


Pssm-ID: 214647  Cd Length: 79  Bit Score: 69.64  E-value: 1.33e-14
                            10        20        30        40        50        60        70        80
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109730811    152 IDLHEFLVNTLKKNPRDRMMLLKLEQEILDFINdNNNQFKKFPQMTSYHRMLLHRVAAYFGMDHNVDQTGKA--VIINKT 229
Cdd:smart00393    1 ADFLPVTLDALSYRPRRREELIELELEIARFVK-STKESVELPPMNSYERKIVHELAEKYGLESESFGEGPKrrVVISKK 79
Med15 pfam09606
ARC105 or Med15 subunit of Mediator complex non-fungal; The approx. 70 residue Med15 domain of ...
419-768 1.64e-11

ARC105 or Med15 subunit of Mediator complex non-fungal; The approx. 70 residue Med15 domain of the ARC-Mediator co-activator is a three-helix bundle with marked similarity to the KIX domain. The sterol regulatory element binding protein (SREBP) family of transcription activators use the ARC105 subunit to activate target genes in the regulation of cholesterol and fatty acid homeostasis. In addition, Med15 is a critical transducer of gene activation signals that control early metazoan development.


Pssm-ID: 312941 [Multi-domain]  Cd Length: 732  Bit Score: 68.50  E-value: 1.64e-11
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109730811   419 LPPTPQHQPPLNNHMISQPVPALQPSPQPVQFSPSSCPQVLLPVSPPQQYNMAEDLSNPF--GQMSLSRQGSTEAADPSS 496
Cdd:pfam09606  124 LASLGRPQMPMGGAGFPSQMSRVGRMQPGGQAGGMMQPSSGQPGSGTPNQMGPNGGPGQGqaGGMNGGQQGPMGGQMPPQ 203
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109730811   497 ALFQPPLISQHPQQASFIMASAGQPLPTSNYSTSSHAPPTQQVLPPQGYMQppQQIQVSYYPPGQYPNSNQQYRPLSHPV 576
Cdd:pfam09606  204 MGVPGMPGPADAGAQMGQQAQANGGMNPQQMGGAPNQVAMQQQQPQQQGQQ--SQLGMGINQMQQMPQGVGGGAGQGGPG 281
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109730811   577 AYSPQRGQQLPQASQQ--PGLQPMMSNQQQTAYQGMLGVQQPQNQGLLSNQRSSMGGQMQ---GLVVQYTPLPSYQVPVG 651
Cdd:pfam09606  282 QPMGPPGQQPGAMPNVmsIGDQNNYQQQQTRQQQQQQGGNHPAAHQQQMNQSVGQGGQVValgGLNHLETWNPGNFGGLG 361
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109730811   652 SDSQNVVQP---SFQQPMLVPASQSVQG--GLPTGGVPVYYSMIPPAQQNGTSPSVGFLQPPGSEQYQMPQSPSPCSPPQ 726
Cdd:pfam09606  362 ANPMQRGQPgmmSSPSPVPGQQVRQVTPnqFMRQSPQPSVPSPQGPGSQPPQSHPGGMIPSPALIPSPSPQMSQQPAQQR 441
                          330       340       350       360
                   ....*....|....*....|....*....|....*....|...
gi 109730811   727 MSQQYSGVSPSG-PGVVVMqlNVPNGPQAPQNPSMVQWSHCKY 768
Cdd:pfam09606  442 TIGQDSPGGSLNtPGQSAV--NSPLNPQEEQLYREKYRQLTKY 482
R3H cd02325
R3H domain. The name of the R3H domain comes from the characteristic spacing of the most ...
172-228 3.74e-11

R3H domain. The name of the R3H domain comes from the characteristic spacing of the most conserved arginine and histidine residues. R3H domains are found in proteins together with ATPase domains, SF1 helicase domains, SF2 DEAH helicase domains, Cys-rich repeats, ring-type zinc fingers, and KH domains. The function of the domain is predicted to bind ssDNA or ssRNA in a sequence-specific manner.


Pssm-ID: 100064  Cd Length: 59  Bit Score: 59.16  E-value: 3.74e-11
                          10        20        30        40        50
                  ....*....|....*....|....*....|....*....|....*....|....*....
gi 109730811  172 LLKLEQEILDFINDNNNQFKKFPQMTSYHRMLLHRVAAYFGMDHNVDQTG--KAVIINK 228
Cdd:cd02325     1 REEREEELEAFAKDAAGKSLELPPMNSYERKLIHDLAEYYGLKSESEGEGpnRRVVITK 59
R3H pfam01424
R3H domain; The name of the R3H domain comes from the characteriztic spacing of the most ...
170-228 1.35e-10

R3H domain; The name of the R3H domain comes from the characteriztic spacing of the most conserved arginine and histidine residues. The function of the domain is predicted to be binding ssDNA.


Pssm-ID: 460206  Cd Length: 60  Bit Score: 57.50  E-value: 1.35e-10
                           10        20        30        40        50        60
                   ....*....|....*....|....*....|....*....|....*....|....*....|.
gi 109730811   170 MMLLKLEQEILDFINDNNNQFKkFPQMTSYHRMLLHRVAAYFGMDHNV--DQTGKAVIINK 228
Cdd:pfam01424    1 EFLEQLAEKLAEFVKDTGKSLE-LPPMSSYERRIIHELAQKYGLESESegEEPNRRVVVYK 60
PAT1 pfam09770
Topoisomerase II-associated protein PAT1; Members of this family are necessary for accurate ...
439-667 1.08e-07

Topoisomerase II-associated protein PAT1; Members of this family are necessary for accurate chromosome transmission during cell division.


Pssm-ID: 401645 [Multi-domain]  Cd Length: 846  Bit Score: 56.20  E-value: 1.08e-07
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109730811   439 PALQPSPQPVQFSPSSCPQVLLPVSPPQQ-----------YNMAEDLSNPFGQMSLSRQGSTEAADPSSALFQPPLISQH 507
Cdd:pfam09770  107 PAARAAQSSAQPPASSLPQYQYASQQSQQpskpvrtgyekYKEPEPIPDLQVDASLWGVAPKKAAAPAPAPQPAAQPASL 186
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109730811   508 PQQASFIM------------ASAGQPLPTSNYSTSSHAPPTQQVLPPQGYMQPPQQIQvsyyPPGQYPNSNQQYRPLSHP 575
Cdd:pfam09770  187 PAPSRKMMsleeveaamraqAKKPAQQPAPAPAQPPAAPPAQQAQQQQQFPPQIQQQQ----QPQQQPQQPQQHPGQGHP 262
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109730811   576 VAYspqrgQQLPQASQQPGLQPMMSNQQQTAYQGMLGV-QQP----QNQGLLSNQRSSMGGQMQGlVVQYTPLPSYQVPV 650
Cdd:pfam09770  263 VTI-----LQRPQSPQPDPAQPSIQPQAQQFHQQPPPVpVQPtqilQNPNRLSAARVGYPQNPQP-GVQPAPAHQAHRQQ 336
                          250
                   ....*....|....*..
gi 109730811   651 GSDSQNVVQPSFQQPML 667
Cdd:pfam09770  337 GSFGRQAPIITHPQQLA 353
PHA03379 PHA03379
EBNA-3A; Provisional
437-816 2.73e-06

EBNA-3A; Provisional


Pssm-ID: 223066 [Multi-domain]  Cd Length: 935  Bit Score: 51.60  E-value: 2.73e-06
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109730811  437 PVPALQPSPQPVQFSPSSCPQVLLPVSPPQQYNMAEDLSNPFgqmSLSRQGSTEAADPSSALFQP--PLISQHPQQASFI 514
Cdd:PHA03379  447 PVHDLEPGPLHDQHSMAPCPVAQLPPGPLQDLEPGDQLPGVV---QDGRPACAPVPAPAGPIVRPweASLSQVPGVAFAP 523
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109730811  515 MAS---AGQPLPTSNYSTSSHAPPtqqvLPPQGYMQPP------QQIQVSYYPPGQYPNSNQQYRPLshPVAYSPQRGQQ 585
Cdd:PHA03379  524 VMPqpmPVEPVPVPTVALERPVCP----APPLIAMQGPgetsgiVRVRERWRPAPWTPNPPRSPSQM--SVRDRLARLRA 597
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109730811  586 LPQASQQP-GLQPMMSNQQQTayqgmlgvQQPQNQGLLSNQRSSMGGQMQGL--VVQYTPLPSYQVPvgSDSQNVVQPSF 662
Cdd:PHA03379  598 EAQPYQASvEVQPPQLTQVSP--------QQPMEYPLEPEQQMFPGSPFSQVadVMRAGGVPAMQPQ--YFDLPLQQPIS 667
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109730811  663 QQPMLVPASQSVQGGLPTGGVPVYYSMIPPAQQ--NGTSPSVGFLQPPGS-------EQYQMPQSPSPCSPPQMSQQYSG 733
Cdd:PHA03379  668 QGAPLAPLRASMGPVPPVPATQPQYFDIPLTEPinQGASAAHFLPQQPMEgplvperWMFQGATLSQSVRPGVAQSQYFD 747
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109730811  734 VSPSGPgvvvMQLNVPNGPQAPQNPSMVQWshckyysVEQRGQKPGDLYSPdGSPQANAQMGSSPVTSPTQSPAPSPVTS 813
Cdd:PHA03379  748 LPLTQP----INHGAPAAHFLHQPPMEGPW-------VPEQWMFQGAPPSQ-GTDVVQHQLDALGYVLHVLNHPGVPVSP 815

                  ...
gi 109730811  814 LSN 816
Cdd:PHA03379  816 AVN 818
PRK10263 PRK10263
DNA translocase FtsK; Provisional
417-593 9.50e-06

DNA translocase FtsK; Provisional


Pssm-ID: 236669 [Multi-domain]  Cd Length: 1355  Bit Score: 49.70  E-value: 9.50e-06
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109730811  417 PALPPTPQHQPPLNNHMISQPVPALQPSPQPVQFSPSSCPQVLLPVSPPQQYNMAEDLSNPFGQMSLSRQGSTEAADPSS 496
Cdd:PRK10263  336 PVEPVTQTPPVASVDVPPAQPTVAWQPVPGPQTGEPVIAPAPEGYPQQSQYAQPAVQYNEPLQQPVQPQQPYYAPAAEQP 415
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109730811  497 AlfQPPLISQHPQQASFIMASAGQPlPTSNYSTSSHAPPTQQVLPPQGYMQPPQQIQVSYYPPGQYpnsnQQYRPLSHPV 576
Cdd:PRK10263  416 A--QQPYYAPAPEQPAQQPYYAPAP-EQPVAGNAWQAEEQQSTFAPQSTYQTEQTYQQPAAQEPLY----QQPQPVEQQP 488
                         170
                  ....*....|....*..
gi 109730811  577 AYSPQRGQQLPQASQQP 593
Cdd:PRK10263  489 VVEPEPVVEETKPARPP 505
Atrophin-1 pfam03154
Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian ...
410-710 5.42e-05

Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian atrophy (DRPLA) gene. DRPLA OMIM:125370 is a progressive neurodegenerative disorder. It is caused by the expansion of a CAG repeat in the DRPLA gene on chromosome 12p. This results in an extended polyglutamine region in atrophin-1, that is thought to confer toxicity to the protein, possibly through altering its interactions with other proteins. The expansion of a CAG repeat is also the underlying defect in six other neurodegenerative disorders, including Huntington's disease. One interaction of expanded polyglutamine repeats that is thought to be pathogenic is that with the short glutamine repeat in the transcriptional coactivator CREB binding protein, CBP. This interaction draws CBP away from its usual nuclear location to the expanded polyglutamine repeat protein aggregates that are characteriztic of the polyglutamine neurodegenerative disorders. This interferes with CBP-mediated transcription and causes cytotoxicity.


Pssm-ID: 460830 [Multi-domain]  Cd Length: 991  Bit Score: 47.45  E-value: 5.42e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109730811   410 QQQQQQLPALPPTPQHQPPlnnhmisqPVPALQPSPQPVQFSPSSCPQVLLPVSPP--QQYNMAEDLSNPFGQMSLSRQG 487
Cdd:pfam03154  170 QPPVLQAQSGAASPPSPPP--------PGTTQAATAGPTPSAPSVPPQGSPATSQPpnQTQSTAAPHTLIQQTPTLHPQR 241
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109730811   488 STEAADPSSALFQPPLISQHPQQA--SFIMASAGQPLPTSNYSTSSHAPptqQVLPPQGYMQPPQ--QIQVSYYPPGQYP 563
Cdd:pfam03154  242 LPSPHPPLQPMTQPPPPSQVSPQPlpQPSLHGQMPPMPHSLQTGPSHMQ---HPVPPQPFPLTPQssQSQVPPGPSPAAP 318
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109730811   564 NSNQQ--YRPLSHPVAYSPQ--RGQQLPQASQQ------------PGLQPMMSNQQQTAYQGMLGVQQPQN--------- 618
Cdd:pfam03154  319 GQSQQriHTPPSQSQLQSQQppREQPLPPAPLSmphikpppttpiPQLPNPQSHKHPPHLSGPSPFQMNSNlppppalkp 398
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109730811   619 -QGLLSNQRSSMGGQMQGLVVQYTPLPS--YQVPVGSDSQNVVQPSFQQPmlvPASQSVQGG----------LPTGGVPV 685
Cdd:pfam03154  399 lSSLSTHHPPSAHPPPLQLMPQSQQLPPppAQPPVLTQSQSLPPPAASHP---PTSGLHQVPsqspfpqhpfVPGGPPPI 475
                          330       340
                   ....*....|....*....|....*
gi 109730811   686 YYSMIPPAQqngTSPSVGFLQPPGS 710
Cdd:pfam03154  476 TPPSGPPTS---TSSAMPGIQPPSS 497
PHA03247 PHA03247
large tegument protein UL36; Provisional
325-587 6.12e-05

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 47.24  E-value: 6.12e-05
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109730811  325 PRPWSS----TDSDGSVRSMRPPVTKASSfsgisilTRGDSIGSSKGGSAGRLSRPGMALGAPEVC---NQVTSPQSVRG 397
Cdd:PHA03247 2712 PHALVSatplPPGPAAARQASPALPAAPA-------PPAVPAGPATPGGPARPARPPTTAGPPAPAppaAPAAGPPRRLT 2784
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109730811  398 LLPCTAQQQQQQQQQQQQLPALPPTPQHQPPLNNHMISQPVPALQPSPQPVQFSPSSCPQvllPVSPPQQYNMAEDLSNP 477
Cdd:PHA03247 2785 RPAVASLSESRESLPSPWDPADPPAAVLAPAAALPPAASPAGPLPPPTSAQPTAPPPPPG---PPPPSLPLGGSVAPGGD 2861
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109730811  478 FGQMSLSRQGSTEAADPS----SALFQPPLisqHPQQASFIMASAGQPLPTSnySTSSHAPPTQQVLPPQGYMQPPQQIQ 553
Cdd:PHA03247 2862 VRRRPPSRSPAAKPAAPArppvRRLARPAV---SRSTESFALPPDQPERPPQ--PQAPPPPQPQPQPPPPPQPQPPPPPP 2936
                         250       260       270
                  ....*....|....*....|....*....|....
gi 109730811  554 VSYYPPGQyPNSNQQYRPLSHPVAYSPQRGQQLP 587
Cdd:PHA03247 2937 PRPQPPLA-PTTDPAGAGEPSGAVPQPWLGALVP 2969
PRK10263 PRK10263
DNA translocase FtsK; Provisional
494-645 7.95e-05

DNA translocase FtsK; Provisional


Pssm-ID: 236669 [Multi-domain]  Cd Length: 1355  Bit Score: 47.00  E-value: 7.95e-05
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109730811  494 PSSALFQPPLI-SQHPQQasfimasagQPLPTSNYSTSSH-APPTQQVLPPQGYMQPPQQiqvsYYPPGQYPNSNQQYRP 571
Cdd:PRK10263  740 PHEPLFTPIVEpVQQPQQ---------PVAPQQQYQQPQQpVAPQPQYQQPQQPVAPQPQ----YQQPQQPVAPQPQYQQ 806
                          90       100       110       120       130       140       150
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*
gi 109730811  572 LSHPVAYSPQRGQ-QLPQASQQPGLQPmmsnQQQTAyqgmlgvqqPQNQGLLSNQRSSMGGQMQGLVVQYTPLPS 645
Cdd:PRK10263  807 PQQPVAPQPQYQQpQQPVAPQPQYQQP----QQPVA---------PQPQDTLLHPLLMRNGDSRPLHKPTTPLPS 868
PABP-1234 TIGR01628
polyadenylate binding protein, human types 1, 2, 3, 4 family; These eukaryotic proteins ...
474-643 1.62e-04

polyadenylate binding protein, human types 1, 2, 3, 4 family; These eukaryotic proteins recognize the poly-A of mRNA and consists of four tandem RNA recognition domains at the N-terminus (rrm: pfam00076) followed by a PABP-specific domain (pfam00658) at the C-terminus. The protein is involved in the transport of mRNA's from the nucleus to the cytoplasm. There are four paralogs in Homo sapiens which are expressed in testis, platelets, broadly expressed and of unknown tissue range.


Pssm-ID: 130689 [Multi-domain]  Cd Length: 562  Bit Score: 45.57  E-value: 1.62e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109730811   474 LSNPFGQMSLSRQGSTEAADPSSALFQPPLISQHPQQASfimasAGQPLptsNYstsshaPPTQQVLPPQGYMQPPQQIQ 553
Cdd:TIGR01628  371 LQDQFMQLQPRMRQLPMGSPMGGAMGQPPYYGQGPQQQF-----NGQPL---GW------PRMSMMPTPMGPGGPLRPNG 436
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109730811   554 VSYYPPGQYPNSNQQYRPLSHPVA---YSPQ-RGQQLPQASQQPGLQPMMSNQQQTAYQgMLGVQQPQnqgllsNQRSSM 629
Cdd:TIGR01628  437 LAPMNAVRAPSRNAQNAAQKPPMQpvmYPPNyQSLPLSQDLPQPQSTASQGGQNKKLAQ-VLASATPQ------MQKQVL 509
                          170
                   ....*....|....
gi 109730811   630 GGQMQGLVVQYTPL 643
Cdd:TIGR01628  510 GERLFPLVEAIEPA 523
PHA03377 PHA03377
EBNA-3C; Provisional
417-704 7.93e-04

EBNA-3C; Provisional


Pssm-ID: 177614 [Multi-domain]  Cd Length: 1000  Bit Score: 43.50  E-value: 7.93e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109730811  417 PALPPTPQHQPPLNNHMISQPVPALQpsPQPVQFSPSScpqvllPVSPPQQYNMAEDLS--NPFGQMSLSRQGSTEAADP 494
Cdd:PHA03377  672 PATQSTPPRPSWLPSVFVLPSVDAGR--AQPSEESHLS------SMSPTQPISHEEQPRyeDPDDPLDLSLHPDQAPPPS 743
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109730811  495 SSALFQPPLISQHPQqasfimasagqplptSNYSTSSHAPPTQqvLPPQGYMQPP-QQIQVSYYPPGQYP-NSNQQYRPL 572
Cdd:PHA03377  744 HQAPYSGHEEPQAQQ---------------APYPGYWEPRPPQ--APYLGYQEPQaQGVQVSSYPGYAGPwGLRAQHPRY 806
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109730811  573 SHPVAYSPQR----GQQLPQASQQPGLQPmmsnqqqtAYQGMLGVQQPQNQGLLSNQRSSMGGQMQGLVVQYTPLPsyQV 648
Cdd:PHA03377  807 RHSWAYWSQYpghgHPQGPWAPRPPHLPP--------QWDGSAGHGQDQVSQFPHLQSETGPPRLQLSQVPQLPYS--QT 876
                         250       260       270       280       290
                  ....*....|....*....|....*....|....*....|....*....|....*..
gi 109730811  649 PVGSDSQNVVQPSFQQPMlvpasQSVQGGLPTGGVPVYYSMIPPAQQNGTS-PSVGF 704
Cdd:PHA03377  877 LVSSSAPSWSSPQPRAPI-----RPIPTRFPPPPMPLQDSMAVGCDSSGTAcPSMPF 928
G_path_suppress pfam15991
G-protein pathway suppressor; This family of proteins inhibits G-protein- and ...
440-599 1.16e-03

G-protein pathway suppressor; This family of proteins inhibits G-protein- and mitogen-activated protein kinase-mediated signal transduction.


Pssm-ID: 464961 [Multi-domain]  Cd Length: 272  Bit Score: 41.83  E-value: 1.16e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109730811   440 ALQPSPQPVQFSPSS-CPQVLLPVSPPQQYNMAEDLSNPFGQMSLSRQGSTEAADPSSALFQPPLISQHPQQASFIMASA 518
Cdd:pfam15991  103 ALQQAAAQVFLPQLSmQGQPHHQQHPGPQVGVLKRTRSPSPPVQQQAYYKQPAFSPGYAEHGQQKHDDGRRGYDVARFGS 182
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109730811   519 GQPLPTSNYSTSSHAPPTQQVLPP-QGYMQPPQQIQVSYYPPGQYPNSNQQYRPLSHPVAY---SPQRGQQLPQA--SQQ 592
Cdd:pfam15991  183 WNKSTAQYPPSGQLFYPTHQYLPPpQTQGQADARLQTIYPQPGYALPLQQQYEHANQPSPFvssSPLKQMQSPKAgpGPQ 262

                   ....*..
gi 109730811   593 PGLQPMM 599
Cdd:pfam15991  263 PMQLSVL 269
PRK10263 PRK10263
DNA translocase FtsK; Provisional
488-687 1.63e-03

DNA translocase FtsK; Provisional


Pssm-ID: 236669 [Multi-domain]  Cd Length: 1355  Bit Score: 42.38  E-value: 1.63e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109730811  488 STEAADPSSALFQPPLISQHPQqasfimaSAGQPLPTSNYSTSSHAPPtqqvlpPQGYMQPPQ--QIQVSYYPPGQYPNS 565
Cdd:PRK10263  336 PVEPVTQTPPVASVDVPPAQPT-------VAWQPVPGPQTGEPVIAPA------PEGYPQQSQyaQPAVQYNEPLQQPVQ 402
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109730811  566 NQQyrPLSHPVAYSPQRGQQLPQASQQPGLQPMMSNQQQTAYQGMLGVQQPQNQgLLSNQRSsmggqmqglvvqYTPLPS 645
Cdd:PRK10263  403 PQQ--PYYAPAAEQPAQQPYYAPAPEQPAQQPYYAPAPEQPVAGNAWQAEEQQS-TFAPQST------------YQTEQT 467
                         170       180       190       200
                  ....*....|....*....|....*....|....*....|..
gi 109730811  646 YQVPVGSDSQNVVQPSFQQPMLVPASQSVQGGLPTGGvPVYY 687
Cdd:PRK10263  468 YQQPAAQEPLYQQPQPVEQQPVVEPEPVVEETKPARP-PLYY 508
PRK10263 PRK10263
DNA translocase FtsK; Provisional
324-711 3.05e-03

DNA translocase FtsK; Provisional


Pssm-ID: 236669 [Multi-domain]  Cd Length: 1355  Bit Score: 41.61  E-value: 3.05e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109730811  324 EPRPWSSTDSDGSVRSMrPPVTKASSfsgISILTRGDSIGSSKGGSAGRLSRPGMAL---GAPEvcnqvtsPQSVRGLLP 400
Cdd:PRK10263  536 EPEPIKSSLKAPSVAAV-PPVEAAAA---VSPLASGVKKATLATGAAATVAAPVFSLansGGPR-------PQVKEGIGP 604
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109730811  401 ctaqqqqqqqqqqqqlpALPPTPQHQPPLNNHMISQPV--PALQPSPQPVQFSPSSCPQVLLPVSPPQQYNMAED-LSNP 477
Cdd:PRK10263  605 -----------------QLPRPKRIRVPTRRELASYGIklPSQRAAEEKAREAQRNQYDSGDQYNDDEIDAMQQDeLARQ 667
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109730811  478 FGQMSLSRQGST-------EAADPSSALfQPPLISQ--HPQQASFimaSAGQPLPTSNYSTSSHAPPTQQVLPPQGYMQP 548
Cdd:PRK10263  668 FAQTQQQRYGEQyqhdvpvNAEDADAAA-EAELARQfaQTQQQRY---SGEQPAGANPFSLDDFEFSPMKALLDDGPHEP 743
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109730811  549 pqqiqvsYYPPGQYPNSNQQYRPLSHPVAYSPQRgqqlPQASQQPGLQPMMSNQQQTAYQGMLGVQQPQNQGLLSNQrss 628
Cdd:PRK10263  744 -------LFTPIVEPVQQPQQPVAPQQQYQQPQQ----PVAPQPQYQQPQQPVAPQPQYQQPQQPVAPQPQYQQPQQ--- 809
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109730811  629 mggqmqglvvQYTPLPSYQVPvgsDSQNVVQPSFQQPMLVPASQSVQgglptggvpvyySMIPP-AQQNGTS-------- 699
Cdd:PRK10263  810 ----------PVAPQPQYQQP---QQPVAPQPQYQQPQQPVAPQPQD------------TLLHPlLMRNGDSrplhkptt 864
                         410
                  ....*....|....
gi 109730811  700 --PSVGFLQPPGSE 711
Cdd:PRK10263  865 plPSLDLLTPPPSE 878
PABP-1234 TIGR01628
polyadenylate binding protein, human types 1, 2, 3, 4 family; These eukaryotic proteins ...
581-711 3.79e-03

polyadenylate binding protein, human types 1, 2, 3, 4 family; These eukaryotic proteins recognize the poly-A of mRNA and consists of four tandem RNA recognition domains at the N-terminus (rrm: pfam00076) followed by a PABP-specific domain (pfam00658) at the C-terminus. The protein is involved in the transport of mRNA's from the nucleus to the cytoplasm. There are four paralogs in Homo sapiens which are expressed in testis, platelets, broadly expressed and of unknown tissue range.


Pssm-ID: 130689 [Multi-domain]  Cd Length: 562  Bit Score: 40.95  E-value: 3.79e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109730811   581 QRGQQLPQASQQPGLQPMMSNQQQTAYQGMLGVQQ----PQNQGLLSNQRSSMGGQMQGLVVQYTPLPSYQVPvGSDSQN 656
Cdd:TIGR01628  374 QFMQLQPRMRQLPMGSPMGGAMGQPPYYGQGPQQQfngqPLGWPRMSMMPTPMGPGGPLRPNGLAPMNAVRAP-SRNAQN 452
                           90       100       110       120       130
                   ....*....|....*....|....*....|....*....|....*....|....*
gi 109730811   657 VVQPSFQQPMlvPASQSVQGGLPTGGVPVYYSMIPPAQQNGTSPSVGFLQPPGSE 711
Cdd:TIGR01628  453 AAQKPPMQPV--MYPPNYQSLPLSQDLPQPQSTASQGGQNKKLAQVLASATPQMQ 505
Amelin smart00817
Ameloblastin precursor (Amelin); This family consists of several mammalian Ameloblastin ...
485-620 4.83e-03

Ameloblastin precursor (Amelin); This family consists of several mammalian Ameloblastin precursor (Amelin) proteins. Matrix proteins of tooth enamel consist mainly of amelogenin but also of non-amelogenin proteins, which, although their volumetric percentage is low, have an important role in enamel mineralisation. One of the non-amelogenin proteins is ameloblastin, also known as amelin and sheathlin. Ameloblastin (AMBN) is one of the enamel sheath proteins which is though to have a role in determining the prismatic structure of growing enamel crystals.


Pssm-ID: 214832 [Multi-domain]  Cd Length: 411  Bit Score: 40.64  E-value: 4.83e-03
                            10        20        30        40        50        60        70        80
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109730811    485 RQGSTEAADPSSALFQPPLISQHPQQAsfimasagqPLPTSNYSTSSHAPPTQQVLPPQGYMQPPQQIQVSYYPPGQYPN 564
Cdd:smart00817   82 REHETQQYEYSLPVHPPPLPSQPSLQP---------QQPGLKPFLQPTALPTNQATPQKNGPQPPMHLGQPPLQQAELPM 152
                            90       100       110       120       130       140
                    ....*....|....*....|....*....|....*....|....*....|....*....|...
gi 109730811    565 SNQQYRPLSHP-------VAYSPQRGQQLPQASQQPGLQPMMSNQQQTAYQGMLGVQQPQNQG 620
Cdd:smart00817  153 IPPQVAPSDKPpqtelplYDFADPQNPLLFQIAHLMSRGPMPQNKQQHLYPGLFYMSYGANQL 215
PRK14948 PRK14948
DNA polymerase III subunit gamma/tau;
380-593 5.64e-03

DNA polymerase III subunit gamma/tau;


Pssm-ID: 237862 [Multi-domain]  Cd Length: 620  Bit Score: 40.72  E-value: 5.64e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109730811  380 LGAPEVCNQVTSPQSVRGLLPCTAQQQQQQQQQQQQLPALPPTPQHQPPLNN-HMISQPVPALQPSPQPVQfspsscpqv 458
Cdd:PRK14948  360 LPSAFISEIANASAPANPTPAPNPSPPPAPIQPSAPKTKQAATTPSPPPAKAsPPIPVPAEPTEPSPTPPA--------- 430
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109730811  459 lLPVSPPQQYNMAE-------DLSNPFGQMSLSRQGSTEAADPSSALFQ------------PPLIS-------QHPQQAS 512
Cdd:PRK14948  431 -NAANAPPSLNLEElwqqilaKLELPSTRMLLSQQAELVSLDSNRAVIAvspnwlgmvqsrKPLLEqafakvlGRSIKLN 509
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109730811  513 FIMASAGQPlptsnySTSSHAPPTQQVLPPQGYMQPPQQIQVSYYPPGQYPNSNQQYRPLSHPVAYSPQRGQQLPQASQQ 592
Cdd:PRK14948  510 LESQSGSAS------NTAKTPPPPQKSPPPPAPTPPLPQPTATAPPPTPPPPPPTATQASSNAPAQIPADSSPPPPIPEE 583

                  .
gi 109730811  593 P 593
Cdd:PRK14948  584 P 584
PAT1 pfam09770
Topoisomerase II-associated protein PAT1; Members of this family are necessary for accurate ...
480-708 6.39e-03

Topoisomerase II-associated protein PAT1; Members of this family are necessary for accurate chromosome transmission during cell division.


Pssm-ID: 401645 [Multi-domain]  Cd Length: 846  Bit Score: 40.40  E-value: 6.39e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109730811   480 QMSLSRQGSTEAADPSSALfQPPLISQHPQQASFIMASAGQPLPTS--NYSTSSHAPPTQ--QVL---PPQGYMQPPQQI 552
Cdd:pfam09770   99 QVRFNRQQPAARAAQSSAQ-PPASSLPQYQYASQQSQQPSKPVRTGyeKYKEPEPIPDLQvdASLwgvAPKKAAAPAPAP 177
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109730811   553 QVSYYPPGQYPNSNQ---------QYRplshpvAYSPQRGQQLPQASQQP--GLQPMMSNQQQTAYQGMLGVQQPQNQGL 621
Cdd:pfam09770  178 QPAAQPASLPAPSRKmmsleeveaAMR------AQAKKPAQQPAPAPAQPpaAPPAQQAQQQQQFPPQIQQQQQPQQQPQ 251
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109730811   622 LSNQRSSMGGQMQGL---------VVQYTPLPSYQVPVGSDSQNVVQPS--FQQPMLVPASQSVQGGLPTGGVPVyysmi 690
Cdd:pfam09770  252 QPQQHPGQGHPVTILqrpqspqpdPAQPSIQPQAQQFHQQPPPVPVQPTqiLQNPNRLSAARVGYPQNPQPGVQP----- 326
                          250
                   ....*....|....*...
gi 109730811   691 PPAQQNGTSPSVGFLQPP 708
Cdd:pfam09770  327 APAHQAHRQQGSFGRQAP 344
Atrophin-1 pfam03154
Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian ...
516-836 8.87e-03

Atrophin-1 family; Atrophin-1 is the protein product of the dentatorubral-pallidoluysian atrophy (DRPLA) gene. DRPLA OMIM:125370 is a progressive neurodegenerative disorder. It is caused by the expansion of a CAG repeat in the DRPLA gene on chromosome 12p. This results in an extended polyglutamine region in atrophin-1, that is thought to confer toxicity to the protein, possibly through altering its interactions with other proteins. The expansion of a CAG repeat is also the underlying defect in six other neurodegenerative disorders, including Huntington's disease. One interaction of expanded polyglutamine repeats that is thought to be pathogenic is that with the short glutamine repeat in the transcriptional coactivator CREB binding protein, CBP. This interaction draws CBP away from its usual nuclear location to the expanded polyglutamine repeat protein aggregates that are characteriztic of the polyglutamine neurodegenerative disorders. This interferes with CBP-mediated transcription and causes cytotoxicity.


Pssm-ID: 460830 [Multi-domain]  Cd Length: 991  Bit Score: 40.14  E-value: 8.87e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109730811   516 ASAGQPLPTSNYSTSShAPPTQQVLPPQgymQPPQQIQVSYYPPGQYPNSNQQYRPLSHPVAYSPQRGQQLPQASQQPGL 595
Cdd:pfam03154  144 TSPSIPSPQDNESDSD-SSAQQQILQTQ---PPVLQAQSGAASPPSPPPPGTTQAATAGPTPSAPSVPPQGSPATSQPPN 219
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109730811   596 QPMMSNQQQTAYQGMLGVQQPQnqglLSNQRSSMggqmqglvvqyTPLPSYQVPVGSDSQNVVQPSFQQPMlVPASQSVQ 675
Cdd:pfam03154  220 QTQSTAAPHTLIQQTPTLHPQR----LPSPHPPL-----------QPMTQPPPPSQVSPQPLPQPSLHGQM-PPMPHSLQ 283
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109730811   676 GGLPTGGVPVYYSMIPPAQQNGTSPSVGFLQPPGSEQYQMPQSPSPCSPPQMSQQYSGVSPSGPGVVVM-QLNVPNGPQA 754
Cdd:pfam03154  284 TGPSHMQHPVPPQPFPLTPQSSQSQVPPGPSPAAPGQSQQRIHTPPSQSQLQSQQPPREQPLPPAPLSMpHIKPPPTTPI 363
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 109730811   755 PQNPSMVQWSHCKYYSVEQRGQKPGDLYSPDG-----------SPQAN----------AQMGSSPVTSP--TQSPAPSPV 811
Cdd:pfam03154  364 PQLPNPQSHKHPPHLSGPSPFQMNSNLPPPPAlkplsslsthhPPSAHppplqlmpqsQQLPPPPAQPPvlTQSQSLPPP 443
                          330       340
                   ....*....|....*....|....*....
gi 109730811   812 TSLSNVCTGLSPLPVLTPFPR----PGGP 836
Cdd:pfam03154  444 AASHPPTSGLHQVPSQSPFPQhpfvPGGP 472
 
Blast search parameters
Data Source: Precalculated data, version = cdd.v.3.21
Preset Options:Database: CDSEARCH/cdd   Low complexity filter: no  Composition Based Adjustment: yes   E-value threshold: 0.01

References:

  • Wang J et al. (2023), "The conserved domain database in 2023", Nucleic Acids Res.51(D)384-8.
  • Lu S et al. (2020), "The conserved domain database in 2020", Nucleic Acids Res.48(D)265-8.
  • Marchler-Bauer A et al. (2017), "CDD/SPARCLE: functional classification of proteins via subfamily domain architectures.", Nucleic Acids Res.45(D)200-3.
Help | Disclaimer | Write to the Help Desk
NCBI | NLM | NIH