NCBI Home Page NCBI Site Search page NCBI Guide that lists and describes the NCBI resources
Conserved domains on  [gi|148235066|ref|NP_001085635|]
View 

histone-lysine N-trimethyltransferase SMYD5 [Xenopus laevis]

Protein Classification

histone-lysine N-trimethyltransferase SMYD5( domain architecture ID 14410307)

histone-lysine N-trimethyltransferase SMYD5 specifically trimethylates 'Lys-20' of histone H4 to form trimethylated histone H4 lysine 20 (H4K20me3) which represents a specific tag for epigenetic transcriptional repression

Graphical summary

 Zoom to residue level

show extra options »

Show site features     Horizontal zoom: ×

List of domain hits

Name Accession Description Interval E-value
SET_SMYD5 cd10521
SET domain (including iSET domain and post-SET domain) found in SET and MYND domain-containing ...
21-383 0e+00

SET domain (including iSET domain and post-SET domain) found in SET and MYND domain-containing protein 5 (SMYD5) and similar proteins; SMYD5 (also termed protein NN8-4AG, or retinoic acid-induced protein 15) functions as histone lysine methyltransferase that mediates H4K20me3 at heterochromatin regions. It plays an important role in chromosome integrity by regulating heterochromatin and repressing endogenous repetitive DNA elements during differentiation. In zebrafish embryogenesis, it plays pivotal roles in both primitive and definitive hematopoiesis.


:

Pssm-ID: 380919 [Multi-domain]  Cd Length: 282  Bit Score: 558.08  E-value: 0e+00
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 148235066  21 TVEIRFVSSGKGKGLFAIRTIRKGETIFQEKPLVSSQFQWNALyryracdhclrsletaeenaqrlsgnahvllpypelc 100
Cdd:cd10521    1 HVEVRFIDAAKGRGLFATRDFKKGDIIFEEKPLVCAQFLWNEL------------------------------------- 43
                         90       100       110       120       130       140       150       160
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 148235066 101 tvrnglhqqcprcqvtycsaeclkaaaeqyhqilcletsrdnpaHPLNKLEEAWRNMHYPPETASIMLMARMVGTIKQAQ 180
Cdd:cd10521   44 --------------------------------------------HPLNKLQEAWRNMHYPPETASIMLIARMIATVKQAK 79
                        170       180       190       200       210       220       230       240
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 148235066 181 DKDWWLHLFSQFCNKTANEEEEIVHKLLGEKFKGQLDQLRRLFVDALYEERMSRWFTPEGFRSLFALVGTNGQGIGTSSL 260
Cdd:cd10521   80 DKDEWLKLFSQFCSATANEEEHIAHKLLGKQFQGQLELLRQLFTEALYEERLSQWFTPEGFRSLFALVGTNGQGIGTSSL 159
                        250       260       270       280       290       300       310       320
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 148235066 261 SQWVHACDALELPPRDREKLDALIDQLYKDIEKVTGEFLNCEGSGLYLLQSCCNHSCVPNAEASFPDNNFILHLTALEDI 340
Cdd:cd10521  160 SVWVHNCDALELPEQEREELDAFIDQLYVDIEKESGEFLNCEGSGLYLLQSCCNHSCVPNAEITFPENNFTLSLKALRDI 239
                        330       340       350       360
                 ....*....|....*....|....*....|....*....|...
gi 148235066 341 QPGEEICISYLDCCQRDRSRHSRQKILRENYLFMCSCPKCLAQ 383
Cdd:cd10521  240 QEGEEICISYLDECQRERSRHSRQKILRENYLFICNCPKCEAQ 282
zf-MYND pfam01753
MYND finger;
100-135 1.91e-03

MYND finger;


:

Pssm-ID: 460312  Cd Length: 39  Bit Score: 35.86  E-value: 1.91e-03
                          10        20        30
                  ....*....|....*....|....*....|....*..
gi 148235066  100 CTVRNGLHQQCPRCQ-VTYCSAECLKAAAEqYHQILC 135
Cdd:pfam01753   4 CGKEALKLLRCSRCKsVYYCSKECQKADWP-YHKKEC 39
 
Name Accession Description Interval E-value
SET_SMYD5 cd10521
SET domain (including iSET domain and post-SET domain) found in SET and MYND domain-containing ...
21-383 0e+00

SET domain (including iSET domain and post-SET domain) found in SET and MYND domain-containing protein 5 (SMYD5) and similar proteins; SMYD5 (also termed protein NN8-4AG, or retinoic acid-induced protein 15) functions as histone lysine methyltransferase that mediates H4K20me3 at heterochromatin regions. It plays an important role in chromosome integrity by regulating heterochromatin and repressing endogenous repetitive DNA elements during differentiation. In zebrafish embryogenesis, it plays pivotal roles in both primitive and definitive hematopoiesis.


Pssm-ID: 380919 [Multi-domain]  Cd Length: 282  Bit Score: 558.08  E-value: 0e+00
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 148235066  21 TVEIRFVSSGKGKGLFAIRTIRKGETIFQEKPLVSSQFQWNALyryracdhclrsletaeenaqrlsgnahvllpypelc 100
Cdd:cd10521    1 HVEVRFIDAAKGRGLFATRDFKKGDIIFEEKPLVCAQFLWNEL------------------------------------- 43
                         90       100       110       120       130       140       150       160
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 148235066 101 tvrnglhqqcprcqvtycsaeclkaaaeqyhqilcletsrdnpaHPLNKLEEAWRNMHYPPETASIMLMARMVGTIKQAQ 180
Cdd:cd10521   44 --------------------------------------------HPLNKLQEAWRNMHYPPETASIMLIARMIATVKQAK 79
                        170       180       190       200       210       220       230       240
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 148235066 181 DKDWWLHLFSQFCNKTANEEEEIVHKLLGEKFKGQLDQLRRLFVDALYEERMSRWFTPEGFRSLFALVGTNGQGIGTSSL 260
Cdd:cd10521   80 DKDEWLKLFSQFCSATANEEEHIAHKLLGKQFQGQLELLRQLFTEALYEERLSQWFTPEGFRSLFALVGTNGQGIGTSSL 159
                        250       260       270       280       290       300       310       320
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 148235066 261 SQWVHACDALELPPRDREKLDALIDQLYKDIEKVTGEFLNCEGSGLYLLQSCCNHSCVPNAEASFPDNNFILHLTALEDI 340
Cdd:cd10521  160 SVWVHNCDALELPEQEREELDAFIDQLYVDIEKESGEFLNCEGSGLYLLQSCCNHSCVPNAEITFPENNFTLSLKALRDI 239
                        330       340       350       360
                 ....*....|....*....|....*....|....*....|...
gi 148235066 341 QPGEEICISYLDCCQRDRSRHSRQKILRENYLFMCSCPKCLAQ 383
Cdd:cd10521  240 QEGEEICISYLDECQRERSRHSRQKILRENYLFICNCPKCEAQ 282
SET COG2940
SET domain-containing protein (function unknown) [General function prediction only];
264-380 3.96e-11

SET domain-containing protein (function unknown) [General function prediction only];


Pssm-ID: 442183 [Multi-domain]  Cd Length: 134  Bit Score: 60.36  E-value: 3.96e-11
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 148235066 264 VHACDALELPPRDREKLDALIDQLYKDI----EKVTGEFLNCEGSGLYLlqsccNHSCVPNAEASFPDNNFILHltALED 339
Cdd:COG2940   32 IGEYPGEVITWAEAERREPHKEPLHTYLfeldDDGVIDGALGGNPARFI-----NHSCDPNCEADEEDGRIFIV--ALRD 104
                         90       100       110       120
                 ....*....|....*....|....*....|....*....|.
gi 148235066 340 IQPGEEICISYLDccqrdrsrhsrqkiLRENYLFMCSCPKC 380
Cdd:COG2940  105 IAAGEELTYDYGL--------------DYDEEEYPCRCPNC 131
SET smart00317
SET (Su(var)3-9, Enhancer-of-zeste, Trithorax) domain; Putative methyl transferase, based on ...
297-350 2.16e-10

SET (Su(var)3-9, Enhancer-of-zeste, Trithorax) domain; Putative methyl transferase, based on outlier plant homologues


Pssm-ID: 214614 [Multi-domain]  Cd Length: 124  Bit Score: 58.11  E-value: 2.16e-10
                           10        20        30        40        50
                   ....*....|....*....|....*....|....*....|....*....|....*.
gi 148235066   297 EFLNCEGSGLYLLQSCCNHSCVPNAEASFP--DNNFILHLTALEDIQPGEEICISY 350
Cdd:smart00317  62 SDLCIDARRKGNLARFINHSCEPNCELLFVevNGDDRIVIFALRDIKPGEELTIDY 117
SET pfam00856
SET domain; SET domains are protein lysine methyltransferase enzymes. SET domains appear to be ...
313-350 3.26e-09

SET domain; SET domains are protein lysine methyltransferase enzymes. SET domains appear to be protein-protein interaction domains. It has been demonstrated that SET domains mediate interactions with a family of proteins that display similarity with dual-specificity phosphatases (dsPTPases). A subset of SET domains have been called PR domains. These domains are divergent in sequence from other SET domains, but also appear to mediate protein-protein interaction. The SET domain consists of two regions known as SET-N and SET-C. SET-C forms an unusual and conserved knot-like structure of probably functional importance. Additionally to SET-N and SET-C, an insert region (SET-I) and flanking regions of high structural variability form part of the overall structure.


Pssm-ID: 459965 [Multi-domain]  Cd Length: 115  Bit Score: 54.45  E-value: 3.26e-09
                          10        20        30        40
                  ....*....|....*....|....*....|....*....|
gi 148235066  313 CNHSCVPNAEASF--PDNNFILHLTALEDIQPGEEICISY 350
Cdd:pfam00856  75 INHSCDPNCEVRVvyVNGGPRIVIFALRDIKPGEELTIDY 114
zf-MYND pfam01753
MYND finger;
100-135 1.91e-03

MYND finger;


Pssm-ID: 460312  Cd Length: 39  Bit Score: 35.86  E-value: 1.91e-03
                          10        20        30
                  ....*....|....*....|....*....|....*..
gi 148235066  100 CTVRNGLHQQCPRCQ-VTYCSAECLKAAAEqYHQILC 135
Cdd:pfam01753   4 CGKEALKLLRCSRCKsVYYCSKECQKADWP-YHKKEC 39
 
Name Accession Description Interval E-value
SET_SMYD5 cd10521
SET domain (including iSET domain and post-SET domain) found in SET and MYND domain-containing ...
21-383 0e+00

SET domain (including iSET domain and post-SET domain) found in SET and MYND domain-containing protein 5 (SMYD5) and similar proteins; SMYD5 (also termed protein NN8-4AG, or retinoic acid-induced protein 15) functions as histone lysine methyltransferase that mediates H4K20me3 at heterochromatin regions. It plays an important role in chromosome integrity by regulating heterochromatin and repressing endogenous repetitive DNA elements during differentiation. In zebrafish embryogenesis, it plays pivotal roles in both primitive and definitive hematopoiesis.


Pssm-ID: 380919 [Multi-domain]  Cd Length: 282  Bit Score: 558.08  E-value: 0e+00
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 148235066  21 TVEIRFVSSGKGKGLFAIRTIRKGETIFQEKPLVSSQFQWNALyryracdhclrsletaeenaqrlsgnahvllpypelc 100
Cdd:cd10521    1 HVEVRFIDAAKGRGLFATRDFKKGDIIFEEKPLVCAQFLWNEL------------------------------------- 43
                         90       100       110       120       130       140       150       160
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 148235066 101 tvrnglhqqcprcqvtycsaeclkaaaeqyhqilcletsrdnpaHPLNKLEEAWRNMHYPPETASIMLMARMVGTIKQAQ 180
Cdd:cd10521   44 --------------------------------------------HPLNKLQEAWRNMHYPPETASIMLIARMIATVKQAK 79
                        170       180       190       200       210       220       230       240
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 148235066 181 DKDWWLHLFSQFCNKTANEEEEIVHKLLGEKFKGQLDQLRRLFVDALYEERMSRWFTPEGFRSLFALVGTNGQGIGTSSL 260
Cdd:cd10521   80 DKDEWLKLFSQFCSATANEEEHIAHKLLGKQFQGQLELLRQLFTEALYEERLSQWFTPEGFRSLFALVGTNGQGIGTSSL 159
                        250       260       270       280       290       300       310       320
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 148235066 261 SQWVHACDALELPPRDREKLDALIDQLYKDIEKVTGEFLNCEGSGLYLLQSCCNHSCVPNAEASFPDNNFILHLTALEDI 340
Cdd:cd10521  160 SVWVHNCDALELPEQEREELDAFIDQLYVDIEKESGEFLNCEGSGLYLLQSCCNHSCVPNAEITFPENNFTLSLKALRDI 239
                        330       340       350       360
                 ....*....|....*....|....*....|....*....|...
gi 148235066 341 QPGEEICISYLDCCQRDRSRHSRQKILRENYLFMCSCPKCLAQ 383
Cdd:cd10521  240 QEGEEICISYLDECQRERSRHSRQKILRENYLFICNCPKCEAQ 282
SET_SMYD cd20071
SET domain (including SET domain and post-SET domain) found in SET and MYND domain-containing ...
297-380 3.65e-27

SET domain (including SET domain and post-SET domain) found in SET and MYND domain-containing protein, and similar proteins; The family includes SET and MYND domain-containing proteins, SMYD1-SYMD5. SMYD1 (EC 2.1.1.43; also termed BOP) is a heart and muscle specific SET-MYND domain containing protein, which functions as a histone methyltransferase and regulates downstream gene transcription. It methylates histone H3 at 'Lys-4' (H3K4me), seems able to perform both mono-, di-, and trimethylation. SMYD2 (also termed HSKM-B, or lysine N-methyltransferase 3C (KMT3C)) functions as a histone methyltransferase that methylates both histones and non-histone proteins, including p53/TP53 and RB1. It specifically methylates histone H3 'Lys-4' (H3K4me) and dimethylates histone H3 'Lys-36' (H3K36me2). SMYD3 (also termed zinc finger MYND domain-containing protein 1) functions as a histone methyltransferase that specifically methylates 'Lys-4' of histone H3, inducing di- and tri-methylation, but not monomethylation. It also methylates 'Lys-5' of histone H4. SMYD3 plays an important role in transcriptional activation as a member of an RNA polymerase complex. SMYD4 functions as a potential tumor suppressor that plays a critical role in breast carcinogenesis at least partly through inhibiting the expression of PDGFR-alpha. SMYD5 (also termed protein NN8-4AG, or retinoic acid-induced protein 15) functions as histone lysine methyltransferase that mediates H4K20me3 at heterochromatin regions.


Pssm-ID: 380997 [Multi-domain]  Cd Length: 122  Bit Score: 104.77  E-value: 3.65e-27
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 148235066 297 EFLNCEGSGLYLLQSCCNHSCVPNAEASFpDNNFILHLTALEDIQPGEEICISYLDccqRDRSRHSRQKILRENYLFMCS 376
Cdd:cd20071   43 DGLNEIGVGLFPLASLLNHSCDPNAVVVF-DGNGTLRVRALRDIKAGEELTISYID---PLLPRTERRRELLEKYGFTCS 118

                 ....
gi 148235066 377 CPKC 380
Cdd:cd20071  119 CPRC 122
SET_SMYD3 cd19203
SET domain (including post-SET domain) found in SET and MYND domain-containing protein 3 ...
303-381 3.91e-16

SET domain (including post-SET domain) found in SET and MYND domain-containing protein 3 (SMYD3) and similar proteins; SMYD3 (also termed zinc finger MYND domain-containing protein 1) functions as a histone methyltransferase that specifically methylates 'Lys-4' of histone H3, inducing di- and tri-methylation, but not monomethylation. It also methylates 'Lys-5' of histone H4. SMYD3 plays an important role in transcriptional activation as a member of an RNA polymerase complex. It is overexpressed in colorectal, breast, prostate, and hepatocellular tumors, and has been implicated as an oncogene in human malignancies. Methylation of MEKK2 by SMYD3 is important for regulation of the MEK/ERK pathway, suggesting the possibility of selectively targeting SMYD3 in RAS-driven cancers.


Pssm-ID: 380980 [Multi-domain]  Cd Length: 210  Bit Score: 76.63  E-value: 3.91e-16
                         10        20        30        40        50        60        70
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 148235066 303 GSGLYLLQSCCNHSCVPNAEASFpdNNFILHLTALEDIQPGEEICISYLDCCQrdrSRHSRQKILRENYLFMCSCPKCL 381
Cdd:cd19203  136 GVGLYPSASLLNHSCDPNCVIVF--NGPHLLLRAIREIEVGEELTISYIDMLM---PSEERRKQLRDQYCFECDCFRCQ 209
SET_SMYD4 cd10536
SET domain (including iSET domain and post-SET domain) found in SET and MYND domain-containing ...
303-380 7.27e-16

SET domain (including iSET domain and post-SET domain) found in SET and MYND domain-containing protein 4 (SMYD4) and similar proteins; SMYD4 functions as a potential tumor suppressor that plays a critical role in breast carcinogenesis at least partly through inhibiting the expression of PDGFR-alpha. In zebrafish, SMYD4 is ubiquitously expressed in early embryos and becomes enriched in the developing heart; mutants show a strong defect in cardiomyocyte proliferation, which lead to a severe cardiac malformation.


Pssm-ID: 380934 [Multi-domain]  Cd Length: 218  Bit Score: 76.18  E-value: 7.27e-16
                         10        20        30        40        50        60        70
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 148235066 303 GSGLYLLQSCCNHSCVPNAEASFPDNNfiLHLTALEDIQPGEEICISYLDCCqRDRSRHSRQKILRENYLFMCSCPKC 380
Cdd:cd10536  144 ATAIYPTLSLLNHSCDPNTIRSFYGNT--IVVRATRPIKKGEEITICYGPHF-SRMKRSERQRLLKEQYFFDCSCEAC 218
SET_SMYD1_2_3-like cd19167
SET domain (including post-SET domain) found in SET and MYND domain-containing proteins, SMYD1, ...
303-381 2.86e-11

SET domain (including post-SET domain) found in SET and MYND domain-containing proteins, SMYD1, SMYD2, SMYD3 and similar proteins; The family includes SET and MYND domain-containing proteins, SMYD1, SMYD2 and SMYD3. SMYD1 (EC 2.1.1.43; also termed BOP) is a heart and muscle specific SET-MYND domain containing protein, which functions as a histone methyltransferase and regulates downstream gene transcription. It methylates histone H3 at 'Lys-4' (H3K4me), seems able to perform both mono-, di-, and trimethylation. SMYD2 (also termed HSKM-B, or lysine N-methyltransferase 3C (KMT3C)) functions as a histone methyltransferase that methylates both histones and non-histone proteins, including p53/TP53 and RB1. It specifically methylates histone H3 'Lys-4' (H3K4me) and dimethylates histone H3 'Lys-36' (H3K36me2). SMYD3 (also termed zinc finger MYND domain-containing protein 1) functions as a histone methyltransferase that specifically methylates 'Lys-4' of histone H3, inducing di- and tri-methylation, but not monomethylation. It also methylates 'Lys-5' of histone H4. SMYD3 plays an important role in transcriptional activation as a member of an RNA polymerase complex.


Pssm-ID: 380944 [Multi-domain]  Cd Length: 205  Bit Score: 62.44  E-value: 2.86e-11
                         10        20        30        40        50        60        70
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 148235066 303 GSGLYLLQSCCNHSCVPNAEASFpdNNFILHLTALEDIQPGEEICISYLDCCQrdrSRHSRQKILRENYLFMCSCPKCL 381
Cdd:cd19167  131 GVGIYPQAALLNHSCCPNCIVTF--NGPNIEVRAVQEIEPGEEVFHSYIDLLY---PTEERRDQLRDQYFFLCQCADCQ 204
SET COG2940
SET domain-containing protein (function unknown) [General function prediction only];
264-380 3.96e-11

SET domain-containing protein (function unknown) [General function prediction only];


Pssm-ID: 442183 [Multi-domain]  Cd Length: 134  Bit Score: 60.36  E-value: 3.96e-11
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 148235066 264 VHACDALELPPRDREKLDALIDQLYKDI----EKVTGEFLNCEGSGLYLlqsccNHSCVPNAEASFPDNNFILHltALED 339
Cdd:COG2940   32 IGEYPGEVITWAEAERREPHKEPLHTYLfeldDDGVIDGALGGNPARFI-----NHSCDPNCEADEEDGRIFIV--ALRD 104
                         90       100       110       120
                 ....*....|....*....|....*....|....*....|.
gi 148235066 340 IQPGEEICISYLDccqrdrsrhsrqkiLRENYLFMCSCPKC 380
Cdd:COG2940  105 IAAGEELTYDYGL--------------DYDEEEYPCRCPNC 131
SET smart00317
SET (Su(var)3-9, Enhancer-of-zeste, Trithorax) domain; Putative methyl transferase, based on ...
297-350 2.16e-10

SET (Su(var)3-9, Enhancer-of-zeste, Trithorax) domain; Putative methyl transferase, based on outlier plant homologues


Pssm-ID: 214614 [Multi-domain]  Cd Length: 124  Bit Score: 58.11  E-value: 2.16e-10
                           10        20        30        40        50
                   ....*....|....*....|....*....|....*....|....*....|....*.
gi 148235066   297 EFLNCEGSGLYLLQSCCNHSCVPNAEASFP--DNNFILHLTALEDIQPGEEICISY 350
Cdd:smart00317  62 SDLCIDARRKGNLARFINHSCEPNCELLFVevNGDDRIVIFALRDIKPGEELTIDY 117
SET_SMYD1 cd10526
SET domain (including post-SET domain) found in SET and MYND domain-containing protein 1 ...
303-380 7.39e-10

SET domain (including post-SET domain) found in SET and MYND domain-containing protein 1 (SMYD1) and similar proteins; SMYD1 (EC 2.1.1.43), also termed BOP, is a heart and muscle specific SET-MYND domain containing protein, which functions as a histone methyltransferase and regulates downstream gene transcription. It methylates histone H3 at 'Lys-4' (H3K4me), seems able to perform both mono-, di-, and trimethylation. SMYD1 plays a critical role in cardiomyocyte differentiation, cardiac morphogenesis and myofibril organization, as well as in the regulation of endothelial cells (ECs). It is expressed in vascular endothelial cells, it has beenshown that knockdown of SMYD1 in endothelial cells impairs EC migration and tube formation.


Pssm-ID: 380924 [Multi-domain]  Cd Length: 210  Bit Score: 58.58  E-value: 7.39e-10
                         10        20        30        40        50        60        70
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 148235066 303 GSGLYLLQSCCNHSCVPNAEASFpdNNFILHLTALEDIQPGEEICISYLDCCQrdrSRHSRQKILRENYLFMCSCPKC 380
Cdd:cd10526  136 GVGIFPNLCLVNHDCWPNCTVIF--NNGRIELRALGKISEGDELTVSYIDFLN---TSEDRKEQLKKQYYFDCTCEHC 208
SET pfam00856
SET domain; SET domains are protein lysine methyltransferase enzymes. SET domains appear to be ...
313-350 3.26e-09

SET domain; SET domains are protein lysine methyltransferase enzymes. SET domains appear to be protein-protein interaction domains. It has been demonstrated that SET domains mediate interactions with a family of proteins that display similarity with dual-specificity phosphatases (dsPTPases). A subset of SET domains have been called PR domains. These domains are divergent in sequence from other SET domains, but also appear to mediate protein-protein interaction. The SET domain consists of two regions known as SET-N and SET-C. SET-C forms an unusual and conserved knot-like structure of probably functional importance. Additionally to SET-N and SET-C, an insert region (SET-I) and flanking regions of high structural variability form part of the overall structure.


Pssm-ID: 459965 [Multi-domain]  Cd Length: 115  Bit Score: 54.45  E-value: 3.26e-09
                          10        20        30        40
                  ....*....|....*....|....*....|....*....|
gi 148235066  313 CNHSCVPNAEASF--PDNNFILHLTALEDIQPGEEICISY 350
Cdd:pfam00856  75 INHSCDPNCEVRVvyVNGGPRIVIFALRDIKPGEELTIDY 114
SET cd08161
SET (Su(var)3-9, Enhancer-of-zeste, Trithorax) domain superfamily; The Su(var)3-9, ...
289-350 8.38e-08

SET (Su(var)3-9, Enhancer-of-zeste, Trithorax) domain superfamily; The Su(var)3-9, Enhancer-of-zeste, Trithorax (SET) domain superfamily corresponds to SET domain-containing lysine methyltransferases, which catalyze site and state-specific methylation of lysine residues in histones that are fundamental in epigenetic regulation of gene activation and silencing in eukaryotic organisms. SET domains appear to be protein-protein interaction domains. It has been demonstrated that SET domains mediate interactions with a family of proteins that display similarity with dual-specificity phosphatases (dsPTPases). A subset of SET domains has been called PR domains. These domains are divergent in sequence from other SET domains, but also appear to mediate protein-protein interaction. The SET domain consists of two regions known as N-SET and C-SET. C-SET forms an unusual and conserved knot-like structure of probable functional importance. In addition to N-SET and C-SET, an insert region (I-SET) and flanking regions of high structural variability form part of the overall structure. Some family members contain a pre-SET domain, which is found in a number of histone methyltransferases (HMTase), and a post-SET domain, which harbors a zinc-binding site.


Pssm-ID: 380914 [Multi-domain]  Cd Length: 72  Bit Score: 49.17  E-value: 8.38e-08
                         10        20        30        40        50        60
                 ....*....|....*....|....*....|....*....|....*....|....*....|....
gi 148235066 289 KDIEKvtGEFLnceGSGLYllqscCNHSCVPNAEASFPDNNFILH--LTALEDIQPGEEICISY 350
Cdd:cd08161   18 RDIPK--GEVI---GLARF-----INHSCEPNCEFEEVYVGGKPRvfIVALRDIKAGEELTVDY 71
SET_SMYD2 cd19202
SET domain (including post-SET domain) found in SET and MYND domain-containing protein 2 ...
270-380 6.99e-07

SET domain (including post-SET domain) found in SET and MYND domain-containing protein 2 (SMYD2) and similar proteins; SMYD2 (also termed HSKM-B, lysine N-methyltransferase 3C (KMT3C)) functions as a histone methyltransferase that methylates both histones and non-histone proteins, including p53/TP53 and RB1. It specifically methylates histone H3 'Lys-4' (H3K4me) and dimethylates histone H3 'Lys-36' (H3K36me2). It plays a role in myofilament organization in both skeletal and cardiac muscles via Hsp90 methylation. SMYD2 overexpression is associated with tumor cell proliferation and a worse outcome in human papillomavirus-unrelated nonmultiple head and neck carcinomas. It regulates leukemia cell growth such that diminished SMYD2 expression upregulates SET7/9, thereby possibly shifting leukemia cells from growth to quiescence state associated with resistance to DNA damage associated with Acute Myeloid Leukemia (AML).


Pssm-ID: 380979 [Multi-domain]  Cd Length: 206  Bit Score: 49.82  E-value: 6.99e-07
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 148235066 270 LELPprDREKLDALIDQLYKDIEKVTGEFLNCEGSGLYLLQSCCNHSCVPNAEASFPDNNfiLHLTALEDIQPGEEICIS 349
Cdd:cd19202  101 LEFP--DNDSLVVLFAQVNCNGFTIEDEELSHLGSAIFPDVALMNHSCCPNVIVTYKGTL--AEVRAVQEIKPGEEVFTS 176
                         90       100       110
                 ....*....|....*....|....*....|.
gi 148235066 350 YLDCCQRDRSRHSRqkiLRENYLFMCSCPKC 380
Cdd:cd19202  177 YIDLLYPTEDRNDR---LRDSYFFTCECQEC 204
SET_SpSet7-like cd10540
SET domain found in Schizossacharomyces pombe Set7 and similar proteins; Schizosaccharomyces ...
314-350 1.76e-05

SET domain found in Schizossacharomyces pombe Set7 and similar proteins; Schizosaccharomyces pombe Set7 is a novel histone-lysine N-methyltransferase. The family also includes a viral histone H3 lysine 27 methyltransferase from Paramecium bursaria Chlorella virus 1 (PBCV-1).


Pssm-ID: 380938  Cd Length: 112  Bit Score: 43.78  E-value: 1.76e-05
                         10        20        30
                 ....*....|....*....|....*....|....*..
gi 148235066 314 NHSCVPNAEASFPDNNFILHLTALEDIQPGEEICISY 350
Cdd:cd10540   70 NHSYTPNAEYEIDFENQTIVFYALRDIEAGEELTINY 106
SET_LSMT cd10527
SET domain found in Rubisco large subunit methyltransferase (LSMT) and similar proteins; ...
313-350 2.48e-05

SET domain found in Rubisco large subunit methyltransferase (LSMT) and similar proteins; Rubisco LSMT is a non-histone protein methyl transferase responsible for the trimethylation of lysine14 in the large subunit of Rubisco (ribulose-1,5-bisphosphate carboxylase/oxygenase). The family also includes SET domain-containing proteins, SETD3, SETD4 and SETD6, which belong to methyltransferase class VII that represents classical non-histone SET domain methyltransferases. Members in this family contain a SET domain and a C-terminal RubisCO LSMT substrate-binding (Rubis-subs-bind) domain.


Pssm-ID: 380925 [Multi-domain]  Cd Length: 236  Bit Score: 45.52  E-value: 2.48e-05
                         10        20        30
                 ....*....|....*....|....*....|....*....
gi 148235066 313 CNHS-CVPNAEASFPDNNFILHLTALEDIQPGEEICISY 350
Cdd:cd10527  183 LNHSpDAPNVRYEYDEDEGSFVLVATRDIAAGEEVFISY 221
SET_SMYD cd20071
SET domain (including SET domain and post-SET domain) found in SET and MYND domain-containing ...
23-58 2.65e-05

SET domain (including SET domain and post-SET domain) found in SET and MYND domain-containing protein, and similar proteins; The family includes SET and MYND domain-containing proteins, SMYD1-SYMD5. SMYD1 (EC 2.1.1.43; also termed BOP) is a heart and muscle specific SET-MYND domain containing protein, which functions as a histone methyltransferase and regulates downstream gene transcription. It methylates histone H3 at 'Lys-4' (H3K4me), seems able to perform both mono-, di-, and trimethylation. SMYD2 (also termed HSKM-B, or lysine N-methyltransferase 3C (KMT3C)) functions as a histone methyltransferase that methylates both histones and non-histone proteins, including p53/TP53 and RB1. It specifically methylates histone H3 'Lys-4' (H3K4me) and dimethylates histone H3 'Lys-36' (H3K36me2). SMYD3 (also termed zinc finger MYND domain-containing protein 1) functions as a histone methyltransferase that specifically methylates 'Lys-4' of histone H3, inducing di- and tri-methylation, but not monomethylation. It also methylates 'Lys-5' of histone H4. SMYD3 plays an important role in transcriptional activation as a member of an RNA polymerase complex. SMYD4 functions as a potential tumor suppressor that plays a critical role in breast carcinogenesis at least partly through inhibiting the expression of PDGFR-alpha. SMYD5 (also termed protein NN8-4AG, or retinoic acid-induced protein 15) functions as histone lysine methyltransferase that mediates H4K20me3 at heterochromatin regions.


Pssm-ID: 380997 [Multi-domain]  Cd Length: 122  Bit Score: 43.52  E-value: 2.65e-05
                         10        20        30
                 ....*....|....*....|....*....|....*.
gi 148235066  23 EIRFVSSGKGKGLFAIRTIRKGETIFQEKPLVSSQF 58
Cdd:cd20071    1 EVRESEGSKGRGLVATRDIEPGELILVEKPLVSVPS 36
SET_Suv4-20-like cd10524
SET domain (including post-SET domain) found in Drosophila melanogaster suppressor of ...
313-353 3.66e-04

SET domain (including post-SET domain) found in Drosophila melanogaster suppressor of variegation 4-20 (Suv4-20) and similar proteins; Suv4-20 (also termed Su(var)4-20) is a histone-lysine N-methyltransferase that specifically trimethylates 'Lys-20' of histone H4. It acts as a dominant suppressor of position-effect variegation. The family also includes Suv4-20 homologs, lysine N-methyltransferase 5B (KMT5B) and lysine N-methyltransferase 5C (KMT5C). Both KMT5B (also termed lysine-specific methyltransferase 5B, or suppressor of variegation 4-20 homolog 1, or Su(var)4-20 homolog 1, or Suv4-20h1) and KMT5C (also termed lysine-specific methyltransferase 5C, or suppressor of variegation 4-20 homolog 2, or Su(var)4-20 homolog 2, or Suv4-20h2) are histone methyltransferases that specifically trimethylate 'Lys-20' of histone H4 (H4K20me3). They play central roles in the establishment of constitutive heterochromatin in pericentric heterochromatin regions.


Pssm-ID: 380922 [Multi-domain]  Cd Length: 141  Bit Score: 40.34  E-value: 3.66e-04
                         10        20        30        40
                 ....*....|....*....|....*....|....*....|.
gi 148235066 313 CNHSCVPNAEASFPDNNFIlHLTALEDIQPGEEICISYLDC 353
Cdd:cd10524   80 INHDCRPNCKFVPTGKSTA-CVKVLRDIEPGEEITVYYGDN 119
SET_SETD4 cd19177
SET domain found in SET domain-containing protein 4 (SETD4) and similar proteins; SETD4 is a ...
313-375 6.99e-04

SET domain found in SET domain-containing protein 4 (SETD4) and similar proteins; SETD4 is a cytosolic and nuclear functional lysine methyltransferase that plays a crucial role in breast carcinogenesis. However, its specific substrates and modification sites remain to be disclosed.


Pssm-ID: 380954 [Multi-domain]  Cd Length: 245  Bit Score: 41.13  E-value: 6.99e-04
                         10        20        30        40        50        60
                 ....*....|....*....|....*....|....*....|....*....|....*....|...
gi 148235066 313 CNHSCVPNAEASFPDNNFILHLTALEDIQPGEEICISYldccqrdrSRHSRQKILREnYLFMC 375
Cdd:cd19177  192 LNHSPDVNVKAGFNKSGKCYEIRTGTDYKKGEEVFISY--------GPHSNDFLLLE-YGFVL 245
SET_SpSET3-like cd19183
SET domain (including post-SET domain) found in Schizosaccharomyces pombe SET ...
313-350 7.34e-04

SET domain (including post-SET domain) found in Schizosaccharomyces pombe SET domain-containing protein 3 (SETD3) and similar proteins; Schizosaccharomyces pombe SETD3 functions as a transcriptional regulator that acts via the formation of large multiprotein complexes that modify and/or remodel the chromatin. It is required for both, gene activation and repression.


Pssm-ID: 380960  Cd Length: 173  Bit Score: 40.08  E-value: 7.34e-04
                         10        20        30        40
                 ....*....|....*....|....*....|....*....|..
gi 148235066 313 CNHSCVPNAEAS--FPDNNFILH--LTALEDIQPGEEICISY 350
Cdd:cd19183   81 IRRSCRPNAELVtvASDSGSVLKfvLYASRDISPGEEITIGW 122
zf-MYND pfam01753
MYND finger;
100-135 1.91e-03

MYND finger;


Pssm-ID: 460312  Cd Length: 39  Bit Score: 35.86  E-value: 1.91e-03
                          10        20        30
                  ....*....|....*....|....*....|....*..
gi 148235066  100 CTVRNGLHQQCPRCQ-VTYCSAECLKAAAEqYHQILC 135
Cdd:pfam01753   4 CGKEALKLLRCSRCKsVYYCSKECQKADWP-YHKKEC 39
SET_SETD5-like cd10529
SET domain found in SET domain-containing protein 5 (SETD5), inactive histone-lysine ...
315-350 2.01e-03

SET domain found in SET domain-containing protein 5 (SETD5), inactive histone-lysine N-methyltransferase 2E (KMT2E) and similar proteins; SETD5 is a probable transcriptional regulator that acts via the formation of large multiprotein complexes that modify and/or remodel the chromatin. KMT2E (also termed inactive lysine N-methyltransferase 2E or myeloid/lymphoid or mixed-lineage leukemia protein 5 (MLL5)) associates with chromatin regions downstream of transcriptional start sites of active genes and thus regulates gene transcription. The family also includes Saccharomyces cerevisiae SET domain-containing proteins, SET3 and SET4, and Schizosaccharomyces pombe SET3. Most of these family members contain a post-SET domain which harbors a zinc-binding site.


Pssm-ID: 380927  Cd Length: 127  Bit Score: 38.02  E-value: 2.01e-03
                         10        20        30
                 ....*....|....*....|....*....|....*....
gi 148235066 315 HSCVPNAEAS---FPDNNFILHLTALEDIQPGEEICISY 350
Cdd:cd10529   85 RSCRPNAELRhvvVSNGELRLFIFALKDIRKGTEITIPF 123
SET_SETD1-like cd10518
SET domain (including post-SET domain) found in SET domain-containing proteins (SETD1A/SETD1B), ...
314-350 2.63e-03

SET domain (including post-SET domain) found in SET domain-containing proteins (SETD1A/SETD1B), histone-lysine N-methyltransferases (KMT2A/KMT2B/KMT2C/KMT2D) and similar proteins; This family includes SET domain-containing protein 1A (SETD1A), 1B (SETD1B), as well as histone-lysine N-methyltransferase 2A (KMT2A), 2B (KMT2B), 2C (KMT2C), 2D (KMT2D). These proteins are histone-lysine N-methyltransferases (EC 2.1.1.43) that specifically methylate 'Lys-4' of histone H3 (H3K4me).


Pssm-ID: 380916  Cd Length: 150  Bit Score: 38.35  E-value: 2.63e-03
                         10        20        30        40
                 ....*....|....*....|....*....|....*....|
gi 148235066 314 NHSCVPNAEA---SFPDNNFILhLTALEDIQPGEEICISY 350
Cdd:cd10518   92 NHSCDPNCYAkiiTVDGEKHIV-IFAKRDIAPGEELTYDY 130
SET_LegAS4-like cd10522
SET domain found in Legionella pneumophila type IV secretion system effector LegAS4 and ...
314-350 4.39e-03

SET domain found in Legionella pneumophila type IV secretion system effector LegAS4 and similar proteins; LegAS4 is a type IV secretion system effector of Legionella pneumophila. It contains a SET domain that is involved in the modification of Lys4 of histone H3 (H3K4) in the nucleolus of the host cell, thereby enhancing heterochromatic rDNA transcription. It also contains an ankyrin repeat domain of unknown function at its C-terminal region.


Pssm-ID: 380920 [Multi-domain]  Cd Length: 122  Bit Score: 36.94  E-value: 4.39e-03
                         10        20        30
                 ....*....|....*....|....*....|....*....
gi 148235066 314 NHSCVPNAEASFPDNNFILH--LTALEDIQPGEEICISY 350
Cdd:cd10522   78 NHSDQPNLELIVRTLKGEQHigFVAIRDIKPGEELFISY 116
SET COG2940
SET domain-containing protein (function unknown) [General function prediction only];
22-47 4.54e-03

SET domain-containing protein (function unknown) [General function prediction only];


Pssm-ID: 442183 [Multi-domain]  Cd Length: 134  Bit Score: 37.25  E-value: 4.54e-03
                         10        20
                 ....*....|....*....|....*.
gi 148235066  22 VEIRfVSSGKGKGLFAIRTIRKGETI 47
Cdd:COG2940    8 IEVR-PSPIHGRGVFATRDIPKGTLI 32
 
Blast search parameters
Data Source: Precalculated data, version = cdd.v.3.21
Preset Options:Database: CDSEARCH/cdd   Low complexity filter: no  Composition Based Adjustment: yes   E-value threshold: 0.01

References:

  • Wang J et al. (2023), "The conserved domain database in 2023", Nucleic Acids Res.51(D)384-8.
  • Lu S et al. (2020), "The conserved domain database in 2020", Nucleic Acids Res.48(D)265-8.
  • Marchler-Bauer A et al. (2017), "CDD/SPARCLE: functional classification of proteins via subfamily domain architectures.", Nucleic Acids Res.45(D)200-3.
Help | Disclaimer | Write to the Help Desk
NCBI | NLM | NIH