NCBI Home Page NCBI Site Search page NCBI Guide that lists and describes the NCBI resources
Conserved domains on  [gi|3334182|sp|Q92800|]
View 

RecName: Full=Histone-lysine N-methyltransferase EZH1; AltName: Full=ENX-2; AltName: Full=Enhancer of zeste homolog 1

Protein Classification

Graphical summary

 Zoom to residue level

show extra options »

Show site features     Horizontal zoom: ×

List of domain hits

Name Accession Description Interval E-value
SET_EZH1 cd19217
SET domain found in enhancer of zeste homolog 1 (EZH1) and similar proteins; EZH1 (EC 2.1.1.43) ...
608-743 3.16e-94

SET domain found in enhancer of zeste homolog 1 (EZH1) and similar proteins; EZH1 (EC 2.1.1.43), also termed ENX-2, or histone-lysine N-methyltransferase EZH1, is a catalytic subunit of the PRC2/EED-EZH1 complex, which methylates 'Lys-27' of histone H3, leading to transcriptional repression of the affected target gene. It can mono-, di- and trimethylate 'Lys-27' of histone H3 to form H3K27me1, H3K27me2 and H3K27me3, respectively.


:

Pssm-ID: 380994  Cd Length: 136  Bit Score: 288.89  E-value: 3.16e-94
                        10        20        30        40        50        60        70        80
                ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 3334182  608 QRGLKKHLLLAPSDVAGWGTFIKESVQKNEFISEYCGELISQDEADRRGKVYDKYMSSFLFNLNNDFVVDATRKGNKIRF 687
Cdd:cd19217   1 QRGLKKHLLLAPSDVAGWGTFIKESVQKNEFISEYCGELISQDEADRRGKVYDKYMSSFLFNLNNDFVVDATRKGNKIRF 80
                        90       100       110       120       130
                ....*....|....*....|....*....|....*....|....*....|....*.
gi 3334182  688 ANHSVNPNCYAKVVMVNGDHRIGIFAKRAIQAGEELFFDYRYSQADALKYVGIERE 743
Cdd:cd19217  81 ANHSVNPNCYAKVVMVNGDHRIGIFAKRAIQQGEELFFDYRYSQADALKYVGIERE 136
PRC2_HTH_1 pfam18118
Polycomb repressive complex 2 tri-helical domain; This domain can be found in the Polycomb ...
159-262 4.11e-37

Polycomb repressive complex 2 tri-helical domain; This domain can be found in the Polycomb repressive complex 2 (PRC2) present in Homo sapiens. Polycomb complexes maintain repressive chromatin states by silencing gene expression. PRC2 does this by methylating lysine 27 of histone H3. This domain makes up part of the N-lobe which is involved in regulation.


:

Pssm-ID: 436286  Cd Length: 101  Bit Score: 134.05  E-value: 4.11e-37
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 3334182    159 HGEEEmipgSVLISDAVFLELVDALNQYSDEEEEGHNDTSdGKQDDSKEDLPVTRKRKRH--AIEGNKKSSKKQFPNDMI 236
Cdd:pfam18118   1 HGDRE----GGFINDDIFVELVNALMQYYDDDDESEPESS-EKMSQAKKDERKTEEDERTekKDGDEKSESKKPFPSDII 75
                          90       100
                  ....*....|....*....|....*.
gi 3334182    237 FSAIASMFPENGVPDDMKERYRELTE 262
Cdd:pfam18118  76 FQAISSMFPDKGTPEELKEKYKELTE 101
EZH2_WD-Binding pfam11616
WD repeat binding protein EZH2; This family of proteins represents Enhancer of zest homolog 2, ...
39-68 2.45e-12

WD repeat binding protein EZH2; This family of proteins represents Enhancer of zest homolog 2, (EZH2) a 30 residue peptide which binds to a WD-repeat domain of EED by residues 39-68. EED is a component of PRC2 complex which is involved in gene expression. This interaction is required for the HMTase activity of PCR2.


:

Pssm-ID: 463308  Cd Length: 30  Bit Score: 61.31  E-value: 2.45e-12
                          10        20        30
                  ....*....|....*....|....*....|
gi 3334182     39 KALYVANFAKVQEKTQILNEEWKKLRVQPV 68
Cdd:pfam11616   1 KSLFVSNRQKIQERTELLNEEWKKLRIQPI 30
preSET_CXC pfam18264
CXC domain; This domain is found to the N-terminus of the SET domain in the EZH2 protein. It ...
560-591 5.00e-10

CXC domain; This domain is found to the N-terminus of the SET domain in the EZH2 protein. It is a zinc binding domain.ED L9LD52.1/505-536;


:

Pssm-ID: 408079  Cd Length: 32  Bit Score: 54.84  E-value: 5.00e-10
                          10        20        30
                  ....*....|....*....|....*....|..
gi 3334182    560 GCRCKTQCNTKQCPCYLAVRECDPDLCLTCGA 591
Cdd:pfam18264   1 GCSCRATCYTKACLCYRANRECDPDLCNMCGA 32
SANT cd00167
'SWI3, ADA2, N-CoR and TFIIIB' DNA-binding domains. Tandem copies of the domain bind telomeric ...
433-474 9.60e-05

'SWI3, ADA2, N-CoR and TFIIIB' DNA-binding domains. Tandem copies of the domain bind telomeric DNA tandem repeatsas part of the capping complex. Binding is sequence dependent for repeats which contain the G/C rich motif [C2-3 A (CA)1-6]. The domain is also found in regulatory transcriptional repressor complexes where it also binds DNA.


:

Pssm-ID: 238096 [Multi-domain]  Cd Length: 45  Bit Score: 40.25  E-value: 9.60e-05
                        10        20        30        40
                ....*....|....*....|....*....|....*....|...
gi 3334182  433 EWTGAEESLFRVFHGTY-FNNFCSIARLLGTKTCKQVFQFAVK 474
Cdd:cd00167   1 PWTEEEDELLLEAVKKYgKNNWEKIAKELPGRTPKQCRERWRN 43
 
Name Accession Description Interval E-value
SET_EZH1 cd19217
SET domain found in enhancer of zeste homolog 1 (EZH1) and similar proteins; EZH1 (EC 2.1.1.43) ...
608-743 3.16e-94

SET domain found in enhancer of zeste homolog 1 (EZH1) and similar proteins; EZH1 (EC 2.1.1.43), also termed ENX-2, or histone-lysine N-methyltransferase EZH1, is a catalytic subunit of the PRC2/EED-EZH1 complex, which methylates 'Lys-27' of histone H3, leading to transcriptional repression of the affected target gene. It can mono-, di- and trimethylate 'Lys-27' of histone H3 to form H3K27me1, H3K27me2 and H3K27me3, respectively.


Pssm-ID: 380994  Cd Length: 136  Bit Score: 288.89  E-value: 3.16e-94
                        10        20        30        40        50        60        70        80
                ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 3334182  608 QRGLKKHLLLAPSDVAGWGTFIKESVQKNEFISEYCGELISQDEADRRGKVYDKYMSSFLFNLNNDFVVDATRKGNKIRF 687
Cdd:cd19217   1 QRGLKKHLLLAPSDVAGWGTFIKESVQKNEFISEYCGELISQDEADRRGKVYDKYMSSFLFNLNNDFVVDATRKGNKIRF 80
                        90       100       110       120       130
                ....*....|....*....|....*....|....*....|....*....|....*.
gi 3334182  688 ANHSVNPNCYAKVVMVNGDHRIGIFAKRAIQAGEELFFDYRYSQADALKYVGIERE 743
Cdd:cd19217  81 ANHSVNPNCYAKVVMVNGDHRIGIFAKRAIQQGEELFFDYRYSQADALKYVGIERE 136
SET smart00317
SET (Su(var)3-9, Enhancer-of-zeste, Trithorax) domain; Putative methyl transferase, based on ...
613-734 1.57e-41

SET (Su(var)3-9, Enhancer-of-zeste, Trithorax) domain; Putative methyl transferase, based on outlier plant homologues


Pssm-ID: 214614 [Multi-domain]  Cd Length: 124  Bit Score: 147.48  E-value: 1.57e-41
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 3334182     613 KHLLLAPSDVAGWGTFIKESVQKNEFISEYCGELISQDEADRRGKVYDKYM--SSFLFNLNNDFVVDATRKGNKIRFANH 690
Cdd:smart00317   1 NKLEVFKSPGKGWGVRATEDIPKGEFIGEYVGEIITSEEAEERPKAYDTDGakAFYLFDIDSDLCIDARRKGNLARFINH 80
                           90       100       110       120
                   ....*....|....*....|....*....|....*....|....
gi 3334182     691 SVNPNCYAKVVMVNGDHRIGIFAKRAIQAGEELFFDYRYSQADA 734
Cdd:smart00317  81 SCEPNCELLFVEVNGDDRIVIFALRDIKPGEELTIDYGSDYANE 124
PRC2_HTH_1 pfam18118
Polycomb repressive complex 2 tri-helical domain; This domain can be found in the Polycomb ...
159-262 4.11e-37

Polycomb repressive complex 2 tri-helical domain; This domain can be found in the Polycomb repressive complex 2 (PRC2) present in Homo sapiens. Polycomb complexes maintain repressive chromatin states by silencing gene expression. PRC2 does this by methylating lysine 27 of histone H3. This domain makes up part of the N-lobe which is involved in regulation.


Pssm-ID: 436286  Cd Length: 101  Bit Score: 134.05  E-value: 4.11e-37
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 3334182    159 HGEEEmipgSVLISDAVFLELVDALNQYSDEEEEGHNDTSdGKQDDSKEDLPVTRKRKRH--AIEGNKKSSKKQFPNDMI 236
Cdd:pfam18118   1 HGDRE----GGFINDDIFVELVNALMQYYDDDDESEPESS-EKMSQAKKDERKTEEDERTekKDGDEKSESKKPFPSDII 75
                          90       100
                  ....*....|....*....|....*.
gi 3334182    237 FSAIASMFPENGVPDDMKERYRELTE 262
Cdd:pfam18118  76 FQAISSMFPDKGTPEELKEKYKELTE 101
SET COG2940
SET domain-containing protein (function unknown) [General function prediction only];
611-734 4.55e-29

SET domain-containing protein (function unknown) [General function prediction only];


Pssm-ID: 442183 [Multi-domain]  Cd Length: 134  Bit Score: 112.36  E-value: 4.55e-29
                        10        20        30        40        50        60        70        80
                ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 3334182  611 LKKHLLLAPSDVAGWGTFIKESVQKNEFISEYCGELISQDEADRR---GKVYDKYmssfLFNLNNDFVVDATRKGNKIRF 687
Cdd:COG2940   4 LHPRIEVRPSPIHGRGVFATRDIPKGTLIGEYPGEVITWAEAERRephKEPLHTY----LFELDDDGVIDGALGGNPARF 79
                        90       100       110       120
                ....*....|....*....|....*....|....*....|....*..
gi 3334182  688 ANHSVNPNCYAkvvmVNGDHRIGIFAKRAIQAGEELFFDYRYSQADA 734
Cdd:COG2940  80 INHSCDPNCEA----DEEDGRIFIVALRDIAAGEELTYDYGLDYDEE 122
SET pfam00856
SET domain; SET domains are protein lysine methyltransferase enzymes. SET domains appear to be ...
624-727 1.55e-26

SET domain; SET domains are protein lysine methyltransferase enzymes. SET domains appear to be protein-protein interaction domains. It has been demonstrated that SET domains mediate interactions with a family of proteins that display similarity with dual-specificity phosphatases (dsPTPases). A subset of SET domains have been called PR domains. These domains are divergent in sequence from other SET domains, but also appear to mediate protein-protein interaction. The SET domain consists of two regions known as SET-N and SET-C. SET-C forms an unusual and conserved knot-like structure of probably functional importance. Additionally to SET-N and SET-C, an insert region (SET-I) and flanking regions of high structural variability form part of the overall structure.


Pssm-ID: 459965 [Multi-domain]  Cd Length: 115  Bit Score: 104.53  E-value: 1.55e-26
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 3334182    624 GWGTFIKESVQKNEFISEYCGE-LISQDEADRRGKVYDKYM-----SSFLFNLNND--FVVDAT--RKGNKIRFANHSVN 693
Cdd:pfam00856   1 GRGLFATEDIPKGEFIGEYVEVlLITKEEADKRELLYYDKLelrlwGPYLFTLDEDseYCIDARalYYGNWARFINHSCD 80
                          90       100       110
                  ....*....|....*....|....*....|....
gi 3334182    694 PNCYAKVVMVNGDHRIGIFAKRAIQAGEELFFDY 727
Cdd:pfam00856  81 PNCEVRVVYVNGGPRIVIFALRDIKPGEELTIDY 114
EZH2_WD-Binding pfam11616
WD repeat binding protein EZH2; This family of proteins represents Enhancer of zest homolog 2, ...
39-68 2.45e-12

WD repeat binding protein EZH2; This family of proteins represents Enhancer of zest homolog 2, (EZH2) a 30 residue peptide which binds to a WD-repeat domain of EED by residues 39-68. EED is a component of PRC2 complex which is involved in gene expression. This interaction is required for the HMTase activity of PCR2.


Pssm-ID: 463308  Cd Length: 30  Bit Score: 61.31  E-value: 2.45e-12
                          10        20        30
                  ....*....|....*....|....*....|
gi 3334182     39 KALYVANFAKVQEKTQILNEEWKKLRVQPV 68
Cdd:pfam11616   1 KSLFVSNRQKIQERTELLNEEWKKLRIQPI 30
preSET_CXC pfam18264
CXC domain; This domain is found to the N-terminus of the SET domain in the EZH2 protein. It ...
560-591 5.00e-10

CXC domain; This domain is found to the N-terminus of the SET domain in the EZH2 protein. It is a zinc binding domain.ED L9LD52.1/505-536;


Pssm-ID: 408079  Cd Length: 32  Bit Score: 54.84  E-value: 5.00e-10
                          10        20        30
                  ....*....|....*....|....*....|..
gi 3334182    560 GCRCKTQCNTKQCPCYLAVRECDPDLCLTCGA 591
Cdd:pfam18264   1 GCSCRATCYTKACLCYRANRECDPDLCNMCGA 32
SANT cd00167
'SWI3, ADA2, N-CoR and TFIIIB' DNA-binding domains. Tandem copies of the domain bind telomeric ...
433-474 9.60e-05

'SWI3, ADA2, N-CoR and TFIIIB' DNA-binding domains. Tandem copies of the domain bind telomeric DNA tandem repeatsas part of the capping complex. Binding is sequence dependent for repeats which contain the G/C rich motif [C2-3 A (CA)1-6]. The domain is also found in regulatory transcriptional repressor complexes where it also binds DNA.


Pssm-ID: 238096 [Multi-domain]  Cd Length: 45  Bit Score: 40.25  E-value: 9.60e-05
                        10        20        30        40
                ....*....|....*....|....*....|....*....|...
gi 3334182  433 EWTGAEESLFRVFHGTY-FNNFCSIARLLGTKTCKQVFQFAVK 474
Cdd:cd00167   1 PWTEEEDELLLEAVKKYgKNNWEKIAKELPGRTPKQCRERWRN 43
SANT smart00717
SANT SWI3, ADA2, N-CoR and TFIIIB'' DNA-binding domains;
433-474 6.44e-04

SANT SWI3, ADA2, N-CoR and TFIIIB'' DNA-binding domains;


Pssm-ID: 197842 [Multi-domain]  Cd Length: 49  Bit Score: 37.97  E-value: 6.44e-04
                           10        20        30        40
                   ....*....|....*....|....*....|....*....|...
gi 3334182     433 EWTGAEESLFRVFHGTY-FNNFCSIARLLGTKTCKQVFQFAVK 474
Cdd:smart00717   3 EWTEEEDELLIELVKKYgKNNWEKIAKELPGRTAEQCRERWRN 45
 
Name Accession Description Interval E-value
SET_EZH1 cd19217
SET domain found in enhancer of zeste homolog 1 (EZH1) and similar proteins; EZH1 (EC 2.1.1.43) ...
608-743 3.16e-94

SET domain found in enhancer of zeste homolog 1 (EZH1) and similar proteins; EZH1 (EC 2.1.1.43), also termed ENX-2, or histone-lysine N-methyltransferase EZH1, is a catalytic subunit of the PRC2/EED-EZH1 complex, which methylates 'Lys-27' of histone H3, leading to transcriptional repression of the affected target gene. It can mono-, di- and trimethylate 'Lys-27' of histone H3 to form H3K27me1, H3K27me2 and H3K27me3, respectively.


Pssm-ID: 380994  Cd Length: 136  Bit Score: 288.89  E-value: 3.16e-94
                        10        20        30        40        50        60        70        80
                ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 3334182  608 QRGLKKHLLLAPSDVAGWGTFIKESVQKNEFISEYCGELISQDEADRRGKVYDKYMSSFLFNLNNDFVVDATRKGNKIRF 687
Cdd:cd19217   1 QRGLKKHLLLAPSDVAGWGTFIKESVQKNEFISEYCGELISQDEADRRGKVYDKYMSSFLFNLNNDFVVDATRKGNKIRF 80
                        90       100       110       120       130
                ....*....|....*....|....*....|....*....|....*....|....*.
gi 3334182  688 ANHSVNPNCYAKVVMVNGDHRIGIFAKRAIQAGEELFFDYRYSQADALKYVGIERE 743
Cdd:cd19217  81 ANHSVNPNCYAKVVMVNGDHRIGIFAKRAIQQGEELFFDYRYSQADALKYVGIERE 136
SET_EZH2 cd19218
SET domain found in enhancer of zeste homolog 2 (EZH2) and similar proteins; EZH2 (EC 2.1.1.43) ...
610-729 1.76e-86

SET domain found in enhancer of zeste homolog 2 (EZH2) and similar proteins; EZH2 (EC 2.1.1.43), also termed lysine N-methyltransferase 6, or ENX-1, or histone-lysine N-methyltransferase EZH2, is a catalytic subunit of the polycomb repressive complex 2 (PRC2)/EED-EZH2 complex, which methylates 'Lys-9' (H3K9me) and 'Lys-27' (H3K27me) of histone H3, leading to transcriptional repression of the affected target gene. It can mono-, di- and trimethylate 'Lys-27' of histone H3 to form H3K27me1, H3K27me2 and H3K27me3, respectively. PRC2 is involved in several cancers; EZH2 is overexpressed in breast, liver and prostate cancer, while point mutations in EZH2 alter the substrate preference and product specificity of PRC2 in Non-Hodgkin lymphomas (NHLs). Thus, PRC2 is a popular target for cancer therapeutics.


Pssm-ID: 380995  Cd Length: 120  Bit Score: 268.32  E-value: 1.76e-86
                        10        20        30        40        50        60        70        80
                ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 3334182  610 GLKKHLLLAPSDVAGWGTFIKESVQKNEFISEYCGELISQDEADRRGKVYDKYMSSFLFNLNNDFVVDATRKGNKIRFAN 689
Cdd:cd19218   1 GSKKHLLLAPSDVAGWGIFIKDPVQKNEFISEYCGEIISQDEADRRGKVYDKYMCSFLFNLNNDFVVDATRKGNKIRFAN 80
                        90       100       110       120
                ....*....|....*....|....*....|....*....|
gi 3334182  690 HSVNPNCYAKVVMVNGDHRIGIFAKRAIQAGEELFFDYRY 729
Cdd:cd19218  81 HSVNPNCYAKVMMVNGDHRIGIFAKRAIQTGEELFFDYRY 120
SET_EZH cd10519
SET domain found in enhancer of zeste homolog 1 (EZH1), zeste homolog 2 (EZH2) and similar ...
613-729 1.16e-83

SET domain found in enhancer of zeste homolog 1 (EZH1), zeste homolog 2 (EZH2) and similar proteins; The family includes EZH1 and EZH2. EZH1 (EC 2.1.1.43; also termed ENX-2, or histone-lysine N-methyltransferase EZH1) is a catalytic subunit of the PRC2/EED-EZH1 complex, which methylates 'Lys-27' of histone H3, leading to transcriptional repression of the affected target gene. EZH2 (EC 2.1.1.43; also termed lysine N-methyltransferase 6, ENX-1, or histone-lysine N-methyltransferase EZH2) is a catalytic subunit of the PRC2/EED-EZH2 complex, which methylates 'Lys-9' (H3K9me) and 'Lys-27' (H3K27me) of histone H3, leading to transcriptional repression of the affected target gene. Both, EZH1 and EZH2, can mono-, di- and trimethylate 'Lys-27' of histone H3 to form H3K27me1, H3K27me2 and H3K27me3, respectively.


Pssm-ID: 380917  Cd Length: 117  Bit Score: 260.64  E-value: 1.16e-83
                        10        20        30        40        50        60        70        80
                ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 3334182  613 KHLLLAPSDVAGWGTFIKESVQKNEFISEYCGELISQDEADRRGKVYDKYMSSFLFNLNNDFVVDATRKGNKIRFANHSV 692
Cdd:cd10519   1 KRLLLGKSDVAGWGLFLKEPIKKDEFIGEYTGELISQDEADRRGKIYDKYNSSYLFNLNDQFVVDATRKGNKIRFANHSS 80
                        90       100       110
                ....*....|....*....|....*....|....*..
gi 3334182  693 NPNCYAKVVMVNGDHRIGIFAKRAIQAGEELFFDYRY 729
Cdd:cd10519  81 NPNCYAKVMMVNGDHRIGIFAKRDIEAGEELFFDYGY 117
SET_SETD1-like cd10518
SET domain (including post-SET domain) found in SET domain-containing proteins (SETD1A/SETD1B), ...
603-736 6.51e-43

SET domain (including post-SET domain) found in SET domain-containing proteins (SETD1A/SETD1B), histone-lysine N-methyltransferases (KMT2A/KMT2B/KMT2C/KMT2D) and similar proteins; This family includes SET domain-containing protein 1A (SETD1A), 1B (SETD1B), as well as histone-lysine N-methyltransferase 2A (KMT2A), 2B (KMT2B), 2C (KMT2C), 2D (KMT2D). These proteins are histone-lysine N-methyltransferases (EC 2.1.1.43) that specifically methylate 'Lys-4' of histone H3 (H3K4me).


Pssm-ID: 380916  Cd Length: 150  Bit Score: 152.37  E-value: 6.51e-43
                        10        20        30        40        50        60        70        80
                ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 3334182  603 KNCSIQRGLKKHLLLAPSDVAGWGTFIKESVQKNEFISEYCGELISQDEADRRGKVYDK--YMSSFLFNLNNDFVVDATR 680
Cdd:cd10518   4 RFRQLRSRLKERLRVGKSGIHGWGLFAKRPIAAGEMVIEYVGEVIRPIVADKREKRYDEegGGGTYMFRIDEDLVIDATK 83
                        90       100       110       120       130
                ....*....|....*....|....*....|....*....|....*....|....*.
gi 3334182  681 KGNKIRFANHSVNPNCYAKVVMVNGDHRIGIFAKRAIQAGEELFFDYRYSQADALK 736
Cdd:cd10518  84 KGNIARFINHSCDPNCYAKIITVDGEKHIVIFAKRDIAPGEELTYDYKFPIEDEEK 139
SET smart00317
SET (Su(var)3-9, Enhancer-of-zeste, Trithorax) domain; Putative methyl transferase, based on ...
613-734 1.57e-41

SET (Su(var)3-9, Enhancer-of-zeste, Trithorax) domain; Putative methyl transferase, based on outlier plant homologues


Pssm-ID: 214614 [Multi-domain]  Cd Length: 124  Bit Score: 147.48  E-value: 1.57e-41
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 3334182     613 KHLLLAPSDVAGWGTFIKESVQKNEFISEYCGELISQDEADRRGKVYDKYM--SSFLFNLNNDFVVDATRKGNKIRFANH 690
Cdd:smart00317   1 NKLEVFKSPGKGWGVRATEDIPKGEFIGEYVGEIITSEEAEERPKAYDTDGakAFYLFDIDSDLCIDARRKGNLARFINH 80
                           90       100       110       120
                   ....*....|....*....|....*....|....*....|....
gi 3334182     691 SVNPNCYAKVVMVNGDHRIGIFAKRAIQAGEELFFDYRYSQADA 734
Cdd:smart00317  81 SCEPNCELLFVEVNGDDRIVIFALRDIKPGEELTIDYGSDYANE 124
PRC2_HTH_1 pfam18118
Polycomb repressive complex 2 tri-helical domain; This domain can be found in the Polycomb ...
159-262 4.11e-37

Polycomb repressive complex 2 tri-helical domain; This domain can be found in the Polycomb repressive complex 2 (PRC2) present in Homo sapiens. Polycomb complexes maintain repressive chromatin states by silencing gene expression. PRC2 does this by methylating lysine 27 of histone H3. This domain makes up part of the N-lobe which is involved in regulation.


Pssm-ID: 436286  Cd Length: 101  Bit Score: 134.05  E-value: 4.11e-37
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 3334182    159 HGEEEmipgSVLISDAVFLELVDALNQYSDEEEEGHNDTSdGKQDDSKEDLPVTRKRKRH--AIEGNKKSSKKQFPNDMI 236
Cdd:pfam18118   1 HGDRE----GGFINDDIFVELVNALMQYYDDDDESEPESS-EKMSQAKKDERKTEEDERTekKDGDEKSESKKPFPSDII 75
                          90       100
                  ....*....|....*....|....*.
gi 3334182    237 FSAIASMFPENGVPDDMKERYRELTE 262
Cdd:pfam18118  76 FQAISSMFPDKGTPEELKEKYKELTE 101
SET_SETD2-like cd10531
SET domain (including post-SET domain) found in SET domain-containing protein 2 (SETD2), ...
624-731 1.60e-36

SET domain (including post-SET domain) found in SET domain-containing protein 2 (SETD2), nuclear SETD2 (NSD2), ASH1-like protein (ASH1L) and similar proteins; This family includes SET domain-containing protein 2 (SETD2), nuclear SETD2 (NSD2) and ASH1-like protein (ASH1L), which function as histone-lysine N-methyltransferases. SETD2 specifically trimethylates 'Lys-36' of histone H3 (H3K36me3) using demethylated 'Lys-36' (H3K36me2) as substrate. NSD2 shows histone H3 'Lys-27' (H3K27me) methyltransferase activity. ASH1L specifically methylates 'Lys-36' of histone H3 (H3K36me). The family also includes Arabidopsis thaliana ASH1-related protein 3 (ASHR3) and similar proteins.


Pssm-ID: 380929  Cd Length: 136  Bit Score: 133.92  E-value: 1.60e-36
                        10        20        30        40        50        60        70        80
                ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 3334182  624 GWGTFIKESVQKNEFISEYCGELISQDEADRRGKVY--DKYMSSFLFNLNNDFVVDATRKGNKIRFANHSVNPNCYAKVV 701
Cdd:cd10531  11 GWGVKAKEDIQKGEFIIEYVGEVIDKKEFKERLDEYeeLGKSNFYILSLSDDVVIDATRKGNLSRFINHSCEPNCETQKW 90
                        90       100       110
                ....*....|....*....|....*....|
gi 3334182  702 MVNGDHRIGIFAKRAIQAGEELFFDYRYSQ 731
Cdd:cd10531  91 IVNGEYRIGIFALRDIPAGEELTFDYNFVN 120
SET_SETD1 cd19169
SET domain (including post-SET domain) found in SET domain-containing protein 1 (SETD1) and ...
612-733 4.00e-35

SET domain (including post-SET domain) found in SET domain-containing protein 1 (SETD1) and similar proteins; This family includes SET domain-containing protein 1A (SETD1A) and SET domain-containing protein 1B (SETD1B). These proteins are histone-lysine N-methyltransferases that specifically methylate 'Lys-4' of histone H3 (H3K4me) when part of the SET1 histone methyltransferase (HMT) complex, but not if the neighboring 'Lys-9' residue is already methylated.


Pssm-ID: 380946  Cd Length: 148  Bit Score: 130.15  E-value: 4.00e-35
                        10        20        30        40        50        60        70        80
                ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 3334182  612 KKHLLLAPSDVAGWGTFIKESVQKNEFISEYCGELISQDEADRRGKVYDK--YMSSFLFNLNNDFVVDATRKGNKIRFAN 689
Cdd:cd19169  12 KKQLKFAKSRIHDWGLFALEPIAADEMVIEYVGQVIRQSVADEREKRYEAigIGSSYLFRVDDDTIIDATKCGNLARFIN 91
                        90       100       110       120
                ....*....|....*....|....*....|....*....|....
gi 3334182  690 HSVNPNCYAKVVMVNGDHRIGIFAKRAIQAGEELFFDYRYSQAD 733
Cdd:cd19169  92 HSCNPNCYAKIITVESQKKIVIYSKRPIAVNEEITYDYKFPIED 135
SET_SET1 cd20072
SET domain (including post-SET domain) found in catalytic component of the Saccharomyces ...
612-733 1.98e-34

SET domain (including post-SET domain) found in catalytic component of the Saccharomyces cerevisiae COMPASS complex and similar proteins; The family contains mostly fungal SET domains, including SET1 found in the catalytic component of the Saccharomyces cerevisiae COMPASS (complex of proteins associated with Set1). SET1 is a histone-lysine N-methyltransferase that specifically methylates 'Lys-4' of histone H3 (H3K4me), when part of the SET1 histone methyltransferase (HMT) complex. The activity of this catalytic domain is established through forming a complex with a set of core proteins; it is extensively contacted by Cps60 (Bre2), Cps50 (Swd1), and Cps30 (Swd3).


Pssm-ID: 380998  Cd Length: 148  Bit Score: 128.31  E-value: 1.98e-34
                        10        20        30        40        50        60        70        80
                ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 3334182  612 KKHLLLAPSDVAGWGTFIKESVQKNEFISEYCGELISQDEADRRGKVYDK--YMSSFLFNLNNDFVVDATRKGNKIRFAN 689
Cdd:cd20072  12 KKQLKFARSAIHNWGLYAMENISAKDMVIEYVGEVIRQQVADEREKRYLRqgIGSSYLFRIDDDTVVDATKKGNIARFIN 91
                        90       100       110       120
                ....*....|....*....|....*....|....*....|....
gi 3334182  690 HSVNPNCYAKVVMVNGDHRIGIFAKRAIQAGEELFFDYRYSQAD 733
Cdd:cd20072  92 HCCDPNCTAKIIKVEGEKRIVIYAKRDIAAGEELTYDYKFPREE 135
SET_SETD2 cd19172
SET domain (including post-SET domain) found in SET domain-containing protein 2 (SETD2) and ...
624-729 9.49e-34

SET domain (including post-SET domain) found in SET domain-containing protein 2 (SETD2) and similar proteins; SETD2 (also termed HIF-1, huntingtin yeast partner B, huntingtin-interacting protein 1 (HIP-1), huntingtin-interacting protein B, lysine N-methyltransferase 3A or protein-lysine N-methyltransferase SETD2) acts as histone-lysine N-methyltransferase that specifically trimethylates 'Lys-36' of histone H3 (H3K36me3) using demethylated 'Lys-36' (H3K36me2) as substrate. It has been shown that methylation is a posttranslational modification of dynamic microtubules and that SETD2 methylates alpha-tubulin at lysine 40, the same lysine that is marked by acetylation on microtubules. Methylation of microtubules occurs during mitosis and cytokinesis and can be ablated by SETD2 deletion, which causes mitotic spindle and cytokinesis defects, micronuclei, and polyploidy.


Pssm-ID: 380949 [Multi-domain]  Cd Length: 142  Bit Score: 126.16  E-value: 9.49e-34
                        10        20        30        40        50        60        70        80
                ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 3334182  624 GWGTFIKESVQKNEFISEYCGELISQDEADRRGKVYDK-YMSSFLF-NLNNDFVVDATRKGNKIRFANHSVNPNCYAKVV 701
Cdd:cd19172  13 GWGLRAAEDLPKGTFVIEYVGEVLDEKEFKRRMKEYAReGNRHYYFmALKSDEIIDATKKGNLSRFINHSCEPNCETQKW 92
                        90       100
                ....*....|....*....|....*...
gi 3334182  702 MVNGDHRIGIFAKRAIQAGEELFFDYRY 729
Cdd:cd19172  93 TVNGELRVGFFAKRDIPAGEELTFDYQF 120
SET_SETDB-like cd10538
SET domain (including pre-SET and post-SET domains) found in SET domain bifurcated 1 (SETDB1) ...
557-727 1.44e-32

SET domain (including pre-SET and post-SET domains) found in SET domain bifurcated 1 (SETDB1) and 2 (SETDB2), suppressor of variegation 3-9 homologs, SUV39H1 and SUV39H2, euchromatic histone-lysine N-methyltransferase EHMT1 and EHMT2, and similar proteins; The family includes SET domain bifurcated 1 (SETDB1) and 2 (SETDB2), suppressor of variegation 3-9 homologs, SUV39H1 and SUV39H2, euchromatic histone-lysine N-methyltransferase EHMT1 and EHMT2. SETDB1 (EC 2.1.1.43; also termed ERG-associated protein with SET domain (ESET), histone H3-K9 methyltransferase 4, H3-K9-HMTase 4, or lysine N-methyltransferase 1E (KMT1E)) acts as a histone-lysine N-methyltransferase that specifically trimethylates 'Lys-9' of histone H3 (H3K9me3). It mainly functions in euchromatin regions, thereby playing a central role in the silencing of euchromatic genes. SETDB2 (EC 2.1.1.43; also termed chronic lymphocytic leukemia deletion region gene 8 protein (CLLD8), or lysine N-methyltransferase 1F (KMT1F)) acts as a histone-lysine N-methyltransferase that specifically trimethylates 'Lys-9' of histone H3 (H3K9me3). It is involved in left-right axis specification in early development and mitosis. SUV39H1 (also termed histone H3-K9 methyltransferase 1, H3-K9-HMTase 1, lysine N-methyltransferase 1A, KMT1A, position-effect variegation 3-9 homolog, SUV39H, or Su(var)3-9 homolog 1) and SUV39H2 (also termed histone H3-K9 methyltransferase 2, H3-K9-HMTase 2, lysine N-methyltransferase 1B, KMT1B, or Su(var)3-9 homolog 2), both act as histone-lysine N-methyltransferases that specifically trimethylate 'Lys-9' of histone H3 (H3K9me3) using monomethylated H3 'Lys-9' as substrate. They mainly function in heterochromatin regions, thereby playing central roles in the establishment of constitutive heterochromatin at pericentric and telomere regions. EHMT1 (also termed Eu-HMTase1, G9a-like protein 1, GLP, GLP1, histone H3-K9 methyltransferase 5, H3-K9-HMTase 5, lysine N-methyltransferase 1D, or KMT1D) and EHMT2 (also termed Eu-HMTase2, HLA-B-associated transcript 8, histone H3-K9 methyltransferase 3, H3-K9-HMTase 3, lysine N-methyltransferase 1C, KMT1C, or protein G9a), both act as histone-lysine N-methyltransferases that specifically mono- and dimethylate 'Lys-9' of histone H3 (H3K9me1 and H3K9me2, respectively) in euchromatin. This family also includes the pre-SET domain, which is found in a number of histone methyltransferases (HMTase), N-terminal to the SET domain. Pre-SET domain is a zinc binding motif which contains 9 conserved cysteines that coordinate three zinc ions. It is thought that this region plays a structural role in stabilizing SET domains. Most family members, except for Arabidopsis thaliana SUVH9, contain a post-SET domain which harbors a zinc-binding site.


Pssm-ID: 380936 [Multi-domain]  Cd Length: 217  Bit Score: 125.56  E-value: 1.44e-32
                        10        20        30        40        50        60        70        80
                ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 3334182  557 RFPGCRCKTQCNTKQCPCY---------------------LAVRECDPdlclTCGASEHwdckvvsCKNCSIQRGLKKHL 615
Cdd:cd10538  23 DSVGCKCKDDCLDSKCACAaesdgifaytkngllrlnnspPPIFECNS----KCSCDDD-------CKNRVVQRGLQARL 91
                        90       100       110       120       130       140       150       160
                ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 3334182  616 LLAPSDVAGWGTFIKESVQKNEFISEYCGELISQDEADRRGKVYDKYMSSFLFNLNND---------FVVDATRKGNKIR 686
Cdd:cd10538  92 QVFRTSKKGWGVRSLEFIPKGSFVCEYVGEVITTSEADRRGKIYDKSGGSYLFDLDEFsdsdgdgeeLCVDATFCGNVSR 171
                       170       180       190       200
                ....*....|....*....|....*....|....*....|....*
gi 3334182  687 FANHSVNPNCYA-KVVMVNGD---HRIGIFAKRAIQAGEELFFDY 727
Cdd:cd10538 172 FINHSCDPNLFPfNVVIDHDDlryPRIALFATRDILPGEELTFDY 216
SET_SUV39H cd10542
SET domain (including pre-SET and post-SET domains) found in suppressor of variegation 3-9 ...
547-746 6.16e-32

SET domain (including pre-SET and post-SET domains) found in suppressor of variegation 3-9 homologs, SUV39H1, SUV39H2 and similar proteins; This family includes SUV39H1 (also termed histone H3-K9 methyltransferase 1, H3-K9-HMTase 1, lysine N-methyltransferase 1A, KMT1A, position-effect variegation 3-9 homolog, SUV39H, or Su(var)3-9 homolog 1) and SUV39H2 (also termed histone H3-K9 methyltransferase 2, H3-K9-HMTase 2, lysine N-methyltransferase 1B, KMT1B, or Su(var)3-9 homolog 2), both act as histone-lysine N-methyltransferases that specifically trimethylate 'Lys-9' of histone H3 (H3K9me3) using monomethylated H3 'Lys-9' as substrate. They mainly function in heterochromatin regions, thereby playing central roles in the establishment of constitutive heterochromatin at pericentric and telomere regions. Also included are Schizosaccharomyces pombe H3K9 methyltransferase Clr4 (SUV39H homolog) and Neurospora crassa DIM-5, both of which also methylate 'Lys-9' of histone H3.


Pssm-ID: 380940 [Multi-domain]  Cd Length: 245  Bit Score: 124.71  E-value: 6.16e-32
                        10        20        30        40        50        60        70        80
                ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 3334182  547 FCQCNPDCQNRFPGCrCKTQCNTK---------QCPCYLAVRECDPdlclTCGASEhwdckvvSCKNCSIQRGLKKHL-L 616
Cdd:cd10542  24 GCECTEDCHNNNPTC-CPAESGVKfaydkqgrlRLPPGTPIYECNS----RCKCGP-------DCPNRVVQRGRKVPLcI 91
                        90       100       110       120       130       140       150       160
                ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 3334182  617 LAPSDVAGWGTFIKESVQKNEFISEYCGELISQDEADRRGKVYDKYMSSFLFNLN-----NDFVVDATRKGNKIRFANHS 691
Cdd:cd10542  92 FRTSNGRGWGVKTLEDIKKGTFVMEYVGEIITSEEAERRGKIYDANGRTYLFDLDyndddCEYTVDAAYYGNISHFINHS 171
                       170       180       190       200       210       220
                ....*....|....*....|....*....|....*....|....*....|....*....|
gi 3334182  692 VNPN--CYAkVVMVNGD---HRIGIFAKRAIQAGEELFFDYRYSQADALKYVGIERETDV 746
Cdd:cd10542 172 CDPNlaVYA-VWINHLDprlPRIAFFAKRDIKAGEELTFDYLMTGTGGSSESTIPKPKDV 230
SET_ASH1L cd19174
SET domain (including post-SET domain) found in ASH1-like protein (ASH1L) and similar proteins; ...
614-730 7.38e-32

SET domain (including post-SET domain) found in ASH1-like protein (ASH1L) and similar proteins; ASH1L (EC 2.1.1.43; also termed absent small and homeotic disks protein 1 homolog, KMT2H, or lysine N-methyltransferase 2H) acts as histone-lysine N-methyltransferase that specifically methylates 'Lys-36' of histone H3 (H3K36me). It plays important roles in development; heterozygous mutation of ASH1L is associated with severe intellectual disability (ID) and multiple congenital anomaly (MCA).


Pssm-ID: 380951 [Multi-domain]  Cd Length: 141  Bit Score: 120.86  E-value: 7.38e-32
                        10        20        30        40        50        60        70        80
                ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 3334182  614 HLLLAPSDVAGWGTFIKESVQKNEFISEYCGELISQDEADRRGK-VYDKYMSSFLFNLNNDFVVDATRKGNKIRFANHSV 692
Cdd:cd19174   1 GLERFRTEDKGWGVRTKEPIKAGQFIIEYVGEVVSEQEFRRRMIeQYHNHSHHYCLNLDSGMVIDGYRMGNEARFVNHSC 80
                        90       100       110
                ....*....|....*....|....*....|....*...
gi 3334182  693 NPNCYAKVVMVNGDHRIGIFAKRAIQAGEELFFDYRYS 730
Cdd:cd19174  81 DPNCEMQKWSVNGVYRIGLFALKDIPAGEELTYDYNFH 118
SET_NSD cd19173
SET domain (including post-SET domain) found in nuclear SET domain-containing proteins, NSD1, ...
624-727 2.89e-30

SET domain (including post-SET domain) found in nuclear SET domain-containing proteins, NSD1, NSD2, NSD3 and similar proteins; The nuclear receptor-binding SET Domain (NSD) family of histone H3 lysine 36 methyltransferases is comprised of NSD1, NSD2, and NSD3, which are primarily known to be involved in chromatin integrity and gene expression through mono-, di-, or tri-methylating lysine 36 of histone H3 (H3K36), respectively. NSD1 (EC 2.1.1.43; also termed histone-lysine N-methyltransferase H3 lysine-36 and H4 lysine-20 specific, androgen receptor coactivator 267 kDa protein (ARA267), androgen receptor-associated protein of 267 kDa, H3-K36-HMTase, H4-K20-HMTase, lysine N-methyltransferase 3B (KMT3B) or NR-binding SET domain-containing protein 1) functions as a histone-lysine N-methyltransferase that preferentially methylates 'Lys-36' of histone H3 and 'Lys-20' of histone H4. NSD2 (EC 2.1.1.43; also termed multiple myeloma SET domain-containing protein (MMSET), protein trithorax-5 (TRX5), or wolf-Hirschhorn syndrome candidate 1 protein (WHSC1)) acts as histone-lysine N-methyltransferase with histone H3 'Lys-27' (H3K27me) methyltransferase activity. NSD3 (EC 2.1.1.43; also termed protein whistle, WHSC1-like 1 isoform 9 with methyltransferase activity to lysine, Wolf-Hirschhorn syndrome candidate 1-like protein 1 (WHSC1L1), or WHSC1-like protein 1) functions as a histone-lysine N-methyltransferase that preferentially methylates 'Lys-4' and 'Lys-27' of histone H3.


Pssm-ID: 380950 [Multi-domain]  Cd Length: 142  Bit Score: 116.26  E-value: 2.89e-30
                        10        20        30        40        50        60        70        80
                ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 3334182  624 GWGTFIKESVQKNEFISEYCGELISQDEADRR-GKVYDKYMSSFLF-NLNNDFVVDATRKGNKIRFANHSVNPNCYAKVV 701
Cdd:cd19173  13 GWGLRTKRDIKKGDFVIEYVGELIDEEECRRRlKKAHENNITNFYMlTLDKDRIIDAGPKGNLSRFMNHSCQPNCETQKW 92
                        90       100
                ....*....|....*....|....*.
gi 3334182  702 MVNGDHRIGIFAKRAIQAGEELFFDY 727
Cdd:cd19173  93 TVNGDTRVGLFAVRDIPAGEELTFNY 118
SET_EZH-like cd19168
SET domain found in enhancer of zeste homolog 1 (EZH1) and zeste homolog 2 (EZH2) of polycomb ...
613-727 3.02e-30

SET domain found in enhancer of zeste homolog 1 (EZH1) and zeste homolog 2 (EZH2) of polycomb repressive complex 2 (PRC2), and similar proteins; The family includes EZH1 and EZH2. EZH1 (EC 2.1.1.43; also termed ENX-2, or histone-lysine N-methyltransferase EZH1) is a catalytic subunit of the PRC2/EED-EZH1 complex, which methylates 'Lys-27' of histone H3, leading to transcriptional repression of the affected target gene. EZH2 (EC 2.1.1.43; also termed lysine N-methyltransferase 6, ENX-1, or histone-lysine N-methyltransferase EZH2) is a catalytic subunit of the PRC2/EED-EZH2 complex, which methylates 'Lys-9' (H3K9me) and 'Lys-27' (H3K27me) of histone H3, leading to transcriptional repression of the affected target gene. Both EZH1 and EZH2 can mono-, di- and trimethylate 'Lys-27' of histone H3 to form H3K27me1, H3K27me2 and H3K27me3, respectively. PRC2 is involved in several cancers; EZH2 is overexpressed in breast, liver and prostate cancer, while point mutations in EZH2 alter the substrate preference and product specificity of PRC2 in Non-Hodgkin lymphomas (NHLs). Thus, PRC2 is a popular target for cancer therapeutics.


Pssm-ID: 380945  Cd Length: 124  Bit Score: 115.36  E-value: 3.02e-30
                        10        20        30        40        50        60        70        80
                ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 3334182  613 KHLLLAPSDV-AGWGTFIKESVQKNEFISEYCGELISQDEADRRGKVYDKYMSSFLFNLNNDFVVDATRKGNKIRFANHS 691
Cdd:cd19168   1 KAVVLGKSQLeCGLGLFAAEDIKEGEFVIEYTGELISHDEGVRREHRRGDVSYLYLFEEQEGIWVDAAIYGNLSRYINHA 80
                        90       100       110       120
                ....*....|....*....|....*....|....*....|
gi 3334182  692 VNP----NCYAKVVMVNGDHRIGIFAKRAIQAGEELFFDY 727
Cdd:cd19168  81 TDKvktgNCMPKIMYVNHEWRIKFTAIKDIKIGEELFFNY 120
SET_KMT2C_2D cd19171
SET domain (including post-SET domain) found in histone-lysine N-methyltransferase 2C (KMT2C), ...
612-729 1.56e-29

SET domain (including post-SET domain) found in histone-lysine N-methyltransferase 2C (KMT2C), 2D (KMT2D) and similar proteins; This family includes KMT2C and KMT2D. Both, KMT2C (also termed HALR or MLL3) and KMT2D (also termed ALR or MLL2), act as histone methyltransferases that methylate 'Lys-4' of histone H3 (H3K4me). They are subunits of MLL2/3 complex, a coactivator complex of nuclear receptors, involved in transcriptional coactivation.


Pssm-ID: 380948 [Multi-domain]  Cd Length: 153  Bit Score: 114.45  E-value: 1.56e-29
                        10        20        30        40        50        60        70        80
                ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 3334182  612 KKHLLLAPSDVAGWGTFIKESVQKNEFISEYCGELISQDEADRRGKVYDK-----YMssflFNLNNDFVVDATRKGNKIR 686
Cdd:cd19171  13 RSNVYLARSRIQGLGLYAARDIEKHTMVIEYIGEIIRNEVANRREKIYESqnrgiYM----FRIDNDWVIDATMTGGPAR 88
                        90       100       110       120
                ....*....|....*....|....*....|....*....|...
gi 3334182  687 FANHSVNPNCYAKVVMVNGDHRIGIFAKRAIQAGEELFFDYRY 729
Cdd:cd19171  89 YINHSCNPNCVAEVVTFDKEKKIIIISNRRIAKGEELTYDYKF 131
SET_ASHR3-like cd19175
SET domain (including post-SET domain) found in Arabidopsis thaliana ASH1-related protein 3 ...
617-731 1.74e-29

SET domain (including post-SET domain) found in Arabidopsis thaliana ASH1-related protein 3 (ASHR3) and similar proteins; This family includes Arabidopsis thaliana ASH1-related protein 3 (ASHR3, also termed protein SET DOMAIN GROUP 4 or protein stamen loss), ASH1 homolog 3 (ASHH3, also termed protein SET DOMAIN GROUP 7) and homolog 4 (ASHH4, also termed protein SET DOMAIN GROUP 24). They all function as histone-lysine N-methyltransferases (EC 2.1.1.43).


Pssm-ID: 380952 [Multi-domain]  Cd Length: 139  Bit Score: 114.05  E-value: 1.74e-29
                        10        20        30        40        50        60        70        80
                ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 3334182  617 LAPSDVAGWGTFIKESVQKNEFISEYCGELISQDEADRRGKVY-DKYMSSF-LFNLNNDFVVDATRKGNKIRFANHSVNP 694
Cdd:cd19175   4 LVKTEKCGWGLVADEDINAGEFIIEYVGEVIDDKTCEERLWDMkHKGEKNFyMCEIDKDMVIDATFKGNLSRFINHSCDP 83
                        90       100       110
                ....*....|....*....|....*....|....*..
gi 3334182  695 NCYAKVVMVNGDHRIGIFAKRAIQAGEELFFDYRYSQ 731
Cdd:cd19175  84 NCELQKWQVDGETRIGVFAIRDIKKGEELTYDYQFVQ 120
SET COG2940
SET domain-containing protein (function unknown) [General function prediction only];
611-734 4.55e-29

SET domain-containing protein (function unknown) [General function prediction only];


Pssm-ID: 442183 [Multi-domain]  Cd Length: 134  Bit Score: 112.36  E-value: 4.55e-29
                        10        20        30        40        50        60        70        80
                ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 3334182  611 LKKHLLLAPSDVAGWGTFIKESVQKNEFISEYCGELISQDEADRR---GKVYDKYmssfLFNLNNDFVVDATRKGNKIRF 687
Cdd:COG2940   4 LHPRIEVRPSPIHGRGVFATRDIPKGTLIGEYPGEVITWAEAERRephKEPLHTY----LFELDDDGVIDGALGGNPARF 79
                        90       100       110       120
                ....*....|....*....|....*....|....*....|....*..
gi 3334182  688 ANHSVNPNCYAkvvmVNGDHRIGIFAKRAIQAGEELFFDYRYSQADA 734
Cdd:COG2940  80 INHSCDPNCEA----DEEDGRIFIVALRDIAAGEELTYDYGLDYDEE 122
SET_LegAS4-like cd10522
SET domain found in Legionella pneumophila type IV secretion system effector LegAS4 and ...
623-727 6.54e-29

SET domain found in Legionella pneumophila type IV secretion system effector LegAS4 and similar proteins; LegAS4 is a type IV secretion system effector of Legionella pneumophila. It contains a SET domain that is involved in the modification of Lys4 of histone H3 (H3K4) in the nucleolus of the host cell, thereby enhancing heterochromatic rDNA transcription. It also contains an ankyrin repeat domain of unknown function at its C-terminal region.


Pssm-ID: 380920 [Multi-domain]  Cd Length: 122  Bit Score: 111.66  E-value: 6.54e-29
                        10        20        30        40        50        60        70        80
                ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 3334182  623 AGWGTFIKESVQKNEFISEYCGELISQDEADRRGKVYDKYMSSFLFNLNNdFVVDATRKGNKIRFANHSVNPNCYAKVVM 702
Cdd:cd10522  13 NGLGLFAAETIAKGEFVGEYTGEVLDRWEEDRDSVYHYDPLYPFDLNGDI-LVIDAGKKGNLTRFINHSDQPNLELIVRT 91
                        90       100
                ....*....|....*....|....*
gi 3334182  703 VNGDHRIGIFAKRAIQAGEELFFDY 727
Cdd:cd10522  92 LKGEQHIGFVAIRDIKPGEELFISY 116
SET_KMT2A_2B cd19170
SET domain (including post-SET domain) found in histone-lysine N-methyltransferase 2A (KMT2A), ...
620-729 8.66e-29

SET domain (including post-SET domain) found in histone-lysine N-methyltransferase 2A (KMT2A), 2B (KMT2B) and similar proteins; This family includes KMT2A and KMT2B. Both KMT2A (also termed ALL-1 or CXXC7 or MLL or MLL1 or TRX1 or HRX) and KMT2B (also termed MLL4 or TRX2) act as histone methyltransferases that methylate 'Lys-4' of histone H3 (H3K4me).


Pssm-ID: 380947 [Multi-domain]  Cd Length: 152  Bit Score: 112.49  E-value: 8.66e-29
                        10        20        30        40        50        60        70        80
                ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 3334182  620 SDVAGWGTFIKESVQKNEFISEYCGELISQDEADRRGKVYD-KYMSSFLFNLNNDFVVDATRKGNKIRFANHSVNPNCYA 698
Cdd:cd19170  21 SPIHGRGLFCKRNIDAGEMVIEYAGEVIRSVLTDKREKYYEsKGIGCYMFRIDDDEVVDATMHGNAARFINHSCEPNCYS 100
                        90       100       110
                ....*....|....*....|....*....|.
gi 3334182  699 KVVMVNGDHRIGIFAKRAIQAGEELFFDYRY 729
Cdd:cd19170 101 RVVNIDGKKHIVIFALRRILRGEELTYDYKF 131
SET_SETMAR cd10544
SET domain (including pre-SET and post-SET domains) found in SET domain and mariner ...
558-727 1.18e-26

SET domain (including pre-SET and post-SET domains) found in SET domain and mariner transposase fusion protein (SETMAR) and similar proteins; SETMAR (also termed metnase) is a DNA-binding protein that is indirectly recruited to sites of DNA damage through protein-protein interactions. It has a sequence-specific DNA-binding activity recognizing the 19-mer core of the 5'-terminal inverted repeats (TIRs) of the Hsmar1 element and displays a DNA nicking and end joining activity. SETMAR also acts as a histone-lysine N-methyltransferase that methylates 'Lys-4' and 'Lys-36' of histone H3. It specifically mediates dimethylation of H3 'Lys-36' at sites of DNA double-strand break and may recruit proteins required for efficient DSB repair through non-homologous end-joining.


Pssm-ID: 380942 [Multi-domain]  Cd Length: 254  Bit Score: 109.70  E-value: 1.18e-26
                        10        20        30        40        50        60        70        80
                ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 3334182  558 FPGCRCKTQ-CNTKQCPC-------YLA--------------VRECDpDLClTCGASehwdckvvsCKNCSIQRGLKKHL 615
Cdd:cd10544  24 FPGCDCKTSsCEPETCSClrkygpnYDDdgclldfdgkysgpVFECN-SMC-KCSES---------CQNRVVQNGLQFKL 92
                        90       100       110       120       130       140       150       160
                ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 3334182  616 LLAPSDVAGWGTFIKESVQKNEFISEYCGELISQDEADRRGKVYDKYMSSFLFNLN----NDFV----VDATRKGNKIRF 687
Cdd:cd10544  93 QVFKTPKKGWGLRTLEFIPKGRFVCEYAGEVIGFEEARRRTKSQTKGDMNYIIVLRehlsSGKVletfVDPTYIGNIGRF 172
                       170       180       190       200
                ....*....|....*....|....*....|....*....|.
gi 3334182  688 ANHSVNPNCYAKVVMVNGD-HRIGIFAKRAIQAGEELFFDY 727
Cdd:cd10544 173 LNHSCEPNLFMVPVRVDSMvPKLALFAARDIVAGEELSFDY 213
SET pfam00856
SET domain; SET domains are protein lysine methyltransferase enzymes. SET domains appear to be ...
624-727 1.55e-26

SET domain; SET domains are protein lysine methyltransferase enzymes. SET domains appear to be protein-protein interaction domains. It has been demonstrated that SET domains mediate interactions with a family of proteins that display similarity with dual-specificity phosphatases (dsPTPases). A subset of SET domains have been called PR domains. These domains are divergent in sequence from other SET domains, but also appear to mediate protein-protein interaction. The SET domain consists of two regions known as SET-N and SET-C. SET-C forms an unusual and conserved knot-like structure of probably functional importance. Additionally to SET-N and SET-C, an insert region (SET-I) and flanking regions of high structural variability form part of the overall structure.


Pssm-ID: 459965 [Multi-domain]  Cd Length: 115  Bit Score: 104.53  E-value: 1.55e-26
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 3334182    624 GWGTFIKESVQKNEFISEYCGE-LISQDEADRRGKVYDKYM-----SSFLFNLNND--FVVDAT--RKGNKIRFANHSVN 693
Cdd:pfam00856   1 GRGLFATEDIPKGEFIGEYVEVlLITKEEADKRELLYYDKLelrlwGPYLFTLDEDseYCIDARalYYGNWARFINHSCD 80
                          90       100       110
                  ....*....|....*....|....*....|....
gi 3334182    694 PNCYAKVVMVNGDHRIGIFAKRAIQAGEELFFDY 727
Cdd:pfam00856  81 PNCEVRVVYVNGGPRIVIFALRDIKPGEELTIDY 114
SET_SUV39H1 cd10525
SET domain (including pre-SET and post-SET domains) found in suppressor of variegation 3-9 ...
576-728 8.61e-26

SET domain (including pre-SET and post-SET domains) found in suppressor of variegation 3-9 homolog 1 (SUV39H1) and similar proteins; SUV39H1 (EC 2.1.1.43; also termed histone H3-K9 methyltransferase 1, H3-K9-HMTase 1, lysine N-methyltransferase 1A (KMT1A), position-effect variegation 3-9 homolog (SUV39H), or Su(var)3-9 homolog 1) acts as a histone-lysine N-methyltransferase that specifically trimethylates 'Lys-9' of histone H3 (H3K9me3) using monomethylated H3 'Lys-9' as substrate. It mainly functions in heterochromatin regions, thereby playing a central role in the establishment of constitutive heterochromatin at pericentric and telomere regions.


Pssm-ID: 380923 [Multi-domain]  Cd Length: 255  Bit Score: 107.29  E-value: 8.61e-26
                        10        20        30        40        50        60        70        80
                ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 3334182  576 LAVRECDPdlCLTCGASehwdckvvsCKNCSIQRGLKKHL-LLAPSDVAGWGTFIKESVQKNEFISEYCGELISQDEADR 654
Cdd:cd10525  60 LPIYECNS--RCRCGPD---------CPNRVVQKGIQYDLcIFRTDNGRGWGVRTLEKIRKNSFVMEYVGEIITSEEAER 128
                        90       100       110       120       130       140       150       160
                ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 3334182  655 RGKVYDKYMSSFLFNLN---NDFVVDATRKGNKIRFANHSVNPNCYAKVVMV-NGDH---RIGIFAKRAIQAGEELFFDY 727
Cdd:cd10525 129 RGQIYDRQGATYLFDLDyveDVYTVDAAYYGNISHFVNHSCDPNLQVYNVFIdNLDErlpRIALFATRTIRAGEELTFDY 208

                .
gi 3334182  728 R 728
Cdd:cd10525 209 N 209
SET_EHMT cd10543
SET domain (including pre-SET and post-SET domains) found in euchromatic histone-lysine ...
560-727 2.55e-24

SET domain (including pre-SET and post-SET domains) found in euchromatic histone-lysine N-methyltransferase EHMT1, EHMT2 and similar proteins; This family includes EHMT1 (also termed Eu-HMTase1, G9a-like protein 1, GLP, GLP1, histone H3-K9 methyltransferase 5, H3-K9-HMTase 5, lysine N-methyltransferase 1D, or KMT1D) and EHMT2 (also termed Eu-HMTase2, HLA-B-associated transcript 8, histone H3-K9 methyltransferase 3, H3-K9-HMTase 3, lysine N-methyltransferase 1C, KMT1C, or protein G9a), both act as histone-lysine N-methyltransferases that specifically mono- and dimethylate 'Lys-9' of histone H3 (H3K9me1 and H3K9me2, respectively) in euchromatin.


Pssm-ID: 380941 [Multi-domain]  Cd Length: 231  Bit Score: 102.03  E-value: 2.55e-24
                        10        20        30        40        50        60        70        80
                ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 3334182  560 GCRCKTQCNTKQCPCYLAVREC--DPDLCLTcGASEHWDCKVV-----------SCKNCSIQRGLKKHLLLAPSDVAGWG 626
Cdd:cd10543  26 TCSCRDDCSSDNCVCGRLSVRCwyDKEGRLL-PDFNKLDPPLIfecnracscwrNCRNRVVQNGIRYRLQLFRTRGMGWG 104
                        90       100       110       120       130       140       150       160
                ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 3334182  627 TFIKESVQKNEFISEYCGELISQDEADRRGKvydkymSSFLFNLNND----FVVDATRKGNKIRFANHSVNPNCYAKVVM 702
Cdd:cd10543 105 VRALQDIPKGTFVCEYIGELISDSEADSRED------DSYLFDLDNKdgetYCIDARRYGNISRFINHLCEPNLIPVRVF 178
                       170       180       190
                ....*....|....*....|....*....|.
gi 3334182  703 VngDH------RIGIFAKRAIQAGEELFFDY 727
Cdd:cd10543 179 V--EHqdlrfpRIAFFASRDIKAGEELGFDY 207
SET_KMT2A cd19206
SET domain (including post-SET domain) found in histone-lysine N-methyltransferase 2A (KMT2A) ...
620-734 2.98e-24

SET domain (including post-SET domain) found in histone-lysine N-methyltransferase 2A (KMT2A) and similar proteins; KMT2A (EC2.1.1.43; also termed lysine N-methyltransferase 2A, ALL-1, CXXC-type zinc finger protein 7 (CXXC7), myeloid/lymphoid or mixed-lineage leukemia (MLL), myeloid/lymphoid or mixed-lineage leukemia protein 1 (MLL1), trithorax-like protein (TRX1), or zinc finger protein HRX) acts as a histone methyltransferase that plays an essential role in early development and hematopoiesis. It is a catalytic subunit of the MLL1/MLL complex, a multiprotein complex that mediates both methylation of 'Lys-4' of histone H3 (H3K4me) complex and acetylation of 'Lys-16' of histone H4 (H4K16ac).


Pssm-ID: 380983 [Multi-domain]  Cd Length: 154  Bit Score: 99.33  E-value: 2.98e-24
                        10        20        30        40        50        60        70        80
                ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 3334182  620 SDVAGWGTFIKESVQKNEFISEYCGELISQDEADRRGKVYD-KYMSSFLFNLNNDFVVDATRKGNKIRFANHSVNPNCYA 698
Cdd:cd19206  21 SPIHGRGLFCKRNIDAGEMVIEYSGNVIRSILTDKREKYYDsKGIGCYMFRIDDSEVVDATMHGNAARFINHSCEPNCYS 100
                        90       100       110
                ....*....|....*....|....*....|....*.
gi 3334182  699 KVVMVNGDHRIGIFAKRAIQAGEELFFDYRYSQADA 734
Cdd:cd19206 101 RVINIDGQKHIVIFAMRKIYRGEELTYDYKFPIEDA 136
SET_SETD1A cd19204
SET domain (including post-SET domain) found in SET domain-containing protein 1A (SETD1A) and ...
612-733 3.22e-24

SET domain (including post-SET domain) found in SET domain-containing protein 1A (SETD1A) and similar proteins; SETD1A (EC2.1.1.43), also termed lysine N-methyltransferase 2F, or Set1/Ash2 histone methyltransferase complex subunit SET1, is a histone-lysine N-methyltransferase that specifically methylates 'Lys-4' of histone H3 (H3K4me), when part of the SET1 histone methyltransferase (HMT) complex, but not if the neighboring 'Lys-9' residue is already methylated. Human SET domain containing protein 1A (hSETD1A) expression occurs at a high rate in hepatocellular carcinoma patients and controls tumor metastasis in breast cancer by activating MMP expression.


Pssm-ID: 380981 [Multi-domain]  Cd Length: 153  Bit Score: 99.33  E-value: 3.22e-24
                        10        20        30        40        50        60        70        80
                ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 3334182  612 KKHLLLAPSDVAGWGTFIKESVQKNEFISEYCGELISQDEADRRGKVYDK--YMSSFLFNLNNDFVVDATRKGNKIRFAN 689
Cdd:cd19204  13 KKKLRFGRSRIHEWGLFAMEPIAADEMVIEYVGQNIRQVVADMREKRYVQegIGSSYLFRVDHDTIIDATKCGNLARFIN 92
                        90       100       110       120
                ....*....|....*....|....*....|....*....|....
gi 3334182  690 HSVNPNCYAKVVMVNGDHRIGIFAKRAIQAGEELFFDYRYSQAD 733
Cdd:cd19204  93 HCCTPNCYAKVITIESQKKIVIYSKQPIGVNEEITYDYKFPIED 136
SET_SUV39H2 cd10532
SET domain (including pre-SET and post-SET domains) found in suppressor of variegation 3-9 ...
602-728 3.70e-24

SET domain (including pre-SET and post-SET domains) found in suppressor of variegation 3-9 homolog 2 (SUV39H2) and similar proteins; SUV39H2 (EC 2.1.1.43; also termed histone H3-K9 methyltransferase 2, H3-K9-HMTase 2, lysine N-methyltransferase 1B (KMT1B), or Su(var)3-9 homolog 2) acts as a histone-lysine N-methyltransferase that specifically trimethylates 'Lys-9' of histone H3 (H3K9me3) using monomethylated H3 'Lys-9' as substrate. It mainly functions in heterochromatin regions, thereby playing a central role in the establishment of constitutive heterochromatin at pericentric and telomere regions.


Pssm-ID: 380930 [Multi-domain]  Cd Length: 243  Bit Score: 101.89  E-value: 3.70e-24
                        10        20        30        40        50        60        70        80
                ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 3334182  602 CKNCSIQRGLKKHL-LLAPSDVAGWGTFIKESVQKNEFISEYCGELISQDEADRRGKVYDKYMSSFLFNLN---NDFVVD 677
Cdd:cd10532  73 CPNRVVQKGTQYSLcIFRTSNGRGWGVKTLQKIKKNSFVMEYVGEVITSEEAERRGQFYDSKGITYLFDLDyesDEFTVD 152
                        90       100       110       120       130
                ....*....|....*....|....*....|....*....|....*....|....*
gi 3334182  678 ATRKGNKIRFANHSVNPNCYA-KVVMVNGD---HRIGIFAKRAIQAGEELFFDYR 728
Cdd:cd10532 153 AARYGNVSHFVNHSCDPNLQVfNVFIDNLDtrlPRIALFSTRTIKAGEELTFDYQ 207
SET_SETD8 cd10528
SET domain found in SET domain-containing protein 8 (SETD8) and similar proteins; SETD8 (EC 2. ...
607-727 9.51e-24

SET domain found in SET domain-containing protein 8 (SETD8) and similar proteins; SETD8 (EC 2.1.1.43; also termed N-lysine methyltransferase KMT5A, H4-K20-HMTase KMT5A, lysine N-methyltransferase 5A, lysine-specific methylase 5A, PR/SET domain-containing protein 07, PR-Set7 or PR/SET07) is a nucleosomal histone-lysine N-methyltransferase that specifically monomethylates 'Lys-20' of histone H4 (H4K20me1). It plays a central role in the silencing of euchromatic genes.


Pssm-ID: 380926 [Multi-domain]  Cd Length: 141  Bit Score: 97.65  E-value: 9.51e-24
                        10        20        30        40        50        60        70        80
                ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 3334182  607 IQRGLKKHLLLAPSDVAGWGTFIKESVQKNEFISEYCGELISQDEADRRGKVY------DKYMSSFLFNlNNDFVVDAT- 679
Cdd:cd10528  11 ILSGKEEGLKVIEIDGKGRGVIATRPFEKGDFVVEYHGDLITITEAKKREALYakdpstGCYMYYFQYK-GKTYCVDATk 89
                        90       100       110       120       130
                ....*....|....*....|....*....|....*....|....*....|..
gi 3334182  680 ---RKGNKIrfaNHSV-NPNCYAKVVMVNGDHRIGIFAKRAIQAGEELFFDY 727
Cdd:cd10528  90 esgRLGRLI---NHSKkKPNLKTKLLVIDGVPHLILVAKRDIKPGEELLYDY 138
SET_NSD2 cd19211
SET domain (including post-SET domain) found in nuclear SET domain-containing protein 2 (NSD2) ...
624-727 1.31e-23

SET domain (including post-SET domain) found in nuclear SET domain-containing protein 2 (NSD2) and similar proteins; NSD2 (EC 2.1.1.43; also termed multiple myeloma SET domain-containing protein (MMSET), protein trithorax-5 (TRX5), or wolf-Hirschhorn syndrome candidate 1 protein (WHSC1)) acts as histone-lysine N-methyltransferase with histone H3 'Lys-36' (H3K36me) methyltransferase activity. NSD2 has been shown to mediate di- and trimethylation of H3K36 and dimethylation of H4K20 in different systems, and has been characterized as a transcriptional repressor interacting with histone deacetylase HDAC1 and histone demethylase LSD1. NSD2 mediates constitutive NF-kappaB signaling for cancer cell proliferation, survival and tumor growth. It is highly overexpressed in several types of human cancers, including small-cell lung cancers, neuroblastoma, carcinomas of stomach and colon, and bladder cancers, and its overexpression tends to be associated with tumor aggressiveness. WHSC1 is frequently deleted in Wolf-Hirschhorn syndrome (WHS).


Pssm-ID: 380988 [Multi-domain]  Cd Length: 142  Bit Score: 97.37  E-value: 1.31e-23
                        10        20        30        40        50        60        70        80
                ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 3334182  624 GWGTFIKESVQKNEFISEYCGELISQDEA-DRRGKVYDKYMSSF-LFNLNNDFVVDATRKGNKIRFANHSVNPNCYAKVV 701
Cdd:cd19211  13 GWGLIAKRDIKKGEFVNEYVGELIDEEECmARIKHAHENDITHFyMLTIDKDRIIDAGPKGNYSRFMNHSCQPNCETQKW 92
                        90       100
                ....*....|....*....|....*.
gi 3334182  702 MVNGDHRIGIFAKRAIQAGEELFFDY 727
Cdd:cd19211  93 TVNGDTRVGLFAVCDIPAGTELTFNY 118
SET_NSD3 cd19212
SET domain (including post-SET domain) found in nuclear receptor-binding SET domain-containing ...
624-727 1.53e-23

SET domain (including post-SET domain) found in nuclear receptor-binding SET domain-containing protein 3 (NSD3) and similar proteins; NSD3 (EC 2.1.1.43; also termed protein whistle, WHSC1-like 1 isoform 9 with methyltransferase activity to lysine, Wolf-Hirschhorn syndrome candidate 1-like protein 1 (WHSC1L1), or WHSC1-like protein 1) functions as a histone-lysine N-methyltransferase that preferentially methylates 'Lys-4' and 'Lys-27' of histone H3. NSD3 is amplified and overexpressed in multiple cancer types, including acute myeloid leukemia (AML), breast, lung, pancreatic and bladder cancers, as well as squamous cell carcinoma of the head and neck (SCCHN). NSD3 contributes to tumorigenesis by interacting with bromodomain-containing protein 4 (BRD4), the bromodomain and extraterminal (BET) protein, which is a potential therapeutic target in acute myeloid leukemia (AML). NSD3 is amplified in primary tumors and cell lines from breast carcinoma, and can promote the cell viability of small-cell lung cancer and pancreatic ductal adenocarcinoma. High NSD3 expression is implicated in poor grade and heavy smoking history in SCCHN. Thus, NSD3 may serve as a potential druggable target for selective cancer therapy.


Pssm-ID: 380989 [Multi-domain]  Cd Length: 142  Bit Score: 96.92  E-value: 1.53e-23
                        10        20        30        40        50        60        70        80
                ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 3334182  624 GWGTFIKESVQKNEFISEYCGELISQDEADRRGK-VYDKYMSSF-LFNLNNDFVVDATRKGNKIRFANHSVNPNCYAKVV 701
Cdd:cd19212  13 GWGLRTKRSIKKGEFVNEYVGELIDEEECRLRIKrAHENSVTNFyMLTVTKDRIIDAGPKGNYSRFMNHSCNPNCETQKW 92
                        90       100
                ....*....|....*....|....*.
gi 3334182  702 MVNGDHRIGIFAKRAIQAGEELFFDY 727
Cdd:cd19212  93 TVNGDVRVGLFALCDIPAGMELTFNY 118
SET_SETD1B cd19205
SET domain (including post-SET domain) found in SET domain-containing protein 1B (SETD1B) and ...
612-733 6.83e-23

SET domain (including post-SET domain) found in SET domain-containing protein 1B (SETD1B) and similar proteins; SETD1B (EC2.1.1.43), also termed lysine N-methyltransferase 2G, is a histone-lysine N-methyltransferase that specifically methylates 'Lys-4' of histone H3 (H3K4me) when part of the SET1 histone methyltransferase (HMT) complex, but not if the neighboring 'Lys-9' residue is already methylated. Loss of SETD1B occurs in up to half the gastric and colorectal cancers, most commonly via SETD1B mutations, while de novo variants in SETD1B are associated with intellectual disability, epilepsy and autism.


Pssm-ID: 380982 [Multi-domain]  Cd Length: 153  Bit Score: 95.51  E-value: 6.83e-23
                        10        20        30        40        50        60        70        80
                ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 3334182  612 KKHLLLAPSDVAGWGTFIKESVQKNEFISEYCGELISQDEADRRGKVYDK--YMSSFLFNLNNDFVVDATRKGNKIRFAN 689
Cdd:cd19205  13 KKKLKFCKSHIHDWGLFAMEPIAADEMVIEYVGQNIRQVIADMREKRYEDegIGSSYMFRVDHDTIIDATKCGNFARFIN 92
                        90       100       110       120
                ....*....|....*....|....*....|....*....|....
gi 3334182  690 HSVNPNCYAKVVMVNGDHRIGIFAKRAIQAGEELFFDYRYSQAD 733
Cdd:cd19205  93 HSCNPNCYAKVITVESQKKIVIYSKQHINVNEEITYDYKFPIED 136
SET_NSD1 cd19210
SET domain (including post-SET domain) found in nuclear receptor-binding SET domain-containing ...
624-727 3.47e-22

SET domain (including post-SET domain) found in nuclear receptor-binding SET domain-containing protein 1 (NSD1) and similar proteins; NSD1 (EC 2.1.1.43; also termed Histone-lysine N-methyltransferase H3 lysine-36 and H4 lysine-20 specific, androgen receptor coactivator 267 kDa protein (ARA267), androgen receptor-associated protein of 267 kDa, H3-K36-HMTase, H4-K20-HMTase, lysine N-methyltransferase 3B (KMT3B), or NR-binding SET domain-containing protein 1) functions as a histone-lysine N-methyltransferase that preferentially methylates 'Lys-36' of histone H3 and 'Lys-20' of histone H4. NSD1 is altered in approximately 10% of head and neck cancer patients with 55% decrease in risk of death in NSD1-mutated versus non-mutated patients; its disruption promotes favorable chemotherapeutic responses linked to hypomethylation.


Pssm-ID: 380987 [Multi-domain]  Cd Length: 142  Bit Score: 93.07  E-value: 3.47e-22
                        10        20        30        40        50        60        70        80
                ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 3334182  624 GWGTFIKESVQKNEFISEYCGELISQDEADRRGKVYDKY-MSSF-LFNLNNDFVVDATRKGNKIRFANHSVNPNCYAKVV 701
Cdd:cd19210  13 GWGLRCKTDIKKGEFVNEYVGELIDEEECRARIRYAQEHdITNFyMLTLDKDRIIDAGPKGNYARFMNHCCQPNCETQKW 92
                        90       100
                ....*....|....*....|....*.
gi 3334182  702 MVNGDHRIGIFAKRAIQAGEELFFDY 727
Cdd:cd19210  93 TVNGDTRVGLFALCDIKAGTELTFNY 118
SET_KMT2C cd19208
SET domain (including post-SET domain) found in histone-lysine N-methyltransferase 2C (KMT2C) ...
612-733 1.07e-21

SET domain (including post-SET domain) found in histone-lysine N-methyltransferase 2C (KMT2C) and similar proteins; KMT2C (EC2.1.1.43; also termed lysine N-methyltransferase 2C, homologous to ALR protein (HALR) myeloid/lymphoid, or mixed-lineage leukemia protein 3 (MLL3)), acts as a histone methyltransferase that methylates 'Lys-4' of histone H3 (H3K4me) and may be involved in leukemogenesis and developmental disorder. KMT2C is a catalytic subunit of MLL2/3 complex, a coactivator complex of nuclear receptors, involved in transcriptional coactivation. Overexpression of KMT2C is associated with estrogen receptor-positive breast cancer; KMT2C mediates the estrogen dependence of breast cancer through regulation of estrogen receptor alpha (ERalpha) enhancer function. KMT2C is frequently mutated in certain populations with diffuse-type gastric adenocarcinomas (DGA); its loss promotes epithelial-to-mesenchymal transition (EMT) and is associated with worse overall survival.


Pssm-ID: 380985 [Multi-domain]  Cd Length: 154  Bit Score: 92.00  E-value: 1.07e-21
                        10        20        30        40        50        60        70        80
                ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 3334182  612 KKHLLLAPSDVAGWGTFIKESVQKNEFISEYCGELISQDEADRRGKVYD-KYMSSFLFNLNNDFVVDATRKGNKIRFANH 690
Cdd:cd19208  14 KSNVYLARSRIQGLGLYAARDIEKHTMVIEYIGTIIRNEVANRKEKLYEsQNRGVYMFRIDNDHVIDATLTGGPARYINH 93
                        90       100       110       120
                ....*....|....*....|....*....|....*....|...
gi 3334182  691 SVNPNCYAKVVMVNGDHRIGIFAKRAIQAGEELFFDYRYSQAD 733
Cdd:cd19208  94 SCAPNCVAEVVTFEKGHKIIISSSRRIQKGEELCYDYKFDFED 136
SET_KMT2B cd19207
SET domain (including post-SET domain) found in histone-lysine N-methyltransferase 2B (KMT2B) ...
620-734 8.59e-21

SET domain (including post-SET domain) found in histone-lysine N-methyltransferase 2B (KMT2B) and similar proteins; KMT2B (EC2.1.1.43; also termed lysine N-methyltransferase 2B, myeloid/lymphoid or mixed-lineage leukemia protein 4 (MLL2/MLL4), trithorax homolog 2 (TRX2), or WW domain-binding protein 7 (WBP-7)), acts as a histone methyltransferase that methylates 'Lys-4' of histone H3 (H3K4me). It is required during the transcriptionally active period of oocyte growth for the establishment and/or maintenance of bulk H3K4 trimethylation (H3K4me3), global transcriptional silencing that precedes resumption of meiosis, oocyte survival and normal zygotic genome activation.


Pssm-ID: 380984 [Multi-domain]  Cd Length: 154  Bit Score: 89.70  E-value: 8.59e-21
                        10        20        30        40        50        60        70        80
                ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 3334182  620 SDVAGWGTFIKESVQKNEFISEYCGELISQDEADRRGKVYD-KYMSSFLFNLNNDFVVDATRKGNKIRFANHSVNPNCYA 698
Cdd:cd19207  21 SAIHGRGLFCKRNIDAGEMVIEYSGIVIRSVLTDKREKFYDsKGIGCYMFRIDDFDVVDATMHGNAARFINHSCEPNCYS 100
                        90       100       110
                ....*....|....*....|....*....|....*.
gi 3334182  699 KVVMVNGDHRIGIFAKRAIQAGEELFFDYRYSQADA 734
Cdd:cd19207 101 RVIHVEGQKHIVIFALRKIYRGEELTYDYKFPIEDA 136
SET_KMT2D cd19209
SET domain (including post-SET domain) found in histone-lysine N-methyltransferase 2D (KMT2D) ...
612-733 1.32e-20

SET domain (including post-SET domain) found in histone-lysine N-methyltransferase 2D (KMT2D) and similar proteins; KMT2D (EC2.1.1.43; also termed lysine N-methyltransferase 2D, ALL1-related protein (ALR), or myeloid/lymphoid or mixed-lineage leukemia protein 2 (MLL2)), acts as histone methyltransferase that methylates 'Lys-4' of histone H3 (H3K4me). It is a coactivator for estrogen receptor by being recruited by ESR1, thereby activating transcription. KMT2D is a subunit of MLL2/3 complex, a coactivator complex of nuclear receptors, involved in transcriptional coactivation.


Pssm-ID: 380986 [Multi-domain]  Cd Length: 155  Bit Score: 88.98  E-value: 1.32e-20
                        10        20        30        40        50        60        70        80
                ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 3334182  612 KKHLLLAPSDVAGWGTFIKESVQKNEFISEYCGELISQDEADRRGKVYDKYMSS-FLFNLNNDFVVDATRKGNKIRFANH 690
Cdd:cd19209  15 KNNVYLARSRIQGLGLYAAKDLEKHTMVIEYIGTIIRNEVANRREKIYEEQNRGiYMFRINNEHVIDATLTGGPARYINH 94
                        90       100       110       120
                ....*....|....*....|....*....|....*....|...
gi 3334182  691 SVNPNCYAKVVMVNGDHRIGIFAKRAIQAGEELFFDYRYSQAD 733
Cdd:cd19209  95 SCAPNCVAEVVTFDKEDKIIIISSRRIPKGEELTYDYQFDFED 137
SET_EHMT2 cd10533
SET domain (including pre-SET and post-SET domains) found in euchromatic histone-lysine ...
561-727 1.89e-20

SET domain (including pre-SET and post-SET domains) found in euchromatic histone-lysine N-methyltransferase 2 (EHMT2) and similar proteins; EHMT2 (also termed Eu-HMTase2, HLA-B-associated transcript 8, histone H3-K9 methyltransferase 3, H3-K9-HMTase 3, lysine N-methyltransferase 1C (KMT1C), or protein G9a) acts as a histone-lysine N-methyltransferase that specifically mono- and dimethylates 'Lys-9' of histone H3 (H3K9me1 and H3K9me2, respectively) in euchromatin.


Pssm-ID: 380931 [Multi-domain]  Cd Length: 239  Bit Score: 91.23  E-value: 1.89e-20
                        10        20        30        40        50        60        70        80
                ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 3334182  561 CRCKTQCNTKQCPC-YLAVR--------------ECDPDLCLTCG-ASEHWDckvvSCKNCSIQRGLKKHLLLAPSDVAG 624
Cdd:cd10533  27 CTCVDDCSSSNCLCgQLSIRcwydkdgrllqefnKIEPPLIFECNqACSCWR----NCKNRVVQSGIKVRLQLYRTAKMG 102
                        90       100       110       120       130       140       150       160
                ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 3334182  625 WGTFIKESVQKNEFISEYCGELISQDEADRRGKvydkymSSFLFNLNND----FVVDATRKGNKIRFANHSVNPNCY-AK 699
Cdd:cd10533 103 WGVRALQTIPQGTFICEYVGELISDAEADVRED------DSYLFDLDNKdgevYCIDARYYGNISRFINHLCDPNIIpVR 176
                       170       180       190
                ....*....|....*....|....*....|.
gi 3334182  700 VVMVNGD---HRIGIFAKRAIQAGEELFFDY 727
Cdd:cd10533 177 VFMLHQDlrfPRIAFFSSRDIRTGEELGFDY 207
SET_EHMT1 cd10535
SET domain (including pre-SET and post-SET domains) found in euchromatic histone-lysine ...
546-727 4.59e-20

SET domain (including pre-SET and post-SET domains) found in euchromatic histone-lysine N-methyltransferase 1 (EHMT1) and similar proteins; EHMT1 (also termed Eu-HMTase1, G9a-like protein 1, GLP, GLP1, histone H3-K9 methyltransferase 5, H3-K9-HMTase 5, or lysine N-methyltransferase 1D (KMT1D)) acts as a histone-lysine N-methyltransferase that specifically mono- and dimethylates 'Lys-9' of histone H3 (H3K9me1 and H3K9me2, respectively) in euchromatin.


Pssm-ID: 380933 [Multi-domain]  Cd Length: 231  Bit Score: 89.99  E-value: 4.59e-20
                        10        20        30        40        50        60        70        80
                ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 3334182  546 KFCQCNPDCQNRFPGC-----RCKTQCNTKQCPCYLAVrecDPDLCLTCG-ASEHWDckvvSCKNCSIQRGLKKHLLLAP 619
Cdd:cd10535  25 QYCVCIDDCSSSNCMCgqlsmRCWYDKDGRLLPEFNMA---EPPLIFECNhACSCWR----NCRNRVVQNGLRARLQLYR 97
                        90       100       110       120       130       140       150       160
                ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 3334182  620 SDVAGWGTFIKESVQKNEFISEYCGELISQDEADRRGKvydkymSSFLFNLNND----FVVDATRKGNKIRFANHSVNPN 695
Cdd:cd10535  98 TRDMGWGVRSLQDIPPGTFVCEYVGELISDSEADVREE------DSYLFDLDNKdgevYCIDARFYGNVSRFINHHCEPN 171
                       170       180       190
                ....*....|....*....|....*....|....*.
gi 3334182  696 CY-AKVVMVNGD---HRIGIFAKRAIQAGEELFFDY 727
Cdd:cd10535 172 LVpVRVFMAHQDlrfPRIAFFSTRLIEAGEQLGFDY 207
SET_SUV39H_Clr4-like cd20073
SET domain (including pre-SET and post-SET domains) found in of Schizosaccharomyces pombe H3K9 ...
592-727 1.28e-19

SET domain (including pre-SET and post-SET domains) found in of Schizosaccharomyces pombe H3K9 methyltransferase Clr4, and similar proteins; This subfamily contains fission yeast Schizosaccharomyces pombe H3K9 methyltransferase Clr4 (also known as Suv39h), the sole homolog of the mammalian SUV39H1 and SUV39H2 enzymes, that has a critical role in preventing aberrant heterochromatin formation. It is known to di- and tri-methylate Lys-9 of histone H3, a central heterochromatic histone modification, with its specificity profile most similar to that of the human SUV39H2 homolog.


Pssm-ID: 380999 [Multi-domain]  Cd Length: 259  Bit Score: 89.17  E-value: 1.28e-19
                        10        20        30        40        50        60        70        80
                ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 3334182  592 SEHWDCKVvSCKNCSIQRGLKKHLLLAPSDVAGWGTFIKESVQKNEFISEYCGELISQDEADRRGKVYDKYMSSFLFNLN 671
Cdd:cd20073  73 NENCDCGI-NCPNRVVQRGRKLPLEIFKTKHKGWGLRCPRFIKAGTFIGVYLGEVITQSEAEIRGKKYDNVGVTYLFDLD 151
                        90       100       110       120       130       140
                ....*....|....*....|....*....|....*....|....*....|....*....|....*..
gi 3334182  672 -------NDFVVDATRKGNKIRFANHSVNPNCYAKVVMVNGDHR----IGIFAKRAIQAGEELFFDY 727
Cdd:cd20073 152 lfedqvdEYYTVDAQYCGDVTRFINHSCDPNLAIYSVLRDKSDSkiydLAFFAIKDIPALEELTFDY 218
SET_SETDB1 cd10517
SET domain (including pre-SET and post-SET domains) found in SET domain bifurcated 1 (SETDB1) ...
601-731 3.48e-18

SET domain (including pre-SET and post-SET domains) found in SET domain bifurcated 1 (SETDB1) and similar proteins; SETDB1 (EC 2.1.1.43; also termed ERG-associated protein with SET domain (ESET), histone H3-K9 methyltransferase 4, H3-K9-HMTase 4, or lysine N-methyltransferase 1E (KMT1E)) acts as a histone-lysine N-methyltransferase that specifically trimethylates 'Lys-9' of histone H3 (H3K9me3). It mainly functions in euchromatin regions, thereby playing a central role in the silencing of euchromatic genes.


Pssm-ID: 380915 [Multi-domain]  Cd Length: 288  Bit Score: 85.80  E-value: 3.48e-18
                        10        20        30        40        50        60        70        80
                ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 3334182  601 SCKNCSIQRGLKKHLLLAPSDVAGWGTFIKESVQKNEFISEYCGELISQDEADRRGKVY-DKYmssfLFNLN-------- 671
Cdd:cd10517 117 RCYNRVVQNGLQVRLQVFKTEKKGWGIRCLDDIPKGSFVCIYAGQILTEDEANEEGLQYgDEY----FAELDyievvekl 192
                        90       100       110       120       130       140       150
                ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....
gi 3334182  672 ----------NDFVVDATRKGNKIRFANHSVNPNCYAKVVMVNG-DHR---IGIFAKRAIQAGEELFFDYRYSQ 731
Cdd:cd10517 193 kegyesdveeHCYIIDAKSEGNLGRYLNHSCSPNLFVQNVFVDThDLRfpwVAFFASRYIRAGTELTWDYNYEV 266
SET_SETD5-like cd10529
SET domain found in SET domain-containing protein 5 (SETD5), inactive histone-lysine ...
626-729 5.07e-17

SET domain found in SET domain-containing protein 5 (SETD5), inactive histone-lysine N-methyltransferase 2E (KMT2E) and similar proteins; SETD5 is a probable transcriptional regulator that acts via the formation of large multiprotein complexes that modify and/or remodel the chromatin. KMT2E (also termed inactive lysine N-methyltransferase 2E or myeloid/lymphoid or mixed-lineage leukemia protein 5 (MLL5)) associates with chromatin regions downstream of transcriptional start sites of active genes and thus regulates gene transcription. The family also includes Saccharomyces cerevisiae SET domain-containing proteins, SET3 and SET4, and Schizosaccharomyces pombe SET3. Most of these family members contain a post-SET domain which harbors a zinc-binding site.


Pssm-ID: 380927  Cd Length: 127  Bit Score: 77.70  E-value: 5.07e-17
                        10        20        30        40        50        60        70        80
                ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 3334182  626 GTFIKESVQKNEFISEYCGELISQDEADRRGKVYDKyMSSFLFNL----NNDFVVDATRKGNKIRFANHSVNPNCYAKVV 701
Cdd:cd10529  18 GLVATEDISPGEPILEYKGEVSLRSEFKEDNGFFKR-PSPFVFFYdgfeGLPLCVDARKYGNEARFIRRSCRPNAELRHV 96
                        90       100       110
                ....*....|....*....|....*....|.
gi 3334182  702 MV-NGDHRIGIFAKRAIQAGEELF--FDYRY 729
Cdd:cd10529  97 VVsNGELRLFIFALKDIRKGTEITipFDYDY 127
SET_SETDB cd10541
SET domain (including pre-SET and post-SET domains) found in SET domain bifurcated 1 (SETDB1), ...
553-729 9.71e-17

SET domain (including pre-SET and post-SET domains) found in SET domain bifurcated 1 (SETDB1), SET domain bifurcated 2 (SETDB2), and similar proteins; SETDB1 (EC 2.1.1.43; also termed ERG-associated protein with SET domain (ESET), histone H3-K9 methyltransferase 4, H3-K9-HMTase 4, or lysine N-methyltransferase 1E (KMT1E)) acts as a histone-lysine N-methyltransferase that specifically trimethylates 'Lys-9' of histone H3 (H3K9me3). It mainly functions in euchromatin regions, thereby playing a central role in the silencing of euchromatic genes. SETDB2 (EC 2.1.1.43; also termed chronic lymphocytic leukemia deletion region gene 8 protein (CLLD8), or lysine N-methyltransferase 1F (KMT1F)) acts as a histone-lysine N-methyltransferase that specifically trimethylates 'Lys-9' of histone H3 (H3K9me3). It is involved in left-right axis specification in early development and mitosis.


Pssm-ID: 380939 [Multi-domain]  Cd Length: 236  Bit Score: 80.28  E-value: 9.71e-17
                        10        20        30        40        50        60        70        80
                ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 3334182  553 DCQNrfpGCRCKTQC--------NTKQCPC----------YLAVRECDPDLCLTCgaSEHWDCKVVSCKNCSIQRGLKKH 614
Cdd:cd10541  19 DCTD---GCRDKSKCachqltiqATACTPGgqdnptagyqYKRLEECLPTGVYEC--NKLCKCDPNMCQNRLVQHGLQVR 93
                        90       100       110       120       130       140       150       160
                ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 3334182  615 LLLAPSDVAGWGTFIKESVQKNEFISEYCGELISQDEADRRGKVY-DKYMSSFLFNLNNDFVVDATRKGNKIRFANHSVN 693
Cdd:cd10541  94 LQLFKTQNKGWGIRCLDDIAKGTFVCIYAGKILTDDFADKEGLEMgDEYFANLDHIEESCYIIDAKLEGNLGRYLNHSCS 173
                       170       180       190       200
                ....*....|....*....|....*....|....*....|
gi 3334182  694 PNCYAKVVMVNGDHR----IGIFAKRAIQAGEELFFDYRY 729
Cdd:cd10541 174 PNLFVQNVFVDTHDLrfpwVAFFASKRIKAGTELTWDYNY 213
SET_AtSUVH-like cd10545
SET domain found in Arabidopsis thaliana histone H3-K9 methyltransferases (SUVHs) and similar ...
560-729 1.96e-16

SET domain found in Arabidopsis thaliana histone H3-K9 methyltransferases (SUVHs) and similar proteins; Arabidopsis thaliana SUVH protein (also termed suppressor of variegation 3-9 homolog protein) is a histone-lysine N-methyltransferase that methylates 'Lys-9' of histone H3. H3 'Lys-9' methylation represents a specific tag for epigenetic transcriptional repression. Some family members contain a post-SET domain which binds a Zn2+ ion. Most family members, except for Arabidopsis thaliana SUVH9, contain a post-SET domain which harbors a zinc-binding site.


Pssm-ID: 380943 [Multi-domain]  Cd Length: 232  Bit Score: 79.37  E-value: 1.96e-16
                        10        20        30        40        50        60        70        80
                ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 3334182  560 GCRCKTQC--NTKQCPC--------------YLAVR-----ECDPdLClTCGASehwdckvvsCKNCSIQRGLKKHLLLA 618
Cdd:cd10545  23 GCDCKNRCtdGASDCACvkknggeipynfngRLIRAkpaiyECGP-LC-KCPPS---------CYNRVTQKGLRYRLEVF 91
                        90       100       110       120       130       140       150       160
                ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 3334182  619 PSDVAGWGTFIKESVQKNEFISEYCGELISQDEADRRGKvYDKYmssfLFNLNN-------------------------- 672
Cdd:cd10545  92 KTAERGWGVRSWDSIPAGSFICEYVGELLDTSEADTRSG-NDDY----LFDIDNrqtnrgwdggqrldvgmsdgerssae 166
                       170       180       190       200       210       220
                ....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 3334182  673 -----DFVVDATRKGNKIRFANHSVNPNCYAKVVMVngDH------RIGIFAKRAIQAGEELFFDYRY 729
Cdd:cd10545 167 deessEFTIDAGSFGNVARFINHSCSPNLFVQCVLY--DHndlrlpRVMLFAADNIPPLQELTYDYGY 232
SET cd08161
SET (Su(var)3-9, Enhancer-of-zeste, Trithorax) domain superfamily; The Su(var)3-9, ...
686-728 1.70e-15

SET (Su(var)3-9, Enhancer-of-zeste, Trithorax) domain superfamily; The Su(var)3-9, Enhancer-of-zeste, Trithorax (SET) domain superfamily corresponds to SET domain-containing lysine methyltransferases, which catalyze site and state-specific methylation of lysine residues in histones that are fundamental in epigenetic regulation of gene activation and silencing in eukaryotic organisms. SET domains appear to be protein-protein interaction domains. It has been demonstrated that SET domains mediate interactions with a family of proteins that display similarity with dual-specificity phosphatases (dsPTPases). A subset of SET domains has been called PR domains. These domains are divergent in sequence from other SET domains, but also appear to mediate protein-protein interaction. The SET domain consists of two regions known as N-SET and C-SET. C-SET forms an unusual and conserved knot-like structure of probable functional importance. In addition to N-SET and C-SET, an insert region (I-SET) and flanking regions of high structural variability form part of the overall structure. Some family members contain a pre-SET domain, which is found in a number of histone methyltransferases (HMTase), and a post-SET domain, which harbors a zinc-binding site.


Pssm-ID: 380914 [Multi-domain]  Cd Length: 72  Bit Score: 71.51  E-value: 1.70e-15
                        10        20        30        40
                ....*....|....*....|....*....|....*....|...
gi 3334182  686 RFANHSVNPNCYAKVVMVNGDHRIGIFAKRAIQAGEELFFDYR 728
Cdd:cd08161  30 RFINHSCEPNCEFEEVYVGGKPRVFIVALRDIKAGEELTVDYG 72
EZH2_WD-Binding pfam11616
WD repeat binding protein EZH2; This family of proteins represents Enhancer of zest homolog 2, ...
39-68 2.45e-12

WD repeat binding protein EZH2; This family of proteins represents Enhancer of zest homolog 2, (EZH2) a 30 residue peptide which binds to a WD-repeat domain of EED by residues 39-68. EED is a component of PRC2 complex which is involved in gene expression. This interaction is required for the HMTase activity of PCR2.


Pssm-ID: 463308  Cd Length: 30  Bit Score: 61.31  E-value: 2.45e-12
                          10        20        30
                  ....*....|....*....|....*....|
gi 3334182     39 KALYVANFAKVQEKTQILNEEWKKLRVQPV 68
Cdd:pfam11616   1 KSLFVSNRQKIQERTELLNEEWKKLRIQPI 30
SET_SUV39H_DIM5-like cd19473
SET domain (including pre-SET domain) found in Neurospora crassa (DIM-5) and similar proteins; ...
601-743 1.78e-11

SET domain (including pre-SET domain) found in Neurospora crassa (DIM-5) and similar proteins; This subfamily contains Neurospora crassa DIM-5 (also termed H3-K9-HMTase dim-5, or HKMT) which functions as histone-lysine N-methyltransferase that specifically trimethylates histone H3 to form H3K9me3.


Pssm-ID: 380996 [Multi-domain]  Cd Length: 274  Bit Score: 65.42  E-value: 1.78e-11
                        10        20        30        40        50        60        70        80
                ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 3334182  601 SCKNCSIQRGLKKHLLLAP-SDVAGWGTFIKESVQKNEFISEYCGELISQDEADRRGK---------VY----DKY--MS 664
Cdd:cd19473  93 DCPNRVVERGRKVPLQIFRtSDGRGWGVRSTVDIKRGQFVDCYVGEIITPEEAQRRRDaatiaqrkdVYlfalDKFsdPD 172
                        90       100       110       120       130       140       150       160
                ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 3334182  665 SFLFNLNND-FVVDATRKGNKIRFANHSVNPN--CYAKVvmvnGD------HRIGIFAKRAIQAGEELFFDYRYSQADAL 735
Cdd:cd19473 173 SLDPRLRGDpYEIDGEFMSGPTRFINHSCDPNlrIFARV----GDhadkhiHDLAFFAIKDIPRGTELTFDYVDGVTGLD 248

                ....*...
gi 3334182  736 KYVGIERE 743
Cdd:cd19473 249 DDAGDEEK 256
SET_SETD7 cd10530
SET domain found in SET domain-containing protein 7 (SETD7) and similar proteins; SETD7 (EC 2. ...
613-729 5.35e-11

SET domain found in SET domain-containing protein 7 (SETD7) and similar proteins; SETD7 (EC 2.1.1.43; also termed histone H3-K4 methyltransferase SETD7, H3-K4-HMTase SETD7, lysine N-methyltransferase 7 (KMT7) or SET7/9) is a histone-lysine N-methyltransferase that specifically monomethylates 'Lys-4' of histone H3. It plays a central role in the transcriptional activation of genes such as collagenase or insulin. Set7/9 also methylates non-histone proteins, including estrogen receptor alpha (ERa), suggesting it has a role in diverse biological processes. ERa methylation by Set7/9 stabilizes ERa and activates its transcriptional activities, which are involved in the carcinogenesis of breast cancer. In a high-throughput screen, treatment of human breast cancer cells (MCF7 cells) with cyproheptadine, a Set7/9 inhibitor, decreased the expression and transcriptional activity of ERa, thereby inhibiting estrogen-dependent cell growth.


Pssm-ID: 380928  Cd Length: 130  Bit Score: 60.78  E-value: 5.35e-11
                        10        20        30        40        50        60        70        80
                ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 3334182  613 KHLLLAPSDV--AGWGTFIKESVQKNEFISEYCGELISQDEADRRgkvyDKYMSSFLFNLNNDFVVDATRKGNKIRF--- 687
Cdd:cd10530   7 ERVYVAESLIpsAGEGLFAKVAVGPNTVMSFYNGVRITHQEVDSR----DWSLNGNTISLDEETVIDVPEPYNSVSKyca 82
                        90       100       110       120       130
                ....*....|....*....|....*....|....*....|....*....|...
gi 3334182  688 -----ANHSVNPNC-YAKVVmvngdH-RIG----IFAKRAIQAGEELFFDYRY 729
Cdd:cd10530  83 slghkANHSFTPNCiYDPFV-----HpRFGpikcIRTLRAVEAGEELTVAYGY 130
SET_SETDB2 cd10523
SET domain (including pre-SET and post-SET domains) found in SET domain bifurcated 2 (SETDB2) ...
597-729 2.52e-10

SET domain (including pre-SET and post-SET domains) found in SET domain bifurcated 2 (SETDB2) and similar proteins; SETDB2 (EC 2.1.1.43; also termed chronic lymphocytic leukemia deletion region gene 8 protein (CLLD8), or lysine N-methyltransferase 1F (KMT1F)) acts as a histone-lysine N-methyltransferase that specifically trimethylates 'Lys-9' of histone H3 (H3K9me3). It is involved in left-right axis specification in early development and mitosis.


Pssm-ID: 380921 [Multi-domain]  Cd Length: 266  Bit Score: 61.77  E-value: 2.52e-10
                        10        20        30        40        50        60        70        80
                ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 3334182  597 CKVVSCKNCSIQRGLKKHLLLAPSDVAGWGTFIKESVQKNEFISEYCGELISQ--------------DEADRRGKVYDKY 662
Cdd:cd10523  92 CNRMLCQNRVVQHGLQVRLQVFKTEKKGWGVRCLDDIDKGTFVCIYAGRVLSRarspteplppklelPSENEVEVVTSWL 171
                        90       100       110       120       130       140       150
                ....*....|....*....|....*....|....*....|....*....|....*....|....*....|..
gi 3334182  663 MSSFLFNLN-NDFVVDATRKGNKIRFANHSVNPNCYAKVVMVNGDHR----IGIFAKRAIQAGEELFFDYRY 729
Cdd:cd10523 172 ILSKKRKLReNVCFLDASKEGNVGRFLNHSCCPNLFVQNVFVDTHDKnfpwVAFFTNRVVKAGTELTWDYSY 243
preSET_CXC pfam18264
CXC domain; This domain is found to the N-terminus of the SET domain in the EZH2 protein. It ...
560-591 5.00e-10

CXC domain; This domain is found to the N-terminus of the SET domain in the EZH2 protein. It is a zinc binding domain.ED L9LD52.1/505-536;


Pssm-ID: 408079  Cd Length: 32  Bit Score: 54.84  E-value: 5.00e-10
                          10        20        30
                  ....*....|....*....|....*....|..
gi 3334182    560 GCRCKTQCNTKQCPCYLAVRECDPDLCLTCGA 591
Cdd:pfam18264   1 GCSCRATCYTKACLCYRANRECDPDLCNMCGA 32
SET_SpSet7-like cd10540
SET domain found in Schizossacharomyces pombe Set7 and similar proteins; Schizosaccharomyces ...
614-727 1.87e-06

SET domain found in Schizossacharomyces pombe Set7 and similar proteins; Schizosaccharomyces pombe Set7 is a novel histone-lysine N-methyltransferase. The family also includes a viral histone H3 lysine 27 methyltransferase from Paramecium bursaria Chlorella virus 1 (PBCV-1).


Pssm-ID: 380938  Cd Length: 112  Bit Score: 47.25  E-value: 1.87e-06
                        10        20        30        40        50        60        70        80
                ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 3334182  614 HLLLAPSDVAGWGTFIKESVQKNEFIsEYCGELISQDEAdrrgkvYDKYMSSFLFNLNndFVVDATRKGNKIRF---ANH 690
Cdd:cd10540   1 RLEVKPSTLKGRGVFATRPIKKGEVI-EEAPVIVLPKEE------YQHLCKTVLDHYV--FSWGDGCLALALGYgsmFNH 71
                        90       100       110
                ....*....|....*....|....*....|....*..
gi 3334182  691 SVNPNCYakVVMVNGDHRIGIFAKRAIQAGEELFFDY 727
Cdd:cd10540  72 SYTPNAE--YEIDFENQTIVFYALRDIEAGEELTINY 106
SET_SpSET3-like cd19183
SET domain (including post-SET domain) found in Schizosaccharomyces pombe SET ...
626-727 2.27e-06

SET domain (including post-SET domain) found in Schizosaccharomyces pombe SET domain-containing protein 3 (SETD3) and similar proteins; Schizosaccharomyces pombe SETD3 functions as a transcriptional regulator that acts via the formation of large multiprotein complexes that modify and/or remodel the chromatin. It is required for both, gene activation and repression.


Pssm-ID: 380960  Cd Length: 173  Bit Score: 48.55  E-value: 2.27e-06
                        10        20        30        40        50        60        70        80
                ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 3334182  626 GTFIKESVQKNEFISEYCGELISQDE--ADRRgkvyDKYMSSF------LFNLNNDFVVDATRKGNKIRFANHSVNPNCy 697
Cdd:cd19183  15 GLFADRPIPAGDPIQELLGEIGLQSEyiADPE----NQYQILGapkphvFFHPQSPLYIDTRRSGSVARFIRRSCRPNA- 89
                        90       100       110
                ....*....|....*....|....*....|....
gi 3334182  698 aKVVMV----NGDHRIGIFAKRAIQAGEELFFDY 727
Cdd:cd19183  90 -ELVTVasdsGSVLKFVLYASRDISPGEEITIGW 122
SET_SMYD cd20071
SET domain (including SET domain and post-SET domain) found in SET and MYND domain-containing ...
687-727 8.39e-06

SET domain (including SET domain and post-SET domain) found in SET and MYND domain-containing protein, and similar proteins; The family includes SET and MYND domain-containing proteins, SMYD1-SYMD5. SMYD1 (EC 2.1.1.43; also termed BOP) is a heart and muscle specific SET-MYND domain containing protein, which functions as a histone methyltransferase and regulates downstream gene transcription. It methylates histone H3 at 'Lys-4' (H3K4me), seems able to perform both mono-, di-, and trimethylation. SMYD2 (also termed HSKM-B, or lysine N-methyltransferase 3C (KMT3C)) functions as a histone methyltransferase that methylates both histones and non-histone proteins, including p53/TP53 and RB1. It specifically methylates histone H3 'Lys-4' (H3K4me) and dimethylates histone H3 'Lys-36' (H3K36me2). SMYD3 (also termed zinc finger MYND domain-containing protein 1) functions as a histone methyltransferase that specifically methylates 'Lys-4' of histone H3, inducing di- and tri-methylation, but not monomethylation. It also methylates 'Lys-5' of histone H4. SMYD3 plays an important role in transcriptional activation as a member of an RNA polymerase complex. SMYD4 functions as a potential tumor suppressor that plays a critical role in breast carcinogenesis at least partly through inhibiting the expression of PDGFR-alpha. SMYD5 (also termed protein NN8-4AG, or retinoic acid-induced protein 15) functions as histone lysine methyltransferase that mediates H4K20me3 at heterochromatin regions.


Pssm-ID: 380997 [Multi-domain]  Cd Length: 122  Bit Score: 45.45  E-value: 8.39e-06
                        10        20        30        40
                ....*....|....*....|....*....|....*....|.
gi 3334182  687 FANHSVNPNCyakVVMVNGDHRIGIFAKRAIQAGEELFFDY 727
Cdd:cd20071  58 LLNHSCDPNA---VVVFDGNGTLRVRALRDIKAGEELTISY 95
SET_Suv4-20-like cd10524
SET domain (including post-SET domain) found in Drosophila melanogaster suppressor of ...
630-727 1.04e-05

SET domain (including post-SET domain) found in Drosophila melanogaster suppressor of variegation 4-20 (Suv4-20) and similar proteins; Suv4-20 (also termed Su(var)4-20) is a histone-lysine N-methyltransferase that specifically trimethylates 'Lys-20' of histone H4. It acts as a dominant suppressor of position-effect variegation. The family also includes Suv4-20 homologs, lysine N-methyltransferase 5B (KMT5B) and lysine N-methyltransferase 5C (KMT5C). Both KMT5B (also termed lysine-specific methyltransferase 5B, or suppressor of variegation 4-20 homolog 1, or Su(var)4-20 homolog 1, or Suv4-20h1) and KMT5C (also termed lysine-specific methyltransferase 5C, or suppressor of variegation 4-20 homolog 2, or Su(var)4-20 homolog 2, or Suv4-20h2) are histone methyltransferases that specifically trimethylate 'Lys-20' of histone H4 (H4K20me3). They play central roles in the establishment of constitutive heterochromatin in pericentric heterochromatin regions.


Pssm-ID: 380922 [Multi-domain]  Cd Length: 141  Bit Score: 45.73  E-value: 1.04e-05
                        10        20        30        40        50        60        70        80
                ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 3334182  630 KESVQKNEFISEYCGELISQDEADRRgkvydkymssFLFNLNNDF-VVDATRKGNK------IRFANHSVNPNCyakVVM 702
Cdd:cd10524  25 TKPIKKGEKIHELCGCIAELSEEEEA----------LLRPGGNDFsVMYSSRKKCSqlwlgpAAFINHDCRPNC---KFV 91
                        90       100
                ....*....|....*....|....*
gi 3334182  703 VNGDHRIGIFAKRAIQAGEELFFDY 727
Cdd:cd10524  92 PTGKSTACVKVLRDIEPGEEITVYY 116
SANT cd00167
'SWI3, ADA2, N-CoR and TFIIIB' DNA-binding domains. Tandem copies of the domain bind telomeric ...
433-474 9.60e-05

'SWI3, ADA2, N-CoR and TFIIIB' DNA-binding domains. Tandem copies of the domain bind telomeric DNA tandem repeatsas part of the capping complex. Binding is sequence dependent for repeats which contain the G/C rich motif [C2-3 A (CA)1-6]. The domain is also found in regulatory transcriptional repressor complexes where it also binds DNA.


Pssm-ID: 238096 [Multi-domain]  Cd Length: 45  Bit Score: 40.25  E-value: 9.60e-05
                        10        20        30        40
                ....*....|....*....|....*....|....*....|...
gi 3334182  433 EWTGAEESLFRVFHGTY-FNNFCSIARLLGTKTCKQVFQFAVK 474
Cdd:cd00167   1 PWTEEEDELLLEAVKKYgKNNWEKIAKELPGRTPKQCRERWRN 43
SET_ATXR5_6-like cd10539
SET domain found in fungal protein lysine methyltransferase SET5 and similar protein; The ...
638-727 2.96e-04

SET domain found in fungal protein lysine methyltransferase SET5 and similar protein; The family includes Arabidopsis thaliana ATXR5 and ATXR6. Both ATXR5 (also termed protein SET DOMAIN GROUP 15, or TRX-related protein 5) and ATXR6 (also termed protein SET DOMAIN GROUP 34, or TRX-related protein 6) function as histone methyltransferase that specifically monomethylates 'Lys-37' of histone H3 (H3K27me1). They are required for chromatin structure and gene silencing.


Pssm-ID: 380937  Cd Length: 138  Bit Score: 41.63  E-value: 2.96e-04
                        10        20        30        40        50        60        70        80
                ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 3334182  638 FISEYCGEL--ISQDEADRRgkvyDKYMSsFLFNLNND--FVVDATRKGNKIRFA----NHSVN----PNCYAKVVMVNG 705
Cdd:cd10539  29 IIAEYTGDVdyIRNREFDDN----DSIMT-LLLAGDPSksLVICPDKRGNIARFIsginNHTKDgkkkQNCKCVRYSING 103
                        90       100
                ....*....|....*....|..
gi 3334182  706 DHRIGIFAKRAIQAGEELFFDY 727
Cdd:cd10539 104 EARVLLVATRDIAKGERLYYDY 125
SANT smart00717
SANT SWI3, ADA2, N-CoR and TFIIIB'' DNA-binding domains;
433-474 6.44e-04

SANT SWI3, ADA2, N-CoR and TFIIIB'' DNA-binding domains;


Pssm-ID: 197842 [Multi-domain]  Cd Length: 49  Bit Score: 37.97  E-value: 6.44e-04
                           10        20        30        40
                   ....*....|....*....|....*....|....*....|...
gi 3334182     433 EWTGAEESLFRVFHGTY-FNNFCSIARLLGTKTCKQVFQFAVK 474
Cdd:smart00717   3 EWTEEEDELLIELVKKYgKNNWEKIAKELPGRTAEQCRERWRN 45
SET_SETD5 cd19181
SET domain (including post-SET domain) found in SET domain-containing protein 5 (SETD5) and ...
636-730 1.26e-03

SET domain (including post-SET domain) found in SET domain-containing protein 5 (SETD5) and similar proteins; SETD5 is a probable transcriptional regulator that acts via the formation of large multiprotein complexes that modify and/or remodel the chromatin. SETD5 loss-of-function mutations are a likely cause of a familial syndromic intellectual disability with variable phenotypic expression.


Pssm-ID: 380958  Cd Length: 150  Bit Score: 39.99  E-value: 1.26e-03
                        10        20        30        40        50        60        70        80
                ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 3334182  636 NEFISEYCGELISQDEADRRGKVYDKYMSSFLF--NLNN-DFVVDATRKGNKIRFANHSVNPNCYAKVVMVNGDHRIGIF 712
Cdd:cd19181  30 DTLIIEYRGKVMLRQQFEVNGHFFKRPYPFVLFysKFNGvEMCVDARTFGNDARFIRRSCTPNAEVRHMIADGMIHLCIY 109
                        90       100
                ....*....|....*....|
gi 3334182  713 AKRAIQAGEE--LFFDYRYS 730
Cdd:cd19181 110 AVAAIAKDAEvtIAFDYEYS 129
SET_LSMT cd10527
SET domain found in Rubisco large subunit methyltransferase (LSMT) and similar proteins; ...
687-727 4.78e-03

SET domain found in Rubisco large subunit methyltransferase (LSMT) and similar proteins; Rubisco LSMT is a non-histone protein methyl transferase responsible for the trimethylation of lysine14 in the large subunit of Rubisco (ribulose-1,5-bisphosphate carboxylase/oxygenase). The family also includes SET domain-containing proteins, SETD3, SETD4 and SETD6, which belong to methyltransferase class VII that represents classical non-histone SET domain methyltransferases. Members in this family contain a SET domain and a C-terminal RubisCO LSMT substrate-binding (Rubis-subs-bind) domain.


Pssm-ID: 380925 [Multi-domain]  Cd Length: 236  Bit Score: 39.35  E-value: 4.78e-03
                        10        20        30        40
                ....*....|....*....|....*....|....*....|..
gi 3334182  687 FANHSVN-PNCyaKVVMVNGDHRIGIFAKRAIQAGEELFFDY 727
Cdd:cd10527 182 MLNHSPDaPNV--RYEYDEDEGSFVLVATRDIAAGEEVFISY 221
 
Blast search parameters
Data Source: Precalculated data, version = cdd.v.3.21
Preset Options:Database: CDSEARCH/cdd   Low complexity filter: no  Composition Based Adjustment: yes   E-value threshold: 0.01

References:

  • Wang J et al. (2023), "The conserved domain database in 2023", Nucleic Acids Res.51(D)384-8.
  • Lu S et al. (2020), "The conserved domain database in 2020", Nucleic Acids Res.48(D)265-8.
  • Marchler-Bauer A et al. (2017), "CDD/SPARCLE: functional classification of proteins via subfamily domain architectures.", Nucleic Acids Res.45(D)200-3.
Help | Disclaimer | Write to the Help Desk
NCBI | NLM | NIH