NCBI Home Page NCBI Site Search page NCBI Guide that lists and describes the NCBI resources
Conserved domains on  [gi|570726569|gb|AHE97877|]
View 

endonuclease [Thioalkalivibrio paradoxus ARh 1]

Protein Classification

GIY-YIG nuclease family protein( domain architecture ID 10179944)

GIY-YIG nuclease family protein

CATH:  3.40.1440.10
Gene Ontology:  GO:0004518
PubMed:  16646971
SCOP:  3000597

Graphical summary

 Zoom to residue level

show extra options »

Show site features     Horizontal zoom: ×

List of domain hits

Name Accession Description Interval E-value
GIY-YIG_unchar_3 cd10448
GIY-YIG domain of uncharacterized hypothetical protein found in bacteria; The family includes ...
4-90 1.61e-50

GIY-YIG domain of uncharacterized hypothetical protein found in bacteria; The family includes a group of uncharacterized bacterial proteins with a GIY-YIG domain that shows statistically significant similarity to the N-terminal catalytic domains of GIY-YIG family of intron-encoded homing endonuclease I-TevI and catalytic GIY-YIG domain of nucleotide excision repair endonuclease UvrC.


:

Pssm-ID: 198395  Cd Length: 87  Bit Score: 153.80  E-value: 1.61e-50
                        10        20        30        40        50        60        70        80
                ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 570726569  4 PCVYMMTNRRNGTLYVGVTSNLVQRVWQHKSGVVEGFTKQYSLHRLAWFEVHETMESAISREKQLKAGNRKRKLRLIEAL 83
Cdd:cd10448   1 YYVYILANKRNGTLYIGVTSDLIRRIYEHKEGLGSGFTSKYNVTRLVYYEEFEDIEEAIAREKQLKKWRRAWKINLIEKM 80

                ....*..
gi 570726569 84 NPDWRDL 90
Cdd:cd10448  81 NPDWKDL 87
 
Name Accession Description Interval E-value
GIY-YIG_unchar_3 cd10448
GIY-YIG domain of uncharacterized hypothetical protein found in bacteria; The family includes ...
4-90 1.61e-50

GIY-YIG domain of uncharacterized hypothetical protein found in bacteria; The family includes a group of uncharacterized bacterial proteins with a GIY-YIG domain that shows statistically significant similarity to the N-terminal catalytic domains of GIY-YIG family of intron-encoded homing endonuclease I-TevI and catalytic GIY-YIG domain of nucleotide excision repair endonuclease UvrC.


Pssm-ID: 198395  Cd Length: 87  Bit Score: 153.80  E-value: 1.61e-50
                        10        20        30        40        50        60        70        80
                ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 570726569  4 PCVYMMTNRRNGTLYVGVTSNLVQRVWQHKSGVVEGFTKQYSLHRLAWFEVHETMESAISREKQLKAGNRKRKLRLIEAL 83
Cdd:cd10448   1 YYVYILANKRNGTLYIGVTSDLIRRIYEHKEGLGSGFTSKYNVTRLVYYEEFEDIEEAIAREKQLKKWRRAWKINLIEKM 80

                ....*..
gi 570726569 84 NPDWRDL 90
Cdd:cd10448  81 NPDWKDL 87
YhbQ COG2827
Predicted endonuclease, GIY-YIG superfamily [Replication, recombination and repair];
2-81 1.39e-37

Predicted endonuclease, GIY-YIG superfamily [Replication, recombination and repair];


Pssm-ID: 442075 [Multi-domain]  Cd Length: 82  Bit Score: 121.00  E-value: 1.39e-37
                        10        20        30        40        50        60        70        80
                ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 570726569  2 KQPCVYMMTNRRNGTLYVGVTSNLVQRVWQHKSGVVEGFTKQYSLHRLAWFEVHETMESAISREKQLKAGNRKRKLRLIE 81
Cdd:COG2827   1 MMYYVYILRCADNGTLYTGVTNDLERRLAEHNSGKGAKFTRKRRPVKLVYYEEFEDRSEALKREKQIKKWSRAKKEALIE 80
GIY-YIG pfam01541
GIY-YIG catalytic domain; This domain called GIY-YIG is found in the amino terminal region of ...
3-78 5.67e-14

GIY-YIG catalytic domain; This domain called GIY-YIG is found in the amino terminal region of excinuclease abc subunit c (uvrC), bacteriophage T4 endonucleases segA, segB, segC, segD and segE; it is also found in putative endonucleases encoded by group I introns of fungi and phage. The structure of I-TevI a GIY-YIG endonuclease, reveals a novel alpha/beta-fold with a central three-stranded antiparallel beta-sheet flanked by three helices. The most conserved and putative catalytic residues are located on a shallow, concave surface and include a metal coordination site.


Pssm-ID: 426314 [Multi-domain]  Cd Length: 78  Bit Score: 60.82  E-value: 5.67e-14
                         10        20        30        40        50        60        70
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 570726569   3 QPCVYMMTNRRNGTLYVGVTSNLVQRVWQHKSGVVEGFTKQYSLH--RLAWFEVHETMESAISREKQLKAGNRKRKLR 78
Cdd:pfam01541  1 KGGIYIIRNKDNKLLYVGSTKNLERRLNQHNAGKGAKYTRGKGVEpfKLIYLEEFPTKSEALELEKYLIKLYRPNKYN 78
GIYc smart00465
GIY-YIG type nucleases (URI domain);
6-69 5.05e-08

GIY-YIG type nucleases (URI domain);


Pssm-ID: 214677 [Multi-domain]  Cd Length: 84  Bit Score: 45.88  E-value: 5.05e-08
                          10        20        30        40        50        60
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 570726569    6 VYMMTNRRNGTLYVGVTSNLVQRVWQHKSGVVEGFTKQYSL----HRLAWFEVHETMESAISREKQLK 69
Cdd:smart00465  4 VYYITNKKNGKLYVGKAKNLRNRLKRHFSGSRKGRLLIDALlkygGNFEFIILESFDESALELEKYLI 71
PRK00329 PRK00329
GIY-YIG nuclease superfamily protein; Validated
1-82 1.95e-07

GIY-YIG nuclease superfamily protein; Validated


Pssm-ID: 178979  Cd Length: 86  Bit Score: 44.53  E-value: 1.95e-07
                        10        20        30        40        50        60        70        80
                ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 570726569  1 MKQPCVYMMTNRrNGTLYVGVTSNLVQRVWQHKSGVVEGFTKQYSLHRLAWFEVHETMESAISREKQLKAGNRKRKLRLI 80
Cdd:PRK00329  4 MKPWFLYLLRCA-DGSLYTGITTDVERRFAQHQSGKGAKYTRGRPPLTLVFVEPVGDRSEALRAEYRFKQLTKKQKERLV 82

                ..
gi 570726569 81 EA 82
Cdd:PRK00329 83 AE 84
 
Name Accession Description Interval E-value
GIY-YIG_unchar_3 cd10448
GIY-YIG domain of uncharacterized hypothetical protein found in bacteria; The family includes ...
4-90 1.61e-50

GIY-YIG domain of uncharacterized hypothetical protein found in bacteria; The family includes a group of uncharacterized bacterial proteins with a GIY-YIG domain that shows statistically significant similarity to the N-terminal catalytic domains of GIY-YIG family of intron-encoded homing endonuclease I-TevI and catalytic GIY-YIG domain of nucleotide excision repair endonuclease UvrC.


Pssm-ID: 198395  Cd Length: 87  Bit Score: 153.80  E-value: 1.61e-50
                        10        20        30        40        50        60        70        80
                ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 570726569  4 PCVYMMTNRRNGTLYVGVTSNLVQRVWQHKSGVVEGFTKQYSLHRLAWFEVHETMESAISREKQLKAGNRKRKLRLIEAL 83
Cdd:cd10448   1 YYVYILANKRNGTLYIGVTSDLIRRIYEHKEGLGSGFTSKYNVTRLVYYEEFEDIEEAIAREKQLKKWRRAWKINLIEKM 80

                ....*..
gi 570726569 84 NPDWRDL 90
Cdd:cd10448  81 NPDWKDL 87
YhbQ COG2827
Predicted endonuclease, GIY-YIG superfamily [Replication, recombination and repair];
2-81 1.39e-37

Predicted endonuclease, GIY-YIG superfamily [Replication, recombination and repair];


Pssm-ID: 442075 [Multi-domain]  Cd Length: 82  Bit Score: 121.00  E-value: 1.39e-37
                        10        20        30        40        50        60        70        80
                ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 570726569  2 KQPCVYMMTNRRNGTLYVGVTSNLVQRVWQHKSGVVEGFTKQYSLHRLAWFEVHETMESAISREKQLKAGNRKRKLRLIE 81
Cdd:COG2827   1 MMYYVYILRCADNGTLYTGVTNDLERRLAEHNSGKGAKFTRKRRPVKLVYYEEFEDRSEALKREKQIKKWSRAKKEALIE 80
GIY-YIG pfam01541
GIY-YIG catalytic domain; This domain called GIY-YIG is found in the amino terminal region of ...
3-78 5.67e-14

GIY-YIG catalytic domain; This domain called GIY-YIG is found in the amino terminal region of excinuclease abc subunit c (uvrC), bacteriophage T4 endonucleases segA, segB, segC, segD and segE; it is also found in putative endonucleases encoded by group I introns of fungi and phage. The structure of I-TevI a GIY-YIG endonuclease, reveals a novel alpha/beta-fold with a central three-stranded antiparallel beta-sheet flanked by three helices. The most conserved and putative catalytic residues are located on a shallow, concave surface and include a metal coordination site.


Pssm-ID: 426314 [Multi-domain]  Cd Length: 78  Bit Score: 60.82  E-value: 5.67e-14
                         10        20        30        40        50        60        70
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 570726569   3 QPCVYMMTNRRNGTLYVGVTSNLVQRVWQHKSGVVEGFTKQYSLH--RLAWFEVHETMESAISREKQLKAGNRKRKLR 78
Cdd:pfam01541  1 KGGIYIIRNKDNKLLYVGSTKNLERRLNQHNAGKGAKYTRGKGVEpfKLIYLEEFPTKSEALELEKYLIKLYRPNKYN 78
GIY-YIG_UPF0213 cd10456
The GIY-YIG domain of uncharacterized protein family UPF0213 related to structure-specific ...
5-69 3.45e-10

The GIY-YIG domain of uncharacterized protein family UPF0213 related to structure-specific endonuclease SLX1; This family contains a group of uncharacterized proteins found mainly in bacteria and several in dsDNA viruses. Although their function roles have not been recognized, these proteins show significant sequence similarities with the N-terminal GIY-YIG endonuclease domain of structure-specific endonuclease subunit SLX1, which binds another structure-specific endonuclease subunit SLX4 to form an active heterodimeric SLX1-SLX4 complex. This complex functions as a 5' flap endonuclease in yeast, and has also been identified as a Holliday junction resolvase in human.


Pssm-ID: 198403  Cd Length: 68  Bit Score: 50.87  E-value: 3.45e-10
                        10        20        30        40        50        60
                ....*....|....*....|....*....|....*....|....*....|....*....|....*
gi 570726569  5 CVYMMTNRrNGTLYVGVTSNLVQRVWQHKSGVVEGFTKQYSLHRLAWFEVHETMESAISREKQLK 69
Cdd:cd10456   2 YVYILRCA-DGSLYTGITTDLERRLAEHNSGKGAKYTRGRRPVKLVYSEEFDDRSEALKREYRIK 65
GIY-YIG_SLX1_like cd10449
Catalytic GIY-YIG domain of yeast structure-specific endonuclease subunit SLX1 and its ...
6-71 4.95e-09

Catalytic GIY-YIG domain of yeast structure-specific endonuclease subunit SLX1 and its homologs; Structure-specific endonuclease subunit SLX1 is a highly conserved protein from yeast to human, with an N-terminal GIY-YIG endonuclease domain and a C-terminal PHD-type zinc finger postulated to mediate protein-protein or protein-DNA interaction. SLX1 forms active heterodimeric complexes with its SLX4 partner, which has additional roles in the DNA damage response that are distinct from the function of the heterodimeric SLX1-SLX4 nuclease. In yeast, the SLX1-SLX4 complex functions as a 5' flap endonuclease that maintains ribosomal DNA copy number, where SLX1 and SLX4 are shown to be catalytic and regulatory subunits, respectively. This endonuclease introduces single-strand cuts in duplex DNA on the 3' side of junctions with single-strand DNA. In addition to 5' flap endonuclease activity, human SLX1-SLX4 complex has been identified as a Holliday junction resolvase that promotes symmetrical cleavage of static and migrating Holliday junctions. SLX1 also associates with MUS81, EME1, C20orf94, PLK1, and ERCC1. Some eukaryotic SLX1 homologs lack the zinc finger domain, but possess intrinsically unstructured extensions of unknown function. These unstructured segments might be involved in interactions with other proteins.


Pssm-ID: 198396  Cd Length: 67  Bit Score: 47.97  E-value: 4.95e-09
                        10        20        30        40        50        60
                ....*....|....*....|....*....|....*....|....*....|....*....|....*.
gi 570726569  6 VYMMTNRRNGTLYVGVTSNLVQRVWQHKSGVVeGFTKQYSLHRLAWFEVHETMESAISREKQLKAG 71
Cdd:cd10449   2 VYILYSEKLDRYYIGYTSDLERRLEQHNSGKS-KFTSKYRPWELVYSEAFESKSEALKREKYLKSG 66
GIYc smart00465
GIY-YIG type nucleases (URI domain);
6-69 5.05e-08

GIY-YIG type nucleases (URI domain);


Pssm-ID: 214677 [Multi-domain]  Cd Length: 84  Bit Score: 45.88  E-value: 5.05e-08
                          10        20        30        40        50        60
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 570726569    6 VYMMTNRRNGTLYVGVTSNLVQRVWQHKSGVVEGFTKQYSL----HRLAWFEVHETMESAISREKQLK 69
Cdd:smart00465  4 VYYITNKKNGKLYVGKAKNLRNRLKRHFSGSRKGRLLIDALlkygGNFEFIILESFDESALELEKYLI 71
GIY-YIG_SF cd00719
GIY-YIG nuclease domain superfamily; The GIY-YIG nuclease domain superfamily includes a large ...
6-69 8.47e-08

GIY-YIG nuclease domain superfamily; The GIY-YIG nuclease domain superfamily includes a large and diverse group of proteins involved in many cellular processes, such as class I homing GIY-YIG family endonucleases, prokaryotic nucleotide excision repair proteins UvrC and Cho, type II restriction enzymes, the endonuclease/reverse transcriptase of eukaryotic retrotransposable elements, and a family of eukaryotic enzymes that repair stalled replication forks. All of these members contain a conserved GIY-YIG nuclease domain that may serve as a scaffold for the coordination of a divalent metal ion required for catalysis of the phosphodiester bond cleavage. By combining with different specificity, targeting, or other domains, the GIY-YIG nucleases may perform different functions.


Pssm-ID: 198380 [Multi-domain]  Cd Length: 69  Bit Score: 45.05  E-value: 8.47e-08
                        10        20        30        40        50        60
                ....*....|....*....|....*....|....*....|....*....|....*....|....*
gi 570726569  6 VYMMTNRRNGTLYVGVTSNLVQRVWQHKSGVVEGFTKQYSLHRLAWFEVHETMESAI-SREKQLK 69
Cdd:cd00719   2 VYVLYDEDNGLIYVGQTKNLRNRIKEHLRKQRSDWTKGLKPFEILYLEVAPEAESELlDLEAALI 66
PRK00329 PRK00329
GIY-YIG nuclease superfamily protein; Validated
1-82 1.95e-07

GIY-YIG nuclease superfamily protein; Validated


Pssm-ID: 178979  Cd Length: 86  Bit Score: 44.53  E-value: 1.95e-07
                        10        20        30        40        50        60        70        80
                ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 570726569  1 MKQPCVYMMTNRrNGTLYVGVTSNLVQRVWQHKSGVVEGFTKQYSLHRLAWFEVHETMESAISREKQLKAGNRKRKLRLI 80
Cdd:PRK00329  4 MKPWFLYLLRCA-DGSLYTGITTDVERRFAQHQSGKGAKYTRGRPPLTLVFVEPVGDRSEALRAEYRFKQLTKKQKERLV 82

                ..
gi 570726569 81 EA 82
Cdd:PRK00329 83 AE 84
GIY-YIG_HE_I-TevI_like cd10437
N-terminal catalytic domain of GIY-YIG intron endonuclease I-TevI, I-BmoI, I-BanI, I-BthII and ...
6-34 2.90e-03

N-terminal catalytic domain of GIY-YIG intron endonuclease I-TevI, I-BmoI, I-BanI, I-BthII and similar proteins; I-TevI is a site-specific GIY-YIG homing endonuclease encoded within the group I intron of the thymidylate synthase gene (td) from Escherichia coli phage T4. It functions as an endonuclease that catalyzes the first step in intron homing by generating a double-strand break in the intronless td allele within a sequence designated the homing site. I-TevI recognizes its extensive 37 base pair DNA target in a site-specific, but sequence-tolerant manner. The cleavage site is located at 23 (upper strand) and 25 (lower strand) nucleotides upstream of the intron insertion site. A divalent cation, such as Mg2+, is required for the catalysis. I-TevI also acts as a repressor of its own transcription. It binds an operator that is located upstream of the I-TevI coding sequence and overlaps the T4 late promoter, which drives I-TevI expression from within the td intron. I-TevI binds the homing sites and the operator with the same affinity, but cleaves the homing site more efficiently than the operator. I-TevI consists of an N-terminal catalytic domain, containing the GIY-YIG motif, and a C-terminal DNA-binding domain that binds DNA as a monomer, joined by a flexible linker. The C-terminal domain includes three subdomains: a zinc finger, a minor-groove binding alpha-helix (NUMOD3, nuclease-associated modular domain 3), and a helix-turn-helix domain (HTH). The last two are responsible for DNA-binding. The zinc finger is part of the linker and not required for DNA-binding. It is implicated as a distance sensor to constrain the catalytic domain to cleave the homing site at a fixed position. None of other GIY-YIG endonucleases have been found to have the zinc finger motif. This family also includes a reduced activity isoschizomer of I-TevI, I-BmoI, which is encoded within the group I intron of the thymidylate synthase (TS) gene (thyA) from Bacillus mojavensis. I-BmoI catalyzes the first step in intron homing by generating a double-strand break in the intronless td allele within a sequence designated the homing site in the presence of a divalent cation cofactor, such as Mg2+. In the absence of Mg2+, I-Bmol only nicks one of the strands. Both I-BmoI and I-TevI bind a homologous stretch of TS-encoding DNA as monomers, but use different strategies to distinguish intronless from intron-containing substrates. I-TevI recognizes substrates at the level of DNA-binding. However, I-BmoI binds both intron-containing and intronless TS-encoding substrates, but efficiently cleaves only intronless substrate. Afterwards they cleave their respective intronless substrates in the same positions, and both require a critical G-C base pair adjacent to the top strand site for efficient cleavage. The C-terminal domain of I-BmoI has nuclease-associated modular DNA-binding domains (NUMODs), but lacks the zinc finger, which is different from that of I-TevI. Although the zinc finger implicated as a distance determination in I-TevI is absent, I-BmoI still possesses some cleavage distance discrimination. Besides I-TevI and I-BmoI, this family contains a putative GIY-YIG homing endonuclease, I-BanI, encoded within the self-splicing group I intron of nrdE gene from Bacillus anthracis. It contains two major domains, the N-terminal GIY-YIG domain and the C-terminal DNA-binding domain that consists of a minor-groove DNA binding alpha-helix motif and a helix-turn-helix (HTH) motif. I-BanI generates a double-strand break (DSB) in the intronless nrdE gene. The cleavage site is located at 5 and 7 nucleotides upstream of the intron insertion site, with 2-nucleotide 3' extensions. The recognition site is 35 to 40 base pairs and covers the cleavage site with a bias toward the downstream region including the (intervening sequence) IVS insertion site. Moreover, this family contains another putative GIY-YIG homing endonuclease, I-BthII, encoded within the self-splicing group I intron of nrdF gene from Bacillus thuringiensis ssp. pakistani. It contains a GIY-YIG motif that generates a double-strand break (DSB) in the intronless nrdF gene. The cleavage site is located at 7 and 9 nucleotides upstream of the intron insertion site, leaving 2-nucleotide 3' extensions. The recognition site is 27 to 29 base pairs with the DSB cleavage site at the 5'-end of the top strand, and with the intervening sequence (IVS) insertion site approximately in the middle of the recognition site.


Pssm-ID: 198384  Cd Length: 90  Bit Score: 33.79  E-value: 2.90e-03
                        10        20
                ....*....|....*....|....*....
gi 570726569  6 VYMMTNRRNGTLYVGVTSNLVQRVWQHKS 34
Cdd:cd10437   3 IYKITNLENGKIYIGSSKNILKRLSQHKR 31
GIY-YIG_MSH cd10438
Catalytic GIY-YIG domain of eukaryotic DNA mismatch repair protein MutS homologs; This family ...
5-38 7.07e-03

Catalytic GIY-YIG domain of eukaryotic DNA mismatch repair protein MutS homologs; This family represents a putative GIY-YIG nuclease domain C-terminally fused to the DNA-repair ATPase on a small group of eukaryotic DNA mismatch repair protein mutS homologs (MSH). The MSH proteins in this family do not have the zinc finger domain, but have a predicted mitochondrial localization. They might play roles in the recognition and repair of errors made during the replication of DNA. The prototype of this family is the protein encoded by the chloroplast mutator (CHM) locus from Arabidopsis thaliana. It is suggested that this protein could be involved in the maintenance of mitochondrial genome stability.


Pssm-ID: 198385  Cd Length: 72  Bit Score: 32.45  E-value: 7.07e-03
                        10        20        30
                ....*....|....*....|....*....|....
gi 570726569  5 CVYMMtNRRNGTLYVGVTSNLVQRVWQHKSGVVE 38
Cdd:cd10438   2 CVYIL-RRGDGEYYVGETDNLSGRLEQHRASLGE 34
 
Blast search parameters
Data Source: Precalculated data, version = cdd.v.3.21
Preset Options:Database: CDSEARCH/cdd   Low complexity filter: no  Composition Based Adjustment: yes   E-value threshold: 0.01

References:

  • Wang J et al. (2023), "The conserved domain database in 2023", Nucleic Acids Res.51(D)384-8.
  • Lu S et al. (2020), "The conserved domain database in 2020", Nucleic Acids Res.48(D)265-8.
  • Marchler-Bauer A et al. (2017), "CDD/SPARCLE: functional classification of proteins via subfamily domain architectures.", Nucleic Acids Res.45(D)200-3.
Help | Disclaimer | Write to the Help Desk
NCBI | NLM | NIH