NCBI Home Page NCBI Site Search page NCBI Guide that lists and describes the NCBI resources
Conserved domains on  [gi|1867200878|gb|QLF81821|]
View 

hypothetical protein F1_0237 [Escherichia phage vB_EcoM_F1]

Protein Classification

GIY-YIG nuclease family protein( domain architecture ID 61901)

GIY-YIG nuclease family protein similar to Escherichia virus T4 endonuclease segA that may be involved in the movement of the endonuclease-encoding DNA

CATH:  3.40.1440.10
Gene Ontology:  GO:0004518
PubMed:  16646971
SCOP:  3000597

Graphical summary

 Zoom to residue level

show extra options »

Show site features     Horizontal zoom: ×

List of domain hits

Name Accession Description Interval E-value
GIY-YIG_SF super family cl15257
GIY-YIG nuclease domain superfamily; The GIY-YIG nuclease domain superfamily includes a large ...
3-33 5.03e-11

GIY-YIG nuclease domain superfamily; The GIY-YIG nuclease domain superfamily includes a large and diverse group of proteins involved in many cellular processes, such as class I homing GIY-YIG family endonucleases, prokaryotic nucleotide excision repair proteins UvrC and Cho, type II restriction enzymes, the endonuclease/reverse transcriptase of eukaryotic retrotransposable elements, and a family of eukaryotic enzymes that repair stalled replication forks. All of these members contain a conserved GIY-YIG nuclease domain that may serve as a scaffold for the coordination of a divalent metal ion required for catalysis of the phosphodiester bond cleavage. By combining with different specificity, targeting, or other domains, the GIY-YIG nucleases may perform different functions.


The actual alignment was detected with superfamily member cd10437:

Pssm-ID: 472790  Cd Length: 90  Bit Score: 52.27  E-value: 5.03e-11
                         10        20        30
                 ....*....|....*....|....*....|.
gi 1867200878  3 SGIYQIKNTLNNKVYVGSAKDFEKRWKRHFK 33
Cdd:cd10437    1 SGIYKITNLENGKIYIGSSKNILKRLSQHKR 31
 
Name Accession Description Interval E-value
GIY-YIG_HE_I-TevI_like cd10437
N-terminal catalytic domain of GIY-YIG intron endonuclease I-TevI, I-BmoI, I-BanI, I-BthII and ...
3-33 5.03e-11

N-terminal catalytic domain of GIY-YIG intron endonuclease I-TevI, I-BmoI, I-BanI, I-BthII and similar proteins; I-TevI is a site-specific GIY-YIG homing endonuclease encoded within the group I intron of the thymidylate synthase gene (td) from Escherichia coli phage T4. It functions as an endonuclease that catalyzes the first step in intron homing by generating a double-strand break in the intronless td allele within a sequence designated the homing site. I-TevI recognizes its extensive 37 base pair DNA target in a site-specific, but sequence-tolerant manner. The cleavage site is located at 23 (upper strand) and 25 (lower strand) nucleotides upstream of the intron insertion site. A divalent cation, such as Mg2+, is required for the catalysis. I-TevI also acts as a repressor of its own transcription. It binds an operator that is located upstream of the I-TevI coding sequence and overlaps the T4 late promoter, which drives I-TevI expression from within the td intron. I-TevI binds the homing sites and the operator with the same affinity, but cleaves the homing site more efficiently than the operator. I-TevI consists of an N-terminal catalytic domain, containing the GIY-YIG motif, and a C-terminal DNA-binding domain that binds DNA as a monomer, joined by a flexible linker. The C-terminal domain includes three subdomains: a zinc finger, a minor-groove binding alpha-helix (NUMOD3, nuclease-associated modular domain 3), and a helix-turn-helix domain (HTH). The last two are responsible for DNA-binding. The zinc finger is part of the linker and not required for DNA-binding. It is implicated as a distance sensor to constrain the catalytic domain to cleave the homing site at a fixed position. None of other GIY-YIG endonucleases have been found to have the zinc finger motif. This family also includes a reduced activity isoschizomer of I-TevI, I-BmoI, which is encoded within the group I intron of the thymidylate synthase (TS) gene (thyA) from Bacillus mojavensis. I-BmoI catalyzes the first step in intron homing by generating a double-strand break in the intronless td allele within a sequence designated the homing site in the presence of a divalent cation cofactor, such as Mg2+. In the absence of Mg2+, I-Bmol only nicks one of the strands. Both I-BmoI and I-TevI bind a homologous stretch of TS-encoding DNA as monomers, but use different strategies to distinguish intronless from intron-containing substrates. I-TevI recognizes substrates at the level of DNA-binding. However, I-BmoI binds both intron-containing and intronless TS-encoding substrates, but efficiently cleaves only intronless substrate. Afterwards they cleave their respective intronless substrates in the same positions, and both require a critical G-C base pair adjacent to the top strand site for efficient cleavage. The C-terminal domain of I-BmoI has nuclease-associated modular DNA-binding domains (NUMODs), but lacks the zinc finger, which is different from that of I-TevI. Although the zinc finger implicated as a distance determination in I-TevI is absent, I-BmoI still possesses some cleavage distance discrimination. Besides I-TevI and I-BmoI, this family contains a putative GIY-YIG homing endonuclease, I-BanI, encoded within the self-splicing group I intron of nrdE gene from Bacillus anthracis. It contains two major domains, the N-terminal GIY-YIG domain and the C-terminal DNA-binding domain that consists of a minor-groove DNA binding alpha-helix motif and a helix-turn-helix (HTH) motif. I-BanI generates a double-strand break (DSB) in the intronless nrdE gene. The cleavage site is located at 5 and 7 nucleotides upstream of the intron insertion site, with 2-nucleotide 3' extensions. The recognition site is 35 to 40 base pairs and covers the cleavage site with a bias toward the downstream region including the (intervening sequence) IVS insertion site. Moreover, this family contains another putative GIY-YIG homing endonuclease, I-BthII, encoded within the self-splicing group I intron of nrdF gene from Bacillus thuringiensis ssp. pakistani. It contains a GIY-YIG motif that generates a double-strand break (DSB) in the intronless nrdF gene. The cleavage site is located at 7 and 9 nucleotides upstream of the intron insertion site, leaving 2-nucleotide 3' extensions. The recognition site is 27 to 29 base pairs with the DSB cleavage site at the 5'-end of the top strand, and with the intervening sequence (IVS) insertion site approximately in the middle of the recognition site.


Pssm-ID: 198384  Cd Length: 90  Bit Score: 52.27  E-value: 5.03e-11
                         10        20        30
                 ....*....|....*....|....*....|.
gi 1867200878  3 SGIYQIKNTLNNKVYVGSAKDFEKRWKRHFK 33
Cdd:cd10437    1 SGIYKITNLENGKIYIGSSKNILKRLSQHKR 31
grpIintron_endo TIGR01453
group I intron endonuclease; This model represents one subfamily of endonucleases containing ...
2-36 6.58e-10

group I intron endonuclease; This model represents one subfamily of endonucleases containing the endo/excinuclease amino terminal domain, pfam01541 at its amino end. A distinct subfamily includes excinuclease abc subunit c (uvrC). Members of pfam01541 are often termed GIY-YIG endonucleases after conserved motifs near the amino end. This subfamily in this model is found in open reading frames of group I introns in both phage and mitochondria. The closely related endonucleases of phage T4: segA, segB, segC, segD and segE, score below the trusted cutoff for the family.


Pssm-ID: 273636 [Multi-domain]  Cd Length: 214  Bit Score: 51.62  E-value: 6.58e-10
                          10        20        30
                  ....*....|....*....|....*....|....*
gi 1867200878   2 KSGIYQIKNTLNNKVYVGSAKDFEKRWKRHFKISS 36
Cdd:TIGR01453   1 KSGIYKITNNINGKIYVGSSVNLEKRLKEHLKLLK 35
GIYc smart00465
GIY-YIG type nucleases (URI domain);
2-37 1.29e-07

GIY-YIG type nucleases (URI domain);


Pssm-ID: 214677 [Multi-domain]  Cd Length: 84  Bit Score: 43.18  E-value: 1.29e-07
                          10        20        30
                  ....*....|....*....|....*....|....*.
gi 1867200878   2 KSGIYQIKNTLNNKVYVGSAKDFEKRWKRHFKISSG 37
Cdd:smart00465  1 KPGVYYITNKKNGKLYVGKAKNLRNRLKRHFSGSRK 36
GIY-YIG pfam01541
GIY-YIG catalytic domain; This domain called GIY-YIG is found in the amino terminal region of ...
2-33 2.50e-06

GIY-YIG catalytic domain; This domain called GIY-YIG is found in the amino terminal region of excinuclease abc subunit c (uvrC), bacteriophage T4 endonucleases segA, segB, segC, segD and segE; it is also found in putative endonucleases encoded by group I introns of fungi and phage. The structure of I-TevI a GIY-YIG endonuclease, reveals a novel alpha/beta-fold with a central three-stranded antiparallel beta-sheet flanked by three helices. The most conserved and putative catalytic residues are located on a shallow, concave surface and include a metal coordination site.


Pssm-ID: 426314 [Multi-domain]  Cd Length: 78  Bit Score: 40.02  E-value: 2.50e-06
                         10        20        30
                 ....*....|....*....|....*....|..
gi 1867200878  2 KSGIYQIKNTLNNKVYVGSAKDFEKRWKRHFK 33
Cdd:pfam01541  1 KGGIYIIRNKDNKLLYVGSTKNLERRLNQHNA 32
 
Name Accession Description Interval E-value
GIY-YIG_HE_I-TevI_like cd10437
N-terminal catalytic domain of GIY-YIG intron endonuclease I-TevI, I-BmoI, I-BanI, I-BthII and ...
3-33 5.03e-11

N-terminal catalytic domain of GIY-YIG intron endonuclease I-TevI, I-BmoI, I-BanI, I-BthII and similar proteins; I-TevI is a site-specific GIY-YIG homing endonuclease encoded within the group I intron of the thymidylate synthase gene (td) from Escherichia coli phage T4. It functions as an endonuclease that catalyzes the first step in intron homing by generating a double-strand break in the intronless td allele within a sequence designated the homing site. I-TevI recognizes its extensive 37 base pair DNA target in a site-specific, but sequence-tolerant manner. The cleavage site is located at 23 (upper strand) and 25 (lower strand) nucleotides upstream of the intron insertion site. A divalent cation, such as Mg2+, is required for the catalysis. I-TevI also acts as a repressor of its own transcription. It binds an operator that is located upstream of the I-TevI coding sequence and overlaps the T4 late promoter, which drives I-TevI expression from within the td intron. I-TevI binds the homing sites and the operator with the same affinity, but cleaves the homing site more efficiently than the operator. I-TevI consists of an N-terminal catalytic domain, containing the GIY-YIG motif, and a C-terminal DNA-binding domain that binds DNA as a monomer, joined by a flexible linker. The C-terminal domain includes three subdomains: a zinc finger, a minor-groove binding alpha-helix (NUMOD3, nuclease-associated modular domain 3), and a helix-turn-helix domain (HTH). The last two are responsible for DNA-binding. The zinc finger is part of the linker and not required for DNA-binding. It is implicated as a distance sensor to constrain the catalytic domain to cleave the homing site at a fixed position. None of other GIY-YIG endonucleases have been found to have the zinc finger motif. This family also includes a reduced activity isoschizomer of I-TevI, I-BmoI, which is encoded within the group I intron of the thymidylate synthase (TS) gene (thyA) from Bacillus mojavensis. I-BmoI catalyzes the first step in intron homing by generating a double-strand break in the intronless td allele within a sequence designated the homing site in the presence of a divalent cation cofactor, such as Mg2+. In the absence of Mg2+, I-Bmol only nicks one of the strands. Both I-BmoI and I-TevI bind a homologous stretch of TS-encoding DNA as monomers, but use different strategies to distinguish intronless from intron-containing substrates. I-TevI recognizes substrates at the level of DNA-binding. However, I-BmoI binds both intron-containing and intronless TS-encoding substrates, but efficiently cleaves only intronless substrate. Afterwards they cleave their respective intronless substrates in the same positions, and both require a critical G-C base pair adjacent to the top strand site for efficient cleavage. The C-terminal domain of I-BmoI has nuclease-associated modular DNA-binding domains (NUMODs), but lacks the zinc finger, which is different from that of I-TevI. Although the zinc finger implicated as a distance determination in I-TevI is absent, I-BmoI still possesses some cleavage distance discrimination. Besides I-TevI and I-BmoI, this family contains a putative GIY-YIG homing endonuclease, I-BanI, encoded within the self-splicing group I intron of nrdE gene from Bacillus anthracis. It contains two major domains, the N-terminal GIY-YIG domain and the C-terminal DNA-binding domain that consists of a minor-groove DNA binding alpha-helix motif and a helix-turn-helix (HTH) motif. I-BanI generates a double-strand break (DSB) in the intronless nrdE gene. The cleavage site is located at 5 and 7 nucleotides upstream of the intron insertion site, with 2-nucleotide 3' extensions. The recognition site is 35 to 40 base pairs and covers the cleavage site with a bias toward the downstream region including the (intervening sequence) IVS insertion site. Moreover, this family contains another putative GIY-YIG homing endonuclease, I-BthII, encoded within the self-splicing group I intron of nrdF gene from Bacillus thuringiensis ssp. pakistani. It contains a GIY-YIG motif that generates a double-strand break (DSB) in the intronless nrdF gene. The cleavage site is located at 7 and 9 nucleotides upstream of the intron insertion site, leaving 2-nucleotide 3' extensions. The recognition site is 27 to 29 base pairs with the DSB cleavage site at the 5'-end of the top strand, and with the intervening sequence (IVS) insertion site approximately in the middle of the recognition site.


Pssm-ID: 198384  Cd Length: 90  Bit Score: 52.27  E-value: 5.03e-11
                         10        20        30
                 ....*....|....*....|....*....|.
gi 1867200878  3 SGIYQIKNTLNNKVYVGSAKDFEKRWKRHFK 33
Cdd:cd10437    1 SGIYKITNLENGKIYIGSSKNILKRLSQHKR 31
grpIintron_endo TIGR01453
group I intron endonuclease; This model represents one subfamily of endonucleases containing ...
2-36 6.58e-10

group I intron endonuclease; This model represents one subfamily of endonucleases containing the endo/excinuclease amino terminal domain, pfam01541 at its amino end. A distinct subfamily includes excinuclease abc subunit c (uvrC). Members of pfam01541 are often termed GIY-YIG endonucleases after conserved motifs near the amino end. This subfamily in this model is found in open reading frames of group I introns in both phage and mitochondria. The closely related endonucleases of phage T4: segA, segB, segC, segD and segE, score below the trusted cutoff for the family.


Pssm-ID: 273636 [Multi-domain]  Cd Length: 214  Bit Score: 51.62  E-value: 6.58e-10
                          10        20        30
                  ....*....|....*....|....*....|....*
gi 1867200878   2 KSGIYQIKNTLNNKVYVGSAKDFEKRWKRHFKISS 36
Cdd:TIGR01453   1 KSGIYKITNNINGKIYVGSSVNLEKRLKEHLKLLK 35
GIY-YIG_SF cd00719
GIY-YIG nuclease domain superfamily; The GIY-YIG nuclease domain superfamily includes a large ...
4-47 5.86e-08

GIY-YIG nuclease domain superfamily; The GIY-YIG nuclease domain superfamily includes a large and diverse group of proteins involved in many cellular processes, such as class I homing GIY-YIG family endonucleases, prokaryotic nucleotide excision repair proteins UvrC and Cho, type II restriction enzymes, the endonuclease/reverse transcriptase of eukaryotic retrotransposable elements, and a family of eukaryotic enzymes that repair stalled replication forks. All of these members contain a conserved GIY-YIG nuclease domain that may serve as a scaffold for the coordination of a divalent metal ion required for catalysis of the phosphodiester bond cleavage. By combining with different specificity, targeting, or other domains, the GIY-YIG nucleases may perform different functions.


Pssm-ID: 198380 [Multi-domain]  Cd Length: 69  Bit Score: 43.89  E-value: 5.86e-08
                         10        20        30        40
                 ....*....|....*....|....*....|....*....|....
gi 1867200878  4 GIYQIKNTLNNKVYVGSAKDFEKRWKRHFKISSGLVTYRVKSDK 47
Cdd:cd00719    1 GVYVLYDEDNGLIYVGQTKNLRNRIKEHLRKQRSDWTKGLKPFE 44
GIYc smart00465
GIY-YIG type nucleases (URI domain);
2-37 1.29e-07

GIY-YIG type nucleases (URI domain);


Pssm-ID: 214677 [Multi-domain]  Cd Length: 84  Bit Score: 43.18  E-value: 1.29e-07
                          10        20        30
                  ....*....|....*....|....*....|....*.
gi 1867200878   2 KSGIYQIKNTLNNKVYVGSAKDFEKRWKRHFKISSG 37
Cdd:smart00465  1 KPGVYYITNKKNGKLYVGKAKNLRNRLKRHFSGSRK 36
GIY-YIG_LuxR_like cd10451
GIY-YIG domain of LuxR and ArsR family transcriptional regulators, and uncharacterized ...
4-37 4.47e-07

GIY-YIG domain of LuxR and ArsR family transcriptional regulators, and uncharacterized hypothetical proteins found in bacteria; The family includes some bacterial LuxR and ArsR family transcriptional regulators. The a C-terminal conserved domain shows sequence similarity to the N-terminal catalytic GIY-YIG domains of intron-encoded homing endonucleases. Besides, they have an N-terminally fused transcriptional regulators module, comprising the winged helix-turn-helix (wHTH) domain and uncharacterized domain DUF2087. At this point, they are distinct from GIY-YIG homing endonucleases, which typically contain a variety of C-terminally fused nuclease-associated modular DNA-binding domains (NUMODs). Moreover, some key residues relevant to catalysis in GIY-YIG endonucleases are mutanted or absent in this family, which suggests that members in this family might lose the catalytic function that GIY-YIG endonucleases possess. This family also includes many uncharacterized hypothetical proteins that consist of a standalone GIY-YIG like domain.


Pssm-ID: 198398  Cd Length: 101  Bit Score: 42.16  E-value: 4.47e-07
                          10        20        30
                  ....*....|....*....|....*....|....*
gi 1867200878   4 GIYQIKNTLNNKVYVGSAKDFEKRWKRH-FKISSG 37
Cdd:cd10451    15 GVYAIRNTATGKVFIGSSPNLKGTLNRLrFQLNTG 49
GIY-YIG pfam01541
GIY-YIG catalytic domain; This domain called GIY-YIG is found in the amino terminal region of ...
2-33 2.50e-06

GIY-YIG catalytic domain; This domain called GIY-YIG is found in the amino terminal region of excinuclease abc subunit c (uvrC), bacteriophage T4 endonucleases segA, segB, segC, segD and segE; it is also found in putative endonucleases encoded by group I introns of fungi and phage. The structure of I-TevI a GIY-YIG endonuclease, reveals a novel alpha/beta-fold with a central three-stranded antiparallel beta-sheet flanked by three helices. The most conserved and putative catalytic residues are located on a shallow, concave surface and include a metal coordination site.


Pssm-ID: 426314 [Multi-domain]  Cd Length: 78  Bit Score: 40.02  E-value: 2.50e-06
                         10        20        30
                 ....*....|....*....|....*....|..
gi 1867200878  2 KSGIYQIKNTLNNKVYVGSAKDFEKRWKRHFK 33
Cdd:pfam01541  1 KGGIYIIRNKDNKLLYVGSTKNLERRLNQHNA 32
GIY-YIG_bI1_like cd10445
Catalytic GIY-YIG domain of putative intron-encoded endonuclease bI1 and similar proteins; The ...
3-37 1.02e-05

Catalytic GIY-YIG domain of putative intron-encoded endonuclease bI1 and similar proteins; The prototype of this family is a putative intron-encoded mitochondrial DNA endonuclease bI1 found in mitochondrion Ustilago maydis. This protein may arise from proteolytic cleavage of an in-frame translation of COB exon 1 plus intron 1, containing the bI1 open reading frame. It contains an N-terminal truncated non-functional cytochrome b region and a C-terminal intron-encoded endonuclease bI1 region. The bI1 region shows high sequence similarity to endonucleases of group I introns of fungi and phage and might be involved in intron homing. Many uncharacterized bI1 homologs existing in fungi and chlorophyta in this family do not contain the cytochrome b region, but have a standalone bI1-like region, which contains a GIY-YIG domain and a minor-groove binding alpha-helix nuclease-associated modular domain (NUMOD). This family also includes a Yarrowia lipolytica mobile group-II intron COX1-i1, also called intron alpha, encoding protein with reverse transcriptase activity. The group-II intron COX1-i1 may be involv ed both in the generation of the circular multimeric DNA molecules (senDNA alpha) which amplify during the senescence syndrome and in the generation of the site-specific deletion which accumulates in the premature-death syndrome.


Pssm-ID: 198392 [Multi-domain]  Cd Length: 88  Bit Score: 38.75  E-value: 1.02e-05
                         10        20        30
                 ....*....|....*....|....*....|....*
gi 1867200878  3 SGIYQIKNTLNNKVYVGSAKDFEKRWKRHFKISSG 37
Cdd:cd10445    1 SGIYIWINKINGKIYVGSSINLYKRLRSYLNPSYL 35
GIY-YIG_HE_Tlr8p_PBC-V_like cd10443
GIY-YIG domain of uncharacterized hypothetical protein found in phycodnavirus PBCV-1 DNA virus, ...
5-41 2.26e-05

GIY-YIG domain of uncharacterized hypothetical protein found in phycodnavirus PBCV-1 DNA virus, T. thermophila Tlr element eoncoding protein Tlr8p, and similar proteins found in bacteria; The family includes a group of diverse uncharacterized hypothetical proteins with a GIY-YIG domain that shows statistically significant similarity to the N-terminal catalytic domains of GIY-YIG family of intron-encoded homing endonuclease I-TevI. Similar to I-TevI, family members from phycodnavirus PBCV-1 DNA virus have nuclease-associated modular DNA-binding domains (NUMODs) and a helix-turn-helix (HTH) domain C-terminally fused to the GIY-YIG domain, which suggests that these PBCV-1 acquired the I-TevI-like homing endonucleases from phages by horizontal gene transfer. This family also includes proteins that appear to connect homing endonucleases with Penelope elements, such as Tetrahymena thermophila Tlr element encoding protein Tlr8p that possess additional N-terminal and central structural regions, followed by a putative superfamily 1 helicase domain and I-TevI-like GIY-YIG domain, but lacks the NUMOD domains and HTH domain. It is suggested that the Tlr8p element could have acquired its GIY-YIG domain w ithin the nucleus of the ciliate cell infected by the Phycodnavirus. Some family members only contain a standalone GIY-YIG domain and their biological functions are unclear.


Pssm-ID: 198390  Cd Length: 90  Bit Score: 37.66  E-value: 2.26e-05
                         10        20        30
                 ....*....|....*....|....*....|....*...
gi 1867200878  5 IYQIKNTLNNKVYVG-SAKDFEKRWKRHFKISSGLVTY 41
Cdd:cd10443    3 IYKITNPINGKVYIGqTTRTLEERLKQHKKAASAGPKK 40
 
Blast search parameters
Data Source: Precalculated data, version = cdd.v.3.21
Preset Options:Database: CDSEARCH/cdd   Low complexity filter: no  Composition Based Adjustment: yes   E-value threshold: 0.01

References:

  • Wang J et al. (2023), "The conserved domain database in 2023", Nucleic Acids Res.51(D)384-8.
  • Lu S et al. (2020), "The conserved domain database in 2020", Nucleic Acids Res.48(D)265-8.
  • Marchler-Bauer A et al. (2017), "CDD/SPARCLE: functional classification of proteins via subfamily domain architectures.", Nucleic Acids Res.45(D)200-3.
Help | Disclaimer | Write to the Help Desk
NCBI | NLM | NIH