NCBI Home Page NCBI Site Search page NCBI Guide that lists and describes the NCBI resources
Conserved domains on  [gi|1102618193|ref|YP_009321933|]
View 

hypothetical protein BOW81_gp64 [Flavobacterium phage Fpv20]

Protein Classification

GIY-YIG nuclease family protein( domain architecture ID 10333112)

GIY-YIG nuclease family protein

Graphical summary

 Zoom to residue level

show extra options »

Show site features     Horizontal zoom: ×

List of domain hits

Name Accession Description Interval E-value
GIY-YIG_SF super family cl15257
GIY-YIG nuclease domain superfamily; The GIY-YIG nuclease domain superfamily includes a large ...
2-64 1.11e-04

GIY-YIG nuclease domain superfamily; The GIY-YIG nuclease domain superfamily includes a large and diverse group of proteins involved in many cellular processes, such as class I homing GIY-YIG family endonucleases, prokaryotic nucleotide excision repair proteins UvrC and Cho, type II restriction enzymes, the endonuclease/reverse transcriptase of eukaryotic retrotransposable elements, and a family of eukaryotic enzymes that repair stalled replication forks. All of these members contain a conserved GIY-YIG nuclease domain that may serve as a scaffold for the coordination of a divalent metal ion required for catalysis of the phosphodiester bond cleavage. By combining with different specificity, targeting, or other domains, the GIY-YIG nucleases may perform different functions.


The actual alignment was detected with superfamily member cd10440:

Pssm-ID: 472790 [Multi-domain]  Cd Length: 94  Bit Score: 39.67  E-value: 1.11e-04
                          10        20        30        40        50        60
                  ....*....|....*....|....*....|....*....|....*....|....*....|...
gi 1102618193   2 YYLYRHIRLDKNEVFYIGIGTvpktnynissiktyYNRAFEKVKSRNKYWKNITNITDYLVEI 64
Cdd:cd10440     1 YYVYALIDPRTGEVFYVGKGK--------------GNRVFSHVKEALGEYENIKEKLSAKLQR 49
IENR1 smart00497
Intron encoded nuclease repeat motif; Repeat of unknown function, but possibly DNA-binding via ...
135-183 2.79e-03

Intron encoded nuclease repeat motif; Repeat of unknown function, but possibly DNA-binding via helix-turn-helix motif (Ponting, unpublished).


:

Pssm-ID: 197761  Cd Length: 53  Bit Score: 34.85  E-value: 2.79e-03
                           10        20        30        40        50
                   ....*....|....*....|....*....|....*....|....*....|.
gi 1102618193  135 IKIIMY--NDSFNKEFDSISEAAMYIGVNNiGNISSCLKGKRSSAYGYKFK 183
Cdd:smart00497   2 KPVYVYdlDGNLIGEFSSIREAAKYLGISH-SSISKYLNTGKKYKGGYYFK 51
 
Name Accession Description Interval E-value
GIY-YIG_COG3680 cd10440
GIY-YIG domain of uncharacterized proteins from bacteria and their eukaryotic homologs; This ...
2-64 1.11e-04

GIY-YIG domain of uncharacterized proteins from bacteria and their eukaryotic homologs; This family includes a group of functionally uncharacterized proteins from bacteria and their eukaryotic homologs which are present only in metazoa. These proteins might have nuclease activities and possibly be engaged in DNA repair or recombination, since they share sequence homology with the catalytic GIY-YIG domain of bacterial UvrC DNA repair proteins. Distinct from their prokaryotic relatives, the eukaryotic homologs contain an N-terminal extension that includes the region of approximately 3-4 ankyrin repeats, unique motifs mediating protein-protein interactions. Some of eukaryotic homologs do have an additional LEM domain located between ankyrin repeats region and GIY-YIG domain. The LEM domain, found in inner nuclear membrane proteins, may be involved in protein- or DNA-binding. The different domain composition of the eukaryotic homologs suggests that they might participate in interactions with multiple partners and implies important cellular function.


Pssm-ID: 198387 [Multi-domain]  Cd Length: 94  Bit Score: 39.67  E-value: 1.11e-04
                          10        20        30        40        50        60
                  ....*....|....*....|....*....|....*....|....*....|....*....|...
gi 1102618193   2 YYLYRHIRLDKNEVFYIGIGTvpktnynissiktyYNRAFEKVKSRNKYWKNITNITDYLVEI 64
Cdd:cd10440     1 YYVYALIDPRTGEVFYVGKGK--------------GNRVFSHVKEALGEYENIKEKLSAKLQR 49
IENR1 smart00497
Intron encoded nuclease repeat motif; Repeat of unknown function, but possibly DNA-binding via ...
135-183 2.79e-03

Intron encoded nuclease repeat motif; Repeat of unknown function, but possibly DNA-binding via helix-turn-helix motif (Ponting, unpublished).


Pssm-ID: 197761  Cd Length: 53  Bit Score: 34.85  E-value: 2.79e-03
                           10        20        30        40        50
                   ....*....|....*....|....*....|....*....|....*....|.
gi 1102618193  135 IKIIMY--NDSFNKEFDSISEAAMYIGVNNiGNISSCLKGKRSSAYGYKFK 183
Cdd:smart00497   2 KPVYVYdlDGNLIGEFSSIREAAKYLGISH-SSISKYLNTGKKYKGGYYFK 51
 
Name Accession Description Interval E-value
GIY-YIG_COG3680 cd10440
GIY-YIG domain of uncharacterized proteins from bacteria and their eukaryotic homologs; This ...
2-64 1.11e-04

GIY-YIG domain of uncharacterized proteins from bacteria and their eukaryotic homologs; This family includes a group of functionally uncharacterized proteins from bacteria and their eukaryotic homologs which are present only in metazoa. These proteins might have nuclease activities and possibly be engaged in DNA repair or recombination, since they share sequence homology with the catalytic GIY-YIG domain of bacterial UvrC DNA repair proteins. Distinct from their prokaryotic relatives, the eukaryotic homologs contain an N-terminal extension that includes the region of approximately 3-4 ankyrin repeats, unique motifs mediating protein-protein interactions. Some of eukaryotic homologs do have an additional LEM domain located between ankyrin repeats region and GIY-YIG domain. The LEM domain, found in inner nuclear membrane proteins, may be involved in protein- or DNA-binding. The different domain composition of the eukaryotic homologs suggests that they might participate in interactions with multiple partners and implies important cellular function.


Pssm-ID: 198387 [Multi-domain]  Cd Length: 94  Bit Score: 39.67  E-value: 1.11e-04
                          10        20        30        40        50        60
                  ....*....|....*....|....*....|....*....|....*....|....*....|...
gi 1102618193   2 YYLYRHIRLDKNEVFYIGIGTvpktnynissiktyYNRAFEKVKSRNKYWKNITNITDYLVEI 64
Cdd:cd10440     1 YYVYALIDPRTGEVFYVGKGK--------------GNRVFSHVKEALGEYENIKEKLSAKLQR 49
IENR1 smart00497
Intron encoded nuclease repeat motif; Repeat of unknown function, but possibly DNA-binding via ...
135-183 2.79e-03

Intron encoded nuclease repeat motif; Repeat of unknown function, but possibly DNA-binding via helix-turn-helix motif (Ponting, unpublished).


Pssm-ID: 197761  Cd Length: 53  Bit Score: 34.85  E-value: 2.79e-03
                           10        20        30        40        50
                   ....*....|....*....|....*....|....*....|....*....|.
gi 1102618193  135 IKIIMY--NDSFNKEFDSISEAAMYIGVNNiGNISSCLKGKRSSAYGYKFK 183
Cdd:smart00497   2 KPVYVYdlDGNLIGEFSSIREAAKYLGISH-SSISKYLNTGKKYKGGYYFK 51
 
Blast search parameters
Data Source: Precalculated data, version = cdd.v.3.21
Preset Options:Database: CDSEARCH/cdd   Low complexity filter: no  Composition Based Adjustment: yes   E-value threshold: 0.01

References:

  • Wang J et al. (2023), "The conserved domain database in 2023", Nucleic Acids Res.51(D)384-8.
  • Lu S et al. (2020), "The conserved domain database in 2020", Nucleic Acids Res.48(D)265-8.
  • Marchler-Bauer A et al. (2017), "CDD/SPARCLE: functional classification of proteins via subfamily domain architectures.", Nucleic Acids Res.45(D)200-3.
Help | Disclaimer | Write to the Help Desk
NCBI | NLM | NIH