NCBI Home Page NCBI Site Search page NCBI Guide that lists and describes the NCBI resources
Conserved domains on  [gi|1336595443|gb|AUW23198|]
View 

DNA repair protein [Streptococcus suis]

Protein Classification

endonuclease III domain-containing protein( domain architecture ID 11454908)

endonuclease III domain-containing protein similar to N-glycosylase/DNA lyase, which specifically removes oxidatively damaged form of guanine (7,8-dihydro-8-oxoguanine or 7-oxoG) from DNA

Graphical summary

 Zoom to residue level

show extra options »

Show site features     Horizontal zoom: ×

List of domain hits

Name Accession Description Interval E-value
HP0602 COG2231
3-Methyladenine DNA glycosylase, HhH-GPD/Endo3 superfamily [Replication, recombination and ...
2-204 2.61e-74

3-Methyladenine DNA glycosylase, HhH-GPD/Endo3 superfamily [Replication, recombination and repair];


:

Pssm-ID: 441832 [Multi-domain]  Cd Length: 220  Bit Score: 223.57  E-value: 2.61e-74
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1336595443   2 EQLTVLYQNLKRRYGEFHWWNDENPIKDLVSMILIQQTTEANAKRALEQL--EGRLTIHSLLEMPVEDLQECIRPAGFFK 79
Cdd:COG2231     5 EDLLEIYERLLEHYGPQHWWPAETPFEVIVGAILTQNTSWKNVEKAIANLkeAGLLDPEALAALDPEELAELIRPSGFYN 84
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1336595443  80 QKSLYIRSVVEW-ANQFDGDFSRLDRVETAVLRKELLSLKGVGNETADVILLYLCRRSVFVADQYALRLFNRLGL-SQSQ 157
Cdd:COG2231    85 QKAKRLKNLARWlVERYGGGLEKLKALPTEELREELLSLKGIGPETADSILLYAFNRPVFVVDAYTRRIFSRLGLiEEDA 164
                         170       180       190       200
                  ....*....|....*....|....*....|....*....|....*..
gi 1336595443 158 DYLSLRQEFTEQIKDwSVKDAQELHALIDEHGKQFRLTKGQLDESWL 204
Cdd:COG2231   165 SYDELQRLFEENLPP-DVALYNEFHALIVEHGKEYCKKKPKCEECPL 210
 
Name Accession Description Interval E-value
HP0602 COG2231
3-Methyladenine DNA glycosylase, HhH-GPD/Endo3 superfamily [Replication, recombination and ...
2-204 2.61e-74

3-Methyladenine DNA glycosylase, HhH-GPD/Endo3 superfamily [Replication, recombination and repair];


Pssm-ID: 441832 [Multi-domain]  Cd Length: 220  Bit Score: 223.57  E-value: 2.61e-74
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1336595443   2 EQLTVLYQNLKRRYGEFHWWNDENPIKDLVSMILIQQTTEANAKRALEQL--EGRLTIHSLLEMPVEDLQECIRPAGFFK 79
Cdd:COG2231     5 EDLLEIYERLLEHYGPQHWWPAETPFEVIVGAILTQNTSWKNVEKAIANLkeAGLLDPEALAALDPEELAELIRPSGFYN 84
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1336595443  80 QKSLYIRSVVEW-ANQFDGDFSRLDRVETAVLRKELLSLKGVGNETADVILLYLCRRSVFVADQYALRLFNRLGL-SQSQ 157
Cdd:COG2231    85 QKAKRLKNLARWlVERYGGGLEKLKALPTEELREELLSLKGIGPETADSILLYAFNRPVFVVDAYTRRIFSRLGLiEEDA 164
                         170       180       190       200
                  ....*....|....*....|....*....|....*....|....*..
gi 1336595443 158 DYLSLRQEFTEQIKDwSVKDAQELHALIDEHGKQFRLTKGQLDESWL 204
Cdd:COG2231   165 SYDELQRLFEENLPP-DVALYNEFHALIVEHGKEYCKKKPKCEECPL 210
ENDO3c cd00056
endonuclease III; includes endonuclease III (DNA-(apurinic or apyrimidinic site) lyase), ...
30-189 7.29e-30

endonuclease III; includes endonuclease III (DNA-(apurinic or apyrimidinic site) lyase), alkylbase DNA glycosidases (Alka-family) and other DNA glycosidases


Pssm-ID: 238013 [Multi-domain]  Cd Length: 158  Bit Score: 108.10  E-value: 7.29e-30
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1336595443  30 LVSMILIQQTTEANAKRALEQLEGRL--TIHSLLEMPVEDLQECIRPAGFfKQKSLYIRSVVEW-ANQFDGDFSRLDRve 106
Cdd:cd00056     4 LVSEILSQQTTDKAVNKAYERLFERYgpTPEALAAADEEELRELIRSLGY-RRKAKYLKELARAiVEGFGGLVLDDPD-- 80
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1336595443 107 tavLRKELLSLKGVGNETADVILLYLCRRSVFVADQYALRLFNRLGLSQSQ-DYLSLRQEFTEqikDWSVKDAQELHALI 185
Cdd:cd00056    81 ---AREELLALPGVGRKTANVVLLFALGPDAFPVDTHVRRVLKRLGLIPKKkTPEELEELLEE---LLPKPYWGEANQAL 154

                  ....
gi 1336595443 186 DEHG 189
Cdd:cd00056   155 MDLG 158
ENDO3c smart00478
endonuclease III; includes endonuclease III (DNA-(apurinic or apyrimidinic site) lyase), ...
35-190 2.57e-26

endonuclease III; includes endonuclease III (DNA-(apurinic or apyrimidinic site) lyase), alkylbase DNA glycosidases (Alka-family) and other DNA glycosidases


Pssm-ID: 214684 [Multi-domain]  Cd Length: 149  Bit Score: 98.49  E-value: 2.57e-26
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1336595443   35 LIQQTTEANAKRALEQLEGR-LTIHSLLEMPVEDLQECIRPAGFFKQKSLYIRSVVEW-ANQFDGDFSRLdrvetavlRK 112
Cdd:smart00478   1 LSQQTTDERVNKATERLFEKfPTPEDLAAADEEELEELIRGLGFYRRKARYLIELARIlVEEYGGEVPDD--------RE 72
                           90       100       110       120       130       140       150
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 1336595443  113 ELLSLKGVGNETADVILLYLCRRSVFVADQYALRLFNRLGLSQSQD-YLSLRQEFtEQIKDWsvKDAQELHALIDEHGK 190
Cdd:smart00478  73 ELLKLPGVGRKTANAVLSFALGKPFIPVDTHVLRIAKRLGLVDKKStPEEVEKLL-EKLLPE--EDWRELNLLLIDFGR 148
HhH-GPD pfam00730
HhH-GPD superfamily base excision DNA repair protein; This family contains a diverse range of ...
31-154 3.22e-20

HhH-GPD superfamily base excision DNA repair protein; This family contains a diverse range of structurally related DNA repair proteins. The superfamily is called the HhH-GPD family after its hallmark Helix-hairpin-helix and Gly/Pro rich loop followed by a conserved aspartate. This includes endonuclease III, EC:4.2.99.18 and MutY an A/G-specific adenine glycosylase, both have a C terminal 4Fe-4S cluster. The family also includes 8-oxoguanine DNA glycosylases. The methyl-CPG binding protein MBD4 also contains a related domain that is a thymine DNA glycosylase. The family also includes DNA-3-methyladenine glycosylase II EC:3.2.2.21 and other members of the AlkA family.


Pssm-ID: 425841 [Multi-domain]  Cd Length: 141  Bit Score: 82.33  E-value: 3.22e-20
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1336595443  31 VSMILIQQTTEANAKRALEQLEGR--LTIHSLLEMPVEDLQECIRPAGFFKQKSLYIRSVVE-WANQFDGDFSRldrvet 107
Cdd:pfam00730   1 VSAILSQQTSDKAVNKITERLFEKffPTPEDLADADEEELRELIRGLGFYRRKAKYLKELARiLVEGYGGEVPL------ 74
                          90       100       110       120       130
                  ....*....|....*....|....*....|....*....|....*....|
gi 1336595443 108 avLRKELLS-LKGVGNETADVILLYLCRRS--VFVADQYALRLFNRLGLS 154
Cdd:pfam00730  75 --DEEELEAlLKGVGRWTAEAVLIFALGRPdpLPVVDTHVRRVLKRLGLI 122
PRK13913 PRK13913
3-methyladenine DNA glycosylase; Provisional
20-200 7.52e-20

3-methyladenine DNA glycosylase; Provisional


Pssm-ID: 184390  Cd Length: 218  Bit Score: 83.36  E-value: 7.52e-20
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1336595443  20 WWNDENPIKDLVSMILIQQTTEANAKRALEQLEGRLTIHSLLEMPVED--------LQECIRPAGFFKQKSLYIRSVVEw 91
Cdd:PRK13913   24 WWPNALKFEALLGAVLTQNTKFEAVEKSLENLKNAFILENDDEINLKKiayiefskLAECVRPSGFYNQKAKRLIDLSE- 102
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1336595443  92 anQFDGDFSRLDRVETAVLRKELLSLKGVGNETADVILLYLCRRSVFVADQYALRLFNRLGLsQSQDYlslrqeftEQIK 171
Cdd:PRK13913  103 --NILKDFGSFENFKQEVTREWLLDQKGIGKESADAILCYVCAKEVMVVDKYSYLFLKKLGI-EIEDY--------DELQ 171
                         170       180       190       200
                  ....*....|....*....|....*....|....*....|....
gi 1336595443 172 DWSVKDAQELHALIDE---------------HGKQFRLTKGQLD 200
Cdd:PRK13913  172 HFFEKGVQENLNSALAlyentislaqlyarfHGKIVEFSKQKLE 215
 
Name Accession Description Interval E-value
HP0602 COG2231
3-Methyladenine DNA glycosylase, HhH-GPD/Endo3 superfamily [Replication, recombination and ...
2-204 2.61e-74

3-Methyladenine DNA glycosylase, HhH-GPD/Endo3 superfamily [Replication, recombination and repair];


Pssm-ID: 441832 [Multi-domain]  Cd Length: 220  Bit Score: 223.57  E-value: 2.61e-74
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1336595443   2 EQLTVLYQNLKRRYGEFHWWNDENPIKDLVSMILIQQTTEANAKRALEQL--EGRLTIHSLLEMPVEDLQECIRPAGFFK 79
Cdd:COG2231     5 EDLLEIYERLLEHYGPQHWWPAETPFEVIVGAILTQNTSWKNVEKAIANLkeAGLLDPEALAALDPEELAELIRPSGFYN 84
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1336595443  80 QKSLYIRSVVEW-ANQFDGDFSRLDRVETAVLRKELLSLKGVGNETADVILLYLCRRSVFVADQYALRLFNRLGL-SQSQ 157
Cdd:COG2231    85 QKAKRLKNLARWlVERYGGGLEKLKALPTEELREELLSLKGIGPETADSILLYAFNRPVFVVDAYTRRIFSRLGLiEEDA 164
                         170       180       190       200
                  ....*....|....*....|....*....|....*....|....*..
gi 1336595443 158 DYLSLRQEFTEQIKDwSVKDAQELHALIDEHGKQFRLTKGQLDESWL 204
Cdd:COG2231   165 SYDELQRLFEENLPP-DVALYNEFHALIVEHGKEYCKKKPKCEECPL 210
Nth COG0177
Endonuclease III [Replication, recombination and repair];
11-192 2.87e-31

Endonuclease III [Replication, recombination and repair];


Pssm-ID: 439947 [Multi-domain]  Cd Length: 198  Bit Score: 112.88  E-value: 2.87e-31
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1336595443  11 LKRRYGEFHWW-NDENPIKDLVSMILIQQTTEANAKRALEQLEGRL-TIHSLLEMPVEDLQECIRPAGFFKQKSLYIRSV 88
Cdd:COG0177     4 LKELYPDAKTElDYRDPFELLVATILSAQTTDERVNKATPRLFARYpTPEALAAADLEELEELIRPIGLYRNKAKNIIAL 83
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1336595443  89 VEW-ANQFDGdfsrldRVETAvlRKELLSLKGVGNETADVILLYLCRRSVFVADQYALRLFNRLGLSQSQDYLSLRQEFT 167
Cdd:COG0177    84 ARIlVEKYGG------EVPET--REELESLPGVGRKTANVVLNFAFGKPAIAVDTHVHRVSNRLGLVPGKDPEEVEKDLM 155
                         170       180
                  ....*....|....*....|....*..
gi 1336595443 168 EQI--KDWSvkdaqELHALIDEHGKQF 192
Cdd:COG0177   156 KLIpkEYWG-----DLHHLLILHGRYI 177
ENDO3c cd00056
endonuclease III; includes endonuclease III (DNA-(apurinic or apyrimidinic site) lyase), ...
30-189 7.29e-30

endonuclease III; includes endonuclease III (DNA-(apurinic or apyrimidinic site) lyase), alkylbase DNA glycosidases (Alka-family) and other DNA glycosidases


Pssm-ID: 238013 [Multi-domain]  Cd Length: 158  Bit Score: 108.10  E-value: 7.29e-30
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1336595443  30 LVSMILIQQTTEANAKRALEQLEGRL--TIHSLLEMPVEDLQECIRPAGFfKQKSLYIRSVVEW-ANQFDGDFSRLDRve 106
Cdd:cd00056     4 LVSEILSQQTTDKAVNKAYERLFERYgpTPEALAAADEEELRELIRSLGY-RRKAKYLKELARAiVEGFGGLVLDDPD-- 80
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1336595443 107 tavLRKELLSLKGVGNETADVILLYLCRRSVFVADQYALRLFNRLGLSQSQ-DYLSLRQEFTEqikDWSVKDAQELHALI 185
Cdd:cd00056    81 ---AREELLALPGVGRKTANVVLLFALGPDAFPVDTHVRRVLKRLGLIPKKkTPEELEELLEE---LLPKPYWGEANQAL 154

                  ....
gi 1336595443 186 DEHG 189
Cdd:cd00056   155 MDLG 158
ENDO3c smart00478
endonuclease III; includes endonuclease III (DNA-(apurinic or apyrimidinic site) lyase), ...
35-190 2.57e-26

endonuclease III; includes endonuclease III (DNA-(apurinic or apyrimidinic site) lyase), alkylbase DNA glycosidases (Alka-family) and other DNA glycosidases


Pssm-ID: 214684 [Multi-domain]  Cd Length: 149  Bit Score: 98.49  E-value: 2.57e-26
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1336595443   35 LIQQTTEANAKRALEQLEGR-LTIHSLLEMPVEDLQECIRPAGFFKQKSLYIRSVVEW-ANQFDGDFSRLdrvetavlRK 112
Cdd:smart00478   1 LSQQTTDERVNKATERLFEKfPTPEDLAAADEEELEELIRGLGFYRRKARYLIELARIlVEEYGGEVPDD--------RE 72
                           90       100       110       120       130       140       150
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 1336595443  113 ELLSLKGVGNETADVILLYLCRRSVFVADQYALRLFNRLGLSQSQD-YLSLRQEFtEQIKDWsvKDAQELHALIDEHGK 190
Cdd:smart00478  73 ELLKLPGVGRKTANAVLSFALGKPFIPVDTHVLRIAKRLGLVDKKStPEEVEKLL-EKLLPE--EDWRELNLLLIDFGR 148
HhH-GPD pfam00730
HhH-GPD superfamily base excision DNA repair protein; This family contains a diverse range of ...
31-154 3.22e-20

HhH-GPD superfamily base excision DNA repair protein; This family contains a diverse range of structurally related DNA repair proteins. The superfamily is called the HhH-GPD family after its hallmark Helix-hairpin-helix and Gly/Pro rich loop followed by a conserved aspartate. This includes endonuclease III, EC:4.2.99.18 and MutY an A/G-specific adenine glycosylase, both have a C terminal 4Fe-4S cluster. The family also includes 8-oxoguanine DNA glycosylases. The methyl-CPG binding protein MBD4 also contains a related domain that is a thymine DNA glycosylase. The family also includes DNA-3-methyladenine glycosylase II EC:3.2.2.21 and other members of the AlkA family.


Pssm-ID: 425841 [Multi-domain]  Cd Length: 141  Bit Score: 82.33  E-value: 3.22e-20
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1336595443  31 VSMILIQQTTEANAKRALEQLEGR--LTIHSLLEMPVEDLQECIRPAGFFKQKSLYIRSVVE-WANQFDGDFSRldrvet 107
Cdd:pfam00730   1 VSAILSQQTSDKAVNKITERLFEKffPTPEDLADADEEELRELIRGLGFYRRKAKYLKELARiLVEGYGGEVPL------ 74
                          90       100       110       120       130
                  ....*....|....*....|....*....|....*....|....*....|
gi 1336595443 108 avLRKELLS-LKGVGNETADVILLYLCRRS--VFVADQYALRLFNRLGLS 154
Cdd:pfam00730  75 --DEEELEAlLKGVGRWTAEAVLIFALGRPdpLPVVDTHVRRVLKRLGLI 122
PRK13913 PRK13913
3-methyladenine DNA glycosylase; Provisional
20-200 7.52e-20

3-methyladenine DNA glycosylase; Provisional


Pssm-ID: 184390  Cd Length: 218  Bit Score: 83.36  E-value: 7.52e-20
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1336595443  20 WWNDENPIKDLVSMILIQQTTEANAKRALEQLEGRLTIHSLLEMPVED--------LQECIRPAGFFKQKSLYIRSVVEw 91
Cdd:PRK13913   24 WWPNALKFEALLGAVLTQNTKFEAVEKSLENLKNAFILENDDEINLKKiayiefskLAECVRPSGFYNQKAKRLIDLSE- 102
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1336595443  92 anQFDGDFSRLDRVETAVLRKELLSLKGVGNETADVILLYLCRRSVFVADQYALRLFNRLGLsQSQDYlslrqeftEQIK 171
Cdd:PRK13913  103 --NILKDFGSFENFKQEVTREWLLDQKGIGKESADAILCYVCAKEVMVVDKYSYLFLKKLGI-EIEDY--------DELQ 171
                         170       180       190       200
                  ....*....|....*....|....*....|....*....|....
gi 1336595443 172 DWSVKDAQELHALIDE---------------HGKQFRLTKGQLD 200
Cdd:PRK13913  172 HFFEKGVQENLNSALAlyentislaqlyarfHGKIVEFSKQKLE 215
AlkA COG0122
3-methyladenine DNA glycosylase/8-oxoguanine DNA glycosylase [Replication, recombination and ...
13-146 7.81e-15

3-methyladenine DNA glycosylase/8-oxoguanine DNA glycosylase [Replication, recombination and repair];


Pssm-ID: 439892 [Multi-domain]  Cd Length: 255  Bit Score: 70.68  E-value: 7.81e-15
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1336595443  13 RRYGEFHWWNDENPIKDLVSMILIQQTTEANAKRALEQLEGRL---------------TIHSLLEMPVEDLQECirpaGF 77
Cdd:COG0122    71 ERYPGLRLPRRPDPFEALVRAILGQQVSVAAARTIWRRLVALFgepiegpggglyafpTPEALAAASEEELRAC----GL 146
                          90       100       110       120       130       140       150
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1336595443  78 FKQKSLYIRSVVEWANQFDGDFSRLDRVETAVLRKELLSLKGVGNETADVILLY-LCRRSVFVADQYALR 146
Cdd:COG0122   147 SRRKARYLRALARAVADGELDLEALAGLDDEEAIARLTALPGIGPWTAEMVLLFaLGRPDAFPAGDLGLR 216
HHH pfam00633
Helix-hairpin-helix motif; The helix-hairpin-helix DNA-binding motif is found to be duplicated ...
101-131 2.06e-03

Helix-hairpin-helix motif; The helix-hairpin-helix DNA-binding motif is found to be duplicated in the central domain of RuvA. The HhH domain of DisA, a bacterial checkpoint control protein, is a DNA-binding domain.


Pssm-ID: 425789 [Multi-domain]  Cd Length: 30  Bit Score: 34.70  E-value: 2.06e-03
                          10        20        30
                  ....*....|....*....|....*....|.
gi 1336595443 101 RLDRVETAvLRKELLSLKGVGNETADVILLY 131
Cdd:pfam00633   1 SLEGLIPA-SVEELLALPGVGPKTAEAILSY 30
 
Blast search parameters
Data Source: Precalculated data, version = cdd.v.3.21
Preset Options:Database: CDSEARCH/cdd   Low complexity filter: no  Composition Based Adjustment: yes   E-value threshold: 0.01

References:

  • Wang J et al. (2023), "The conserved domain database in 2023", Nucleic Acids Res.51(D)384-8.
  • Lu S et al. (2020), "The conserved domain database in 2020", Nucleic Acids Res.48(D)265-8.
  • Marchler-Bauer A et al. (2017), "CDD/SPARCLE: functional classification of proteins via subfamily domain architectures.", Nucleic Acids Res.45(D)200-3.
Help | Disclaimer | Write to the Help Desk
NCBI | NLM | NIH