|
Name |
Accession |
Description |
Interval |
E-value |
| MutY |
COG1194 |
Adenine-specific DNA glycosylase, acts on AG and A-oxoG pairs [Replication, recombination and ... |
33-438 |
1.89e-156 |
|
Adenine-specific DNA glycosylase, acts on AG and A-oxoG pairs [Replication, recombination and repair];
Pssm-ID: 440807 [Multi-domain] Cd Length: 350 Bit Score: 446.89 E-value: 1.89e-156
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2060810902 33 ADVTAFRSNLLSWYDQEKRDLPWRNlakeeansDRRAYAVWVSEVMLQQTQVATVIDYYTRWMQKWPKLQDLASASLEEV 112
Cdd:COG1194 1 MDMASFAKRLLAWYDRHGRDLPWRQ--------TRDPYRVWLSEIMLQQTQVATVIPYYERFLERFPTVEALAAAPEDEV 72
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2060810902 113 NQLWSGLGYYSRGRRLQEGARKVVEELGGHMPRTAETLQQLlPGVGRYTAGAIASIAFDQVTGVVDGNVLRVLCRVRAIG 192
Cdd:COG1194 73 LKLWEGLGYYSRARNLHKAAQQVVEEHGGVFPDTYEELLAL-PGIGPYTAAAIASIAFGEPAPIVDGNVKRVLSRLFAIE 151
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2060810902 193 ADPTSTLVSHHLWNLAQQLVDPARPGDFNQAAMELGATVCTPQRPLCSHCPVQSLCRAYQRvqrgqlsalpGRPdieeca 272
Cdd:COG1194 152 GPIGSPAAKKELWALAEELLPPERPGDFNQALMDLGATVCTPKKPKCLLCPLQDDCAAFAE----------GRQ------ 215
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2060810902 273 lntrqcqlcltssspwdpsmgvANFPRKASRRPPREEYSATCVVEQPGAiggplVLLVQRPDSGLLAGLWEFPSVtlEPS 352
Cdd:COG1194 216 ----------------------EELPVKKPKKKKPERYGAALVIRDDGR-----VLLEKRPPKGLWGGLWEFPEF--EWE 266
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2060810902 353 EQHQHKALLQELQRWCGpLPAIRLQHLGEVIHIFSHIKLTYQVYSLALDQAPAStAPPGARWLTWEEFCNAAVSTAMKKV 432
Cdd:COG1194 267 EAEDPEALERWLREELG-LEVEWLEPLGTVRHVFTHFRLHLTVYLARVPAGPPA-EPDGGRWVPLEELAALPLPAPMRKL 344
|
....*.
gi 2060810902 433 FRMYED 438
Cdd:COG1194 345 LKALLK 350
|
|
| mutY |
TIGR01084 |
A/G-specific adenine glycosylase; This equivalog model identifies mutY members of the ... |
37-345 |
2.48e-100 |
|
A/G-specific adenine glycosylase; This equivalog model identifies mutY members of the pfam00730 superfamily (HhH-GPD: Helix-hairpin-helix and Gly/Pro rich loop followed by a conserved aspartate). The major members of the superfamily are nth and mutY. [DNA metabolism, DNA replication, recombination, and repair]
Pssm-ID: 130156 Cd Length: 275 Bit Score: 301.25 E-value: 2.48e-100
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2060810902 37 AFRSNLLSWYDQEKR-DLPWRNlakeeansDRRAYAVWVSEVMLQQTQVATVIDYYTRWMQKWPKLQDLASASLEEVNQL 115
Cdd:TIGR01084 1 QFSEDLLSWYDKYGRkTLPWRQ--------NKTPYRVWLSEVMLQQTQVATVIPYFERFLERFPTVQALANAPQDEVLKL 72
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2060810902 116 WSGLGYYSRGRRLQEGARKVVEELGGHMPRTAETLQQlLPGVGRYTAGAIASIAFDQVTGVVDGNVLRVLCRVRAIGADP 195
Cdd:TIGR01084 73 WEGLGYYARARNLHKAAQEVVEEFGGEFPQDFEDLAA-LPGVGRYTAGAILSFALNKPYPILDGNVKRVLSRLFAVEGWP 151
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2060810902 196 TSTLVSHHLWNLAQQLVDPARPGDFNQAAMELGATVCTPQRPLCSHCPVQSLCRAYqrvQRGQLSALPGrpdieecalnt 275
Cdd:TIGR01084 152 GKKKVENRLWTLAESLLPKADPEAFNQALMDLGAMICTRKKPKCDLCPLQDFCLAY---QQGTWEEYPV----------- 217
|
250 260 270 280 290 300 310
....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2060810902 276 rqcqlcltssspwdpsmgvanfpRKASRRPPREEYSATCVVEQPGAiggplVLLVQRPDSGLLAGLWEFP 345
Cdd:TIGR01084 218 -----------------------KKPKAAPPERTTYFLVLQNYDGE-----VLLEQRPEKGLWGGLYCFP 259
|
|
| PRK10880 |
PRK10880 |
adenine DNA glycosylase; |
38-391 |
3.29e-69 |
|
adenine DNA glycosylase;
Pssm-ID: 182805 [Multi-domain] Cd Length: 350 Bit Score: 223.82 E-value: 3.29e-69
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2060810902 38 FRSNLLSWYDQEKRD-LPWRnlakeeanSDRRAYAVWVSEVMLQQTQVATVIDYYTRWMQKWPKLQDLASASLEEVNQLW 116
Cdd:PRK10880 6 FSAQVLDWYDKYGRKtLPWQ--------IDKTPYKVWLSEVMLQQTQVATVIPYFERFMARFPTVTDLANAPLDEVLHLW 77
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2060810902 117 SGLGYYSRGRRLQEGARKVVEELGGHMPRTAETLQQlLPGVGRYTAGAIASIAFDQVTGVVDGNVLRVLCRVRAIGADPT 196
Cdd:PRK10880 78 TGLGYYARARNLHKAAQQVATLHGGEFPETFEEVAA-LPGVGRSTAGAILSLSLGKHFPILDGNVKRVLARCYAVSGWPG 156
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2060810902 197 STLVSHHLWNLAQQlVDPARP-GDFNQAAMELGATVCTPQRPLCSHCPVQSLCRAYqrvqrgqlsalpgrpdieecalnt 275
Cdd:PRK10880 157 KKEVENRLWQLSEQ-VTPAVGvERFNQAMMDLGAMVCTRSKPKCELCPLQNGCIAY------------------------ 211
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2060810902 276 rqcqlcltSSSPWdpsmgvANFPRKASRRPPREEYSATCVVEQpgaigGPLVLLVQRPDSGLLAGLWEFPSVTLEpseqh 355
Cdd:PRK10880 212 --------ANHSW------ALYPGKKPKQTLPERTGYFLLLQH-----GDEVWLEQRPPSGLWGGLFCFPQFADE----- 267
|
330 340 350
....*....|....*....|....*....|....*...
gi 2060810902 356 qhkallQELQRWCGP--LPAIRLQHLGEVIHIFSHIKL 391
Cdd:PRK10880 268 ------EELRQWLAQrgIAADNLTQLTAFRHTFSHFHL 299
|
|
| ENDO3c |
cd00056 |
endonuclease III; includes endonuclease III (DNA-(apurinic or apyrimidinic site) lyase), ... |
70-228 |
5.14e-48 |
|
endonuclease III; includes endonuclease III (DNA-(apurinic or apyrimidinic site) lyase), alkylbase DNA glycosidases (Alka-family) and other DNA glycosidases
Pssm-ID: 238013 [Multi-domain] Cd Length: 158 Bit Score: 162.03 E-value: 5.14e-48
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2060810902 70 YAVWVSEVMLQQTQVATVIDYYTRWMQKW-PKLQDLASASLEEVNQLWSGLGYYSRGRRLQEGARKVVEELGG---HMPR 145
Cdd:cd00056 1 FEVLVSEILSQQTTDKAVNKAYERLFERYgPTPEALAAADEEELRELIRSLGYRRKAKYLKELARAIVEGFGGlvlDDPD 80
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2060810902 146 TAETLQQLlPGVGRYTAGAIASIAFDQVTGVVDGNVLRVLCRVRAIGADPTstlvSHHLWNLAQQLVDPARPGDFNQAAM 225
Cdd:cd00056 81 AREELLAL-PGVGRKTANVVLLFALGPDAFPVDTHVRRVLKRLGLIPKKKT----PEELEELLEELLPKPYWGEANQALM 155
|
...
gi 2060810902 226 ELG 228
Cdd:cd00056 156 DLG 158
|
|
| ENDO3c |
smart00478 |
endonuclease III; includes endonuclease III (DNA-(apurinic or apyrimidinic site) lyase), ... |
78-230 |
3.41e-43 |
|
endonuclease III; includes endonuclease III (DNA-(apurinic or apyrimidinic site) lyase), alkylbase DNA glycosidases (Alka-family) and other DNA glycosidases
Pssm-ID: 214684 [Multi-domain] Cd Length: 149 Bit Score: 148.95 E-value: 3.41e-43
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2060810902 78 MLQQTQVATVIDYYTRWMQKWPKLQDLASASLEEVNQLWSGLG-YYSRGRRLQEGARKVVEELGGHMPRTAETLQQLlPG 156
Cdd:smart00478 1 LSQQTTDERVNKATERLFEKFPTPEDLAAADEEELEELIRGLGfYRRKARYLIELARILVEEYGGEVPDDREELLKL-PG 79
|
90 100 110 120 130 140 150
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....
gi 2060810902 157 VGRYTAGAIASIAFDQVTGVVDGNVLRVLCRVRAIGADPTstlvSHHLWNLAQQLVDPARPGDFNQAAMELGAT 230
Cdd:smart00478 80 VGRKTANAVLSFALGKPFIPVDTHVLRIAKRLGLVDKKST----PEEVEKLLEKLLPEEDWRELNLLLIDFGRT 149
|
|
| HhH-GPD |
pfam00730 |
HhH-GPD superfamily base excision DNA repair protein; This family contains a diverse range of ... |
74-210 |
1.96e-41 |
|
HhH-GPD superfamily base excision DNA repair protein; This family contains a diverse range of structurally related DNA repair proteins. The superfamily is called the HhH-GPD family after its hallmark Helix-hairpin-helix and Gly/Pro rich loop followed by a conserved aspartate. This includes endonuclease III, EC:4.2.99.18 and MutY an A/G-specific adenine glycosylase, both have a C terminal 4Fe-4S cluster. The family also includes 8-oxoguanine DNA glycosylases. The methyl-CPG binding protein MBD4 also contains a related domain that is a thymine DNA glycosylase. The family also includes DNA-3-methyladenine glycosylase II EC:3.2.2.21 and other members of the AlkA family.
Pssm-ID: 425841 [Multi-domain] Cd Length: 141 Bit Score: 144.35 E-value: 1.96e-41
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2060810902 74 VSEVMLQQTQVATVIDYYTRWMQKW-PKLQDLASASLEEVNQLWSGLGYY-SRGRRLQEGARKVVEELGGHMPRTAETLQ 151
Cdd:pfam00730 1 VSAILSQQTSDKAVNKITERLFEKFfPTPEDLADADEEELRELIRGLGFYrRKAKYLKELARILVEGYGGEVPLDEEELE 80
|
90 100 110 120 130 140
....*....|....*....|....*....|....*....|....*....|....*....|.
gi 2060810902 152 QLLPGVGRYTAGAIASIAF--DQVTGVVDGNVLRVLCRVRAIGADPTSTLVSHHLWNLAQQ 210
Cdd:pfam00730 81 ALLKGVGRWTAEAVLIFALgrPDPLPVVDTHVRRVLKRLGLIKEKPTPKEVERELEELWPP 141
|
|
|
|
Name |
Accession |
Description |
Interval |
E-value |
| MutY |
COG1194 |
Adenine-specific DNA glycosylase, acts on AG and A-oxoG pairs [Replication, recombination and ... |
33-438 |
1.89e-156 |
|
Adenine-specific DNA glycosylase, acts on AG and A-oxoG pairs [Replication, recombination and repair];
Pssm-ID: 440807 [Multi-domain] Cd Length: 350 Bit Score: 446.89 E-value: 1.89e-156
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2060810902 33 ADVTAFRSNLLSWYDQEKRDLPWRNlakeeansDRRAYAVWVSEVMLQQTQVATVIDYYTRWMQKWPKLQDLASASLEEV 112
Cdd:COG1194 1 MDMASFAKRLLAWYDRHGRDLPWRQ--------TRDPYRVWLSEIMLQQTQVATVIPYYERFLERFPTVEALAAAPEDEV 72
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2060810902 113 NQLWSGLGYYSRGRRLQEGARKVVEELGGHMPRTAETLQQLlPGVGRYTAGAIASIAFDQVTGVVDGNVLRVLCRVRAIG 192
Cdd:COG1194 73 LKLWEGLGYYSRARNLHKAAQQVVEEHGGVFPDTYEELLAL-PGIGPYTAAAIASIAFGEPAPIVDGNVKRVLSRLFAIE 151
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2060810902 193 ADPTSTLVSHHLWNLAQQLVDPARPGDFNQAAMELGATVCTPQRPLCSHCPVQSLCRAYQRvqrgqlsalpGRPdieeca 272
Cdd:COG1194 152 GPIGSPAAKKELWALAEELLPPERPGDFNQALMDLGATVCTPKKPKCLLCPLQDDCAAFAE----------GRQ------ 215
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2060810902 273 lntrqcqlcltssspwdpsmgvANFPRKASRRPPREEYSATCVVEQPGAiggplVLLVQRPDSGLLAGLWEFPSVtlEPS 352
Cdd:COG1194 216 ----------------------EELPVKKPKKKKPERYGAALVIRDDGR-----VLLEKRPPKGLWGGLWEFPEF--EWE 266
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2060810902 353 EQHQHKALLQELQRWCGpLPAIRLQHLGEVIHIFSHIKLTYQVYSLALDQAPAStAPPGARWLTWEEFCNAAVSTAMKKV 432
Cdd:COG1194 267 EAEDPEALERWLREELG-LEVEWLEPLGTVRHVFTHFRLHLTVYLARVPAGPPA-EPDGGRWVPLEELAALPLPAPMRKL 344
|
....*.
gi 2060810902 433 FRMYED 438
Cdd:COG1194 345 LKALLK 350
|
|
| mutY |
TIGR01084 |
A/G-specific adenine glycosylase; This equivalog model identifies mutY members of the ... |
37-345 |
2.48e-100 |
|
A/G-specific adenine glycosylase; This equivalog model identifies mutY members of the pfam00730 superfamily (HhH-GPD: Helix-hairpin-helix and Gly/Pro rich loop followed by a conserved aspartate). The major members of the superfamily are nth and mutY. [DNA metabolism, DNA replication, recombination, and repair]
Pssm-ID: 130156 Cd Length: 275 Bit Score: 301.25 E-value: 2.48e-100
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2060810902 37 AFRSNLLSWYDQEKR-DLPWRNlakeeansDRRAYAVWVSEVMLQQTQVATVIDYYTRWMQKWPKLQDLASASLEEVNQL 115
Cdd:TIGR01084 1 QFSEDLLSWYDKYGRkTLPWRQ--------NKTPYRVWLSEVMLQQTQVATVIPYFERFLERFPTVQALANAPQDEVLKL 72
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2060810902 116 WSGLGYYSRGRRLQEGARKVVEELGGHMPRTAETLQQlLPGVGRYTAGAIASIAFDQVTGVVDGNVLRVLCRVRAIGADP 195
Cdd:TIGR01084 73 WEGLGYYARARNLHKAAQEVVEEFGGEFPQDFEDLAA-LPGVGRYTAGAILSFALNKPYPILDGNVKRVLSRLFAVEGWP 151
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2060810902 196 TSTLVSHHLWNLAQQLVDPARPGDFNQAAMELGATVCTPQRPLCSHCPVQSLCRAYqrvQRGQLSALPGrpdieecalnt 275
Cdd:TIGR01084 152 GKKKVENRLWTLAESLLPKADPEAFNQALMDLGAMICTRKKPKCDLCPLQDFCLAY---QQGTWEEYPV----------- 217
|
250 260 270 280 290 300 310
....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2060810902 276 rqcqlcltssspwdpsmgvanfpRKASRRPPREEYSATCVVEQPGAiggplVLLVQRPDSGLLAGLWEFP 345
Cdd:TIGR01084 218 -----------------------KKPKAAPPERTTYFLVLQNYDGE-----VLLEQRPEKGLWGGLYCFP 259
|
|
| PRK10880 |
PRK10880 |
adenine DNA glycosylase; |
38-391 |
3.29e-69 |
|
adenine DNA glycosylase;
Pssm-ID: 182805 [Multi-domain] Cd Length: 350 Bit Score: 223.82 E-value: 3.29e-69
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2060810902 38 FRSNLLSWYDQEKRD-LPWRnlakeeanSDRRAYAVWVSEVMLQQTQVATVIDYYTRWMQKWPKLQDLASASLEEVNQLW 116
Cdd:PRK10880 6 FSAQVLDWYDKYGRKtLPWQ--------IDKTPYKVWLSEVMLQQTQVATVIPYFERFMARFPTVTDLANAPLDEVLHLW 77
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2060810902 117 SGLGYYSRGRRLQEGARKVVEELGGHMPRTAETLQQlLPGVGRYTAGAIASIAFDQVTGVVDGNVLRVLCRVRAIGADPT 196
Cdd:PRK10880 78 TGLGYYARARNLHKAAQQVATLHGGEFPETFEEVAA-LPGVGRSTAGAILSLSLGKHFPILDGNVKRVLARCYAVSGWPG 156
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2060810902 197 STLVSHHLWNLAQQlVDPARP-GDFNQAAMELGATVCTPQRPLCSHCPVQSLCRAYqrvqrgqlsalpgrpdieecalnt 275
Cdd:PRK10880 157 KKEVENRLWQLSEQ-VTPAVGvERFNQAMMDLGAMVCTRSKPKCELCPLQNGCIAY------------------------ 211
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2060810902 276 rqcqlcltSSSPWdpsmgvANFPRKASRRPPREEYSATCVVEQpgaigGPLVLLVQRPDSGLLAGLWEFPSVTLEpseqh 355
Cdd:PRK10880 212 --------ANHSW------ALYPGKKPKQTLPERTGYFLLLQH-----GDEVWLEQRPPSGLWGGLFCFPQFADE----- 267
|
330 340 350
....*....|....*....|....*....|....*...
gi 2060810902 356 qhkallQELQRWCGP--LPAIRLQHLGEVIHIFSHIKL 391
Cdd:PRK10880 268 ------EELRQWLAQrgIAADNLTQLTAFRHTFSHFHL 299
|
|
| ENDO3c |
cd00056 |
endonuclease III; includes endonuclease III (DNA-(apurinic or apyrimidinic site) lyase), ... |
70-228 |
5.14e-48 |
|
endonuclease III; includes endonuclease III (DNA-(apurinic or apyrimidinic site) lyase), alkylbase DNA glycosidases (Alka-family) and other DNA glycosidases
Pssm-ID: 238013 [Multi-domain] Cd Length: 158 Bit Score: 162.03 E-value: 5.14e-48
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2060810902 70 YAVWVSEVMLQQTQVATVIDYYTRWMQKW-PKLQDLASASLEEVNQLWSGLGYYSRGRRLQEGARKVVEELGG---HMPR 145
Cdd:cd00056 1 FEVLVSEILSQQTTDKAVNKAYERLFERYgPTPEALAAADEEELRELIRSLGYRRKAKYLKELARAIVEGFGGlvlDDPD 80
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2060810902 146 TAETLQQLlPGVGRYTAGAIASIAFDQVTGVVDGNVLRVLCRVRAIGADPTstlvSHHLWNLAQQLVDPARPGDFNQAAM 225
Cdd:cd00056 81 AREELLAL-PGVGRKTANVVLLFALGPDAFPVDTHVRRVLKRLGLIPKKKT----PEELEELLEELLPKPYWGEANQALM 155
|
...
gi 2060810902 226 ELG 228
Cdd:cd00056 156 DLG 158
|
|
| ENDO3c |
smart00478 |
endonuclease III; includes endonuclease III (DNA-(apurinic or apyrimidinic site) lyase), ... |
78-230 |
3.41e-43 |
|
endonuclease III; includes endonuclease III (DNA-(apurinic or apyrimidinic site) lyase), alkylbase DNA glycosidases (Alka-family) and other DNA glycosidases
Pssm-ID: 214684 [Multi-domain] Cd Length: 149 Bit Score: 148.95 E-value: 3.41e-43
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2060810902 78 MLQQTQVATVIDYYTRWMQKWPKLQDLASASLEEVNQLWSGLG-YYSRGRRLQEGARKVVEELGGHMPRTAETLQQLlPG 156
Cdd:smart00478 1 LSQQTTDERVNKATERLFEKFPTPEDLAAADEEELEELIRGLGfYRRKARYLIELARILVEEYGGEVPDDREELLKL-PG 79
|
90 100 110 120 130 140 150
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....
gi 2060810902 157 VGRYTAGAIASIAFDQVTGVVDGNVLRVLCRVRAIGADPTstlvSHHLWNLAQQLVDPARPGDFNQAAMELGAT 230
Cdd:smart00478 80 VGRKTANAVLSFALGKPFIPVDTHVLRIAKRLGLVDKKST----PEEVEKLLEKLLPEEDWRELNLLLIDFGRT 149
|
|
| HhH-GPD |
pfam00730 |
HhH-GPD superfamily base excision DNA repair protein; This family contains a diverse range of ... |
74-210 |
1.96e-41 |
|
HhH-GPD superfamily base excision DNA repair protein; This family contains a diverse range of structurally related DNA repair proteins. The superfamily is called the HhH-GPD family after its hallmark Helix-hairpin-helix and Gly/Pro rich loop followed by a conserved aspartate. This includes endonuclease III, EC:4.2.99.18 and MutY an A/G-specific adenine glycosylase, both have a C terminal 4Fe-4S cluster. The family also includes 8-oxoguanine DNA glycosylases. The methyl-CPG binding protein MBD4 also contains a related domain that is a thymine DNA glycosylase. The family also includes DNA-3-methyladenine glycosylase II EC:3.2.2.21 and other members of the AlkA family.
Pssm-ID: 425841 [Multi-domain] Cd Length: 141 Bit Score: 144.35 E-value: 1.96e-41
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2060810902 74 VSEVMLQQTQVATVIDYYTRWMQKW-PKLQDLASASLEEVNQLWSGLGYY-SRGRRLQEGARKVVEELGGHMPRTAETLQ 151
Cdd:pfam00730 1 VSAILSQQTSDKAVNKITERLFEKFfPTPEDLADADEEELRELIRGLGFYrRKAKYLKELARILVEGYGGEVPLDEEELE 80
|
90 100 110 120 130 140
....*....|....*....|....*....|....*....|....*....|....*....|.
gi 2060810902 152 QLLPGVGRYTAGAIASIAF--DQVTGVVDGNVLRVLCRVRAIGADPTSTLVSHHLWNLAQQ 210
Cdd:pfam00730 81 ALLKGVGRWTAEAVLIFALgrPDPLPVVDTHVRRVLKRLGLIKEKPTPKEVERELEELWPP 141
|
|
| PRK13910 |
PRK13910 |
DNA glycosylase MutY; Provisional |
78-259 |
1.47e-32 |
|
DNA glycosylase MutY; Provisional
Pssm-ID: 172427 [Multi-domain] Cd Length: 289 Bit Score: 125.13 E-value: 1.47e-32
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2060810902 78 MLQQTQVATVID-YYTRWMQKWPKLQDLASASLEEVNQLWSGLGYYSRGRRLQEGARKVVEELGGHMPRTAETLQQlLPG 156
Cdd:PRK13910 1 MSQQTQINTVVErFYSPFLEAFPTLKDLANAPLEEVLLLWRGLGYYSRAKNLKKSAEICVKEHHSQLPNDYQSLLK-LPG 79
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2060810902 157 VGRYTAGAIASIAFDQVTGVVDGNVLRVLCRVraIGADPTSTlvSHHLWNLAQQLVDPARPGDFNQAAMELGATVCTPqR 236
Cdd:PRK13910 80 IGAYTANAILCFGFREKSACVDANIKRVLLRL--FGLDPNIH--AKDLQIKANDFLNLNESFNHNQALIDLGALICSP-K 154
|
170 180
....*....|....*....|...
gi 2060810902 237 PLCSHCPVQSLCRAYQRVQRGQL 259
Cdd:PRK13910 155 PKCAICPLNPYCLGKNNPEKHTL 177
|
|
| NUDIX_DNA_Glycosylase_C-MutY |
cd03431 |
C-terminal domain of DNA glycosylase; DNA glycosylase (MutY in bacteria and hMYH in humans) is ... |
306-433 |
5.45e-30 |
|
C-terminal domain of DNA glycosylase; DNA glycosylase (MutY in bacteria and hMYH in humans) is responsible for repairing misread A*oxoG residues to C*G by removing the inappropriately paired adenine base from the DNA backbone. It belongs to the NUDIX hydrolase superfamily and is important for the repair of various genotoxic lesions. Enzymes belonging to this superfamily requires a divalent cation, such as Mg2+ or Mn2+ for their activity. They are also recognized by a highly conserved 23-residue NUDIX motif (GX5EX7REUXEEXGU, where U = I, L or V). However, DNA glycosylase does not seem to contain this signature motif. DNA glycosylase consists of 2 domains: the N-terminal domain contains the catalytic properties of the enzyme and the C-terminal domain affects substrate (oxoG) binding and enzymatic turnover. The C-terminal domain is highly similar to MutT, based on secondary structure and topology, despite low sequence identity. MutT sanitizes the nucleotide precursor pool by hydrolyzing oxo-dGTP to oxo-dGMO and inorganic pyrophosphate. The similarity strongly suggests that the two proteins share a common evolutionary origin.
Pssm-ID: 467537 [Multi-domain] Cd Length: 118 Bit Score: 112.78 E-value: 5.45e-30
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2060810902 306 PREEYSATCVVEQPGAiggplVLLVQRPDSGLLAGLWEFPSVTLEPSEQhqhkaLLQELQRWCGPLPAIRLQHLGEVIHI 385
Cdd:cd03431 1 VPERYFTVLVLRDGGR-----VLLEKRPEKGLLAGLWEFPLVETEEEEE-----EAEALLGLLAEELLLILEPLGEVKHV 70
|
90 100 110 120
....*....|....*....|....*....|....*....|....*...
gi 2060810902 386 FSHIKLTYQVYSLALDQAPAsTAPPGARWLTWEEFCNAAVSTAMKKVF 433
Cdd:cd03431 71 FSHFRLHITVYLVELPEAPP-AAPDEGRWVDLEELDEYALPAPMRKLL 117
|
|
| Nth |
COG0177 |
Endonuclease III [Replication, recombination and repair]; |
96-252 |
9.74e-28 |
|
Endonuclease III [Replication, recombination and repair];
Pssm-ID: 439947 [Multi-domain] Cd Length: 198 Bit Score: 109.03 E-value: 9.74e-28
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2060810902 96 QKWPKLQDLASASLEEVNQLWSGLGYY-SRGRRLQEGARKVVEELGGHMPRTAETLQQLlPGVGRYTAGAIASIAFDQVT 174
Cdd:COG0177 47 ARYPTPEALAAADLEELEELIRPIGLYrNKAKNIIALARILVEKYGGEVPETREELESL-PGVGRKTANVVLNFAFGKPA 125
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2060810902 175 GVVDGNVLRVLCRvraIG-ADPTSTL-VSHHLwnlaQQLVDPARPGDFNQAAMELGATVCTPQRPLCSHCPVQSLCRAYQ 252
Cdd:COG0177 126 IAVDTHVHRVSNR---LGlVPGKDPEeVEKDL----MKLIPKEYWGDLHHLLILHGRYICKARKPKCEECPLADLCPYYG 198
|
|
| NUDIX_4 |
pfam14815 |
NUDIX domain; |
327-435 |
3.86e-22 |
|
NUDIX domain;
Pssm-ID: 464330 [Multi-domain] Cd Length: 114 Bit Score: 91.22 E-value: 3.86e-22
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2060810902 327 VLLVQRPDSGLLAGLWEFPSVTLEPSEqhqhkALLQELQRWCGPLPAIRLQHLGEVIHIFSHIKLTYQVYsLALDQAPAS 406
Cdd:pfam14815 12 VLLRKRPEKGLLGGLWEFPGGKVEPGE-----TLEEALARLEELGIEVEVLEPGTVKHVFTHFRLTLHVY-LVREVEGEE 85
|
90 100
....*....|....*....|....*....
gi 2060810902 407 TAPPGARWLTWEEFCNAAVSTAMKKVFRM 435
Cdd:pfam14815 86 EPQQELRWVTPEELDKYALPAAVRKILEA 114
|
|
| nth |
TIGR01083 |
endonuclease III; This equivalog model identifes nth members of the pfam00730 superfamily ... |
96-239 |
2.95e-20 |
|
endonuclease III; This equivalog model identifes nth members of the pfam00730 superfamily (HhH-GPD: Helix-hairpin-helix and Gly/Pro rich loop followed by a conserved aspartate). The major members of the superfamily are nth and mutY. [DNA metabolism, DNA replication, recombination, and repair]
Pssm-ID: 273434 [Multi-domain] Cd Length: 192 Bit Score: 88.21 E-value: 2.95e-20
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2060810902 96 QKWPKLQDLASASLEEVNQLWSGLGYY-SRGRRLQEGARKVVEELGGHMPRTAETLQQLlPGVGRYTAGAIASIAFDQVT 174
Cdd:TIGR01083 54 EVYPTPQALAQAGLEELEEYIKSIGLYrNKAKNIIELCRKLVERYGGEVPEDREELVKL-PGVGRKTANVVLNVAFGIPA 132
|
90 100 110 120 130 140 150
....*....|....*....|....*....|....*....|....*....|....*....|....*....|..
gi 2060810902 175 GVVDGNVLRVLCRVR-AIGADPTST------LVSHHLWnlaqqlvdparpGDFNQAAMELGATVCTPQRPLC 239
Cdd:TIGR01083 133 IAVDTHVFRVSNRLGlSKGKDPIKVeedlmkLVPREFW------------VKLHHWLILHGRYTCKARKPLC 192
|
|
| HP0602 |
COG2231 |
3-Methyladenine DNA glycosylase, HhH-GPD/Endo3 superfamily [Replication, recombination and ... |
101-254 |
4.46e-07 |
|
3-Methyladenine DNA glycosylase, HhH-GPD/Endo3 superfamily [Replication, recombination and repair];
Pssm-ID: 441832 [Multi-domain] Cd Length: 220 Bit Score: 50.61 E-value: 4.46e-07
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2060810902 101 LQDLASASLEEVNQLWSGLGYYSR-GRRLQEGARKVVEELGGHMPR-----TAETLQQLL--PGVGRYTAGAIASIAFDQ 172
Cdd:COG2231 62 PEALAALDPEELAELIRPSGFYNQkAKRLKNLARWLVERYGGGLEKlkalpTEELREELLslKGIGPETADSILLYAFNR 141
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2060810902 173 VTGVVDGNVLRVLCRVrAIGADPTStlvshhlWNLAQQLVDPARPGDFNQA----AM--ELGATVCTPqRPLCSHCPVQS 246
Cdd:COG2231 142 PVFVVDAYTRRIFSRL-GLIEEDAS-------YDELQRLFEENLPPDVALYnefhALivEHGKEYCKK-KPKCEECPLRD 212
|
....*...
gi 2060810902 247 LCRAYQRV 254
Cdd:COG2231 213 LCPYGGQE 220
|
|
| PRK10702 |
PRK10702 |
endonuclease III; Provisional |
121-254 |
5.85e-06 |
|
endonuclease III; Provisional
Pssm-ID: 182661 [Multi-domain] Cd Length: 211 Bit Score: 47.32 E-value: 5.85e-06
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2060810902 121 YYSRGRRLQEGARKVVEELGGHMPRTAETLQQLlPGVGRYTAGAIASIAFDQVTGVVDGNVLRVLCRVR-AIGAD----- 194
Cdd:PRK10702 82 YNSKAENVIKTCRILLEQHNGEVPEDRAALEAL-PGVGRKTANVVLNTAFGWPTIAVDTHIFRVCNRTQfAPGKNveqve 160
|
90 100 110 120 130 140
....*....|....*....|....*....|....*....|....*....|....*....|....*..
gi 2060810902 195 -------PTSTLVSHHLWNLAQqlvdparpgdfnqaamelGATVCTPQRPLCSHCPVQSLCRAYQRV 254
Cdd:PRK10702 161 ekllkvvPAEFKVDCHHWLILH------------------GRYTCIARKPRCGSCIIEDLCEYKEKV 209
|
|
| PRK10546 |
PRK10546 |
pyrimidine (deoxy)nucleoside triphosphate diphosphatase; |
315-365 |
4.08e-05 |
|
pyrimidine (deoxy)nucleoside triphosphate diphosphatase;
Pssm-ID: 182536 [Multi-domain] Cd Length: 135 Bit Score: 43.19 E-value: 4.08e-05
10 20 30 40 50
....*....|....*....|....*....|....*....|....*....|.
gi 2060810902 315 VVEQPGAIggplvLLVQRPDSGLLAGLWEFPSVTLEPSEQhQHKALLQELQ 365
Cdd:PRK10546 10 IIERDGKI-----LLAQRPAHSDQAGLWEFAGGKVEPGES-QPQALIRELR 54
|
|
| HHH |
pfam00633 |
Helix-hairpin-helix motif; The helix-hairpin-helix DNA-binding motif is found to be duplicated ... |
138-168 |
8.75e-05 |
|
Helix-hairpin-helix motif; The helix-hairpin-helix DNA-binding motif is found to be duplicated in the central domain of RuvA. The HhH domain of DisA, a bacterial checkpoint control protein, is a DNA-binding domain.
Pssm-ID: 425789 [Multi-domain] Cd Length: 30 Bit Score: 39.32 E-value: 8.75e-05
10 20 30
....*....|....*....|....*....|.
gi 2060810902 138 ELGGHMPRTAETLQQLlPGVGRYTAGAIASI 168
Cdd:pfam00633 1 SLEGLIPASVEELLAL-PGVGPKTAEAILSY 30
|
|
| FES |
smart00525 |
iron-sulpphur binding domain in DNA-(apurinic or apyrimidinic site) lyase (subfamily of ENDO3); ... |
231-251 |
1.18e-04 |
|
iron-sulpphur binding domain in DNA-(apurinic or apyrimidinic site) lyase (subfamily of ENDO3);
Pssm-ID: 197771 [Multi-domain] Cd Length: 21 Bit Score: 39.07 E-value: 1.18e-04
|
| NUDIX_MutT_NudA_like |
cd03425 |
MutT pyrophosphohydrolase; The MutT pyrophosphohydrolase is a prototypical NUDIX hydrolase ... |
327-419 |
5.79e-04 |
|
MutT pyrophosphohydrolase; The MutT pyrophosphohydrolase is a prototypical NUDIX hydrolase that catalyzes the hydrolysis of nucleoside and deoxynucleoside triphosphates (NTPs and dNTPs) by substitution at a beta-phosphorus to yield a nucleotide monophosphate (NMP) and inorganic pyrophosphate (PPi). This enzyme requires two divalent cations for activity; one coordinates the phosphoryl groups of the NTP/dNTP substrate, and the other coordinates to the enzyme. It also contains the NUDIX motif, a highly conserved 23-residue block (GX5EX7REUXEEXGU, where U = I, L or V), that functions as metal binding and catalytic site. MutT pyrophosphohydrolase is important in preventing errors in DNA replication by hydrolyzing mutagenic nucleotides such as 8-oxo-dGTP (a product of oxidative damage), which can mispair with template adenine during DNA replication, to guanine nucleotides.
Pssm-ID: 467531 [Multi-domain] Cd Length: 123 Bit Score: 39.74 E-value: 5.79e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2060810902 327 VLLVQRPDSGLLAGLWEFPSVTLEPSEQHQHkALLQELQRWCGpLPAIRLQHLGEVIHIFSHIKLTYQVYS-LALDQAPA 405
Cdd:cd03425 14 VLIAQRPEGKHLAGLWEFPGGKVEPGETPEQ-ALVRELREELG-IEVEVGEPLGTVEHDYPDFHVRLHVYLcTLWSGEPQ 91
|
90
....*....|....
gi 2060810902 406 STAPPGARWLTWEE 419
Cdd:cd03425 92 LLEHQELRWVTPEE 105
|
|
| EndIII_4Fe-2S |
pfam10576 |
Iron-sulfur binding domain of endonuclease III; Escherichia coli endonuclease III (EC 4.2.99. ... |
232-248 |
1.99e-03 |
|
Iron-sulfur binding domain of endonuclease III; Escherichia coli endonuclease III (EC 4.2.99.18) is a DNA repair enzyme that acts both as a DNA N-glycosylase, removing oxidized pyrimidines from DNA, and as an apurinic/apyrimidinic (AP) endonuclease, introducing a single-strand nick at the site from which the damaged base was removed. Endonuclease III is an iron-sulfur protein that binds a single 4Fe-4S cluster. The 4Fe-4S cluster does not seem to be important for catalytic activity, but is probably involved in the proper positioning of the enzyme along the DNA strand. The 4Fe-4S cluster is bound by four cysteines which are all located in a 17 amino acid region at the C-terminal end of endonuclease III. A similar region is also present in the central section of mutY and in the C-terminus of ORF-10 and of the Micro-coccus UV endonuclease.
Pssm-ID: 463153 [Multi-domain] Cd Length: 17 Bit Score: 35.44 E-value: 1.99e-03
|
| PRK10776 |
PRK10776 |
8-oxo-dGTP diphosphatase MutT; |
331-391 |
3.66e-03 |
|
8-oxo-dGTP diphosphatase MutT;
Pssm-ID: 182721 [Multi-domain] Cd Length: 129 Bit Score: 37.66 E-value: 3.66e-03
10 20 30 40 50 60
....*....|....*....|....*....|....*....|....*....|....*....|...
gi 2060810902 331 QRPDSGLLAGLWEFPSVTLEPSEQHQhKALLQELQRWCGpLPAIRLQHLGEVIHIFS--HIKL 391
Cdd:PRK10776 22 RRAADAHMAGKWEFPGGKIEAGETPE-QALIRELQEEVG-ITVQHATLFEKLEYEFPdrHITL 82
|
|
| MutT |
COG0494 |
8-oxo-dGTP pyrophosphatase MutT and related house-cleaning NTP pyrophosphohydrolases, NUDIX ... |
327-432 |
7.00e-03 |
|
8-oxo-dGTP pyrophosphatase MutT and related house-cleaning NTP pyrophosphohydrolases, NUDIX family [Defense mechanisms];
Pssm-ID: 440260 [Multi-domain] Cd Length: 143 Bit Score: 36.93 E-value: 7.00e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2060810902 327 VLLVQRPDSGLLAGLWEFPSVTLEPSEQHQHkALLQELQ----------RWCGPLPAIRlqHLGEVIHIFsHIKLTYQVY 396
Cdd:COG0494 27 VLLVRRYRYGVGPGLWEFPGGKIEPGESPEE-AALRELReetgltaedlELLGELPSPG--YTDEKVHVF-LARGLGPGE 102
|
90 100 110
....*....|....*....|....*....|....*.
gi 2060810902 397 SLALDQAPASTAppgARWLTWEEFCNAAVSTAMKKV 432
Cdd:COG0494 103 EVGLDDEDEFIE---VRWVPLDEALALVTAGEIAKT 135
|
|
|