NCBI Home Page NCBI Site Search page NCBI Guide that lists and describes the NCBI resources
Conserved domains on  [gi|1907114104|ref|XP_036015468|]
View 

deoxyribonuclease TATDN1 isoform X6 [Mus musculus]

Protein Classification

amidohydrolase family protein( domain architecture ID 330)

metal-dependent amidohydrolase family protein having a conserved metal binding site, usually involving four histidines and one aspartic acid residue

Graphical summary

 Zoom to residue level

show extra options »

Show site features     Horizontal zoom: ×

List of domain hits

Name Accession Description Interval E-value
metallo-dependent_hydrolases super family cl00281
Superfamily of metallo-dependent hydrolases (also called amidohydrolase superfamily) is a ...
3-158 3.19e-29

Superfamily of metallo-dependent hydrolases (also called amidohydrolase superfamily) is a large group of proteins that show conservation in their 3-dimensional fold (TIM barrel) and in details of their active site. The vast majority of the members have a conserved metal binding site, involving four histidines and one aspartic acid residue. In the common reaction mechanism, the metal ion (or ions) deprotonate a water molecule for a nucleophilic attack on the substrate. The family includes urease alpha, adenosine deaminase, phosphotriesterase dihydroorotases, allantoinases, hydantoinases, AMP-, adenine and cytosine deaminases, imidazolonepropionase, aryldialkylphosphatase, chlorohydrolases, formylmethanofuran dehydrogenases and others.


The actual alignment was detected with superfamily member cd01310:

Pssm-ID: 469705 [Multi-domain]  Cd Length: 251  Bit Score: 107.27  E-value: 3.19e-29
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907114104   3 FSTVGCHPTRCDEFEkgspDQYLAGLLSLAENNKgkVVAIGECGLDFDRLQFcPKDTQL--------------------- 61
Cdd:cd01310    56 YAAVGLHPHDADEHV----DEDLDLLELLAANPK--VVAIGEIGLDYYRDKS-PREVQKevfraqlelakelnlpvvihs 128
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907114104  62 ----------------------------------------------NSLKTEAN--LEVLKSIPSEKLMIETDAPWCGVK 93
Cdd:cd01310   129 rdahedvleilkeygppkrgvfhcfsgsaeeakelldlgfyisisgIVTFKNANelREVVKEIPLERLLLETDSPYLAPV 208
                         170       180       190       200       210       220
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*
gi 1907114104  94 STHagskyintsfptkkkwenghclKDRNEPCHIIQILEIMSAVREEDPLELANTLYNNTIKVFF 158
Cdd:cd01310   209 PFR----------------------GKRNEPAYVKHVAEKIAELKGISVEEVAEVTTENAKRLFG 251
 
Name Accession Description Interval E-value
TatD_DNAse cd01310
TatD like proteins; E.coli TatD is a cytoplasmic protein, shown to have magnesium dependent ...
3-158 3.19e-29

TatD like proteins; E.coli TatD is a cytoplasmic protein, shown to have magnesium dependent DNase activity.


Pssm-ID: 238635 [Multi-domain]  Cd Length: 251  Bit Score: 107.27  E-value: 3.19e-29
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907114104   3 FSTVGCHPTRCDEFEkgspDQYLAGLLSLAENNKgkVVAIGECGLDFDRLQFcPKDTQL--------------------- 61
Cdd:cd01310    56 YAAVGLHPHDADEHV----DEDLDLLELLAANPK--VVAIGEIGLDYYRDKS-PREVQKevfraqlelakelnlpvvihs 128
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907114104  62 ----------------------------------------------NSLKTEAN--LEVLKSIPSEKLMIETDAPWCGVK 93
Cdd:cd01310   129 rdahedvleilkeygppkrgvfhcfsgsaeeakelldlgfyisisgIVTFKNANelREVVKEIPLERLLLETDSPYLAPV 208
                         170       180       190       200       210       220
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*
gi 1907114104  94 STHagskyintsfptkkkwenghclKDRNEPCHIIQILEIMSAVREEDPLELANTLYNNTIKVFF 158
Cdd:cd01310   209 PFR----------------------GKRNEPAYVKHVAEKIAELKGISVEEVAEVTTENAKRLFG 251
TatD_DNase pfam01026
TatD related DNase; This family of proteins are related to a large superfamily of ...
3-157 1.77e-22

TatD related DNase; This family of proteins are related to a large superfamily of metalloenzymes. TatD, a member of this family has been shown experimentally to be a DNase enzyme.


Pssm-ID: 425997 [Multi-domain]  Cd Length: 253  Bit Score: 89.63  E-value: 1.77e-22
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907114104   3 FSTVGCHPTRCDEFEkgspDQYLAGLLSLAENnkGKVVAIGECGLDFDRLQFCPKDTQ---------------------- 60
Cdd:pfam01026  56 YAAVGVHPHEADEAS----EDDLEALEKLAEH--PKVVAIGEIGLDYYYVDESPKEAQeevfrrqlelakelglpvviht 129
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907114104  61 -------LNSLKTEANL-----------------------------------------EVLKSIPSEKLMIETDAPWCgv 92
Cdd:pfam01026 130 rdaeedlLEILKEAGAPgargvlhcftgsveearkfldlgfyisisgivtfknakklrEVAAAIPLDRLLVETDAPYL-- 207
                         170       180       190       200       210       220
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*
gi 1907114104  93 ksthagskyinTSFPTKKKwenghclkdRNEPCHIIQILEIMSAVREEDPLELANTLYNNTIKVF 157
Cdd:pfam01026 208 -----------APVPYRGK---------RNEPAYVPYVVEKLAELKGISPEEVAEITTENAERLF 252
TatD COG0084
3'->5' ssDNA/RNA exonuclease TatD [Cell motility];
3-157 1.62e-20

3'->5' ssDNA/RNA exonuclease TatD [Cell motility];


Pssm-ID: 439854 [Multi-domain]  Cd Length: 253  Bit Score: 84.33  E-value: 1.62e-20
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907114104   3 FSTVGCHPTRCDEFekgsPDQYLAGLLSLAENNKgkVVAIGECGLDFDRlQFCPKDTQLNSLKT------EANL------ 70
Cdd:COG0084    56 YAAVGLHPHDAKEH----DEEDLAELEELAAHPK--VVAIGEIGLDYYR-DKSPREVQEEAFRAqlalakELGLpviihs 128
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907114104  71 ----------------------------------------------------------EVLKSIPSEKLMIETDAPwcgv 92
Cdd:COG0084   129 rdahddtleilkeegapalggvfhcfsgsleqakraldlgfyisfggivtfknakklrEVAAAIPLDRLLLETDAP---- 204
                         170       180       190       200       210       220
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*
gi 1907114104  93 ksthagskYInTSFPTKKKwenghclkdRNEPCHIIQILEIMSAVREEDPLELANTLYNNTIKVF 157
Cdd:COG0084   205 --------YL-APVPFRGK---------RNEPAYVPHVAEKLAELRGISLEELAEATTANARRLF 251
TIGR00010 TIGR00010
hydrolase, TatD family; PSI-BLAST, starting with a urease alpha subunit, finds a large ...
1-157 6.66e-10

hydrolase, TatD family; PSI-BLAST, starting with a urease alpha subunit, finds a large superfamily of proteins, including a number of different enzymes that act as hydrolases at C-N bonds other than peptide bonds (EC 3.5.-.-), many uncharacterized proteins, and the members of this family. Several genomes have multiple paralogs related to this family. However, a set of 17 proteins can be found, one each from 17 of the first 20 genomes, such that each member forms a bidirectional best hit across genomes with all other members of the set. This core set (and one other near-perfect member), but not the other paralogs, form the seed for this model. Additionally, members of the seed alignment and all trusted hits, but not all paralogs, have a conserved motif DxHxH near the amino end. The member from E. coli was recently shown to have DNase activity. [Unknown function, Enzymes of unknown specificity]


Pssm-ID: 272852 [Multi-domain]  Cd Length: 252  Bit Score: 55.73  E-value: 6.66e-10
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907114104   1 MFFSTVGCHPTRCDEFEKGSPDQylagLLSLAenNKGKVVAIGECGLDF------DRLQ---FCpkdTQLNsLKTEANL- 70
Cdd:TIGR00010  54 NVYAAVGVHPLDVDDDTKEDIKE----LERLA--AHPKVVAIGETGLDYykadeyKRRQeevFR---AQLQ-LAEELNLp 123
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907114104  71 --------------------------------------------------------------EVLKSIPSEKLMIETDAP 88
Cdd:TIGR00010 124 viihardaeedvldilreekpkvggvlhcftgdaelakklldlgfyisisgivtfknakslrEVVRKIPLERLLVETDSP 203
                         170       180       190       200       210       220
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 1907114104  89 WCgvksthagskyinTSFPTKKKwenghclkdRNEPCHIIQILEIMSAVREEDPLELANTLYNNTIKVF 157
Cdd:TIGR00010 204 YL-------------APVPYRGK---------RNEPAFVRYTVEAIAEIKGIDVEELAQITTKNAKRLF 250
PRK10425 PRK10425
3'-5' ssDNA/RNA exonuclease TatD;
4-157 3.60e-09

3'-5' ssDNA/RNA exonuclease TatD;


Pssm-ID: 182449  Cd Length: 258  Bit Score: 53.90  E-value: 3.60e-09
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907114104   4 STVGCHPTRCDEFEKGSPDQylagLLSLAEnnKGKVVAIGECGLDFDRlQFCPKDTQ----------------------- 60
Cdd:PRK10425   57 STAGVHPHDSSQWQAATEEA----IIELAA--QPEVVAIGECGLDFNR-NFSTPEEQerafvaqlaiaaelnmpvfmhcr 129
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907114104  61 -------------LNSLK---------TEANL--------------------------EVLKSIPSEKLMIETDAPWCGV 92
Cdd:PRK10425  130 daherfmallepwLDKLPgavlhcftgTREEMqaclarglyigitgwvcderrglelrELLPLIPAERLLLETDAPYLLP 209
                         170       180       190       200       210       220
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*
gi 1907114104  93 KSTHAGskyintsfPTKKkwenghclkdRNEPCHIIQILEIMSAVREEDPLELANTLYNNTIKVF 157
Cdd:PRK10425  210 RDLTPK--------PASR----------RNEPAFLPHILQRIAHWRGEDAAWLAATTDANARTLF 256
 
Name Accession Description Interval E-value
TatD_DNAse cd01310
TatD like proteins; E.coli TatD is a cytoplasmic protein, shown to have magnesium dependent ...
3-158 3.19e-29

TatD like proteins; E.coli TatD is a cytoplasmic protein, shown to have magnesium dependent DNase activity.


Pssm-ID: 238635 [Multi-domain]  Cd Length: 251  Bit Score: 107.27  E-value: 3.19e-29
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907114104   3 FSTVGCHPTRCDEFEkgspDQYLAGLLSLAENNKgkVVAIGECGLDFDRLQFcPKDTQL--------------------- 61
Cdd:cd01310    56 YAAVGLHPHDADEHV----DEDLDLLELLAANPK--VVAIGEIGLDYYRDKS-PREVQKevfraqlelakelnlpvvihs 128
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907114104  62 ----------------------------------------------NSLKTEAN--LEVLKSIPSEKLMIETDAPWCGVK 93
Cdd:cd01310   129 rdahedvleilkeygppkrgvfhcfsgsaeeakelldlgfyisisgIVTFKNANelREVVKEIPLERLLLETDSPYLAPV 208
                         170       180       190       200       210       220
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*
gi 1907114104  94 STHagskyintsfptkkkwenghclKDRNEPCHIIQILEIMSAVREEDPLELANTLYNNTIKVFF 158
Cdd:cd01310   209 PFR----------------------GKRNEPAYVKHVAEKIAELKGISVEEVAEVTTENAKRLFG 251
TatD_DNase pfam01026
TatD related DNase; This family of proteins are related to a large superfamily of ...
3-157 1.77e-22

TatD related DNase; This family of proteins are related to a large superfamily of metalloenzymes. TatD, a member of this family has been shown experimentally to be a DNase enzyme.


Pssm-ID: 425997 [Multi-domain]  Cd Length: 253  Bit Score: 89.63  E-value: 1.77e-22
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907114104   3 FSTVGCHPTRCDEFEkgspDQYLAGLLSLAENnkGKVVAIGECGLDFDRLQFCPKDTQ---------------------- 60
Cdd:pfam01026  56 YAAVGVHPHEADEAS----EDDLEALEKLAEH--PKVVAIGEIGLDYYYVDESPKEAQeevfrrqlelakelglpvviht 129
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907114104  61 -------LNSLKTEANL-----------------------------------------EVLKSIPSEKLMIETDAPWCgv 92
Cdd:pfam01026 130 rdaeedlLEILKEAGAPgargvlhcftgsveearkfldlgfyisisgivtfknakklrEVAAAIPLDRLLVETDAPYL-- 207
                         170       180       190       200       210       220
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*
gi 1907114104  93 ksthagskyinTSFPTKKKwenghclkdRNEPCHIIQILEIMSAVREEDPLELANTLYNNTIKVF 157
Cdd:pfam01026 208 -----------APVPYRGK---------RNEPAYVPYVVEKLAELKGISPEEVAEITTENAERLF 252
TatD COG0084
3'->5' ssDNA/RNA exonuclease TatD [Cell motility];
3-157 1.62e-20

3'->5' ssDNA/RNA exonuclease TatD [Cell motility];


Pssm-ID: 439854 [Multi-domain]  Cd Length: 253  Bit Score: 84.33  E-value: 1.62e-20
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907114104   3 FSTVGCHPTRCDEFekgsPDQYLAGLLSLAENNKgkVVAIGECGLDFDRlQFCPKDTQLNSLKT------EANL------ 70
Cdd:COG0084    56 YAAVGLHPHDAKEH----DEEDLAELEELAAHPK--VVAIGEIGLDYYR-DKSPREVQEEAFRAqlalakELGLpviihs 128
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907114104  71 ----------------------------------------------------------EVLKSIPSEKLMIETDAPwcgv 92
Cdd:COG0084   129 rdahddtleilkeegapalggvfhcfsgsleqakraldlgfyisfggivtfknakklrEVAAAIPLDRLLLETDAP---- 204
                         170       180       190       200       210       220
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*
gi 1907114104  93 ksthagskYInTSFPTKKKwenghclkdRNEPCHIIQILEIMSAVREEDPLELANTLYNNTIKVF 157
Cdd:COG0084   205 --------YL-APVPFRGK---------RNEPAYVPHVAEKLAELRGISLEELAEATTANARRLF 251
TIGR00010 TIGR00010
hydrolase, TatD family; PSI-BLAST, starting with a urease alpha subunit, finds a large ...
1-157 6.66e-10

hydrolase, TatD family; PSI-BLAST, starting with a urease alpha subunit, finds a large superfamily of proteins, including a number of different enzymes that act as hydrolases at C-N bonds other than peptide bonds (EC 3.5.-.-), many uncharacterized proteins, and the members of this family. Several genomes have multiple paralogs related to this family. However, a set of 17 proteins can be found, one each from 17 of the first 20 genomes, such that each member forms a bidirectional best hit across genomes with all other members of the set. This core set (and one other near-perfect member), but not the other paralogs, form the seed for this model. Additionally, members of the seed alignment and all trusted hits, but not all paralogs, have a conserved motif DxHxH near the amino end. The member from E. coli was recently shown to have DNase activity. [Unknown function, Enzymes of unknown specificity]


Pssm-ID: 272852 [Multi-domain]  Cd Length: 252  Bit Score: 55.73  E-value: 6.66e-10
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907114104   1 MFFSTVGCHPTRCDEFEKGSPDQylagLLSLAenNKGKVVAIGECGLDF------DRLQ---FCpkdTQLNsLKTEANL- 70
Cdd:TIGR00010  54 NVYAAVGVHPLDVDDDTKEDIKE----LERLA--AHPKVVAIGETGLDYykadeyKRRQeevFR---AQLQ-LAEELNLp 123
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907114104  71 --------------------------------------------------------------EVLKSIPSEKLMIETDAP 88
Cdd:TIGR00010 124 viihardaeedvldilreekpkvggvlhcftgdaelakklldlgfyisisgivtfknakslrEVVRKIPLERLLVETDSP 203
                         170       180       190       200       210       220
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 1907114104  89 WCgvksthagskyinTSFPTKKKwenghclkdRNEPCHIIQILEIMSAVREEDPLELANTLYNNTIKVF 157
Cdd:TIGR00010 204 YL-------------APVPYRGK---------RNEPAFVRYTVEAIAEIKGIDVEELAQITTKNAKRLF 250
PRK10425 PRK10425
3'-5' ssDNA/RNA exonuclease TatD;
4-157 3.60e-09

3'-5' ssDNA/RNA exonuclease TatD;


Pssm-ID: 182449  Cd Length: 258  Bit Score: 53.90  E-value: 3.60e-09
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907114104   4 STVGCHPTRCDEFEKGSPDQylagLLSLAEnnKGKVVAIGECGLDFDRlQFCPKDTQ----------------------- 60
Cdd:PRK10425   57 STAGVHPHDSSQWQAATEEA----IIELAA--QPEVVAIGECGLDFNR-NFSTPEEQerafvaqlaiaaelnmpvfmhcr 129
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1907114104  61 -------------LNSLK---------TEANL--------------------------EVLKSIPSEKLMIETDAPWCGV 92
Cdd:PRK10425  130 daherfmallepwLDKLPgavlhcftgTREEMqaclarglyigitgwvcderrglelrELLPLIPAERLLLETDAPYLLP 209
                         170       180       190       200       210       220
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*
gi 1907114104  93 KSTHAGskyintsfPTKKkwenghclkdRNEPCHIIQILEIMSAVREEDPLELANTLYNNTIKVF 157
Cdd:PRK10425  210 RDLTPK--------PASR----------RNEPAFLPHILQRIAHWRGEDAAWLAATTDANARTLF 256
 
Blast search parameters
Data Source: Precalculated data, version = cdd.v.3.21
Preset Options:Database: CDSEARCH/cdd   Low complexity filter: no  Composition Based Adjustment: yes   E-value threshold: 0.01

References:

  • Wang J et al. (2023), "The conserved domain database in 2023", Nucleic Acids Res.51(D)384-8.
  • Lu S et al. (2020), "The conserved domain database in 2020", Nucleic Acids Res.48(D)265-8.
  • Marchler-Bauer A et al. (2017), "CDD/SPARCLE: functional classification of proteins via subfamily domain architectures.", Nucleic Acids Res.45(D)200-3.
Help | Disclaimer | Write to the Help Desk
NCBI | NLM | NIH