NCBI Home Page NCBI Site Search page NCBI Guide that lists and describes the NCBI resources
Conserved domains on  [gi|1611156425|ref|WP_135559896|]
View 

MULTISPECIES: IS3 family transposase [Escherichia]

Protein Classification

Graphical summary

 Zoom to residue level

show extra options »

Show site features     Horizontal zoom: ×

List of domain hits

Name Accession Description Interval E-value
PRK09409 super family cl32366
IS2 transposase TnpB; Reviewed
110-399 1.29e-130

IS2 transposase TnpB; Reviewed


The actual alignment was detected with superfamily member PRK09409:

Pssm-ID: 181829 [Multi-domain]  Cd Length: 301  Bit Score: 377.52  E-value: 1.29e-130
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1611156425 110 MDSARALVAKGRGIALVSRTMGVSRAQLSLRVNRSADWQDRRCNRRNDEADaeILTDILDIISDMPSYGYRRVWGILRKQ 189
Cdd:PRK09409    1 MDSARALIARGWGVSLVSRCLRVSRAQLHVILRRTDDWMDGRRSRHTDDTD--VLLRIHHVIGELPTYGYRRVWALLRRQ 78
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1611156425 190 RRTEGEPPVNAKRLYRIMSEhNLLLLHDKPERP--KREHKGKIAVAESDMRWCSDGFEFGCDNGEKLRVTFALDCCDREA 267
Cdd:PRK09409   79 AELDGMPAINAKRVYRIMRQ-NALLLERKPAVPpsKRAHTGRVAVKESNQRWCSDGFEFCCDNGERLRVTFALDCCDREA 157
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1611156425 268 IDWAASTGGYDSSTVQDVMLRSVEKRFGDRLPDTPVQWLTDNGSAYTAHETRRFARELNLEPCTTAVSSPQSNGMAERFV 347
Cdd:PRK09409  158 LHWAVTTGGFNSETVQDVMLGAVERRFGNDLPSSPVEWLTDNGSCYRANETRQFARMLGLEPKNTAVRSPESNGIAESFV 237
                         250       260       270       280       290
                  ....*....|....*....|....*....|....*....|....*....|..
gi 1611156425 348 KTMKEDYIAFMPKPDVRTALQNLAAAFTHYNENHPHSALGYYSPREYRRQRA 399
Cdd:PRK09409  238 KTIKRDYISIMPKPDGLTAAKNLAEAFEHYNEWHPHSALGYRSPREYLRQRA 289
HTH_CRP super family cl46859
helix_turn_helix, cAMP Regulatory protein C-terminus; DNA binding domain of prokaryotic ...
5-109 6.12e-46

helix_turn_helix, cAMP Regulatory protein C-terminus; DNA binding domain of prokaryotic regulatory proteins belonging to the catabolite activator protein family.


The actual alignment was detected with superfamily member PRK09413:

Pssm-ID: 481199  Cd Length: 121  Bit Score: 154.19  E-value: 6.12e-46
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1611156425   5 LLGQEVRKRKTPQEKIAIIQQTMEPGMNVSHVARLHGIQPSLLFKWKKQYQEGSLTAVAAGEEVVPASELTAALKQVREL 84
Cdd:PRK09413    4 VLGPEKRRRRTTQEKIAIVQQSFEPGMTVSLVARQHGVAASQLFLWRKQYQEGSLTAVAAGEQVVPASELAAAMKQIKEL 83
                          90       100
                  ....*....|....*....|....*
gi 1611156425  85 QRLLGKKTMEVEILKEAVEYGQSPK 109
Cdd:PRK09413   84 QRLLGKKTMENELLKEAVEYGRAKK 108
 
Name Accession Description Interval E-value
PRK09409 PRK09409
IS2 transposase TnpB; Reviewed
110-399 1.29e-130

IS2 transposase TnpB; Reviewed


Pssm-ID: 181829 [Multi-domain]  Cd Length: 301  Bit Score: 377.52  E-value: 1.29e-130
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1611156425 110 MDSARALVAKGRGIALVSRTMGVSRAQLSLRVNRSADWQDRRCNRRNDEADaeILTDILDIISDMPSYGYRRVWGILRKQ 189
Cdd:PRK09409    1 MDSARALIARGWGVSLVSRCLRVSRAQLHVILRRTDDWMDGRRSRHTDDTD--VLLRIHHVIGELPTYGYRRVWALLRRQ 78
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1611156425 190 RRTEGEPPVNAKRLYRIMSEhNLLLLHDKPERP--KREHKGKIAVAESDMRWCSDGFEFGCDNGEKLRVTFALDCCDREA 267
Cdd:PRK09409   79 AELDGMPAINAKRVYRIMRQ-NALLLERKPAVPpsKRAHTGRVAVKESNQRWCSDGFEFCCDNGERLRVTFALDCCDREA 157
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1611156425 268 IDWAASTGGYDSSTVQDVMLRSVEKRFGDRLPDTPVQWLTDNGSAYTAHETRRFARELNLEPCTTAVSSPQSNGMAERFV 347
Cdd:PRK09409  158 LHWAVTTGGFNSETVQDVMLGAVERRFGNDLPSSPVEWLTDNGSCYRANETRQFARMLGLEPKNTAVRSPESNGIAESFV 237
                         250       260       270       280       290
                  ....*....|....*....|....*....|....*....|....*....|..
gi 1611156425 348 KTMKEDYIAFMPKPDVRTALQNLAAAFTHYNENHPHSALGYYSPREYRRQRA 399
Cdd:PRK09409  238 KTIKRDYISIMPKPDGLTAAKNLAEAFEHYNEWHPHSALGYRSPREYLRQRA 289
transpos_IS3 NF033516
IS3 family transposase;
17-395 4.42e-90

IS3 family transposase;


Pssm-ID: 468052 [Multi-domain]  Cd Length: 369  Bit Score: 276.37  E-value: 4.42e-90
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1611156425  17 QEKIAIIQQTMEPGMNVSHVARLHGIQPSLLFKWKKQYQEGSLTAVAAgeevVPASELTAALKQVRELQRLLGKKTMEVE 96
Cdd:NF033516    1 EFKLEAVREVLEGGKSVAEVARELGISPSTLYRWRKKYRGGGEAADAG----RLKELLTPEEEENRRLKRELAELRLENE 76
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1611156425  97 ILKEAVEYGQSPKMdsaRALVAKGRG---IALVSRTMGVSRAqlSLRvnrsaDWQDRRCNRRNDEaDAEILTDILDIISD 173
Cdd:NF033516   77 ILKKARKLLRPAVK---YALIDALRGeysVRRACRVLGVSRS--TYY-----YWRKRPPSRRAPD-DAELRARIREIFEE 145
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1611156425 174 MP-SYGYRRVWGILRKqrrtEGEPpVNAKRLYRIMSEHNLLLLHDKPERPKREHKGKIA------------VAESDMRWC 240
Cdd:NF033516  146 SRgRYGYRRITALLRR----EGIR-VNHKRVYRLMRELGLLARRRRKRRPYTTDSGHVHpvapnllnrqftATRPNQVWV 220
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1611156425 241 SDGFEFGCDNGeKLRVTFALDCCDREAIDWAASTGgYDSSTVQDVMLRSVEKRfgdRLPDTPVqWLTDNGSAYTAHETRR 320
Cdd:NF033516  221 TDITYIRTAEG-WLYLAVVLDLFSREIVGWSVSTS-MSAELVLDALEMAIEWR---GKPEGLI-LHSDNGSQYTSKAYRE 294
                         330       340       350       360       370       380       390
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*
gi 1611156425 321 FARELNLEPCTTAVSSPQSNGMAERFVKTMKEDYIAFMPKPDVRTALQNLAAAFTHYNENHPHSALGYYSPREYR 395
Cdd:NF033516  295 WLKEHGITQSMSRPGNCWDNAVAESFFGTLKRECLYRRRFRTLEEARQAIEEYIEFYNHERPHSSLGYLTPAEFE 369
Tra5 COG2801
Transposase InsO and inactivated derivatives [Mobilome: prophages, transposons];
94-399 1.06e-52

Transposase InsO and inactivated derivatives [Mobilome: prophages, transposons];


Pssm-ID: 442053 [Multi-domain]  Cd Length: 309  Bit Score: 178.04  E-value: 1.06e-52
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1611156425  94 EVEILKEAVEYGQSPKMDSARALVAKGRGIALVSRTMGVSRAQLSLRVNRSADWQDRRCNR-RNDEADAEILTDILDIIS 172
Cdd:COG2801     3 AEEEELRKEEELLRRLLLLLRLLLLRRRVLRRVSRRRRRLLRLLRRRRARSRRRRRLRRPRsYRADEDAELLERIKEIFA 82
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1611156425 173 DMPSYGYRRVWGILRKQRRtegepPVNAKRLYRIMSEHNLLLLHDKPERP--KREHKGKIA------VAESDMRWCSDGF 244
Cdd:COG2801    83 ESPRYGYRRITAELRREGI-----AVNRKRVRRLMRELGLQARRRRKKKYttYSGHGGPIApnllftATAPNQVWVTDIT 157
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1611156425 245 EFGCDNGeKLRVTFALDCCDREAIDWAASTGGyDSSTVQDVMLRSVEKRFgdrlPDTPVQWLTDNGSAYTAHETRRFARE 324
Cdd:COG2801   158 YIPTAEG-WLYLAAVIDLFSREIVGWSVSDSM-DAELVVDALEMAIERRG----PPKPLILHSDNGSQYTSKAYQELLKK 231
                         250       260       270       280       290       300       310
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*
gi 1611156425 325 LNLEPCTTAVSSPQSNGMAERFVKTMKEDYIAFMPKPDVRTALQNLAAAFTHYNENHPHSALGYYSPREYRRQRA 399
Cdd:COG2801   232 LGITQSMSRPGNPQDNAFIESFFGTLKYELLYRRRFESLEEAREAIEEYIEFYNHERPHSSLGYLTPAEYEKQLA 306
PRK09413 PRK09413
IS2 repressor TnpA; Reviewed
5-109 6.12e-46

IS2 repressor TnpA; Reviewed


Pssm-ID: 181833  Cd Length: 121  Bit Score: 154.19  E-value: 6.12e-46
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1611156425   5 LLGQEVRKRKTPQEKIAIIQQTMEPGMNVSHVARLHGIQPSLLFKWKKQYQEGSLTAVAAGEEVVPASELTAALKQVREL 84
Cdd:PRK09413    4 VLGPEKRRRRTTQEKIAIVQQSFEPGMTVSLVARQHGVAASQLFLWRKQYQEGSLTAVAAGEQVVPASELAAAMKQIKEL 83
                          90       100
                  ....*....|....*....|....*
gi 1611156425  85 QRLLGKKTMEVEILKEAVEYGQSPK 109
Cdd:PRK09413   84 QRLLGKKTMENELLKEAVEYGRAKK 108
transpos_IS481 NF033577
IS481 family transposase; null
113-391 3.22e-21

IS481 family transposase; null


Pssm-ID: 468094 [Multi-domain]  Cd Length: 283  Bit Score: 92.65  E-value: 3.22e-21
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1611156425 113 ARALVAKGRGIALVSRTMGVSRAQLslrvnrsADWQDR-----------------RCNRRNDEADAEIltdILDIISDMp 175
Cdd:NF033577    6 VRLVLEDGWSVREAARRFGISRKTV-------YKWLKRyraggeeglidrsrrphRSPRRTSPETEAR---ILALRREL- 74
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1611156425 176 SYGYRRVWGILRKQRrtegePPVNAKRLYRIMSEHNLLLLHDKpERPKREHKgKIAVAESDMRWCSDGFEFG--CDNGeK 253
Cdd:NF033577   75 RLGPRRIAYELERQG-----PGVSRSTVHRILRRHGLSRLRAL-DRKTGKVK-RYERAHPGELWHIDIKKLGriPDVG-R 146
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1611156425 254 LRVTFALDCCDREAIdwaasTGGYDSSTVQDVM--LRSVEKRFGdrlpdTPVQW-LTDNGSAYTAHET--RRFARELNLE 328
Cdd:NF033577  147 LYLHTAIDDHSRFAY-----AELYPDETAETAAdfLRRAFAEHG-----IPIRRvLTDNGSEFRSRAHgfELALAELGIE 216
                         250       260       270       280       290       300
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*..
gi 1611156425 329 PCTTAVSSPQSNGMAERFVKTMKEdyiAFMPKPDVRTaLQNLAAAFT----HYNENHPHSALGYYSP 391
Cdd:NF033577  217 HRRTRPYHPQTNGKVERFHRTLKD---EFAYARPYES-LAELQAALDewlhHYNHHRPHSALGGKTP 279
rve pfam00665
Integrase core domain; Integrase mediates integration of a DNA copy of the viral genome into ...
238-337 1.04e-19

Integrase core domain; Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. The amino-terminal domain is a zinc binding domain pfam02022. This domain is the central catalytic domain. The carboxyl terminal domain that is a non-specific DNA binding domain pfam00552. The catalytic domain acts as an endonuclease when two nucleotides are removed from the 3' ends of the blunt-ended viral DNA made by reverse transcription. This domain also catalyzes the DNA strand transfer reaction of the 3' ends of the viral DNA to the 5' ends of the integration site.


Pssm-ID: 459897 [Multi-domain]  Cd Length: 98  Bit Score: 83.13  E-value: 1.04e-19
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1611156425 238 RWCSDGFEFGCDNGE-KLRVTFALDCCDREAIDWAASTGGyDSSTVQDVMLRSVEKRFGDrlpdtPVQWLTDNGSAYTAH 316
Cdd:pfam00665   4 LWQGDFTYIRIPGGGgKLYLLVIVDDFSREILAWALSSEM-DAELVLDALERAIAFRGGV-----PLIIHSDNGSEYTSK 77
                          90       100
                  ....*....|....*....|.
gi 1611156425 317 ETRRFARELNLEPCTTAVSSP 337
Cdd:pfam00665  78 AFREFLKDLGIKPSFSRPGNP 98
InsE COG2963
Transposase InsE and inactivated derivatives [Mobilome: prophages, transposons];
11-104 2.17e-19

Transposase InsE and inactivated derivatives [Mobilome: prophages, transposons];


Pssm-ID: 442203 [Multi-domain]  Cd Length: 93  Bit Score: 82.28  E-value: 2.17e-19
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1611156425  11 RKRKTPQEKIAIIQQTMEPGMNVSHVARLHGIQPSLLFKWKKQYQEGSLTAVAAGEEVVPASEltaalkQVRELQRLLGK 90
Cdd:COG2963     5 RRRYSPEFKAEAVRLVLEGGASVAEVARELGISPSTLYRWVRQYREGGLGGFPGDGRTTPEQA------EIRRLRKELRR 78
                          90
                  ....*....|....
gi 1611156425  91 KTMEVEILKEAVEY 104
Cdd:COG2963    79 LEMENDILKKAAAL 92
HTH_Tnp_1 pfam01527
Transposase; Transposase proteins are necessary for efficient DNA transposition. This family ...
11-88 3.97e-14

Transposase; Transposase proteins are necessary for efficient DNA transposition. This family consists of various E. coli insertion elements and other bacterial transposases some of which are members of the IS3 family.


Pssm-ID: 426308 [Multi-domain]  Cd Length: 75  Bit Score: 66.99  E-value: 3.97e-14
                          10        20        30        40        50        60        70
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 1611156425  11 RKRKTPQEKIAIIQQTMEPGMNVSHVARLHGIQPSLLFKWKKQYQEgsltavaaGEEVVPAS-ELTAALKQVRELQRLL 88
Cdd:pfam01527   4 RRRFSEEFKLRAVKEVLEPGRTVKEVARRHGVSPNTLYQWRRQYEG--------GMGASPARpRLTALEEENRRLKREL 74
IS66_access_TnpA NF038385
IS66-like element accessory protein TnpA;
8-60 6.94e-12

IS66-like element accessory protein TnpA;


Pssm-ID: 439678 [Multi-domain]  Cd Length: 190  Bit Score: 63.96  E-value: 6.94e-12
                          10        20        30        40        50
                  ....*....|....*....|....*....|....*....|....*....|...
gi 1611156425   8 QEVRKRKTPQEKIAIIQQTMEPGMNVSHVARLHGIQPSLLFKWKKQYQEGSLT 60
Cdd:NF038385   83 KIRRPNFSREFKIRLVEQTLQPGACVAQIARENGINDNLLFNWRHLYRNGLLQ 135
 
Name Accession Description Interval E-value
PRK09409 PRK09409
IS2 transposase TnpB; Reviewed
110-399 1.29e-130

IS2 transposase TnpB; Reviewed


Pssm-ID: 181829 [Multi-domain]  Cd Length: 301  Bit Score: 377.52  E-value: 1.29e-130
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1611156425 110 MDSARALVAKGRGIALVSRTMGVSRAQLSLRVNRSADWQDRRCNRRNDEADaeILTDILDIISDMPSYGYRRVWGILRKQ 189
Cdd:PRK09409    1 MDSARALIARGWGVSLVSRCLRVSRAQLHVILRRTDDWMDGRRSRHTDDTD--VLLRIHHVIGELPTYGYRRVWALLRRQ 78
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1611156425 190 RRTEGEPPVNAKRLYRIMSEhNLLLLHDKPERP--KREHKGKIAVAESDMRWCSDGFEFGCDNGEKLRVTFALDCCDREA 267
Cdd:PRK09409   79 AELDGMPAINAKRVYRIMRQ-NALLLERKPAVPpsKRAHTGRVAVKESNQRWCSDGFEFCCDNGERLRVTFALDCCDREA 157
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1611156425 268 IDWAASTGGYDSSTVQDVMLRSVEKRFGDRLPDTPVQWLTDNGSAYTAHETRRFARELNLEPCTTAVSSPQSNGMAERFV 347
Cdd:PRK09409  158 LHWAVTTGGFNSETVQDVMLGAVERRFGNDLPSSPVEWLTDNGSCYRANETRQFARMLGLEPKNTAVRSPESNGIAESFV 237
                         250       260       270       280       290
                  ....*....|....*....|....*....|....*....|....*....|..
gi 1611156425 348 KTMKEDYIAFMPKPDVRTALQNLAAAFTHYNENHPHSALGYYSPREYRRQRA 399
Cdd:PRK09409  238 KTIKRDYISIMPKPDGLTAAKNLAEAFEHYNEWHPHSALGYRSPREYLRQRA 289
PRK14702 PRK14702
insertion element IS2 transposase InsD; Provisional
149-399 4.85e-113

insertion element IS2 transposase InsD; Provisional


Pssm-ID: 237792 [Multi-domain]  Cd Length: 262  Bit Score: 331.31  E-value: 4.85e-113
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1611156425 149 DRRCNRRNDEADaeILTDILDIISDMPSYGYRRVWGILRKQRRTEGEPPVNAKRLYRIMSEhNLLLLHDKPERP--KREH 226
Cdd:PRK14702    1 DGRRSRHTDDTD--VLLRIHHVIGELPTYGYRRVWALLRRQAELDGMPAINAKRVYRLMRQ-NALLLERKPAVPpsKRAH 77
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1611156425 227 KGKIAVAESDMRWCSDGFEFGCDNGEKLRVTFALDCCDREAIDWAASTGGYDSSTVQDVMLRSVEKRFGDRLPDTPVQWL 306
Cdd:PRK14702   78 TGRVAVKESNQRWCSDGFEFCCDNGERLRVTFALDCCDREALHWAVTTGGFNSETVQDVMLGAVERRFGNDLPSSPVEWL 157
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1611156425 307 TDNGSAYTAHETRRFARELNLEPCTTAVSSPQSNGMAERFVKTMKEDYIAFMPKPDVRTALQNLAAAFTHYNENHPHSAL 386
Cdd:PRK14702  158 TDNGSCYRANETRQFARMLGLEPKNTAVRSPESNGIAESFVKTIKRDYISIMPKPDGLTAAKNLAEAFEHYNEWHPHSAL 237
                         250
                  ....*....|...
gi 1611156425 387 GYYSPREYRRQRA 399
Cdd:PRK14702  238 GYRSPREYLRQRA 250
transpos_IS3 NF033516
IS3 family transposase;
17-395 4.42e-90

IS3 family transposase;


Pssm-ID: 468052 [Multi-domain]  Cd Length: 369  Bit Score: 276.37  E-value: 4.42e-90
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1611156425  17 QEKIAIIQQTMEPGMNVSHVARLHGIQPSLLFKWKKQYQEGSLTAVAAgeevVPASELTAALKQVRELQRLLGKKTMEVE 96
Cdd:NF033516    1 EFKLEAVREVLEGGKSVAEVARELGISPSTLYRWRKKYRGGGEAADAG----RLKELLTPEEEENRRLKRELAELRLENE 76
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1611156425  97 ILKEAVEYGQSPKMdsaRALVAKGRG---IALVSRTMGVSRAqlSLRvnrsaDWQDRRCNRRNDEaDAEILTDILDIISD 173
Cdd:NF033516   77 ILKKARKLLRPAVK---YALIDALRGeysVRRACRVLGVSRS--TYY-----YWRKRPPSRRAPD-DAELRARIREIFEE 145
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1611156425 174 MP-SYGYRRVWGILRKqrrtEGEPpVNAKRLYRIMSEHNLLLLHDKPERPKREHKGKIA------------VAESDMRWC 240
Cdd:NF033516  146 SRgRYGYRRITALLRR----EGIR-VNHKRVYRLMRELGLLARRRRKRRPYTTDSGHVHpvapnllnrqftATRPNQVWV 220
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1611156425 241 SDGFEFGCDNGeKLRVTFALDCCDREAIDWAASTGgYDSSTVQDVMLRSVEKRfgdRLPDTPVqWLTDNGSAYTAHETRR 320
Cdd:NF033516  221 TDITYIRTAEG-WLYLAVVLDLFSREIVGWSVSTS-MSAELVLDALEMAIEWR---GKPEGLI-LHSDNGSQYTSKAYRE 294
                         330       340       350       360       370       380       390
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*
gi 1611156425 321 FARELNLEPCTTAVSSPQSNGMAERFVKTMKEDYIAFMPKPDVRTALQNLAAAFTHYNENHPHSALGYYSPREYR 395
Cdd:NF033516  295 WLKEHGITQSMSRPGNCWDNAVAESFFGTLKRECLYRRRFRTLEEARQAIEEYIEFYNHERPHSSLGYLTPAEFE 369
Tra5 COG2801
Transposase InsO and inactivated derivatives [Mobilome: prophages, transposons];
94-399 1.06e-52

Transposase InsO and inactivated derivatives [Mobilome: prophages, transposons];


Pssm-ID: 442053 [Multi-domain]  Cd Length: 309  Bit Score: 178.04  E-value: 1.06e-52
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1611156425  94 EVEILKEAVEYGQSPKMDSARALVAKGRGIALVSRTMGVSRAQLSLRVNRSADWQDRRCNR-RNDEADAEILTDILDIIS 172
Cdd:COG2801     3 AEEEELRKEEELLRRLLLLLRLLLLRRRVLRRVSRRRRRLLRLLRRRRARSRRRRRLRRPRsYRADEDAELLERIKEIFA 82
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1611156425 173 DMPSYGYRRVWGILRKQRRtegepPVNAKRLYRIMSEHNLLLLHDKPERP--KREHKGKIA------VAESDMRWCSDGF 244
Cdd:COG2801    83 ESPRYGYRRITAELRREGI-----AVNRKRVRRLMRELGLQARRRRKKKYttYSGHGGPIApnllftATAPNQVWVTDIT 157
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1611156425 245 EFGCDNGeKLRVTFALDCCDREAIDWAASTGGyDSSTVQDVMLRSVEKRFgdrlPDTPVQWLTDNGSAYTAHETRRFARE 324
Cdd:COG2801   158 YIPTAEG-WLYLAAVIDLFSREIVGWSVSDSM-DAELVVDALEMAIERRG----PPKPLILHSDNGSQYTSKAYQELLKK 231
                         250       260       270       280       290       300       310
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*
gi 1611156425 325 LNLEPCTTAVSSPQSNGMAERFVKTMKEDYIAFMPKPDVRTALQNLAAAFTHYNENHPHSALGYYSPREYRRQRA 399
Cdd:COG2801   232 LGITQSMSRPGNPQDNAFIESFFGTLKYELLYRRRFESLEEAREAIEEYIEFYNHERPHSSLGYLTPAEYEKQLA 306
PRK09413 PRK09413
IS2 repressor TnpA; Reviewed
5-109 6.12e-46

IS2 repressor TnpA; Reviewed


Pssm-ID: 181833  Cd Length: 121  Bit Score: 154.19  E-value: 6.12e-46
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1611156425   5 LLGQEVRKRKTPQEKIAIIQQTMEPGMNVSHVARLHGIQPSLLFKWKKQYQEGSLTAVAAGEEVVPASELTAALKQVREL 84
Cdd:PRK09413    4 VLGPEKRRRRTTQEKIAIVQQSFEPGMTVSLVARQHGVAASQLFLWRKQYQEGSLTAVAAGEQVVPASELAAAMKQIKEL 83
                          90       100
                  ....*....|....*....|....*
gi 1611156425  85 QRLLGKKTMEVEILKEAVEYGQSPK 109
Cdd:PRK09413   84 QRLLGKKTMENELLKEAVEYGRAKK 108
transpos_IS481 NF033577
IS481 family transposase; null
113-391 3.22e-21

IS481 family transposase; null


Pssm-ID: 468094 [Multi-domain]  Cd Length: 283  Bit Score: 92.65  E-value: 3.22e-21
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1611156425 113 ARALVAKGRGIALVSRTMGVSRAQLslrvnrsADWQDR-----------------RCNRRNDEADAEIltdILDIISDMp 175
Cdd:NF033577    6 VRLVLEDGWSVREAARRFGISRKTV-------YKWLKRyraggeeglidrsrrphRSPRRTSPETEAR---ILALRREL- 74
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1611156425 176 SYGYRRVWGILRKQRrtegePPVNAKRLYRIMSEHNLLLLHDKpERPKREHKgKIAVAESDMRWCSDGFEFG--CDNGeK 253
Cdd:NF033577   75 RLGPRRIAYELERQG-----PGVSRSTVHRILRRHGLSRLRAL-DRKTGKVK-RYERAHPGELWHIDIKKLGriPDVG-R 146
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1611156425 254 LRVTFALDCCDREAIdwaasTGGYDSSTVQDVM--LRSVEKRFGdrlpdTPVQW-LTDNGSAYTAHET--RRFARELNLE 328
Cdd:NF033577  147 LYLHTAIDDHSRFAY-----AELYPDETAETAAdfLRRAFAEHG-----IPIRRvLTDNGSEFRSRAHgfELALAELGIE 216
                         250       260       270       280       290       300
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*..
gi 1611156425 329 PCTTAVSSPQSNGMAERFVKTMKEdyiAFMPKPDVRTaLQNLAAAFT----HYNENHPHSALGYYSP 391
Cdd:NF033577  217 HRRTRPYHPQTNGKVERFHRTLKD---EFAYARPYES-LAELQAALDewlhHYNHHRPHSALGGKTP 279
rve pfam00665
Integrase core domain; Integrase mediates integration of a DNA copy of the viral genome into ...
238-337 1.04e-19

Integrase core domain; Integrase mediates integration of a DNA copy of the viral genome into the host chromosome. Integrase is composed of three domains. The amino-terminal domain is a zinc binding domain pfam02022. This domain is the central catalytic domain. The carboxyl terminal domain that is a non-specific DNA binding domain pfam00552. The catalytic domain acts as an endonuclease when two nucleotides are removed from the 3' ends of the blunt-ended viral DNA made by reverse transcription. This domain also catalyzes the DNA strand transfer reaction of the 3' ends of the viral DNA to the 5' ends of the integration site.


Pssm-ID: 459897 [Multi-domain]  Cd Length: 98  Bit Score: 83.13  E-value: 1.04e-19
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1611156425 238 RWCSDGFEFGCDNGE-KLRVTFALDCCDREAIDWAASTGGyDSSTVQDVMLRSVEKRFGDrlpdtPVQWLTDNGSAYTAH 316
Cdd:pfam00665   4 LWQGDFTYIRIPGGGgKLYLLVIVDDFSREILAWALSSEM-DAELVLDALERAIAFRGGV-----PLIIHSDNGSEYTSK 77
                          90       100
                  ....*....|....*....|.
gi 1611156425 317 ETRRFARELNLEPCTTAVSSP 337
Cdd:pfam00665  78 AFREFLKDLGIKPSFSRPGNP 98
InsE COG2963
Transposase InsE and inactivated derivatives [Mobilome: prophages, transposons];
11-104 2.17e-19

Transposase InsE and inactivated derivatives [Mobilome: prophages, transposons];


Pssm-ID: 442203 [Multi-domain]  Cd Length: 93  Bit Score: 82.28  E-value: 2.17e-19
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1611156425  11 RKRKTPQEKIAIIQQTMEPGMNVSHVARLHGIQPSLLFKWKKQYQEGSLTAVAAGEEVVPASEltaalkQVRELQRLLGK 90
Cdd:COG2963     5 RRRYSPEFKAEAVRLVLEGGASVAEVARELGISPSTLYRWVRQYREGGLGGFPGDGRTTPEQA------EIRRLRKELRR 78
                          90
                  ....*....|....
gi 1611156425  91 KTMEVEILKEAVEY 104
Cdd:COG2963    79 LEMENDILKKAAAL 92
rve_3 pfam13683
Integrase core domain;
332-391 2.93e-15

Integrase core domain;


Pssm-ID: 433402 [Multi-domain]  Cd Length: 67  Bit Score: 69.94  E-value: 2.93e-15
                          10        20        30        40        50        60
                  ....*....|....*....|....*....|....*....|....*....|....*....|
gi 1611156425 332 TAVSSPQSNGMAERFVKTMKEDYIAFMPKPDVRTALQNLAAAFTHYNENHPHSALGYYSP 391
Cdd:pfam13683   8 IAPGKPMQNGLVESFNGTLRDECLNEHLFSSLAEARALLAAWREDYNTERPHSSLGYRTP 67
HTH_Tnp_1 pfam01527
Transposase; Transposase proteins are necessary for efficient DNA transposition. This family ...
11-88 3.97e-14

Transposase; Transposase proteins are necessary for efficient DNA transposition. This family consists of various E. coli insertion elements and other bacterial transposases some of which are members of the IS3 family.


Pssm-ID: 426308 [Multi-domain]  Cd Length: 75  Bit Score: 66.99  E-value: 3.97e-14
                          10        20        30        40        50        60        70
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 1611156425  11 RKRKTPQEKIAIIQQTMEPGMNVSHVARLHGIQPSLLFKWKKQYQEgsltavaaGEEVVPAS-ELTAALKQVRELQRLL 88
Cdd:pfam01527   4 RRRFSEEFKLRAVKEVLEPGRTVKEVARRHGVSPNTLYQWRRQYEG--------GMGASPARpRLTALEEENRRLKREL 74
PHA02517 PHA02517
putative transposase OrfB; Reviewed
130-393 7.55e-13

putative transposase OrfB; Reviewed


Pssm-ID: 222853 [Multi-domain]  Cd Length: 277  Bit Score: 68.35  E-value: 7.55e-13
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1611156425 130 MGVSRAQLSLRVNRSADwQDRRCNRRNDeaDAEILTDILDIISD-MPSYGYRRVWGILRKQRRTEGepPVNAKRLYRIMS 208
Cdd:PHA02517    1 LGIAPSTYYRCQQQRHH-PDKRRARAQH--DDWLKSEILRVYDEnHQVYGVRKVWRQLNREGIRVA--RCTVGRLMKELG 75
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1611156425 209 EHNLL--------LLHDKPERPKREHKgKIAVAESDMRWCSDgFEFGCDNGEKLRVTFALDCCDREAIDWAAStggydSS 280
Cdd:PHA02517   76 LAGVLrgkkvrttISRKAVAAPDRVNR-QFVATRPNQLWVAD-FTYVSTWQGWVYVAFIIDVFARRIVGWRVS-----SS 148
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1611156425 281 TVQDVMLRSVEKRFGDRLPDTPVQWLTDNGSAYTAHETRRFARELNLEPCTTAVSSPQSNGMAERFVKTMKEDYIAFMPK 360
Cdd:PHA02517  149 MDTDFVLDALEQALWARGRPGGLIHHSDKGSQYVSLAYTQRLKEAGIRASTGSRGDSYDNAPAESINGLYKAEVIHRVSW 228
                         250       260       270
                  ....*....|....*....|....*....|...
gi 1611156425 361 PDVRTALQNLAAAFTHYNENHPHSALGYYSPRE 393
Cdd:PHA02517  229 KNREEVELATLEWVAWYNNRRLHERLGYTPPAE 261
IS66_access_TnpA NF038385
IS66-like element accessory protein TnpA;
8-60 6.94e-12

IS66-like element accessory protein TnpA;


Pssm-ID: 439678 [Multi-domain]  Cd Length: 190  Bit Score: 63.96  E-value: 6.94e-12
                          10        20        30        40        50
                  ....*....|....*....|....*....|....*....|....*....|...
gi 1611156425   8 QEVRKRKTPQEKIAIIQQTMEPGMNVSHVARLHGIQPSLLFKWKKQYQEGSLT 60
Cdd:NF038385   83 KIRRPNFSREFKIRLVEQTLQPGACVAQIARENGINDNLLFNWRHLYRNGLLQ 135
HTH_21 pfam13276
HTH-like domain; This domain contains a predicted helix-turn-helix suggesting a DNA-binding ...
158-213 6.66e-06

HTH-like domain; This domain contains a predicted helix-turn-helix suggesting a DNA-binding function.


Pssm-ID: 463824 [Multi-domain]  Cd Length: 60  Bit Score: 43.33  E-value: 6.66e-06
                          10        20        30        40        50
                  ....*....|....*....|....*....|....*....|....*....|....*..
gi 1611156425 158 EADAEILTDILDIISD-MPSYGYRRVWGILRKQrrteGEPPVNAKRLYRIMSEHNLL 213
Cdd:pfam13276   1 ELEDELLEAIREIFEEsRGTYGYRRITAELRRE----GGIRVNRKRVARLMRELGLR 53
HTH_28 pfam13518
Helix-turn-helix domain; This helix-turn-helix domain is often found in transposases and is ...
18-61 1.90e-03

Helix-turn-helix domain; This helix-turn-helix domain is often found in transposases and is likely to be DNA-binding.


Pssm-ID: 463908 [Multi-domain]  Cd Length: 52  Bit Score: 36.03  E-value: 1.90e-03
                          10        20        30        40
                  ....*....|....*....|....*....|....*....|....
gi 1611156425  18 EKIAIIQQTMEpGMNVSHVARLHGIQPSLLFKWKKQYQEGSLTA 61
Cdd:pfam13518   1 ERLKIVLLALE-GESIKEAARLFGISRSTVYRWIRRYREGGLEG 43
 
Blast search parameters
Data Source: Precalculated data, version = cdd.v.3.21
Preset Options:Database: CDSEARCH/cdd   Low complexity filter: no  Composition Based Adjustment: yes   E-value threshold: 0.01

References:

  • Wang J et al. (2023), "The conserved domain database in 2023", Nucleic Acids Res.51(D)384-8.
  • Lu S et al. (2020), "The conserved domain database in 2020", Nucleic Acids Res.48(D)265-8.
  • Marchler-Bauer A et al. (2017), "CDD/SPARCLE: functional classification of proteins via subfamily domain architectures.", Nucleic Acids Res.45(D)200-3.
Help | Disclaimer | Write to the Help Desk
NCBI | NLM | NIH