NCBI Home Page NCBI Site Search page NCBI Guide that lists and describes the NCBI resources
Conserved domains on  [gi|1634553350|gb|QCO69814|]
View 

R4 integrase [Cloning vector pLCBRT1]

Protein Classification

recombinase family protein( domain architecture ID 11449350)

recombinase family protein is a serine recombinase that catalyzes the site-specific recombination of DNA molecules by a concerted, four-strand cleavage and rejoining mechanism which involves a transient phosphoserine linkage between DNA and the enzyme

Graphical summary

 Zoom to residue level

show extra options »

Show site features     Horizontal zoom: ×

List of domain hits

Name Accession Description Interval E-value
SpoIVCA COG1961
Site-specific DNA recombinase SpoIVCA/DNA invertase PinE [Replication, recombination and ...
10-323 9.10e-41

Site-specific DNA recombinase SpoIVCA/DNA invertase PinE [Replication, recombination and repair];


:

Pssm-ID: 441564 [Multi-domain]  Cd Length: 388  Bit Score: 150.18  E-value: 9.10e-41
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1634553350  10 RADIYVRISLDRtGEELGVERQEESCRELCKSLGMEVGQVWVDNDLSATKKNvvRPDFEAMIA----SNPQAIVCWHTDR 85
Cdd:COG1961     3 RAAGYARVSTDD-QEGLSLERQREALRAYAEKAGWEIVRIYVDEGVSGTSKD--RPGLQRLLAdlraGKFDTLVVWKLDR 79
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1634553350  86 LIRVTRDLERVIDL----GVNVHAVMAGhLDLSTPAGRAVARTVTAWATYEGEQKAERQKLANIQNARAGKPytPGIRPF 161
Cdd:COG1961    80 LGRNLADLLELVEElkerGVRLISLTEG-IDTSTPMGRLLLTILAAFAEFERELISERTRAGLAAAKARGKY--LGRPPY 156
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1634553350 162 GY-GDDHMTIVTAEADAIRDGAKMILDGWSLSAVARYWEELKLqsprsmaaggKGWSLRGVKKVLTSPRYVGRSSYLGEV 240
Cdd:COG1961   157 GYrDPKKLVIDEEEAEVVRRIFELYLEGKSLREIARELNERGI----------PTWSRSTVYRILKNPVYGGLGVYGLGK 226
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1634553350 241 VGDAQWPPILDPDVYYGVVAILNNPDRFSGGPRTGRTPGTLLAGIALCGECGKTVSGRGYRGVLVYGCKDTHTRTPRSIA 320
Cdd:COG1961   227 KIVTEKGERKVRRRRKRRVIREEEERVRRRELAAKRARRLLKGKLLRGLRGRKKRGGGRKRRRGGGRRLCGRLGRLRKGG 306

                  ...
gi 1634553350 321 DGR 323
Cdd:COG1961   307 RLR 309
 
Name Accession Description Interval E-value
SpoIVCA COG1961
Site-specific DNA recombinase SpoIVCA/DNA invertase PinE [Replication, recombination and ...
10-323 9.10e-41

Site-specific DNA recombinase SpoIVCA/DNA invertase PinE [Replication, recombination and repair];


Pssm-ID: 441564 [Multi-domain]  Cd Length: 388  Bit Score: 150.18  E-value: 9.10e-41
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1634553350  10 RADIYVRISLDRtGEELGVERQEESCRELCKSLGMEVGQVWVDNDLSATKKNvvRPDFEAMIA----SNPQAIVCWHTDR 85
Cdd:COG1961     3 RAAGYARVSTDD-QEGLSLERQREALRAYAEKAGWEIVRIYVDEGVSGTSKD--RPGLQRLLAdlraGKFDTLVVWKLDR 79
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1634553350  86 LIRVTRDLERVIDL----GVNVHAVMAGhLDLSTPAGRAVARTVTAWATYEGEQKAERQKLANIQNARAGKPytPGIRPF 161
Cdd:COG1961    80 LGRNLADLLELVEElkerGVRLISLTEG-IDTSTPMGRLLLTILAAFAEFERELISERTRAGLAAAKARGKY--LGRPPY 156
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1634553350 162 GY-GDDHMTIVTAEADAIRDGAKMILDGWSLSAVARYWEELKLqsprsmaaggKGWSLRGVKKVLTSPRYVGRSSYLGEV 240
Cdd:COG1961   157 GYrDPKKLVIDEEEAEVVRRIFELYLEGKSLREIARELNERGI----------PTWSRSTVYRILKNPVYGGLGVYGLGK 226
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1634553350 241 VGDAQWPPILDPDVYYGVVAILNNPDRFSGGPRTGRTPGTLLAGIALCGECGKTVSGRGYRGVLVYGCKDTHTRTPRSIA 320
Cdd:COG1961   227 KIVTEKGERKVRRRRKRRVIREEEERVRRRELAAKRARRLLKGKLLRGLRGRKKRGGGRKRRRGGGRRLCGRLGRLRKGG 306

                  ...
gi 1634553350 321 DGR 323
Cdd:COG1961   307 RLR 309
Ser_Recombinase cd00338
Serine Recombinase family, catalytic domain; a DNA binding domain may be present either N- or ...
13-143 5.81e-28

Serine Recombinase family, catalytic domain; a DNA binding domain may be present either N- or C-terminal to the catalytic domain. These enzymes perform site-specific recombination of DNA molecules by a concerted, four-strand cleavage and rejoining mechanism which involves a transient phosphoserine linkage between DNA and serine recombinase. Serine recombinases demonstrate functional versatility and include resolvases, invertases, integrases, and transposases. Resolvases and invertases (i.e. Tn3, gamma-delta, Tn5044 resolvases, Gin and Hin invertases) in this family contain a C-terminal DNA binding domain and comprise a major phylogenic group. Also included are phage- and bacterial-encoded recombinases such as phiC31 integrase, SpoIVCA excisionase, and Tn4451 TnpX transposase. These integrases and transposases have larger C-terminal domains compared to resolvases/invertases and are referred to as large serine recombinases. Also belonging to this family are proteins with N-terminal DNA binding domains similar to IS607- and IS1535-transposases from Helicobacter and Mycobacterium.


Pssm-ID: 238206 [Multi-domain]  Cd Length: 137  Bit Score: 108.12  E-value: 5.81e-28
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1634553350  13 IYVRISLDRTGEELGVERQEESCRELCKSLGMEVGQVWVDNDLSATkKNVVRPDFEAMIA----SNPQAIVCWHTDRLIR 88
Cdd:cd00338     1 IYARVSTDKQEQGDSLERQREALREYAARNGLEVVGEYEDAGSSAT-SLVDRPGLQRLLAdvkaGKIDVVLVEKLDRLSR 79
                          90       100       110       120       130
                  ....*....|....*....|....*....|....*....|....*....|....*....
gi 1634553350  89 VTRDLERVIDL----GVNVHAVMaGHLDLSTPAGRAVARTVTAWATYEGEQKAERQKLA 143
Cdd:cd00338    80 NLVDLLELLELleahGVRVVTAD-GEIDLDSEDGRLMLGILAAMAEEESKLISERTKRG 137
Resolvase smart00857
Resolvase, N terminal domain; The N-terminal domain of the resolvase family contains the ...
13-153 2.57e-27

Resolvase, N terminal domain; The N-terminal domain of the resolvase family contains the active site and the dimer interface. The extended arm at the C-terminus of this domain connects to the C-terminal helix-turn-helix domain of resolvase.


Pssm-ID: 214861 [Multi-domain]  Cd Length: 148  Bit Score: 106.55  E-value: 2.57e-27
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1634553350   13 IYVRISLDRTGEElGVERQEESCRELCKSLGMEVGQVWVDNDLSATKKNvvRPDFEAMIAS----NPQAIVCWHTDRLIR 88
Cdd:smart00857   3 GYARVSTDDQADG-SLERQLEALRAYAERNGWEVVRIYEDEGVSGKKAD--RPGLQRLLADlragDIDVLVVYKLDRLGR 79
                           90       100       110       120       130       140
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 1634553350   89 VTRDLERVIDL----GVNVHAVMAGHLDLSTPAGRAVARTVTAWATYEGEQKAERQKLANIQNARAGKP 153
Cdd:smart00857  80 SLRDLLALLELleekGVRLVSLKEGILDTSTPAGRLLLDILAALAEFERELISERTKAGLARAAARGRW 148
Resolvase pfam00239
Resolvase, N terminal domain; The N-terminal domain of the resolvase family (this family) ...
13-150 6.55e-17

Resolvase, N terminal domain; The N-terminal domain of the resolvase family (this family) contains the active site and the dimer interface. The extended arm at the C-terminus of this domain connects to the C-terminal helix-turn-helix domain of resolvase - see pfam02796.


Pssm-ID: 425548 [Multi-domain]  Cd Length: 144  Bit Score: 77.31  E-value: 6.55e-17
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1634553350  13 IYVRISLDrtGEELGVERQEESCRELCKSLGMEVgqVWVDNDLSATKKNvvRPDFEAMI--ASNPQ--AIVCWHTDRLIR 88
Cdd:pfam00239   3 GYARVSTE--DQDDSLERQLEALRAYAACNGKIV--EFEDKGVSGRKLD--RPGLQRLLalLRAGKgdVLVVYKLDRLGR 76
                          90       100       110       120       130       140
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*.
gi 1634553350  89 VTRDLERVID----LGVNVHaVMAGHLDLSTPAGRAVARTVTAWATYEGEQKAERQKlANIQNARA 150
Cdd:pfam00239  77 SLRDLLTLVEelreKGVDLV-SLDEGIDTSTPMGRLLLTILAALAEFERALIRERTR-AGLAAAAA 140
 
Name Accession Description Interval E-value
SpoIVCA COG1961
Site-specific DNA recombinase SpoIVCA/DNA invertase PinE [Replication, recombination and ...
10-323 9.10e-41

Site-specific DNA recombinase SpoIVCA/DNA invertase PinE [Replication, recombination and repair];


Pssm-ID: 441564 [Multi-domain]  Cd Length: 388  Bit Score: 150.18  E-value: 9.10e-41
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1634553350  10 RADIYVRISLDRtGEELGVERQEESCRELCKSLGMEVGQVWVDNDLSATKKNvvRPDFEAMIA----SNPQAIVCWHTDR 85
Cdd:COG1961     3 RAAGYARVSTDD-QEGLSLERQREALRAYAEKAGWEIVRIYVDEGVSGTSKD--RPGLQRLLAdlraGKFDTLVVWKLDR 79
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1634553350  86 LIRVTRDLERVIDL----GVNVHAVMAGhLDLSTPAGRAVARTVTAWATYEGEQKAERQKLANIQNARAGKPytPGIRPF 161
Cdd:COG1961    80 LGRNLADLLELVEElkerGVRLISLTEG-IDTSTPMGRLLLTILAAFAEFERELISERTRAGLAAAKARGKY--LGRPPY 156
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1634553350 162 GY-GDDHMTIVTAEADAIRDGAKMILDGWSLSAVARYWEELKLqsprsmaaggKGWSLRGVKKVLTSPRYVGRSSYLGEV 240
Cdd:COG1961   157 GYrDPKKLVIDEEEAEVVRRIFELYLEGKSLREIARELNERGI----------PTWSRSTVYRILKNPVYGGLGVYGLGK 226
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1634553350 241 VGDAQWPPILDPDVYYGVVAILNNPDRFSGGPRTGRTPGTLLAGIALCGECGKTVSGRGYRGVLVYGCKDTHTRTPRSIA 320
Cdd:COG1961   227 KIVTEKGERKVRRRRKRRVIREEEERVRRRELAAKRARRLLKGKLLRGLRGRKKRGGGRKRRRGGGRRLCGRLGRLRKGG 306

                  ...
gi 1634553350 321 DGR 323
Cdd:COG1961   307 RLR 309
Ser_Recombinase cd00338
Serine Recombinase family, catalytic domain; a DNA binding domain may be present either N- or ...
13-143 5.81e-28

Serine Recombinase family, catalytic domain; a DNA binding domain may be present either N- or C-terminal to the catalytic domain. These enzymes perform site-specific recombination of DNA molecules by a concerted, four-strand cleavage and rejoining mechanism which involves a transient phosphoserine linkage between DNA and serine recombinase. Serine recombinases demonstrate functional versatility and include resolvases, invertases, integrases, and transposases. Resolvases and invertases (i.e. Tn3, gamma-delta, Tn5044 resolvases, Gin and Hin invertases) in this family contain a C-terminal DNA binding domain and comprise a major phylogenic group. Also included are phage- and bacterial-encoded recombinases such as phiC31 integrase, SpoIVCA excisionase, and Tn4451 TnpX transposase. These integrases and transposases have larger C-terminal domains compared to resolvases/invertases and are referred to as large serine recombinases. Also belonging to this family are proteins with N-terminal DNA binding domains similar to IS607- and IS1535-transposases from Helicobacter and Mycobacterium.


Pssm-ID: 238206 [Multi-domain]  Cd Length: 137  Bit Score: 108.12  E-value: 5.81e-28
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1634553350  13 IYVRISLDRTGEELGVERQEESCRELCKSLGMEVGQVWVDNDLSATkKNVVRPDFEAMIA----SNPQAIVCWHTDRLIR 88
Cdd:cd00338     1 IYARVSTDKQEQGDSLERQREALREYAARNGLEVVGEYEDAGSSAT-SLVDRPGLQRLLAdvkaGKIDVVLVEKLDRLSR 79
                          90       100       110       120       130
                  ....*....|....*....|....*....|....*....|....*....|....*....
gi 1634553350  89 VTRDLERVIDL----GVNVHAVMaGHLDLSTPAGRAVARTVTAWATYEGEQKAERQKLA 143
Cdd:cd00338    80 NLVDLLELLELleahGVRVVTAD-GEIDLDSEDGRLMLGILAAMAEEESKLISERTKRG 137
Resolvase smart00857
Resolvase, N terminal domain; The N-terminal domain of the resolvase family contains the ...
13-153 2.57e-27

Resolvase, N terminal domain; The N-terminal domain of the resolvase family contains the active site and the dimer interface. The extended arm at the C-terminus of this domain connects to the C-terminal helix-turn-helix domain of resolvase.


Pssm-ID: 214861 [Multi-domain]  Cd Length: 148  Bit Score: 106.55  E-value: 2.57e-27
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1634553350   13 IYVRISLDRTGEElGVERQEESCRELCKSLGMEVGQVWVDNDLSATKKNvvRPDFEAMIAS----NPQAIVCWHTDRLIR 88
Cdd:smart00857   3 GYARVSTDDQADG-SLERQLEALRAYAERNGWEVVRIYEDEGVSGKKAD--RPGLQRLLADlragDIDVLVVYKLDRLGR 79
                           90       100       110       120       130       140
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 1634553350   89 VTRDLERVIDL----GVNVHAVMAGHLDLSTPAGRAVARTVTAWATYEGEQKAERQKLANIQNARAGKP 153
Cdd:smart00857  80 SLRDLLALLELleekGVRLVSLKEGILDTSTPAGRLLLDILAALAEFERELISERTKAGLARAAARGRW 148
Resolvase pfam00239
Resolvase, N terminal domain; The N-terminal domain of the resolvase family (this family) ...
13-150 6.55e-17

Resolvase, N terminal domain; The N-terminal domain of the resolvase family (this family) contains the active site and the dimer interface. The extended arm at the C-terminus of this domain connects to the C-terminal helix-turn-helix domain of resolvase - see pfam02796.


Pssm-ID: 425548 [Multi-domain]  Cd Length: 144  Bit Score: 77.31  E-value: 6.55e-17
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1634553350  13 IYVRISLDrtGEELGVERQEESCRELCKSLGMEVgqVWVDNDLSATKKNvvRPDFEAMI--ASNPQ--AIVCWHTDRLIR 88
Cdd:pfam00239   3 GYARVSTE--DQDDSLERQLEALRAYAACNGKIV--EFEDKGVSGRKLD--RPGLQRLLalLRAGKgdVLVVYKLDRLGR 76
                          90       100       110       120       130       140
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*.
gi 1634553350  89 VTRDLERVID----LGVNVHaVMAGHLDLSTPAGRAVARTVTAWATYEGEQKAERQKlANIQNARA 150
Cdd:pfam00239  77 SLRDLLTLVEelreKGVDLV-SLDEGIDTSTPMGRLLLTILAALAEFERALIRERTR-AGLAAAAA 140
Recombinase pfam07508
Recombinase; This domain is usually found associated with pfam00239 in putative integrases ...
174-262 1.46e-08

Recombinase; This domain is usually found associated with pfam00239 in putative integrases/recombinases of mobile genetic elements of diverse bacteria and phages.


Pssm-ID: 429502 [Multi-domain]  Cd Length: 102  Bit Score: 52.40  E-value: 1.46e-08
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1634553350 174 EADAIRDGAKMILDGWSLSAVARYWEELKLQSPRsmaagGKGWSLRGVKKVLTSPRYVGRSSY-------------LGEV 240
Cdd:pfam07508   3 EAEIVRLIFELYLEGKSLRSIARYLNEQGIPTPR-----GKDWSPSTVRRILTNPAYIGILVYgktkkkrrkrnpdEEWI 77
                          90       100
                  ....*....|....*....|..
gi 1634553350 241 VGDAQWPPILDPDVYYGVVAIL 262
Cdd:pfam07508  78 VIEGAHPPIISEELFEAVQERL 99
SR_ResInv cd03768
Serine Recombinase (SR) family, Resolvase and Invertase subfamily, catalytic domain; members ...
13-141 5.20e-08

Serine Recombinase (SR) family, Resolvase and Invertase subfamily, catalytic domain; members contain a C-terminal DNA binding domain. Serine recombinases catalyze site-specific recombination of DNA molecules by a concerted, four-strand cleavage and rejoining mechanism which involves a transient phosphoserine linkage between DNA and the enzyme. They are functionally versatile and include resolvases, invertases, integrases, and transposases. Resolvases and invertases affect resolution or inversion and comprise a major phylogenic group. Resolvases (e.g. Tn3, gamma-delta, and Tn5044) normally recombine two sites in direct repeat causing deletion of the DNA between the sites. Invertases (e.g. Gin and Hin) recombine sites in inverted repeat to invert the DNA between the sites. Cointegrate resolution with gamma-delta resolvase requires the formation of a synaptosome of three resolvase dimers bound to each of two res sites on the DNA. Also included in this subfamily are some putative integrases including a sequence from bacteriophage phi-FC1.


Pssm-ID: 239737 [Multi-domain]  Cd Length: 126  Bit Score: 51.33  E-value: 5.20e-08
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1634553350  13 IYVRISldrTGEELgVERQEESCRELCkslgmEVGQVWVDnDLSATKKNvvRPDFEAMI--ASNPQAIVCWHTDRLIRVT 90
Cdd:cd03768     3 GYARVS---TDDQS-LERQLEALKAAG-----ECDKIFEE-KGSGGKKE--RPELQKLLedLREGDTLVVTKLDRLGRST 70
                          90       100       110       120       130
                  ....*....|....*....|....*....|....*....|....*....|....*
gi 1634553350  91 RDLERVIDL----GVNVHAVMAGhLDLSTPAGRAVARTVTAWATYEGEQKAERQK 141
Cdd:cd03768    71 KDLLEIVEElrekGVSLRSLTEG-IDTSTPSGKLMLTILGAFAEFERELIRERTK 124
Zn_ribbon_recom pfam13408
Recombinase zinc beta ribbon domain; This short bacterial protein contains a zinc ribbon ...
281-314 6.36e-03

Recombinase zinc beta ribbon domain; This short bacterial protein contains a zinc ribbon domain that is likely to be DNA-binding. This domain is found in site specific recombinase proteins. This family appears most closely related to pfam04606.


Pssm-ID: 433183 [Multi-domain]  Cd Length: 58  Bit Score: 34.91  E-value: 6.36e-03
                          10        20        30
                  ....*....|....*....|....*....|....*
gi 1634553350 281 LLAGIALCGECGKTVSGRGYRG-VLVYGCKDTHTR 314
Cdd:pfam13408   1 LLSGLLRCGECGSPMTGRTSKGgKRYYRCSTRRRK 35
 
Blast search parameters
Data Source: Precalculated data, version = cdd.v.3.21
Preset Options:Database: CDSEARCH/cdd   Low complexity filter: no  Composition Based Adjustment: yes   E-value threshold: 0.01

References:

  • Wang J et al. (2023), "The conserved domain database in 2023", Nucleic Acids Res.51(D)384-8.
  • Lu S et al. (2020), "The conserved domain database in 2020", Nucleic Acids Res.48(D)265-8.
  • Marchler-Bauer A et al. (2017), "CDD/SPARCLE: functional classification of proteins via subfamily domain architectures.", Nucleic Acids Res.45(D)200-3.
Help | Disclaimer | Write to the Help Desk
NCBI | NLM | NIH