NCBI Home Page NCBI Site Search page NCBI Guide that lists and describes the NCBI resources
Conserved domains on  [gi|446136619|ref|WP_000214474|]
View 

MULTISPECIES: N4-gp56 family major capsid protein [Enterobacteriaceae]

Protein Classification

DUF4043 domain-containing protein( domain architecture ID 10593963)

DUF4043 domain-containing protein

Graphical summary

 Zoom to residue level

show extra options »

Show site features     Horizontal zoom: ×

List of domain hits

Name Accession Description Interval E-value
DUF4043 pfam13252
Protein of unknown function (DUF4043); This family of proteins is functionally uncharacterized. ...
1-404 1.16e-144

Protein of unknown function (DUF4043); This family of proteins is functionally uncharacterized. This family of proteins is found in bacteria and viruses. Proteins in this family are typically between 369 and 424 amino acids in length. There is a single completely conserved residue G that may be functionally important.


:

Pssm-ID: 463819  Cd Length: 382  Bit Score: 416.08  E-value: 1.16e-144
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 446136619    1 MTTVTSAQANKLYQVALFTAANRNRSMVNILTEQqeapkavspdkkstkqTSAGAPVVRITDLNKQAGDEVTFSIMHKLS 80
Cdd:pfam13252   1 NTGVNSPLAVKKWSVALFAEANKRSTFLPRLAGQ----------------TSDDMPIVRITDLQKGAGDEVTFDLLNPLG 64
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 446136619   81 KRPTMGDERVEGRGEDLSHADFSLKINQGRHLVDAGGRMSQQRTKFNLASSARTLLGTYFNDLQDQCAIVHLAGARGdfv 160
Cdd:pfam13252  65 GAPIMGDEVLEGRGEALSFSSDKLRINQARHAVDAGGTMTQQRTPHDLRATARPALADWFDRLQDQSAFVHLAGARG--- 141
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 446136619  161 aDDTILPTAEHPEFKKIMINDVLPPTHDRHFFGGDATSFE-QIEAADIFSIGLVDNLSLFIDEMAHPLQPVRLSGDELHG 239
Cdd:pfam13252 142 -FHSNWTLASAPKFNDIMVNPVTAPTSNRHLFAGGAASTSgSLTSTDLFTLDLVDKARKLADTMALPPPPVKLRGDVVAG 220
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 446136619  240 EDPYYVLYVTPRQWNDWYTSTSGKDWNQMMVRAVNRAKGFNHPLFKGECAMWRNILVRKYaGMPIRFYQGSKVLVSENNL 319
Cdd:pfam13252 221 GDPLYVLLLHPYQYDDLRTDTDTGAWRDIQKAAMARALVDKNPLFQGELGLWNGVVLRKH-PRVIRFNNGDTGKYAANTF 299
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 446136619  320 TattkevAAATNIDRAMLLGAQALANAYGQKAGG-HFNMVEKKTDMDNRTEIAISWINGLKKIRFPEKSG---KMQDHGV 395
Cdd:pfam13252 300 S------GAGYATDRALLLGAQALAIAFGKTNGGtPFGWNEELLDHGNRLEVLIWAIDGLKKTRFNVDDGgvnKPTDFGV 373

                  ....*....
gi 446136619  396 IAVDTAVKL 404
Cdd:pfam13252 374 IVVDTAVKI 382
 
Name Accession Description Interval E-value
DUF4043 pfam13252
Protein of unknown function (DUF4043); This family of proteins is functionally uncharacterized. ...
1-404 1.16e-144

Protein of unknown function (DUF4043); This family of proteins is functionally uncharacterized. This family of proteins is found in bacteria and viruses. Proteins in this family are typically between 369 and 424 amino acids in length. There is a single completely conserved residue G that may be functionally important.


Pssm-ID: 463819  Cd Length: 382  Bit Score: 416.08  E-value: 1.16e-144
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 446136619    1 MTTVTSAQANKLYQVALFTAANRNRSMVNILTEQqeapkavspdkkstkqTSAGAPVVRITDLNKQAGDEVTFSIMHKLS 80
Cdd:pfam13252   1 NTGVNSPLAVKKWSVALFAEANKRSTFLPRLAGQ----------------TSDDMPIVRITDLQKGAGDEVTFDLLNPLG 64
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 446136619   81 KRPTMGDERVEGRGEDLSHADFSLKINQGRHLVDAGGRMSQQRTKFNLASSARTLLGTYFNDLQDQCAIVHLAGARGdfv 160
Cdd:pfam13252  65 GAPIMGDEVLEGRGEALSFSSDKLRINQARHAVDAGGTMTQQRTPHDLRATARPALADWFDRLQDQSAFVHLAGARG--- 141
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 446136619  161 aDDTILPTAEHPEFKKIMINDVLPPTHDRHFFGGDATSFE-QIEAADIFSIGLVDNLSLFIDEMAHPLQPVRLSGDELHG 239
Cdd:pfam13252 142 -FHSNWTLASAPKFNDIMVNPVTAPTSNRHLFAGGAASTSgSLTSTDLFTLDLVDKARKLADTMALPPPPVKLRGDVVAG 220
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 446136619  240 EDPYYVLYVTPRQWNDWYTSTSGKDWNQMMVRAVNRAKGFNHPLFKGECAMWRNILVRKYaGMPIRFYQGSKVLVSENNL 319
Cdd:pfam13252 221 GDPLYVLLLHPYQYDDLRTDTDTGAWRDIQKAAMARALVDKNPLFQGELGLWNGVVLRKH-PRVIRFNNGDTGKYAANTF 299
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 446136619  320 TattkevAAATNIDRAMLLGAQALANAYGQKAGG-HFNMVEKKTDMDNRTEIAISWINGLKKIRFPEKSG---KMQDHGV 395
Cdd:pfam13252 300 S------GAGYATDRALLLGAQALAIAFGKTNGGtPFGWNEELLDHGNRLEVLIWAIDGLKKTRFNVDDGgvnKPTDFGV 373

                  ....*....
gi 446136619  396 IAVDTAVKL 404
Cdd:pfam13252 374 IVVDTAVKI 382
capsid_maj_N4 TIGR04387
major capsid protein, N4-gp56 family; Members of this family are phage major capsid proteins ...
55-358 3.69e-10

major capsid protein, N4-gp56 family; Members of this family are phage major capsid proteins as found in phage N4 (a double-stranded DNA virus) plus many additional lytic phage and integrated prophage regions. [Mobile and extrachromosomal element functions, Prophage functions]


Pssm-ID: 275180  Cd Length: 315  Bit Score: 60.83  E-value: 3.69e-10
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 446136619   55 APVVRITDLNKQAGDEVTFSIMHKLSKRPTMGDERVEGRGEDLSHADFSLKINQGRHLVDAGGRMSQQRTKFNLaSSART 134
Cdd:TIGR04387  29 SKFAQVKPLPKNPGDTIKFRRYVPLPGAPTPLTEGVTPKGEKLTFTDLTVTLEQYGKFVELTDVAADTHEDPEL-GEATE 107
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 446136619  135 LLGTYFNDLQDQCAIVHLAGARGDFVAddtilptaehpefkkimindvlppthdrhffgGDATSFEQIEAADIfSIGLVD 214
Cdd:TIGR04387 108 LLGEQAAQTIDELTRDVLAGATNVIYA--------------------------------GAGTARNAVTADDV-TYDDIR 154
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 446136619  215 NLSLFIDEMAHPLQPVRLSGDELHGEDPYYVLYVTPRQWNDWYTSTSGKDWNQMmvravnrakGFNHPLFKGECAMWRNi 294
Cdd:TIGR04387 155 RAVRKLKDNRAPKITTVLTASVMVGTEPSYVAVIHPDLEPDLRDDPGFIPVEKY---------GAADPIMKGEIGMIEG- 224
                         250       260       270       280       290       300
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*.
gi 446136619  295 lvrkyagmpIRFYQGSKVLVSENNltattkeVAAATNID--RAMLLGAQALANAYGQKAGGHFNMV 358
Cdd:TIGR04387 225 ---------VRFVETPEVLPWADA-------GAAGGNADvyPILIVGKDAFGTVPKNGKASTKHKI 274
 
Name Accession Description Interval E-value
DUF4043 pfam13252
Protein of unknown function (DUF4043); This family of proteins is functionally uncharacterized. ...
1-404 1.16e-144

Protein of unknown function (DUF4043); This family of proteins is functionally uncharacterized. This family of proteins is found in bacteria and viruses. Proteins in this family are typically between 369 and 424 amino acids in length. There is a single completely conserved residue G that may be functionally important.


Pssm-ID: 463819  Cd Length: 382  Bit Score: 416.08  E-value: 1.16e-144
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 446136619    1 MTTVTSAQANKLYQVALFTAANRNRSMVNILTEQqeapkavspdkkstkqTSAGAPVVRITDLNKQAGDEVTFSIMHKLS 80
Cdd:pfam13252   1 NTGVNSPLAVKKWSVALFAEANKRSTFLPRLAGQ----------------TSDDMPIVRITDLQKGAGDEVTFDLLNPLG 64
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 446136619   81 KRPTMGDERVEGRGEDLSHADFSLKINQGRHLVDAGGRMSQQRTKFNLASSARTLLGTYFNDLQDQCAIVHLAGARGdfv 160
Cdd:pfam13252  65 GAPIMGDEVLEGRGEALSFSSDKLRINQARHAVDAGGTMTQQRTPHDLRATARPALADWFDRLQDQSAFVHLAGARG--- 141
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 446136619  161 aDDTILPTAEHPEFKKIMINDVLPPTHDRHFFGGDATSFE-QIEAADIFSIGLVDNLSLFIDEMAHPLQPVRLSGDELHG 239
Cdd:pfam13252 142 -FHSNWTLASAPKFNDIMVNPVTAPTSNRHLFAGGAASTSgSLTSTDLFTLDLVDKARKLADTMALPPPPVKLRGDVVAG 220
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 446136619  240 EDPYYVLYVTPRQWNDWYTSTSGKDWNQMMVRAVNRAKGFNHPLFKGECAMWRNILVRKYaGMPIRFYQGSKVLVSENNL 319
Cdd:pfam13252 221 GDPLYVLLLHPYQYDDLRTDTDTGAWRDIQKAAMARALVDKNPLFQGELGLWNGVVLRKH-PRVIRFNNGDTGKYAANTF 299
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 446136619  320 TattkevAAATNIDRAMLLGAQALANAYGQKAGG-HFNMVEKKTDMDNRTEIAISWINGLKKIRFPEKSG---KMQDHGV 395
Cdd:pfam13252 300 S------GAGYATDRALLLGAQALAIAFGKTNGGtPFGWNEELLDHGNRLEVLIWAIDGLKKTRFNVDDGgvnKPTDFGV 373

                  ....*....
gi 446136619  396 IAVDTAVKL 404
Cdd:pfam13252 374 IVVDTAVKI 382
capsid_maj_N4 TIGR04387
major capsid protein, N4-gp56 family; Members of this family are phage major capsid proteins ...
55-358 3.69e-10

major capsid protein, N4-gp56 family; Members of this family are phage major capsid proteins as found in phage N4 (a double-stranded DNA virus) plus many additional lytic phage and integrated prophage regions. [Mobile and extrachromosomal element functions, Prophage functions]


Pssm-ID: 275180  Cd Length: 315  Bit Score: 60.83  E-value: 3.69e-10
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 446136619   55 APVVRITDLNKQAGDEVTFSIMHKLSKRPTMGDERVEGRGEDLSHADFSLKINQGRHLVDAGGRMSQQRTKFNLaSSART 134
Cdd:TIGR04387  29 SKFAQVKPLPKNPGDTIKFRRYVPLPGAPTPLTEGVTPKGEKLTFTDLTVTLEQYGKFVELTDVAADTHEDPEL-GEATE 107
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 446136619  135 LLGTYFNDLQDQCAIVHLAGARGDFVAddtilptaehpefkkimindvlppthdrhffgGDATSFEQIEAADIfSIGLVD 214
Cdd:TIGR04387 108 LLGEQAAQTIDELTRDVLAGATNVIYA--------------------------------GAGTARNAVTADDV-TYDDIR 154
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 446136619  215 NLSLFIDEMAHPLQPVRLSGDELHGEDPYYVLYVTPRQWNDWYTSTSGKDWNQMmvravnrakGFNHPLFKGECAMWRNi 294
Cdd:TIGR04387 155 RAVRKLKDNRAPKITTVLTASVMVGTEPSYVAVIHPDLEPDLRDDPGFIPVEKY---------GAADPIMKGEIGMIEG- 224
                         250       260       270       280       290       300
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*.
gi 446136619  295 lvrkyagmpIRFYQGSKVLVSENNltattkeVAAATNID--RAMLLGAQALANAYGQKAGGHFNMV 358
Cdd:TIGR04387 225 ---------VRFVETPEVLPWADA-------GAAGGNADvyPILIVGKDAFGTVPKNGKASTKHKI 274
 
Blast search parameters
Data Source: Precalculated data, version = cdd.v.3.21
Preset Options:Database: CDSEARCH/cdd   Low complexity filter: no  Composition Based Adjustment: yes   E-value threshold: 0.01

References:

  • Wang J et al. (2023), "The conserved domain database in 2023", Nucleic Acids Res.51(D)384-8.
  • Lu S et al. (2020), "The conserved domain database in 2020", Nucleic Acids Res.48(D)265-8.
  • Marchler-Bauer A et al. (2017), "CDD/SPARCLE: functional classification of proteins via subfamily domain architectures.", Nucleic Acids Res.45(D)200-3.
Help | Disclaimer | Write to the Help Desk
NCBI | NLM | NIH