NCBI Home Page NCBI Site Search page NCBI Guide that lists and describes the NCBI resources
Conserved domains on  [gi|1420808780|gb|AXB22574|]
View 

major capsid protein [Alces alces faeces associated microvirus MP12 5423]

Protein Classification

Graphical summary

 Zoom to residue level

show extra options »

Show site features     Horizontal zoom: ×

List of domain hits

Name Accession Description Interval E-value
Phage_F super family cl15846
Capsid protein (F protein); This is a family of proteins from single-stranded DNA ...
1-546 3.03e-156

Capsid protein (F protein); This is a family of proteins from single-stranded DNA bacteriophages. Protein F is the major capsid component, sixty copies of which are found in the virion.


The actual alignment was detected with superfamily member PHA00363:

Pssm-ID: 326659  Cd Length: 557  Bit Score: 457.78  E-value: 3.03e-156
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1420808780   1 MNKYFATNPRIKISRTKFDLSSTHKTTFNAGYLVPIKIYENVPGDTISVSMNSVIEMTTPLKPTMDLAVCNVYAFKVPMR 80
Cdd:PHA00363   11 MSHSFAQVPSARIQRSSFDRSCGLKTTFDAGYLIPIFCDEVLPGDTFSLKEAFLARMATPIFPLMDNLRLDTQYFFVPLR 90
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1420808780  81 LLWEHWQEMFGENDDTywAQPVEYTIPKLTAPTGGWSKDSIMSHMGVRMNTEGLSVSALPARAVAKIYNDWFRDQNVMTP 160
Cdd:PHA00363   91 LLWSNFEKFCGEQDNP--DDSTDFLTPVLTAPSGGFAEGSIHDYFGLPTKVAGIRCVALWHRAYNLIWNQYYRDENLQES 168
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1420808780 161 YLEAKDDSNREGSNGdgkndlvagGKCPRIAKFHDIFTTALPGPIKSTEdVLLPLGEFAPVGTRTNVSPQLTNNTMLTW- 239
Cdd:PHA00363  169 VAVQMGDTTSDEVNN---------YKLLKRGKRYDYFTSCLPWPQKGPA-VTIGVGGIAPVTGLYGDVSSNNPIPAFVWd 238
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1420808780 240 ------RMSNGESLTSGTSYNITIGANDYT--PTQARTDVDLGTYSADIQPKNLWADLSQATGATMQDFLYAYSLYKMFT 311
Cdd:PHA00363  239 nsvnptWFNSGGPTPTGTLGIVPVGQAYYIkkPGNDPTAQAANGEPATDSTPRLYADLGSTSPVTINSLREAFQLQKLYE 318
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1420808780 312 IDARGGTRYTEILANHYGVISSDARLQRSEYLGGTNFEINMVHVPQTSQTTEESAQGYLTAYSnTAIRNKHLFTTSYDEH 391
Cdd:PHA00363  319 RDARGGTRYVEIIRSHFNVQSPDARLQRAEYLGGSSTPVNISPVPQTSSTDSTSPQGNLAAYG-TAIGSKRVFTKSFTEH 397
                         410       420       430       440       450       460       470       480
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1420808780 392 CYTILFAAIRTKQTYAQGIEKMWFHEERTDFYMPTFAHIGEQPIRNRELFAQG--------TDEDAQTFGFGEAFYEYRT 463
Cdd:PHA00363  398 GVILGLASVRADLTYQQGLDRMWSRRTRWDFYWPALSHLGEQAVLNKEIYCQGpavkdaggVVVDEQVFGYQERFAEYRY 477
                         490       500       510       520       530       540       550       560
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1420808780 464 QVDLVTGEFSPDAENSLPSYTYTNDFESLPTLSEEFIlETEDNVNRTIAVNSslQDQFKADFYFNVTAVRPLPPRSIPSL 543
Cdd:PHA00363  478 KPSKITGKFRSNATGSLDSWHLAQEFENLPTLSPEFI-EENPPMDRVLAVKT--EPDFLLDFWFSLRCARPMPVYSVPGL 554

                  ...
gi 1420808780 544 ASH 546
Cdd:PHA00363  555 IDH 557
 
Name Accession Description Interval E-value
PHA00363 PHA00363
major capsid protein
1-546 3.03e-156

major capsid protein


Pssm-ID: 222784  Cd Length: 557  Bit Score: 457.78  E-value: 3.03e-156
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1420808780   1 MNKYFATNPRIKISRTKFDLSSTHKTTFNAGYLVPIKIYENVPGDTISVSMNSVIEMTTPLKPTMDLAVCNVYAFKVPMR 80
Cdd:PHA00363   11 MSHSFAQVPSARIQRSSFDRSCGLKTTFDAGYLIPIFCDEVLPGDTFSLKEAFLARMATPIFPLMDNLRLDTQYFFVPLR 90
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1420808780  81 LLWEHWQEMFGENDDTywAQPVEYTIPKLTAPTGGWSKDSIMSHMGVRMNTEGLSVSALPARAVAKIYNDWFRDQNVMTP 160
Cdd:PHA00363   91 LLWSNFEKFCGEQDNP--DDSTDFLTPVLTAPSGGFAEGSIHDYFGLPTKVAGIRCVALWHRAYNLIWNQYYRDENLQES 168
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1420808780 161 YLEAKDDSNREGSNGdgkndlvagGKCPRIAKFHDIFTTALPGPIKSTEdVLLPLGEFAPVGTRTNVSPQLTNNTMLTW- 239
Cdd:PHA00363  169 VAVQMGDTTSDEVNN---------YKLLKRGKRYDYFTSCLPWPQKGPA-VTIGVGGIAPVTGLYGDVSSNNPIPAFVWd 238
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1420808780 240 ------RMSNGESLTSGTSYNITIGANDYT--PTQARTDVDLGTYSADIQPKNLWADLSQATGATMQDFLYAYSLYKMFT 311
Cdd:PHA00363  239 nsvnptWFNSGGPTPTGTLGIVPVGQAYYIkkPGNDPTAQAANGEPATDSTPRLYADLGSTSPVTINSLREAFQLQKLYE 318
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1420808780 312 IDARGGTRYTEILANHYGVISSDARLQRSEYLGGTNFEINMVHVPQTSQTTEESAQGYLTAYSnTAIRNKHLFTTSYDEH 391
Cdd:PHA00363  319 RDARGGTRYVEIIRSHFNVQSPDARLQRAEYLGGSSTPVNISPVPQTSSTDSTSPQGNLAAYG-TAIGSKRVFTKSFTEH 397
                         410       420       430       440       450       460       470       480
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1420808780 392 CYTILFAAIRTKQTYAQGIEKMWFHEERTDFYMPTFAHIGEQPIRNRELFAQG--------TDEDAQTFGFGEAFYEYRT 463
Cdd:PHA00363  398 GVILGLASVRADLTYQQGLDRMWSRRTRWDFYWPALSHLGEQAVLNKEIYCQGpavkdaggVVVDEQVFGYQERFAEYRY 477
                         490       500       510       520       530       540       550       560
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1420808780 464 QVDLVTGEFSPDAENSLPSYTYTNDFESLPTLSEEFIlETEDNVNRTIAVNSslQDQFKADFYFNVTAVRPLPPRSIPSL 543
Cdd:PHA00363  478 KPSKITGKFRSNATGSLDSWHLAQEFENLPTLSPEFI-EENPPMDRVLAVKT--EPDFLLDFWFSLRCARPMPVYSVPGL 554

                  ...
gi 1420808780 544 ASH 546
Cdd:PHA00363  555 IDH 557
Phage_F pfam02305
Capsid protein (F protein); This is a family of proteins from single-stranded DNA ...
9-543 1.28e-135

Capsid protein (F protein); This is a family of proteins from single-stranded DNA bacteriophages. Protein F is the major capsid component, sixty copies of which are found in the virion.


Pssm-ID: 308107  Cd Length: 510  Bit Score: 403.49  E-value: 1.28e-135
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1420808780   9 PRIKISRTKFDLSSTHKTTFNAGYLVPIKIYENVPGDTISVSMNSVIEMTTPLKPTMDLAVCNVYAFKVPMRLLWEHWQE 88
Cdd:pfam02305   4 QTSHIERSPFDLSHLTFTAFKIGRLIPISWTPVLPGDSFEMDEVGLIRLSTLRRPLMDDSRVDTFFFYVPHRHVWDQWEK 83
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1420808780  89 MFGENDDTYwaQPVEYTIPKLTAPTGGWSKDSIMSHMGVRMNTEGLSVSALPARAVAKIYNDWFRDQNvMTPYLEA-KDD 167
Cdd:pfam02305  84 FMGDGVNAW--DSTDALVPDITAPLGGVTEGSIYDHFGIPGKVATLRIPKLLFRAYLNIYNNYFRDPN-LQESTEAnPGD 160
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1420808780 168 SNREGSNGDgknDLVaggkcpRIAKFHDIFTTALPGPIKSTEdVLLPLGEFAPVGTRTNVSPQLtNNTMLTWRMSNGESL 247
Cdd:pfam02305 161 TNGDDSRYD---ILL------RAAKLKDYFTSPLPWPQKGPS-VTMGIGGMAPVTTSFRPVPNF-VGTPLIFRDLKGRTI 229
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1420808780 248 TSGTSYNITIGA-NDYTPTqartdvdlgtysadiqPKNLWADLSQATGATMQDFLYAYSLYKMFTIDARGGTRYTEILAN 326
Cdd:pfam02305 230 KTGQTGNGPIDNgNGETAI----------------PSNLYADLSAATSIDIMGLRAAYALQHTEEEDARGGTRYVEIIKS 293
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1420808780 327 HYGVISSDARLQRSEYLGGTNFEINMVHVPQTSQTTEESAQGYLTAYSNTAIR-NKHLFTTSYDEHCYTILFAAIRTKQT 405
Cdd:pfam02305 294 HFGVTSYDARLQRPELLGGSSFWASGYDVPQTSSTDSKSPQGNLAAFSGRVQQtNKHLVPKFFVEHGVIITLAVVRFPPT 373
                         410       420       430       440       450       460       470       480
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1420808780 406 YAQGIEKMWFHEE-RTDFYMPTFAHIGEQPIRNRELFAQGTDEDAQTFGFGEAFYEYRTQVDLVTGEFSPDAENSLPSYT 484
Cdd:pfam02305 374 YQQGLHYLWSRGQlTYDIYDPALANLPEQEVSNKEIFCQGSSVDSEKFGYQERYAWYRYKPSKVAGVYRSNATQSLDGWH 453
                         490       500       510       520       530
                  ....*....|....*....|....*....|....*....|....*....|....*....
gi 1420808780 485 YTNDFESLPTLSEEFILETeDNVNRTIAVNSSLQdQFKADFYFNVTAVRPLPPRSIPSL 543
Cdd:pfam02305 454 FAQHFANLPDLSERFLEEN-TPYDRCLAVSDQLP-QWNMDFKFNYTVYRPMPTYSDPGM 510
 
Name Accession Description Interval E-value
PHA00363 PHA00363
major capsid protein
1-546 3.03e-156

major capsid protein


Pssm-ID: 222784  Cd Length: 557  Bit Score: 457.78  E-value: 3.03e-156
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1420808780   1 MNKYFATNPRIKISRTKFDLSSTHKTTFNAGYLVPIKIYENVPGDTISVSMNSVIEMTTPLKPTMDLAVCNVYAFKVPMR 80
Cdd:PHA00363   11 MSHSFAQVPSARIQRSSFDRSCGLKTTFDAGYLIPIFCDEVLPGDTFSLKEAFLARMATPIFPLMDNLRLDTQYFFVPLR 90
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1420808780  81 LLWEHWQEMFGENDDTywAQPVEYTIPKLTAPTGGWSKDSIMSHMGVRMNTEGLSVSALPARAVAKIYNDWFRDQNVMTP 160
Cdd:PHA00363   91 LLWSNFEKFCGEQDNP--DDSTDFLTPVLTAPSGGFAEGSIHDYFGLPTKVAGIRCVALWHRAYNLIWNQYYRDENLQES 168
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1420808780 161 YLEAKDDSNREGSNGdgkndlvagGKCPRIAKFHDIFTTALPGPIKSTEdVLLPLGEFAPVGTRTNVSPQLTNNTMLTW- 239
Cdd:PHA00363  169 VAVQMGDTTSDEVNN---------YKLLKRGKRYDYFTSCLPWPQKGPA-VTIGVGGIAPVTGLYGDVSSNNPIPAFVWd 238
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1420808780 240 ------RMSNGESLTSGTSYNITIGANDYT--PTQARTDVDLGTYSADIQPKNLWADLSQATGATMQDFLYAYSLYKMFT 311
Cdd:PHA00363  239 nsvnptWFNSGGPTPTGTLGIVPVGQAYYIkkPGNDPTAQAANGEPATDSTPRLYADLGSTSPVTINSLREAFQLQKLYE 318
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1420808780 312 IDARGGTRYTEILANHYGVISSDARLQRSEYLGGTNFEINMVHVPQTSQTTEESAQGYLTAYSnTAIRNKHLFTTSYDEH 391
Cdd:PHA00363  319 RDARGGTRYVEIIRSHFNVQSPDARLQRAEYLGGSSTPVNISPVPQTSSTDSTSPQGNLAAYG-TAIGSKRVFTKSFTEH 397
                         410       420       430       440       450       460       470       480
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1420808780 392 CYTILFAAIRTKQTYAQGIEKMWFHEERTDFYMPTFAHIGEQPIRNRELFAQG--------TDEDAQTFGFGEAFYEYRT 463
Cdd:PHA00363  398 GVILGLASVRADLTYQQGLDRMWSRRTRWDFYWPALSHLGEQAVLNKEIYCQGpavkdaggVVVDEQVFGYQERFAEYRY 477
                         490       500       510       520       530       540       550       560
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1420808780 464 QVDLVTGEFSPDAENSLPSYTYTNDFESLPTLSEEFIlETEDNVNRTIAVNSslQDQFKADFYFNVTAVRPLPPRSIPSL 543
Cdd:PHA00363  478 KPSKITGKFRSNATGSLDSWHLAQEFENLPTLSPEFI-EENPPMDRVLAVKT--EPDFLLDFWFSLRCARPMPVYSVPGL 554

                  ...
gi 1420808780 544 ASH 546
Cdd:PHA00363  555 IDH 557
Phage_F pfam02305
Capsid protein (F protein); This is a family of proteins from single-stranded DNA ...
9-543 1.28e-135

Capsid protein (F protein); This is a family of proteins from single-stranded DNA bacteriophages. Protein F is the major capsid component, sixty copies of which are found in the virion.


Pssm-ID: 308107  Cd Length: 510  Bit Score: 403.49  E-value: 1.28e-135
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1420808780   9 PRIKISRTKFDLSSTHKTTFNAGYLVPIKIYENVPGDTISVSMNSVIEMTTPLKPTMDLAVCNVYAFKVPMRLLWEHWQE 88
Cdd:pfam02305   4 QTSHIERSPFDLSHLTFTAFKIGRLIPISWTPVLPGDSFEMDEVGLIRLSTLRRPLMDDSRVDTFFFYVPHRHVWDQWEK 83
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1420808780  89 MFGENDDTYwaQPVEYTIPKLTAPTGGWSKDSIMSHMGVRMNTEGLSVSALPARAVAKIYNDWFRDQNvMTPYLEA-KDD 167
Cdd:pfam02305  84 FMGDGVNAW--DSTDALVPDITAPLGGVTEGSIYDHFGIPGKVATLRIPKLLFRAYLNIYNNYFRDPN-LQESTEAnPGD 160
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1420808780 168 SNREGSNGDgknDLVaggkcpRIAKFHDIFTTALPGPIKSTEdVLLPLGEFAPVGTRTNVSPQLtNNTMLTWRMSNGESL 247
Cdd:pfam02305 161 TNGDDSRYD---ILL------RAAKLKDYFTSPLPWPQKGPS-VTMGIGGMAPVTTSFRPVPNF-VGTPLIFRDLKGRTI 229
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1420808780 248 TSGTSYNITIGA-NDYTPTqartdvdlgtysadiqPKNLWADLSQATGATMQDFLYAYSLYKMFTIDARGGTRYTEILAN 326
Cdd:pfam02305 230 KTGQTGNGPIDNgNGETAI----------------PSNLYADLSAATSIDIMGLRAAYALQHTEEEDARGGTRYVEIIKS 293
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1420808780 327 HYGVISSDARLQRSEYLGGTNFEINMVHVPQTSQTTEESAQGYLTAYSNTAIR-NKHLFTTSYDEHCYTILFAAIRTKQT 405
Cdd:pfam02305 294 HFGVTSYDARLQRPELLGGSSFWASGYDVPQTSSTDSKSPQGNLAAFSGRVQQtNKHLVPKFFVEHGVIITLAVVRFPPT 373
                         410       420       430       440       450       460       470       480
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1420808780 406 YAQGIEKMWFHEE-RTDFYMPTFAHIGEQPIRNRELFAQGTDEDAQTFGFGEAFYEYRTQVDLVTGEFSPDAENSLPSYT 484
Cdd:pfam02305 374 YQQGLHYLWSRGQlTYDIYDPALANLPEQEVSNKEIFCQGSSVDSEKFGYQERYAWYRYKPSKVAGVYRSNATQSLDGWH 453
                         490       500       510       520       530
                  ....*....|....*....|....*....|....*....|....*....|....*....
gi 1420808780 485 YTNDFESLPTLSEEFILETeDNVNRTIAVNSSLQdQFKADFYFNVTAVRPLPPRSIPSL 543
Cdd:pfam02305 454 FAQHFANLPDLSERFLEEN-TPYDRCLAVSDQLP-QWNMDFKFNYTVYRPMPTYSDPGM 510
 
Blast search parameters
Data Source: Precalculated data, version = cdd.v.3.21
Preset Options:Database: CDSEARCH/cdd   Low complexity filter: no  Composition Based Adjustment: yes   E-value threshold: 0.01

References:

  • Wang J et al. (2023), "The conserved domain database in 2023", Nucleic Acids Res.51(D)384-8.
  • Lu S et al. (2020), "The conserved domain database in 2020", Nucleic Acids Res.48(D)265-8.
  • Marchler-Bauer A et al. (2017), "CDD/SPARCLE: functional classification of proteins via subfamily domain architectures.", Nucleic Acids Res.45(D)200-3.
Help | Disclaimer | Write to the Help Desk
NCBI | NLM | NIH