NCBI Home Page NCBI Site Search page NCBI Guide that lists and describes the NCBI resources
Conserved domains on  [gi|332205949|ref|NP_001193770|]
View 

trans-Golgi network integral membrane protein 2 isoform 3 precursor [Homo sapiens]

Protein Classification

Graphical summary

 Zoom to residue level

show extra options »

Show site features     Horizontal zoom: ×

List of domain hits

Name Accession Description Interval E-value
MSCRAMM_ClfA super family cl41352
MSCRAMM family adhesin clumping factor ClfA; Clumping factor A is an MSCRAMM (Microbial ...
44-383 6.64e-14

MSCRAMM family adhesin clumping factor ClfA; Clumping factor A is an MSCRAMM (Microbial Surface Components Recognizing Adhesive Matrix Molecules). It is heavily studied in Staphylococcus aureus both for its biological role in adhesion and for its potential for vaccination. Features of the sequence, but also of other MSCRAMM adhesins, include a long run of Ser-Asp dipeptide repeats and a C-terminal cell wall anchoring LPXTG motif.


The actual alignment was detected with superfamily member NF033609:

Pssm-ID: 468110 [Multi-domain]  Cd Length: 934  Bit Score: 74.18  E-value: 6.64e-14
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 332205949  44 PSLSQRPGGSTKSHPEPQTPKDSPSKSSAEAQTPEDTPNKSGAEAKTQK--DSSNKSGAEAKTQKGSTSKSGSEAQTTKD 121
Cdd:NF033609 542 PVVPEQPDEPGEIEPIPEDSDSDPGSDSGSDSSNSDSGSDSGSDSTSDSgsDSASDSDSASDSDSASDSDSASDSDSASD 621
                         90       100       110       120       130       140       150       160
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 332205949 122 STSKSHPELQTPKDSTGKSGAEAQTPEDSPNRSGAEAKTQKDSPSKSGSEAQTTKDVPNKSGADGQTPKDGSSKSGAEDQ 201
Cdd:NF033609 622 SDSASDSDSASDSDSASDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSD 701
                        170       180       190       200       210       220       230       240
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 332205949 202 TPKDVPNKSGAEKQTPKDGSNKSGAEEQGPIDGPSKSGAEEQTSKDSPNKVVPEQPSRKDHSKPISNPSDNKELPKADTN 281
Cdd:NF033609 702 SDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSD 781
                        250       260       270       280       290       300       310       320
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 332205949 282 QLADKGKLSPHAFKTESGEETDLISPPQEEVKS---SEPTEDVEPKEAEDDDTGPEEGSPPKEEKEKMSGSASSENREGT 358
Cdd:NF033609 782 SDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSdsdSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSESDSN 861
                        330       340
                 ....*....|....*....|....*
gi 332205949 359 LSDSTGSEKDDLYPNGSGNGSAESS 383
Cdd:NF033609 862 SDSESGSNNNVVPPNSPKNGTNASN 886
 
Name Accession Description Interval E-value
MSCRAMM_ClfA NF033609
MSCRAMM family adhesin clumping factor ClfA; Clumping factor A is an MSCRAMM (Microbial ...
44-383 6.64e-14

MSCRAMM family adhesin clumping factor ClfA; Clumping factor A is an MSCRAMM (Microbial Surface Components Recognizing Adhesive Matrix Molecules). It is heavily studied in Staphylococcus aureus both for its biological role in adhesion and for its potential for vaccination. Features of the sequence, but also of other MSCRAMM adhesins, include a long run of Ser-Asp dipeptide repeats and a C-terminal cell wall anchoring LPXTG motif.


Pssm-ID: 468110 [Multi-domain]  Cd Length: 934  Bit Score: 74.18  E-value: 6.64e-14
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 332205949  44 PSLSQRPGGSTKSHPEPQTPKDSPSKSSAEAQTPEDTPNKSGAEAKTQK--DSSNKSGAEAKTQKGSTSKSGSEAQTTKD 121
Cdd:NF033609 542 PVVPEQPDEPGEIEPIPEDSDSDPGSDSGSDSSNSDSGSDSGSDSTSDSgsDSASDSDSASDSDSASDSDSASDSDSASD 621
                         90       100       110       120       130       140       150       160
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 332205949 122 STSKSHPELQTPKDSTGKSGAEAQTPEDSPNRSGAEAKTQKDSPSKSGSEAQTTKDVPNKSGADGQTPKDGSSKSGAEDQ 201
Cdd:NF033609 622 SDSASDSDSASDSDSASDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSD 701
                        170       180       190       200       210       220       230       240
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 332205949 202 TPKDVPNKSGAEKQTPKDGSNKSGAEEQGPIDGPSKSGAEEQTSKDSPNKVVPEQPSRKDHSKPISNPSDNKELPKADTN 281
Cdd:NF033609 702 SDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSD 781
                        250       260       270       280       290       300       310       320
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 332205949 282 QLADKGKLSPHAFKTESGEETDLISPPQEEVKS---SEPTEDVEPKEAEDDDTGPEEGSPPKEEKEKMSGSASSENREGT 358
Cdd:NF033609 782 SDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSdsdSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSESDSN 861
                        330       340
                 ....*....|....*....|....*
gi 332205949 359 LSDSTGSEKDDLYPNGSGNGSAESS 383
Cdd:NF033609 862 SDSESGSNNNVVPPNSPKNGTNASN 886
MSCRAMM_ClfA NF033609
MSCRAMM family adhesin clumping factor ClfA; Clumping factor A is an MSCRAMM (Microbial ...
77-384 5.22e-10

MSCRAMM family adhesin clumping factor ClfA; Clumping factor A is an MSCRAMM (Microbial Surface Components Recognizing Adhesive Matrix Molecules). It is heavily studied in Staphylococcus aureus both for its biological role in adhesion and for its potential for vaccination. Features of the sequence, but also of other MSCRAMM adhesins, include a long run of Ser-Asp dipeptide repeats and a C-terminal cell wall anchoring LPXTG motif.


Pssm-ID: 468110 [Multi-domain]  Cd Length: 934  Bit Score: 61.85  E-value: 5.22e-10
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 332205949  77 PEDTPNKSGAEAKTqkDSSNkSGAEAKTQKGSTSKSGSEAQTTKDSTSKSHPELQTPKDSTGKSGAEAQTPEDSPNRSGA 156
Cdd:NF033609 558 PEDSDSDPGSDSGS--DSSN-SDSGSDSGSDSTSDSGSDSASDSDSASDSDSASDSDSASDSDSASDSDSASDSDSASDS 634
                         90       100       110       120       130       140       150       160
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 332205949 157 EAKTQKDSPSKSGSEAQTTKDVPNKSGADGQTPKDGSSKSGAEDQTPKDVPNKSGAEKQTPKDGSNKSGAEEQGPIDGPS 236
Cdd:NF033609 635 DSASDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDS 714
                        170       180       190       200       210       220       230       240
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 332205949 237 KSGAEEQTSKDSPNKVVPEQPSRKDHSKPISNPSDNKELPKADTNQLADKGKLSPHAFKTESGEETDLISPPQEEvKSSE 316
Cdd:NF033609 715 DSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSD-SDSD 793
                        250       260       270       280       290       300
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 332205949 317 PTEDVEPKEAEDDDTGPEEGSPPKEEKEKMSGSASSENREGTLSDSTGSEKDDLYPNGSGNGSAESSH 384
Cdd:NF033609 794 SDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSESDSN 861
KCT2 pfam17818
Keratinocyte-associated gene product; This entry includes Keratinocyte-associated ...
306-437 6.23e-08

Keratinocyte-associated gene product; This entry includes Keratinocyte-associated transmembrane protein 2 found in humans. Functional studies show that KCP2 localizes to the endoplasmic reticulum, consistent with a role in protein biosynthesis, and has a functional KKxx retrieval signal at its cytosolic C-terminus.


Pssm-ID: 407686 [Multi-domain]  Cd Length: 187  Bit Score: 52.63  E-value: 6.23e-08
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 332205949  306 SPPQEEVKSSEptedvEPKEAEDDDTGPEEGSPPKEE------------------KEKMSGSASSENRE-GTLSDSTGSE 366
Cdd:pfam17818  33 SVPPEEADNNE-----DPSIEEEDLLTLNSSPPTAKDtldngdygepdydwttspRDEESDEILEENRGyKEIEQSVKSF 107
                          90       100       110       120       130       140       150
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|.
gi 332205949  367 KddlypNGSGNGSAESSHFFAYLVTAAILVAVLYIAHHNKRKIIAFVlEGKRSKVTRRPKASDYQRLDQKI 437
Cdd:pfam17818 108 K-----SPPSNVEEEDSHFFFHLIIFAFCVAVVYVTYHNKRKIFLLV-QSRKWRDGLCSKTVEYHRLDQNV 172
PTZ00121 PTZ00121
MAEBL; Provisional
55-330 2.34e-07

MAEBL; Provisional


Pssm-ID: 173412 [Multi-domain]  Cd Length: 2084  Bit Score: 53.61  E-value: 2.34e-07
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 332205949   55 KSHPEPQTPKDSPSKSSAEAQTPEDTPNKSGAEAKtQKDSSNKSGAEAKTQKGSTSKSGSEAQTTKDSTSKSHPELQTPK 134
Cdd:PTZ00121 1308 KKKAEEAKKADEAKKKAEEAKKKADAAKKKAEEAK-KAAEAAKAEAEAAADEAEAAEEKAEAAEKKKEEAKKKADAAKKK 1386
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 332205949  135 DSTGKSGAEAQTPEDSPNRSG------AEAKTQKDSPSKSGSEAQTTKDVPNKSGadgQTPKDGSSKSGAEDQTPKDVPN 208
Cdd:PTZ00121 1387 AEEKKKADEAKKKAEEDKKKAdelkkaAAAKKKADEAKKKAEEKKKADEAKKKAE---EAKKADEAKKKAEEAKKAEEAK 1463
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 332205949  209 KSGAEKQTPKDGSNKsgAEEQGPIDgPSKSGAEEQTSKDSPNKVVPEQPSRKDHSKPISNPSDNKELPKADTNQLADKGK 288
Cdd:PTZ00121 1464 KKAEEAKKADEAKKK--AEEAKKAD-EAKKKAEEAKKKADEAKKAAEAKKKADEAKKAEEAKKADEAKKAEEAKKADEAK 1540
                         250       260       270       280
                  ....*....|....*....|....*....|....*....|..
gi 332205949  289 LSPHAFKTESGEETdlisppqEEVKSSEPTEDVEPKEAEDDD 330
Cdd:PTZ00121 1541 KAEEKKKADELKKA-------EELKKAEEKKKAEEAKKAEED 1575
MSCRAMM_ClfA NF033609
MSCRAMM family adhesin clumping factor ClfA; Clumping factor A is an MSCRAMM (Microbial ...
53-210 2.95e-04

MSCRAMM family adhesin clumping factor ClfA; Clumping factor A is an MSCRAMM (Microbial Surface Components Recognizing Adhesive Matrix Molecules). It is heavily studied in Staphylococcus aureus both for its biological role in adhesion and for its potential for vaccination. Features of the sequence, but also of other MSCRAMM adhesins, include a long run of Ser-Asp dipeptide repeats and a C-terminal cell wall anchoring LPXTG motif.


Pssm-ID: 468110 [Multi-domain]  Cd Length: 934  Bit Score: 43.36  E-value: 2.95e-04
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 332205949  53 STKSHPEPQTPKDSPSKSSAEAQTPEDTPNKSGAEAKTQKDSSNKSGAEAKTQKGSTSKSGSEAQTTKDSTSKSHPELQT 132
Cdd:NF033609 751 DSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDS 830
                         90       100       110       120       130       140       150
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 332205949 133 PKDSTGKSGAEAQTPEDSPNRSGAEAKTQKDSPSKSGSEAQTTKDVPNKSGADGQTPKDGS-SKSGAEDQTPKDVPNKS 210
Cdd:NF033609 831 DSDSDSDSDSDSDSDSDSDSDSDSDSESDSNSDSESGSNNNVVPPNSPKNGTNASNKNEAKdSKEPLPDTGSEDEANTS 909
 
Name Accession Description Interval E-value
MSCRAMM_ClfA NF033609
MSCRAMM family adhesin clumping factor ClfA; Clumping factor A is an MSCRAMM (Microbial ...
44-383 6.64e-14

MSCRAMM family adhesin clumping factor ClfA; Clumping factor A is an MSCRAMM (Microbial Surface Components Recognizing Adhesive Matrix Molecules). It is heavily studied in Staphylococcus aureus both for its biological role in adhesion and for its potential for vaccination. Features of the sequence, but also of other MSCRAMM adhesins, include a long run of Ser-Asp dipeptide repeats and a C-terminal cell wall anchoring LPXTG motif.


Pssm-ID: 468110 [Multi-domain]  Cd Length: 934  Bit Score: 74.18  E-value: 6.64e-14
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 332205949  44 PSLSQRPGGSTKSHPEPQTPKDSPSKSSAEAQTPEDTPNKSGAEAKTQK--DSSNKSGAEAKTQKGSTSKSGSEAQTTKD 121
Cdd:NF033609 542 PVVPEQPDEPGEIEPIPEDSDSDPGSDSGSDSSNSDSGSDSGSDSTSDSgsDSASDSDSASDSDSASDSDSASDSDSASD 621
                         90       100       110       120       130       140       150       160
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 332205949 122 STSKSHPELQTPKDSTGKSGAEAQTPEDSPNRSGAEAKTQKDSPSKSGSEAQTTKDVPNKSGADGQTPKDGSSKSGAEDQ 201
Cdd:NF033609 622 SDSASDSDSASDSDSASDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSD 701
                        170       180       190       200       210       220       230       240
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 332205949 202 TPKDVPNKSGAEKQTPKDGSNKSGAEEQGPIDGPSKSGAEEQTSKDSPNKVVPEQPSRKDHSKPISNPSDNKELPKADTN 281
Cdd:NF033609 702 SDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSD 781
                        250       260       270       280       290       300       310       320
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 332205949 282 QLADKGKLSPHAFKTESGEETDLISPPQEEVKS---SEPTEDVEPKEAEDDDTGPEEGSPPKEEKEKMSGSASSENREGT 358
Cdd:NF033609 782 SDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSdsdSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSESDSN 861
                        330       340
                 ....*....|....*....|....*
gi 332205949 359 LSDSTGSEKDDLYPNGSGNGSAESS 383
Cdd:NF033609 862 SDSESGSNNNVVPPNSPKNGTNASN 886
MSCRAMM_ClfA NF033609
MSCRAMM family adhesin clumping factor ClfA; Clumping factor A is an MSCRAMM (Microbial ...
77-384 5.22e-10

MSCRAMM family adhesin clumping factor ClfA; Clumping factor A is an MSCRAMM (Microbial Surface Components Recognizing Adhesive Matrix Molecules). It is heavily studied in Staphylococcus aureus both for its biological role in adhesion and for its potential for vaccination. Features of the sequence, but also of other MSCRAMM adhesins, include a long run of Ser-Asp dipeptide repeats and a C-terminal cell wall anchoring LPXTG motif.


Pssm-ID: 468110 [Multi-domain]  Cd Length: 934  Bit Score: 61.85  E-value: 5.22e-10
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 332205949  77 PEDTPNKSGAEAKTqkDSSNkSGAEAKTQKGSTSKSGSEAQTTKDSTSKSHPELQTPKDSTGKSGAEAQTPEDSPNRSGA 156
Cdd:NF033609 558 PEDSDSDPGSDSGS--DSSN-SDSGSDSGSDSTSDSGSDSASDSDSASDSDSASDSDSASDSDSASDSDSASDSDSASDS 634
                         90       100       110       120       130       140       150       160
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 332205949 157 EAKTQKDSPSKSGSEAQTTKDVPNKSGADGQTPKDGSSKSGAEDQTPKDVPNKSGAEKQTPKDGSNKSGAEEQGPIDGPS 236
Cdd:NF033609 635 DSASDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDS 714
                        170       180       190       200       210       220       230       240
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 332205949 237 KSGAEEQTSKDSPNKVVPEQPSRKDHSKPISNPSDNKELPKADTNQLADKGKLSPHAFKTESGEETDLISPPQEEvKSSE 316
Cdd:NF033609 715 DSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSD-SDSD 793
                        250       260       270       280       290       300
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 332205949 317 PTEDVEPKEAEDDDTGPEEGSPPKEEKEKMSGSASSENREGTLSDSTGSEKDDLYPNGSGNGSAESSH 384
Cdd:NF033609 794 SDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSESDSN 861
KCT2 pfam17818
Keratinocyte-associated gene product; This entry includes Keratinocyte-associated ...
306-437 6.23e-08

Keratinocyte-associated gene product; This entry includes Keratinocyte-associated transmembrane protein 2 found in humans. Functional studies show that KCP2 localizes to the endoplasmic reticulum, consistent with a role in protein biosynthesis, and has a functional KKxx retrieval signal at its cytosolic C-terminus.


Pssm-ID: 407686 [Multi-domain]  Cd Length: 187  Bit Score: 52.63  E-value: 6.23e-08
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 332205949  306 SPPQEEVKSSEptedvEPKEAEDDDTGPEEGSPPKEE------------------KEKMSGSASSENRE-GTLSDSTGSE 366
Cdd:pfam17818  33 SVPPEEADNNE-----DPSIEEEDLLTLNSSPPTAKDtldngdygepdydwttspRDEESDEILEENRGyKEIEQSVKSF 107
                          90       100       110       120       130       140       150
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|.
gi 332205949  367 KddlypNGSGNGSAESSHFFAYLVTAAILVAVLYIAHHNKRKIIAFVlEGKRSKVTRRPKASDYQRLDQKI 437
Cdd:pfam17818 108 K-----SPPSNVEEEDSHFFFHLIIFAFCVAVVYVTYHNKRKIFLLV-QSRKWRDGLCSKTVEYHRLDQNV 172
PTZ00121 PTZ00121
MAEBL; Provisional
55-330 2.34e-07

MAEBL; Provisional


Pssm-ID: 173412 [Multi-domain]  Cd Length: 2084  Bit Score: 53.61  E-value: 2.34e-07
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 332205949   55 KSHPEPQTPKDSPSKSSAEAQTPEDTPNKSGAEAKtQKDSSNKSGAEAKTQKGSTSKSGSEAQTTKDSTSKSHPELQTPK 134
Cdd:PTZ00121 1308 KKKAEEAKKADEAKKKAEEAKKKADAAKKKAEEAK-KAAEAAKAEAEAAADEAEAAEEKAEAAEKKKEEAKKKADAAKKK 1386
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 332205949  135 DSTGKSGAEAQTPEDSPNRSG------AEAKTQKDSPSKSGSEAQTTKDVPNKSGadgQTPKDGSSKSGAEDQTPKDVPN 208
Cdd:PTZ00121 1387 AEEKKKADEAKKKAEEDKKKAdelkkaAAAKKKADEAKKKAEEKKKADEAKKKAE---EAKKADEAKKKAEEAKKAEEAK 1463
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 332205949  209 KSGAEKQTPKDGSNKsgAEEQGPIDgPSKSGAEEQTSKDSPNKVVPEQPSRKDHSKPISNPSDNKELPKADTNQLADKGK 288
Cdd:PTZ00121 1464 KKAEEAKKADEAKKK--AEEAKKAD-EAKKKAEEAKKKADEAKKAAEAKKKADEAKKAEEAKKADEAKKAEEAKKADEAK 1540
                         250       260       270       280
                  ....*....|....*....|....*....|....*....|..
gi 332205949  289 LSPHAFKTESGEETdlisppqEEVKSSEPTEDVEPKEAEDDD 330
Cdd:PTZ00121 1541 KAEEKKKADELKKA-------EELKKAEEKKKAEEAKKAEED 1575
PLN03237 PLN03237
DNA topoisomerase 2; Provisional
64-330 4.00e-06

DNA topoisomerase 2; Provisional


Pssm-ID: 215641 [Multi-domain]  Cd Length: 1465  Bit Score: 49.48  E-value: 4.00e-06
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 332205949   64 KDSPSKSSAEAQTPEDTPNKSGAEAKTQKDSSNKSGAEAKTQKGSTSKSGSEAQTTKDSTSKSHPELQTPKDSTGKS--- 140
Cdd:PLN03237 1171 EDAKAEEAREKLQRAAARGESGAAKKVSRQAPKKPAPKKTTKKASESETTEETYGSSAMETENVAEVVKPKGRAGAKkka 1250
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 332205949  141 -GAEAQTPEDSPNRSGAEAKTQ---KDSPSKSGSEAQTTKDVPNKSGADGQTPKDGSSKSGAEDQTPKDVPNKSGAEKQT 216
Cdd:PLN03237 1251 pAAAKEKEEEDEILDLKDRLAAynlDSAPAQSAKMEETVKAVPARRAAARKKPLASVSVISDSDDDDDDFAVEVSLAERL 1330
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 332205949  217 PKDGSNKSGAEEQGPIDGPSKSGAEEQTSKDSP-------------NKVVPEQPSRKDHSKPIsNPSDNKELPKADTNQl 283
Cdd:PLN03237 1331 KKKGGRKPAAANKKAAKPPAAAKKRGPATVQSGqklltemlkpaeaIGISPEKKVRKMRASPF-NKKSGSVLGRAATNK- 1408
                         250       260       270       280       290
                  ....*....|....*....|....*....|....*....|....*....|.
gi 332205949  284 ADKGKLSPHAFKTESGEETDLISPPQEEVKSSEPT----EDVEPKEAEDDD 330
Cdd:PLN03237 1409 ETESSENVSGSSSSEKDEIDVSAKPRPQRANRKQTtyvlSDSESESADDSD 1459
PHA00430 PHA00430
tail fiber protein
46-177 6.44e-05

tail fiber protein


Pssm-ID: 222790 [Multi-domain]  Cd Length: 568  Bit Score: 45.27  E-value: 6.44e-05
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 332205949  46 LSQRPGGSTKSHPEPQTPKDSPSKSSAEAQTPEDTPNKSGAEAKTQKDSSNKSGAEAKTQKGSTSKSGSEAQTTKDSTSK 125
Cdd:PHA00430 154 IKTWNQSAWNARNEANRSRNEADRARNQAERFNNESGASATNTKQWRSEADGSNSEANRFKGYADSMTSSVEAAKGQAES 233
                         90       100       110       120       130
                 ....*....|....*....|....*....|....*....|....*....|....*
gi 332205949 126 SHPELQTPKDSTGKSGAE---AQTPEDSPNRSGAEAKTQKDSPSKSGSEAQTTKD 177
Cdd:PHA00430 234 SSKEANTAGDYATKAAASasaAHASEVNAANSATAAATSANRAKQQADRAKTEAD 288
PHA03307 PHA03307
transcriptional regulator ICP4; Provisional
13-365 9.40e-05

transcriptional regulator ICP4; Provisional


Pssm-ID: 223039 [Multi-domain]  Cd Length: 1352  Bit Score: 45.16  E-value: 9.40e-05
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 332205949   13 AAAGAVPLLATESVKQEEAGVRPSAGN-----VSTHPSLSQRPGGSTKSHPEPQTPKDSPSKSSAEAQTPEDTPNKSGAE 87
Cdd:PHA03307   13 AAAEGGEFFPRPPATPGDAADDLLSGSqgqlvSDSAELAAVTVVAGAAACDRFEPPTGPPPGPGTEAPANESRSTPTWSL 92
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 332205949   88 AKTQKDSSNKSGAEakTQKGSTSKSGSEAQTTKDSTSKSH-PELQTPKDSTGKSGAEAQT-PEDSPNRSGAEAKTQKDS- 164
Cdd:PHA03307   93 STLAPASPAREGSP--TPPGPSSPDPPPPTPPPASPPPSPaPDLSEMLRPVGSPGPPPAAsPPAAGASPAAVASDAASSr 170
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 332205949  165 ----PSKSGSEAQTTKDVPNKS---------GADGQTPKDGSSKSGAEDQTPKDVPNKSGAEKQTPKDGSNKSGAE-EQG 230
Cdd:PHA03307  171 qaalPLSSPEETARAPSSPPAEpppstppaaASPRPPRRSSPISASASSPAPAPGRSAADDAGASSSDSSSSESSGcGWG 250
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 332205949  231 PIDGPSKSGAEEQTSKDSPNKVVPEQPSRKDHSKPISNPSDNKELPKADTNQlADKGKLSPHAFKTESGEetdlispPQE 310
Cdd:PHA03307  251 PENECPLPRPAPITLPTRIWEASGWNGPSSRPGPASSSSSPRERSPSPSPSS-PGSGPAPSSPRASSSSS-------SSR 322
                         330       340       350       360       370
                  ....*....|....*....|....*....|....*....|....*....|....*
gi 332205949  311 EVKSSEPTEDVEPKEAEDDDTGPEEGSPPKEEKEKMSGSASSENREGTLSDSTGS 365
Cdd:PHA03307  323 ESSSSSTSSSSESSRGAAVSPGPSPSRSPSPSRPPPPADPSSPRKRPRPSRAPSS 377
PHA00430 PHA00430
tail fiber protein
61-197 2.50e-04

tail fiber protein


Pssm-ID: 222790 [Multi-domain]  Cd Length: 568  Bit Score: 43.34  E-value: 2.50e-04
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 332205949  61 QTPKDSPSKSSAEAQTPEDTPNKSGAEAKTQKDSSNKSGAEAKTQKGSTSKSGSEAQTTKDSTSKSHPELQTPKDSTGKS 140
Cdd:PHA00430 155 KTWNQSAWNARNEANRSRNEADRARNQAERFNNESGASATNTKQWRSEADGSNSEANRFKGYADSMTSSVEAAKGQAESS 234
                         90       100       110       120       130
                 ....*....|....*....|....*....|....*....|....*....|....*..
gi 332205949 141 GAEAQTPEDSPNRSGAEAKTQKDSPSKSGSEAQTTKDVPNKSGADGQTPKDGSSKSG 197
Cdd:PHA00430 235 SKEANTAGDYATKAAASASAAHASEVNAANSATAAATSANRAKQQADRAKTEADKLG 291
MSCRAMM_ClfA NF033609
MSCRAMM family adhesin clumping factor ClfA; Clumping factor A is an MSCRAMM (Microbial ...
53-210 2.95e-04

MSCRAMM family adhesin clumping factor ClfA; Clumping factor A is an MSCRAMM (Microbial Surface Components Recognizing Adhesive Matrix Molecules). It is heavily studied in Staphylococcus aureus both for its biological role in adhesion and for its potential for vaccination. Features of the sequence, but also of other MSCRAMM adhesins, include a long run of Ser-Asp dipeptide repeats and a C-terminal cell wall anchoring LPXTG motif.


Pssm-ID: 468110 [Multi-domain]  Cd Length: 934  Bit Score: 43.36  E-value: 2.95e-04
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 332205949  53 STKSHPEPQTPKDSPSKSSAEAQTPEDTPNKSGAEAKTQKDSSNKSGAEAKTQKGSTSKSGSEAQTTKDSTSKSHPELQT 132
Cdd:NF033609 751 DSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDSDS 830
                         90       100       110       120       130       140       150
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 332205949 133 PKDSTGKSGAEAQTPEDSPNRSGAEAKTQKDSPSKSGSEAQTTKDVPNKSGADGQTPKDGS-SKSGAEDQTPKDVPNKS 210
Cdd:NF033609 831 DSDSDSDSDSDSDSDSDSDSDSDSDSESDSNSDSESGSNNNVVPPNSPKNGTNASNKNEAKdSKEPLPDTGSEDEANTS 909
Herpes_BLLF1 pfam05109
Herpes virus major outer envelope glycoprotein (BLLF1); This family consists of the BLLF1 ...
62-377 3.97e-04

Herpes virus major outer envelope glycoprotein (BLLF1); This family consists of the BLLF1 viral late glycoprotein, also termed gp350/220. It is the most abundantly expressed glycoprotein in the viral envelope of the Herpesviruses and is the major antigen responsible for stimulating the production of neutralising antibodies in vivo.


Pssm-ID: 282904 [Multi-domain]  Cd Length: 886  Bit Score: 42.98  E-value: 3.97e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 332205949   62 TPKDSPSKSSAEAQTPEDTPNKSGAEAKTQKDSSNKSGAEAKTQKGSTSKSGSEAQTTKDSTSKSHPELQTPKDSTGKSG 141
Cdd:pfam05109 358 TETDFKCKWTLTSGTPSGCENISGAFASNRTFDITVSGLGTAPKTLIITRTATNATTTTHKVIFSKAPESTTTSPTLNTT 437
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 332205949  142 AEAQTPEDSPNRSGAEAKTQKDSPSKSGseaqttkdvPNKSGADGQTPKDGSSKSGAEDQTPKDVPNKSGAEKQTPKDGS 221
Cdd:pfam05109 438 GFAAPNTTTGLPSSTHVPTNLTAPASTG---------PTVSTADVTSPTPAGTTSGASPVTPSPSPRDNGTESKAPDMTS 508
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 332205949  222 NKSGAEEQGP--------IDGPSKSGAEEQTSKDSPNKVVPE-QPSRKDHSKPISNPSDNKELPK-ADTNQLADKGKLSP 291
Cdd:pfam05109 509 PTSAVTTPTPnatsptpaVTTPTPNATSPTLGKTSPTSAVTTpTPNATSPTPAVTTPTPNATIPTlGKTSPTSAVTTPTP 588
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 332205949  292 HAFKTESGEETDLISPPQEEV--KSSEPTEDVEPKEAEDD-DTGPEEGSPPKEEKEKMSGSASSENREGTLSDSTGSEKD 368
Cdd:pfam05109 589 NATSPTVGETSPQANTTNHTLggTSSTPVVTSPPKNATSAvTTGQHNITSSSTSSMSLRPSSISETLSPSTSDNSTSHMP 668
                         330
                  ....*....|..
gi 332205949  369 DL---YPNGSGN 377
Cdd:pfam05109 669 LLtsaHPTGGEN 680
PRK08581 PRK08581
amidase domain-containing protein;
64-290 8.53e-04

amidase domain-containing protein;


Pssm-ID: 236304 [Multi-domain]  Cd Length: 619  Bit Score: 41.70  E-value: 8.53e-04
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 332205949  64 KDSPSKSSAEAQTPEDTPNKSGAEAKTQKDSSNKSgaEAKTQKGSTSKSGSEAQTTKDSTSKSHPELQTPKDSTGKSGAE 143
Cdd:PRK08581  58 KDTDKADNNNTSNQDNNDKKFSTIDSSTSDSNNII--DFIYKNLPQTNINQLLTKNKYDDNYSLTTLIQNLFNLNSDISD 135
                         90       100       110       120       130       140       150       160
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 332205949 144 AQTPEDSPNRSGAEAKTQKDSPSKSGSEAQTTKD-VPNKSGADGQTPKDGSSKSGAEDQTPKDvPNKSGAEKQTPKDGSN 222
Cdd:PRK08581 136 YEQPRNSEKSTNDSNKNSDSSIKNDTDTQSSKQDkADNQKAPSSNNTKPSTSNKQPNSPKPTQ-PNQSNSQPASDDTANQ 214
                        170       180       190       200       210       220
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 332205949 223 KSGAEEQGPIDGPSKSGAEEQTSKDSP-NKVVPEQPSRKDHSKpiSNPSDNKELPKADTNQLADKGKLS 290
Cdd:PRK08581 215 KSSSKDNQSMSDSALDSILDQYSEDAKkTQKDYASQSKKDKTE--TSNTKNPQLPTQDELKHKSKPAQS 281
PTZ00449 PTZ00449
104 kDa microneme/rhoptry antigen; Provisional
50-358 1.12e-03

104 kDa microneme/rhoptry antigen; Provisional


Pssm-ID: 185628 [Multi-domain]  Cd Length: 943  Bit Score: 41.60  E-value: 1.12e-03
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 332205949  50 PGGSTKSHPEPQTPKDSPSKSSAEAQTPE-DTPNKSGAEAKTQKDSSNKSGAEAKTQKGSTSKSGSE-AQTTKDSTSKSH 127
Cdd:PTZ00449 511 PEGPEASGLPPKAPGDKEGEEGEHEDSKEsDEPKEGGKPGETKEGEVGKKPGPAKEHKPSKIPTLSKkPEFPKDPKHPKD 590
                         90       100       110       120       130       140       150       160
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 332205949 128 PElqTPKDStgKSGAEAQTPEDSPNRSGAEAKTQKDSPSKSGSEAQTTKDVPNKSGADGQTPKDGSS-KSGAEDQTPKDV 206
Cdd:PTZ00449 591 PE--EPKKP--KRPRSAQRPTRPKSPKLPELLDIPKSPKRPESPKSPKRPPPPQRPSSPERPEGPKIiKSPKPPKSPKPP 666
                        170       180       190       200       210       220       230       240
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 332205949 207 PNKSGAEK--QTPKDGSNKSGAEEQGPIDGPSKSGAEEQTSKDSPNkvVPEQPSRKDHSKPISNPSDNKELPKADTNQLA 284
Cdd:PTZ00449 667 FDPKFKEKfyDDYLDAAAKSKETKTTVVLDESFESILKETLPETPG--TPFTTPRPLPPKLPRDEEFPFEPIGDPDAEQP 744
                        250       260       270       280       290       300       310
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....
gi 332205949 285 DkgklsPHAFKTESGEETDLIsppqEEVKSSEPTEDVEPKEAEDDDTGPEEGSPPKEEKEKMSGSASSENREGT 358
Cdd:PTZ00449 745 D-----DIEFFTPPEEERTFF----HETPADTPLPDILAEEFKEEDIHAETGEPDEAMKRPDSPSEHEDKPPGD 809
PRK07764 PRK07764
DNA polymerase III subunits gamma and tau; Validated
107-336 1.56e-03

DNA polymerase III subunits gamma and tau; Validated


Pssm-ID: 236090 [Multi-domain]  Cd Length: 824  Bit Score: 41.12  E-value: 1.56e-03
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 332205949 107 GSTSKSGSEAQTTKDSTSKSHPELQTPKDSTGKSGAEAQTPEDSPNRSGAEAKTQKDSPSKSGSEAQTTKDVPNKSGADG 186
Cdd:PRK07764 589 GPAPGAAGGEGPPAPASSGPPEEAARPAAPAAPAAPAAPAPAGAAAAPAEASAAPAPGVAAPEHHPKHVAVPDASDGGDG 668
                         90       100       110       120       130       140       150       160
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 332205949 187 QTPKDGSSKSGAEDQTPKDVPNKSGAEKQTPkdgsnksgAEEQGPIDGPSKSGAEEQTSKDspnkvvpeQPSRKDHSKPI 266
Cdd:PRK07764 669 WPAKAGGAAPAAPPPAPAPAAPAAPAGAAPA--------QPAPAPAATPPAGQADDPAAQP--------PQAAQGASAPS 732
                        170       180       190       200       210       220       230
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 332205949 267 SNPSDNKELPKADTNQLADKGKLSPHAFKTESGEETDlISPPQEEVKSSEPTEDVEPKEAEDDDTGPEEG 336
Cdd:PRK07764 733 PAADDPVPLPPEPDDPPDPAGAPAQPPPPPAPAPAAA-PAAAPPPSPPSEEEEMAEDDAPSMDDEDRRDA 801
PTZ00449 PTZ00449
104 kDa microneme/rhoptry antigen; Provisional
25-356 2.77e-03

104 kDa microneme/rhoptry antigen; Provisional


Pssm-ID: 185628 [Multi-domain]  Cd Length: 943  Bit Score: 40.44  E-value: 2.77e-03
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 332205949  25 SVKQEEAGVRPSAG---NVSTHPSLSQRPGGSTKS-HP-EPQTPKDSPSKSSAEAQTPEDTPNKSGAEAKTQKDSSNKSG 99
Cdd:PTZ00449 551 ETKEGEVGKKPGPAkehKPSKIPTLSKKPEFPKDPkHPkDPEEPKKPKRPRSAQRPTRPKSPKLPELLDIPKSPKRPESP 630
                         90       100       110       120       130       140       150       160
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 332205949 100 AEAKTQKGSTSKSGSEAQTTKDS--TSKSHPELQTPKDSTGK-------SGAEAQTPE---DSPNRSGAEAKTQKDSPSK 167
Cdd:PTZ00449 631 KSPKRPPPPQRPSSPERPEGPKIikSPKPPKSPKPPFDPKFKekfyddyLDAAAKSKEtktTVVLDESFESILKETLPET 710
                        170       180       190       200       210       220       230       240
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 332205949 168 SGSEAQTTKDVPNKSGADGQTP----KDGSSKSGAEDQTPKDVPNKSGAEKQTPKDGSNKS-GAEEQGPIDGPSKSGAEE 242
Cdd:PTZ00449 711 PGTPFTTPRPLPPKLPRDEEFPfepiGDPDAEQPDDIEFFTPPEEERTFFHETPADTPLPDiLAEEFKEEDIHAETGEPD 790
                        250       260       270       280       290       300       310       320
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 332205949 243 QTSKDspnkvvPEQPSRKDHSKPISNPSDNKELPKAD------TNQLADKGKLSPHAF-------KTESGEE-TDLISPP 308
Cdd:PTZ00449 791 EAMKR------PDSPSEHEDKPPGDHPSLPKKRHRLDglalstTDLESDAGRIAKDASgkivklkRSKSFDDlTTVEEAE 864
                        330       340       350       360       370
                 ....*....|....*....|....*....|....*....|....*....|....*..
gi 332205949 309 QEEVKSSEPTEDVEPKEAEDDDTGPEEGS---------PPKEEKEKMSGSASSENRE 356
Cdd:PTZ00449 865 EMGAEARKIVVDDDGTEADDEDTHPPEEKhksevrrrrPPKKPSKPKKPSKPKKPKK 921
PRK13108 PRK13108
prolipoprotein diacylglyceryl transferase; Reviewed
189-334 3.03e-03

prolipoprotein diacylglyceryl transferase; Reviewed


Pssm-ID: 237284 [Multi-domain]  Cd Length: 460  Bit Score: 39.96  E-value: 3.03e-03
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 332205949 189 PKDGSSKSGAEDQTPKDVPNKSGAEKQTPKDGSNKSGAEEQGPIDGPSKSGAEEQTSKDSPNKVVPEQPSRKDHSKPISN 268
Cdd:PRK13108 298 REPAELAAAAVASAASAVGPVGPGEPNQPDDVAEAVKAEVAEVTDEVAAESVVQVADRDGESTPAVEETSEADIEREQPG 377
                         90       100       110       120       130       140       150
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 332205949 269 PSDnKELPKADTNQLADKGKL--SPHAFKTESGEETDLISPPQEEVKsSEPTEDVEP--KEAEDDDTGPE 334
Cdd:PRK13108 378 DLA-GQAPAAHQVDAEAASAApeEPAALASEAHDETEPEVPEKAAPI-PDPAKPDELavAGPGDDPAEPD 445
PRK13108 PRK13108
prolipoprotein diacylglyceryl transferase; Reviewed
59-233 3.14e-03

prolipoprotein diacylglyceryl transferase; Reviewed


Pssm-ID: 237284 [Multi-domain]  Cd Length: 460  Bit Score: 39.96  E-value: 3.14e-03
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 332205949  59 EPQTPKDSPSKSSAEAQTPEDTPNKSGAEAKTQKDSSNKSGAEAKTQKGSTSKSGSEAQTTKDSTSKSHPElqtpkdSTG 138
Cdd:PRK13108 294 EALEREPAELAAAAVASAASAVGPVGPGEPNQPDDVAEAVKAEVAEVTDEVAAESVVQVADRDGESTPAVE------ETS 367
                         90       100       110       120       130       140       150       160
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 332205949 139 KSGAEAQTPEDspnrsgaeakTQKDSPSKSGSEAQTTKDVPNKSGADGQTPKDGSSKSGAEDQTPK---DVPNKSGAEKQ 215
Cdd:PRK13108 368 EADIEREQPGD----------LAGQAPAAHQVDAEAASAAPEEPAALASEAHDETEPEVPEKAAPIpdpAKPDELAVAGP 437
                        170
                 ....*....|....*...
gi 332205949 216 TPkDGSNKSGAEEQGPID 233
Cdd:PRK13108 438 GD-DPAEPDGIRRQDDFS 454
PRK08691 PRK08691
DNA polymerase III subunits gamma and tau; Validated
129-316 3.70e-03

DNA polymerase III subunits gamma and tau; Validated


Pssm-ID: 236333 [Multi-domain]  Cd Length: 709  Bit Score: 39.69  E-value: 3.70e-03
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 332205949 129 ELQTPKDSTGKSGAEAQTPEDSPNRSGAEAKTQKDSPSKSGSEAQTTKDVPNKSGADGQtPKDGSSKSGAEDQTPKDVPN 208
Cdd:PRK08691 376 ELQSPSAQTAEKETAAKKPQPRPEAETAQTPVQTASAAAMPSEGKTAGPVSNQENNDVP-PWEDAPDEAQTAAGTAQTSA 454
                         90       100       110       120       130       140       150       160
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 332205949 209 KS---GAEKQTPKDGSNKSGAEEQGPIDGPSKSGAEEQTSKDSPNK--VVPEQPSRKDHSKPISN---------PSDNKE 274
Cdd:PRK08691 455 KSiqtASEAETPPENQVSKNKAADNETDAPLSEVPSENPIQATPNDeaVETETFAHEAPAEPFYGygfpdndcpPEDGAE 534
                        170       180       190       200
                 ....*....|....*....|....*....|....*....|..
gi 332205949 275 LPKADTNQLADKGklSPHAFKTESGEETDLISPPQEEVKSSE 316
Cdd:PRK08691 535 IPPPDWEHAAPAD--TAGGGADEEAEAGGIGGNNTPSAPPPE 574
PRK07003 PRK07003
DNA polymerase III subunit gamma/tau;
12-241 5.03e-03

DNA polymerase III subunit gamma/tau;


Pssm-ID: 235906 [Multi-domain]  Cd Length: 830  Bit Score: 39.45  E-value: 5.03e-03
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 332205949  12 VAAAGAVPLLATESVKQEEAGVRPSAGNVSTHPSLSQRPGGSTkshPEPQTPKDSPSKSSAEAQTPEDTPNKSGAEAKTQ 91
Cdd:PRK07003 374 ARVAGAVPAPGARAAAAVGASAVPAVTAVTGAAGAALAPKAAA---AAAATRAEAPPAAPAPPATADRGDDAADGDAPVP 450
                         90       100       110       120       130       140       150       160
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 332205949  92 KDSSNKSGAEAKTQKGST---SKSGSEAQTTKDSTSKSHPELQTPKDSTGKSGA---------------EAQTPEDSPNR 153
Cdd:PRK07003 451 AKANARASADSRCDERDAqppADSGSASAPASDAPPDAAFEPAPRAAAPSAATPaavpdarapaaasreDAPAAAAPPAP 530
                        170       180       190       200       210       220       230       240
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 332205949 154 SGAEAKTQKDSPSKSGSEAQTTKDVPNKSG----ADGQTPKDGSSKSGAEdQTPKDVPNKSGAEKQTPKDGSNKSGAEEQ 229
Cdd:PRK07003 531 EARPPTPAAAAPAARAGGAAAALDVLRNAGmrvsSDRGARAAAAAKPAAA-PAAAPKPAAPRVAVQVPTPRARAATGDAP 609
                        250
                 ....*....|..
gi 332205949 230 GPIDGPSKSGAE 241
Cdd:PRK07003 610 PNGAARAEQAAE 621
PHA03169 PHA03169
hypothetical protein; Provisional
134-347 6.58e-03

hypothetical protein; Provisional


Pssm-ID: 223003 [Multi-domain]  Cd Length: 413  Bit Score: 38.80  E-value: 6.58e-03
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 332205949 134 KDSTGKSGAEAQTPEDSPNRSGAEAKTQKDSPSKSGSEAQTTKDvpnksgadgqtpkDGSSKSGAEDQTPKDvpNKSGAE 213
Cdd:PHA03169  20 RGHCKRHGGTREQAGRRRGTAARAAKPAPPAPTTSGPQVRAVAE-------------QGHRQTESDTETAEE--SRHGEK 84
                         90       100       110       120       130       140       150       160
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 332205949 214 KQTPKDGSNKSGAEEQGPIDGPSKSGAEEQTSKDSPNKVVPEQP-SRKDHSKPISNPSDNKELPKADTNQLADKGKLSPH 292
Cdd:PHA03169  85 EERGQGGPSGSGSESVGSPTPSPSGSAEELASGLSPENTSGSSPeSPASHSPPPSPPSHPGPHEPAPPESHNPSPNQQPS 164
                        170       180       190       200       210
                 ....*....|....*....|....*....|....*....|....*....|....*
gi 332205949 293 AFKTESGEETDlisPPQEEVKSSEPTEDVEPKEAEDDDTGPEEGSPPKEEKEKMS 347
Cdd:PHA03169 165 SFLQPSHEDSP---EEPEPPTSEPEPDSPGPPQSETPTSSPPPQSPPDEPGEPQS 216
PHA00430 PHA00430
tail fiber protein
103-199 8.50e-03

tail fiber protein


Pssm-ID: 222790 [Multi-domain]  Cd Length: 568  Bit Score: 38.33  E-value: 8.50e-03
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 332205949 103 KTQKGSTSKSGSEAQTTKDSTSKSHPELQTPKDSTGKSGAEAQTPEDSPNRSGAEAKTQKDSPSKSGSEAQTTKDVPNKS 182
Cdd:PHA00430 155 KTWNQSAWNARNEANRSRNEADRARNQAERFNNESGASATNTKQWRSEADGSNSEANRFKGYADSMTSSVEAAKGQAESS 234
                         90
                 ....*....|....*..
gi 332205949 183 GADGQTPKDGSSKSGAE 199
Cdd:PHA00430 235 SKEANTAGDYATKAAAS 251
 
Blast search parameters
Data Source: Precalculated data, version = cdd.v.3.21
Preset Options:Database: CDSEARCH/cdd   Low complexity filter: no  Composition Based Adjustment: yes   E-value threshold: 0.01

References:

  • Wang J et al. (2023), "The conserved domain database in 2023", Nucleic Acids Res.51(D)384-8.
  • Lu S et al. (2020), "The conserved domain database in 2020", Nucleic Acids Res.48(D)265-8.
  • Marchler-Bauer A et al. (2017), "CDD/SPARCLE: functional classification of proteins via subfamily domain architectures.", Nucleic Acids Res.45(D)200-3.
Help | Disclaimer | Write to the Help Desk
NCBI | NLM | NIH